BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>psy7460
MLMEVEKDGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSK
SQLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTG
KDFLHFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGY
GKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDVVIQRLVLEKKAIM
LIQAVFLLCGVASCLCLPSLTDRITDQVVARVDTLAIEGSLTFDNENILETFKAFIVKRG
RQYANDEEIKERFEYFKQDGHKKHERYGTSEFSDRSPEEILCKTGFKWSERTYERIVADR
EKVEKMLMEVEKDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKL
VEFSKSQLVECAKQCSGCGGCDGLEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSK
VKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHA
VLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDVVIQRLVL
EKKAIMLIQAVFLLCGVASCLCLPSLTDRITDQVVARVDTLAIEGSLTFDNENILETFKA
FIVKRGRQYANDEEIKERFEYFKQDGHKKHERYGTSEFSDRSPEEILCKTGFKWSERTYE
RIVADREKVEKMLMEVEKDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYA
IKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCA
YDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPY
DLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDVVK
NDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAG
YATIDV

High Scoring Gene Products

Symbol, full name Information P value
tag-196 gene from Caenorhabditis elegans 1.9e-42
CG12163 protein from Drosophila melanogaster 8.5e-42
CTSW
Uncharacterized protein
protein from Bos taurus 1.7e-41
CTSW
Uncharacterized protein
protein from Canis lupus familiaris 6.4e-41
Ctsw
cathepsin W
gene from Rattus norvegicus 2.4e-39
ctsf
cathepsin F
gene_product from Danio rerio 2.5e-39
CTSF
Uncharacterized protein
protein from Sus scrofa 2.6e-39
CTSF
Uncharacterized protein
protein from Canis lupus familiaris 8.6e-39
CTSH
Uncharacterized protein
protein from Ailuropoda melanoleuca 8.6e-39
CTSH
Pro-cathepsin H
protein from Bos taurus 8.8e-39
CTSF
Cathepsin F
protein from Homo sapiens 1.1e-38
CTSH
Pro-cathepsin H
protein from Sus scrofa 1.1e-38
CTSF
Uncharacterized protein
protein from Bos taurus 1.1e-38
Ctsw
cathepsin W
protein from Mus musculus 2.2e-38
AT3G45310 protein from Arabidopsis thaliana 2.3e-38
CTSH
Uncharacterized protein
protein from Canis lupus familiaris 2.3e-38
LOC100525853
Uncharacterized protein
protein from Sus scrofa 2.2e-37
CTSH
Uncharacterized protein
protein from Equus caballus 2.8e-37
Ctsf
cathepsin F
gene from Rattus norvegicus 3.5e-37
CTSH
Uncharacterized protein
protein from Callithrix jacchus 4.5e-37
CTSH
Uncharacterized protein
protein from Callithrix jacchus 5.8e-37
CTSH
Uncharacterized protein
protein from Macaca mulatta 7.4e-37
CTSH
Uncharacterized protein
protein from Oryctolagus cuniculus 1.2e-36
CTSH
Pro-cathepsin H
protein from Homo sapiens 2.5e-36
CTSH
Uncharacterized protein
protein from Nomascus leucogenys 2.5e-36
LOC100662496
Uncharacterized protein
protein from Loxodonta africana 2.5e-36
CTSH
Uncharacterized protein
protein from Gorilla gorilla gorilla 5.3e-36
Ctsh
cathepsin H
gene from Rattus norvegicus 5.3e-36
Ctsf
cathepsin F
protein from Mus musculus 6.8e-36
ALP
aleurain-like protease
protein from Arabidopsis thaliana 8.7e-36
cprB
cysteine proteinase 2
gene from Dictyostelium discoideum 9.7e-36
Ctsk
cathepsin K
gene from Rattus norvegicus 1.8e-35
CTSW
Cathepsin W
protein from Homo sapiens 1.9e-35
AT3G54940 protein from Arabidopsis thaliana 2.3e-35
Ctsh
cathepsin H
protein from Mus musculus 3.0e-35
AT2G21430 protein from Arabidopsis thaliana 3.0e-35
RD19
RESPONSIVE TO DEHYDRATION 19
protein from Arabidopsis thaliana 6.3e-35
Ctsk
cathepsin K
protein from Mus musculus 8.1e-35
CTSS
Cathepsin S
protein from Bos taurus 4.5e-34
XBCP3
xylem bark cysteine peptidase 3
protein from Arabidopsis thaliana 4.5e-34
CTSK
Cathepsin K
protein from Homo sapiens 5.8e-34
Ctss
cathepsin S
protein from Mus musculus 7.4e-34
CTSK
Cathepsin K
protein from Bos taurus 9.5e-34
Ctss
cathepsin S
gene from Rattus norvegicus 1.2e-33
Cat-1
Cathepsin L-like proteinase
protein from Fasciola hepatica 2.0e-33
CTSK
Cathepsin K
protein from Sus scrofa 2.0e-33
ctssb.1
cathepsin Sb, tandem duplicate 1
gene_product from Danio rerio 2.0e-33
ctssb.2
cathepsin Sb, tandem duplicate 2
gene_product from Danio rerio 4.2e-33
cprA
cysteine proteinase 1
gene from Dictyostelium discoideum 5.4e-33
CTSK
Cathepsin K
protein from Canis lupus familiaris 6.9e-33
CTSK
Cathepsin K
protein from Canis lupus familiaris 6.9e-33
cpl-1 gene from Caenorhabditis elegans 1.4e-32
R09F10.1 gene from Caenorhabditis elegans 1.4e-32
ctsh
cathepsin H
gene_product from Danio rerio 1.5e-32
CTSS
Cathepsin S
protein from Canis lupus familiaris 1.8e-32
CTSS
Cathepsin S
protein from Canis lupus familiaris 2.4e-32
Ctsll3
cathepsin L-like 3
gene from Rattus norvegicus 3.8e-32
CTSL1
Cathepsin L1
protein from Canis lupus familiaris 8.1e-32
ctsl.1
cathepsin L.1
gene_product from Danio rerio 1.0e-31
AT4G16190 protein from Arabidopsis thaliana 1.0e-31
CTSL2
Uncharacterized protein
protein from Gallus gallus 1.3e-31
CG4847 protein from Drosophila melanogaster 1.7e-31
RGD1308751
similar to Cathepsin L precursor (Major excreted protein) (MEP)
gene from Rattus norvegicus 1.7e-31
CTSL2
Cathepsin L2
protein from Homo sapiens 2.2e-31
CTSK
Cathepsin K
protein from Gallus gallus 2.2e-31
Cp1
Cysteine proteinase-1
protein from Drosophila melanogaster 3.5e-31
Ctsl1
cathepsin L1
gene from Rattus norvegicus 3.5e-31
CTSL1
Cathepsin L1
protein from Bos taurus 3.5e-31
RD21B
esponsive to dehydration 21B
protein from Arabidopsis thaliana 3.5e-31
Ctsl
cathepsin L
protein from Mus musculus 4.5e-31
DDB_G0272298 gene from Dictyostelium discoideum 5.8e-31
AT3G19390 protein from Arabidopsis thaliana 7.4e-31
ctsla
cathepsin La
gene_product from Danio rerio 7.4e-31
ctso
cathepsin O
gene_product from Danio rerio 9.5e-31
SAG12
senescence-associated gene 12
protein from Arabidopsis thaliana 9.5e-31
Ctso
cathepsin O
protein from Mus musculus 9.6e-31
CTSL
Cathepsin L1
protein from Ovis aries 1.2e-30
CTSL1
Cathepsin L1
protein from Gallus gallus 1.2e-30
AT4G23520 protein from Arabidopsis thaliana 1.6e-30
CTSL1
Cathepsin L1
protein from Bos taurus 2.0e-30
LOC100153090
Uncharacterized protein
protein from Sus scrofa 2.0e-30
CTSO
Uncharacterized protein
protein from Canis lupus familiaris 2.0e-30
XCP2
AT1G20850
protein from Arabidopsis thaliana 2.0e-30
CTSS
Uncharacterized protein
protein from Gallus gallus 2.5e-30
CTSL2
Cathepsin L2
protein from Bos taurus 3.3e-30
CTSW
Cathepsin W
protein from Homo sapiens 3.8e-30
Ctss
Cathepsin S
protein from Rattus norvegicus 4.2e-30
ctssa
cathepsin Sa
gene_product from Danio rerio 5.3e-30
AT1G06260 protein from Arabidopsis thaliana 6.8e-30
CTSS
Cathepsin S
protein from Homo sapiens 8.7e-30
RD21A
responsive to dehydration 21A
protein from Arabidopsis thaliana 8.7e-30
ctskl
cathepsin K, like
gene_product from Danio rerio 1.1e-29
CTSL1
Cathepsin L1
protein from Sus scrofa 1.1e-29
Ctsj
cathepsin J
protein from Mus musculus 1.1e-29
XCP1
xylem cysteine peptidase 1
protein from Arabidopsis thaliana 1.1e-29
R07E3.1 gene from Caenorhabditis elegans 1.1e-29
CTSO
Cathepsin O
protein from Homo sapiens 1.4e-29
zgc:174153 gene_product from Danio rerio 1.4e-29

The BLAST search returned 2 gene products which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  psy7460
        (1026 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

WB|WBGene00007055 - symbol:tag-196 species:6239 "Caenorha...   459  1.9e-42   1
FB|FBgn0260462 - symbol:CG12163 species:7227 "Drosophila ...   453  8.5e-42   1
UNIPROTKB|F1MHV4 - symbol:CTSW "Uncharacterized protein" ...   318  1.7e-41   3
UNIPROTKB|E2RPX3 - symbol:CTSW "Uncharacterized protein" ...   305  6.4e-41   3
RGD|1309354 - symbol:Ctsw "cathepsin W" species:10116 "Ra...   373  2.4e-39   3
ZFIN|ZDB-GENE-030131-9831 - symbol:ctsf "cathepsin F" spe...   430  2.5e-39   1
UNIPROTKB|F1RU48 - symbol:CTSF "Uncharacterized protein" ...   424  2.6e-39   2
UNIPROTKB|E2RR02 - symbol:CTSF "Uncharacterized protein" ...   425  8.6e-39   1
UNIPROTKB|G1M0X4 - symbol:CTSH "Uncharacterized protein" ...   425  8.6e-39   1
UNIPROTKB|Q3T0I2 - symbol:CTSH "Pro-cathepsin H" species:...   415  8.8e-39   2
UNIPROTKB|Q9UBX1 - symbol:CTSF "Cathepsin F" species:9606...   424  1.1e-38   1
UNIPROTKB|O46427 - symbol:CTSH "Pro-cathepsin H" species:...   424  1.1e-38   1
UNIPROTKB|Q0VCU3 - symbol:CTSF "Uncharacterized protein" ...   424  1.1e-38   1
MGI|MGI:1338045 - symbol:Ctsw "cathepsin W" species:10090...   365  2.2e-38   3
TAIR|locus:2078312 - symbol:AT3G45310 species:3702 "Arabi...   410  2.3e-38   2
UNIPROTKB|F6X9C1 - symbol:CTSH "Uncharacterized protein" ...   421  2.3e-38   1
UNIPROTKB|F1RU23 - symbol:CTSW "Uncharacterized protein" ...   412  2.2e-37   1
UNIPROTKB|F7BJD8 - symbol:CTSH "Uncharacterized protein" ...   411  2.8e-37   1
RGD|1308181 - symbol:Ctsf "cathepsin F" species:10116 "Ra...   410  3.5e-37   1
UNIPROTKB|F1P3U9 - symbol:CTSH "Uncharacterized protein" ...   409  4.5e-37   1
UNIPROTKB|F7B939 - symbol:CTSH "Uncharacterized protein" ...   409  4.5e-37   1
UNIPROTKB|F7BRD4 - symbol:CTSH "Uncharacterized protein" ...   408  5.8e-37   1
UNIPROTKB|F6R7P5 - symbol:CTSH "Uncharacterized protein" ...   407  7.4e-37   1
UNIPROTKB|G1SQF0 - symbol:CTSH "Uncharacterized protein" ...   405  1.2e-36   1
UNIPROTKB|P09668 - symbol:CTSH "Pro-cathepsin H" species:...   402  2.5e-36   1
UNIPROTKB|G1RBY1 - symbol:CTSH "Uncharacterized protein" ...   402  2.5e-36   1
UNIPROTKB|G3SSC1 - symbol:CTSH "Uncharacterized protein" ...   402  2.5e-36   1
UNIPROTKB|G3R9A7 - symbol:CTSH "Uncharacterized protein" ...   399  5.3e-36   1
RGD|2447 - symbol:Ctsh "cathepsin H" species:10116 "Rattu...   399  5.3e-36   1
MGI|MGI:1861434 - symbol:Ctsf "cathepsin F" species:10090...   398  6.8e-36   1
TAIR|locus:2175088 - symbol:ALP "aleurain-like protease" ...   397  8.7e-36   1
DICTYBASE|DDB_G0279799 - symbol:cprB "cysteine proteinase...   286  9.7e-36   4
RGD|61810 - symbol:Ctsk "cathepsin K" species:10116 "Ratt...   394  1.8e-35   1
UNIPROTKB|P56202 - symbol:CTSW "Cathepsin W" species:9606...   280  1.9e-35   3
TAIR|locus:2082687 - symbol:AT3G54940 species:3702 "Arabi...   393  2.3e-35   1
MGI|MGI:107285 - symbol:Ctsh "cathepsin H" species:10090 ...   392  3.0e-35   1
TAIR|locus:2050145 - symbol:AT2G21430 species:3702 "Arabi...   392  3.0e-35   1
TAIR|locus:2120222 - symbol:RD19 "RESPONSIVE TO DEHYDRATI...   389  6.3e-35   1
MGI|MGI:107823 - symbol:Ctsk "cathepsin K" species:10090 ...   388  8.1e-35   1
UNIPROTKB|P25326 - symbol:CTSS "Cathepsin S" species:9913...   381  4.5e-34   1
TAIR|locus:2024362 - symbol:XBCP3 "xylem bark cysteine pe...   381  4.5e-34   1
UNIPROTKB|P43235 - symbol:CTSK "Cathepsin K" species:9606...   380  5.8e-34   1
MGI|MGI:107341 - symbol:Ctss "cathepsin S" species:10090 ...   379  7.4e-34   1
UNIPROTKB|Q5E968 - symbol:CTSK "Cathepsin K" species:9913...   378  9.5e-34   1
RGD|621513 - symbol:Ctss "cathepsin S" species:10116 "Rat...   377  1.2e-33   1
UNIPROTKB|Q24940 - symbol:Cat-1 "Cathepsin L-like protein...   375  2.0e-33   1
UNIPROTKB|Q9GLE3 - symbol:CTSK "Cathepsin K" species:9823...   375  2.0e-33   1
ZFIN|ZDB-GENE-050522-559 - symbol:ctssb.1 "cathepsin S, b...   365  2.0e-33   2
ZFIN|ZDB-GENE-050626-55 - symbol:ctssb.2 "cathepsin S, b....   366  4.2e-33   2
DICTYBASE|DDB_G0290957 - symbol:cprA "cysteine proteinase...   371  5.4e-33   1
UNIPROTKB|G1K2A7 - symbol:CTSK "Cathepsin K" species:9615...   370  6.9e-33   1
UNIPROTKB|Q3ZKN1 - symbol:CTSK "Cathepsin K" species:9615...   370  6.9e-33   1
UNIPROTKB|F1NEC8 - symbol:CTSL2 "Uncharacterized protein"...   368  1.1e-32   1
WB|WBGene00000776 - symbol:cpl-1 species:6239 "Caenorhabd...   367  1.4e-32   1
WB|WBGene00019986 - symbol:R09F10.1 species:6239 "Caenorh...   367  1.4e-32   1
ZFIN|ZDB-GENE-030131-3539 - symbol:ctsh "cathepsin H" spe...   354  1.5e-32   2
UNIPROTKB|F1PAK0 - symbol:CTSS "Cathepsin S" species:9615...   366  1.8e-32   1
UNIPROTKB|Q8HY81 - symbol:CTSS "Cathepsin S" species:9615...   365  2.4e-32   1
RGD|1560071 - symbol:Ctsll3 "cathepsin L-like 3" species:...   363  3.8e-32   1
UNIPROTKB|Q9GL24 - symbol:CTSL1 "Cathepsin L1" species:96...   351  8.1e-32   2
ZFIN|ZDB-GENE-040718-61 - symbol:ctsl.1 "cathepsin L.1" s...   359  1.0e-31   1
TAIR|locus:2130180 - symbol:AT4G16190 species:3702 "Arabi...   359  1.0e-31   1
UNIPROTKB|F1NYJ1 - symbol:CTSL2 "Uncharacterized protein"...   358  1.3e-31   1
FB|FBgn0034229 - symbol:CG4847 species:7227 "Drosophila m...   357  1.7e-31   1
RGD|1308751 - symbol:RGD1308751 "similar to Cathepsin L p...   357  1.7e-31   1
UNIPROTKB|O60911 - symbol:CTSL2 "Cathepsin L2" species:96...   356  2.2e-31   1
UNIPROTKB|Q90686 - symbol:CTSK "Cathepsin K" species:9031...   356  2.2e-31   1
FB|FBgn0013770 - symbol:Cp1 "Cysteine proteinase-1" speci...   354  3.5e-31   1
RGD|2448 - symbol:Ctsl1 "cathepsin L1" species:10116 "Rat...   354  3.5e-31   1
UNIPROTKB|A4IFS7 - symbol:CTSL1 "CTSL1 protein" species:9...   354  3.5e-31   1
TAIR|locus:2167821 - symbol:RD21B "esponsive to dehydrati...   354  3.5e-31   1
MGI|MGI:88564 - symbol:Ctsl "cathepsin L" species:10090 "...   353  4.5e-31   1
DICTYBASE|DDB_G0272298 - symbol:DDB_G0272298 species:4468...   352  5.8e-31   1
TAIR|locus:2090614 - symbol:AT3G19390 species:3702 "Arabi...   351  7.4e-31   1
ZFIN|ZDB-GENE-030131-106 - symbol:ctsl1a "cathepsin L, 1 ...   351  7.4e-31   1
ZFIN|ZDB-GENE-080724-8 - symbol:ctso "cathepsin O" specie...   350  9.5e-31   1
TAIR|locus:2152445 - symbol:SAG12 "senescence-associated ...   350  9.5e-31   1
MGI|MGI:2139628 - symbol:Ctso "cathepsin O" species:10090...   338  9.6e-31   2
UNIPROTKB|Q10991 - symbol:CTSL "Cathepsin L1" species:994...   349  1.2e-30   1
UNIPROTKB|P09648 - symbol:CTSL1 "Cathepsin L1" species:90...   349  1.2e-30   1
TAIR|locus:2117979 - symbol:AT4G23520 species:3702 "Arabi...   348  1.6e-30   1
UNIPROTKB|P25975 - symbol:CTSL1 "Cathepsin L1" species:99...   347  2.0e-30   1
UNIPROTKB|F1SS93 - symbol:CTSS "Uncharacterized protein" ...   347  2.0e-30   1
UNIPROTKB|F1PGK4 - symbol:CTSO "Uncharacterized protein" ...   347  2.0e-30   1
TAIR|locus:2030427 - symbol:XCP2 "xylem cysteine peptidas...   347  2.0e-30   1
UNIPROTKB|H9KYW5 - symbol:CTSS "Uncharacterized protein" ...   346  2.5e-30   1
UNIPROTKB|Q5E998 - symbol:CTSL2 "Cathepsin L2" species:99...   345  3.3e-30   1
UNIPROTKB|E9PI30 - symbol:CTSW "Cathepsin W" species:9606...   280  3.8e-30   3
UNIPROTKB|D3ZZR3 - symbol:D3ZZR3 "Uncharacterized protein...   344  4.2e-30   1
ZFIN|ZDB-GENE-040426-1583 - symbol:ctssa "cathepsin S, a"...   343  5.3e-30   1
TAIR|locus:2038515 - symbol:AT1G06260 species:3702 "Arabi...   342  6.8e-30   1
UNIPROTKB|P25774 - symbol:CTSS "Cathepsin S" species:9606...   341  8.7e-30   1
TAIR|locus:2825832 - symbol:RD21A "responsive to dehydrat...   341  8.7e-30   1
ZFIN|ZDB-GENE-050208-336 - symbol:ctskl "cathepsin K, lik...   340  1.1e-29   1
UNIPROTKB|Q28944 - symbol:CTSL1 "Cathepsin L1" species:98...   340  1.1e-29   1
MGI|MGI:1349426 - symbol:Ctsj "cathepsin J" species:10090...   340  1.1e-29   1
TAIR|locus:2122113 - symbol:XCP1 "xylem cysteine peptidas...   340  1.1e-29   1
WB|WBGene00011102 - symbol:R07E3.1 species:6239 "Caenorha...   340  1.1e-29   1
UNIPROTKB|P43234 - symbol:CTSO "Cathepsin O" species:9606...   339  1.4e-29   1
ZFIN|ZDB-GENE-080215-7 - symbol:zgc:174153 "zgc:174153" s...   339  1.4e-29   1

WARNING:  Descriptions of 175 database sequences were not reported due to the
          limiting value of parameter V = 100.


>WB|WBGene00007055 [details] [associations]
            symbol:tag-196 species:6239 "Caenorhabditis elegans"
            [GO:0004869 "cysteine-type endopeptidase inhibitor activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000010
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00031 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00043 SMART:SM00645 InterPro:IPR000169
            GO:GO:0004869 eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 EMBL:FO080488 PIR:T31871
            RefSeq:NP_505215.2 HSSP:Q9UBX1 ProteinModelPortal:O16454 SMR:O16454
            DIP:DIP-27400N IntAct:O16454 MINT:MINT-1044990 MEROPS:C01.A50
            PaxDb:O16454 EnsemblMetazoa:F41E6.6.1 EnsemblMetazoa:F41E6.6.2
            EnsemblMetazoa:F41E6.6.3 GeneID:179240 KEGG:cel:CELE_F41E6.6
            UCSC:F41E6.6.1 CTD:179240 WormBase:F41E6.6 InParanoid:O16454
            OMA:GGGLMTN NextBio:904514 Uniprot:O16454
        Length = 477

 Score = 459 (166.6 bits), Expect = 1.9e-42, P = 1.9e-42
 Identities = 103/317 (32%), Positives = 160/317 (50%)

Query:   288 ILETFKAFIVKRGRQYANDEEIKERFEYFKQDGH-----KKHER----YGTSEFSDRSPE 338
             I  +F  F+ +  ++Y N  E+ +RF  FK++       +K+E+    YG ++FSD +  
Sbjct:   170 IWNSFLDFVDRHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVYGFTKFSDMTTM 229

Query:   339 EIL-CKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAAC 397
             E       ++W +  Y    A+             +  +P+++DWR+K       +Q  C
Sbjct:   230 EFKKIMLPYQWEQPVYPMEQAN----FEKHDVTINEEDLPESFDWREKGAVTQVKNQGNC 285

Query:   398 GSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQPI--EYTHQAG 455
             GSCWAFS  G +EG + I   KLV  S+ +LV+C     GC G  GL      E     G
Sbjct:   286 GSCWAFSTTGNVEGAWFIAKNKLVSLSEQELVDCDSMDQGCNG--GLPSNAYKEIIRMGG 343

Query:   456 LESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNSH 515
             LE E  YPY +G GE   C   +  + ++        +    M+K L   GP+S+GLN++
Sbjct:   344 LEPEDAYPY-DGRGET--CHLVRKDIAVYINGSVELPHDEVEMQKWLVTKGPISIGLNAN 400

Query:   516 LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIER 575
              + FY    +      C P+ L H VL+VGYGK    PYW+ +NSWGP   + G+FK+ R
Sbjct:   401 TLQFYRHGVVHPFKIFCEPFMLNHGVLIVGYGKDGRKPYWIVKNSWGPNWGEAGYFKLYR 460

Query:   576 GNNACGIEQIAGYATID 592
             G N CG++++A  A ++
Sbjct:   461 GKNVCGVQEMATSALVN 477

 Score = 440 (159.9 bits), Expect = 2.1e-40, P = 2.1e-40
 Identities = 104/318 (32%), Positives = 162/318 (50%)

Query:   654 ILETFKAFIVKRGRQYANDEEIKERFEYFKQDGH-----KKHER----YGTSEFSDRSPE 704
             I  +F  F+ +  ++Y N  E+ +RF  FK++       +K+E+    YG ++FSD +  
Sbjct:   170 IWNSFLDFVDRHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVYGFTKFSDMTTM 229

Query:   705 EIL-CKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAAC 763
             E       ++W +  Y    A+             +  +P+++DWR+K       +Q  C
Sbjct:   230 EFKKIMLPYQWEQPVYPMEQAN----FEKHDVTINEEDLPESFDWREKGAVTQVKNQGNC 285

Query:   764 GSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIEYTH---QAG 820
             GSCWAFS  G +EG + I   KLV  S+ +LV+C     GC+G    PS  Y       G
Sbjct:   286 GSCWAFSTTGNVEGAWFIAKNKLVSLSEQELVDCDSMDQGCNGGL--PSNAYKEIIRMGG 343

Query:   821 LESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGSETMKKILYKYGPLSVLLNS 879
             LE E  YPY +  GE   C   +  + ++  G   L  +  E M+K L   GP+S+ LN+
Sbjct:   344 LEPEDAYPY-DGRGET--CHLVRKDIAVYINGSVELPHDEVE-MQKWLVTKGPISIGLNA 399

Query:   880 DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIE 939
             + +  Y    +      C P+ L H VL+VGYGK    PYW+V+NSWGP   + G+FK+ 
Sbjct:   400 NTLQFYRHGVVHPFKIFCEPFMLNHGVLIVGYGKDGRKPYWIVKNSWGPNWGEAGYFKLY 459

Query:   940 RGNNACGIEQIAGYATID 957
             RG N CG++++A  A ++
Sbjct:   460 RGKNVCGVQEMATSALVN 477

 Score = 397 (144.8 bits), Expect = 8.7e-36, P = 8.7e-36
 Identities = 85/225 (37%), Positives = 125/225 (55%)

Query:     6 EKDGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVE 65
             E+D  +P+++DWR+K       +Q +CGSCWAFS  G +EG + I   KLV  S+ +LV+
Sbjct:   261 EED--LPESFDWREKGAVTQVKNQGNCGSCWAFSTTGNVEGAWFIAKNKLVSLSEQELVD 318

Query:    66 CAKQCSGCDGCFFEPSIEYTH---QAGLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGK 121
             C     GC+G    PS  Y       GLE E  YPY +  GE   C   +  + ++  G 
Sbjct:   319 CDSMDQGCNGGL--PSNAYKEIIRMGGLEPEDAYPY-DGRGET--CHLVRKDIAVYINGS 373

Query:   122 DFLHFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG 181
               L  +  E M+K L   GP+S+ LN++ +  Y    +      C P+ L H VL+VGYG
Sbjct:   374 VELPHDEVE-MQKWLVTKGPISIGLNANTLQFYRHGVVHPFKIFCEPFMLNHGVLIVGYG 432

Query:   182 KQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID 226
             K    PYW+V+NSWGP   + G+FK+ RG N CG++++A  A ++
Sbjct:   433 KDGRKPYWIVKNSWGPNWGEAGYFKLYRGKNVCGVQEMATSALVN 477

 Score = 188 (71.2 bits), Expect = 4.4e-11, P = 4.4e-11
 Identities = 30/61 (49%), Positives = 43/61 (70%)

Query:   965 CSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 1024
             C P+ L H VL+VGYGK    PYW+V+NSWGP   + G+FK+ RG N CG++++A  A +
Sbjct:   417 CEPFMLNHGVLIVGYGKDGRKPYWIVKNSWGPNWGEAGYFKLYRGKNVCGVQEMATSALV 476

Query:  1025 D 1025
             +
Sbjct:   477 N 477


>FB|FBgn0260462 [details] [associations]
            symbol:CG12163 species:7227 "Drosophila melanogaster"
            [GO:0035071 "salivary gland cell autophagic cell death"
            evidence=IEP] [GO:0048102 "autophagic cell death" evidence=IEP]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0004869 "cysteine-type
            endopeptidase inhibitor activity" evidence=IEA] [GO:0045169
            "fusome" evidence=IDA] [GO:0035220 "wing disc development"
            evidence=IGI] [GO:0022416 "chaeta development" evidence=IGI]
            InterPro:IPR000010 InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00043 SMART:SM00645
            InterPro:IPR000169 EMBL:AE014297 GO:GO:0004869 eggNOG:COG4870
            KO:K01373 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0022416 GO:GO:0035220 GO:GO:0035071
            GO:GO:0045169 GeneTree:ENSGT00660000095458 EMBL:AY121614
            EMBL:BT003231 RefSeq:NP_649521.1 RefSeq:NP_730901.1
            RefSeq:NP_730902.2 UniGene:Dm.7315 ProteinModelPortal:Q9VN93
            SMR:Q9VN93 DIP:DIP-17491N IntAct:Q9VN93 MINT:MINT-763966
            STRING:Q9VN93 MEROPS:C01.A27 PaxDb:Q9VN93
            EnsemblMetazoa:FBtr0078823 GeneID:40628 KEGG:dme:Dmel_CG12163
            UCSC:CG12163-RA FlyBase:FBgn0260462 InParanoid:Q9VN93 OMA:GPRWGEQ
            OrthoDB:EOG4CC2G9 PhylomeDB:Q9VN93 GenomeRNAi:40628 NextBio:819744
            Bgee:Q9VN93 GermOnline:CG12163 Uniprot:Q9VN93
        Length = 614

 Score = 453 (164.5 bits), Expect = 8.5e-42, P = 8.5e-42
 Identities = 111/329 (33%), Positives = 175/329 (53%)

Query:   283 FDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHE---------RYGTSEFS 333
             FD  + L  F  F V+ GR+Y +  E + R   F+Q+     E         +YG +EF+
Sbjct:   301 FDKVDHL--FYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFA 358

Query:   334 DRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGD 393
             D +  E   +TG  W     +R  A +             G +P  +DWR+K+      +
Sbjct:   359 DMTSSEYKERTGL-W-----QRDEA-KATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKN 411

Query:   394 QAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGG--CDGLEQPIEYT 451
             Q +CGSCWAFS+ G +EG YA+KTG+L EFS+ +L++C    S C G   D   + I+  
Sbjct:   412 QGSCGSCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAIKDI 471

Query:   452 HQAGLESEKDYPYRNGNGEKFKCAYDK--SKVKLFTGKDFLYFNGSET-MKKILYKYGPL 508
                GLE E +YPY+    +K +C +++  S V++    D     G+ET M++ L   GP+
Sbjct:   472 --GGLEYEAEYPYK---AKKNQCHFNRTLSHVQVAGFVDLP--KGNETAMQEWLLANGPI 524

Query:   509 SVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD------DIPYWLARNSWG 562
             S+G+N++ + FY G         CS  +L H VL+VGYG  D       +PYW+ +NSWG
Sbjct:   525 SIGINANAMQFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWG 584

Query:   563 PIGPDEGFFKIERGNNACGIEQIAGYATI 591
             P   ++G++++ RG+N CG+ ++A  A +
Sbjct:   585 PRWGEQGYYRVYRGDNTCGVSEMATSAVL 613

 Score = 447 (162.4 bits), Expect = 3.0e-40, P = 3.0e-40
 Identities = 107/326 (32%), Positives = 173/326 (53%)

Query:   649 FDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHE---------RYGTSEFS 699
             FD  + L  F  F V+ GR+Y +  E + R   F+Q+     E         +YG +EF+
Sbjct:   301 FDKVDHL--FYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFA 358

Query:   700 DRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGD 759
             D +  E   +TG  W     +R  A +             G +P  +DWR+K+      +
Sbjct:   359 DMTSSEYKERTGL-W-----QRDEA-KATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKN 411

Query:   760 QAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIEYTHQ- 818
             Q +CGSCWAFS+ G +EG YA+KTG+L EFS+ +L++C    S C+G   + + +     
Sbjct:   412 QGSCGSCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAIKDI 471

Query:   819 AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF-NGSET-MKKILYKYGPLSVL 876
              GLE E +YPYK    +K +C ++++   +     F+    G+ET M++ L   GP+S+ 
Sbjct:   472 GGLEYEAEYPYK---AKKNQCHFNRTLSHVQVA-GFVDLPKGNETAMQEWLLANGPISIG 527

Query:   877 LNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD------NIPYWLVRNSWGPIG 930
             +N++ +  Y G         CS  +L H VL+VGYG  D       +PYW+V+NSWGP  
Sbjct:   528 INANAMQFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRW 587

Query:   931 PDEGFFKIERGNNACGIEQIAGYATI 956
              ++G++++ RG+N CG+ ++A  A +
Sbjct:   588 GEQGYYRVYRGDNTCGVSEMATSAVL 613

 Score = 409 (149.0 bits), Expect = 2.7e-35, P = 2.7e-35
 Identities = 82/226 (36%), Positives = 133/226 (58%)

Query:     9 GPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAK 68
             G +P  +DWR+K+      +Q  CGSCWAFS+ G +EG YA+KTG+L EFS+ +L++C  
Sbjct:   392 GELPKEFDWRQKDAVTQVKNQGSCGSCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCDT 451

Query:    69 QCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF- 126
               S C+G   + + +      GLE E +YPYK    +K +C ++++   +     F+   
Sbjct:   452 TDSACNGGLMDNAYKAIKDIGGLEYEAEYPYK---AKKNQCHFNRTLSHVQVA-GFVDLP 507

Query:   127 NGSET-MKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD- 184
              G+ET M++ L   GP+S+ +N++ +  Y G         CS  +L H VL+VGYG  D 
Sbjct:   508 KGNETAMQEWLLANGPISIGINANAMQFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSDY 567

Query:   185 -----NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 225
                   +PYW+V+NSWGP   ++G++++ RG+N CG+ ++A  A +
Sbjct:   568 PNFHKTLPYWIVKNSWGPRWGEQGYYRVYRGDNTCGVSEMATSAVL 613

 Score = 164 (62.8 bits), Expect = 2.8e-08, P = 2.8e-08
 Identities = 28/66 (42%), Positives = 44/66 (66%)

Query:   965 CSPYDLGHAVLLVGYGKQD------DIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQI 1018
             CS  +L H VL+VGYG  D       +PYW+V+NSWGP   ++G++++ RG+N CG+ ++
Sbjct:   548 CSKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYRVYRGDNTCGVSEM 607

Query:  1019 AGYATI 1024
             A  A +
Sbjct:   608 ATSAVL 613


>UNIPROTKB|F1MHV4 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 OMA:GRCGDGC EMBL:DAAA02063574
            IPI:IPI00716321 Ensembl:ENSBTAT00000027681 Uniprot:F1MHV4
        Length = 375

 Score = 318 (117.0 bits), Expect = 1.7e-41, Sum P(3) = 1.7e-41
 Identities = 88/270 (32%), Positives = 130/270 (48%)

Query:   656 ETFKAFIVKRGRQYANDEEIKERFEYFKQDGHK----KHERYGTSEF-----SDRSPEEI 706
             E F+ F ++  R Y N  E   R + F Q+  K    + E  GT+EF     SD + EE 
Sbjct:    40 EVFRLFQMQYNRSYPNPAEYARRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSDLTEEEF 99

Query:   707 LCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSC 766
             +   G   S+   E +   R                P   DWRK     P  DQ  C  C
Sbjct:   100 VQLYG---SQVAGEALGVSRKVGSEEWGESE-----PQTCDWRKVGTISPVRDQRNCNCC 151

Query:   767 WAFSIAGMLEGQYAIKTGKLVEFS-KSQLVECAKQCSGCDGCF-FEPSIEYTHQAGLESE 824
             WA + AG +E  +AIK    VE S + +L++C +  +GC G F ++  +   + +GL SE
Sbjct:   152 WAMAAAGNIEALWAIKFRHFVEVSVQPELLDCDRCGNGCRGGFVWDAFLTVLNNSGLASE 211

Query:   825 KDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSE-TMKKILYKYGPLSVLLNSDLIH 883
             KDYP+ N +G+  +C   K K K+   +DF+     E +M + L   GP++V +N  L+ 
Sbjct:   212 KDYPF-NGSGKTHRCLAKKYK-KVAWIQDFIILQACEQSMARHLATEGPITVTINMTLLQ 269

Query:   884 DYNGTPIRKNDETCSPYDLGHAVLLVGYGK 913
              Y    I+    TC P  + H+VLLVG+GK
Sbjct:   270 QYQKGVIKATPTTCDPTQVDHSVLLVGFGK 299

 Score = 313 (115.2 bits), Expect = 6.0e-41, Sum P(3) = 6.0e-41
 Identities = 88/270 (32%), Positives = 129/270 (47%)

Query:   290 ETFKAFIVKRGRQYANDEEIKERFEYFKQDGHK----KHERYGTSEF-----SDRSPEEI 340
             E F+ F ++  R Y N  E   R + F Q+  K    + E  GT+EF     SD + EE 
Sbjct:    40 EVFRLFQMQYNRSYPNPAEYARRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSDLTEEEF 99

Query:   341 LCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSC 400
             +   G   S+   E +   R                P   DWRK     P  DQ  C  C
Sbjct:   100 VQLYG---SQVAGEALGVSRKVGSEEWGESE-----PQTCDWRKVGTISPVRDQRNCNCC 151

Query:   401 WAFSIAGMLEGQYAIKTGKLVEFS-KSQLVECAKQCSGCGGCDGLEQPIEYTHQAGLESE 459
             WA + AG +E  +AIK    VE S + +L++C +  +GC G    +  +   + +GL SE
Sbjct:   152 WAMAAAGNIEALWAIKFRHFVEVSVQPELLDCDRCGNGCRGGFVWDAFLTVLNNSGLASE 211

Query:   460 KDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSE-TMKKILYKYGPLSVGLNSHLIH 518
             KDYP+ NG+G+  +C   K K K+   +DF+     E +M + L   GP++V +N  L+ 
Sbjct:   212 KDYPF-NGSGKTHRCLAKKYK-KVAWIQDFIILQACEQSMARHLATEGPITVTINMTLLQ 269

Query:   519 FYNGTPIRKNDETCSPYDLGHAVLLVGYGK 548
              Y    I+    TC P  + H+VLLVG+GK
Sbjct:   270 QYQKGVIKATPTTCDPTQVDHSVLLVGFGK 299

 Score = 292 (107.8 bits), Expect = 2.6e-34, Sum P(2) = 2.6e-34
 Identities = 64/174 (36%), Positives = 96/174 (55%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFS-KSQLVECAKQC 70
             P   DWRK     P  DQ +C  CWA + AG +E  +AIK    VE S + +L++C +  
Sbjct:   128 PQTCDWRKVGTISPVRDQRNCNCCWAMAAAGNIEALWAIKFRHFVEVSVQPELLDCDRCG 187

Query:    71 SGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGS 129
             +GC G F ++  +   + +GL SEKDYP+ N +G+  +C   K K K+   +DF+     
Sbjct:   188 NGCRGGFVWDAFLTVLNNSGLASEKDYPF-NGSGKTHRCLAKKYK-KVAWIQDFIILQAC 245

Query:   130 E-TMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGK 182
             E +M + L   GP++V +N  L+  Y    I+    TC P  + H+VLLVG+GK
Sbjct:   246 EQSMARHLATEGPITVTINMTLLQQYQKGVIKATPTTCDPTQVDHSVLLVGFGK 299

 Score = 126 (49.4 bits), Expect = 9.9e-38, Sum P(2) = 9.9e-38
 Identities = 21/54 (38%), Positives = 34/54 (62%)

Query:   916 NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDVVKNDE--TCSP 967
             ++ YW+++NSWGP   +EG+F++ RG+N CGI +    A +D  K     +C P
Sbjct:   322 SMAYWILKNSWGPQWGEEGYFRLHRGSNTCGITKFPVTARVDKPKKQHQVSCPP 375

 Score = 120 (47.3 bits), Expect = 2.6e-34, Sum P(2) = 2.6e-34
 Identities = 18/42 (42%), Positives = 30/42 (71%)

Query:   185 NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID 226
             ++ YW+++NSWGP   +EG+F++ RG+N CGI +    A +D
Sbjct:   322 SMAYWILKNSWGPQWGEEGYFRLHRGSNTCGITKFPVTARVD 363

 Score = 119 (46.9 bits), Expect = 1.7e-41, Sum P(3) = 1.7e-41
 Identities = 18/39 (46%), Positives = 28/39 (71%)

Query:   987 YWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID 1025
             YW+++NSWGP   +EG+F++ RG+N CGI +    A +D
Sbjct:   325 YWILKNSWGPQWGEEGYFRLHRGSNTCGITKFPVTARVD 363

 Score = 117 (46.2 bits), Expect = 3.0e-36, Sum P(2) = 3.0e-36
 Identities = 18/39 (46%), Positives = 27/39 (69%)

Query:   554 YWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID 592
             YW+ +NSWGP   +EG+F++ RG+N CGI +    A +D
Sbjct:   325 YWILKNSWGPQWGEEGYFRLHRGSNTCGITKFPVTARVD 363

 Score = 65 (27.9 bits), Expect = 1.7e-41, Sum P(3) = 1.7e-41
 Identities = 13/25 (52%), Positives = 17/25 (68%)

Query:   958 VVK-NDETCSPYDLGHAVLLVGYGK 981
             V+K    TC P  + H+VLLVG+GK
Sbjct:   275 VIKATPTTCDPTQVDHSVLLVGFGK 299


>UNIPROTKB|E2RPX3 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 CTD:1521 KO:K08569 OMA:GRCGDGC
            EMBL:AAEX03011632 RefSeq:XP_540846.2 Ensembl:ENSCAFT00000020910
            GeneID:483725 KEGG:cfa:483725 Uniprot:E2RPX3
        Length = 374

 Score = 305 (112.4 bits), Expect = 6.4e-41, Sum P(3) = 6.4e-41
 Identities = 81/269 (30%), Positives = 126/269 (46%)

Query:   656 ETFKAFIVKRGRQYANDEEIKERFEYFK----QDGHKKHERYGTSEFSDRSPEEILCKTG 711
             + F  F ++  R Y+N EE   R + F     Q    + E  GT+EF   +P   L +  
Sbjct:    40 QVFALFQIQYNRSYSNPEEYARRLDIFAHNLAQAQQLEDEDLGTAEFGV-TPFSDLTEEE 98

Query:   712 FKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRK-KNVTGPAGDQAACGSCWAFS 770
             F      ++R+  +               PVP   DWRK   +  P   Q  C  CWA +
Sbjct:    99 FG-QFYGHQRMAGEAPSVGRKVESEEWGEPVPPTCDWRKLPGIISPIKQQGNCRCCWAMA 157

Query:   771 IAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPY 829
              AG +E  + I+  + VE S  +L++C +   GC G F ++  I   + +GL S KDYP+
Sbjct:   158 AAGNIEALWGIRYHQPVEVSVQELLDCGRCGDGCKGGFTWDAFITVLNNSGLASAKDYPF 217

Query:   830 KNANGEKFKCAYDKSKVKLFTGKDFLHFNGSE-TMKKILYKYGPLSVLLNSDLIHDYNGT 888
                N +  +C   K K K+   +DF+   G+E  +   L   GP++V +N  L+  Y   
Sbjct:   218 LG-NTKPHRCLAKKYK-KVAWIQDFIMLQGNEQAIAWYLATKGPITVTINMKLLQHYQKG 275

Query:   889 PIRKNDETCSPYDLGHAVLLVGYGKQDNI 917
              I+    TC P  + H+VLLVG+GK  ++
Sbjct:   276 VIQATHTTCDPQRVDHSVLLVGFGKSKSV 304

 Score = 302 (111.4 bits), Expect = 1.3e-40, Sum P(3) = 1.3e-40
 Identities = 81/269 (30%), Positives = 124/269 (46%)

Query:   290 ETFKAFIVKRGRQYANDEEIKERFEYFK----QDGHKKHERYGTSEFSDRSPEEILCKTG 345
             + F  F ++  R Y+N EE   R + F     Q    + E  GT+EF   +P   L +  
Sbjct:    40 QVFALFQIQYNRSYSNPEEYARRLDIFAHNLAQAQQLEDEDLGTAEFGV-TPFSDLTEEE 98

Query:   346 FKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRK-KNVTGPAGDQAACGSCWAFS 404
             F      ++R+  +               PVP   DWRK   +  P   Q  C  CWA +
Sbjct:    99 FG-QFYGHQRMAGEAPSVGRKVESEEWGEPVPPTCDWRKLPGIISPIKQQGNCRCCWAMA 157

Query:   405 IAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQPIEYTHQAGLESEKDYPY 464
              AG +E  + I+  + VE S  +L++C +   GC G    +  I   + +GL S KDYP+
Sbjct:   158 AAGNIEALWGIRYHQPVEVSVQELLDCGRCGDGCKGGFTWDAFITVLNNSGLASAKDYPF 217

Query:   465 RNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSE-TMKKILYKYGPLSVGLNSHLIHFYNGT 523
               GN +  +C   K K K+   +DF+   G+E  +   L   GP++V +N  L+  Y   
Sbjct:   218 L-GNTKPHRCLAKKYK-KVAWIQDFIMLQGNEQAIAWYLATKGPITVTINMKLLQHYQKG 275

Query:   524 PIRKNDETCSPYDLGHAVLLVGYGKQDDI 552
              I+    TC P  + H+VLLVG+GK   +
Sbjct:   276 VIQATHTTCDPQRVDHSVLLVGFGKSKSV 304

 Score = 279 (103.3 bits), Expect = 2.7e-34, Sum P(2) = 2.7e-34
 Identities = 65/188 (34%), Positives = 100/188 (53%)

Query:     3 MEVEKDG-PVPDAWDWRK-KNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSK 60
             +E E+ G PVP   DWRK   +  P   Q +C  CWA + AG +E  + I+  + VE S 
Sbjct:   119 VESEEWGEPVPPTCDWRKLPGIISPIKQQGNCRCCWAMAAAGNIEALWGIRYHQPVEVSV 178

Query:    61 SQLVECAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFT 119
              +L++C +   GC G F ++  I   + +GL S KDYP+   N +  +C   K K K+  
Sbjct:   179 QELLDCGRCGDGCKGGFTWDAFITVLNNSGLASAKDYPFLG-NTKPHRCLAKKYK-KVAW 236

Query:   120 GKDFLHFNGSE-TMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLV 178
              +DF+   G+E  +   L   GP++V +N  L+  Y    I+    TC P  + H+VLLV
Sbjct:   237 IQDFIMLQGNEQAIAWYLATKGPITVTINMKLLQHYQKGVIQATHTTCDPQRVDHSVLLV 296

Query:   179 GYGKQDNI 186
             G+GK  ++
Sbjct:   297 GFGKSKSV 304

 Score = 133 (51.9 bits), Expect = 2.7e-34, Sum P(2) = 2.7e-34
 Identities = 24/49 (48%), Positives = 35/49 (71%)

Query:   186 IPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDV-VIQRLV 233
             IPYW+++NSWG    +EG+F++ RGNN CGI +    A +D+ V +RLV
Sbjct:   322 IPYWILKNSWGAEWGEEGYFRLHRGNNTCGITKYPVTARVDLRVKKRLV 370

 Score = 132 (51.5 bits), Expect = 5.7e-37, Sum P(2) = 5.7e-37
 Identities = 24/53 (45%), Positives = 34/53 (64%)

Query:   917 IPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID--VVKNDETCSP 967
             IPYW+++NSWG    +EG+F++ RGNN CGI +    A +D  V K   +C P
Sbjct:   322 IPYWILKNSWGAEWGEEGYFRLHRGNNTCGITKYPVTARVDLRVKKRLVSCPP 374

 Score = 131 (51.2 bits), Expect = 1.5e-36, Sum P(2) = 1.5e-36
 Identities = 24/49 (48%), Positives = 34/49 (69%)

Query:   552 IPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDV-VIQRLV 599
             IPYW+ +NSWG    +EG+F++ RGNN CGI +    A +D+ V +RLV
Sbjct:   322 IPYWILKNSWGAEWGEEGYFRLHRGNNTCGITKYPVTARVDLRVKKRLV 370

 Score = 127 (49.8 bits), Expect = 6.4e-41, Sum P(3) = 6.4e-41
 Identities = 20/42 (47%), Positives = 30/42 (71%)

Query:   985 IPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDV 1026
             IPYW+++NSWG    +EG+F++ RGNN CGI +    A +D+
Sbjct:   322 IPYWILKNSWGAEWGEEGYFRLHRGNNTCGITKYPVTARVDL 363

 Score = 65 (27.9 bits), Expect = 6.4e-41, Sum P(3) = 6.4e-41
 Identities = 11/22 (50%), Positives = 15/22 (68%)

Query:   964 TCSPYDLGHAVLLVGYGKQDDI 985
             TC P  + H+VLLVG+GK   +
Sbjct:   283 TCDPQRVDHSVLLVGFGKSKSV 304


>RGD|1309354 [details] [associations]
            symbol:Ctsw "cathepsin W" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1309354 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 MEROPS:C01.037
            CTD:1521 eggNOG:NOG288820 HOVERGEN:HBG100117 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 EMBL:CH473953 EMBL:BC093401 IPI:IPI00371471
            RefSeq:NP_001019413.1 UniGene:Rn.34406 Ensembl:ENSRNOT00000037404
            GeneID:293676 KEGG:rno:293676 UCSC:RGD:1309354 InParanoid:Q561Q9
            NextBio:636716 Genevestigator:Q561Q9 Uniprot:Q561Q9
        Length = 371

 Score = 373 (136.4 bits), Expect = 3.3e-33, P = 3.3e-33
 Identities = 100/334 (29%), Positives = 162/334 (48%)

Query:   290 ETFKAFIVKRGRQYANDEEIKERFEYFK----QDGHKKHERYGTSEFSDRSPEEILCKTG 345
             E FK F ++  R Y+N  E   R   F     Q    + E  GT+EF  ++P   L +  
Sbjct:    38 EVFKLFQIQFNRSYSNPAEYTRRLGIFAHNLAQAQRLQEEDLGTAEFG-QTPFSDLTEEE 96

Query:   346 FKW---SERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRK-KNVTGPAGDQAACGSCW 401
             F      +R  ERI+                  VP   DWRK KN+     +Q  C  CW
Sbjct:    97 FGQLYGHQRAPERIL----NMAKKVKSERWGESVPPTCDWRKVKNIISSIKNQGNCRCCW 152

Query:   402 AFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQPIEYTHQAGLESEKD 461
             A + A  ++  + IKT + V+ S  +L++C +  +GC G    +  I   + +GL SE+D
Sbjct:   153 AIAAADNIQTLWRIKTQQFVDVSVQELLDCDRCGNGCNGGFVWDAYITVLNNSGLASEED 212

Query:   462 YPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSE-TMKKILYKYGPLSVGLNSHLIHFY 520
             YP++ G+ +  +C  DK + K+   +DF   + +E  +   L  +GP++V +N  L+ +Y
Sbjct:   213 YPFQ-GHQKPHRCLADKYR-KVAWIQDFTMLSSNEQVIAGYLAIHGPITVTINMKLLQYY 270

Query:   521 NGTPIRKNDETCSPYDLGHAVLLVGYGKQD-----------------DIPYWLARNSWGP 563
                 I+    TC P+ + H+VLLVG+GK+                    PYW+ +NSWG 
Sbjct:   271 QKGVIKATPSTCDPHLVNHSVLLVGFGKEKGGMQTGTLLSHSRKPRRSTPYWILKNSWGA 330

Query:   564 IGPDEGFFKIERGNNACGIEQIAGYATIDVVIQR 597
                ++G+F++ RGNN CGI +    A +D  +++
Sbjct:   331 EWGEKGYFRLYRGNNTCGIAKYPITARVDRPVKK 364

 Score = 298 (110.0 bits), Expect = 2.4e-39, Sum P(3) = 2.4e-39
 Identities = 82/269 (30%), Positives = 133/269 (49%)

Query:   656 ETFKAFIVKRGRQYANDEEIKERFEYFK----QDGHKKHERYGTSEFSDRSPEEILCKTG 711
             E FK F ++  R Y+N  E   R   F     Q    + E  GT+EF  ++P   L +  
Sbjct:    38 EVFKLFQIQFNRSYSNPAEYTRRLGIFAHNLAQAQRLQEEDLGTAEFG-QTPFSDLTEEE 96

Query:   712 FKW---SERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRK-KNVTGPAGDQAACGSCW 767
             F      +R  ERI+                  VP   DWRK KN+     +Q  C  CW
Sbjct:    97 FGQLYGHQRAPERIL----NMAKKVKSERWGESVPPTCDWRKVKNIISSIKNQGNCRCCW 152

Query:   768 AFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCF-FEPSIEYTHQAGLESEKD 826
             A + A  ++  + IKT + V+ S  +L++C +  +GC+G F ++  I   + +GL SE+D
Sbjct:   153 AIAAADNIQTLWRIKTQQFVDVSVQELLDCDRCGNGCNGGFVWDAYITVLNNSGLASEED 212

Query:   827 YPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSE-TMKKILYKYGPLSVLLNSDLIHDY 885
             YP++  + +  +C  DK + K+   +DF   + +E  +   L  +GP++V +N  L+  Y
Sbjct:   213 YPFQG-HQKPHRCLADKYR-KVAWIQDFTMLSSNEQVIAGYLAIHGPITVTINMKLLQYY 270

Query:   886 NGTPIRKNDETCSPYDLGHAVLLVGYGKQ 914
                 I+    TC P+ + H+VLLVG+GK+
Sbjct:   271 QKGVIKATPSTCDPHLVNHSVLLVGFGKE 299

 Score = 272 (100.8 bits), Expect = 4.6e-31, Sum P(2) = 4.6e-31
 Identities = 59/176 (33%), Positives = 101/176 (57%)

Query:    11 VPDAWDWRK-KNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ 69
             VP   DWRK KN+     +Q +C  CWA + A  ++  + IKT + V+ S  +L++C + 
Sbjct:   126 VPPTCDWRKVKNIISSIKNQGNCRCCWAIAAADNIQTLWRIKTQQFVDVSVQELLDCDRC 185

Query:    70 CSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNG 128
              +GC+G F ++  I   + +GL SE+DYP++  + +  +C  DK + K+   +DF   + 
Sbjct:   186 GNGCNGGFVWDAYITVLNNSGLASEEDYPFQG-HQKPHRCLADKYR-KVAWIQDFTMLSS 243

Query:   129 SE-TMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 183
             +E  +   L  +GP++V +N  L+  Y    I+    TC P+ + H+VLLVG+GK+
Sbjct:   244 NEQVIAGYLAIHGPITVTINMKLLQYYQKGVIKATPSTCDPHLVNHSVLLVGFGKE 299

 Score = 134 (52.2 bits), Expect = 2.2e-05, P = 2.2e-05
 Identities = 49/190 (25%), Positives = 88/190 (46%)

Query:   797 CAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNAN------GEKF-KCAYDKSKVKLF 849
             C   C+G  G  ++  I   + +GL SE+DYP++          +K+ K A+ +    L 
Sbjct:   185 CGNGCNG--GFVWDAYITVLNNSGLASEEDYPFQGHQKPHRCLADKYRKVAWIQDFTMLS 242

Query:   850 TGKD----FLHFNGSETMK---KIL--YKYGPLSVLLNSDLIHDYNGTPIRKN-DETCSP 899
             + +     +L  +G  T+    K+L  Y+ G +    ++   H  N + +     +    
Sbjct:   243 SNEQVIAGYLAIHGPITVTINMKLLQYYQKGVIKATPSTCDPHLVNHSVLLVGFGKEKGG 302

Query:   900 YDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID-- 957
                G  +L      + + PYW+++NSWG    ++G+F++ RGNN CGI +    A +D  
Sbjct:   303 MQTG-TLLSHSRKPRRSTPYWILKNSWGAEWGEKGYFRLYRGNNTCGIAKYPITARVDRP 361

Query:   958 VVKNDETCSP 967
             V K   +C P
Sbjct:   362 VKKAPVSCPP 371

 Score = 128 (50.1 bits), Expect = 9.9e-05, P = 9.9e-05
 Identities = 45/183 (24%), Positives = 86/183 (46%)

Query:    66 CAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNAN------GEKF-KCAYDKSKVKLF 118
             C   C+G  G  ++  I   + +GL SE+DYP++          +K+ K A+ +    L 
Sbjct:   185 CGNGCNG--GFVWDAYITVLNNSGLASEEDYPFQGHQKPHRCLADKYRKVAWIQDFTMLS 242

Query:   119 TGKD----FLHFNGSETMK---KIL--YKYGPLSVLLNSDLIHDYNGTPIRKN-DETCSP 168
             + +     +L  +G  T+    K+L  Y+ G +    ++   H  N + +     +    
Sbjct:   243 SNEQVIAGYLAIHGPITVTINMKLLQYYQKGVIKATPSTCDPHLVNHSVLLVGFGKEKGG 302

Query:   169 YDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDVV 228
                G  +L      + + PYW+++NSWG    ++G+F++ RGNN CGI +    A +D  
Sbjct:   303 MQTG-TLLSHSRKPRRSTPYWILKNSWGAEWGEKGYFRLYRGNNTCGIAKYPITARVDRP 361

Query:   229 IQR 231
             +++
Sbjct:   362 VKK 364

 Score = 115 (45.5 bits), Expect = 2.4e-39, Sum P(3) = 2.4e-39
 Identities = 18/40 (45%), Positives = 28/40 (70%)

Query:   986 PYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID 1025
             PYW+++NSWG    ++G+F++ RGNN CGI +    A +D
Sbjct:   320 PYWILKNSWGAEWGEKGYFRLYRGNNTCGIAKYPITARVD 359

 Score = 69 (29.3 bits), Expect = 2.4e-39, Sum P(3) = 2.4e-39
 Identities = 13/26 (50%), Positives = 19/26 (73%)

Query:   958 VVK-NDETCSPYDLGHAVLLVGYGKQ 982
             V+K    TC P+ + H+VLLVG+GK+
Sbjct:   274 VIKATPSTCDPHLVNHSVLLVGFGKE 299


>ZFIN|ZDB-GENE-030131-9831 [details] [associations]
            symbol:ctsf "cathepsin F" species:7955 "Danio
            rerio" [GO:0004869 "cysteine-type endopeptidase inhibitor activity"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000010 InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00031 Pfam:PF00112 PRINTS:PR00705 SMART:SM00043
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-030131-9831
            GO:GO:0004869 eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 HOVERGEN:HBG011513 CTD:8722 OrthoDB:EOG4CC41T
            MEROPS:I25.006 EMBL:BC124243 IPI:IPI00503226 RefSeq:NP_001071036.1
            UniGene:Dr.81265 ProteinModelPortal:Q08CH0 SMR:Q08CH0 GeneID:565588
            KEGG:dre:565588 InParanoid:Q08CH0 NextBio:20885952
            ArrayExpress:Q08CH0 Uniprot:Q08CH0
        Length = 473

 Score = 430 (156.4 bits), Expect = 2.5e-39, P = 2.5e-39
 Identities = 96/320 (30%), Positives = 156/320 (48%)

Query:   284 DNENILETFKAFIVKRGRQYANDEEIKERFEYFKQDG---------HKKHERYGTSEFSD 334
             ++  +L  FK F++   R Y++ EE ++R   F+Q+           +    YG ++FSD
Sbjct:   167 ESVELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKFSD 226

Query:   335 RSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQ 394
              + +E      F+        +++                P PD WDWR      P  +Q
Sbjct:   227 LTEDE------FRMMY--LNPMLSQWSLKKEMKPAIPASAPAPDTWDWRDHGAVSPVKNQ 278

Query:   395 AACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGC--DGLEQPIEYTH 452
               CGSCWAFS+ G +EGQ+  KTG+L+  S+ +LV+C K    CGG       + IE  +
Sbjct:   279 GMCGSCWAFSVTGNIEGQWFKKTGQLLSLSEQELVDCDKLDQACGGGLPSNAYEAIE--N 336

Query:   453 QAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGL 512
               GLE+E DY Y    G K  C +   KV  +           + +   L + GP+S  L
Sbjct:   337 LGGLETETDYSY---TGHKQSCDFSTGKVAAYINSSVELPKDEKEIAAFLAENGPVSAAL 393

Query:   513 NSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFK 572
             N+  + FY           C+P+ + HAVLLVG+G+++ +P+W  +NSWG    ++G++ 
Sbjct:   394 NAFAMQFYRKGVSHPLKIFCNPWMIDHAVLLVGFGQRNGVPFWAIKNSWGEDYGEQGYYY 453

Query:   573 IERGNNACGIEQIAGYATID 592
             + RG+  CGI ++   A ++
Sbjct:   454 LYRGSGLCGIHKMCSSAIVN 473

 Score = 418 (152.2 bits), Expect = 4.9e-38, P = 4.9e-38
 Identities = 95/320 (29%), Positives = 155/320 (48%)

Query:   650 DNENILETFKAFIVKRGRQYANDEEIKERFEYFKQDG---------HKKHERYGTSEFSD 700
             ++  +L  FK F++   R Y++ EE ++R   F+Q+           +    YG ++FSD
Sbjct:   167 ESVELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKFSD 226

Query:   701 RSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQ 760
              + +E      F+        +++                P PD WDWR      P  +Q
Sbjct:   227 LTEDE------FRMMY--LNPMLSQWSLKKEMKPAIPASAPAPDTWDWRDHGAVSPVKNQ 278

Query:   761 AACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIEYT---H 817
               CGSCWAFS+ G +EGQ+  KTG+L+  S+ +LV+C K    C G    PS  Y    +
Sbjct:   279 GMCGSCWAFSVTGNIEGQWFKKTGQLLSLSEQELVDCDKLDQACGGGL--PSNAYEAIEN 336

Query:   818 QAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLL 877
               GLE+E DY Y    G K  C +   KV  +           + +   L + GP+S  L
Sbjct:   337 LGGLETETDYSY---TGHKQSCDFSTGKVAAYINSSVELPKDEKEIAAFLAENGPVSAAL 393

Query:   878 NSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFK 937
             N+  +  Y           C+P+ + HAVLLVG+G+++ +P+W ++NSWG    ++G++ 
Sbjct:   394 NAFAMQFYRKGVSHPLKIFCNPWMIDHAVLLVGFGQRNGVPFWAIKNSWGEDYGEQGYYY 453

Query:   938 IERGNNACGIEQIAGYATID 957
             + RG+  CGI ++   A ++
Sbjct:   454 LYRGSGLCGIHKMCSSAIVN 473

 Score = 391 (142.7 bits), Expect = 3.8e-35, P = 3.8e-35
 Identities = 77/225 (34%), Positives = 118/225 (52%)

Query:     5 VEKDGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLV 64
             +    P PD WDWR      P  +Q  CGSCWAFS+ G +EGQ+  KTG+L+  S+ +LV
Sbjct:   254 IPASAPAPDTWDWRDHGAVSPVKNQGMCGSCWAFSVTGNIEGQWFKKTGQLLSLSEQELV 313

Query:    65 ECAKQCSGCDGCFFEPSIEYT---HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGK 121
             +C K    C G    PS  Y    +  GLE+E DY Y    G K  C +   KV  +   
Sbjct:   314 DCDKLDQACGGGL--PSNAYEAIENLGGLETETDYSY---TGHKQSCDFSTGKVAAYINS 368

Query:   122 DFLHFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG 181
                     + +   L + GP+S  LN+  +  Y           C+P+ + HAVLLVG+G
Sbjct:   369 SVELPKDEKEIAAFLAENGPVSAALNAFAMQFYRKGVSHPLKIFCNPWMIDHAVLLVGFG 428

Query:   182 KQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID 226
             +++ +P+W ++NSWG    ++G++ + RG+  CGI ++   A ++
Sbjct:   429 QRNGVPFWAIKNSWGEDYGEQGYYYLYRGSGLCGIHKMCSSAIVN 473

 Score = 159 (61.0 bits), Expect = 6.3e-08, P = 6.3e-08
 Identities = 23/61 (37%), Positives = 44/61 (72%)

Query:   965 CSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 1024
             C+P+ + HAVLLVG+G+++ +P+W ++NSWG    ++G++ + RG+  CGI ++   A +
Sbjct:   413 CNPWMIDHAVLLVGFGQRNGVPFWAIKNSWGEDYGEQGYYYLYRGSGLCGIHKMCSSAIV 472

Query:  1025 D 1025
             +
Sbjct:   473 N 473


>UNIPROTKB|F1RU48 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            OMA:LAPPEWD GeneTree:ENSGT00660000095458 EMBL:CU928034
            EMBL:FP565364 Ensembl:ENSSSCT00000014140 Ensembl:ENSSSCT00000014154
            Uniprot:F1RU48
        Length = 460

 Score = 424 (154.3 bits), Expect = 1.1e-38, P = 1.1e-38
 Identities = 102/314 (32%), Positives = 155/314 (49%)

Query:   292 FKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHE---------RYGTSEFSDRSPEEILC 342
             FK F+    R Y   EE + R   F  +  +  +         RYG ++FSD + EE   
Sbjct:   163 FKEFVTTYNRTYDTKEEARWRMSVFANNMVRAQKIQALDTGTARYGVTKFSDLTEEEF-- 220

Query:   343 KTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWA 402
             +T +         ++ +               P P+ WDWRKK       DQ  CGSCWA
Sbjct:   221 RTIY------LNPLLQEEPGRKMRLAKSVSSLPPPE-WDWRKKGAVTKVKDQGMCGSCWA 273

Query:   403 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGC-GGCDGLEQPIEYTHQAGLESEKD 461
             FS+ G +EGQ+ +K G L+  S+ +L++C K   GC GG          T   GLE+E+D
Sbjct:   274 FSVTGNVEGQWFLKQGTLLSLSEQELLDCDKVDKGCMGGLPSNAYSAIKT-LGGLETEED 332

Query:   462 YPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNSHLIHFY- 520
             Y YR   G    C+++  K K++           + +   L + GP+SV +N+  + FY 
Sbjct:   333 YSYR---GHLQTCSFNAEKAKVYINDSVELSQNEQKLAAWLAEKGPISVAINAFGMQFYR 389

Query:   521 NGT--PIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNN 578
             +G   P+R     CSP+ + HAVLLVGYG +   P+W  +NSWG    +EG++ + RG+ 
Sbjct:   390 HGISHPLRP---LCSPWLIDHAVLLVGYGNRSATPFWAIKNSWGTDWGEEGYYYLYRGSG 446

Query:   579 ACGIEQIAGYATID 592
             ACG+  +A  A ++
Sbjct:   447 ACGVNIMASSAVVN 460

 Score = 415 (151.1 bits), Expect = 2.6e-39, Sum P(2) = 2.6e-39
 Identities = 101/315 (32%), Positives = 157/315 (49%)

Query:   658 FKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHE---------RYGTSEFSDRSPEEILC 708
             FK F+    R Y   EE + R   F  +  +  +         RYG ++FSD + EE   
Sbjct:   163 FKEFVTTYNRTYDTKEEARWRMSVFANNMVRAQKIQALDTGTARYGVTKFSDLTEEEF-- 220

Query:   709 KTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWA 768
             +T +         ++ +               P P+ WDWRKK       DQ  CGSCWA
Sbjct:   221 RTIY------LNPLLQEEPGRKMRLAKSVSSLPPPE-WDWRKKGAVTKVKDQGMCGSCWA 273

Query:   769 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIEYTH---QAGLESEK 825
             FS+ G +EGQ+ +K G L+  S+ +L++C K   GC G    PS  Y+      GLE+E+
Sbjct:   274 FSVTGNVEGQWFLKQGTLLSLSEQELLDCDKVDKGCMGGL--PSNAYSAIKTLGGLETEE 331

Query:   826 DYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNSDLIHDY 885
             DY Y+   G    C+++  K K++           + +   L + GP+SV +N+  +  Y
Sbjct:   332 DYSYR---GHLQTCSFNAEKAKVYINDSVELSQNEQKLAAWLAEKGPISVAINAFGMQFY 388

Query:   886 -NGT--PIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGN 942
              +G   P+R     CSP+ + HAVLLVGYG +   P+W ++NSWG    +EG++ + RG+
Sbjct:   389 RHGISHPLRP---LCSPWLIDHAVLLVGYGNRSATPFWAIKNSWGTDWGEEGYYYLYRGS 445

Query:   943 NACGIEQIAGYATID 957
              ACG+  +A  A ++
Sbjct:   446 GACGVNIMASSAVVN 460

 Score = 393 (143.4 bits), Expect = 2.3e-35, P = 2.3e-35
 Identities = 82/221 (37%), Positives = 124/221 (56%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 71
             P  WDWRKK       DQ  CGSCWAFS+ G +EGQ+ +K G L+  S+ +L++C K   
Sbjct:   248 PPEWDWRKKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLKQGTLLSLSEQELLDCDKVDK 307

Query:    72 GCDGCFFEPSIEYTH---QAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNG 128
             GC G    PS  Y+      GLE+E+DY Y+   G    C+++  K K++          
Sbjct:   308 GCMGGL--PSNAYSAIKTLGGLETEEDYSYR---GHLQTCSFNAEKAKVYINDSVELSQN 362

Query:   129 SETMKKILYKYGPLSVLLNSDLIHDY-NGT--PIRKNDETCSPYDLGHAVLLVGYGKQDN 185
              + +   L + GP+SV +N+  +  Y +G   P+R     CSP+ + HAVLLVGYG +  
Sbjct:   363 EQKLAAWLAEKGPISVAINAFGMQFYRHGISHPLRP---LCSPWLIDHAVLLVGYGNRSA 419

Query:   186 IPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID 226
              P+W ++NSWG    +EG++ + RG+ ACG+  +A  A ++
Sbjct:   420 TPFWAIKNSWGTDWGEEGYYYLYRGSGACGVNIMASSAVVN 460

 Score = 167 (63.8 bits), Expect = 2.7e-08, Sum P(2) = 2.7e-08
 Identities = 27/61 (44%), Positives = 42/61 (68%)

Query:   965 CSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 1024
             CSP+ + HAVLLVGYG +   P+W ++NSWG    +EG++ + RG+ ACG+  +A  A +
Sbjct:   400 CSPWLIDHAVLLVGYGNRSATPFWAIKNSWGTDWGEEGYYYLYRGSGACGVNIMASSAVV 459

Query:  1025 D 1025
             +
Sbjct:   460 N 460

 Score = 42 (19.8 bits), Expect = 2.6e-39, Sum P(2) = 2.6e-39
 Identities = 19/78 (24%), Positives = 33/78 (42%)

Query:   255 LCLPSLTDRITDQVVARVDTLAIEGSLTFDNENILETFKAFIVKRGRQ-YANDEEIKERF 313
             LC   + D +   ++ R D   ++  +T D+ N  ETF +F+    +     D  +K   
Sbjct:   105 LCSFEVLDELGKHMLLRRDCGPVDTKVT-DDTN--ETFSSFLPLLNKDPLPQDFSVKMA- 160

Query:   314 EYFKQDGHKKHERYGTSE 331
               FK+     +  Y T E
Sbjct:   161 SIFKEFVTTYNRTYDTKE 178

 Score = 42 (19.8 bits), Expect = 2.7e-08, Sum P(2) = 2.7e-08
 Identities = 19/78 (24%), Positives = 33/78 (42%)

Query:   621 LCLPSLTDRITDQVVARVDTLAIEGSLTFDNENILETFKAFIVKRGRQ-YANDEEIKERF 679
             LC   + D +   ++ R D   ++  +T D+ N  ETF +F+    +     D  +K   
Sbjct:   105 LCSFEVLDELGKHMLLRRDCGPVDTKVT-DDTN--ETFSSFLPLLNKDPLPQDFSVKMA- 160

Query:   680 EYFKQDGHKKHERYGTSE 697
               FK+     +  Y T E
Sbjct:   161 SIFKEFVTTYNRTYDTKE 178


>UNIPROTKB|E2RR02 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            OMA:LAPPEWD GeneTree:ENSGT00660000095458 EMBL:AAEX03011628
            Ensembl:ENSCAFT00000019742 Uniprot:E2RR02
        Length = 460

 Score = 425 (154.7 bits), Expect = 8.6e-39, P = 8.6e-39
 Identities = 115/387 (29%), Positives = 180/387 (46%)

Query:   226 DVVIQRLVLEKKAIMLIQAVFLLCGVASCL---CLPSLTDRITD---QVVARVDTLAIEG 279
             D  + +L + KK ++    V    G    L   C P  T ++TD   + ++ V  L  + 
Sbjct:    90 DPTVCQLPVSKKTLLCSFEVLDELGKHMLLRRDCGPVDT-KVTDDRNETLSSVLPLLNKD 148

Query:   280 SLTFD-NENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHE---------RYGT 329
              L  D +  +   FK F+    R Y   EE + R   F  +  +  +         +YG 
Sbjct:   149 PLPQDFSVKMASVFKEFVTTYNRTYETKEEAEWRMSVFSNNMVRAQKIQALDRGTAQYGI 208

Query:   330 SEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTG 389
             ++FSD + EE   +T +         ++ +             D   P  WDWR K    
Sbjct:   209 TKFSDLTEEEF--RTIY------LNPLLRENRGKKMRLAKSISDHAPPPEWDWRSKGAVT 260

Query:   390 PAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGC-GGCDGLEQPI 448
                DQ  CGSCWAFS+ G +EGQ+ +K G L+  S+ +L++C K    C GG        
Sbjct:   261 KVKDQGMCGSCWAFSVTGNVEGQWFLKEGTLLSLSEQELLDCDKVDKACLGGLPSNAYSA 320

Query:   449 EYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPL 508
               T   GLE+E DY Y+   G    C++   K +++           + +   L K GP+
Sbjct:   321 IMT-LGGLETEDDYSYQ---GHLQACSFSAKKARVYINDSMELSQNEQKLAAWLAKKGPI 376

Query:   509 SVGLNSHLIHFY-NGT--PIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIG 565
             SV +N+  + FY +G   P+R     CSP+ + HAVLLVGYG +  IP+W  +NSWG   
Sbjct:   377 SVAINAFGMQFYRHGISHPLRP---LCSPWLIDHAVLLVGYGNRSGIPFWAIKNSWGTDW 433

Query:   566 PDEGFFKIERGNNACGIEQIAGYATID 592
              +EG++ + RG+ ACG+  +A  A ++
Sbjct:   434 GEEGYYYLHRGSGACGVNTMASSAVVN 460

 Score = 418 (152.2 bits), Expect = 4.9e-38, P = 4.9e-38
 Identities = 115/388 (29%), Positives = 182/388 (46%)

Query:   592 DVVIQRLVLEKKAIMLIQAVFLLCGVASCL---CLPSLTDRITD---QVVARVDTLAIEG 645
             D  + +L + KK ++    V    G    L   C P  T ++TD   + ++ V  L  + 
Sbjct:    90 DPTVCQLPVSKKTLLCSFEVLDELGKHMLLRRDCGPVDT-KVTDDRNETLSSVLPLLNKD 148

Query:   646 SLTFD-NENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHE---------RYGT 695
              L  D +  +   FK F+    R Y   EE + R   F  +  +  +         +YG 
Sbjct:   149 PLPQDFSVKMASVFKEFVTTYNRTYETKEEAEWRMSVFSNNMVRAQKIQALDRGTAQYGI 208

Query:   696 SEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTG 755
             ++FSD + EE   +T +         ++ +             D   P  WDWR K    
Sbjct:   209 TKFSDLTEEEF--RTIY------LNPLLRENRGKKMRLAKSISDHAPPPEWDWRSKGAVT 260

Query:   756 PAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIEY 815
                DQ  CGSCWAFS+ G +EGQ+ +K G L+  S+ +L++C K    C G    PS  Y
Sbjct:   261 KVKDQGMCGSCWAFSVTGNVEGQWFLKEGTLLSLSEQELLDCDKVDKACLGGL--PSNAY 318

Query:   816 TH---QAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGP 872
             +      GLE+E DY Y+   G    C++   K +++           + +   L K GP
Sbjct:   319 SAIMTLGGLETEDDYSYQ---GHLQACSFSAKKARVYINDSMELSQNEQKLAAWLAKKGP 375

Query:   873 LSVLLNSDLIHDY-NGT--PIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPI 929
             +SV +N+  +  Y +G   P+R     CSP+ + HAVLLVGYG +  IP+W ++NSWG  
Sbjct:   376 ISVAINAFGMQFYRHGISHPLRP---LCSPWLIDHAVLLVGYGNRSGIPFWAIKNSWGTD 432

Query:   930 GPDEGFFKIERGNNACGIEQIAGYATID 957
               +EG++ + RG+ ACG+  +A  A ++
Sbjct:   433 WGEEGYYYLHRGSGACGVNTMASSAVVN 460

 Score = 391 (142.7 bits), Expect = 3.8e-35, P = 3.8e-35
 Identities = 82/225 (36%), Positives = 122/225 (54%)

Query:     8 DGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 67
             D   P  WDWR K       DQ  CGSCWAFS+ G +EGQ+ +K G L+  S+ +L++C 
Sbjct:   244 DHAPPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLKEGTLLSLSEQELLDCD 303

Query:    68 KQCSGCDGCFFEPSIEYTH---QAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
             K    C G    PS  Y+      GLE+E DY Y+   G    C++   K +++      
Sbjct:   304 KVDKACLGGL--PSNAYSAIMTLGGLETEDDYSYQ---GHLQACSFSAKKARVYINDSME 358

Query:   125 HFNGSETMKKILYKYGPLSVLLNSDLIHDY-NGT--PIRKNDETCSPYDLGHAVLLVGYG 181
                  + +   L K GP+SV +N+  +  Y +G   P+R     CSP+ + HAVLLVGYG
Sbjct:   359 LSQNEQKLAAWLAKKGPISVAINAFGMQFYRHGISHPLRP---LCSPWLIDHAVLLVGYG 415

Query:   182 KQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID 226
              +  IP+W ++NSWG    +EG++ + RG+ ACG+  +A  A ++
Sbjct:   416 NRSGIPFWAIKNSWGTDWGEEGYYYLHRGSGACGVNTMASSAVVN 460

 Score = 177 (67.4 bits), Expect = 6.5e-10, P = 6.5e-10
 Identities = 28/61 (45%), Positives = 43/61 (70%)

Query:   965 CSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 1024
             CSP+ + HAVLLVGYG +  IP+W ++NSWG    +EG++ + RG+ ACG+  +A  A +
Sbjct:   400 CSPWLIDHAVLLVGYGNRSGIPFWAIKNSWGTDWGEEGYYYLHRGSGACGVNTMASSAVV 459

Query:  1025 D 1025
             +
Sbjct:   460 N 460


>UNIPROTKB|G1M0X4 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9646
            "Ailuropoda melanoleuca" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 EMBL:ACTA01057330 EMBL:ACTA01065330
            Ensembl:ENSAMET00000013529 Uniprot:G1M0X4
        Length = 337

 Score = 425 (154.7 bits), Expect = 8.6e-39, P = 8.6e-39
 Identities = 110/314 (35%), Positives = 158/314 (50%)

Query:   658 FKAFIVKRGRQYANDEEIKERFEYF-----KQDGHK--KHE-RYGTSEFSDRSPEEILCK 709
             FK+++V+  ++Y++ EE + R   F     K + H    H  + G ++FSD S  EI  K
Sbjct:    37 FKSWMVQHQKKYSS-EEYQHRLRTFVGNWRKINAHNAGNHTFKMGLNQFSDMSFAEI--K 93

Query:   710 TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKN-VTGPAGDQAACGSCWA 768
               + WSE   +   A +             GP P   DWRKK     P  +Q  CGSCW 
Sbjct:    94 RKYLWSEP--QNCSATKGNYLRGT------GPYPPFVDWRKKGKFVSPVKNQGGCGSCWT 145

Query:   769 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPSIEYT-HQAGLESEK 825
             FS  G LE   AIKTGKL+  ++ QLV+CA+  +  GC G     + EY  +  G+  E 
Sbjct:   146 FSTTGALESAIAIKTGKLLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYIRYNRGIMGED 205

Query:   826 DYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF--NGSETMKKILYKYGPLSVL--LNSDL 881
              YPYK  +G+   C +  SK   F  KD  +   N  + M + +  + P+S    +  D 
Sbjct:   206 SYPYKGQDGD---CKFQPSKAIAFV-KDVANITINDEQAMVEAVALFNPVSFAFEVTGDF 261

Query:   882 IHDYNGTPIRKNDETC--SPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIE 939
             +    G     +  +C  +P  + HAVL VGYG+Q+ +PYW+V+NSWGP     G+F IE
Sbjct:   262 MMYRKGV---YSSTSCHKTPDKVNHAVLAVGYGEQNGVPYWIVKNSWGPQWGMHGYFLIE 318

Query:   940 RGNNACGIEQIAGY 953
             RG N CG+   A Y
Sbjct:   319 RGKNMCGLAACASY 332

 Score = 415 (151.1 bits), Expect = 1.0e-37, P = 1.0e-37
 Identities = 110/316 (34%), Positives = 159/316 (50%)

Query:   292 FKAFIVKRGRQYANDEEIKERFEYF-----KQDGHK--KHE-RYGTSEFSDRSPEEILCK 343
             FK+++V+  ++Y++ EE + R   F     K + H    H  + G ++FSD S  EI  K
Sbjct:    37 FKSWMVQHQKKYSS-EEYQHRLRTFVGNWRKINAHNAGNHTFKMGLNQFSDMSFAEI--K 93

Query:   344 TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKN-VTGPAGDQAACGSCWA 402
               + WSE   +   A +             GP P   DWRKK     P  +Q  CGSCW 
Sbjct:    94 RKYLWSEP--QNCSATKGNYLRGT------GPYPPFVDWRKKGKFVSPVKNQGGCGSCWT 145

Query:   403 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCGGCDGL-EQPIEYT-HQAGLES 458
             FS  G LE   AIKTGKL+  ++ QLV+CA+  +  GC G  GL  Q  EY  +  G+  
Sbjct:   146 FSTTGALESAIAIKTGKLLSLAEQQLVDCAQDFNNHGCQG--GLPSQAFEYIRYNRGIMG 203

Query:   459 EKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF--LYFNGSETMKKILYKYGPLSVG--LNS 514
             E  YPY+  +G+   C +  SK   F  KD   +  N  + M + +  + P+S    +  
Sbjct:   204 EDSYPYKGQDGD---CKFQPSKAIAFV-KDVANITINDEQAMVEAVALFNPVSFAFEVTG 259

Query:   515 HLIHFYNGTPIRKNDETC--SPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFK 572
               + +  G     +  +C  +P  + HAVL VGYG+Q+ +PYW+ +NSWGP     G+F 
Sbjct:   260 DFMMYRKGV---YSSTSCHKTPDKVNHAVLAVGYGEQNGVPYWIVKNSWGPQWGMHGYFL 316

Query:   573 IERGNNACGIEQIAGY 588
             IERG N CG+   A Y
Sbjct:   317 IERGKNMCGLAACASY 332

 Score = 394 (143.8 bits), Expect = 1.8e-35, P = 1.8e-35
 Identities = 87/224 (38%), Positives = 119/224 (53%)

Query:     9 GPVPDAWDWRKKN-VTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 67
             GP P   DWRKK     P  +Q  CGSCW FS  G LE   AIKTGKL+  ++ QLV+CA
Sbjct:   116 GPYPPFVDWRKKGKFVSPVKNQGGCGSCWTFSTTGALESAIAIKTGKLLSLAEQQLVDCA 175

Query:    68 KQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
             +  +  GC G     + EY  +  G+  E  YPYK  +G+   C +  SK   F  KD  
Sbjct:   176 QDFNNHGCQGGLPSQAFEYIRYNRGIMGEDSYPYKGQDGD---CKFQPSKAIAFV-KDVA 231

Query:   125 HF--NGSETMKKILYKYGPLSVL--LNSDLIHDYNGTPIRKNDETC--SPYDLGHAVLLV 178
             +   N  + M + +  + P+S    +  D +    G     +  +C  +P  + HAVL V
Sbjct:   232 NITINDEQAMVEAVALFNPVSFAFEVTGDFMMYRKGV---YSSTSCHKTPDKVNHAVLAV 288

Query:   179 GYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
             GYG+Q+ +PYW+V+NSWGP     G+F IERG N CG+   A Y
Sbjct:   289 GYGEQNGVPYWIVKNSWGPQWGMHGYFLIERGKNMCGLAACASY 332

 Score = 177 (67.4 bits), Expect = 2.9e-10, P = 2.9e-10
 Identities = 32/65 (49%), Positives = 43/65 (66%)

Query:   959 VKNDETC--SPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 1016
             V +  +C  +P  + HAVL VGYG+Q+ +PYW+V+NSWGP     G+F IERG N CG+ 
Sbjct:   268 VYSSTSCHKTPDKVNHAVLAVGYGEQNGVPYWIVKNSWGPQWGMHGYFLIERGKNMCGLA 327

Query:  1017 QIAGY 1021
               A Y
Sbjct:   328 ACASY 332


>UNIPROTKB|Q3T0I2 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9913 "Bos taurus"
            [GO:0031638 "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=ISS] [GO:0010815 "bradykinin
            catabolic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0010813 "neuropeptide catabolic
            process" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0045766 "positive regulation of angiogenesis" evidence=ISS]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=ISS] [GO:0008234 "cysteine-type peptidase activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0033619 "membrane protein proteolysis" evidence=ISS]
            [GO:0043066 "negative regulation of apoptotic process"
            evidence=ISS] [GO:0004252 "serine-type endopeptidase activity"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0016505 "apoptotic protease activator activity"
            evidence=ISS] [GO:0010952 "positive regulation of peptidase
            activity" evidence=ISS] [GO:0030108 "HLA-A specific activating MHC
            class I receptor activity" evidence=ISS] [GO:0002764 "immune
            response-regulating signaling pathway" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0070324 "thyroid
            hormone binding" evidence=ISS] [GO:0006508 "proteolysis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0097208
            "alveolar lamellar body" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004175
            "endopeptidase activity" evidence=ISS] [GO:0032526 "response to
            retinoic acid" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 EMBL:BC102386 IPI:IPI00693034
            RefSeq:NP_001029557.1 UniGene:Bt.52393 ProteinModelPortal:Q3T0I2
            SMR:Q3T0I2 STRING:Q3T0I2 MEROPS:C01.040 PRIDE:Q3T0I2
            Ensembl:ENSBTAT00000014593 GeneID:510524 KEGG:bta:510524 CTD:1512
            InParanoid:Q3T0I2 OMA:STSCHKT OrthoDB:EOG4W9J43 NextBio:20869490
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 Uniprot:Q3T0I2
        Length = 335

 Score = 415 (151.1 bits), Expect = 1.0e-37, P = 1.0e-37
 Identities = 110/321 (34%), Positives = 164/321 (51%)

Query:   653 NILETF--KAFIVKRGRQYANDEEIKERFEYFKQD-----GH--KKHE-RYGTSEFSDRS 702
             N LE F  ++++V+  ++Y++ EE   R + F  +      H  + H  + G ++FSD S
Sbjct:    28 NSLEKFHFQSWMVQHQKKYSS-EEYYHRLQAFASNLREINAHNARNHTFKMGLNQFSDMS 86

Query:   703 PEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQA 761
              +E+  K  + WSE   +   A +             GP P + DWRKK N   P  +Q 
Sbjct:    87 FDEL--KRKYLWSEP--QNCSATKSNYLRGT------GPYPPSMDWRKKGNFVTPVKNQG 136

Query:   762 ACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPSIEYT-HQ 818
             +CGSCW FS  G LE   AI TGKL   ++ QLV+CA+  +  GC G     + EY  + 
Sbjct:   137 SCGSCWTFSTTGALESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYN 196

Query:   819 AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF--NGSETMKKILYKYGPLSVL 876
              G+  E  YPY+  +G+   C Y  SK   F  KD  +   N  E M + +  + P+S  
Sbjct:   197 KGIMGEDTYPYRGQDGD---CKYQPSKAIAFV-KDVANITLNDEEAMVEAVALHNPVSFA 252

Query:   877 --LNSDLIHDYNGTPIRKNDETC--SPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPD 932
               + +D +    G     +  +C  +P  + HAVL VGYG++  IPYW+V+NSWGP    
Sbjct:   253 FEVTADFMMYRKGI---YSSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPNWGM 309

Query:   933 EGFFKIERGNNACGIEQIAGY 953
             +G+F IERG N CG+   A +
Sbjct:   310 KGYFLIERGKNMCGLAACASF 330

 Score = 411 (149.7 bits), Expect = 8.8e-39, Sum P(2) = 8.8e-39
 Identities = 112/323 (34%), Positives = 165/323 (51%)

Query:   287 NILETF--KAFIVKRGRQYANDEEIKERFEYFKQD-----GH--KKHE-RYGTSEFSDRS 336
             N LE F  ++++V+  ++Y++ EE   R + F  +      H  + H  + G ++FSD S
Sbjct:    28 NSLEKFHFQSWMVQHQKKYSS-EEYYHRLQAFASNLREINAHNARNHTFKMGLNQFSDMS 86

Query:   337 PEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQA 395
              +E+  K  + WSE   +   A +             GP P + DWRKK N   P  +Q 
Sbjct:    87 FDEL--KRKYLWSEP--QNCSATKSNYLRGT------GPYPPSMDWRKKGNFVTPVKNQG 136

Query:   396 ACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCGGCDGL-EQPIEYT- 451
             +CGSCW FS  G LE   AI TGKL   ++ QLV+CA+  +  GC G  GL  Q  EY  
Sbjct:   137 SCGSCWTFSTTGALESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQG--GLPSQAFEYIR 194

Query:   452 HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF--LYFNGSETMKKILYKYGPLS 509
             +  G+  E  YPYR  +G+   C Y  SK   F  KD   +  N  E M + +  + P+S
Sbjct:   195 YNKGIMGEDTYPYRGQDGD---CKYQPSKAIAFV-KDVANITLNDEEAMVEAVALHNPVS 250

Query:   510 VG--LNSHLIHFYNGTPIRKNDETC--SPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIG 565
                 + +  + +  G     +  +C  +P  + HAVL VGYG++  IPYW+ +NSWGP  
Sbjct:   251 FAFEVTADFMMYRKGI---YSSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPNW 307

Query:   566 PDEGFFKIERGNNACGIEQIAGY 588
               +G+F IERG N CG+   A +
Sbjct:   308 GMKGYFLIERGKNMCGLAACASF 330

 Score = 387 (141.3 bits), Expect = 3.3e-36, Sum P(2) = 3.3e-36
 Identities = 87/224 (38%), Positives = 120/224 (53%)

Query:     9 GPVPDAWDWRKK-NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 67
             GP P + DWRKK N   P  +Q  CGSCW FS  G LE   AI TGKL   ++ QLV+CA
Sbjct:   114 GPYPPSMDWRKKGNFVTPVKNQGSCGSCWTFSTTGALESAVAIATGKLPFLAEQQLVDCA 173

Query:    68 KQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
             +  +  GC G     + EY  +  G+  E  YPY+  +G+   C Y  SK   F  KD  
Sbjct:   174 QNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYRGQDGD---CKYQPSKAIAFV-KDVA 229

Query:   125 HF--NGSETMKKILYKYGPLSVL--LNSDLIHDYNGTPIRKNDETC--SPYDLGHAVLLV 178
             +   N  E M + +  + P+S    + +D +    G     +  +C  +P  + HAVL V
Sbjct:   230 NITLNDEEAMVEAVALHNPVSFAFEVTADFMMYRKGI---YSSTSCHKTPDKVNHAVLAV 286

Query:   179 GYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
             GYG++  IPYW+V+NSWGP    +G+F IERG N CG+   A +
Sbjct:   287 GYGEEKGIPYWIVKNSWGPNWGMKGYFLIERGKNMCGLAACASF 330

 Score = 169 (64.5 bits), Expect = 2.2e-09, P = 2.2e-09
 Identities = 29/56 (51%), Positives = 39/56 (69%)

Query:   966 SPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 1021
             +P  + HAVL VGYG++  IPYW+V+NSWGP    +G+F IERG N CG+   A +
Sbjct:   275 TPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPNWGMKGYFLIERGKNMCGLAACASF 330

 Score = 41 (19.5 bits), Expect = 8.8e-39, Sum P(2) = 8.8e-39
 Identities = 8/22 (36%), Positives = 12/22 (54%)

Query:   603 KAIMLIQAVFLLCGVASCLCLP 624
             K   LI+    +CG+A+C   P
Sbjct:   310 KGYFLIERGKNMCGLAACASFP 331

 Score = 41 (19.5 bits), Expect = 3.3e-36, Sum P(2) = 3.3e-36
 Identities = 8/22 (36%), Positives = 12/22 (54%)

Query:   237 KAIMLIQAVFLLCGVASCLCLP 258
             K   LI+    +CG+A+C   P
Sbjct:   310 KGYFLIERGKNMCGLAACASFP 331


>UNIPROTKB|Q9UBX1 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=TAS] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_6900 GO:GO:0019886 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0043202
            GO:GO:0004197 HOVERGEN:HBG011513 EMBL:AJ007331 EMBL:AF088886
            EMBL:AF132894 EMBL:AF136279 EMBL:AF071748 EMBL:AF071749
            EMBL:AK313657 EMBL:BC011682 EMBL:BC036451 EMBL:AL137742
            IPI:IPI00002816 RefSeq:NP_003784.2 UniGene:Hs.11590 PDB:1D5U
            PDB:1M6D PDBsum:1D5U PDBsum:1M6D ProteinModelPortal:Q9UBX1
            SMR:Q9UBX1 STRING:Q9UBX1 MEROPS:C01.018 PhosphoSite:Q9UBX1
            DMDM:12643325 PaxDb:Q9UBX1 PeptideAtlas:Q9UBX1 PRIDE:Q9UBX1
            DNASU:8722 Ensembl:ENST00000310325 GeneID:8722 KEGG:hsa:8722
            UCSC:uc001oip.3 CTD:8722 GeneCards:GC11M066332 HGNC:HGNC:2531
            HPA:CAB002141 MIM:603539 neXtProt:NX_Q9UBX1 PharmGKB:PA27031
            InParanoid:Q9UBX1 OMA:LAPPEWD OrthoDB:EOG4CC41T PhylomeDB:Q9UBX1
            BindingDB:Q9UBX1 ChEMBL:CHEMBL2517 ChiTaRS:CTSF
            EvolutionaryTrace:Q9UBX1 GenomeRNAi:8722 NextBio:32715
            ArrayExpress:Q9UBX1 Bgee:Q9UBX1 CleanEx:HS_CTSF
            Genevestigator:Q9UBX1 GermOnline:ENSG00000174080 Uniprot:Q9UBX1
        Length = 484

 Score = 424 (154.3 bits), Expect = 1.1e-38, P = 1.1e-38
 Identities = 104/341 (30%), Positives = 160/341 (46%)

Query:   266 DQVVARVDTLAIEGSLTFDNE-NILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHKKH 324
             ++  + V +L  E  L+ D    +   FK F++   R Y + EE + R   F  +  +  
Sbjct:   160 NETFSSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQ 219

Query:   325 E---------RYGTSEFSDRSPEEILCKTGFKWSERT-YERIVADRXXXXXXXXXXXXDG 374
             +         +YG ++FSD + EE           RT Y   +  +              
Sbjct:   220 KIQALDRGTAQYGVTKFSDLTEEEF----------RTIYLNTLLRKEPGNKMKQAKSVGD 269

Query:   375 PVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ 434
               P  WDWR K       DQ  CGSCWAFS+ G +EGQ+ +  G L+  S+ +L++C K 
Sbjct:   270 LAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKM 329

Query:   435 CSGCGGCDGLEQPIEYT---HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLY 491
                C G  GL     Y+   +  GLE+E DY Y+   G    C +   K K++       
Sbjct:   330 DKACMG--GLPSNA-YSAIKNLGGLETEDDYSYQ---GHMQSCNFSAEKAKVYINDSVEL 383

Query:   492 FNGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD 551
                 + +   L K GP+SV +N+  + FY     R     CSP+ + HAVLLVGYG + D
Sbjct:   384 SQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSD 443

Query:   552 IPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID 592
             +P+W  +NSWG    ++G++ + RG+ ACG+  +A  A +D
Sbjct:   444 VPFWAIKNSWGTDWGEKGYYYLHRGSGACGVNTMASSAVVD 484

 Score = 416 (151.5 bits), Expect = 8.0e-38, P = 8.0e-38
 Identities = 102/340 (30%), Positives = 160/340 (47%)

Query:   632 DQVVARVDTLAIEGSLTFDNE-NILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHKKH 690
             ++  + V +L  E  L+ D    +   FK F++   R Y + EE + R   F  +  +  
Sbjct:   160 NETFSSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQ 219

Query:   691 E---------RYGTSEFSDRSPEEILCKTGFKWSERT-YERIVADRXXXXXXXXXXXXDG 740
             +         +YG ++FSD + EE           RT Y   +  +              
Sbjct:   220 KIQALDRGTAQYGVTKFSDLTEEEF----------RTIYLNTLLRKEPGNKMKQAKSVGD 269

Query:   741 PVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ 800
               P  WDWR K       DQ  CGSCWAFS+ G +EGQ+ +  G L+  S+ +L++C K 
Sbjct:   270 LAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKM 329

Query:   801 CSGCDGCFFEPSIEYT---HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF 857
                C G    PS  Y+   +  GLE+E DY Y+   G    C +   K K++        
Sbjct:   330 DKACMGGL--PSNAYSAIKNLGGLETEDDYSYQ---GHMQSCNFSAEKAKVYINDSVELS 384

Query:   858 NGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNI 917
                + +   L K GP+SV +N+  +  Y     R     CSP+ + HAVLLVGYG + ++
Sbjct:   385 QNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDV 444

Query:   918 PYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID 957
             P+W ++NSWG    ++G++ + RG+ ACG+  +A  A +D
Sbjct:   445 PFWAIKNSWGTDWGEKGYYYLHRGSGACGVNTMASSAVVD 484

 Score = 396 (144.5 bits), Expect = 1.1e-35, P = 1.1e-35
 Identities = 78/218 (35%), Positives = 117/218 (53%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 71
             P  WDWR K       DQ  CGSCWAFS+ G +EGQ+ +  G L+  S+ +L++C K   
Sbjct:   272 PPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDK 331

Query:    72 GCDGCFFEPSIEYT---HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNG 128
              C G    PS  Y+   +  GLE+E DY Y+   G    C +   K K++          
Sbjct:   332 ACMGGL--PSNAYSAIKNLGGLETEDDYSYQ---GHMQSCNFSAEKAKVYINDSVELSQN 386

Query:   129 SETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPY 188
              + +   L K GP+SV +N+  +  Y     R     CSP+ + HAVLLVGYG + ++P+
Sbjct:   387 EQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPF 446

Query:   189 WLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID 226
             W ++NSWG    ++G++ + RG+ ACG+  +A  A +D
Sbjct:   447 WAIKNSWGTDWGEKGYYYLHRGSGACGVNTMASSAVVD 484

 Score = 184 (69.8 bits), Expect = 1.2e-10, P = 1.2e-10
 Identities = 28/61 (45%), Positives = 44/61 (72%)

Query:   965 CSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 1024
             CSP+ + HAVLLVGYG + D+P+W ++NSWG    ++G++ + RG+ ACG+  +A  A +
Sbjct:   424 CSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKGYYYLHRGSGACGVNTMASSAVV 483

Query:  1025 D 1025
             D
Sbjct:   484 D 484


>UNIPROTKB|O46427 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9823 "Sus scrofa"
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0032526 "response to retinoic acid" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0043129
            "surfactant homeostasis" evidence=ISS] [GO:0010815 "bradykinin
            catabolic process" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0005829 "cytosol"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0030335 "positive regulation of cell
            migration" evidence=ISS] [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0016505 "apoptotic protease activator
            activity" evidence=ISS] [GO:0004252 "serine-type endopeptidase
            activity" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=ISS] [GO:0031638 "zymogen activation"
            evidence=ISS] [GO:0030108 "HLA-A specific activating MHC class I
            receptor activity" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0010628 "positive regulation of gene
            expression" evidence=ISS] [GO:0070324 "thyroid hormone binding"
            evidence=ISS] [GO:0006508 "proteolysis" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0060448
            "dichotomous subdivision of terminal units involved in lung
            branching" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] [GO:0005764 "lysosome" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0004177 "aminopeptidase
            activity" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=ISS] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 MEROPS:C01.040 CTD:1512 OMA:STSCHKT
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 EMBL:AF001169
            RefSeq:NP_999094.1 UniGene:Ssc.3593 PDB:1NB3 PDB:1NB5 PDB:8PCH
            PDBsum:1NB3 PDBsum:1NB5 PDBsum:8PCH ProteinModelPortal:O46427
            SMR:O46427 Ensembl:ENSSSCT00000001983 GeneID:396969 KEGG:ssc:396969
            EvolutionaryTrace:O46427 ArrayExpress:O46427 Uniprot:O46427
        Length = 335

 Score = 424 (154.3 bits), Expect = 1.1e-38, P = 1.1e-38
 Identities = 111/315 (35%), Positives = 161/315 (51%)

Query:   658 FKAFIVKRGRQYANDEEIKERFEYF-----KQDGHK--KHE-RYGTSEFSDRSPEEILCK 709
             FK+++V+  ++Y+  EE   R + F     K + H    H  + G ++FSD S +EI  K
Sbjct:    35 FKSWMVQHQKKYSL-EEYHHRLQVFVSNWRKINAHNAGNHTFKLGLNQFSDMSFDEIRHK 93

Query:   710 TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQAACGSCWA 768
               + WSE   +   A +             GP P + DWRKK N   P  +Q +CGSCW 
Sbjct:    94 --YLWSEP--QNCSATKGNYLRGT------GPYPPSMDWRKKGNFVSPVKNQGSCGSCWT 143

Query:   769 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPSIEYT-HQAGLESEK 825
             FS  G LE   AI TGK++  ++ QLV+CA+  +  GC G     + EY  +  G+  E 
Sbjct:   144 FSTTGALESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGED 203

Query:   826 DYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF--NGSETMKKILYKYGPLSVLLNSDLIH 883
              YPYK   G+   C +   K   F  KD  +   N  E M + +  Y P+S     ++ +
Sbjct:   204 TYPYK---GQDDHCKFQPDKAIAFV-KDVANITMNDEEAMVEAVALYNPVSFAF--EVTN 257

Query:   884 DYNGTPIRK---NDETC--SPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKI 938
             D+     RK   +  +C  +P  + HAVL VGYG+++ IPYW+V+NSWGP     G+F I
Sbjct:   258 DF--LMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLI 315

Query:   939 ERGNNACGIEQIAGY 953
             ERG N CG+   A Y
Sbjct:   316 ERGKNMCGLAACASY 330

 Score = 419 (152.6 bits), Expect = 3.8e-38, P = 3.8e-38
 Identities = 113/318 (35%), Positives = 160/318 (50%)

Query:   292 FKAFIVKRGRQYANDEEIKERFEYF-----KQDGHK--KHE-RYGTSEFSDRSPEEILCK 343
             FK+++V+  ++Y+  EE   R + F     K + H    H  + G ++FSD S +EI  K
Sbjct:    35 FKSWMVQHQKKYSL-EEYHHRLQVFVSNWRKINAHNAGNHTFKLGLNQFSDMSFDEIRHK 93

Query:   344 TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQAACGSCWA 402
               + WSE   +   A +             GP P + DWRKK N   P  +Q +CGSCW 
Sbjct:    94 --YLWSEP--QNCSATKGNYLRGT------GPYPPSMDWRKKGNFVSPVKNQGSCGSCWT 143

Query:   403 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCGGCDGL-EQPIEYT-HQAGLES 458
             FS  G LE   AI TGK++  ++ QLV+CA+  +  GC G  GL  Q  EY  +  G+  
Sbjct:   144 FSTTGALESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQG--GLPSQAFEYIRYNKGIMG 201

Query:   459 EKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF--LYFNGSETMKKILYKYGPLSVGL---N 513
             E  YPY+   G+   C +   K   F  KD   +  N  E M + +  Y P+S      N
Sbjct:   202 EDTYPYK---GQDDHCKFQPDKAIAFV-KDVANITMNDEEAMVEAVALYNPVSFAFEVTN 257

Query:   514 SHLIH---FYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGF 570
               L++    Y+ T   K     +P  + HAVL VGYG+++ IPYW+ +NSWGP     G+
Sbjct:   258 DFLMYRKGIYSSTSCHK-----TPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGY 312

Query:   571 FKIERGNNACGIEQIAGY 588
             F IERG N CG+   A Y
Sbjct:   313 FLIERGKNMCGLAACASY 330

 Score = 395 (144.1 bits), Expect = 1.4e-35, P = 1.4e-35
 Identities = 88/225 (39%), Positives = 121/225 (53%)

Query:     9 GPVPDAWDWRKK-NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 67
             GP P + DWRKK N   P  +Q  CGSCW FS  G LE   AI TGK++  ++ QLV+CA
Sbjct:   114 GPYPPSMDWRKKGNFVSPVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCA 173

Query:    68 KQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
             +  +  GC G     + EY  +  G+  E  YPYK   G+   C +   K   F  KD  
Sbjct:   174 QNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYK---GQDDHCKFQPDKAIAFV-KDVA 229

Query:   125 HF--NGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRK---NDETC--SPYDLGHAVLL 177
             +   N  E M + +  Y P+S     ++ +D+     RK   +  +C  +P  + HAVL 
Sbjct:   230 NITMNDEEAMVEAVALYNPVSFAF--EVTNDF--LMYRKGIYSSTSCHKTPDKVNHAVLA 285

Query:   178 VGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
             VGYG+++ IPYW+V+NSWGP     G+F IERG N CG+   A Y
Sbjct:   286 VGYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASY 330

 Score = 174 (66.3 bits), Expect = 6.1e-10, P = 6.1e-10
 Identities = 30/56 (53%), Positives = 39/56 (69%)

Query:   966 SPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 1021
             +P  + HAVL VGYG+++ IPYW+V+NSWGP     G+F IERG N CG+   A Y
Sbjct:   275 TPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASY 330


>UNIPROTKB|Q0VCU3 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            HOVERGEN:HBG011513 MEROPS:C01.018 CTD:8722 OMA:LAPPEWD
            OrthoDB:EOG4CC41T GeneTree:ENSGT00660000095458 EMBL:DAAA02063594
            EMBL:BC120003 IPI:IPI00717812 RefSeq:NP_001068884.1 UniGene:Bt.7264
            SMR:Q0VCU3 Ensembl:ENSBTAT00000014587 GeneID:509715 KEGG:bta:509715
            InParanoid:Q0VCU3 NextBio:20869091 Uniprot:Q0VCU3
        Length = 460

 Score = 424 (154.3 bits), Expect = 1.1e-38, P = 1.1e-38
 Identities = 104/314 (33%), Positives = 152/314 (48%)

Query:   292 FKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHE---------RYGTSEFSDRSPEEILC 342
             FK F+    R Y + EE   R   F  +  +  +         RYG ++FSD + EE   
Sbjct:   163 FKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTARYGVTKFSDLTEEEF-- 220

Query:   343 KTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWA 402
             +T +         ++ D             D P P  WDWR K       DQ  CGSCWA
Sbjct:   221 RTIY------LNPLLKDAPGRNMRPAQPVTDVPPPQ-WDWRNKGAVTNVKDQGMCGSCWA 273

Query:   403 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGC-GGCDGLEQPIEYTHQAGLESEKD 461
             FS+ G +EGQ+ +K G L+  S+ +L++C K    C GG          T   GLE+E D
Sbjct:   274 FSVTGNVEGQWFLKRGTLLSLSEQELLDCDKTDKACLGGLPSNAYSAIRT-LGGLETEDD 332

Query:   462 YPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNSHLIHFY- 520
             Y YR   G    C++   K K++           + +   L K GP+S+ +N+  + FY 
Sbjct:   333 YSYR---GRLQTCSFSAEKAKVYINDSVELSKNEQKLAAWLAKNGPVSIAINAFGMQFYR 389

Query:   521 NGT--PIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNN 578
             +G   P+R     CSP+ + HAVLLVGYG +  IP+W  +NSWG    +EG++ + RG+ 
Sbjct:   390 HGISHPLRP---LCSPWLIDHAVLLVGYGNRSAIPFWAIKNSWGTDWGEEGYYYLHRGSG 446

Query:   579 ACGIEQIAGYATID 592
             ACG+  +A  A I+
Sbjct:   447 ACGVNIMASSAVIN 460

 Score = 414 (150.8 bits), Expect = 1.3e-37, P = 1.3e-37
 Identities = 103/315 (32%), Positives = 154/315 (48%)

Query:   658 FKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHE---------RYGTSEFSDRSPEEILC 708
             FK F+    R Y + EE   R   F  +  +  +         RYG ++FSD + EE   
Sbjct:   163 FKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTARYGVTKFSDLTEEEF-- 220

Query:   709 KTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWA 768
             +T +         ++ D             D P P  WDWR K       DQ  CGSCWA
Sbjct:   221 RTIY------LNPLLKDAPGRNMRPAQPVTDVPPPQ-WDWRNKGAVTNVKDQGMCGSCWA 273

Query:   769 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIEYTH---QAGLESEK 825
             FS+ G +EGQ+ +K G L+  S+ +L++C K    C G    PS  Y+      GLE+E 
Sbjct:   274 FSVTGNVEGQWFLKRGTLLSLSEQELLDCDKTDKACLGGL--PSNAYSAIRTLGGLETED 331

Query:   826 DYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNSDLIHDY 885
             DY Y+   G    C++   K K++           + +   L K GP+S+ +N+  +  Y
Sbjct:   332 DYSYR---GRLQTCSFSAEKAKVYINDSVELSKNEQKLAAWLAKNGPVSIAINAFGMQFY 388

Query:   886 -NGT--PIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGN 942
              +G   P+R     CSP+ + HAVLLVGYG +  IP+W ++NSWG    +EG++ + RG+
Sbjct:   389 RHGISHPLRP---LCSPWLIDHAVLLVGYGNRSAIPFWAIKNSWGTDWGEEGYYYLHRGS 445

Query:   943 NACGIEQIAGYATID 957
              ACG+  +A  A I+
Sbjct:   446 GACGVNIMASSAVIN 460

 Score = 389 (142.0 bits), Expect = 6.3e-35, P = 6.3e-35
 Identities = 84/225 (37%), Positives = 123/225 (54%)

Query:     8 DGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 67
             D P P  WDWR K       DQ  CGSCWAFS+ G +EGQ+ +K G L+  S+ +L++C 
Sbjct:   245 DVPPPQ-WDWRNKGAVTNVKDQGMCGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCD 303

Query:    68 KQCSGCDGCFFEPSIEYTH---QAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
             K    C G    PS  Y+      GLE+E DY Y+   G    C++   K K++      
Sbjct:   304 KTDKACLGGL--PSNAYSAIRTLGGLETEDDYSYR---GRLQTCSFSAEKAKVYINDSVE 358

Query:   125 HFNGSETMKKILYKYGPLSVLLNSDLIHDY-NGT--PIRKNDETCSPYDLGHAVLLVGYG 181
                  + +   L K GP+S+ +N+  +  Y +G   P+R     CSP+ + HAVLLVGYG
Sbjct:   359 LSKNEQKLAAWLAKNGPVSIAINAFGMQFYRHGISHPLRP---LCSPWLIDHAVLLVGYG 415

Query:   182 KQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID 226
              +  IP+W ++NSWG    +EG++ + RG+ ACG+  +A  A I+
Sbjct:   416 NRSAIPFWAIKNSWGTDWGEEGYYYLHRGSGACGVNIMASSAVIN 460

 Score = 175 (66.7 bits), Expect = 1.1e-09, P = 1.1e-09
 Identities = 29/61 (47%), Positives = 43/61 (70%)

Query:   965 CSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 1024
             CSP+ + HAVLLVGYG +  IP+W ++NSWG    +EG++ + RG+ ACG+  +A  A I
Sbjct:   400 CSPWLIDHAVLLVGYGNRSAIPFWAIKNSWGTDWGEEGYYYLHRGSGACGVNIMASSAVI 459

Query:  1025 D 1025
             +
Sbjct:   460 N 460


>MGI|MGI:1338045 [details] [associations]
            symbol:Ctsw "cathepsin W" species:10090 "Mus musculus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 MGI:MGI:1338045 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 MEROPS:C01.037
            CTD:1521 eggNOG:NOG288820 HOVERGEN:HBG100117 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 EMBL:AF014941 EMBL:AC122861 IPI:IPI00111727
            RefSeq:NP_034115.2 UniGene:Mm.113590 ProteinModelPortal:P56203
            SMR:P56203 PhosphoSite:P56203 PRIDE:P56203 DNASU:13041
            Ensembl:ENSMUST00000025844 GeneID:13041 KEGG:mmu:13041
            InParanoid:P56203 NextBio:282936 Bgee:P56203 CleanEx:MM_CTSW
            Genevestigator:P56203 GermOnline:ENSMUSG00000024910 Uniprot:P56203
        Length = 371

 Score = 365 (133.5 bits), Expect = 2.4e-32, P = 2.4e-32
 Identities = 97/334 (29%), Positives = 158/334 (47%)

Query:   290 ETFKAFIVKRGRQYANDEEIKERFEYFK----QDGHKKHERYGTSEFSDRSPEEILCKTG 345
             E FK F ++  R Y N  E   R   F     Q    + E  GT+EF + +P   L +  
Sbjct:    38 EVFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDLGTAEFGE-TPFSDLTEEE 96

Query:   346 FKW---SERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRK-KNVTGPAGDQAACGSCW 401
             F      ER+ ER                    VP   DWRK KN+     +Q +C  CW
Sbjct:    97 FGQLYGQERSPERTP----NMTKKVESNTWGESVPRTCDWRKAKNIISSVKNQGSCKCCW 152

Query:   402 AFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQPIEYTHQAGLESEKD 461
             A + A  ++  + IK  + V+ S  +L++C +  +GC G    +  +   + +GL SEKD
Sbjct:   153 AMAAADNIQALWRIKHQQFVDVSVQELLDCERCGNGCNGGFVWDAYLTVLNNSGLASEKD 212

Query:   462 YPYRNGNGEKFKCAYDKSKVKLFTGKDF-LYFNGSETMKKILYKYGPLSVGLNSHLIHFY 520
             YP++ G+ +  +C   K K K+   +DF +  N  + +   L  +GP++V +N  L+  Y
Sbjct:   213 YPFQ-GDRKPHRCLAKKYK-KVAWIQDFTMLSNNEQAIAHYLAVHGPITVTINMKLLQHY 270

Query:   521 NGTPIRKNDETCSPYDLGHAVLLVGYGKQDD-----------------IPYWLARNSWGP 563
                 I+    +C P  + H+VLLVG+GK+ +                  PYW+ +NSWG 
Sbjct:   271 QKGVIKATPSSCDPRQVDHSVLLVGFGKEKEGMQTGTVLSHSRKRRHSSPYWILKNSWGA 330

Query:   564 IGPDEGFFKIERGNNACGIEQIAGYATIDVVIQR 597
                ++G+F++ RGNN CG+ +    A +D  +++
Sbjct:   331 HWGEKGYFRLYRGNNTCGVTKYPFTAQVDSPVKK 364

 Score = 293 (108.2 bits), Expect = 2.2e-38, Sum P(3) = 2.2e-38
 Identities = 80/269 (29%), Positives = 128/269 (47%)

Query:   656 ETFKAFIVKRGRQYANDEEIKERFEYFK----QDGHKKHERYGTSEFSDRSPEEILCKTG 711
             E FK F ++  R Y N  E   R   F     Q    + E  GT+EF + +P   L +  
Sbjct:    38 EVFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDLGTAEFGE-TPFSDLTEEE 96

Query:   712 FKW---SERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRK-KNVTGPAGDQAACGSCW 767
             F      ER+ ER                    VP   DWRK KN+     +Q +C  CW
Sbjct:    97 FGQLYGQERSPERTP----NMTKKVESNTWGESVPRTCDWRKAKNIISSVKNQGSCKCCW 152

Query:   768 AFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCF-FEPSIEYTHQAGLESEKD 826
             A + A  ++  + IK  + V+ S  +L++C +  +GC+G F ++  +   + +GL SEKD
Sbjct:   153 AMAAADNIQALWRIKHQQFVDVSVQELLDCERCGNGCNGGFVWDAYLTVLNNSGLASEKD 212

Query:   827 YPYKNANGEKFKCAYDKSKVKLFTGKDFLHF-NGSETMKKILYKYGPLSVLLNSDLIHDY 885
             YP++  + +  +C   K K K+   +DF    N  + +   L  +GP++V +N  L+  Y
Sbjct:   213 YPFQG-DRKPHRCLAKKYK-KVAWIQDFTMLSNNEQAIAHYLAVHGPITVTINMKLLQHY 270

Query:   886 NGTPIRKNDETCSPYDLGHAVLLVGYGKQ 914
                 I+    +C P  + H+VLLVG+GK+
Sbjct:   271 QKGVIKATPSSCDPRQVDHSVLLVGFGKE 299

 Score = 265 (98.3 bits), Expect = 7.4e-29, Sum P(2) = 7.4e-29
 Identities = 57/176 (32%), Positives = 96/176 (54%)

Query:    11 VPDAWDWRK-KNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ 69
             VP   DWRK KN+     +Q  C  CWA + A  ++  + IK  + V+ S  +L++C + 
Sbjct:   126 VPRTCDWRKAKNIISSVKNQGSCKCCWAMAAADNIQALWRIKHQQFVDVSVQELLDCERC 185

Query:    70 CSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF-N 127
              +GC+G F ++  +   + +GL SEKDYP++  + +  +C   K K K+   +DF    N
Sbjct:   186 GNGCNGGFVWDAYLTVLNNSGLASEKDYPFQG-DRKPHRCLAKKYK-KVAWIQDFTMLSN 243

Query:   128 GSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 183
               + +   L  +GP++V +N  L+  Y    I+    +C P  + H+VLLVG+GK+
Sbjct:   244 NEQAIAHYLAVHGPITVTINMKLLQHYQKGVIKATPSSCDPRQVDHSVLLVGFGKE 299

 Score = 137 (53.3 bits), Expect = 1.0e-05, P = 1.0e-05
 Identities = 52/191 (27%), Positives = 90/191 (47%)

Query:   797 CAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNAN------GEKFK-CAYDKSKVKLF 849
             C   C+G  G  ++  +   + +GL SEKDYP++          +K+K  A+ +    L 
Sbjct:   185 CGNGCNG--GFVWDAYLTVLNNSGLASEKDYPFQGDRKPHRCLAKKYKKVAWIQDFTMLS 242

Query:   850 TGKDFL-HF---NGSETMK---KIL--YKYGPLSVLLNS-DLIH-DYNGTPIRKNDETCS 898
               +  + H+   +G  T+    K+L  Y+ G +    +S D    D++   +    E   
Sbjct:   243 NNEQAIAHYLAVHGPITVTINMKLLQHYQKGVIKATPSSCDPRQVDHSVLLVGFGKEK-E 301

Query:   899 PYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDV 958
                 G  VL     ++ + PYW+++NSWG    ++G+F++ RGNN CG+ +    A +D 
Sbjct:   302 GMQTG-TVLSHSRKRRHSSPYWILKNSWGAHWGEKGYFRLYRGNNTCGVTKYPFTAQVDS 360

Query:   959 -VKNDET-CSP 967
              VK   T C P
Sbjct:   361 PVKKARTSCPP 371

 Score = 132 (51.5 bits), Expect = 3.6e-05, P = 3.6e-05
 Identities = 47/184 (25%), Positives = 88/184 (47%)

Query:    66 CAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNAN------GEKFK-CAYDKSKVKLF 118
             C   C+G  G  ++  +   + +GL SEKDYP++          +K+K  A+ +    L 
Sbjct:   185 CGNGCNG--GFVWDAYLTVLNNSGLASEKDYPFQGDRKPHRCLAKKYKKVAWIQDFTMLS 242

Query:   119 TGKDFL-HF---NGSETMK---KIL--YKYGPLSVLLNS-DLIH-DYNGTPIRKNDETCS 167
               +  + H+   +G  T+    K+L  Y+ G +    +S D    D++   +    E   
Sbjct:   243 NNEQAIAHYLAVHGPITVTINMKLLQHYQKGVIKATPSSCDPRQVDHSVLLVGFGKEK-E 301

Query:   168 PYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDV 227
                 G  VL     ++ + PYW+++NSWG    ++G+F++ RGNN CG+ +    A +D 
Sbjct:   302 GMQTG-TVLSHSRKRRHSSPYWILKNSWGAHWGEKGYFRLYRGNNTCGVTKYPFTAQVDS 360

Query:   228 VIQR 231
              +++
Sbjct:   361 PVKK 364

 Score = 115 (45.5 bits), Expect = 2.2e-38, Sum P(3) = 2.2e-38
 Identities = 17/40 (42%), Positives = 28/40 (70%)

Query:   986 PYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID 1025
             PYW+++NSWG    ++G+F++ RGNN CG+ +    A +D
Sbjct:   320 PYWILKNSWGAHWGEKGYFRLYRGNNTCGVTKYPFTAQVD 359

 Score = 65 (27.9 bits), Expect = 2.2e-38, Sum P(3) = 2.2e-38
 Identities = 12/28 (42%), Positives = 19/28 (67%)

Query:   958 VVK-NDETCSPYDLGHAVLLVGYGKQDD 984
             V+K    +C P  + H+VLLVG+GK+ +
Sbjct:   274 VIKATPSSCDPRQVDHSVLLVGFGKEKE 301


>TAIR|locus:2078312 [details] [associations]
            symbol:AT3G45310 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005773 EMBL:CP002686
            GenomeReviews:BA000014_GR eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AL132953
            EMBL:AY091771 IPI:IPI00540369 PIR:T47471 RefSeq:NP_566880.1
            UniGene:At.25239 ProteinModelPortal:Q8RWQ9 SMR:Q8RWQ9
            MEROPS:C01.162 PaxDb:Q8RWQ9 PRIDE:Q8RWQ9 EnsemblPlants:AT3G45310.1
            GeneID:823669 KEGG:ath:AT3G45310 GeneFarm:5032 TAIR:At3g45310
            InParanoid:Q8RWQ9 KO:K01366 OMA:AFEVVHE PhylomeDB:Q8RWQ9
            ProtClustDB:CLSN2689015 Genevestigator:Q8RWQ9 Uniprot:Q8RWQ9
        Length = 358

 Score = 410 (149.4 bits), Expect = 2.3e-38, Sum P(2) = 2.3e-38
 Identities = 103/310 (33%), Positives = 145/310 (46%)

Query:   291 TFKAFIVKRGRQYANDEEIKERFEYFKQ--DGHKKHERYGTSEFSDRSPEEILCKTGFKW 348
             +F  F  + G++Y + EE+K RF  FK+  D  +   + G S     S  +    T   W
Sbjct:    58 SFSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSY--KLSLNQFADLT---W 112

Query:   349 SE-RTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAG 407
              E + Y+   A              +  VPD  DWR+  +  P  +Q  CGSCW FS  G
Sbjct:   113 QEFQRYKLGAAQNCSATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTG 172

Query:   408 MLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGL-EQPIEYT-HQAGLESEKDYPYR 465
              LE  Y    GK +  S+ QLV+CA   +  G   GL  Q  EY  +  GL++E+ YPY 
Sbjct:   173 ALEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYT 232

Query:   466 NGNGEKFKCAYDKSKVKLFTGKDFLYFNGSET-MKKILYKYGPLSVGLNS-HLIHFYNGT 523
               +G    C +    + +          G+E  +K  +    P+SV     H   FY   
Sbjct:   233 GKDGG---CKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVVHEFRFYKKG 289

Query:   524 PIRKNDETC--SPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACG 581
                 N  TC  +P D+ HAVL VGYG +DD+PYWL +NSWG    D G+FK+E G N CG
Sbjct:   290 VFTSN--TCGNTPMDVNHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNGYFKMEMGKNMCG 347

Query:   582 IEQIAGYATI 591
             +   + Y  +
Sbjct:   348 VATCSSYPVV 357

 Score = 404 (147.3 bits), Expect = 1.6e-36, P = 1.6e-36
 Identities = 100/313 (31%), Positives = 154/313 (49%)

Query:   657 TFKAFIVKRGRQYANDEEIKERFEYFKQ--DGHKKHERYGTSEFSDRSPEEILCKTGFKW 714
             +F  F  + G++Y + EE+K RF  FK+  D  +   + G S     S  +    T   W
Sbjct:    58 SFSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSY--KLSLNQFADLT---W 112

Query:   715 SE-RTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAG 773
              E + Y+   A              +  VPD  DWR+  +  P  +Q  CGSCW FS  G
Sbjct:   113 QEFQRYKLGAAQNCSATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTG 172

Query:   774 MLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYK 830
              LE  Y    GK +  S+ QLV+CA   +  GC G     + EY  +  GL++E+ YPY 
Sbjct:   173 ALEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYT 232

Query:   831 NANGEKFKCAYDKSKVKLFTGKDFLHFN-GSET-MKKILYKYGPLSVLLNSDLIHDYNGT 888
               +G    C +    + +   +D ++   G+E  +K  +    P+SV    +++H++   
Sbjct:   233 GKDGG---CKFSAKNIGVQV-RDSVNITLGAEDELKHAVGLVRPVSVAF--EVVHEFRF- 285

Query:   889 PIRKN---DETC--SPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNN 943
               +K      TC  +P D+ HAVL VGYG +D++PYWL++NSWG    D G+FK+E G N
Sbjct:   286 -YKKGVFTSNTCGNTPMDVNHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNGYFKMEMGKN 344

Query:   944 ACGIEQIAGYATI 956
              CG+   + Y  +
Sbjct:   345 MCGVATCSSYPVV 357

 Score = 373 (136.4 bits), Expect = 2.2e-34, Sum P(2) = 2.2e-34
 Identities = 80/228 (35%), Positives = 122/228 (53%)

Query:     8 DGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 67
             +  VPD  DWR+  +  P  +Q  CGSCW FS  G LE  Y    GK +  S+ QLV+CA
Sbjct:   138 EATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCA 197

Query:    68 KQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
                +  GC G     + EY  +  GL++E+ YPY   +G    C +    + +   +D +
Sbjct:   198 GTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGG---CKFSAKNIGVQV-RDSV 253

Query:   125 HFN-GSET-MKKILYKYGPLSVLLNSDLIHDYNGTPIRKN---DETC--SPYDLGHAVLL 177
             +   G+E  +K  +    P+SV    +++H++     +K      TC  +P D+ HAVL 
Sbjct:   254 NITLGAEDELKHAVGLVRPVSVAF--EVVHEFRF--YKKGVFTSNTCGNTPMDVNHAVLA 309

Query:   178 VGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 225
             VGYG +D++PYWL++NSWG    D G+FK+E G N CG+   + Y  +
Sbjct:   310 VGYGVEDDVPYWLIKNSWGGEWGDNGYFKMEMGKNMCGVATCSSYPVV 357

 Score = 194 (73.4 bits), Expect = 4.3e-12, P = 4.3e-12
 Identities = 33/68 (48%), Positives = 44/68 (64%)

Query:   959 VKNDETC--SPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 1016
             V    TC  +P D+ HAVL VGYG +DD+PYWL++NSWG    D G+FK+E G N CG+ 
Sbjct:   290 VFTSNTCGNTPMDVNHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNGYFKMEMGKNMCGVA 349

Query:  1017 QIAGYATI 1024
               + Y  +
Sbjct:   350 TCSSYPVV 357

 Score = 38 (18.4 bits), Expect = 2.3e-38, Sum P(2) = 2.3e-38
 Identities = 6/11 (54%), Positives = 8/11 (72%)

Query:   614 LCGVASCLCLP 624
             +CGVA+C   P
Sbjct:   345 MCGVATCSSYP 355

 Score = 38 (18.4 bits), Expect = 2.2e-34, Sum P(2) = 2.2e-34
 Identities = 6/11 (54%), Positives = 8/11 (72%)

Query:   248 LCGVASCLCLP 258
             +CGVA+C   P
Sbjct:   345 MCGVATCSSYP 355


>UNIPROTKB|F6X9C1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00660000095458
            OMA:STSCHKT Ensembl:ENSCAFT00000036196 EMBL:AAEX03002388
            Uniprot:F6X9C1
        Length = 305

 Score = 421 (153.3 bits), Expect = 2.3e-38, P = 2.3e-38
 Identities = 110/314 (35%), Positives = 161/314 (51%)

Query:   658 FKAFIVKRGRQYANDEEIKERFEYF-----KQDGHK--KHE-RYGTSEFSDRSPEEILCK 709
             FK++ V+  ++Y+++E + +R + F     K + H    H  + G ++FSD +  EI  K
Sbjct:     5 FKSWAVQHQKKYSSEEYL-QRLQTFVGNWRKINAHNAGNHTFKMGLNQFSDMNFAEI--K 61

Query:   710 TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKN-VTGPAGDQAACGSCWA 768
               + WSE   +   A +             GP P   DWRKK     P  +Q +CGSCW 
Sbjct:    62 HKYLWSEP--QNCSATKGNYLRGT------GPYPPFVDWRKKGKFVSPVKNQGSCGSCWT 113

Query:   769 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPSIEYT-HQAGLESEK 825
             FS  G LE   AIK+GKL+  ++ QLV+CA+  +  GC G     + EY  +  G+  E 
Sbjct:   114 FSTTGALESAIAIKSGKLLSLAEQQLVDCAQNFNNHGCQGGAPLQAFEYIRYNKGIMGED 173

Query:   826 DYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF--NGSETMKKILYKYGPLSVL--LNSDL 881
              YPYK  +G+   C Y  SK   F  KD  +   N  + M + +  Y P+S    + SD 
Sbjct:   174 SYPYKGQDGD---CKYQPSKAIAFV-KDVANITINDEQAMVEAVALYNPVSFAFEVTSDF 229

Query:   882 IHDYNGTPIRKNDETC--SPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIE 939
             +    G     +  +C  +P  + HAVL VGYG+Q+ IPYW+V+NSWGP     G+F +E
Sbjct:   230 MMYRKGI---YSSTSCHKTPDKVNHAVLAVGYGEQNGIPYWIVKNSWGPQWGMNGYFLME 286

Query:   940 RGNNACGIEQIAGY 953
             RG N CG+   A Y
Sbjct:   287 RGKNMCGLAACASY 300

 Score = 414 (150.8 bits), Expect = 1.3e-37, P = 1.3e-37
 Identities = 109/315 (34%), Positives = 161/315 (51%)

Query:   292 FKAFIVKRGRQYANDEEIKERFEYF-----KQDGHK--KHE-RYGTSEFSDRSPEEILCK 343
             FK++ V+  ++Y+++E + +R + F     K + H    H  + G ++FSD +  EI  K
Sbjct:     5 FKSWAVQHQKKYSSEEYL-QRLQTFVGNWRKINAHNAGNHTFKMGLNQFSDMNFAEI--K 61

Query:   344 TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKN-VTGPAGDQAACGSCWA 402
               + WSE   +   A +             GP P   DWRKK     P  +Q +CGSCW 
Sbjct:    62 HKYLWSEP--QNCSATKGNYLRGT------GPYPPFVDWRKKGKFVSPVKNQGSCGSCWT 113

Query:   403 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCGGCDGLEQPIEYT-HQAGLESE 459
             FS  G LE   AIK+GKL+  ++ QLV+CA+  +  GC G   L Q  EY  +  G+  E
Sbjct:   114 FSTTGALESAIAIKSGKLLSLAEQQLVDCAQNFNNHGCQGGAPL-QAFEYIRYNKGIMGE 172

Query:   460 KDYPYRNGNGEKFKCAYDKSKVKLFTGKDF--LYFNGSETMKKILYKYGPLSVG--LNSH 515
               YPY+  +G+   C Y  SK   F  KD   +  N  + M + +  Y P+S    + S 
Sbjct:   173 DSYPYKGQDGD---CKYQPSKAIAFV-KDVANITINDEQAMVEAVALYNPVSFAFEVTSD 228

Query:   516 LIHFYNGTPIRKNDETC--SPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKI 573
              + +  G     +  +C  +P  + HAVL VGYG+Q+ IPYW+ +NSWGP     G+F +
Sbjct:   229 FMMYRKGI---YSSTSCHKTPDKVNHAVLAVGYGEQNGIPYWIVKNSWGPQWGMNGYFLM 285

Query:   574 ERGNNACGIEQIAGY 588
             ERG N CG+   A Y
Sbjct:   286 ERGKNMCGLAACASY 300

 Score = 395 (144.1 bits), Expect = 1.4e-35, P = 1.4e-35
 Identities = 89/224 (39%), Positives = 120/224 (53%)

Query:     9 GPVPDAWDWRKKN-VTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 67
             GP P   DWRKK     P  +Q  CGSCW FS  G LE   AIK+GKL+  ++ QLV+CA
Sbjct:    84 GPYPPFVDWRKKGKFVSPVKNQGSCGSCWTFSTTGALESAIAIKSGKLLSLAEQQLVDCA 143

Query:    68 KQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
             +  +  GC G     + EY  +  G+  E  YPYK  +G+   C Y  SK   F  KD  
Sbjct:   144 QNFNNHGCQGGAPLQAFEYIRYNKGIMGEDSYPYKGQDGD---CKYQPSKAIAFV-KDVA 199

Query:   125 HF--NGSETMKKILYKYGPLSVL--LNSDLIHDYNGTPIRKNDETC--SPYDLGHAVLLV 178
             +   N  + M + +  Y P+S    + SD +    G     +  +C  +P  + HAVL V
Sbjct:   200 NITINDEQAMVEAVALYNPVSFAFEVTSDFMMYRKGI---YSSTSCHKTPDKVNHAVLAV 256

Query:   179 GYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
             GYG+Q+ IPYW+V+NSWGP     G+F +ERG N CG+   A Y
Sbjct:   257 GYGEQNGIPYWIVKNSWGPQWGMNGYFLMERGKNMCGLAACASY 300

 Score = 174 (66.3 bits), Expect = 4.3e-10, P = 4.3e-10
 Identities = 30/56 (53%), Positives = 39/56 (69%)

Query:   966 SPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 1021
             +P  + HAVL VGYG+Q+ IPYW+V+NSWGP     G+F +ERG N CG+   A Y
Sbjct:   245 TPDKVNHAVLAVGYGEQNGIPYWIVKNSWGPQWGMNGYFLMERGKNMCGLAACASY 300


>UNIPROTKB|F1RU23 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 KO:K08569 EMBL:CU928325
            RefSeq:XP_003122571.1 UniGene:Ssc.28940 Ensembl:ENSSSCT00000014177
            GeneID:100525853 KEGG:ssc:100525853 OMA:CWAMAAV Uniprot:F1RU23
        Length = 367

 Score = 412 (150.1 bits), Expect = 2.2e-37, P = 2.2e-37
 Identities = 109/338 (32%), Positives = 167/338 (49%)

Query:   656 ETFKAFIVKRGRQYANDEEIKERFEYFKQDGHK----KHERYGTSEF-----SDRSPEEI 706
             E F  F ++  R Y+N  E   R + F Q+  K    + E  GT+EF     SD + EE 
Sbjct:    40 EVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDLTEEEF 99

Query:   707 LCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGP-VPDAWDWRKK-NVTGPAGDQAACG 764
                 G  W         A +             G  VP + DWRKK  V      Q  C 
Sbjct:   100 GQLHGHHWG--------AGKAPSMGIKVGSEESGETVPQSCDWRKKPGVISAIKHQKDCN 151

Query:   765 SCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCF-FEPSIEYTHQAGLES 823
              CWA +    +E Q+AIK  + V+ S  Q+++C +  +GC+G F ++  +   + +GL S
Sbjct:   152 CCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRCGNGCNGGFVWDAFLTVLNTSGLAS 211

Query:   824 EKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSE-TMKKILYKYGPLSVLLNSDLI 882
             E+DYPYK    +  +C   + + K+   +DFL     E ++ + L   GP++V +N+ L+
Sbjct:   212 EQDYPYKGTV-KTHRCLAKQHR-KVAWIQDFLMLQFCEQSIARYLATEGPITVTINAGLL 269

Query:   883 HDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN-----------IPYWLVRNSWGPIGP 931
               Y    IR    TC P+ + H+VLLVG+GK  +           IPYW+++NSWGP   
Sbjct:   270 QQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGRRPRPGHSIPYWILKNSWGPDWG 329

Query:   932 DEGFFKIERGNNACGIEQIAGYATID--VVKNDETCSP 967
             +EG+F++ RG+N CGI +    A +D  V K+  +C P
Sbjct:   330 EEGYFRLHRGSNTCGITKYPVTARVDKPVKKHQISCPP 367

 Score = 391 (142.7 bits), Expect = 3.8e-35, P = 3.8e-35
 Identities = 104/331 (31%), Positives = 160/331 (48%)

Query:   290 ETFKAFIVKRGRQYANDEEIKERFEYFKQDGHK----KHERYGTSEF-----SDRSPEEI 340
             E F  F ++  R Y+N  E   R + F Q+  K    + E  GT+EF     SD + EE 
Sbjct:    40 EVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDLTEEEF 99

Query:   341 LCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGP-VPDAWDWRKK-NVTGPAGDQAACG 398
                 G  W         A +             G  VP + DWRKK  V      Q  C 
Sbjct:   100 GQLHGHHWG--------AGKAPSMGIKVGSEESGETVPQSCDWRKKPGVISAIKHQKDCN 151

Query:   399 SCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQPIEYTHQAGLES 458
              CWA +    +E Q+AIK  + V+ S  Q+++C +  +GC G    +  +   + +GL S
Sbjct:   152 CCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRCGNGCNGGFVWDAFLTVLNTSGLAS 211

Query:   459 EKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSE-TMKKILYKYGPLSVGLNSHLI 517
             E+DYPY+ G  +  +C   + + K+   +DFL     E ++ + L   GP++V +N+ L+
Sbjct:   212 EQDYPYK-GTVKTHRCLAKQHR-KVAWIQDFLMLQFCEQSIARYLATEGPITVTINAGLL 269

Query:   518 HFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD-----------IPYWLARNSWGPIGP 566
               Y    IR    TC P+ + H+VLLVG+GK              IPYW+ +NSWGP   
Sbjct:   270 QQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGRRPRPGHSIPYWILKNSWGPDWG 329

Query:   567 DEGFFKIERGNNACGIEQIAGYATIDVVIQR 597
             +EG+F++ RG+N CGI +    A +D  +++
Sbjct:   330 EEGYFRLHRGSNTCGITKYPVTARVDKPVKK 360

 Score = 382 (139.5 bits), Expect = 3.5e-34, P = 3.5e-34
 Identities = 84/241 (34%), Positives = 135/241 (56%)

Query:     6 EKDGP-VPDAWDWRKK-NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQL 63
             E+ G  VP + DWRKK  V      Q DC  CWA +    +E Q+AIK  + V+ S  Q+
Sbjct:   122 EESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQV 181

Query:    64 VECAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKD 122
             ++C +  +GC+G F ++  +   + +GL SE+DYPYK    +  +C   + + K+   +D
Sbjct:   182 LDCDRCGNGCNGGFVWDAFLTVLNTSGLASEQDYPYKGTV-KTHRCLAKQHR-KVAWIQD 239

Query:   123 FLHFNGSE-TMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG 181
             FL     E ++ + L   GP++V +N+ L+  Y    IR    TC P+ + H+VLLVG+G
Sbjct:   240 FLMLQFCEQSIARYLATEGPITVTINAGLLQQYKRGVIRATPATCDPHLVNHSVLLVGFG 299

Query:   182 KQDN-----------IPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDVVIQ 230
             K  +           IPYW+++NSWGP   +EG+F++ RG+N CGI +    A +D  ++
Sbjct:   300 KSKSVEGRRPRPGHSIPYWILKNSWGPDWGEEGYFRLHRGSNTCGITKYPVTARVDKPVK 359

Query:   231 R 231
             +
Sbjct:   360 K 360

 Score = 164 (62.8 bits), Expect = 1.0e-08, P = 1.0e-08
 Identities = 31/73 (42%), Positives = 45/73 (61%)

Query:   964 TCSPYDLGHAVLLVGYGKQDD-----------IPYWLVRNSWGPIGPDEGFFKIERGNNA 1012
             TC P+ + H+VLLVG+GK              IPYW+++NSWGP   +EG+F++ RG+N 
Sbjct:   283 TCDPHLVNHSVLLVGFGKSKSVEGRRPRPGHSIPYWILKNSWGPDWGEEGYFRLHRGSNT 342

Query:  1013 CGIEQIAGYATID 1025
             CGI +    A +D
Sbjct:   343 CGITKYPVTARVD 355


>UNIPROTKB|F7BJD8 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9796 "Equus
            caballus" [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=ISS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0004175 "endopeptidase activity" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0004252
            "serine-type endopeptidase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005829 "cytosol" evidence=ISS] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0010628 "positive regulation of
            gene expression" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0010815 "bradykinin catabolic
            process" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0016505 "apoptotic protease
            activator activity" evidence=ISS] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=ISS] [GO:0030335
            "positive regulation of cell migration" evidence=ISS] [GO:0031638
            "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0032526 "response to retinoic
            acid" evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0043129 "surfactant homeostasis"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0070324
            "thyroid hormone binding" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            GO:GO:0043066 GO:GO:0005615 GO:GO:0008284 GO:GO:0070371
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458
            OMA:STSCHKT GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129
            Ensembl:ENSECAT00000013967 Uniprot:F7BJD8
        Length = 305

 Score = 411 (149.7 bits), Expect = 2.8e-37, P = 2.8e-37
 Identities = 108/315 (34%), Positives = 160/315 (50%)

Query:   658 FKAFIVKRGRQYANDEEIKERFEYF-----KQDGHK--KHE-RYGTSEFSDRSPEEILCK 709
             FK+++V+  ++Y++ EE   R + F     K + H    H  R G ++FS  +  E+  K
Sbjct:     5 FKSWMVQHQKKYSS-EEYHHRLQTFVSNWRKINAHNTGNHTFRMGLNQFSAMNFAEL--K 61

Query:   710 TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQAACGSCWA 768
               + WSE   +   A +             GP P + DWRKK N   P  +Q  CGSCW 
Sbjct:    62 HKYLWSEP--QNCSATKGNYLRGA------GPYPPSVDWRKKGNFVSPVKNQGGCGSCWT 113

Query:   769 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPSIEYT-HQAGLESEK 825
             FS  G LE   AI +GKL+  ++ QLV+CA+  +  GC G     + EY  +  G+  E 
Sbjct:   114 FSTTGALESAVAIASGKLLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGED 173

Query:   826 DYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF--NGSETMKKILYKYGPLSVLLNSDLIH 883
              YPYK  +G+   C +  +K   F  KD  +   N  + M + +  Y P+S     ++  
Sbjct:   174 TYPYKGQDGD---CKFQPNKAIAFV-KDVANITLNDEKAMVEAVALYNPVSFAF--EVTE 227

Query:   884 DYNGTPIRK---NDETC--SPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKI 938
             D+     RK   +  +C  +P  + HAVL VGYG+++ IPYW+V+NSWGP     G+F I
Sbjct:   228 DF--MMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPHWGMNGYFLI 285

Query:   939 ERGNNACGIEQIAGY 953
             ERG N CG+   A Y
Sbjct:   286 ERGKNMCGLAACASY 300

 Score = 399 (145.5 bits), Expect = 5.3e-36, P = 5.3e-36
 Identities = 107/316 (33%), Positives = 159/316 (50%)

Query:   292 FKAFIVKRGRQYANDEEIKERFEYF-----KQDGHK--KHE-RYGTSEFSDRSPEEILCK 343
             FK+++V+  ++Y++ EE   R + F     K + H    H  R G ++FS  +  E+  K
Sbjct:     5 FKSWMVQHQKKYSS-EEYHHRLQTFVSNWRKINAHNTGNHTFRMGLNQFSAMNFAEL--K 61

Query:   344 TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQAACGSCWA 402
               + WSE   +   A +             GP P + DWRKK N   P  +Q  CGSCW 
Sbjct:    62 HKYLWSEP--QNCSATKGNYLRGA------GPYPPSVDWRKKGNFVSPVKNQGGCGSCWT 113

Query:   403 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCGGCDGL-EQPIEYT-HQAGLES 458
             FS  G LE   AI +GKL+  ++ QLV+CA+  +  GC G  GL  Q  EY  +  G+  
Sbjct:   114 FSTTGALESAVAIASGKLLSLAEQQLVDCAQNFNNHGCQG--GLPSQAFEYIRYNKGIMG 171

Query:   459 EKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF--LYFNGSETMKKILYKYGPLSVG--LNS 514
             E  YPY+  +G+   C +  +K   F  KD   +  N  + M + +  Y P+S    +  
Sbjct:   172 EDTYPYKGQDGD---CKFQPNKAIAFV-KDVANITLNDEKAMVEAVALYNPVSFAFEVTE 227

Query:   515 HLIHFYNGTPIRKNDETC--SPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFK 572
               + +  G     +  +C  +P  + HAVL VGYG+++ IPYW+ +NSWGP     G+F 
Sbjct:   228 DFMMYRKGI---YSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPHWGMNGYFL 284

Query:   573 IERGNNACGIEQIAGY 588
             IERG N CG+   A Y
Sbjct:   285 IERGKNMCGLAACASY 300

 Score = 391 (142.7 bits), Expect = 3.8e-35, P = 3.8e-35
 Identities = 87/225 (38%), Positives = 122/225 (54%)

Query:     9 GPVPDAWDWRKK-NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 67
             GP P + DWRKK N   P  +Q  CGSCW FS  G LE   AI +GKL+  ++ QLV+CA
Sbjct:    84 GPYPPSVDWRKKGNFVSPVKNQGGCGSCWTFSTTGALESAVAIASGKLLSLAEQQLVDCA 143

Query:    68 KQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
             +  +  GC G     + EY  +  G+  E  YPYK  +G+   C +  +K   F  KD  
Sbjct:   144 QNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKGQDGD---CKFQPNKAIAFV-KDVA 199

Query:   125 HF--NGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRK---NDETC--SPYDLGHAVLL 177
             +   N  + M + +  Y P+S     ++  D+     RK   +  +C  +P  + HAVL 
Sbjct:   200 NITLNDEKAMVEAVALYNPVSFAF--EVTEDF--MMYRKGIYSSTSCHKTPDKVNHAVLA 255

Query:   178 VGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
             VGYG+++ IPYW+V+NSWGP     G+F IERG N CG+   A Y
Sbjct:   256 VGYGEENGIPYWIVKNSWGPHWGMNGYFLIERGKNMCGLAACASY 300

 Score = 174 (66.3 bits), Expect = 4.3e-10, P = 4.3e-10
 Identities = 30/56 (53%), Positives = 39/56 (69%)

Query:   966 SPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 1021
             +P  + HAVL VGYG+++ IPYW+V+NSWGP     G+F IERG N CG+   A Y
Sbjct:   245 TPDKVNHAVLAVGYGEENGIPYWIVKNSWGPHWGMNGYFLIERGKNMCGLAACASY 300


>RGD|1308181 [details] [associations]
            symbol:Ctsf "cathepsin F" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1308181 eggNOG:COG4870 HOGENOM:HOG000230774
            KO:K01373 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 MEROPS:C01.018
            CTD:8722 OMA:LAPPEWD OrthoDB:EOG4CC41T GeneTree:ENSGT00660000095458
            EMBL:CH473953 EMBL:BC099780 EMBL:EU253481 IPI:IPI00201100
            RefSeq:NP_001029282.1 UniGene:Rn.25087 SMR:Q499S6
            Ensembl:ENSRNOT00000026718 GeneID:361704 KEGG:rno:361704
            UCSC:RGD:1308181 InParanoid:Q499S6 NextBio:677325
            Genevestigator:Q499S6 Uniprot:Q499S6
        Length = 462

 Score = 410 (149.4 bits), Expect = 3.5e-37, P = 3.5e-37
 Identities = 99/313 (31%), Positives = 150/313 (47%)

Query:   658 FKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHE---------RYGTSEFSDRSPEEILC 708
             FK F+    R Y + EE + R   F ++  +  +         +YG ++FSD + EE   
Sbjct:   165 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEF-- 222

Query:   709 KTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWA 768
                       Y   +  +            +   P  WDWRKK       DQ  CGSCWA
Sbjct:   223 -------HTIYLNPLLQKESGGKMSLAKSINDLAPPEWDWRKKGAVTEVKDQGMCGSCWA 275

Query:   769 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIEYT---HQAGLESEK 825
             FS+ G +EGQ+ +  G L+  S+ +L++C K    C G    PS  YT   +  GLE+E 
Sbjct:   276 FSVTGNVEGQWFLNRGTLLSLSEQELLDCDKMDKACMGGL--PSNAYTAIKNLGGLETED 333

Query:   826 DYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSET-MKKILYKYGPLSVLLNSDLIHD 884
             DY Y+   G    C +     K++   D +  +  E  +   L + GP+SV +N+  +  
Sbjct:   334 DYGYQ---GHVQACNFSTQMAKVYIN-DSVELSRDENKIAAWLAQKGPISVAINAFGMQF 389

Query:   885 YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNA 944
             Y           CSP+ + HAVLLVGYG + NIPYW ++NSWG    +EG++ + RG+ A
Sbjct:   390 YRHGIAHPFRPLCSPWFIDHAVLLVGYGNRSNIPYWAIKNSWGRDWGEEGYYYLYRGSGA 449

Query:   945 CGIEQIAGYATID 957
             CG+  +A  A ++
Sbjct:   450 CGVNTMASSAVVN 462

 Score = 405 (147.6 bits), Expect = 1.2e-36, P = 1.2e-36
 Identities = 97/313 (30%), Positives = 146/313 (46%)

Query:   292 FKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHE---------RYGTSEFSDRSPEEILC 342
             FK F+    R Y + EE + R   F ++  +  +         +YG ++FSD + EE   
Sbjct:   165 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEF-- 222

Query:   343 KTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWA 402
                       Y   +  +            +   P  WDWRKK       DQ  CGSCWA
Sbjct:   223 -------HTIYLNPLLQKESGGKMSLAKSINDLAPPEWDWRKKGAVTEVKDQGMCGSCWA 275

Query:   403 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQPIEYT---HQAGLESE 459
             FS+ G +EGQ+ +  G L+  S+ +L++C K    C G  GL     YT   +  GLE+E
Sbjct:   276 FSVTGNVEGQWFLNRGTLLSLSEQELLDCDKMDKACMG--GLPSNA-YTAIKNLGGLETE 332

Query:   460 KDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNSHLIHF 519
              DY Y+   G    C +     K++             +   L + GP+SV +N+  + F
Sbjct:   333 DDYGYQ---GHVQACNFSTQMAKVYINDSVELSRDENKIAAWLAQKGPISVAINAFGMQF 389

Query:   520 YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNA 579
             Y           CSP+ + HAVLLVGYG + +IPYW  +NSWG    +EG++ + RG+ A
Sbjct:   390 YRHGIAHPFRPLCSPWFIDHAVLLVGYGNRSNIPYWAIKNSWGRDWGEEGYYYLYRGSGA 449

Query:   580 CGIEQIAGYATID 592
             CG+  +A  A ++
Sbjct:   450 CGVNTMASSAVVN 462

 Score = 394 (143.8 bits), Expect = 1.8e-35, P = 1.8e-35
 Identities = 82/219 (37%), Positives = 119/219 (54%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 71
             P  WDWRKK       DQ  CGSCWAFS+ G +EGQ+ +  G L+  S+ +L++C K   
Sbjct:   250 PPEWDWRKKGAVTEVKDQGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKMDK 309

Query:    72 GCDGCFFEPSIEYT---HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNG 128
              C G    PS  YT   +  GLE+E DY Y+   G    C +     K++   D +  + 
Sbjct:   310 ACMGGL--PSNAYTAIKNLGGLETEDDYGYQ---GHVQACNFSTQMAKVYIN-DSVELSR 363

Query:   129 SET-MKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIP 187
              E  +   L + GP+SV +N+  +  Y           CSP+ + HAVLLVGYG + NIP
Sbjct:   364 DENKIAAWLAQKGPISVAINAFGMQFYRHGIAHPFRPLCSPWFIDHAVLLVGYGNRSNIP 423

Query:   188 YWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID 226
             YW ++NSWG    +EG++ + RG+ ACG+  +A  A ++
Sbjct:   424 YWAIKNSWGRDWGEEGYYYLYRGSGACGVNTMASSAVVN 462

 Score = 181 (68.8 bits), Expect = 2.4e-10, P = 2.4e-10
 Identities = 29/61 (47%), Positives = 44/61 (72%)

Query:   965 CSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 1024
             CSP+ + HAVLLVGYG + +IPYW ++NSWG    +EG++ + RG+ ACG+  +A  A +
Sbjct:   402 CSPWFIDHAVLLVGYGNRSNIPYWAIKNSWGRDWGEEGYYYLYRGSGACGVNTMASSAVV 461

Query:  1025 D 1025
             +
Sbjct:   462 N 462


>UNIPROTKB|F1P3U9 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005829 "cytosol" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=IEA] [GO:0010628 "positive regulation of gene expression"
            evidence=IEA] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=IEA] [GO:0010813 "neuropeptide catabolic
            process" evidence=IEA] [GO:0010815 "bradykinin catabolic process"
            evidence=IEA] [GO:0016505 "apoptotic protease activator activity"
            evidence=IEA] [GO:0030108 "HLA-A specific activating MHC class I
            receptor activity" evidence=IEA] [GO:0031638 "zymogen activation"
            evidence=IEA] [GO:0031648 "protein destabilization" evidence=IEA]
            [GO:0032526 "response to retinoic acid" evidence=IEA] [GO:0033619
            "membrane protein proteolysis" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0043129
            "surfactant homeostasis" evidence=IEA] [GO:0045766 "positive
            regulation of angiogenesis" evidence=IEA] [GO:0060448 "dichotomous
            subdivision of terminal units involved in lung branching"
            evidence=IEA] [GO:0070324 "thyroid hormone binding" evidence=IEA]
            [GO:0070371 "ERK1 and ERK2 cascade" evidence=IEA] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA]
            [GO:0097208 "alveolar lamellar body" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066
            GO:GO:0005615 GO:GO:0008284 GO:GO:0070371 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0032526 GO:GO:0010628
            GO:GO:0070324 GO:GO:0016505 GO:GO:0010634 GO:GO:0004197
            GO:GO:0042599 GO:GO:0031648 GO:GO:0097067 GO:GO:0031638
            GO:GO:0001913 GeneTree:ENSGT00660000095458 OMA:STSCHKT
            GO:GO:0030108 GO:GO:0010815 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 EMBL:AADN02038832 EMBL:AADN02038831 IPI:IPI00594147
            Ensembl:ENSGALT00000013440 Uniprot:F1P3U9
        Length = 261

 Score = 409 (149.0 bits), Expect = 4.5e-37, P = 4.5e-37
 Identities = 90/226 (39%), Positives = 122/226 (53%)

Query:     5 VEKDGPVPDAWDWRKK-NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQL 63
             +  DGP P+A DWRKK N   P  +Q  CGSCW FS  G LE   AI TGKL+  ++  L
Sbjct:    36 LRSDGPCPEAVDWRKKGNFVTPVKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQLL 95

Query:    64 VECAKQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTG 120
             V+CA+  +  GC G     + EY  +  GL  E  YPY+  NG    C +   K   F  
Sbjct:    96 VDCAQAFNNHGCSGGLPSQAFEYILYNKGLMGEDAYPYRAQNGT---CKFQPDKAIAFV- 151

Query:   121 KDFLHFNGSET--MKKILYKYGPLSVL--LNSDLIHDYNGTPIRKNDETCSPYDLGHAVL 176
             KD ++    +   M + + K+ P+S    + SD +H   G       E  +P  + HAVL
Sbjct:   152 KDVINITQYDEAGMVEAVGKHNPVSFAFEVTSDFMHYRKGVYSNPRCEH-TPDKVNHAVL 210

Query:   177 LVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
              VGYG++D  PYW+V+NSWGP+   +G+F IERG N CG+   A Y
Sbjct:   211 AVGYGEEDGRPYWIVKNSWGPLWGMDGYFLIERGKNMCGLAACASY 256

 Score = 408 (148.7 bits), Expect = 5.8e-37, P = 5.8e-37
 Identities = 90/223 (40%), Positives = 121/223 (54%)

Query:   739 DGPVPDAWDWRKK-NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC 797
             DGP P+A DWRKK N   P  +Q  CGSCW FS  G LE   AI TGKL+  ++  LV+C
Sbjct:    39 DGPCPEAVDWRKKGNFVTPVKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQLLVDC 98

Query:   798 AKQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF 854
             A+  +  GC G     + EY  +  GL  E  YPY+  NG    C +   K   F  KD 
Sbjct:    99 AQAFNNHGCSGGLPSQAFEYILYNKGLMGEDAYPYRAQNGT---CKFQPDKAIAFV-KDV 154

Query:   855 LHFNGSET--MKKILYKYGPLSVL--LNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVG 910
             ++    +   M + + K+ P+S    + SD +H   G       E  +P  + HAVL VG
Sbjct:   155 INITQYDEAGMVEAVGKHNPVSFAFEVTSDFMHYRKGVYSNPRCEH-TPDKVNHAVLAVG 213

Query:   911 YGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 953
             YG++D  PYW+V+NSWGP+   +G+F IERG N CG+   A Y
Sbjct:   214 YGEEDGRPYWIVKNSWGPLWGMDGYFLIERGKNMCGLAACASY 256

 Score = 403 (146.9 bits), Expect = 2.0e-36, P = 2.0e-36
 Identities = 92/225 (40%), Positives = 121/225 (53%)

Query:   373 DGPVPDAWDWRKK-NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC 431
             DGP P+A DWRKK N   P  +Q  CGSCW FS  G LE   AI TGKL+  ++  LV+C
Sbjct:    39 DGPCPEAVDWRKKGNFVTPVKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQLLVDC 98

Query:   432 AKQCS--GCGGCDGL-EQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGK 487
             A+  +  GC G  GL  Q  EY  +  GL  E  YPYR  NG    C +   K   F  K
Sbjct:    99 AQAFNNHGCSG--GLPSQAFEYILYNKGLMGEDAYPYRAQNGT---CKFQPDKAIAFV-K 152

Query:   488 DFLYFNGSET--MKKILYKYGPLSVG--LNSHLIHFYNGTPIRKNDETCSPYDLGHAVLL 543
             D +     +   M + + K+ P+S    + S  +H+  G       E  +P  + HAVL 
Sbjct:   153 DVINITQYDEAGMVEAVGKHNPVSFAFEVTSDFMHYRKGVYSNPRCEH-TPDKVNHAVLA 211

Query:   544 VGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGY 588
             VGYG++D  PYW+ +NSWGP+   +G+F IERG N CG+   A Y
Sbjct:   212 VGYGEEDGRPYWIVKNSWGPLWGMDGYFLIERGKNMCGLAACASY 256

 Score = 179 (68.1 bits), Expect = 3.3e-11, P = 3.3e-11
 Identities = 30/56 (53%), Positives = 40/56 (71%)

Query:   966 SPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 1021
             +P  + HAVL VGYG++D  PYW+V+NSWGP+   +G+F IERG N CG+   A Y
Sbjct:   201 TPDKVNHAVLAVGYGEEDGRPYWIVKNSWGPLWGMDGYFLIERGKNMCGLAACASY 256


>UNIPROTKB|F7B939 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458 CTD:1512
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 EMBL:ACFV01158341
            EMBL:ACFV01158342 EMBL:ACFV01158343 RefSeq:XP_002753411.1
            Ensembl:ENSCJAT00000004397 GeneID:100413104 Uniprot:F7B939
        Length = 336

 Score = 409 (149.0 bits), Expect = 4.5e-37, P = 4.5e-37
 Identities = 109/321 (33%), Positives = 158/321 (49%)

Query:   653 NILETF--KAFIVKRGRQYANDEEIKERFEYF-----KQDGHKK--HE-RYGTSEFSDRS 702
             N LE F  K+++ K  + Y+ +EE  +R + F     K + H    H  +   ++FSD S
Sbjct:    28 NSLEKFHFKSWMAKHHKTYSREEEYHQRLQTFASNWRKINAHNNGNHTFKMAVNQFSDMS 87

Query:   703 PEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQA 761
               EI  K  + WSE   +   A +             GP P + DWRKK +   P  +Q 
Sbjct:    88 FAEI--KRKYLWSEP--QNCSATKSNYLRGT------GPYPPSVDWRKKGHFVSPVKNQG 137

Query:   762 ACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPSIEYT-HQ 818
             ACGSCW FS  G LE   AI TGK++  ++ QLV+CA+  +  GC G     + EY  + 
Sbjct:   138 ACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYN 197

Query:   819 AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN--GSETMKKILYKYGPLSVL 876
              G+  E  YPY+   G+   C +   K   F  KD  +      + M + +  Y P+S  
Sbjct:   198 NGIMGEDTYPYQ---GKDSDCKFQPGKAIGFV-KDVANITIYDEDAMVEAVALYNPVSFA 253

Query:   877 --LNSDLIHDYNGTPIRKNDETC--SPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPD 932
               +  D +    G     +  +C  +P  + HAVL VGYG+++ IPYW+V+NSWGP    
Sbjct:   254 FEVTQDFMMYKRGI---YSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGM 310

Query:   933 EGFFKIERGNNACGIEQIAGY 953
              G+F IERG N CG+   A Y
Sbjct:   311 NGYFLIERGKNMCGLAACASY 331

 Score = 401 (146.2 bits), Expect = 3.3e-36, P = 3.3e-36
 Identities = 110/323 (34%), Positives = 159/323 (49%)

Query:   287 NILETF--KAFIVKRGRQYANDEEIKERFEYF-----KQDGHKK--HE-RYGTSEFSDRS 336
             N LE F  K+++ K  + Y+ +EE  +R + F     K + H    H  +   ++FSD S
Sbjct:    28 NSLEKFHFKSWMAKHHKTYSREEEYHQRLQTFASNWRKINAHNNGNHTFKMAVNQFSDMS 87

Query:   337 PEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQA 395
               EI  K  + WSE   +   A +             GP P + DWRKK +   P  +Q 
Sbjct:    88 FAEI--KRKYLWSEP--QNCSATKSNYLRGT------GPYPPSVDWRKKGHFVSPVKNQG 137

Query:   396 ACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCGGCDGL-EQPIEYT- 451
             ACGSCW FS  G LE   AI TGK++  ++ QLV+CA+  +  GC G  GL  Q  EY  
Sbjct:   138 ACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQG--GLPSQAFEYIL 195

Query:   452 HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF--LYFNGSETMKKILYKYGPLS 509
             +  G+  E  YPY+   G+   C +   K   F  KD   +     + M + +  Y P+S
Sbjct:   196 YNNGIMGEDTYPYQ---GKDSDCKFQPGKAIGFV-KDVANITIYDEDAMVEAVALYNPVS 251

Query:   510 VG--LNSHLIHFYNGTPIRKNDETC--SPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIG 565
                 +    + +  G     +  +C  +P  + HAVL VGYG+++ IPYW+ +NSWGP  
Sbjct:   252 FAFEVTQDFMMYKRGI---YSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQW 308

Query:   566 PDEGFFKIERGNNACGIEQIAGY 588
                G+F IERG N CG+   A Y
Sbjct:   309 GMNGYFLIERGKNMCGLAACASY 331

 Score = 366 (133.9 bits), Expect = 1.8e-32, P = 1.8e-32
 Identities = 83/224 (37%), Positives = 117/224 (52%)

Query:     9 GPVPDAWDWRKK-NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 67
             GP P + DWRKK +   P  +Q  CGSCW FS  G LE   AI TGK++  ++ QLV+CA
Sbjct:   115 GPYPPSVDWRKKGHFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCA 174

Query:    68 KQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
             +  +  GC G     + EY  +  G+  E  YPY+   G+   C +   K   F  KD  
Sbjct:   175 QDFNNHGCQGGLPSQAFEYILYNNGIMGEDTYPYQ---GKDSDCKFQPGKAIGFV-KDVA 230

Query:   125 HFN--GSETMKKILYKYGPLSVL--LNSDLIHDYNGTPIRKNDETC--SPYDLGHAVLLV 178
             +      + M + +  Y P+S    +  D +    G     +  +C  +P  + HAVL V
Sbjct:   231 NITIYDEDAMVEAVALYNPVSFAFEVTQDFMMYKRGI---YSSTSCHKTPDKVNHAVLAV 287

Query:   179 GYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
             GYG+++ IPYW+V+NSWGP     G+F IERG N CG+   A Y
Sbjct:   288 GYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASY 331

 Score = 174 (66.3 bits), Expect = 6.2e-10, P = 6.2e-10
 Identities = 30/56 (53%), Positives = 39/56 (69%)

Query:   966 SPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 1021
             +P  + HAVL VGYG+++ IPYW+V+NSWGP     G+F IERG N CG+   A Y
Sbjct:   276 TPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASY 331


>UNIPROTKB|F7BRD4 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001656 "metanephros development"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0001656
            GeneTree:ENSGT00660000095458 EMBL:ACFV01158341 EMBL:ACFV01158342
            EMBL:ACFV01158343 Ensembl:ENSCJAT00000004396 Uniprot:F7BRD4
        Length = 336

 Score = 408 (148.7 bits), Expect = 5.8e-37, P = 5.8e-37
 Identities = 106/314 (33%), Positives = 155/314 (49%)

Query:   658 FKAFIVKRGRQYANDEEIKERFEYF-----KQDGHKK--HE-RYGTSEFSDRSPEEILCK 709
             FK+++ K  + Y+ +EE  +R + F     K + H    H  +   ++FSD S  EI  K
Sbjct:    35 FKSWMAKHHKTYSREEEYHQRLQTFASNWRKINAHNNGNHTFKMAVNQFSDMSFAEI--K 92

Query:   710 TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQAACGSCWA 768
               + WSE   +   A +             GP P + DWRKK +   P  +Q ACGSCW 
Sbjct:    93 RKYLWSEP--QNCSATKSNYLRGT------GPYPPSVDWRKKGHFVSPVKNQGACGSCWT 144

Query:   769 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPSIEYT-HQAGLESEK 825
             FS  G LE   AI TGK++  ++ QLV+CA+  +  GC G     + EY  +  G+  E 
Sbjct:   145 FSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNNGIMGED 204

Query:   826 DYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN--GSETMKKILYKYGPLSVL--LNSDL 881
              YPY+   G+   C +   K   F  KD  +      + M + +  Y P+S    +  D 
Sbjct:   205 TYPYQ---GKDSDCKFQPGKAIGFV-KDVANITIYDEDAMVEAVALYNPVSFAFEVTQDF 260

Query:   882 IHDYNGTPIRKNDETC--SPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIE 939
             +    G     +  +C  +P  + HAVL VGYG+++ IPYW+V+NSWGP     G+F IE
Sbjct:   261 MMYKRGI---YSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIE 317

Query:   940 RGNNACGIEQIAGY 953
             RG N CG+   A Y
Sbjct:   318 RGKNMCGLAACASY 331

 Score = 400 (145.9 bits), Expect = 4.2e-36, P = 4.2e-36
 Identities = 107/316 (33%), Positives = 156/316 (49%)

Query:   292 FKAFIVKRGRQYANDEEIKERFEYF-----KQDGHKK--HE-RYGTSEFSDRSPEEILCK 343
             FK+++ K  + Y+ +EE  +R + F     K + H    H  +   ++FSD S  EI  K
Sbjct:    35 FKSWMAKHHKTYSREEEYHQRLQTFASNWRKINAHNNGNHTFKMAVNQFSDMSFAEI--K 92

Query:   344 TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQAACGSCWA 402
               + WSE   +   A +             GP P + DWRKK +   P  +Q ACGSCW 
Sbjct:    93 RKYLWSEP--QNCSATKSNYLRGT------GPYPPSVDWRKKGHFVSPVKNQGACGSCWT 144

Query:   403 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCGGCDGL-EQPIEYT-HQAGLES 458
             FS  G LE   AI TGK++  ++ QLV+CA+  +  GC G  GL  Q  EY  +  G+  
Sbjct:   145 FSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQG--GLPSQAFEYILYNNGIMG 202

Query:   459 EKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF--LYFNGSETMKKILYKYGPLSVG--LNS 514
             E  YPY+   G+   C +   K   F  KD   +     + M + +  Y P+S    +  
Sbjct:   203 EDTYPYQ---GKDSDCKFQPGKAIGFV-KDVANITIYDEDAMVEAVALYNPVSFAFEVTQ 258

Query:   515 HLIHFYNGTPIRKNDETC--SPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFK 572
               + +  G     +  +C  +P  + HAVL VGYG+++ IPYW+ +NSWGP     G+F 
Sbjct:   259 DFMMYKRGI---YSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFL 315

Query:   573 IERGNNACGIEQIAGY 588
             IERG N CG+   A Y
Sbjct:   316 IERGKNMCGLAACASY 331

 Score = 366 (133.9 bits), Expect = 1.8e-32, P = 1.8e-32
 Identities = 83/224 (37%), Positives = 117/224 (52%)

Query:     9 GPVPDAWDWRKK-NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 67
             GP P + DWRKK +   P  +Q  CGSCW FS  G LE   AI TGK++  ++ QLV+CA
Sbjct:   115 GPYPPSVDWRKKGHFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCA 174

Query:    68 KQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
             +  +  GC G     + EY  +  G+  E  YPY+   G+   C +   K   F  KD  
Sbjct:   175 QDFNNHGCQGGLPSQAFEYILYNNGIMGEDTYPYQ---GKDSDCKFQPGKAIGFV-KDVA 230

Query:   125 HFN--GSETMKKILYKYGPLSVL--LNSDLIHDYNGTPIRKNDETC--SPYDLGHAVLLV 178
             +      + M + +  Y P+S    +  D +    G     +  +C  +P  + HAVL V
Sbjct:   231 NITIYDEDAMVEAVALYNPVSFAFEVTQDFMMYKRGI---YSSTSCHKTPDKVNHAVLAV 287

Query:   179 GYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
             GYG+++ IPYW+V+NSWGP     G+F IERG N CG+   A Y
Sbjct:   288 GYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASY 331

 Score = 174 (66.3 bits), Expect = 6.2e-10, P = 6.2e-10
 Identities = 30/56 (53%), Positives = 39/56 (69%)

Query:   966 SPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 1021
             +P  + HAVL VGYG+++ IPYW+V+NSWGP     G+F IERG N CG+   A Y
Sbjct:   276 TPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASY 331


>UNIPROTKB|F6R7P5 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9544 "Macaca
            mulatta" [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=ISS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0004175 "endopeptidase activity" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0004252
            "serine-type endopeptidase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005829 "cytosol" evidence=ISS] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0010628 "positive regulation of
            gene expression" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0010815 "bradykinin catabolic
            process" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0016505 "apoptotic protease
            activator activity" evidence=ISS] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=ISS] [GO:0030335
            "positive regulation of cell migration" evidence=ISS] [GO:0031638
            "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0032526 "response to retinoic
            acid" evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0043129 "surfactant homeostasis"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0070324
            "thyroid hormone binding" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            GO:GO:0043066 GO:GO:0005615 GO:GO:0008284 GO:GO:0070371
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458 CTD:1512
            OMA:STSCHKT GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 RefSeq:XP_001108862.1
            UniGene:Mmu.3000 Ensembl:ENSMMUT00000014095 GeneID:711437
            KEGG:mcc:711437 NextBio:19969972 Uniprot:F6R7P5
        Length = 335

 Score = 407 (148.3 bits), Expect = 7.4e-37, P = 7.4e-37
 Identities = 112/321 (34%), Positives = 159/321 (49%)

Query:   653 NILETF--KAFIVKRGRQYANDEEIKERFEYF-----KQDGHKK--HE-RYGTSEFSDRS 702
             N LE F  K+++ K  + Y+  EE   R + F     K + H    H  +   ++FSD S
Sbjct:    28 NSLEKFHFKSWMSKHHKTYST-EEYHHRMQTFASNWRKINAHNNGNHTFKMALNQFSDMS 86

Query:   703 PEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQA 761
               EI  K  + WSE   +   A +             GP P + DWRKK N   P  +Q 
Sbjct:    87 FAEI--KHKYLWSEP--QNCSATKSNYLRGT------GPYPPSMDWRKKGNFVSPVKNQG 136

Query:   762 ACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPSIEYT-HQ 818
             ACGSCW FS  G LE   AI TGK++  ++ QLV+CA+  +  GC G     + EY  + 
Sbjct:   137 ACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYN 196

Query:   819 AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN--GSETMKKILYKYGPLSVL 876
              G+  E  YPY+  +G+   C +   K   F  KD  +      E M + +  Y P+S  
Sbjct:   197 KGIMGEDTYPYQGKDGD---CKFRPGKAIGFV-KDVANITIYDEEAMVEAVALYNPVSFA 252

Query:   877 LNSDLIHDYN--GTPIRKNDETC--SPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPD 932
                ++  D+    T I  +  +C  +P  + HAVL VGYG+++ IPYW+V+NSWGP    
Sbjct:   253 F--EVTQDFMIYKTGIYSST-SCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGM 309

Query:   933 EGFFKIERGNNACGIEQIAGY 953
              G+F IERG N CG+   A Y
Sbjct:   310 NGYFLIERGKNMCGLAACASY 330

 Score = 399 (145.5 bits), Expect = 5.3e-36, P = 5.3e-36
 Identities = 113/321 (35%), Positives = 156/321 (48%)

Query:   287 NILETF--KAFIVKRGRQYANDEEIKERFEYF-----KQDGHKK--HE-RYGTSEFSDRS 336
             N LE F  K+++ K  + Y+  EE   R + F     K + H    H  +   ++FSD S
Sbjct:    28 NSLEKFHFKSWMSKHHKTYST-EEYHHRMQTFASNWRKINAHNNGNHTFKMALNQFSDMS 86

Query:   337 PEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQA 395
               EI  K  + WSE   +   A +             GP P + DWRKK N   P  +Q 
Sbjct:    87 FAEI--KHKYLWSEP--QNCSATKSNYLRGT------GPYPPSMDWRKKGNFVSPVKNQG 136

Query:   396 ACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCGGCDGL-EQPIEYT- 451
             ACGSCW FS  G LE   AI TGK++  ++ QLV+CA+  +  GC G  GL  Q  EY  
Sbjct:   137 ACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQG--GLPSQAFEYIL 194

Query:   452 HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF--LYFNGSETMKKILYKYGPLS 509
             +  G+  E  YPY+  +G+   C +   K   F  KD   +     E M + +  Y P+S
Sbjct:   195 YNKGIMGEDTYPYQGKDGD---CKFRPGKAIGFV-KDVANITIYDEEAMVEAVALYNPVS 250

Query:   510 VGLNSHLIHFYNGTPIRKNDETC--SPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPD 567
                          T I  +  +C  +P  + HAVL VGYG+++ IPYW+ +NSWGP    
Sbjct:   251 FAFEVTQDFMIYKTGIYSST-SCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGM 309

Query:   568 EGFFKIERGNNACGIEQIAGY 588
              G+F IERG N CG+   A Y
Sbjct:   310 NGYFLIERGKNMCGLAACASY 330

 Score = 381 (139.2 bits), Expect = 4.5e-34, P = 4.5e-34
 Identities = 86/224 (38%), Positives = 120/224 (53%)

Query:     9 GPVPDAWDWRKK-NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 67
             GP P + DWRKK N   P  +Q  CGSCW FS  G LE   AI TGK++  ++ QLV+CA
Sbjct:   114 GPYPPSMDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCA 173

Query:    68 KQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
             +  +  GC G     + EY  +  G+  E  YPY+  +G+   C +   K   F  KD  
Sbjct:   174 QDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGD---CKFRPGKAIGFV-KDVA 229

Query:   125 HFN--GSETMKKILYKYGPLSVLLNSDLIHDYN--GTPIRKNDETC--SPYDLGHAVLLV 178
             +      E M + +  Y P+S     ++  D+    T I  +  +C  +P  + HAVL V
Sbjct:   230 NITIYDEEAMVEAVALYNPVSFAF--EVTQDFMIYKTGIYSST-SCHKTPDKVNHAVLAV 286

Query:   179 GYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
             GYG+++ IPYW+V+NSWGP     G+F IERG N CG+   A Y
Sbjct:   287 GYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASY 330

 Score = 174 (66.3 bits), Expect = 1.9e-09, Sum P(2) = 1.9e-09
 Identities = 30/56 (53%), Positives = 39/56 (69%)

Query:   966 SPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 1021
             +P  + HAVL VGYG+++ IPYW+V+NSWGP     G+F IERG N CG+   A Y
Sbjct:   275 TPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASY 330

 Score = 39 (18.8 bits), Expect = 1.9e-09, Sum P(2) = 1.9e-09
 Identities = 9/30 (30%), Positives = 16/30 (53%)

Query:   125 HFNGSETMKKILYKYGPLSVLLNSDLIHDY 154
             H NG+ T K  L ++  +S    +++ H Y
Sbjct:    68 HNNGNHTFKMALNQFSDMSF---AEIKHKY 94


>UNIPROTKB|G1SQF0 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9986
            "Oryctolagus cuniculus" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 RefSeq:XP_002721635.1 UniGene:Ocu.7137
            Ensembl:ENSOCUT00000006138 GeneID:100101597 Uniprot:G1SQF0
        Length = 333

 Score = 405 (147.6 bits), Expect = 1.2e-36, P = 1.2e-36
 Identities = 110/333 (33%), Positives = 164/333 (49%)

Query:   641 LAIEGSLTFDNENILET-FKAFIVKRGRQYANDEEIKERFEYF-----KQDGHKK--HE- 691
             L   G+  F   N+ +  FK+++ +  ++Y+  EE   R + F     K + H    H  
Sbjct:    15 LGAPGADAFSANNLEKFHFKSWMSQHHKKYSA-EEYPRRLQTFVRNWRKINAHNNGNHTF 73

Query:   692 RYGTSEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK 751
             + G ++FSD S  EI  K  + W+E   +   A +             GP P + DWRKK
Sbjct:    74 QMGLNQFSDMSFAEI--KHKYLWTEP--QNCSATKSNYLRGT------GPYPSSVDWRKK 123

Query:   752 -NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCF 808
              N   P  +Q ACGSCW FS  G LE   AI  GK++  ++ QLV+CA+  +  GC+G  
Sbjct:   124 GNFVSPVKNQGACGSCWTFSTTGALESAVAIAGGKMLSLAEQQLVDCAQNFNNHGCEGGL 183

Query:   809 FEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF--NGSETMKK 865
                + EY  +  G+  E  YPY+   G   +C +   K   F  KD  +   N  E M +
Sbjct:   184 PSQAFEYILYNKGIMGEDSYPYRAMEG---RCKFQPQKAIAFV-KDVANITLNDEEAMVE 239

Query:   866 ILYKYGPLSVLLNSDLIHDYNGTPIRK---NDETC--SPYDLGHAVLLVGYGKQDNIPYW 920
              +  Y P+S     ++  D+     RK   +  +C  +P  + HAVL VGYG+++ +PYW
Sbjct:   240 AVALYNPVSFAF--EVTEDF--MQYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGVPYW 295

Query:   921 LVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 953
             +V+NSWG      G+F IERG N CG+   A Y
Sbjct:   296 IVKNSWGSHWGMNGYFYIERGKNMCGLAACASY 328

 Score = 397 (144.8 bits), Expect = 8.7e-36, P = 8.7e-36
 Identities = 110/333 (33%), Positives = 162/333 (48%)

Query:   275 LAIEGSLTFDNENILET-FKAFIVKRGRQYANDEEIKERFEYF-----KQDGHKK--HE- 325
             L   G+  F   N+ +  FK+++ +  ++Y+  EE   R + F     K + H    H  
Sbjct:    15 LGAPGADAFSANNLEKFHFKSWMSQHHKKYSA-EEYPRRLQTFVRNWRKINAHNNGNHTF 73

Query:   326 RYGTSEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK 385
             + G ++FSD S  EI  K  + W+E   +   A +             GP P + DWRKK
Sbjct:    74 QMGLNQFSDMSFAEI--KHKYLWTEP--QNCSATKSNYLRGT------GPYPSSVDWRKK 123

Query:   386 -NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDG- 443
              N   P  +Q ACGSCW FS  G LE   AI  GK++  ++ QLV+CA+  +   GC+G 
Sbjct:   124 GNFVSPVKNQGACGSCWTFSTTGALESAVAIAGGKMLSLAEQQLVDCAQNFNN-HGCEGG 182

Query:   444 L-EQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF--LYFNGSETMK 499
             L  Q  EY  +  G+  E  YPYR   G   +C +   K   F  KD   +  N  E M 
Sbjct:   183 LPSQAFEYILYNKGIMGEDSYPYRAMEG---RCKFQPQKAIAFV-KDVANITLNDEEAMV 238

Query:   500 KILYKYGPLSVG--LNSHLIHFYNGTPIRKNDETC--SPYDLGHAVLLVGYGKQDDIPYW 555
             + +  Y P+S    +    + +  G     +  +C  +P  + HAVL VGYG+++ +PYW
Sbjct:   239 EAVALYNPVSFAFEVTEDFMQYRKGI---YSSTSCHKTPDKVNHAVLAVGYGEENGVPYW 295

Query:   556 LARNSWGPIGPDEGFFKIERGNNACGIEQIAGY 588
             + +NSWG      G+F IERG N CG+   A Y
Sbjct:   296 IVKNSWGSHWGMNGYFYIERGKNMCGLAACASY 328

 Score = 378 (138.1 bits), Expect = 9.5e-34, P = 9.5e-34
 Identities = 84/225 (37%), Positives = 119/225 (52%)

Query:     9 GPVPDAWDWRKK-NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 67
             GP P + DWRKK N   P  +Q  CGSCW FS  G LE   AI  GK++  ++ QLV+CA
Sbjct:   112 GPYPSSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAVAIAGGKMLSLAEQQLVDCA 171

Query:    68 KQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
             +  +  GC+G     + EY  +  G+  E  YPY+   G   +C +   K   F  KD  
Sbjct:   172 QNFNNHGCEGGLPSQAFEYILYNKGIMGEDSYPYRAMEG---RCKFQPQKAIAFV-KDVA 227

Query:   125 HF--NGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRK---NDETC--SPYDLGHAVLL 177
             +   N  E M + +  Y P+S     ++  D+     RK   +  +C  +P  + HAVL 
Sbjct:   228 NITLNDEEAMVEAVALYNPVSFAF--EVTEDF--MQYRKGIYSSTSCHKTPDKVNHAVLA 283

Query:   178 VGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
             VGYG+++ +PYW+V+NSWG      G+F IERG N CG+   A Y
Sbjct:   284 VGYGEENGVPYWIVKNSWGSHWGMNGYFYIERGKNMCGLAACASY 328

 Score = 165 (63.1 bits), Expect = 6.1e-09, P = 6.1e-09
 Identities = 28/56 (50%), Positives = 38/56 (67%)

Query:   966 SPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 1021
             +P  + HAVL VGYG+++ +PYW+V+NSWG      G+F IERG N CG+   A Y
Sbjct:   273 TPDKVNHAVLAVGYGEENGVPYWIVKNSWGSHWGMNGYFYIERGKNMCGLAACASY 328


>UNIPROTKB|P09668 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9606 "Homo sapiens"
            [GO:0001520 "outer dense fiber" evidence=IEA] [GO:0001669
            "acrosomal vesicle" evidence=IEA] [GO:0007283 "spermatogenesis"
            evidence=IEA] [GO:0030984 "kininogen binding" evidence=IEA]
            [GO:0032403 "protein complex binding" evidence=IEA] [GO:0043621
            "protein self-association" evidence=IEA] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0031648 "protein destabilization"
            evidence=IMP] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=ISS] [GO:0032526 "response to retinoic acid"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0030108 "HLA-A
            specific activating MHC class I receptor activity" evidence=IDA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEP] [GO:0010813 "neuropeptide catabolic process"
            evidence=IDA] [GO:0010815 "bradykinin catabolic process"
            evidence=IDA] [GO:0030335 "positive regulation of cell migration"
            evidence=IDA] [GO:0070371 "ERK1 and ERK2 cascade" evidence=IDA]
            [GO:0010628 "positive regulation of gene expression" evidence=IDA]
            [GO:0006508 "proteolysis" evidence=IDA;TAS] [GO:0031638 "zymogen
            activation" evidence=IDA] [GO:0016505 "apoptotic protease activator
            activity" evidence=IDA] [GO:0010952 "positive regulation of
            peptidase activity" evidence=IDA] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=ISS] [GO:0004252 "serine-type endopeptidase
            activity" evidence=ISS] [GO:0043066 "negative regulation of
            apoptotic process" evidence=ISS] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0033619 "membrane protein proteolysis"
            evidence=IDA] [GO:0004175 "endopeptidase activity" evidence=IDA]
            [GO:0004177 "aminopeptidase activity" evidence=IDA] [GO:0005764
            "lysosome" evidence=IDA] [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0002250 "adaptive immune response" evidence=IEP]
            [GO:0019882 "antigen processing and presentation" evidence=TAS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0070324 "thyroid hormone binding" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0008284
            "positive regulation of cell proliferation" evidence=ISS]
            [GO:0045766 "positive regulation of angiogenesis" evidence=ISS]
            [GO:0008233 "peptidase activity" evidence=IDA] [GO:0097208
            "alveolar lamellar body" evidence=IDA] [GO:0043129 "surfactant
            homeostasis" evidence=IDA] [GO:0004197 "cysteine-type endopeptidase
            activity" evidence=IDA;TAS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 Reactome:REACT_6900 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913 MEROPS:C01.040 CTD:1512
            OMA:STSCHKT OrthoDB:EOG4W9J43 GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 EMBL:X16832 EMBL:AF426247 EMBL:AK314698 EMBL:AC011944
            EMBL:BC002479 EMBL:X07549 IPI:IPI00297487 PIR:S12486
            RefSeq:NP_004381.2 UniGene:Hs.148641 PDB:1BZN PDBsum:1BZN
            ProteinModelPortal:P09668 SMR:P09668 IntAct:P09668 STRING:P09668
            PhosphoSite:P09668 DMDM:288558851 PaxDb:P09668 PRIDE:P09668
            DNASU:1512 Ensembl:ENST00000220166 GeneID:1512 KEGG:hsa:1512
            UCSC:uc021srk.1 GeneCards:GC15M079213 H-InvDB:HIX0012481
            HGNC:HGNC:2535 HPA:CAB000458 HPA:HPA003524 MIM:116820
            neXtProt:NX_P09668 PharmGKB:PA27033 InParanoid:P09668
            PhylomeDB:P09668 BRENDA:3.4.22.16 ChEMBL:CHEMBL2225 GenomeRNAi:1512
            NextBio:6261 ArrayExpress:P09668 Bgee:P09668 CleanEx:HS_CTSH
            Genevestigator:P09668 GermOnline:ENSG00000103811 GO:GO:0019882
            Uniprot:P09668
        Length = 335

 Score = 402 (146.6 bits), Expect = 2.5e-36, P = 2.5e-36
 Identities = 112/321 (34%), Positives = 158/321 (49%)

Query:   653 NILETF--KAFIVKRGRQYANDEEIKERFEYF-----KQDGHKK--HE-RYGTSEFSDRS 702
             N LE F  K+++ K  + Y+  EE   R + F     K + H    H  +   ++FSD S
Sbjct:    28 NSLEKFHFKSWMSKHRKTYST-EEYHHRLQTFASNWRKINAHNNGNHTFKMALNQFSDMS 86

Query:   703 PEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQA 761
               EI  K  + WSE   +   A +             GP P + DWRKK N   P  +Q 
Sbjct:    87 FAEI--KHKYLWSEP--QNCSATKSNYLRGT------GPYPPSVDWRKKGNFVSPVKNQG 136

Query:   762 ACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPSIEYT-HQ 818
             ACGSCW FS  G LE   AI TGK++  ++ QLV+CA+  +  GC G     + EY  + 
Sbjct:   137 ACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYN 196

Query:   819 AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN--GSETMKKILYKYGPLSVL 876
              G+  E  YPY+  +G    C +   K   F  KD  +      E M + +  Y P+S  
Sbjct:   197 KGIMGEDTYPYQGKDGY---CKFQPGKAIGFV-KDVANITIYDEEAMVEAVALYNPVSFA 252

Query:   877 LNSDLIHDYN--GTPIRKNDETC--SPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPD 932
                ++  D+    T I  +  +C  +P  + HAVL VGYG+++ IPYW+V+NSWGP    
Sbjct:   253 F--EVTQDFMMYRTGIYSST-SCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGM 309

Query:   933 EGFFKIERGNNACGIEQIAGY 953
              G+F IERG N CG+   A Y
Sbjct:   310 NGYFLIERGKNMCGLAACASY 330

 Score = 394 (143.8 bits), Expect = 1.8e-35, P = 1.8e-35
 Identities = 113/321 (35%), Positives = 155/321 (48%)

Query:   287 NILETF--KAFIVKRGRQYANDEEIKERFEYF-----KQDGHKK--HE-RYGTSEFSDRS 336
             N LE F  K+++ K  + Y+  EE   R + F     K + H    H  +   ++FSD S
Sbjct:    28 NSLEKFHFKSWMSKHRKTYST-EEYHHRLQTFASNWRKINAHNNGNHTFKMALNQFSDMS 86

Query:   337 PEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQA 395
               EI  K  + WSE   +   A +             GP P + DWRKK N   P  +Q 
Sbjct:    87 FAEI--KHKYLWSEP--QNCSATKSNYLRGT------GPYPPSVDWRKKGNFVSPVKNQG 136

Query:   396 ACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCGGCDGL-EQPIEYT- 451
             ACGSCW FS  G LE   AI TGK++  ++ QLV+CA+  +  GC G  GL  Q  EY  
Sbjct:   137 ACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQG--GLPSQAFEYIL 194

Query:   452 HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF--LYFNGSETMKKILYKYGPLS 509
             +  G+  E  YPY+  +G    C +   K   F  KD   +     E M + +  Y P+S
Sbjct:   195 YNKGIMGEDTYPYQGKDGY---CKFQPGKAIGFV-KDVANITIYDEEAMVEAVALYNPVS 250

Query:   510 VGLNSHLIHFYNGTPIRKNDETC--SPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPD 567
                          T I  +  +C  +P  + HAVL VGYG+++ IPYW+ +NSWGP    
Sbjct:   251 FAFEVTQDFMMYRTGIYSST-SCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGM 309

Query:   568 EGFFKIERGNNACGIEQIAGY 588
              G+F IERG N CG+   A Y
Sbjct:   310 NGYFLIERGKNMCGLAACASY 330

 Score = 376 (137.4 bits), Expect = 1.6e-33, P = 1.6e-33
 Identities = 86/224 (38%), Positives = 119/224 (53%)

Query:     9 GPVPDAWDWRKK-NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 67
             GP P + DWRKK N   P  +Q  CGSCW FS  G LE   AI TGK++  ++ QLV+CA
Sbjct:   114 GPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCA 173

Query:    68 KQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
             +  +  GC G     + EY  +  G+  E  YPY+  +G    C +   K   F  KD  
Sbjct:   174 QDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGY---CKFQPGKAIGFV-KDVA 229

Query:   125 HFN--GSETMKKILYKYGPLSVLLNSDLIHDYN--GTPIRKNDETC--SPYDLGHAVLLV 178
             +      E M + +  Y P+S     ++  D+    T I  +  +C  +P  + HAVL V
Sbjct:   230 NITIYDEEAMVEAVALYNPVSFAF--EVTQDFMMYRTGIYSST-SCHKTPDKVNHAVLAV 286

Query:   179 GYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
             GYG+++ IPYW+V+NSWGP     G+F IERG N CG+   A Y
Sbjct:   287 GYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASY 330

 Score = 173 (66.0 bits), Expect = 2.5e-09, Sum P(2) = 2.5e-09
 Identities = 30/56 (53%), Positives = 39/56 (69%)

Query:   966 SPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 1021
             +P  + HAVL VGYG+++ IPYW+V+NSWGP     G+F IERG N CG+   A Y
Sbjct:   275 TPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASY 330

 Score = 39 (18.8 bits), Expect = 2.5e-09, Sum P(2) = 2.5e-09
 Identities = 9/30 (30%), Positives = 16/30 (53%)

Query:   125 HFNGSETMKKILYKYGPLSVLLNSDLIHDY 154
             H NG+ T K  L ++  +S    +++ H Y
Sbjct:    68 HNNGNHTFKMALNQFSDMSF---AEIKHKY 94


>UNIPROTKB|G1RBY1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:61853
            "Nomascus leucogenys" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 EMBL:ADFV01087552 RefSeq:XP_003275518.1
            Ensembl:ENSNLET00000011249 GeneID:100584322 Uniprot:G1RBY1
        Length = 335

 Score = 402 (146.6 bits), Expect = 2.5e-36, P = 2.5e-36
 Identities = 111/321 (34%), Positives = 156/321 (48%)

Query:   653 NILETF--KAFIVKRGRQYANDEEIKERFEYF-----KQDGHKK--HE-RYGTSEFSDRS 702
             N LE F  K+++ K  + Y+  EE   R + F     K + H    H  +   ++FSD S
Sbjct:    28 NSLEKFHFKSWMSKHHKTYST-EEYHHRLQMFASNWRKINAHNNGNHTFKMALNQFSDMS 86

Query:   703 PEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQA 761
               EI  K  + WSE   +   A +             GP P + DWRKK N   P  +Q 
Sbjct:    87 FAEI--KHKYLWSEP--QNCSATKSNYLRGT------GPYPPSMDWRKKGNFVSPVKNQG 136

Query:   762 ACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPSIEYT-HQ 818
             ACGSCW FS  G LE   AI TGK++  ++ QLV+CA+  +  GC G     + EY  + 
Sbjct:   137 ACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYN 196

Query:   819 AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN--GSETMKKILYKYGPLSVL 876
              G+  E  YPY+  +G    C +   K   F  KD  +      E M + +  Y P+S  
Sbjct:   197 KGIMGEDTYPYQGKDGY---CKFRPGKAIGFV-KDVANITIYDEEAMVEAVALYNPVSFA 252

Query:   877 --LNSDLIHDYNGTPIRKNDETC--SPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPD 932
               +  D +    G     +  +C  +P  + HAVL VGYG+++ IPYW+V+NSWGP    
Sbjct:   253 FEVTQDFMMYRRGI---YSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGM 309

Query:   933 EGFFKIERGNNACGIEQIAGY 953
              G+F IERG N CG+   A Y
Sbjct:   310 NGYFLIERGKNMCGLAACASY 330

 Score = 393 (143.4 bits), Expect = 2.3e-35, P = 2.3e-35
 Identities = 112/323 (34%), Positives = 157/323 (48%)

Query:   287 NILETF--KAFIVKRGRQYANDEEIKERFEYF-----KQDGHKK--HE-RYGTSEFSDRS 336
             N LE F  K+++ K  + Y+  EE   R + F     K + H    H  +   ++FSD S
Sbjct:    28 NSLEKFHFKSWMSKHHKTYST-EEYHHRLQMFASNWRKINAHNNGNHTFKMALNQFSDMS 86

Query:   337 PEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQA 395
               EI  K  + WSE   +   A +             GP P + DWRKK N   P  +Q 
Sbjct:    87 FAEI--KHKYLWSEP--QNCSATKSNYLRGT------GPYPPSMDWRKKGNFVSPVKNQG 136

Query:   396 ACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCGGCDGL-EQPIEYT- 451
             ACGSCW FS  G LE   AI TGK++  ++ QLV+CA+  +  GC G  GL  Q  EY  
Sbjct:   137 ACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQG--GLPSQAFEYIL 194

Query:   452 HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF--LYFNGSETMKKILYKYGPLS 509
             +  G+  E  YPY+  +G    C +   K   F  KD   +     E M + +  Y P+S
Sbjct:   195 YNKGIMGEDTYPYQGKDGY---CKFRPGKAIGFV-KDVANITIYDEEAMVEAVALYNPVS 250

Query:   510 VG--LNSHLIHFYNGTPIRKNDETC--SPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIG 565
                 +    + +  G     +  +C  +P  + HAVL VGYG+++ IPYW+ +NSWGP  
Sbjct:   251 FAFEVTQDFMMYRRGI---YSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQW 307

Query:   566 PDEGFFKIERGNNACGIEQIAGY 588
                G+F IERG N CG+   A Y
Sbjct:   308 GMNGYFLIERGKNMCGLAACASY 330

 Score = 375 (137.1 bits), Expect = 2.0e-33, P = 2.0e-33
 Identities = 85/224 (37%), Positives = 117/224 (52%)

Query:     9 GPVPDAWDWRKK-NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 67
             GP P + DWRKK N   P  +Q  CGSCW FS  G LE   AI TGK++  ++ QLV+CA
Sbjct:   114 GPYPPSMDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCA 173

Query:    68 KQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
             +  +  GC G     + EY  +  G+  E  YPY+  +G    C +   K   F  KD  
Sbjct:   174 QDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGY---CKFRPGKAIGFV-KDVA 229

Query:   125 HFN--GSETMKKILYKYGPLSVL--LNSDLIHDYNGTPIRKNDETC--SPYDLGHAVLLV 178
             +      E M + +  Y P+S    +  D +    G     +  +C  +P  + HAVL V
Sbjct:   230 NITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRRGI---YSSTSCHKTPDKVNHAVLAV 286

Query:   179 GYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
             GYG+++ IPYW+V+NSWGP     G+F IERG N CG+   A Y
Sbjct:   287 GYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASY 330

 Score = 173 (66.0 bits), Expect = 2.5e-09, Sum P(2) = 2.5e-09
 Identities = 30/56 (53%), Positives = 39/56 (69%)

Query:   966 SPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 1021
             +P  + HAVL VGYG+++ IPYW+V+NSWGP     G+F IERG N CG+   A Y
Sbjct:   275 TPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASY 330

 Score = 39 (18.8 bits), Expect = 2.5e-09, Sum P(2) = 2.5e-09
 Identities = 9/30 (30%), Positives = 16/30 (53%)

Query:   125 HFNGSETMKKILYKYGPLSVLLNSDLIHDY 154
             H NG+ T K  L ++  +S    +++ H Y
Sbjct:    68 HNNGNHTFKMALNQFSDMSF---AEIKHKY 94


>UNIPROTKB|G3SSC1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9785
            "Loxodonta africana" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 RefSeq:XP_003413898.1
            Ensembl:ENSLAFT00000003415 GeneID:100662496 Uniprot:G3SSC1
        Length = 335

 Score = 402 (146.6 bits), Expect = 2.5e-36, P = 2.5e-36
 Identities = 106/311 (34%), Positives = 153/311 (49%)

Query:   658 FKAFIVKRGRQYANDEEIKERFEYF-----KQDGH--KKHE-RYGTSEFSDRSPEEILCK 709
             F++++ +  ++Y++ EE  +R + F     K + H  + H  +   ++FSD +  EI  K
Sbjct:    35 FQSWMAQHQKKYSS-EEYHQRQQTFVSNWRKINAHNARNHTFKMALNQFSDMTFAEI--K 91

Query:   710 TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQAACGSCWA 768
               + WSE   +   A +             GP P   DWRKK +   P  +Q ACGSCW 
Sbjct:    92 QKYLWSEP--QNCSATKGNYLRGT------GPYPPFVDWRKKGHFVSPVKNQGACGSCWT 143

Query:   769 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPSIEYT-HQAGLESEK 825
             FS  G LE   AI  GKL+  ++ QLV+CAK  +  GC G     + EY  +  G+  E 
Sbjct:   144 FSTTGALESAIAIAGGKLLSLAEQQLVDCAKDFNNHGCQGGLPSQAFEYILYNKGIMGED 203

Query:   826 DYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF--NGSETMKKILYKYGPLSVLLN-SDLI 882
              YPYK   G+   C +   K   F  KD  +   N  E M + +  Y P+S     +D  
Sbjct:   204 TYPYK---GQDDVCKFQPKKAIAFV-KDVANITLNDEEAMVEAVALYNPVSFAFEVTDDF 259

Query:   883 HDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGN 942
               Y+           +P  + HAVL VGYG++  IPYW+V+NSWGP    +G+F IERG 
Sbjct:   260 MKYSKGIYSSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPYWGMDGYFLIERGK 319

Query:   943 NACGIEQIAGY 953
             N CG+   A Y
Sbjct:   320 NMCGLAACASY 330

 Score = 392 (143.0 bits), Expect = 3.0e-35, P = 3.0e-35
 Identities = 107/316 (33%), Positives = 157/316 (49%)

Query:   292 FKAFIVKRGRQYANDEEIKERFEYF-----KQDGH--KKHE-RYGTSEFSDRSPEEILCK 343
             F++++ +  ++Y++ EE  +R + F     K + H  + H  +   ++FSD +  EI  K
Sbjct:    35 FQSWMAQHQKKYSS-EEYHQRQQTFVSNWRKINAHNARNHTFKMALNQFSDMTFAEI--K 91

Query:   344 TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQAACGSCWA 402
               + WSE   +   A +             GP P   DWRKK +   P  +Q ACGSCW 
Sbjct:    92 QKYLWSEP--QNCSATKGNYLRGT------GPYPPFVDWRKKGHFVSPVKNQGACGSCWT 143

Query:   403 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCGGCDGL-EQPIEYT-HQAGLES 458
             FS  G LE   AI  GKL+  ++ QLV+CAK  +  GC G  GL  Q  EY  +  G+  
Sbjct:   144 FSTTGALESAIAIAGGKLLSLAEQQLVDCAKDFNNHGCQG--GLPSQAFEYILYNKGIMG 201

Query:   459 EKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF--LYFNGSETMKKILYKYGPLSVG--LNS 514
             E  YPY+   G+   C +   K   F  KD   +  N  E M + +  Y P+S    +  
Sbjct:   202 EDTYPYK---GQDDVCKFQPKKAIAFV-KDVANITLNDEEAMVEAVALYNPVSFAFEVTD 257

Query:   515 HLIHFYNGTPIRKNDETC--SPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFK 572
               + +  G     +  +C  +P  + HAVL VGYG++  IPYW+ +NSWGP    +G+F 
Sbjct:   258 DFMKYSKGI---YSSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPYWGMDGYFL 314

Query:   573 IERGNNACGIEQIAGY 588
             IERG N CG+   A Y
Sbjct:   315 IERGKNMCGLAACASY 330

 Score = 382 (139.5 bits), Expect = 3.5e-34, P = 3.5e-34
 Identities = 86/221 (38%), Positives = 113/221 (51%)

Query:     9 GPVPDAWDWRKK-NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 67
             GP P   DWRKK +   P  +Q  CGSCW FS  G LE   AI  GKL+  ++ QLV+CA
Sbjct:   114 GPYPPFVDWRKKGHFVSPVKNQGACGSCWTFSTTGALESAIAIAGGKLLSLAEQQLVDCA 173

Query:    68 KQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
             K  +  GC G     + EY  +  G+  E  YPYK   G+   C +   K   F  KD  
Sbjct:   174 KDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYK---GQDDVCKFQPKKAIAFV-KDVA 229

Query:   125 HF--NGSETMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG 181
             +   N  E M + +  Y P+S     +D    Y+           +P  + HAVL VGYG
Sbjct:   230 NITLNDEEAMVEAVALYNPVSFAFEVTDDFMKYSKGIYSSTSCHKTPDKVNHAVLAVGYG 289

Query:   182 KQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
             ++  IPYW+V+NSWGP    +G+F IERG N CG+   A Y
Sbjct:   290 EEKGIPYWIVKNSWGPYWGMDGYFLIERGKNMCGLAACASY 330

 Score = 176 (67.0 bits), Expect = 3.6e-10, P = 3.6e-10
 Identities = 30/56 (53%), Positives = 39/56 (69%)

Query:   966 SPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 1021
             +P  + HAVL VGYG++  IPYW+V+NSWGP    +G+F IERG N CG+   A Y
Sbjct:   275 TPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPYWGMDGYFLIERGKNMCGLAACASY 330


>UNIPROTKB|G3R9A7 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9595 "Gorilla
            gorilla gorilla" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 OMA:STSCHKT GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 RefSeq:XP_004056662.1 Ensembl:ENSGGOT00000012331
            GeneID:101144312 Uniprot:G3R9A7
        Length = 335

 Score = 399 (145.5 bits), Expect = 5.3e-36, P = 5.3e-36
 Identities = 111/321 (34%), Positives = 158/321 (49%)

Query:   653 NILETF--KAFIVKRGRQYANDEEIKERFEYF-----KQDGHKK--HE-RYGTSEFSDRS 702
             N LE F  ++++ K  + Y+  EE   R + F     K + H    H  +   ++FSD S
Sbjct:    28 NSLEKFYFRSWMSKHRKTYST-EEYHHRLQTFASNWRKINAHNNGNHTFKMALNQFSDMS 86

Query:   703 PEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQA 761
               EI  K  + WSE   +   A +             GP P + DWRKK N   P  +Q 
Sbjct:    87 FAEI--KHKYLWSEP--QNCSATKSNYLRGT------GPYPPSVDWRKKGNFVSPVKNQG 136

Query:   762 ACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPSIEYT-HQ 818
             ACGSCW FS  G LE   AI TGK++  ++ QLV+CA+  +  GC G     + EY  + 
Sbjct:   137 ACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYN 196

Query:   819 AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN--GSETMKKILYKYGPLSVL 876
              G+  E  YPY+  +G    C +   K   F  KD  +      E M + +  Y P+S  
Sbjct:   197 KGIMGEDTYPYQGKDGY---CKFQPGKAIGFV-KDVANITIYDEEAMVEAVALYNPVSFA 252

Query:   877 LNSDLIHDYN--GTPIRKNDETC--SPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPD 932
                ++  D+    T I  +  +C  +P  + HAVL VGYG+++ IPYW+V+NSWGP    
Sbjct:   253 F--EVTQDFMMYRTGIYSST-SCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPKWGM 309

Query:   933 EGFFKIERGNNACGIEQIAGY 953
              G+F IERG N CG+   A Y
Sbjct:   310 NGYFLIERGKNMCGLAACASY 330

 Score = 391 (142.7 bits), Expect = 3.8e-35, P = 3.8e-35
 Identities = 112/321 (34%), Positives = 155/321 (48%)

Query:   287 NILETF--KAFIVKRGRQYANDEEIKERFEYF-----KQDGHKK--HE-RYGTSEFSDRS 336
             N LE F  ++++ K  + Y+  EE   R + F     K + H    H  +   ++FSD S
Sbjct:    28 NSLEKFYFRSWMSKHRKTYST-EEYHHRLQTFASNWRKINAHNNGNHTFKMALNQFSDMS 86

Query:   337 PEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQA 395
               EI  K  + WSE   +   A +             GP P + DWRKK N   P  +Q 
Sbjct:    87 FAEI--KHKYLWSEP--QNCSATKSNYLRGT------GPYPPSVDWRKKGNFVSPVKNQG 136

Query:   396 ACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCGGCDGL-EQPIEYT- 451
             ACGSCW FS  G LE   AI TGK++  ++ QLV+CA+  +  GC G  GL  Q  EY  
Sbjct:   137 ACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQG--GLPSQAFEYIL 194

Query:   452 HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF--LYFNGSETMKKILYKYGPLS 509
             +  G+  E  YPY+  +G    C +   K   F  KD   +     E M + +  Y P+S
Sbjct:   195 YNKGIMGEDTYPYQGKDGY---CKFQPGKAIGFV-KDVANITIYDEEAMVEAVALYNPVS 250

Query:   510 VGLNSHLIHFYNGTPIRKNDETC--SPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPD 567
                          T I  +  +C  +P  + HAVL VGYG+++ IPYW+ +NSWGP    
Sbjct:   251 FAFEVTQDFMMYRTGIYSST-SCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPKWGM 309

Query:   568 EGFFKIERGNNACGIEQIAGY 588
              G+F IERG N CG+   A Y
Sbjct:   310 NGYFLIERGKNMCGLAACASY 330

 Score = 376 (137.4 bits), Expect = 1.6e-33, P = 1.6e-33
 Identities = 86/224 (38%), Positives = 119/224 (53%)

Query:     9 GPVPDAWDWRKK-NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 67
             GP P + DWRKK N   P  +Q  CGSCW FS  G LE   AI TGK++  ++ QLV+CA
Sbjct:   114 GPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCA 173

Query:    68 KQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
             +  +  GC G     + EY  +  G+  E  YPY+  +G    C +   K   F  KD  
Sbjct:   174 QDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGY---CKFQPGKAIGFV-KDVA 229

Query:   125 HFN--GSETMKKILYKYGPLSVLLNSDLIHDYN--GTPIRKNDETC--SPYDLGHAVLLV 178
             +      E M + +  Y P+S     ++  D+    T I  +  +C  +P  + HAVL V
Sbjct:   230 NITIYDEEAMVEAVALYNPVSFAF--EVTQDFMMYRTGIYSST-SCHKTPDKVNHAVLAV 286

Query:   179 GYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
             GYG+++ IPYW+V+NSWGP     G+F IERG N CG+   A Y
Sbjct:   287 GYGEKNGIPYWIVKNSWGPKWGMNGYFLIERGKNMCGLAACASY 330

 Score = 173 (66.0 bits), Expect = 2.5e-09, Sum P(2) = 2.5e-09
 Identities = 30/56 (53%), Positives = 39/56 (69%)

Query:   966 SPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 1021
             +P  + HAVL VGYG+++ IPYW+V+NSWGP     G+F IERG N CG+   A Y
Sbjct:   275 TPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPKWGMNGYFLIERGKNMCGLAACASY 330

 Score = 39 (18.8 bits), Expect = 2.5e-09, Sum P(2) = 2.5e-09
 Identities = 9/30 (30%), Positives = 16/30 (53%)

Query:   125 HFNGSETMKKILYKYGPLSVLLNSDLIHDY 154
             H NG+ T K  L ++  +S    +++ H Y
Sbjct:    68 HNNGNHTFKMALNQFSDMSF---AEIKHKY 94


>RGD|2447 [details] [associations]
            symbol:Ctsh "cathepsin H" species:10116 "Rattus norvegicus"
          [GO:0001520 "outer dense fiber" evidence=IDA] [GO:0001656
          "metanephros development" evidence=IEP] [GO:0001669 "acrosomal
          vesicle" evidence=IDA] [GO:0001913 "T cell mediated cytotoxicity"
          evidence=ISO;ISS] [GO:0002250 "adaptive immune response"
          evidence=ISO] [GO:0002764 "immune response-regulating signaling
          pathway" evidence=ISO;ISS] [GO:0004175 "endopeptidase activity"
          evidence=ISO] [GO:0004177 "aminopeptidase activity" evidence=ISO;IDA]
          [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISO;ISS]
          [GO:0004252 "serine-type endopeptidase activity" evidence=ISO;ISS]
          [GO:0005615 "extracellular space" evidence=ISO;ISS;IDA] [GO:0005764
          "lysosome" evidence=ISO;ISS;IDA] [GO:0005829 "cytosol"
          evidence=ISO;ISS] [GO:0006508 "proteolysis" evidence=IEP;ISO]
          [GO:0007283 "spermatogenesis" evidence=IEP] [GO:0008233 "peptidase
          activity" evidence=ISO] [GO:0008234 "cysteine-type peptidase
          activity" evidence=ISO] [GO:0008284 "positive regulation of cell
          proliferation" evidence=ISO;ISS] [GO:0010628 "positive regulation of
          gene expression" evidence=ISO;ISS] [GO:0010634 "positive regulation
          of epithelial cell migration" evidence=ISO;ISS] [GO:0010813
          "neuropeptide catabolic process" evidence=ISO;ISS] [GO:0010815
          "bradykinin catabolic process" evidence=ISO;ISS] [GO:0010952
          "positive regulation of peptidase activity" evidence=ISO;ISS]
          [GO:0016505 "apoptotic protease activator activity" evidence=ISO;ISS]
          [GO:0030108 "HLA-A specific activating MHC class I receptor activity"
          evidence=ISO;ISS] [GO:0030335 "positive regulation of cell migration"
          evidence=ISO;ISS] [GO:0030984 "kininogen binding" evidence=IPI]
          [GO:0031638 "zymogen activation" evidence=ISO;ISS] [GO:0031648
          "protein destabilization" evidence=ISO;ISS] [GO:0032403 "protein
          complex binding" evidence=IPI] [GO:0032526 "response to retinoic
          acid" evidence=ISO;ISS] [GO:0033619 "membrane protein proteolysis"
          evidence=ISO;ISS] [GO:0035085 "cilium axoneme" evidence=IDA]
          [GO:0043066 "negative regulation of apoptotic process"
          evidence=ISO;ISS] [GO:0043129 "surfactant homeostasis"
          evidence=ISO;ISS] [GO:0043621 "protein self-association"
          evidence=IDA] [GO:0045766 "positive regulation of angiogenesis"
          evidence=ISO;ISS] [GO:0060448 "dichotomous subdivision of terminal
          units involved in lung branching" evidence=ISO;ISS] [GO:0070324
          "thyroid hormone binding" evidence=ISO;ISS] [GO:0070371 "ERK1 and
          ERK2 cascade" evidence=ISO;ISS] [GO:0097067 "cellular response to
          thyroid hormone stimulus" evidence=ISO;IEP] [GO:0097208 "alveolar
          lamellar body" evidence=ISO;ISS;IDA] InterPro:IPR000668
          InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
          InterPro:IPR000169 RGD:2447 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
          GO:GO:0008284 GO:GO:0070371 GO:GO:0001669 eggNOG:COG4870
          HOGENOM:HOG000230774 InterPro:IPR025661 InterPro:IPR025660
          InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
          PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0007283
          GO:GO:0045766 GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
          GO:GO:0043621 GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 KO:K01366
          GO:GO:0016505 GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
          HOVERGEN:HBG011513 GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
          GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT OrthoDB:EOG4W9J43
          GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
          GO:GO:0010813 GO:GO:0043129 MEROPS:I29.003 EMBL:Y00708 EMBL:BC085352
          EMBL:M38135 IPI:IPI00212809 PIR:S00211 RefSeq:NP_037071.1
          UniGene:Rn.1997 ProteinModelPortal:P00786 SMR:P00786 STRING:P00786
          PRIDE:P00786 Ensembl:ENSRNOT00000019285 GeneID:25425 KEGG:rno:25425
          UCSC:RGD:2447 InParanoid:P00786 BindingDB:P00786 NextBio:606599
          Genevestigator:P00786 GermOnline:ENSRNOG00000014064 GO:GO:0035086
          GO:GO:0001520 Uniprot:P00786
        Length = 333

 Score = 399 (145.5 bits), Expect = 5.3e-36, P = 5.3e-36
 Identities = 99/297 (33%), Positives = 147/297 (49%)

Query:   667 RQYANDEEI-KERFEYFKQDGHKKHE-RYGTSEFSDRSPEEILCKTGFKWSERTYERIVA 724
             R+Y++  ++    +   +    + H  + G ++FSD S  EI  K  + WSE   +   A
Sbjct:    47 REYSHRLQVFANNWRKIQAHNQRNHTFKMGLNQFSDMSFAEI--KHKYLWSEP--QNCSA 102

Query:   725 DRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKT 783
              +             GP P + DWRKK NV  P  +Q ACGSCW FS  G LE   AI +
Sbjct:   103 TKSNYLRGT------GPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGALESAVAIAS 156

Query:   784 GKLVEFSKSQLVECAKQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCA 840
             GK++  ++ QLV+CA+  +  GC G     + EY  +  G+  E  YPY   NG+   C 
Sbjct:   157 GKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIGKNGQ---CK 213

Query:   841 YDKSKVKLFTGKDFLHF--NGSETMKKILYKYGPLSVL--LNSDLIHDYNGTPIRKNDET 896
             ++  K   F  K+ ++   N    M + +  Y P+S    +  D +  Y       N   
Sbjct:   214 FNPEKAVAFV-KNVVNITLNDEAAMVEAVALYNPVSFAFEVTEDFMM-YKSGVYSSNSCH 271

Query:   897 CSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 953
              +P  + HAVL VGYG+Q+ + YW+V+NSWG    + G+F IERG N CG+   A Y
Sbjct:   272 KTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSNWGNNGYFLIERGKNMCGLAACASY 328

 Score = 391 (142.7 bits), Expect = 3.8e-35, P = 3.8e-35
 Identities = 99/297 (33%), Positives = 143/297 (48%)

Query:   301 RQYANDEEI-KERFEYFKQDGHKKHE-RYGTSEFSDRSPEEILCKTGFKWSERTYERIVA 358
             R+Y++  ++    +   +    + H  + G ++FSD S  EI  K  + WSE   +   A
Sbjct:    47 REYSHRLQVFANNWRKIQAHNQRNHTFKMGLNQFSDMSFAEI--KHKYLWSEP--QNCSA 102

Query:   359 DRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKT 417
              +             GP P + DWRKK NV  P  +Q ACGSCW FS  G LE   AI +
Sbjct:   103 TKSNYLRGT------GPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGALESAVAIAS 156

Query:   418 GKLVEFSKSQLVECAKQCS--GCGGCDGL-EQPIEYT-HQAGLESEKDYPYRNGNGEKFK 473
             GK++  ++ QLV+CA+  +  GC G  GL  Q  EY  +  G+  E  YPY   NG+   
Sbjct:   157 GKMMTLAEQQLVDCAQNFNNHGCQG--GLPSQAFEYILYNKGIMGEDSYPYIGKNGQ--- 211

Query:   474 CAYDKSKVKLFTGKDF-LYFNGSETMKKILYKYGPLSVGLN-SHLIHFYNGTPIRKNDET 531
             C ++  K   F      +  N    M + +  Y P+S     +     Y       N   
Sbjct:   212 CKFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTEDFMMYKSGVYSSNSCH 271

Query:   532 CSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGY 588
              +P  + HAVL VGYG+Q+ + YW+ +NSWG    + G+F IERG N CG+   A Y
Sbjct:   272 KTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSNWGNNGYFLIERGKNMCGLAACASY 328

 Score = 379 (138.5 bits), Expect = 7.4e-34, P = 7.4e-34
 Identities = 83/222 (37%), Positives = 117/222 (52%)

Query:     9 GPVPDAWDWRKK-NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 67
             GP P + DWRKK NV  P  +Q  CGSCW FS  G LE   AI +GK++  ++ QLV+CA
Sbjct:   112 GPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMMTLAEQQLVDCA 171

Query:    68 KQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
             +  +  GC G     + EY  +  G+  E  YPY   NG+   C ++  K   F  K+ +
Sbjct:   172 QNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIGKNGQ---CKFNPEKAVAFV-KNVV 227

Query:   125 HF--NGSETMKKILYKYGPLSVL--LNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGY 180
             +   N    M + +  Y P+S    +  D +  Y       N    +P  + HAVL VGY
Sbjct:   228 NITLNDEAAMVEAVALYNPVSFAFEVTEDFMM-YKSGVYSSNSCHKTPDKVNHAVLAVGY 286

Query:   181 GKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
             G+Q+ + YW+V+NSWG    + G+F IERG N CG+   A Y
Sbjct:   287 GEQNGLLYWIVKNSWGSNWGNNGYFLIERGKNMCGLAACASY 328

 Score = 163 (62.4 bits), Expect = 1.0e-08, P = 1.0e-08
 Identities = 30/65 (46%), Positives = 42/65 (64%)

Query:   959 VKNDETC--SPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 1016
             V +  +C  +P  + HAVL VGYG+Q+ + YW+V+NSWG    + G+F IERG N CG+ 
Sbjct:   264 VYSSNSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSNWGNNGYFLIERGKNMCGLA 323

Query:  1017 QIAGY 1021
               A Y
Sbjct:   324 ACASY 328


>MGI|MGI:1861434 [details] [associations]
            symbol:Ctsf "cathepsin F" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=ISS]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISS] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:1861434 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513 MEROPS:C01.018
            CTD:8722 OMA:LAPPEWD OrthoDB:EOG4CC41T EMBL:AF136280 EMBL:AF217224
            EMBL:AJ131851 EMBL:AK075862 EMBL:BC058758 IPI:IPI00126769
            RefSeq:NP_063914.1 UniGene:Mm.29561 ProteinModelPortal:Q9R013
            SMR:Q9R013 STRING:Q9R013 PhosphoSite:Q9R013 PaxDb:Q9R013
            PRIDE:Q9R013 Ensembl:ENSMUST00000119694 GeneID:56464 KEGG:mmu:56464
            UCSC:uc008gbc.1 GeneTree:ENSGT00660000095458 InParanoid:Q9R013
            NextBio:312722 Bgee:Q9R013 CleanEx:MM_CTSF Genevestigator:Q9R013
            GermOnline:ENSMUSG00000006458 Uniprot:Q9R013
        Length = 462

 Score = 398 (145.2 bits), Expect = 6.8e-36, P = 6.8e-36
 Identities = 97/313 (30%), Positives = 150/313 (47%)

Query:   658 FKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHE---------RYGTSEFSDRSPEEILC 708
             FK F+    R Y + EE + R   F ++  +  +         +YG ++FSD + EE   
Sbjct:   165 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEF-- 222

Query:   709 KTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWA 768
                       Y   +  +            +   P  WDWRKK       +Q  CGSCWA
Sbjct:   223 -------HTIYLNPLLQKESGRKMSPAKSINDLAPPEWDWRKKGAVTEVKNQGMCGSCWA 275

Query:   769 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIEYT---HQAGLESEK 825
             FS+ G +EGQ+ +  G L+  S+ +L++C K    C G    PS  Y    +  GLE+E 
Sbjct:   276 FSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDKACLGGL--PSNAYAAIKNLGGLETED 333

Query:   826 DYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSET-MKKILYKYGPLSVLLNSDLIHD 884
             DY Y+   G    C +     K++   D +  + +E  +   L + GP+SV +N+  +  
Sbjct:   334 DYGYQ---GHVQTCNFSAQMAKVYIN-DSVELSRNENKIAAWLAQKGPISVAINAFGMQF 389

Query:   885 YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNA 944
             Y           CSP+ + HAVLLVGYG + NIPYW ++NSWG    +EG++ + RG+ A
Sbjct:   390 YRHGIAHPFRPLCSPWFIDHAVLLVGYGNRSNIPYWAIKNSWGSDWGEEGYYYLYRGSGA 449

Query:   945 CGIEQIAGYATID 957
             CG+  +A  A ++
Sbjct:   450 CGVNTMASSAVVN 462

 Score = 395 (144.1 bits), Expect = 1.4e-35, P = 1.4e-35
 Identities = 93/311 (29%), Positives = 143/311 (45%)

Query:   292 FKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHE---------RYGTSEFSDRSPEEILC 342
             FK F+    R Y + EE + R   F ++  +  +         +YG ++FSD + EE   
Sbjct:   165 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEF-- 222

Query:   343 KTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWA 402
                       Y   +  +            +   P  WDWRKK       +Q  CGSCWA
Sbjct:   223 -------HTIYLNPLLQKESGRKMSPAKSINDLAPPEWDWRKKGAVTEVKNQGMCGSCWA 275

Query:   403 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGC-GGCDGLEQPIEYTHQAGLESEKD 461
             FS+ G +EGQ+ +  G L+  S+ +L++C K    C GG           +  GLE+E D
Sbjct:   276 FSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDKACLGGLPS-NAYAAIKNLGGLETEDD 334

Query:   462 YPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNSHLIHFYN 521
             Y Y+   G    C +     K++             +   L + GP+SV +N+  + FY 
Sbjct:   335 YGYQ---GHVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKGPISVAINAFGMQFYR 391

Query:   522 GTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACG 581
                       CSP+ + HAVLLVGYG + +IPYW  +NSWG    +EG++ + RG+ ACG
Sbjct:   392 HGIAHPFRPLCSPWFIDHAVLLVGYGNRSNIPYWAIKNSWGSDWGEEGYYYLYRGSGACG 451

Query:   582 IEQIAGYATID 592
             +  +A  A ++
Sbjct:   452 VNTMASSAVVN 462

 Score = 383 (139.9 bits), Expect = 2.8e-34, P = 2.8e-34
 Identities = 80/219 (36%), Positives = 119/219 (54%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 71
             P  WDWRKK       +Q  CGSCWAFS+ G +EGQ+ +  G L+  S+ +L++C K   
Sbjct:   250 PPEWDWRKKGAVTEVKNQGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDK 309

Query:    72 GCDGCFFEPSIEYT---HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNG 128
              C G    PS  Y    +  GLE+E DY Y+   G    C +     K++   D +  + 
Sbjct:   310 ACLGGL--PSNAYAAIKNLGGLETEDDYGYQ---GHVQTCNFSAQMAKVYIN-DSVELSR 363

Query:   129 SET-MKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIP 187
             +E  +   L + GP+SV +N+  +  Y           CSP+ + HAVLLVGYG + NIP
Sbjct:   364 NENKIAAWLAQKGPISVAINAFGMQFYRHGIAHPFRPLCSPWFIDHAVLLVGYGNRSNIP 423

Query:   188 YWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID 226
             YW ++NSWG    +EG++ + RG+ ACG+  +A  A ++
Sbjct:   424 YWAIKNSWGSDWGEEGYYYLYRGSGACGVNTMASSAVVN 462

 Score = 182 (69.1 bits), Expect = 1.9e-10, P = 1.9e-10
 Identities = 29/61 (47%), Positives = 44/61 (72%)

Query:   965 CSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 1024
             CSP+ + HAVLLVGYG + +IPYW ++NSWG    +EG++ + RG+ ACG+  +A  A +
Sbjct:   402 CSPWFIDHAVLLVGYGNRSNIPYWAIKNSWGSDWGEEGYYYLYRGSGACGVNTMASSAVV 461

Query:  1025 D 1025
             +
Sbjct:   462 N 462


>TAIR|locus:2175088 [details] [associations]
            symbol:ALP "aleurain-like protease" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009723 "response to ethylene stimulus" evidence=IEP]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0005829 "cytosol"
            evidence=RCA] [GO:0006096 "glycolysis" evidence=RCA] [GO:0006816
            "calcium ion transport" evidence=RCA] [GO:0006833 "water transport"
            evidence=RCA] [GO:0006972 "hyperosmotic response" evidence=RCA]
            [GO:0007030 "Golgi organization" evidence=RCA] [GO:0009266
            "response to temperature stimulus" evidence=RCA] [GO:0009651
            "response to salt stress" evidence=RCA] [GO:0009750 "response to
            fructose stimulus" evidence=RCA] [GO:0042744 "hydrogen peroxide
            catabolic process" evidence=RCA] [GO:0046686 "response to cadmium
            ion" evidence=RCA] [GO:0007568 "aging" evidence=IEP]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002688 GO:GO:0005773
            GO:GO:0007568 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AB011483 KO:K01366
            ProtClustDB:CLSN2689015 UniGene:At.25414 IPI:IPI00846287
            RefSeq:NP_001078774.1 ProteinModelPortal:A8MQZ1 SMR:A8MQZ1
            STRING:A8MQZ1 PRIDE:A8MQZ1 EnsemblPlants:AT5G60360.3 GeneID:836158
            KEGG:ath:AT5G60360 OMA:CGSTPMD Genevestigator:A8MQZ1 Uniprot:A8MQZ1
        Length = 361

 Score = 397 (144.8 bits), Expect = 8.7e-36, P = 8.7e-36
 Identities = 100/304 (32%), Positives = 146/304 (48%)

Query:   291 TFKAFIVKRGRQYANDEEIKERFEYFKQD------GHKKHERY--GTSEFSDRSPEEILC 342
             +F  F  + G++Y N EE+K RF  FK++       +KK   Y  G ++F+D + +E   
Sbjct:    58 SFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQEF-- 115

Query:   343 KTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWA 402
                    +RT +   A              +  +P+  DWR+  +  P  DQ  CGSCW 
Sbjct:   116 -------QRT-KLGAAQNCSATLKGSHKVTEAALPETKDWREDGIVSPVKDQGGCGSCWT 167

Query:   403 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCGGCDGL-EQPIEYTHQ-AGLES 458
             FS  G LE  Y    GK +  S+ QLV+CA   +  GC G  GL  Q  EY     GL++
Sbjct:   168 FSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNG--GLPSQAFEYIKSNGGLDT 225

Query:   459 EKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNS-HLI 517
             EK YPY  G  E  K + +   V++    + +     + +K  +    P+S+     H  
Sbjct:   226 EKAYPY-TGKDETCKFSAENVGVQVLNSVN-ITLGAEDELKHAVGLVRPVSIAFEVIHSF 283

Query:   518 HFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGN 577
               Y       +    +P D+ HAVL VGYG +D +PYWL +NSWG    D+G+FK+E G 
Sbjct:   284 RLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKGYFKMEMGK 343

Query:   578 NACG 581
             N CG
Sbjct:   344 NMCG 347

 Score = 394 (143.8 bits), Expect = 1.8e-35, P = 1.8e-35
 Identities = 98/304 (32%), Positives = 150/304 (49%)

Query:   657 TFKAFIVKRGRQYANDEEIKERFEYFKQD------GHKKHERY--GTSEFSDRSPEEILC 708
             +F  F  + G++Y N EE+K RF  FK++       +KK   Y  G ++F+D + +E   
Sbjct:    58 SFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQEF-- 115

Query:   709 KTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWA 768
                    +RT +   A              +  +P+  DWR+  +  P  DQ  CGSCW 
Sbjct:   116 -------QRT-KLGAAQNCSATLKGSHKVTEAALPETKDWREDGIVSPVKDQGGCGSCWT 167

Query:   769 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPSIEYTHQ-AGLESEK 825
             FS  G LE  Y    GK +  S+ QLV+CA   +  GC+G     + EY     GL++EK
Sbjct:   168 FSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEK 227

Query:   826 DYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNSDLIHDY 885
              YPY   + E  K + +   V++    + +     + +K  +    P+S+    ++IH +
Sbjct:   228 AYPYTGKD-ETCKFSAENVGVQVLNSVN-ITLGAEDELKHAVGLVRPVSIAF--EVIHSF 283

Query:   886 NGTPIRK-NDETC--SPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGN 942
                      D  C  +P D+ HAVL VGYG +D +PYWL++NSWG    D+G+FK+E G 
Sbjct:   284 RLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKGYFKMEMGK 343

Query:   943 NACG 946
             N CG
Sbjct:   344 NMCG 347

 Score = 358 (131.1 bits), Expect = 1.3e-31, P = 1.3e-31
 Identities = 76/214 (35%), Positives = 113/214 (52%)

Query:     8 DGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 67
             +  +P+  DWR+  +  P  DQ  CGSCW FS  G LE  Y    GK +  S+ QLV+CA
Sbjct:   138 EAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCA 197

Query:    68 KQCS--GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
                +  GC+G     + EY     GL++EK YPY   + E  K + +   V++    + +
Sbjct:   198 GAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKD-ETCKFSAENVGVQVLNSVN-I 255

Query:   125 HFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRK-NDETC--SPYDLGHAVLLVGYG 181
                  + +K  +    P+S+    ++IH +         D  C  +P D+ HAVL VGYG
Sbjct:   256 TLGAEDELKHAVGLVRPVSIAF--EVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYG 313

Query:   182 KQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACG 215
              +D +PYWL++NSWG    D+G+FK+E G N CG
Sbjct:   314 VEDGVPYWLIKNSWGADWGDKGYFKMEMGKNMCG 347

 Score = 179 (68.1 bits), Expect = 2.1e-10, P = 2.1e-10
 Identities = 31/58 (53%), Positives = 40/58 (68%)

Query:   959 VKNDETC--SPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACG 1014
             V  D  C  +P D+ HAVL VGYG +D +PYWL++NSWG    D+G+FK+E G N CG
Sbjct:   290 VYTDSHCGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKGYFKMEMGKNMCG 347


>DICTYBASE|DDB_G0279799 [details] [associations]
            symbol:cprB "cysteine proteinase 2" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0279799 GenomeReviews:CM000152_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            MEROPS:I29.003 KO:K01365 EMBL:AAFI02000033 EMBL:M16039 EMBL:X03344
            PIR:A25439 RefSeq:XP_641494.1 ProteinModelPortal:P04989 SMR:P04989
            EnsemblProtists:DDB0214998 GeneID:8622234 KEGG:ddi:DDB_G0279799
            OMA:YVNITAG Uniprot:P04989
        Length = 376

 Score = 286 (105.7 bits), Expect = 5.3e-34, Sum P(3) = 5.3e-34
 Identities = 67/186 (36%), Positives = 99/186 (53%)

Query:     2 LMEVEKDGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKS 61
             ++ VE     P + DWR KN   P  DQ  CGSCW+FS  G  EG +A+KT KLV  S+ 
Sbjct:   114 VLNVEDLQTNPKSIDWRTKNAVTPIKDQGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQ 173

Query:    62 QLVECA--KQCSGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLF 118
              LV+C+  ++  GCDG     + +Y     G+++E  YPY    G    C ++KS +   
Sbjct:   174 NLVDCSGPEENFGCDGGLMNNAFDYIIKNKGIDTESSYPYTAETGST--CLFNKSDIGA- 230

Query:   119 TGKDFLHFN-GSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLL 177
             T K +++   GSE   +   ++GP+SV +++        T     +  CSP +L H VL+
Sbjct:   231 TIKGYVNITAGSEISLENGAQHGPVSVAIDASHNSFQLYTSGIYYEPKCSPTELDHGVLV 290

Query:   178 VGYGKQ 183
             VGYG Q
Sbjct:   291 VGYGVQ 296

 Score = 284 (105.0 bits), Expect = 9.7e-36, Sum P(4) = 9.7e-36
 Identities = 65/176 (36%), Positives = 95/176 (53%)

Query:   743 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA--KQ 800
             P + DWR KN   P  DQ  CGSCW+FS  G  EG +A+KT KLV  S+  LV+C+  ++
Sbjct:   124 PKSIDWRTKNAVTPIKDQGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGPEE 183

Query:   801 CSGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN- 858
               GCDG     + +Y     G+++E  YPY    G    C ++KS +   T K +++   
Sbjct:   184 NFGCDGGLMNNAFDYIIKNKGIDTESSYPYTAETGST--CLFNKSDIGA-TIKGYVNITA 240

Query:   859 GSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 914
             GSE   +   ++GP+SV +++        T     +  CSP +L H VL+VGYG Q
Sbjct:   241 GSEISLENGAQHGPVSVAIDASHNSFQLYTSGIYYEPKCSPTELDHGVLVVGYGVQ 296

 Score = 279 (103.3 bits), Expect = 3.8e-30, Sum P(3) = 3.8e-30
 Identities = 68/181 (37%), Positives = 95/181 (52%)

Query:   377 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 436
             P + DWR KN   P  DQ  CGSCW+FS  G  EG +A+KT KLV  S+  LV+C+    
Sbjct:   124 PKSIDWRTKNAVTPIKDQGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGPEE 183

Query:   437 GCGGCDG--LEQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFN 493
               G CDG  +    +Y     G+++E  YPY    G    C ++KS +   T K ++   
Sbjct:   184 NFG-CDGGLMNNAFDYIIKNKGIDTESSYPYTAETGST--CLFNKSDIGA-TIKGYVNIT 239

Query:   494 -GSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ--D 550
              GSE   +   ++GP+SV +++    F   T     +  CSP +L H VL+VGYG Q  D
Sbjct:   240 AGSEISLENGAQHGPVSVAIDASHNSFQLYTSGIYYEPKCSPTELDHGVLVVGYGVQGKD 299

Query:   551 D 551
             D
Sbjct:   300 D 300

 Score = 78 (32.5 bits), Expect = 9.7e-36, Sum P(4) = 9.7e-36
 Identities = 15/37 (40%), Positives = 23/37 (62%)

Query:   987 YWLVRNSWGPIGPDEGFFKI--ERGNNACGIEQIAGY 1021
             YW+V+NSWG     +G+  +  +R NN CGI  ++ Y
Sbjct:   338 YWIVKNSWGTSWGIKGYILMSKDRKNN-CGIASVSSY 373

 Score = 78 (32.5 bits), Expect = 1.1e-30, Sum P(3) = 1.1e-30
 Identities = 15/37 (40%), Positives = 23/37 (62%)

Query:   919 YWLVRNSWGPIGPDEGFFKI--ERGNNACGIEQIAGY 953
             YW+V+NSWG     +G+  +  +R NN CGI  ++ Y
Sbjct:   338 YWIVKNSWGTSWGIKGYILMSKDRKNN-CGIASVSSY 373

 Score = 78 (32.5 bits), Expect = 2.8e-29, Sum P(2) = 2.8e-29
 Identities = 15/37 (40%), Positives = 23/37 (62%)

Query:   188 YWLVRNSWGPIGPDEGFFKI--ERGNNACGIEQIAGY 222
             YW+V+NSWG     +G+  +  +R NN CGI  ++ Y
Sbjct:   338 YWIVKNSWGTSWGIKGYILMSKDRKNN-CGIASVSSY 373

 Score = 74 (31.1 bits), Expect = 7.4e-29, Sum P(2) = 7.4e-29
 Identities = 14/37 (37%), Positives = 22/37 (59%)

Query:   554 YWLARNSWGPIGPDEGFFKI--ERGNNACGIEQIAGY 588
             YW+ +NSWG     +G+  +  +R NN CGI  ++ Y
Sbjct:   338 YWIVKNSWGTSWGIKGYILMSKDRKNN-CGIASVSSY 373

 Score = 67 (28.6 bits), Expect = 9.7e-36, Sum P(4) = 9.7e-36
 Identities = 14/22 (63%), Positives = 16/22 (72%)

Query:   965 CSPYDLGHAVLLVGYGKQ--DD 984
             CSP +L H VL+VGYG Q  DD
Sbjct:   279 CSPTELDHGVLVVGYGVQGKDD 300

 Score = 37 (18.1 bits), Expect = 9.7e-36, Sum P(4) = 9.7e-36
 Identities = 7/14 (50%), Positives = 11/14 (78%)

Query:   322 KKHERYGTSEFSDR 335
             K + +Y +SEFS+R
Sbjct:    42 KFNRQYSSSEFSNR 55

 Score = 37 (18.1 bits), Expect = 9.7e-36, Sum P(4) = 9.7e-36
 Identities = 7/14 (50%), Positives = 11/14 (78%)

Query:   688 KKHERYGTSEFSDR 701
             K + +Y +SEFS+R
Sbjct:    42 KFNRQYSSSEFSNR 55


>RGD|61810 [details] [associations]
            symbol:Ctsk "cathepsin K" species:10116 "Rattus norvegicus"
           [GO:0001957 "intramembranous ossification" evidence=IEP] [GO:0005615
           "extracellular space" evidence=IDA] [GO:0005737 "cytoplasm"
           evidence=IDA] [GO:0005764 "lysosome" evidence=IDA] [GO:0006508
           "proteolysis" evidence=TAS] [GO:0008234 "cysteine-type peptidase
           activity" evidence=TAS] [GO:0045453 "bone resorption" evidence=IMP]
           InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
           Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
           RGD:61810 GO:GO:0005615 eggNOG:COG4870 HOGENOM:HOG000230774
           GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
           InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
           PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
           GO:GO:0045453 GO:GO:0001957 GeneTree:ENSGT00560000076577
           HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
           OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:AF010306 EMBL:BC078793
           IPI:IPI00206378 RefSeq:NP_113748.1 UniGene:Rn.5598
           ProteinModelPortal:O35186 SMR:O35186 STRING:O35186
           PhosphoSite:O35186 PRIDE:O35186 Ensembl:ENSRNOT00000028730
           GeneID:29175 KEGG:rno:29175 UCSC:RGD:61810 InParanoid:O35186
           OMA:YKEIPEG BindingDB:O35186 ChEMBL:CHEMBL3034 NextBio:608248
           Genevestigator:O35186 GermOnline:ENSRNOG00000021155 Uniprot:O35186
        Length = 329

 Score = 394 (143.8 bits), Expect = 1.8e-35, P = 1.8e-35
 Identities = 83/221 (37%), Positives = 122/221 (55%)

Query:     6 EKDGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVE 65
             E +G VPD+ D+RKK    P  +Q  CGSCWAFS AG LEGQ   KTGKL+  S   LV+
Sbjct:   110 EWEGRVPDSIDYRKKGYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVD 169

Query:    66 CAKQCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDF 123
             C  +  GC G +   + +Y  Q  G++SE  YPY    G+   C Y+ + K     G   
Sbjct:   170 CVSENYGCGGGYMTTAFQYVQQNGGIDSEDAYPYV---GQDESCMYNATAKAAKCRGYRE 226

Query:   124 LHFNGSETMKKILYKYGPLSVLLNSDLIH-DYNGTPIRKNDETCSPYDLGHAVLLVGYGK 182
             +     + +K+ + + GP+SV +++ L    +    +   DE C   ++ HAVL+VGYG 
Sbjct:   227 IPVGNEKALKRAVARVGPVSVSIDASLTSFQFYSRGVYY-DENCDRDNVNHAVLVVGYGT 285

Query:   183 QDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 222
             Q    YW+++NSWG    ++G+  + R  NNACGI  +A +
Sbjct:   286 QKGNKYWIIKNSWGESWGNKGYVLLARNKNNACGITNLASF 326

 Score = 393 (143.4 bits), Expect = 2.3e-35, P = 2.3e-35
 Identities = 105/326 (32%), Positives = 157/326 (48%)

Query:   646 SLTFDNENILET-FKAFIVKRGRQYAND-EEIKERFEYFKQ----DGHKKHERYG--TSE 697
             S     E  L+T ++ +    G+QY +  +EI  R  + K       H      G  T E
Sbjct:    13 SFALSPEETLDTQWELWKKTHGKQYNSKVDEISRRLIWEKNLKKISVHNLEASLGAHTYE 72

Query:   698 FS-----DRSPEEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK 751
              +     D + EE++ K TG         R+   R            +G VPD+ D+RKK
Sbjct:    73 LAMNHLGDMTSEEVVQKMTGL--------RVPPSRSFSNDTLYTPEWEGRVPDSIDYRKK 124

Query:   752 NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEP 811
                 P  +Q  CGSCWAFS AG LEGQ   KTGKL+  S   LV+C  +  GC G +   
Sbjct:   125 GYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVSENYGCGGGYMTT 184

Query:   812 SIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDFLHFNGSETMKKILYK 869
             + +Y  Q  G++SE  YPY    G+   C Y+ + K     G   +     + +K+ + +
Sbjct:   185 AFQYVQQNGGIDSEDAYPYV---GQDESCMYNATAKAAKCRGYREIPVGNEKALKRAVAR 241

Query:   870 YGPLSVLLNSDLIH-DYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGP 928
              GP+SV +++ L    +    +   DE C   ++ HAVL+VGYG Q    YW+++NSWG 
Sbjct:   242 VGPVSVSIDASLTSFQFYSRGVYY-DENCDRDNVNHAVLVVGYGTQKGNKYWIIKNSWGE 300

Query:   929 IGPDEGFFKIERG-NNACGIEQIAGY 953
                ++G+  + R  NNACGI  +A +
Sbjct:   301 SWGNKGYVLLARNKNNACGITNLASF 326

 Score = 389 (142.0 bits), Expect = 6.3e-35, P = 6.3e-35
 Identities = 107/326 (32%), Positives = 156/326 (47%)

Query:   280 SLTFDNENILET-FKAFIVKRGRQYAND-EEIKERFEYFKQ----DGHKKHERYG--TSE 331
             S     E  L+T ++ +    G+QY +  +EI  R  + K       H      G  T E
Sbjct:    13 SFALSPEETLDTQWELWKKTHGKQYNSKVDEISRRLIWEKNLKKISVHNLEASLGAHTYE 72

Query:   332 FS-----DRSPEEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK 385
              +     D + EE++ K TG         R+   R            +G VPD+ D+RKK
Sbjct:    73 LAMNHLGDMTSEEVVQKMTGL--------RVPPSRSFSNDTLYTPEWEGRVPDSIDYRKK 124

Query:   386 NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLE 445
                 P  +Q  CGSCWAFS AG LEGQ   KTGKL+  S   LV+C  +  GCGG   + 
Sbjct:   125 GYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVSENYGCGG-GYMT 183

Query:   446 QPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKS-KVKLFTGKDFLYFNGSETMKKILY 503
                +Y  Q  G++SE  YPY    G+   C Y+ + K     G   +     + +K+ + 
Sbjct:   184 TAFQYVQQNGGIDSEDAYPYV---GQDESCMYNATAKAAKCRGYREIPVGNEKALKRAVA 240

Query:   504 KYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGP 563
             + GP+SV +++ L  F   +     DE C   ++ HAVL+VGYG Q    YW+ +NSWG 
Sbjct:   241 RVGPVSVSIDASLTSFQFYSRGVYYDENCDRDNVNHAVLVVGYGTQKGNKYWIIKNSWGE 300

Query:   564 IGPDEGFFKIERG-NNACGIEQIAGY 588
                ++G+  + R  NNACGI  +A +
Sbjct:   301 SWGNKGYVLLARNKNNACGITNLASF 326

 Score = 152 (58.6 bits), Expect = 3.3e-08, Sum P(2) = 3.3e-08
 Identities = 27/61 (44%), Positives = 39/61 (63%)

Query:   962 DETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAG 1020
             DE C   ++ HAVL+VGYG Q    YW+++NSWG    ++G+  + R  NNACGI  +A 
Sbjct:   266 DENCDRDNVNHAVLVVGYGTQKGNKYWIIKNSWGESWGNKGYVLLARNKNNACGITNLAS 325

Query:  1021 Y 1021
             +
Sbjct:   326 F 326

 Score = 51 (23.0 bits), Expect = 3.3e-08, Sum P(2) = 3.3e-08
 Identities = 17/66 (25%), Positives = 32/66 (48%)

Query:   225 IDVVIQRLVLEK--KAIMLIQAVFLLCGVASCLCLPSLTDRITDQVVARVDTLAIEGSLT 282
             +D + +RL+ EK  K I +      L      L +  L D  +++VV ++  L +  S +
Sbjct:    41 VDEISRRLIWEKNLKKISVHNLEASLGAHTYELAMNHLGDMTSEEVVQKMTGLRVPPSRS 100

Query:   283 FDNENI 288
             F N+ +
Sbjct:   101 FSNDTL 106

 Score = 51 (23.0 bits), Expect = 3.3e-08, Sum P(2) = 3.3e-08
 Identities = 17/66 (25%), Positives = 32/66 (48%)

Query:   591 IDVVIQRLVLEK--KAIMLIQAVFLLCGVASCLCLPSLTDRITDQVVARVDTLAIEGSLT 648
             +D + +RL+ EK  K I +      L      L +  L D  +++VV ++  L +  S +
Sbjct:    41 VDEISRRLIWEKNLKKISVHNLEASLGAHTYELAMNHLGDMTSEEVVQKMTGLRVPPSRS 100

Query:   649 FDNENI 654
             F N+ +
Sbjct:   101 FSNDTL 106


>UNIPROTKB|P56202 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 GO:GO:0006955 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AF013611
            EMBL:AF015954 EMBL:AF055903 EMBL:AP001201 EMBL:BC048255
            IPI:IPI00328978 RefSeq:NP_001326.2 UniGene:Hs.416848
            ProteinModelPortal:P56202 SMR:P56202 STRING:P56202 MEROPS:C01.037
            PhosphoSite:P56202 DMDM:259016196 PaxDb:P56202 PRIDE:P56202
            Ensembl:ENST00000307886 GeneID:1521 KEGG:hsa:1521 UCSC:uc001ogc.1
            CTD:1521 GeneCards:GC11P065647 HGNC:HGNC:2546 HPA:CAB016345
            MIM:602364 neXtProt:NX_P56202 PharmGKB:PA27042 eggNOG:NOG288820
            HOVERGEN:HBG100117 InParanoid:P56202 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 PhylomeDB:P56202 GenomeRNAi:1521 NextBio:6295
            ArrayExpress:P56202 Bgee:P56202 CleanEx:HS_CTSW
            Genevestigator:P56202 GermOnline:ENSG00000172543 Uniprot:P56202
        Length = 376

 Score = 280 (103.6 bits), Expect = 1.9e-35, Sum P(3) = 1.9e-35
 Identities = 79/264 (29%), Positives = 119/264 (45%)

Query:   290 ETFKAFIVKRGRQYANDEEIKERFEYFK----QDGHKKHERYGTSEFSDRSPEEILCKTG 345
             E FK F ++  R Y + EE   R + F     Q    + E  GT+EF   +P   L +  
Sbjct:    40 EAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGV-TPFSDLTEEE 98

Query:   346 FKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRK-KNVTGPAGDQAACGSCWAFS 404
             F      Y R                 +  VP + DWRK  +   P  DQ  C  CWA +
Sbjct:    99 FG-QLYGYRRAAGGVPSMGREIRSEEPEESVPFSCDWRKVASAISPIKDQKNCNCCWAMA 157

Query:   405 IAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQPIEYTHQAGLESEKDYPY 464
              AG +E  + I     V+ S  +L++C +   GC G    +  I   + +GL SEKDYP+
Sbjct:   158 AAGNIETLWRISFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNNSGLASEKDYPF 217

Query:   465 RNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSE-TMKKILYKYGPLSVGLNSHLIHFYNGT 523
             + G     +C + K   K+   +DF+    +E  + + L  YGP++V +N   +  Y   
Sbjct:   218 Q-GKVRAHRC-HPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKG 275

Query:   524 PIRKNDETCSPYDLGHAVLLVGYG 547
              I+    TC P  + H+VLLVG+G
Sbjct:   276 VIKATPTTCDPQLVDHSVLLVGFG 299

 Score = 279 (103.3 bits), Expect = 2.5e-35, Sum P(3) = 2.5e-35
 Identities = 79/264 (29%), Positives = 120/264 (45%)

Query:   656 ETFKAFIVKRGRQYANDEEIKERFEYFK----QDGHKKHERYGTSEFSDRSPEEILCKTG 711
             E FK F ++  R Y + EE   R + F     Q    + E  GT+EF   +P   L +  
Sbjct:    40 EAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGV-TPFSDLTEEE 98

Query:   712 FKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRK-KNVTGPAGDQAACGSCWAFS 770
             F      Y R                 +  VP + DWRK  +   P  DQ  C  CWA +
Sbjct:    99 FG-QLYGYRRAAGGVPSMGREIRSEEPEESVPFSCDWRKVASAISPIKDQKNCNCCWAMA 157

Query:   771 IAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPY 829
              AG +E  + I     V+ S  +L++C +   GC G F ++  I   + +GL SEKDYP+
Sbjct:   158 AAGNIETLWRISFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNNSGLASEKDYPF 217

Query:   830 KNANGEKFKCAYDKSKVKLFTGKDFLHFNGSE-TMKKILYKYGPLSVLLNSDLIHDYNGT 888
             +       +C + K   K+   +DF+    +E  + + L  YGP++V +N   +  Y   
Sbjct:   218 QG-KVRAHRC-HPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKG 275

Query:   889 PIRKNDETCSPYDLGHAVLLVGYG 912
              I+    TC P  + H+VLLVG+G
Sbjct:   276 VIKATPTTCDPQLVDHSVLLVGFG 299

 Score = 255 (94.8 bits), Expect = 3.4e-26, Sum P(2) = 3.4e-26
 Identities = 59/179 (32%), Positives = 93/179 (51%)

Query:     6 EKDGPVPDAWDWRK-KNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLV 64
             E +  VP + DWRK  +   P  DQ +C  CWA + AG +E  + I     V+ S  +L+
Sbjct:   123 EPEESVPFSCDWRKVASAISPIKDQKNCNCCWAMAAAGNIETLWRISFWDFVDVSVQELL 182

Query:    65 ECAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF 123
             +C +   GC G F ++  I   + +GL SEKDYP++       +C + K   K+   +DF
Sbjct:   183 DCGRCGDGCHGGFVWDAFITVLNNSGLASEKDYPFQG-KVRAHRC-HPKKYQKVAWIQDF 240

Query:   124 LHFNGSE-TMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG 181
             +    +E  + + L  YGP++V +N   +  Y    I+    TC P  + H+VLLVG+G
Sbjct:   241 IMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKGVIKATPTTCDPQLVDHSVLLVGFG 299

 Score = 110 (43.8 bits), Expect = 5.4e-32, Sum P(2) = 5.4e-32
 Identities = 19/52 (36%), Positives = 32/52 (61%)

Query:   918 PYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDV--VKNDETCSP 967
             PYW+++NSWG    ++G+F++ RG+N CGI +    A +    +K   +C P
Sbjct:   325 PYWILKNSWGAQWGEKGYFRLHRGSNTCGITKFPLTARVQKPDMKPRVSCPP 376

 Score = 108 (43.1 bits), Expect = 1.9e-35, Sum P(3) = 1.9e-35
 Identities = 15/30 (50%), Positives = 24/30 (80%)

Query:   986 PYWLVRNSWGPIGPDEGFFKIERGNNACGI 1015
             PYW+++NSWG    ++G+F++ RG+N CGI
Sbjct:   325 PYWILKNSWGAQWGEKGYFRLHRGSNTCGI 354

 Score = 108 (43.1 bits), Expect = 3.4e-26, Sum P(2) = 3.4e-26
 Identities = 15/30 (50%), Positives = 24/30 (80%)

Query:   187 PYWLVRNSWGPIGPDEGFFKIERGNNACGI 216
             PYW+++NSWG    ++G+F++ RG+N CGI
Sbjct:   325 PYWILKNSWGAQWGEKGYFRLHRGSNTCGI 354

 Score = 106 (42.4 bits), Expect = 1.4e-31, Sum P(2) = 1.4e-31
 Identities = 15/30 (50%), Positives = 23/30 (76%)

Query:   553 PYWLARNSWGPIGPDEGFFKIERGNNACGI 582
             PYW+ +NSWG    ++G+F++ RG+N CGI
Sbjct:   325 PYWILKNSWGAQWGEKGYFRLHRGSNTCGI 354

 Score = 57 (25.1 bits), Expect = 1.9e-35, Sum P(3) = 1.9e-35
 Identities = 12/24 (50%), Positives = 16/24 (66%)

Query:   958 VVK-NDETCSPYDLGHAVLLVGYG 980
             V+K    TC P  + H+VLLVG+G
Sbjct:   276 VIKATPTTCDPQLVDHSVLLVGFG 299


>TAIR|locus:2082687 [details] [associations]
            symbol:AT3G54940 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002686 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HSSP:P53634
            OMA:GGGLMTN EMBL:AY070063 IPI:IPI00528988 RefSeq:NP_567010.5
            UniGene:At.28412 ProteinModelPortal:Q8VYS0 SMR:Q8VYS0 PRIDE:Q8VYS0
            EnsemblPlants:AT3G54940.2 GeneID:824659 KEGG:ath:AT3G54940
            TAIR:At3g54940 PhylomeDB:Q8VYS0 ProtClustDB:CLSN2718801
            ArrayExpress:Q8VYS0 Genevestigator:Q8VYS0 Uniprot:Q8VYS0
        Length = 367

 Score = 393 (143.4 bits), Expect = 2.3e-35, P = 2.3e-35
 Identities = 118/359 (32%), Positives = 173/359 (48%)

Query:   268 VVARVDTLAIEGSLTFDNE----NILET-----FKAFIVKRGRQYANDEEIKERFEYFKQ 318
             VVA V+ L I   +T DN     N+L T     F+ F+   G+ Y+  EE   R   F +
Sbjct:    19 VVASVEDLTIR-QVTADNRRIRPNLLGTHTESKFRLFMSDYGKNYSTREEYIHRLGIFAK 77

Query:   319 DGHK--KHER------YGTSEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXX-X 369
             +  K  +H+       +G ++FSD + EE      FK        +   R          
Sbjct:    78 NVLKAAEHQMMDPSAVHGVTQFSDLTEEE------FKRMYTGVADVGGSRGGTVGAEAPM 131

Query:   370 XXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLV 429
                DG +P+ +DWR+K       +Q ACGSCWAFS  G  EG + + TGKL+  S+ QLV
Sbjct:   132 VEVDG-LPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLV 190

Query:   430 ECAKQCS-----GC-GGCDG--LEQPIEYTHQAG-LESEKDYPYRNGNGEKFKCAYDKSK 480
             +C + C       C  GC G  +    EY  +AG LE E+ YPY    G++  C +D  K
Sbjct:   191 DCDQACDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPY---TGKRGHCKFDPEK 247

Query:   481 VKLFTGKDFLYFNGSET-MKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGH 539
             V +    +F      E  +   L ++GPL+VGLN+  +  Y G         CS  ++ H
Sbjct:   248 VAVRV-LNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQTYIGGV--SCPLICSKRNVNH 304

Query:   540 AVLLVGYGKQ-------DDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQ-IAGYAT 590
              VLLVGYG +        + PYW+ +NSWG    + G++K+ RG++ CGI   ++  AT
Sbjct:   305 GVLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKLCRGHDICGINSMVSAVAT 363

 Score = 392 (143.0 bits), Expect = 3.0e-35, P = 3.0e-35
 Identities = 117/359 (32%), Positives = 173/359 (48%)

Query:   634 VVARVDTLAIEGSLTFDNE----NILET-----FKAFIVKRGRQYANDEEIKERFEYFKQ 684
             VVA V+ L I   +T DN     N+L T     F+ F+   G+ Y+  EE   R   F +
Sbjct:    19 VVASVEDLTIR-QVTADNRRIRPNLLGTHTESKFRLFMSDYGKNYSTREEYIHRLGIFAK 77

Query:   685 DGHK--KHER------YGTSEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXX-X 735
             +  K  +H+       +G ++FSD + EE      FK        +   R          
Sbjct:    78 NVLKAAEHQMMDPSAVHGVTQFSDLTEEE------FKRMYTGVADVGGSRGGTVGAEAPM 131

Query:   736 XXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLV 795
                DG +P+ +DWR+K       +Q ACGSCWAFS  G  EG + + TGKL+  S+ QLV
Sbjct:   132 VEVDG-LPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLV 190

Query:   796 ECAKQC---------SGCDGCFFEPSIEYTHQAG-LESEKDYPYKNANGEKFKCAYDKSK 845
             +C + C         +GC G     + EY  +AG LE E+ YPY    G++  C +D  K
Sbjct:   191 DCDQACDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPY---TGKRGHCKFDPEK 247

Query:   846 VKLFTGKDFLHFNGSET-MKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGH 904
             V +    +F      E  +   L ++GPL+V LN+  +  Y G         CS  ++ H
Sbjct:   248 VAVRV-LNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQTYIGGV--SCPLICSKRNVNH 304

Query:   905 AVLLVGYGKQ-------DNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ-IAGYAT 955
              VLLVGYG +        N PYW+++NSWG    + G++K+ RG++ CGI   ++  AT
Sbjct:   305 GVLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKLCRGHDICGINSMVSAVAT 363

 Score = 361 (132.1 bits), Expect = 6.3e-32, P = 6.3e-32
 Identities = 87/239 (36%), Positives = 127/239 (53%)

Query:     5 VEKDGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLV 64
             VE DG +P+ +DWR+K       +Q  CGSCWAFS  G  EG + + TGKL+  S+ QLV
Sbjct:   132 VEVDG-LPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLV 190

Query:    65 ECAKQC---------SGCDGCFFEPSIEYTHQAG-LESEKDYPYKNANGEKFKCAYDKSK 114
             +C + C         +GC G     + EY  +AG LE E+ YPY    G++  C +D  K
Sbjct:   191 DCDQACDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPY---TGKRGHCKFDPEK 247

Query:   115 VKLFTGKDFLHFNGSET-MKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGH 173
             V +    +F      E  +   L ++GPL+V LN+  +  Y G         CS  ++ H
Sbjct:   248 VAVRV-LNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQTYIGGV--SCPLICSKRNVNH 304

Query:   174 AVLLVGYGKQ-------DNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ-IAGYAT 224
              VLLVGYG +        N PYW+++NSWG    + G++K+ RG++ CGI   ++  AT
Sbjct:   305 GVLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKLCRGHDICGINSMVSAVAT 363

 Score = 140 (54.3 bits), Expect = 4.6e-06, P = 4.6e-06
 Identities = 31/94 (32%), Positives = 49/94 (52%)

Query:   938 IERGNNACGIEQIAGYATIDVVKNDETCSPYDLGHAVLLVGYGKQ-------DDIPYWLV 990
             +  G  A G+  +     I  V     CS  ++ H VLLVGYG +        + PYW++
Sbjct:   270 VRHGPLAVGLNAVFMQTYIGGVSCPLICSKRNVNHGVLLVGYGSKGFSILRLSNKPYWII 329

Query:   991 RNSWGPIGPDEGFFKIERGNNACGIEQ-IAGYAT 1023
             +NSWG    + G++K+ RG++ CGI   ++  AT
Sbjct:   330 KNSWGKKWGENGYYKLCRGHDICGINSMVSAVAT 363


>MGI|MGI:107285 [details] [associations]
            symbol:Ctsh "cathepsin H" species:10090 "Mus musculus"
            [GO:0001520 "outer dense fiber" evidence=ISO] [GO:0001669
            "acrosomal vesicle" evidence=ISO] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=IGI] [GO:0002764 "immune response-regulating
            signaling pathway" evidence=ISO] [GO:0004175 "endopeptidase
            activity" evidence=ISO;IMP] [GO:0004177 "aminopeptidase activity"
            evidence=ISO] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISO;IDA] [GO:0004252 "serine-type endopeptidase activity"
            evidence=IMP] [GO:0005615 "extracellular space" evidence=ISO]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0005829 "cytosol"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=ISO;IMP]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0008284
            "positive regulation of cell proliferation" evidence=IMP]
            [GO:0010628 "positive regulation of gene expression" evidence=ISO]
            [GO:0010634 "positive regulation of epithelial cell migration"
            evidence=IMP] [GO:0010813 "neuropeptide catabolic process"
            evidence=ISO] [GO:0010815 "bradykinin catabolic process"
            evidence=ISO] [GO:0010952 "positive regulation of peptidase
            activity" evidence=IGI;ISO] [GO:0016505 "apoptotic protease
            activator activity" evidence=IGI;ISO] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0030108 "HLA-A specific activating MHC
            class I receptor activity" evidence=ISO] [GO:0030335 "positive
            regulation of cell migration" evidence=ISO] [GO:0030984 "kininogen
            binding" evidence=ISO] [GO:0031638 "zymogen activation"
            evidence=ISO;IMP] [GO:0031648 "protein destabilization"
            evidence=ISO;IMP] [GO:0032403 "protein complex binding"
            evidence=ISO] [GO:0032526 "response to retinoic acid" evidence=IDA]
            [GO:0033619 "membrane protein proteolysis" evidence=ISO;IMP]
            [GO:0035085 "cilium axoneme" evidence=ISO] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IMP] [GO:0043129
            "surfactant homeostasis" evidence=ISO] [GO:0043621 "protein
            self-association" evidence=ISO] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IMP] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=IMP]
            [GO:0070324 "thyroid hormone binding" evidence=ISO] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISO] [GO:0097208 "alveolar
            lamellar body" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:107285 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 EMBL:CH466560 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            HOVERGEN:HBG011513 GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT OrthoDB:EOG4W9J43
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 BRENDA:3.4.22.16
            EMBL:U06119 EMBL:AK149949 EMBL:AK150583 EMBL:AK157376 EMBL:AK160026
            EMBL:Y18464 IPI:IPI00118987 RefSeq:NP_031827.2 UniGene:Mm.2277
            ProteinModelPortal:P49935 SMR:P49935 STRING:P49935 MEROPS:I29.003
            PhosphoSite:P49935 PaxDb:P49935 PRIDE:P49935
            Ensembl:ENSMUST00000034915 GeneID:13036 KEGG:mmu:13036
            InParanoid:Q3UCD6 ChEMBL:CHEMBL1949491 NextBio:282920 Bgee:P49935
            CleanEx:MM_CTSH Genevestigator:P49935 GermOnline:ENSMUSG00000032359
            Uniprot:P49935
        Length = 333

 Score = 392 (143.0 bits), Expect = 3.0e-35, P = 3.0e-35
 Identities = 104/312 (33%), Positives = 154/312 (49%)

Query:   658 FKAFIVKRGRQYANDEEIKERFEYF-----KQDGH--KKHE-RYGTSEFSDRSPEEILCK 709
             FK+++ +  + Y++  E   R + F     K   H  + H  +   ++FSD S  EI  K
Sbjct:    33 FKSWMKQHQKTYSS-VEYNHRLQMFANNWRKIQAHNQRNHTFKMALNQFSDMSFAEI--K 89

Query:   710 TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQAACGSCWA 768
               F WSE   +   A +             GP P + DWRKK NV  P  +Q ACGSCW 
Sbjct:    90 HKFLWSEP--QNCSATKSNYLRGT------GPYPSSMDWRKKGNVVSPVKNQGACGSCWT 141

Query:   769 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPSIEYT-HQAGLESEK 825
             FS  G LE   AI +GK++  ++ QLV+CA+  +  GC G     + EY  +  G+  E 
Sbjct:   142 FSTTGALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFEYILYNKGIMEED 201

Query:   826 DYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF--NGSETMKKILYKYGPLSVL--LNSDL 881
              YPY    G+   C ++  K   F  K+ ++   N    M + +  Y P+S    +  D 
Sbjct:   202 SYPYI---GKDSSCRFNPQKAVAFV-KNVVNITLNDEAAMVEAVALYNPVSFAFEVTEDF 257

Query:   882 IHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERG 941
             +   +G    K+    +P  + HAVL VGYG+Q+ + YW+V+NSWG    + G+F IERG
Sbjct:   258 LMYKSGVYSSKSCHK-TPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSQWGENGYFLIERG 316

Query:   942 NNACGIEQIAGY 953
              N CG+   A Y
Sbjct:   317 KNMCGLAACASY 328

 Score = 385 (140.6 bits), Expect = 1.7e-34, P = 1.7e-34
 Identities = 104/313 (33%), Positives = 152/313 (48%)

Query:   292 FKAFIVKRGRQYANDEEIKERFEYF-----KQDGH--KKHE-RYGTSEFSDRSPEEILCK 343
             FK+++ +  + Y++  E   R + F     K   H  + H  +   ++FSD S  EI  K
Sbjct:    33 FKSWMKQHQKTYSS-VEYNHRLQMFANNWRKIQAHNQRNHTFKMALNQFSDMSFAEI--K 89

Query:   344 TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVTGPAGDQAACGSCWA 402
               F WSE   +   A +             GP P + DWRKK NV  P  +Q ACGSCW 
Sbjct:    90 HKFLWSEP--QNCSATKSNYLRGT------GPYPSSMDWRKKGNVVSPVKNQGACGSCWT 141

Query:   403 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCGGCDGL-EQPIEYT-HQAGLES 458
             FS  G LE   AI +GK++  ++ QLV+CA+  +  GC G  GL  Q  EY  +  G+  
Sbjct:   142 FSTTGALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKG--GLPSQAFEYILYNKGIME 199

Query:   459 EKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF-LYFNGSETMKKILYKYGPLSVG--LNSH 515
             E  YPY    G+   C ++  K   F      +  N    M + +  Y P+S    +   
Sbjct:   200 EDSYPYI---GKDSSCRFNPQKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTED 256

Query:   516 LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIER 575
              + + +G    K+    +P  + HAVL VGYG+Q+ + YW+ +NSWG    + G+F IER
Sbjct:   257 FLMYKSGVYSSKSCHK-TPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSQWGENGYFLIER 315

Query:   576 GNNACGIEQIAGY 588
             G N CG+   A Y
Sbjct:   316 GKNMCGLAACASY 328

 Score = 371 (135.7 bits), Expect = 5.4e-33, P = 5.4e-33
 Identities = 82/222 (36%), Positives = 118/222 (53%)

Query:     9 GPVPDAWDWRKK-NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 67
             GP P + DWRKK NV  P  +Q  CGSCW FS  G LE   AI +GK++  ++ QLV+CA
Sbjct:   112 GPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCA 171

Query:    68 KQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
             +  +  GC G     + EY  +  G+  E  YPY    G+   C ++  K   F  K+ +
Sbjct:   172 QAFNNHGCKGGLPSQAFEYILYNKGIMEEDSYPYI---GKDSSCRFNPQKAVAFV-KNVV 227

Query:   125 HF--NGSETMKKILYKYGPLSVL--LNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGY 180
             +   N    M + +  Y P+S    +  D +   +G    K+    +P  + HAVL VGY
Sbjct:   228 NITLNDEAAMVEAVALYNPVSFAFEVTEDFLMYKSGVYSSKSCHK-TPDKVNHAVLAVGY 286

Query:   181 GKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
             G+Q+ + YW+V+NSWG    + G+F IERG N CG+   A Y
Sbjct:   287 GEQNGLLYWIVKNSWGSQWGENGYFLIERGKNMCGLAACASY 328

 Score = 165 (63.1 bits), Expect = 6.1e-09, P = 6.1e-09
 Identities = 30/65 (46%), Positives = 43/65 (66%)

Query:   959 VKNDETC--SPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 1016
             V + ++C  +P  + HAVL VGYG+Q+ + YW+V+NSWG    + G+F IERG N CG+ 
Sbjct:   264 VYSSKSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSQWGENGYFLIERGKNMCGLA 323

Query:  1017 QIAGY 1021
               A Y
Sbjct:   324 ACASY 328


>TAIR|locus:2050145 [details] [associations]
            symbol:AT2G21430 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002685 GenomeReviews:CT485783_GR
            EMBL:AC006841 EMBL:X74359 IPI:IPI00519637 PIR:B84601
            RefSeq:NP_565512.1 UniGene:At.14069 ProteinModelPortal:P43295
            SMR:P43295 MEROPS:C01.A04 PRIDE:P43295 EnsemblPlants:AT2G21430.1
            GeneID:816682 KEGG:ath:AT2G21430 TAIR:At2g21430 eggNOG:COG4870
            HOGENOM:HOG000230774 InParanoid:P43295 KO:K01373 OMA:GSIEEHY
            PhylomeDB:P43295 ProtClustDB:CLSN2688311 Genevestigator:P43295
            GermOnline:AT2G21430 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 Uniprot:P43295
        Length = 361

 Score = 392 (143.0 bits), Expect = 3.0e-35, P = 3.0e-35
 Identities = 107/319 (33%), Positives = 153/319 (47%)

Query:   658 FKAFIVKRGRQYANDEEIKERFEYFKQD-----GHKKHE---RYGTSEFSDRSPEEILCK 709
             F  F  K G+ Y + EE   RF  FK +      H+K +   R+G ++FSD +  E   K
Sbjct:    48 FTLFKKKFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSEFRRK 107

Query:   710 -TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWA 768
               G K   +    +  D                +P+ +DWR +    P  +Q +CGSCW+
Sbjct:   108 HLGVKGGFK----LPKDANQAPILPTQN-----LPEEFDWRDRGAVTPVKNQGSCGSCWS 158

Query:   769 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC---------SGCDGCFFEPSIEYTHQA 819
             FS  G LEG + + TGKLV  S+ QLV+C  +C         SGC+G     + EYT + 
Sbjct:   159 FSTTGALEGAHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKT 218

Query:   820 G-LESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLN 878
             G L  EKDYPY   +G    C  D+SK+        +     + +   L K GPL+V +N
Sbjct:   219 GGLMREKDYPYTGTDGGS--CKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAIN 276

Query:   879 SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGK----QDNI---PYWLVRNSWGPIGP 931
             +  +  Y G         CS   L H VLLVGYG     Q  +   PYW+++NSWG    
Sbjct:   277 AAYMQTYIGGV--SCPYICSRR-LNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWG 333

Query:   932 DEGFFKIERGNNACGIEQI 950
             + GF+KI +G N CG++ +
Sbjct:   334 ENGFYKICKGRNICGVDSL 352

 Score = 385 (140.6 bits), Expect = 1.7e-34, P = 1.7e-34
 Identities = 106/319 (33%), Positives = 152/319 (47%)

Query:   292 FKAFIVKRGRQYANDEEIKERFEYFKQD-----GHKKHE---RYGTSEFSDRSPEEILCK 343
             F  F  K G+ Y + EE   RF  FK +      H+K +   R+G ++FSD +  E   K
Sbjct:    48 FTLFKKKFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSEFRRK 107

Query:   344 -TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWA 402
               G K   +    +  D                +P+ +DWR +    P  +Q +CGSCW+
Sbjct:   108 HLGVKGGFK----LPKDANQAPILPTQN-----LPEEFDWRDRGAVTPVKNQGSCGSCWS 158

Query:   403 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS-----GC-GGCDG--LEQPIEYTHQA 454
             FS  G LEG + + TGKLV  S+ QLV+C  +C       C  GC+G  +    EYT + 
Sbjct:   159 FSTTGALEGAHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKT 218

Query:   455 G-LESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLN 513
             G L  EKDYPY   +G    C  D+SK+        +     + +   L K GPL+V +N
Sbjct:   219 GGLMREKDYPYTGTDGGS--CKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAIN 276

Query:   514 SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYG-------KQDDIPYWLARNSWGPIGP 566
             +  +  Y G         CS   L H VLLVGYG       +  + PYW+ +NSWG    
Sbjct:   277 AAYMQTYIGGV--SCPYICSRR-LNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWG 333

Query:   567 DEGFFKIERGNNACGIEQI 585
             + GF+KI +G N CG++ +
Sbjct:   334 ENGFYKICKGRNICGVDSL 352

 Score = 374 (136.7 bits), Expect = 2.6e-33, P = 2.6e-33
 Identities = 84/226 (37%), Positives = 118/226 (52%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P+ +DWR +    P  +Q  CGSCW+FS  G LEG + + TGKLV  S+ QLV+C  +C
Sbjct:   132 LPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHEC 191

Query:    71 ---------SGCDGCFFEPSIEYTHQAG-LESEKDYPYKNANGEKFKCAYDKSKVKLFTG 120
                      SGC+G     + EYT + G L  EKDYPY   +G    C  D+SK+     
Sbjct:   192 DPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMREKDYPYTGTDGGS--CKLDRSKIVASVS 249

Query:   121 KDFLHFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGY 180
                +     + +   L K GPL+V +N+  +  Y G         CS   L H VLLVGY
Sbjct:   250 NFSVVSINEDQIAANLIKNGPLAVAINAAYMQTYIGGV--SCPYICSRR-LNHGVLLVGY 306

Query:   181 GK----QDNI---PYWLVRNSWGPIGPDEGFFKIERGNNACGIEQI 219
             G     Q  +   PYW+++NSWG    + GF+KI +G N CG++ +
Sbjct:   307 GSAGFSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNICGVDSL 352

 Score = 133 (51.9 bits), Expect = 2.6e-05, P = 2.6e-05
 Identities = 24/56 (42%), Positives = 35/56 (62%)

Query:   970 LGHAVLLVGYG-------KQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQI 1018
             L H VLLVGYG       +  + PYW+++NSWG    + GF+KI +G N CG++ +
Sbjct:   297 LNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNICGVDSL 352


>TAIR|locus:2120222 [details] [associations]
            symbol:RD19 "RESPONSIVE TO DEHYDRATION 19" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009269 "response to desiccation" evidence=IEP] [GO:0006970
            "response to osmotic stress" evidence=IGI] [GO:0005634 "nucleus"
            evidence=IDA] [GO:0005773 "vacuole" evidence=IDA] [GO:0042742
            "defense response to bacterium" evidence=IMP] [GO:0006096
            "glycolysis" evidence=RCA] [GO:0006833 "water transport"
            evidence=RCA] [GO:0006972 "hyperosmotic response" evidence=RCA]
            [GO:0007030 "Golgi organization" evidence=RCA] [GO:0009266
            "response to temperature stimulus" evidence=RCA] [GO:0009651
            "response to salt stress" evidence=IEP;RCA] [GO:0046686 "response
            to cadmium ion" evidence=RCA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=ISS] [GO:0009414 "response to
            water deprivation" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005634 GO:GO:0005773 EMBL:CP002687
            GenomeReviews:CT486007_GR GO:GO:0009651 GO:GO:0042742
            eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            ProtClustDB:CLSN2688311 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AL035679 EMBL:AL161594 GO:GO:0004197
            MEROPS:C01.022 EMBL:D13042 EMBL:AY080598 EMBL:AY133844
            IPI:IPI00544363 PIR:JN0718 RefSeq:NP_568052.1 UniGene:At.2850
            UniGene:At.74924 ProteinModelPortal:P43296 SMR:P43296 STRING:P43296
            PaxDb:P43296 PRIDE:P43296 EnsemblPlants:AT4G39090.1 GeneID:830064
            KEGG:ath:AT4G39090 TAIR:At4g39090 InParanoid:P43296 OMA:EDFDWRD
            PhylomeDB:P43296 Genevestigator:P43296 GermOnline:AT4G39090
            Uniprot:P43296
        Length = 368

 Score = 389 (142.0 bits), Expect = 6.3e-35, P = 6.3e-35
 Identities = 106/322 (32%), Positives = 155/322 (48%)

Query:   658 FKAFIVKRGRQYANDEEIKERFEYFKQDGHK--KHER------YGTSEFSDRSPEEILCK 709
             F  F  K G+ YA++EE   RF  FK +  +  +H++      +G ++FSD +  E   K
Sbjct:    51 FSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFRKK 110

Query:   710 TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAF 769
                    R+  ++  D                +P+ +DWR      P  +Q +CGSCW+F
Sbjct:   111 ---HLGVRSGFKLPKDANKAPILPTEN-----LPEDFDWRDHGAVTPVKNQGSCGSCWSF 162

Query:   770 SIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC---------SGCDGCFFEPSIEYTHQAG 820
             S  G LEG   + TGKLV  S+ QLV+C  +C         SGC+G     + EYT + G
Sbjct:   163 SATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKTG 222

Query:   821 -LESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNS 879
              L  E+DYPY   +G+   C  DKSK+        +     E +   L K GPL+V +N+
Sbjct:   223 GLMKEEDYPYTGKDGKT--CKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINA 280

Query:   880 DLIHDYNGTPIRKNDETCSPY----DLGHAVLLVGYG-------KQDNIPYWLVRNSWGP 928
               +  Y G        +C PY     L H VLLVGYG       +    PYW+++NSWG 
Sbjct:   281 GYMQTYIG------GVSC-PYICTRRLNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGE 333

Query:   929 IGPDEGFFKIERGNNACGIEQI 950
                + GF+KI +G N CG++ +
Sbjct:   334 TWGENGFYKICKGRNICGVDSM 355

 Score = 384 (140.2 bits), Expect = 2.2e-34, P = 2.2e-34
 Identities = 106/322 (32%), Positives = 155/322 (48%)

Query:   292 FKAFIVKRGRQYANDEEIKERFEYFKQDGHK--KHER------YGTSEFSDRSPEEILCK 343
             F  F  K G+ YA++EE   RF  FK +  +  +H++      +G ++FSD +  E   K
Sbjct:    51 FSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFRKK 110

Query:   344 TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAF 403
                    R+  ++  D                +P+ +DWR      P  +Q +CGSCW+F
Sbjct:   111 ---HLGVRSGFKLPKDANKAPILPTEN-----LPEDFDWRDHGAVTPVKNQGSCGSCWSF 162

Query:   404 SIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS-----GC-GGCDG--LEQPIEYTHQAG 455
             S  G LEG   + TGKLV  S+ QLV+C  +C       C  GC+G  +    EYT + G
Sbjct:   163 SATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKTG 222

Query:   456 -LESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNS 514
              L  E+DYPY   +G+   C  DKSK+        +     E +   L K GPL+V +N+
Sbjct:   223 GLMKEEDYPYTGKDGKT--CKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINA 280

Query:   515 HLIHFYNGTPIRKNDETCSPY----DLGHAVLLVGYG-------KQDDIPYWLARNSWGP 563
               +  Y G        +C PY     L H VLLVGYG       +  + PYW+ +NSWG 
Sbjct:   281 GYMQTYIG------GVSC-PYICTRRLNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGE 333

Query:   564 IGPDEGFFKIERGNNACGIEQI 585
                + GF+KI +G N CG++ +
Sbjct:   334 TWGENGFYKICKGRNICGVDSM 355

 Score = 365 (133.5 bits), Expect = 2.4e-32, P = 2.4e-32
 Identities = 85/230 (36%), Positives = 118/230 (51%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P+ +DWR      P  +Q  CGSCW+FS  G LEG   + TGKLV  S+ QLV+C  +C
Sbjct:   135 LPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHEC 194

Query:    71 ---------SGCDGCFFEPSIEYTHQAG-LESEKDYPYKNANGEKFKCAYDKSKVKLFTG 120
                      SGC+G     + EYT + G L  E+DYPY   +G+   C  DKSK+     
Sbjct:   195 DPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKDGKT--CKLDKSKIVASVS 252

Query:   121 KDFLHFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPY----DLGHAVL 176
                +     E +   L K GPL+V +N+  +  Y G        +C PY     L H VL
Sbjct:   253 NFSVISIDEEQIAANLVKNGPLAVAINAGYMQTYIG------GVSC-PYICTRRLNHGVL 305

Query:   177 LVGYG-------KQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQI 219
             LVGYG       +    PYW+++NSWG    + GF+KI +G N CG++ +
Sbjct:   306 LVGYGAAGYAPARFKEKPYWIIKNSWGETWGENGFYKICKGRNICGVDSM 355

 Score = 132 (51.5 bits), Expect = 3.5e-05, P = 3.5e-05
 Identities = 24/56 (42%), Positives = 35/56 (62%)

Query:   970 LGHAVLLVGYG-------KQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQI 1018
             L H VLLVGYG       +  + PYW+++NSWG    + GF+KI +G N CG++ +
Sbjct:   300 LNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGETWGENGFYKICKGRNICGVDSM 355


>MGI|MGI:107823 [details] [associations]
            symbol:Ctsk "cathepsin K" species:10090 "Mus musculus"
            [GO:0005615 "extracellular space" evidence=ISO] [GO:0005737
            "cytoplasm" evidence=ISO] [GO:0005764 "lysosome" evidence=ISO]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0045453 "bone resorption" evidence=ISO]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:107823 GO:GO:0005615 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0045453
            GO:GO:0001957 HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 OMA:LKVPPSH EMBL:X94444
            EMBL:AJ006033 EMBL:BC046320 IPI:IPI00316575 PIR:S74227
            RefSeq:NP_031828.2 UniGene:Mm.272085 ProteinModelPortal:P55097
            SMR:P55097 MINT:MINT-3089515 STRING:P55097 PhosphoSite:P55097
            PRIDE:P55097 Ensembl:ENSMUST00000015664 GeneID:13038 KEGG:mmu:13038
            InParanoid:P55097 BioCyc:MetaCyc:MONOMER-14811 ChEMBL:CHEMBL1075277
            NextBio:282924 Bgee:P55097 CleanEx:MM_CTSK Genevestigator:P55097
            GermOnline:ENSMUSG00000028111 Uniprot:P55097
        Length = 329

 Score = 388 (141.6 bits), Expect = 8.1e-35, P = 8.1e-35
 Identities = 82/221 (37%), Positives = 122/221 (55%)

Query:     6 EKDGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVE 65
             E +G VPD+ D+RKK    P  +Q  CGSCWAFS AG LEGQ   KTGKL+  S   LV+
Sbjct:   110 EWEGRVPDSIDYRKKGYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVD 169

Query:    66 CAKQCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDF 123
             C  +  GC G +   + +Y  Q  G++SE  YPY    G+   C Y+ + K     G   
Sbjct:   170 CVTENYGCGGGYMTTAFQYVQQNGGIDSEDAYPYV---GQDESCMYNATAKAAKCRGYRE 226

Query:   124 LHFNGSETMKKILYKYGPLSVLLNSDLIH-DYNGTPIRKNDETCSPYDLGHAVLLVGYGK 182
             +     + +K+ + + GP+SV +++ L    +    +   DE C   ++ HAVL+VGYG 
Sbjct:   227 IPVGNEKALKRAVARVGPISVSIDASLASFQFYSRGVYY-DENCDRDNVNHAVLVVGYGT 285

Query:   183 QDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 222
             Q    +W+++NSWG    ++G+  + R  NNACGI  +A +
Sbjct:   286 QKGSKHWIIKNSWGESWGNKGYALLARNKNNACGITNMASF 326

 Score = 387 (141.3 bits), Expect = 1.0e-34, P = 1.0e-34
 Identities = 90/263 (34%), Positives = 134/263 (50%)

Query:   696 SEFSDRSPEEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVT 754
             +   D + EE++ K TG         RI   R            +G VPD+ D+RKK   
Sbjct:    76 NHLGDMTSEEVVQKMTGL--------RIPPSRSYSNDTLYTPEWEGRVPDSIDYRKKGYV 127

Query:   755 GPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIE 814
              P  +Q  CGSCWAFS AG LEGQ   KTGKL+  S   LV+C  +  GC G +   + +
Sbjct:   128 TPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVTENYGCGGGYMTTAFQ 187

Query:   815 YTHQ-AGLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDFLHFNGSETMKKILYKYGP 872
             Y  Q  G++SE  YPY    G+   C Y+ + K     G   +     + +K+ + + GP
Sbjct:   188 YVQQNGGIDSEDAYPYV---GQDESCMYNATAKAAKCRGYREIPVGNEKALKRAVARVGP 244

Query:   873 LSVLLNSDLIH-DYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGP 931
             +SV +++ L    +    +   DE C   ++ HAVL+VGYG Q    +W+++NSWG    
Sbjct:   245 ISVSIDASLASFQFYSRGVYY-DENCDRDNVNHAVLVVGYGTQKGSKHWIIKNSWGESWG 303

Query:   932 DEGFFKIERG-NNACGIEQIAGY 953
             ++G+  + R  NNACGI  +A +
Sbjct:   304 NKGYALLARNKNNACGITNMASF 326

 Score = 383 (139.9 bits), Expect = 2.8e-34, P = 2.8e-34
 Identities = 92/263 (34%), Positives = 133/263 (50%)

Query:   330 SEFSDRSPEEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVT 388
             +   D + EE++ K TG         RI   R            +G VPD+ D+RKK   
Sbjct:    76 NHLGDMTSEEVVQKMTGL--------RIPPSRSYSNDTLYTPEWEGRVPDSIDYRKKGYV 127

Query:   389 GPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQPI 448
              P  +Q  CGSCWAFS AG LEGQ   KTGKL+  S   LV+C  +  GCGG   +    
Sbjct:   128 TPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVTENYGCGG-GYMTTAF 186

Query:   449 EYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKS-KVKLFTGKDFLYFNGSETMKKILYKYG 506
             +Y  Q  G++SE  YPY    G+   C Y+ + K     G   +     + +K+ + + G
Sbjct:   187 QYVQQNGGIDSEDAYPYV---GQDESCMYNATAKAAKCRGYREIPVGNEKALKRAVARVG 243

Query:   507 PLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGP 566
             P+SV +++ L  F   +     DE C   ++ HAVL+VGYG Q    +W+ +NSWG    
Sbjct:   244 PISVSIDASLASFQFYSRGVYYDENCDRDNVNHAVLVVGYGTQKGSKHWIIKNSWGESWG 303

Query:   567 DEGFFKIERG-NNACGIEQIAGY 588
             ++G+  + R  NNACGI  +A +
Sbjct:   304 NKGYALLARNKNNACGITNMASF 326

 Score = 146 (56.5 bits), Expect = 7.9e-07, P = 7.9e-07
 Identities = 26/61 (42%), Positives = 39/61 (63%)

Query:   962 DETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAG 1020
             DE C   ++ HAVL+VGYG Q    +W+++NSWG    ++G+  + R  NNACGI  +A 
Sbjct:   266 DENCDRDNVNHAVLVVGYGTQKGSKHWIIKNSWGESWGNKGYALLARNKNNACGITNMAS 325

Query:  1021 Y 1021
             +
Sbjct:   326 F 326


>UNIPROTKB|P25326 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9913 "Bos taurus"
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0016020 "membrane" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0002250 "adaptive
            immune response" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0016020 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0002250 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            GO:GO:0097067 EMBL:BC102245 EMBL:M95211 EMBL:X62001 IPI:IPI00702008
            PIR:S15844 RefSeq:NP_001028787.1 UniGene:Bt.7938
            ProteinModelPortal:P25326 SMR:P25326 STRING:P25326 PRIDE:P25326
            Ensembl:ENSBTAT00000022774 GeneID:327711 KEGG:bta:327711 CTD:1520
            InParanoid:P25326 KO:K01368 OMA:KAMDQKC OrthoDB:EOG4JM7Q2
            NextBio:20810175 Uniprot:P25326
        Length = 331

 Score = 381 (139.2 bits), Expect = 4.5e-34, P = 4.5e-34
 Identities = 86/222 (38%), Positives = 121/222 (54%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +PD+ DWR+K        Q ACGSCWAFS  G LE Q  +KTGKLV  S   LV+C+   
Sbjct:   115 LPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAK 174

Query:   436 SGCGGCDG--LEQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYD-KSKVKLFTGKDFLY 491
              G  GC+G  + +  +Y     G++SE  YPY+  +G   KC YD K++    +    L 
Sbjct:   175 YGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDG---KCQYDVKNRAATCSRYIELP 231

Query:   492 FNGSETMKKILYKYGPLSVGLN-SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 550
             F   E +K+ +   GP+SVG++ SH   F   T +   D +C+  ++ H VL+VGYG  D
Sbjct:   232 FGSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYY-DPSCTQ-NVNHGVLVVGYGNLD 289

Query:   551 DIPYWLARNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 591
                YWL +NSWG    D+G+ ++ R + N CGI     Y  I
Sbjct:   290 GKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIANYPSYPEI 331

 Score = 368 (134.6 bits), Expect = 1.1e-32, P = 1.1e-32
 Identities = 85/221 (38%), Positives = 119/221 (53%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC--AK 799
             +PD+ DWR+K        Q ACGSCWAFS  G LE Q  +KTGKLV  S   LV+C  AK
Sbjct:   115 LPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAK 174

Query:   800 QCS-GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLH 856
               + GC+G F   + +Y     G++SE  YPYK  +G   KC YD K++    +    L 
Sbjct:   175 YGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDG---KCQYDVKNRAATCSRYIELP 231

Query:   857 FNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN 916
             F   E +K+ +   GP+SV +++     +        D +C+  ++ H VL+VGYG  D 
Sbjct:   232 FGSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQ-NVNHGVLVVGYGNLDG 290

Query:   917 IPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 956
               YWLV+NSWG    D+G+ ++ R + N CGI     Y  I
Sbjct:   291 KDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIANYPSYPEI 331

 Score = 362 (132.5 bits), Expect = 4.9e-32, P = 4.9e-32
 Identities = 84/221 (38%), Positives = 118/221 (53%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC--AK 68
             +PD+ DWR+K        Q  CGSCWAFS  G LE Q  +KTGKLV  S   LV+C  AK
Sbjct:   115 LPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAK 174

Query:    69 QCS-GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLH 125
               + GC+G F   + +Y     G++SE  YPYK  +G   KC YD K++    +    L 
Sbjct:   175 YGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDG---KCQYDVKNRAATCSRYIELP 231

Query:   126 FNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN 185
             F   E +K+ +   GP+SV +++     +        D +C+  ++ H VL+VGYG  D 
Sbjct:   232 FGSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQ-NVNHGVLVVGYGNLDG 290

Query:   186 IPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 225
               YWLV+NSWG    D+G+ ++ R + N CGI     Y  I
Sbjct:   291 KDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIANYPSYPEI 331

 Score = 140 (54.3 bits), Expect = 3.7e-06, P = 3.7e-06
 Identities = 35/104 (33%), Positives = 55/104 (52%)

Query:   928 PIGPDEGFFKI--ERGNNACGIEQIAGYATIDVVKN----DETCSPYDLGHAVLLVGYGK 981
             P G +E   +    +G  + GI+  A +++  + K     D +C+  ++ H VL+VGYG 
Sbjct:   231 PFGSEEALKEAVANKGPVSVGID--ASHSSFFLYKTGVYYDPSCTQ-NVNHGVLVVGYGN 287

Query:   982 QDDIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 1024
              D   YWLV+NSWG    D+G+ ++ R + N CGI     Y  I
Sbjct:   288 LDGKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIANYPSYPEI 331


>TAIR|locus:2024362 [details] [associations]
            symbol:XBCP3 "xylem bark cysteine peptidase 3"
            species:3702 "Arabidopsis thaliana" [GO:0005576 "extracellular
            region" evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0005783 "endoplasmic
            reticulum" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005783 EMBL:CP002684 GO:GO:0005773 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 MEROPS:I29.003
            HOGENOM:HOG000230773 InterPro:IPR000118 Pfam:PF00396 SMART:SM00277
            UniGene:At.10233 OMA:CEIESAV EMBL:BT026490 EMBL:AK226753
            IPI:IPI00536687 RefSeq:NP_563855.1 ProteinModelPortal:Q0WVJ5
            SMR:Q0WVJ5 PRIDE:Q0WVJ5 EnsemblPlants:AT1G09850.1 GeneID:837517
            KEGG:ath:AT1G09850 TAIR:At1g09850 InParanoid:Q0WVJ5
            PhylomeDB:Q0WVJ5 ProtClustDB:CLSN2687747 Genevestigator:Q0WVJ5
            Uniprot:Q0WVJ5
        Length = 437

 Score = 381 (139.2 bits), Expect = 4.5e-34, P = 4.5e-34
 Identities = 100/321 (31%), Positives = 156/321 (48%)

Query:   646 SLTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHK---KHERYGTSEFS-DR 701
             S +  +++I E F  +  K G+ Y ++EE ++R + FK D H    +H     + +S   
Sbjct:    20 SSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFK-DNHDFVTQHNLITNATYSLSL 78

Query:   702 SPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQA 761
             +    L    FK S R    + A                 VPD+ DWRKK       DQ 
Sbjct:    79 NAFADLTHHEFKAS-RLGLSVSAPSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQG 137

Query:   762 ACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS-GCDGCFFEPSIEYT-HQA 819
             +CG+CW+FS  G +EG   I TG L+  S+ +L++C K  + GC+G   + + E+     
Sbjct:   138 SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 197

Query:   820 GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYG--PLSV-L 876
             G+++EKDYPY+  +G    C  DK K K+ T   +     ++  K ++      P+SV +
Sbjct:   198 GIDTEKDYPYQERDGT---CKKDKLKQKVVTIDSYAGVKSNDE-KALMEAVAAQPVSVGI 253

Query:   877 LNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFF 936
               S+       + I      CS   L HAVL+VGYG Q+ + YW+V+NSWG     +GF 
Sbjct:   254 CGSERAFQLYSSGIFSGP--CST-SLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFM 310

Query:   937 KIERG----NNACGIEQIAGY 953
              ++R     +  CGI  +A Y
Sbjct:   311 HMQRNTENSDGVCGINMLASY 331

 Score = 371 (135.7 bits), Expect = 5.4e-33, P = 5.4e-33
 Identities = 99/324 (30%), Positives = 155/324 (47%)

Query:   280 SLTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHK---KHERYGTSEFS-DR 335
             S +  +++I E F  +  K G+ Y ++EE ++R + FK D H    +H     + +S   
Sbjct:    20 SSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFK-DNHDFVTQHNLITNATYSLSL 78

Query:   336 SPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQA 395
             +    L    FK S R    + A                 VPD+ DWRKK       DQ 
Sbjct:    79 NAFADLTHHEFKAS-RLGLSVSAPSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQG 137

Query:   396 ACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDG--LEQPIEYT-H 452
             +CG+CW+FS  G +EG   I TG L+  S+ +L++C K  +   GC+G  ++   E+   
Sbjct:   138 SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNA--GCNGGLMDYAFEFVIK 195

Query:   453 QAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYG--PLSV 510
               G+++EKDYPY+  +G    C  DK K K+ T   +     ++  K ++      P+SV
Sbjct:   196 NHGIDTEKDYPYQERDGT---CKKDKLKQKVVTIDSYAGVKSNDE-KALMEAVAAQPVSV 251

Query:   511 GL--NSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDE 568
             G+  +      Y+          CS   L HAVL+VGYG Q+ + YW+ +NSWG     +
Sbjct:   252 GICGSERAFQLYSSGIF---SGPCST-SLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMD 307

Query:   569 GFFKIERG----NNACGIEQIAGY 588
             GF  ++R     +  CGI  +A Y
Sbjct:   308 GFMHMQRNTENSDGVCGINMLASY 331

 Score = 347 (127.2 bits), Expect = 2.0e-30, P = 2.0e-30
 Identities = 78/221 (35%), Positives = 117/221 (52%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             VPD+ DWRKK       DQ  CG+CW+FS  G +EG   I TG L+  S+ +L++C K  
Sbjct:   118 VPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSY 177

Query:    71 S-GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNG 128
             + GC+G   + + E+     G+++EKDYPY+  +G    C  DK K K+ T   +     
Sbjct:   178 NAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGT---CKKDKLKQKVVTIDSYAGVKS 234

Query:   129 SETMKKILYKYG--PLSV-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN 185
             ++  K ++      P+SV +  S+       + I      CS   L HAVL+VGYG Q+ 
Sbjct:   235 NDE-KALMEAVAAQPVSVGICGSERAFQLYSSGIFSGP--CST-SLDHAVLIVGYGSQNG 290

Query:   186 IPYWLVRNSWGPIGPDEGFFKIERG----NNACGIEQIAGY 222
             + YW+V+NSWG     +GF  ++R     +  CGI  +A Y
Sbjct:   291 VDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGINMLASY 331

 Score = 140 (54.3 bits), Expect = 6.5e-06, P = 6.5e-06
 Identities = 27/61 (44%), Positives = 37/61 (60%)

Query:   965 CSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERG----NNACGIEQIAG 1020
             CS   L HAVL+VGYG Q+ + YW+V+NSWG     +GF  ++R     +  CGI  +A 
Sbjct:   272 CST-SLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGINMLAS 330

Query:  1021 Y 1021
             Y
Sbjct:   331 Y 331


>UNIPROTKB|P43235 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0001957
            "intramembranous ossification" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0045453 "bone resorption"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=TAS] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=TAS] [GO:0036021 "endolysosome lumen" evidence=TAS]
            [GO:0045087 "innate immune response" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            Reactome:REACT_118779 Reactome:REACT_6900 GO:GO:0005615
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087 GO:GO:0045453
            EMBL:CH471121 EMBL:AL355860 GO:GO:0004197 GO:GO:0001957
            HOVERGEN:HBG011513 GO:GO:0036021 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:U13665 EMBL:X82153
            EMBL:U20280 EMBL:S79895 EMBL:CR541675 EMBL:AL356292 EMBL:BC016058
            IPI:IPI00300599 PIR:JC2476 RefSeq:NP_000387.1 UniGene:Hs.632466
            PDB:1ATK PDB:1AU0 PDB:1AU2 PDB:1AU3 PDB:1AU4 PDB:1AYU PDB:1AYV
            PDB:1AYW PDB:1BGO PDB:1BY8 PDB:1MEM PDB:1NL6 PDB:1NLJ PDB:1Q6K
            PDB:1SNK PDB:1TU6 PDB:1U9V PDB:1U9W PDB:1U9X PDB:1VSN PDB:1YK7
            PDB:1YK8 PDB:1YT7 PDB:2ATO PDB:2AUX PDB:2AUZ PDB:2BDL PDB:2R6N
            PDB:3C9E PDB:3H7D PDB:3KW9 PDB:3KWB PDB:3KWZ PDB:3KX1 PDB:3O0U
            PDB:3O1G PDB:3OVZ PDB:4DMX PDB:4DMY PDB:7PCK PDBsum:1ATK
            PDBsum:1AU0 PDBsum:1AU2 PDBsum:1AU3 PDBsum:1AU4 PDBsum:1AYU
            PDBsum:1AYV PDBsum:1AYW PDBsum:1BGO PDBsum:1BY8 PDBsum:1MEM
            PDBsum:1NL6 PDBsum:1NLJ PDBsum:1Q6K PDBsum:1SNK PDBsum:1TU6
            PDBsum:1U9V PDBsum:1U9W PDBsum:1U9X PDBsum:1VSN PDBsum:1YK7
            PDBsum:1YK8 PDBsum:1YT7 PDBsum:2ATO PDBsum:2AUX PDBsum:2AUZ
            PDBsum:2BDL PDBsum:2R6N PDBsum:3C9E PDBsum:3H7D PDBsum:3KW9
            PDBsum:3KWB PDBsum:3KWZ PDBsum:3KX1 PDBsum:3O0U PDBsum:3O1G
            PDBsum:3OVZ PDBsum:4DMX PDBsum:4DMY PDBsum:7PCK
            ProteinModelPortal:P43235 SMR:P43235 DIP:DIP-39993N IntAct:P43235
            STRING:P43235 PhosphoSite:P43235 DMDM:1168793 PaxDb:P43235
            PRIDE:P43235 DNASU:1513 Ensembl:ENST00000271651 GeneID:1513
            KEGG:hsa:1513 UCSC:uc001evp.2 GeneCards:GC01M150768 HGNC:HGNC:2536
            MIM:265800 MIM:601105 neXtProt:NX_P43235 Orphanet:763
            PharmGKB:PA27034 InParanoid:P43235 OMA:LKVPPSH PhylomeDB:P43235
            BindingDB:P43235 ChEMBL:CHEMBL268 EvolutionaryTrace:P43235
            GenomeRNAi:1513 NextBio:6267 ArrayExpress:P43235 Bgee:P43235
            CleanEx:HS_CTSK CleanEx:HS_CTSO Genevestigator:P43235
            GermOnline:ENSG00000143387 Uniprot:P43235
        Length = 329

 Score = 380 (138.8 bits), Expect = 5.8e-34, P = 5.8e-34
 Identities = 80/220 (36%), Positives = 121/220 (55%)

Query:     6 EKDGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVE 65
             E +G  PD+ D+RKK    P  +Q  CGSCWAFS  G LEGQ   KTGKL+  S   LV+
Sbjct:   110 EWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVD 169

Query:    66 CAKQCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDF 123
             C  +  GC G +   + +Y  +  G++SE  YPY    G++  C Y+ + K     G   
Sbjct:   170 CVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV---GQEESCMYNPTGKAAKCRGYRE 226

Query:   124 LHFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 183
             +     + +K+ + + GP+SV +++ L      +     DE+C+  +L HAVL VGYG Q
Sbjct:   227 IPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQ 286

Query:   184 DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 222
                 +W+++NSWG    ++G+  + R  NNACGI  +A +
Sbjct:   287 KGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASF 326

 Score = 377 (137.8 bits), Expect = 1.2e-33, P = 1.2e-33
 Identities = 79/218 (36%), Positives = 120/218 (55%)

Query:   739 DGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 798
             +G  PD+ D+RKK    P  +Q  CGSCWAFS  G LEGQ   KTGKL+  S   LV+C 
Sbjct:   112 EGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV 171

Query:   799 KQCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDFLH 856
              +  GC G +   + +Y  +  G++SE  YPY    G++  C Y+ + K     G   + 
Sbjct:   172 SENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV---GQEESCMYNPTGKAAKCRGYREIP 228

Query:   857 FNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN 916
                 + +K+ + + GP+SV +++ L      +     DE+C+  +L HAVL VGYG Q  
Sbjct:   229 EGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKG 288

Query:   917 IPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 953
               +W+++NSWG    ++G+  + R  NNACGI  +A +
Sbjct:   289 NKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASF 326

 Score = 375 (137.1 bits), Expect = 2.0e-33, P = 2.0e-33
 Identities = 81/219 (36%), Positives = 120/219 (54%)

Query:   373 DGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 432
             +G  PD+ D+RKK    P  +Q  CGSCWAFS  G LEGQ   KTGKL+  S   LV+C 
Sbjct:   112 EGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV 171

Query:   433 KQCSGCGGCDGLEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKS-KVKLFTGKDFL 490
              +  GCGG   +    +Y  +  G++SE  YPY    G++  C Y+ + K     G   +
Sbjct:   172 SENDGCGG-GYMTNAFQYVQKNRGIDSEDAYPYV---GQEESCMYNPTGKAAKCRGYREI 227

Query:   491 YFNGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 550
                  + +K+ + + GP+SV +++ L  F   +     DE+C+  +L HAVL VGYG Q 
Sbjct:   228 PEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQK 287

Query:   551 DIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 588
                +W+ +NSWG    ++G+  + R  NNACGI  +A +
Sbjct:   288 GNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASF 326

 Score = 148 (57.2 bits), Expect = 4.7e-07, P = 4.7e-07
 Identities = 27/61 (44%), Positives = 40/61 (65%)

Query:   962 DETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAG 1020
             DE+C+  +L HAVL VGYG Q    +W+++NSWG    ++G+  + R  NNACGI  +A 
Sbjct:   266 DESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLAS 325

Query:  1021 Y 1021
             +
Sbjct:   326 F 326


>MGI|MGI:107341 [details] [associations]
            symbol:Ctss "cathepsin S" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=ISO;IDA] [GO:0006508 "proteolysis"
            evidence=IDA] [GO:0008152 "metabolic process" evidence=ISO]
            [GO:0008233 "peptidase activity" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0009986 "cell
            surface" evidence=ISO] [GO:0016020 "membrane" evidence=IDA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0045453 "bone
            resorption" evidence=ISO] [GO:0051930 "regulation of sensory
            perception of pain" evidence=ISO] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:107341 GO:GO:0016020 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0008233 GO:GO:0031905 Reactome:REACT_102124
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 BRENDA:3.4.22.27
            ChiTaRS:CTSS EMBL:AF051732 EMBL:AF051727 EMBL:AF051728
            EMBL:AF051729 EMBL:AF051726 EMBL:AF051730 EMBL:AF051731
            EMBL:AF038546 EMBL:AJ002386 EMBL:AC092203 EMBL:Y18466 EMBL:AJ223208
            IPI:IPI00309520 UniGene:Mm.3619 PDB:1M0H PDBsum:1M0H
            ProteinModelPortal:O70370 SMR:O70370 STRING:O70370
            PhosphoSite:O70370 PaxDb:O70370 PRIDE:O70370
            Ensembl:ENSMUST00000116304 BindingDB:O70370 ChEMBL:CHEMBL4098
            NextBio:282932 Bgee:O70370 CleanEx:MM_CTSS Genevestigator:O70370
            GermOnline:ENSMUSG00000038642 Uniprot:O70370
        Length = 340

 Score = 379 (138.5 bits), Expect = 7.4e-34, P = 7.4e-34
 Identities = 93/272 (34%), Positives = 139/272 (51%)

Query:   328 GTSEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNV 387
             G ++  D + EEILC+ G     R   + V  R               +PD  DWR+K  
Sbjct:    84 GMNDMGDMTNEEILCRMGALRIPRQSPKTVTFRSYSNRT---------LPDTVDWREKGC 134

Query:   388 TGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS-GCGGCDG--L 444
                   Q +CG+CWAFS  G LEGQ  +KTGKL+  S   LV+C+ +   G  GC G  +
Sbjct:   135 VTEVKYQGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYM 194

Query:   445 EQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLY--FNGSETMKKI 501
              +  +Y     G+E++  YPY+    EK  C Y+ SK +  T   ++   F   + +K+ 
Sbjct:   195 TEAFQYIIDNGGIEADASYPYK-ATDEK--CHYN-SKNRAATCSRYIQLPFGDEDALKEA 250

Query:   502 LYKYGPLSVGLN-SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNS 560
             +   GP+SVG++ SH   F+  + +  +D +C+  ++ H VL+VGYG  D   YWL +NS
Sbjct:   251 VATKGPVSVGIDASHSSFFFYKSGVY-DDPSCTG-NVNHGVLVVGYGTLDGKDYWLVKNS 308

Query:   561 WGPIGPDEGFFKIERGN-NACGIEQIAGYATI 591
             WG    D+G+ ++ R N N CGI     Y  I
Sbjct:   309 WGLNFGDQGYIRMARNNKNHCGIASYCSYPEI 340

 Score = 368 (134.6 bits), Expect = 1.1e-32, P = 1.1e-32
 Identities = 91/271 (33%), Positives = 135/271 (49%)

Query:   694 GTSEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNV 753
             G ++  D + EEILC+ G     R   + V  R               +PD  DWR+K  
Sbjct:    84 GMNDMGDMTNEEILCRMGALRIPRQSPKTVTFRSYSNRT---------LPDTVDWREKGC 134

Query:   754 TGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ----CSGCDGCFF 809
                   Q +CG+CWAFS  G LEGQ  +KTGKL+  S   LV+C+ +      GC G + 
Sbjct:   135 VTEVKYQGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYM 194

Query:   810 EPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLH--FNGSETMKKI 866
               + +Y     G+E++  YPYK A  EK  C Y+ SK +  T   ++   F   + +K+ 
Sbjct:   195 TEAFQYIIDNGGIEADASYPYK-ATDEK--CHYN-SKNRAATCSRYIQLPFGDEDALKEA 250

Query:   867 LYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSW 926
             +   GP+SV +++     +       +D +C+  ++ H VL+VGYG  D   YWLV+NSW
Sbjct:   251 VATKGPVSVGIDASHSSFFFYKSGVYDDPSCTG-NVNHGVLVVGYGTLDGKDYWLVKNSW 309

Query:   927 GPIGPDEGFFKIERGN-NACGIEQIAGYATI 956
             G    D+G+ ++ R N N CGI     Y  I
Sbjct:   310 GLNFGDQGYIRMARNNKNHCGIASYCSYPEI 340

 Score = 351 (128.6 bits), Expect = 7.4e-31, P = 7.4e-31
 Identities = 80/223 (35%), Positives = 118/223 (52%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ- 69
             +PD  DWR+K        Q  CG+CWAFS  G LEGQ  +KTGKL+  S   LV+C+ + 
Sbjct:   123 LPDTVDWREKGCVTEVKYQGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEE 182

Query:    70 ---CSGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLH 125
                  GC G +   + +Y     G+E++  YPYK A  EK  C Y+ SK +  T   ++ 
Sbjct:   183 KYGNKGCGGGYMTEAFQYIIDNGGIEADASYPYK-ATDEK--CHYN-SKNRAATCSRYIQ 238

Query:   126 --FNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 183
               F   + +K+ +   GP+SV +++     +       +D +C+  ++ H VL+VGYG  
Sbjct:   239 LPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKSGVYDDPSCTG-NVNHGVLVVGYGTL 297

Query:   184 DNIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 225
             D   YWLV+NSWG    D+G+ ++ R N N CGI     Y  I
Sbjct:   298 DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIASYCSYPEI 340

 Score = 143 (55.4 bits), Expect = 1.8e-06, P = 1.8e-06
 Identities = 29/67 (43%), Positives = 40/67 (59%)

Query:   959 VKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQ 1017
             V +D +C+  ++ H VL+VGYG  D   YWLV+NSWG    D+G+ ++ R N N CGI  
Sbjct:   275 VYDDPSCTG-NVNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIAS 333

Query:  1018 IAGYATI 1024
                Y  I
Sbjct:   334 YCSYPEI 340


>UNIPROTKB|Q5E968 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:BT021052
            EMBL:BC109853 IPI:IPI00709374 RefSeq:NP_001029607.1
            UniGene:Bt.23218 ProteinModelPortal:Q5E968 SMR:Q5E968 STRING:Q5E968
            MEROPS:I29.007 PRIDE:Q5E968 Ensembl:ENSBTAT00000028016
            GeneID:513038 KEGG:bta:513038 CTD:1513 InParanoid:Q5E968 KO:K01371
            OrthoDB:EOG4SJ5FC NextBio:20870669 PANTHER:PTHR12411:SF55
            Uniprot:Q5E968
        Length = 329

 Score = 378 (138.1 bits), Expect = 9.5e-34, P = 9.5e-34
 Identities = 88/262 (33%), Positives = 131/262 (50%)

Query:   696 SEFSDRSPEEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVT 754
             +   D + EE++ K TG K        + A R            +G  PD+ D+RKK   
Sbjct:    76 NHLGDMTSEEVVQKMTGLK--------VPASRSRSNDTLYIPDWEGRAPDSVDYRKKGYV 127

Query:   755 GPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIE 814
              P  +Q  CGSCWAFS  G LEGQ   KTGKL+  S   LV+C  +  GC G +   + +
Sbjct:   128 TPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQ 187

Query:   815 YTHQ-AGLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDFLHFNGSETMKKILYKYGP 872
             Y  +  G++SE  YPY    G+   C Y+ + K     G   +     + +K+ + + GP
Sbjct:   188 YVQKNRGIDSEDAYPYV---GQDENCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGP 244

Query:   873 LSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPD 932
             +SV +++ L            DE C+  +L HAVL VGYG Q    +W+++NSWG    +
Sbjct:   245 ISVAIDASLTSFQFYRKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGN 304

Query:   933 EGFFKIERG-NNACGIEQIAGY 953
             +G+  + R  NNACGI  +A +
Sbjct:   305 KGYILMARNKNNACGIANLASF 326

 Score = 376 (137.4 bits), Expect = 1.6e-33, P = 1.6e-33
 Identities = 90/263 (34%), Positives = 131/263 (49%)

Query:   330 SEFSDRSPEEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVT 388
             +   D + EE++ K TG K        + A R            +G  PD+ D+RKK   
Sbjct:    76 NHLGDMTSEEVVQKMTGLK--------VPASRSRSNDTLYIPDWEGRAPDSVDYRKKGYV 127

Query:   389 GPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQPI 448
              P  +Q  CGSCWAFS  G LEGQ   KTGKL+  S   LV+C  +  GCGG   +    
Sbjct:   128 TPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGG-GYMTNAF 186

Query:   449 EYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKS-KVKLFTGKDFLYFNGSETMKKILYKYG 506
             +Y  +  G++SE  YPY    G+   C Y+ + K     G   +     + +K+ + + G
Sbjct:   187 QYVQKNRGIDSEDAYPYV---GQDENCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVG 243

Query:   507 PLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGP 566
             P+SV +++ L  F         DE C+  +L HAVL VGYG Q    +W+ +NSWG    
Sbjct:   244 PISVAIDASLTSFQFYRKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG 303

Query:   567 DEGFFKIERG-NNACGIEQIAGY 588
             ++G+  + R  NNACGI  +A +
Sbjct:   304 NKGYILMARNKNNACGIANLASF 326

 Score = 374 (136.7 bits), Expect = 2.6e-33, P = 2.6e-33
 Identities = 79/218 (36%), Positives = 117/218 (53%)

Query:     8 DGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 67
             +G  PD+ D+RKK    P  +Q  CGSCWAFS  G LEGQ   KTGKL+  S   LV+C 
Sbjct:   112 EGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV 171

Query:    68 KQCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDFLH 125
              +  GC G +   + +Y  +  G++SE  YPY    G+   C Y+ + K     G   + 
Sbjct:   172 SENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV---GQDENCMYNPTGKAAKCRGYREIP 228

Query:   126 FNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN 185
                 + +K+ + + GP+SV +++ L            DE C+  +L HAVL VGYG Q  
Sbjct:   229 EGNEKALKRAVARVGPISVAIDASLTSFQFYRKGVYYDENCNSDNLNHAVLAVGYGIQKG 288

Query:   186 IPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 222
               +W+++NSWG    ++G+  + R  NNACGI  +A +
Sbjct:   289 NKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASF 326

 Score = 147 (56.8 bits), Expect = 6.1e-07, P = 6.1e-07
 Identities = 27/61 (44%), Positives = 39/61 (63%)

Query:   962 DETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAG 1020
             DE C+  +L HAVL VGYG Q    +W+++NSWG    ++G+  + R  NNACGI  +A 
Sbjct:   266 DENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLAS 325

Query:  1021 Y 1021
             +
Sbjct:   326 F 326


>RGD|621513 [details] [associations]
            symbol:Ctss "cathepsin S" species:10116 "Rattus norvegicus"
            [GO:0001656 "metanephros development" evidence=IEP] [GO:0002250
            "adaptive immune response" evidence=ISO] [GO:0005764 "lysosome"
            evidence=IEA;ISO] [GO:0006508 "proteolysis" evidence=IEA;ISO]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0009986 "cell
            surface" evidence=IDA] [GO:0016020 "membrane" evidence=ISO]
            [GO:0043231 "intracellular membrane-bounded organelle"
            evidence=ISO] [GO:0045453 "bone resorption" evidence=IMP]
            [GO:0051930 "regulation of sensory perception of pain"
            evidence=IMP] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            RGD:621513 GO:GO:0009986 GO:GO:0051930 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0045453
            GO:GO:0001656 HOVERGEN:HBG011513 CTD:1520 KO:K01368 MEROPS:I29.004
            BRENDA:3.4.22.27 EMBL:L03201 IPI:IPI00210228 PIR:A45087
            RefSeq:NP_059016.1 UniGene:Rn.11347 ProteinModelPortal:Q02765
            PhosphoSite:Q02765 PRIDE:Q02765 GeneID:50654 KEGG:rno:50654
            UCSC:RGD:621513 ChEMBL:CHEMBL1075217 NextBio:610462
            Genevestigator:Q02765 Uniprot:Q02765
        Length = 330

 Score = 377 (137.8 bits), Expect = 1.2e-33, P = 1.2e-33
 Identities = 94/271 (34%), Positives = 137/271 (50%)

Query:   328 GTSEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNV 387
             G +   D +PEE++   G+  S R        R            +  +PD+ DWR+K  
Sbjct:    74 GMNHMGDMTPEEVI---GYMGSLRI------PRPWNRSGTLKSSSNQTLPDSVDWREKGC 124

Query:   388 TGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA-KQCSGCGGCDG--L 444
                   Q +CGSCWAFS  G LEGQ  +KTGKLV  S   LV+C+ ++  G  GC G  +
Sbjct:   125 VTNVKYQGSCGSCWAFSAEGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFM 184

Query:   445 EQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYD-KSKVKLFTGKDFLYFNGSETMKKILY 503
              +  +Y     ++SE  YPY+    EK  C YD K++    +    L F   E +K+ + 
Sbjct:   185 TEAFQYIIDTSIDSEASYPYK-AMDEK--CLYDPKNRAATCSRYIELPFGDEEALKEAVA 241

Query:   504 KYGPLSVGLN--SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSW 561
               GP+SVG++  SH   F   + +  +D +C+  ++ H VL+VGYG  D   YWL +NSW
Sbjct:   242 TKGPVSVGIDDASHSSFFLYQSGVY-DDPSCTE-NMNHGVLVVGYGTLDGKDYWLVKNSW 299

Query:   562 GPIGPDEGFFKIERGN-NACGIEQIAGYATI 591
             G    D+G+ ++ R N N CGI     Y  I
Sbjct:   300 GLHFGDQGYIRMARNNKNHCGIASYCSYPEI 330

 Score = 374 (136.7 bits), Expect = 2.6e-33, P = 2.6e-33
 Identities = 96/273 (35%), Positives = 134/273 (49%)

Query:   694 GTSEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNV 753
             G +   D +PEE++   G+  S R        R            +  +PD+ DWR+K  
Sbjct:    74 GMNHMGDMTPEEVI---GYMGSLRI------PRPWNRSGTLKSSSNQTLPDSVDWREKGC 124

Query:   754 TGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ----CSGCDGCFF 809
                   Q +CGSCWAFS  G LEGQ  +KTGKLV  S   LV+C+ +      GC G F 
Sbjct:   125 VTNVKYQGSCGSCWAFSAEGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFM 184

Query:   810 EPSIEYTHQAGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHFNGSETMKKILY 868
               + +Y     ++SE  YPYK A  EK  C YD K++    +    L F   E +K+ + 
Sbjct:   185 TEAFQYIIDTSIDSEASYPYK-AMDEK--CLYDPKNRAATCSRYIELPFGDEEALKEAVA 241

Query:   869 KYGPLSVLLNSDLIHD----YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRN 924
               GP+SV ++ D  H     Y       +D +C+  ++ H VL+VGYG  D   YWLV+N
Sbjct:   242 TKGPVSVGID-DASHSSFFLYQSGVY--DDPSCTE-NMNHGVLVVGYGTLDGKDYWLVKN 297

Query:   925 SWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 956
             SWG    D+G+ ++ R N N CGI     Y  I
Sbjct:   298 SWGLHFGDQGYIRMARNNKNHCGIASYCSYPEI 330

 Score = 367 (134.2 bits), Expect = 1.4e-32, P = 1.4e-32
 Identities = 87/225 (38%), Positives = 118/225 (52%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ- 69
             +PD+ DWR+K        Q  CGSCWAFS  G LEGQ  +KTGKLV  S   LV+C+ + 
Sbjct:   113 LPDSVDWREKGCVTNVKYQGSCGSCWAFSAEGALEGQLKLKTGKLVSLSAQNLVDCSTEE 172

Query:    70 ---CSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLH 125
                  GC G F   + +Y     ++SE  YPYK A  EK  C YD K++    +    L 
Sbjct:   173 KYGNKGCGGGFMTEAFQYIIDTSIDSEASYPYK-AMDEK--CLYDPKNRAATCSRYIELP 229

Query:   126 FNGSETMKKILYKYGPLSVLLNSDLIHD----YNGTPIRKNDETCSPYDLGHAVLLVGYG 181
             F   E +K+ +   GP+SV ++ D  H     Y       +D +C+  ++ H VL+VGYG
Sbjct:   230 FGDEEALKEAVATKGPVSVGID-DASHSSFFLYQSGVY--DDPSCTE-NMNHGVLVVGYG 285

Query:   182 KQDNIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 225
               D   YWLV+NSWG    D+G+ ++ R N N CGI     Y  I
Sbjct:   286 TLDGKDYWLVKNSWGLHFGDQGYIRMARNNKNHCGIASYCSYPEI 330

 Score = 146 (56.5 bits), Expect = 7.9e-07, P = 7.9e-07
 Identities = 36/104 (34%), Positives = 56/104 (53%)

Query:   928 PIGPDEGFFKI--ERGNNACGIEQIAGYATIDV----VKNDETCSPYDLGHAVLLVGYGK 981
             P G +E   +    +G  + GI+  A +++  +    V +D +C+  ++ H VL+VGYG 
Sbjct:   229 PFGDEEALKEAVATKGPVSVGIDD-ASHSSFFLYQSGVYDDPSCTE-NMNHGVLVVGYGT 286

Query:   982 QDDIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 1024
              D   YWLV+NSWG    D+G+ ++ R N N CGI     Y  I
Sbjct:   287 LDGKDYWLVKNSWGLHFGDQGYIRMARNNKNHCGIASYCSYPEI 330


>UNIPROTKB|Q24940 [details] [associations]
            symbol:Cat-1 "Cathepsin L-like proteinase" species:6192
            "Fasciola hepatica" [GO:0004175 "endopeptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005576 "extracellular region" evidence=IDA]
            [GO:0006508 "proteolysis" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005576 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0004197 EMBL:L33771 PIR:S43991 PDB:2O6X
            PDBsum:2O6X ProteinModelPortal:Q24940 SMR:Q24940 MEROPS:C01.033
            EvolutionaryTrace:Q24940 Uniprot:Q24940
        Length = 326

 Score = 375 (137.1 bits), Expect = 2.0e-33, P = 2.0e-33
 Identities = 84/217 (38%), Positives = 116/217 (53%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             VPD  DWR+        DQ +CGSCWAFS  G +EGQY       + FS+ QLV+C+   
Sbjct:   108 VPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPW 167

Query:    71 --SGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSK-VKLFTGKDFLHFN 127
               +GC G   E + +Y  Q GLE+E  YPY    G+   C Y+K   V   TG   +H +
Sbjct:   168 GNNGCSGGLMENAYQYLKQFGLETESSYPYTAVEGQ---CRYNKQLGVAKVTGYYTVH-S 223

Query:   128 GSET-MKKILYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
             GSE  +K ++    P +V ++  SD +   +G       +TCSP  + HAVL VGYG Q 
Sbjct:   224 GSEVELKNLVGARRPAAVAVDVESDFMMYRSGI---YQSQTCSPLRVNHAVLAVGYGTQG 280

Query:   185 NIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIA 220
                YW+V+NSWG    + G+ ++ R   N CGI  +A
Sbjct:   281 GTDYWIVKNSWGTYWGERGYIRMARNRGNMCGIASLA 317

 Score = 372 (136.0 bits), Expect = 4.2e-33, P = 4.2e-33
 Identities = 84/217 (38%), Positives = 115/217 (52%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             VPD  DWR+        DQ  CGSCWAFS  G +EGQY       + FS+ QLV+C+   
Sbjct:   108 VPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPW 167

Query:   802 --SGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSK-VKLFTGKDFLHFN 858
               +GC G   E + +Y  Q GLE+E  YPY    G+   C Y+K   V   TG   +H +
Sbjct:   168 GNNGCSGGLMENAYQYLKQFGLETESSYPYTAVEGQ---CRYNKQLGVAKVTGYYTVH-S 223

Query:   859 GSET-MKKILYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 915
             GSE  +K ++    P +V ++  SD +   +G       +TCSP  + HAVL VGYG Q 
Sbjct:   224 GSEVELKNLVGARRPAAVAVDVESDFMMYRSGI---YQSQTCSPLRVNHAVLAVGYGTQG 280

Query:   916 NIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIA 951
                YW+V+NSWG    + G+ ++ R   N CGI  +A
Sbjct:   281 GTDYWIVKNSWGTYWGERGYIRMARNRGNMCGIASLA 317

 Score = 358 (131.1 bits), Expect = 1.3e-31, P = 1.3e-31
 Identities = 82/218 (37%), Positives = 113/218 (51%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             VPD  DWR+        DQ  CGSCWAFS  G +EGQY       + FS+ QLV+C+   
Sbjct:   108 VPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPW 167

Query:   436 SGCGGCDG--LEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSK-VKLFTGKDFLYF 492
              G  GC G  +E   +Y  Q GLE+E  YPY    G+   C Y+K   V   TG  +   
Sbjct:   168 -GNNGCSGGLMENAYQYLKQFGLETESSYPYTAVEGQ---CRYNKQLGVAKVTGY-YTVH 222

Query:   493 NGSET-MKKILYKYGPLSVGLN--SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 549
             +GSE  +K ++    P +V ++  S  + + +G       +TCSP  + HAVL VGYG Q
Sbjct:   223 SGSEVELKNLVGARRPAAVAVDVESDFMMYRSGI---YQSQTCSPLRVNHAVLAVGYGTQ 279

Query:   550 DDIPYWLARNSWGPIGPDEGFFKIERGN-NACGIEQIA 586
                 YW+ +NSWG    + G+ ++ R   N CGI  +A
Sbjct:   280 GGTDYWIVKNSWGTYWGERGYIRMARNRGNMCGIASLA 317

 Score = 153 (58.9 bits), Expect = 6.7e-07, Sum P(2) = 6.7e-07
 Identities = 27/62 (43%), Positives = 37/62 (59%)

Query:   959 VKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQ 1017
             +   +TCSP  + HAVL VGYG Q    YW+V+NSWG    + G+ ++ R   N CGI  
Sbjct:   256 IYQSQTCSPLRVNHAVLAVGYGTQGGTDYWIVKNSWGTYWGERGYIRMARNRGNMCGIAS 315

Query:  1018 IA 1019
             +A
Sbjct:   316 LA 317

 Score = 37 (18.1 bits), Expect = 6.7e-07, Sum P(2) = 6.7e-07
 Identities = 6/17 (35%), Positives = 10/17 (58%)

Query:   209 RGNNACGIEQIAGYATI 225
             R N   G+ ++ GY T+
Sbjct:   205 RYNKQLGVAKVTGYYTV 221

 Score = 37 (18.1 bits), Expect = 6.7e-07, Sum P(2) = 6.7e-07
 Identities = 6/17 (35%), Positives = 10/17 (58%)

Query:   575 RGNNACGIEQIAGYATI 591
             R N   G+ ++ GY T+
Sbjct:   205 RYNKQLGVAKVTGYYTV 221

 Score = 37 (18.1 bits), Expect = 6.7e-07, Sum P(2) = 6.7e-07
 Identities = 6/17 (35%), Positives = 10/17 (58%)

Query:   940 RGNNACGIEQIAGYATI 956
             R N   G+ ++ GY T+
Sbjct:   205 RYNKQLGVAKVTGYYTV 221


>UNIPROTKB|Q9GLE3 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9823 "Sus scrofa"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0005576 "extracellular
            region" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005576 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 MEROPS:I29.007
            CTD:1513 KO:K01371 OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55
            OMA:LKVPPSH EMBL:AF292030 RefSeq:NP_999467.1 UniGene:Ssc.1020
            ProteinModelPortal:Q9GLE3 SMR:Q9GLE3 STRING:Q9GLE3
            Ensembl:ENSSSCT00000007283 GeneID:397569 KEGG:ssc:397569
            ArrayExpress:Q9GLE3 Uniprot:Q9GLE3
        Length = 330

 Score = 375 (137.1 bits), Expect = 2.0e-33, P = 2.0e-33
 Identities = 79/218 (36%), Positives = 118/218 (54%)

Query:     8 DGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 67
             +G  PD+ D+RKK    P  +Q  CGSCWAFS  G LEGQ   KTGKL+  S   LV+C 
Sbjct:   113 EGRTPDSIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV 172

Query:    68 KQCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDFLH 125
              +  GC G +   + +Y  +  G++SE  YPY    G+   C Y+ + K     G   + 
Sbjct:   173 SENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV---GQDENCMYNPTGKAAKCRGYREIP 229

Query:   126 FNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN 185
                 + +K+ + + GP+SV +++ L      +     DE C+  +L HAVL VGYG Q  
Sbjct:   230 EGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKG 289

Query:   186 IPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 222
               +W+++NSWG    ++G+  + R  NNACGI  +A +
Sbjct:   290 KKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASF 327

 Score = 374 (136.7 bits), Expect = 2.6e-33, P = 2.6e-33
 Identities = 79/218 (36%), Positives = 118/218 (54%)

Query:   739 DGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 798
             +G  PD+ D+RKK    P  +Q  CGSCWAFS  G LEGQ   KTGKL+  S   LV+C 
Sbjct:   113 EGRTPDSIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV 172

Query:   799 KQCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDFLH 856
              +  GC G +   + +Y  +  G++SE  YPY    G+   C Y+ + K     G   + 
Sbjct:   173 SENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV---GQDENCMYNPTGKAAKCRGYREIP 229

Query:   857 FNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN 916
                 + +K+ + + GP+SV +++ L      +     DE C+  +L HAVL VGYG Q  
Sbjct:   230 EGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKG 289

Query:   917 IPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 953
               +W+++NSWG    ++G+  + R  NNACGI  +A +
Sbjct:   290 KKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASF 327

 Score = 372 (136.0 bits), Expect = 4.2e-33, P = 4.2e-33
 Identities = 81/219 (36%), Positives = 118/219 (53%)

Query:   373 DGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 432
             +G  PD+ D+RKK    P  +Q  CGSCWAFS  G LEGQ   KTGKL+  S   LV+C 
Sbjct:   113 EGRTPDSIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV 172

Query:   433 KQCSGCGGCDGLEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKS-KVKLFTGKDFL 490
              +  GCGG   +    +Y  +  G++SE  YPY    G+   C Y+ + K     G   +
Sbjct:   173 SENDGCGG-GYMTNAFQYVQKNRGIDSEDAYPYV---GQDENCMYNPTGKAAKCRGYREI 228

Query:   491 YFNGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 550
                  + +K+ + + GP+SV +++ L  F   +     DE C+  +L HAVL VGYG Q 
Sbjct:   229 PEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQK 288

Query:   551 DIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 588
                +W+ +NSWG    ++G+  + R  NNACGI  +A +
Sbjct:   289 GKKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASF 327

 Score = 147 (56.8 bits), Expect = 6.1e-07, P = 6.1e-07
 Identities = 27/61 (44%), Positives = 39/61 (63%)

Query:   962 DETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAG 1020
             DE C+  +L HAVL VGYG Q    +W+++NSWG    ++G+  + R  NNACGI  +A 
Sbjct:   267 DENCNSDNLNHAVLAVGYGIQKGKKHWIIKNSWGENWGNKGYILMARNKNNACGIANLAS 326

Query:  1021 Y 1021
             +
Sbjct:   327 F 327


>ZFIN|ZDB-GENE-050522-559 [details] [associations]
            symbol:ctssb.1 "cathepsin S, b.1" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-050522-559 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 MEROPS:C01.034
            EMBL:BC095694 IPI:IPI00607338 UniGene:Dr.75553
            ProteinModelPortal:Q502H6 SMR:Q502H6 InParanoid:Q502H6
            ArrayExpress:Q502H6 Uniprot:Q502H6
        Length = 330

 Score = 365 (133.5 bits), Expect = 2.4e-32, P = 2.4e-32
 Identities = 84/220 (38%), Positives = 118/220 (53%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             VPD+ DWR+K        Q ACGSCWAFS  G LEGQ    TGKLV+ S   LV+C+ + 
Sbjct:   115 VPDSLDWREKGYVSSVKMQGACGSCWAFSSVGALEGQLKKTTGKLVDLSPQNLVDCSSKY 174

Query:   802 S--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 858
                GC+G F   + +Y     G+ S+  YPY+   G + +C+Y  S+      K +    
Sbjct:   175 GNKGCNGGFMSDAFQYVIDNGGIASDSAYPYR---GVQQQCSYSSSQRAANCTKYYFVRQ 231

Query:   859 GSET-MKKILYKYGPLSVLLNS---DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 914
             G E  +K+ +   GP+SV +++     +  ++G     ND TCS   + HAVL+VGYG  
Sbjct:   232 GDENALKQAVASVGPISVAIDATRPQFVLYHSGV---YNDPTCSKR-VNHAVLVVGYGTL 287

Query:   915 DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 953
                 +WLV+NSWG    D G+ ++ R  NN CGI   A Y
Sbjct:   288 SGQDHWLVKNSWGTRFGDGGYIRMARNKNNMCGIASYACY 327

 Score = 362 (132.5 bits), Expect = 2.0e-33, Sum P(2) = 2.0e-33
 Identities = 84/221 (38%), Positives = 118/221 (53%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             VPD+ DWR+K        Q ACGSCWAFS  G LEGQ    TGKLV+ S   LV+C+ + 
Sbjct:   115 VPDSLDWREKGYVSSVKMQGACGSCWAFSSVGALEGQLKKTTGKLVDLSPQNLVDCSSKY 174

Query:   436 SGCGGCDG--LEQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYF 492
              G  GC+G  +    +Y     G+ S+  YPYR   G + +C+Y  S+      K +   
Sbjct:   175 -GNKGCNGGFMSDAFQYVIDNGGIASDSAYPYR---GVQQQCSYSSSQRAANCTKYYFVR 230

Query:   493 NGSET-MKKILYKYGPLSVGLNS---HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGK 548
              G E  +K+ +   GP+SV +++     + +++G     ND TCS   + HAVL+VGYG 
Sbjct:   231 QGDENALKQAVASVGPISVAIDATRPQFVLYHSGV---YNDPTCSKR-VNHAVLVVGYGT 286

Query:   549 QDDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 588
                  +WL +NSWG    D G+ ++ R  NN CGI   A Y
Sbjct:   287 LSGQDHWLVKNSWGTRFGDGGYIRMARNKNNMCGIASYACY 327

 Score = 359 (131.4 bits), Expect = 4.3e-33, Sum P(2) = 4.3e-33
 Identities = 83/220 (37%), Positives = 117/220 (53%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             VPD+ DWR+K        Q  CGSCWAFS  G LEGQ    TGKLV+ S   LV+C+ + 
Sbjct:   115 VPDSLDWREKGYVSSVKMQGACGSCWAFSSVGALEGQLKKTTGKLVDLSPQNLVDCSSKY 174

Query:    71 S--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 127
                GC+G F   + +Y     G+ S+  YPY+   G + +C+Y  S+      K +    
Sbjct:   175 GNKGCNGGFMSDAFQYVIDNGGIASDSAYPYR---GVQQQCSYSSSQRAANCTKYYFVRQ 231

Query:   128 GSET-MKKILYKYGPLSVLLNS---DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 183
             G E  +K+ +   GP+SV +++     +  ++G     ND TCS   + HAVL+VGYG  
Sbjct:   232 GDENALKQAVASVGPISVAIDATRPQFVLYHSGV---YNDPTCSKR-VNHAVLVVGYGTL 287

Query:   184 DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 222
                 +WLV+NSWG    D G+ ++ R  NN CGI   A Y
Sbjct:   288 SGQDHWLVKNSWGTRFGDGGYIRMARNKNNMCGIASYACY 327

 Score = 144 (55.7 bits), Expect = 1.3e-06, P = 1.3e-06
 Identities = 31/64 (48%), Positives = 38/64 (59%)

Query:   959 VKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQ 1017
             V ND TCS   + HAVL+VGYG      +WLV+NSWG    D G+ ++ R  NN CGI  
Sbjct:   265 VYNDPTCSKR-VNHAVLVVGYGTLSGQDHWLVKNSWGTRFGDGGYIRMARNKNNMCGIAS 323

Query:  1018 IAGY 1021
              A Y
Sbjct:   324 YACY 327

 Score = 40 (19.1 bits), Expect = 2.0e-33, Sum P(2) = 2.0e-33
 Identities = 6/11 (54%), Positives = 8/11 (72%)

Query:   614 LCGVASCLCLP 624
             +CG+AS  C P
Sbjct:   318 MCGIASYACYP 328

 Score = 40 (19.1 bits), Expect = 4.3e-33, Sum P(2) = 4.3e-33
 Identities = 6/11 (54%), Positives = 8/11 (72%)

Query:   248 LCGVASCLCLP 258
             +CG+AS  C P
Sbjct:   318 MCGIASYACYP 328


>ZFIN|ZDB-GENE-050626-55 [details] [associations]
            symbol:ctssb.2 "cathepsin S, b.2" species:7955
            "Danio rerio" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-050626-55
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            KO:K01368 EMBL:BC093339 IPI:IPI00507098 RefSeq:NP_001017661.1
            UniGene:Dr.132688 ProteinModelPortal:Q566T8 SMR:Q566T8
            GeneID:337572 KEGG:dre:337572 CTD:337572 InParanoid:Q566T8
            NextBio:20812306 ArrayExpress:Q566T8 Uniprot:Q566T8
        Length = 330

 Score = 366 (133.9 bits), Expect = 1.8e-32, P = 1.8e-32
 Identities = 84/223 (37%), Positives = 114/223 (51%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             VPD  DWR K       +Q ACGSCWAFS  G LEGQ    TGKLV+ S   LV+C+ + 
Sbjct:   115 VPDTLDWRDKGYVTSVKNQGACGSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKY 174

Query:   802 S--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDFLHF 857
                GC+G +   + +Y     G++SE  YPY+   G    C YD S +    T   F+  
Sbjct:   175 GNLGCNGGYMSQAFQYVIDNGGIDSESSYPYQGTQGS---CRYDPSQRAANCTSYKFVSQ 231

Query:   858 NGSETMKKILYKYGPLSVLLNS---DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 914
                + +K+ L   GP+SV +++     I   +G     +D +C+   + H VL VGYG  
Sbjct:   232 GDEQALKEALANIGPVSVAIDATRPQFIFYRSGV---YDDPSCTQ-KVNHGVLAVGYGTL 287

Query:   915 DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
                 YWLV+NSWG    D G+ +I R  NN CGI   A Y  +
Sbjct:   288 SGQDYWLVKNSWGAGFGDGGYIRIARNKNNMCGIASEACYPIV 330

 Score = 361 (132.1 bits), Expect = 4.2e-33, Sum P(2) = 4.2e-33
 Identities = 84/224 (37%), Positives = 114/224 (50%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             VPD  DWR K       +Q ACGSCWAFS  G LEGQ    TGKLV+ S   LV+C+ + 
Sbjct:   115 VPDTLDWRDKGYVTSVKNQGACGSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKY 174

Query:   436 SGCGGCDG--LEQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYDKS-KVKLFTGKDFLY 491
                G C+G  + Q  +Y     G++SE  YPY+   G    C YD S +    T   F+ 
Sbjct:   175 GNLG-CNGGYMSQAFQYVIDNGGIDSESSYPYQGTQGS---CRYDPSQRAANCTSYKFVS 230

Query:   492 FNGSETMKKILYKYGPLSVGLNS---HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGK 548
                 + +K+ L   GP+SV +++     I + +G     +D +C+   + H VL VGYG 
Sbjct:   231 QGDEQALKEALANIGPVSVAIDATRPQFIFYRSGV---YDDPSCTQ-KVNHGVLAVGYGT 286

Query:   549 QDDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
                  YWL +NSWG    D G+ +I R  NN CGI   A Y  +
Sbjct:   287 LSGQDYWLVKNSWGAGFGDGGYIRIARNKNNMCGIASEACYPIV 330

 Score = 360 (131.8 bits), Expect = 5.4e-33, Sum P(2) = 5.4e-33
 Identities = 83/223 (37%), Positives = 113/223 (50%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             VPD  DWR K       +Q  CGSCWAFS  G LEGQ    TGKLV+ S   LV+C+ + 
Sbjct:   115 VPDTLDWRDKGYVTSVKNQGACGSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKY 174

Query:    71 S--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDFLHF 126
                GC+G +   + +Y     G++SE  YPY+   G    C YD S +    T   F+  
Sbjct:   175 GNLGCNGGYMSQAFQYVIDNGGIDSESSYPYQGTQGS---CRYDPSQRAANCTSYKFVSQ 231

Query:   127 NGSETMKKILYKYGPLSVLLNS---DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 183
                + +K+ L   GP+SV +++     I   +G     +D +C+   + H VL VGYG  
Sbjct:   232 GDEQALKEALANIGPVSVAIDATRPQFIFYRSGV---YDDPSCTQ-KVNHGVLAVGYGTL 287

Query:   184 DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
                 YWLV+NSWG    D G+ +I R  NN CGI   A Y  +
Sbjct:   288 SGQDYWLVKNSWGAGFGDGGYIRIARNKNNMCGIASEACYPIV 330

 Score = 133 (51.9 bits), Expect = 2.2e-05, P = 2.2e-05
 Identities = 29/67 (43%), Positives = 37/67 (55%)

Query:   959 VKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQ 1017
             V +D +C+   + H VL VGYG      YWLV+NSWG    D G+ +I R  NN CGI  
Sbjct:   265 VYDDPSCTQ-KVNHGVLAVGYGTLSGQDYWLVKNSWGAGFGDGGYIRIARNKNNMCGIAS 323

Query:  1018 IAGYATI 1024
              A Y  +
Sbjct:   324 EACYPIV 330

 Score = 38 (18.4 bits), Expect = 4.2e-33, Sum P(2) = 4.2e-33
 Identities = 6/11 (54%), Positives = 8/11 (72%)

Query:   614 LCGVASCLCLP 624
             +CG+AS  C P
Sbjct:   318 MCGIASEACYP 328

 Score = 38 (18.4 bits), Expect = 5.4e-33, Sum P(2) = 5.4e-33
 Identities = 6/11 (54%), Positives = 8/11 (72%)

Query:   248 LCGVASCLCLP 258
             +CG+AS  C P
Sbjct:   318 MCGIASEACYP 328


>DICTYBASE|DDB_G0290957 [details] [associations]
            symbol:cprA "cysteine proteinase 1" species:44689
            "Dictyostelium discoideum" [GO:0006972 "hyperosmotic response"
            evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0290957
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GenomeReviews:CM000154_GR GO:GO:0005764
            GO:GO:0006972 EMBL:AAFI02000174 KO:K01376 EMBL:X02407 PIR:A22827
            RefSeq:XP_635417.1 ProteinModelPortal:P04988 MEROPS:C01.022
            GlycoSuiteDB:P04988 SWISS-2DPAGE:P04988 EnsemblProtists:DDB0201647
            GeneID:8627918 KEGG:ddi:DDB_G0290957 OMA:KISNFTM
            ProtClustDB:CLSZ2429603 Uniprot:P04988
        Length = 343

 Score = 371 (135.7 bits), Expect = 5.4e-33, P = 5.4e-33
 Identities = 98/329 (29%), Positives = 153/329 (46%)

Query:   292 FKAFIVKRGRQYANDEEIKERFEYFKQD------------GHKKHERYGTSEFSDRSPEE 339
             F  F  K  ++Y+++E + ERFE FK +             HK   ++G ++F+D S +E
Sbjct:    29 FLEFQDKFNKKYSHEEYL-ERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDE 87

Query:   340 ILCKTGFK-WSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACG 398
                   FK +     E I  D             +  +P A+DWR +    P  +Q  CG
Sbjct:    88 ------FKNYYLNNKEAIFTDDLPVADYLDDEFINS-IPTAFDWRTRGAVTPVKNQGQCG 140

Query:   399 SCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC------SGCG-GCDGLEQPIEYT 451
             SCW+FS  G +EGQ+ I   KLV  S+  LV+C  +C        C  GC+G  QP  Y 
Sbjct:   141 SCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEQACDEGCNGGLQPNAYN 200

Query:   452 H---QAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSET-MKKILYKYGP 507
             +     G+++E  YPY    G +  C ++ + +      +F     +ET M   +   GP
Sbjct:   201 YIIKNGGIQTESSYPYTAETGTQ--CNFNSANIGAKIS-NFTMIPKNETVMAGYIVSTGP 257

Query:   508 LSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDI-----PYWLARNSWG 562
             L++  ++    FY G      D  C+P  L H +L+VGY  ++ I     PYW+ +NSWG
Sbjct:   258 LAIAADAVEWQFYIGGVF---DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWG 314

Query:   563 PIGPDEGFFKIERGNNACGIEQIAGYATI 591
                 ++G+  + RG N CG+      + I
Sbjct:   315 ADWGEQGYIYLRRGKNTCGVSNFVSTSII 343

 Score = 354 (129.7 bits), Expect = 3.5e-31, P = 3.5e-31
 Identities = 97/330 (29%), Positives = 154/330 (46%)

Query:   658 FKAFIVKRGRQYANDEEIKERFEYFKQD------------GHKKHERYGTSEFSDRSPEE 705
             F  F  K  ++Y+++E + ERFE FK +             HK   ++G ++F+D S +E
Sbjct:    29 FLEFQDKFNKKYSHEEYL-ERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDE 87

Query:   706 ILCKTGFK-WSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACG 764
                   FK +     E I  D             +  +P A+DWR +    P  +Q  CG
Sbjct:    88 ------FKNYYLNNKEAIFTDDLPVADYLDDEFINS-IPTAFDWRTRGAVTPVKNQGQCG 140

Query:   765 SCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC------SGCD-GCF--FEPSIEY 815
             SCW+FS  G +EGQ+ I   KLV  S+  LV+C  +C        CD GC    +P+  Y
Sbjct:   141 SCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEQACDEGCNGGLQPNA-Y 199

Query:   816 TH---QAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSET-MKKILYKYG 871
              +     G+++E  YPY    G +  C ++ + +      +F     +ET M   +   G
Sbjct:   200 NYIIKNGGIQTESSYPYTAETGTQ--CNFNSANIGAKIS-NFTMIPKNETVMAGYIVSTG 256

Query:   872 PLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD-----NIPYWLVRNSW 926
             PL++  ++     Y G      D  C+P  L H +L+VGY  ++     N+PYW+V+NSW
Sbjct:   257 PLAIAADAVEWQFYIGGVF---DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSW 313

Query:   927 GPIGPDEGFFKIERGNNACGIEQIAGYATI 956
             G    ++G+  + RG N CG+      + I
Sbjct:   314 GADWGEQGYIYLRRGKNTCGVSNFVSTSII 343

 Score = 326 (119.8 bits), Expect = 3.5e-28, P = 3.5e-28
 Identities = 74/233 (31%), Positives = 116/233 (49%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P A+DWR +    P  +Q  CGSCW+FS  G +EGQ+ I   KLV  S+  LV+C  +C
Sbjct:   118 IPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHEC 177

Query:    71 ------SGCD-GCF--FEPSIEYTH---QAGLESEKDYPYKNANGEKFKCAYDKSKVKLF 118
                     CD GC    +P+  Y +     G+++E  YPY    G +  C ++ + +   
Sbjct:   178 MEYEGEQACDEGCNGGLQPNA-YNYIIKNGGIQTESSYPYTAETGTQ--CNFNSANIGAK 234

Query:   119 TGKDFLHFNGSET-MKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLL 177
                +F     +ET M   +   GPL++  ++     Y G      D  C+P  L H +L+
Sbjct:   235 IS-NFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVF---DIPCNPNSLDHGILI 290

Query:   178 VGYGKQD-----NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 225
             VGY  ++     N+PYW+V+NSWG    ++G+  + RG N CG+      + I
Sbjct:   291 VGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343

 Score = 143 (55.4 bits), Expect = 1.9e-06, P = 1.9e-06
 Identities = 25/68 (36%), Positives = 38/68 (55%)

Query:   962 DETCSPYDLGHAVLLVGYGKQDDI-----PYWLVRNSWGPIGPDEGFFKIERGNNACGIE 1016
             D  C+P  L H +L+VGY  ++ I     PYW+V+NSWG    ++G+  + RG N CG+ 
Sbjct:   276 DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVS 335

Query:  1017 QIAGYATI 1024
                  + I
Sbjct:   336 NFVSTSII 343


>UNIPROTKB|G1K2A7 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9615 "Canis lupus
            familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 PANTHER:PTHR12411:SF55 OMA:LKVPPSH
            EMBL:AAEX03011051 Ensembl:ENSCAFT00000019202 Uniprot:G1K2A7
        Length = 333

 Score = 370 (135.3 bits), Expect = 6.9e-33, P = 6.9e-33
 Identities = 78/214 (36%), Positives = 116/214 (54%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 71
             PD+ D+RKK    P  +Q  CGSCWAFS  G LEGQ   KTGKL+  S   LV+C  +  
Sbjct:   120 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSEND 179

Query:    72 GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDFLHFNGS 129
             GC G +   + +Y  +  G++SE  YPY    G+   C Y+ + K     G   +     
Sbjct:   180 GCGGGYMTNAFQYVQKNRGIDSEDAYPYV---GQDESCMYNPTGKAAKCRGYREIPEGNE 236

Query:   130 ETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYW 189
             + +K+ + + GP+SV +++ L      +     DE C+  +L HAVL VGYG Q    +W
Sbjct:   237 KALKRAVARVGPISVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHW 296

Query:   190 LVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 222
             +++NSWG    ++G+  + R  NNACGI  +A +
Sbjct:   297 IIKNSWGENWGNKGYILMARNKNNACGIANLASF 330

 Score = 369 (135.0 bits), Expect = 8.8e-33, P = 8.8e-33
 Identities = 78/214 (36%), Positives = 116/214 (54%)

Query:   743 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 802
             PD+ D+RKK    P  +Q  CGSCWAFS  G LEGQ   KTGKL+  S   LV+C  +  
Sbjct:   120 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSEND 179

Query:   803 GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDFLHFNGS 860
             GC G +   + +Y  +  G++SE  YPY    G+   C Y+ + K     G   +     
Sbjct:   180 GCGGGYMTNAFQYVQKNRGIDSEDAYPYV---GQDESCMYNPTGKAAKCRGYREIPEGNE 236

Query:   861 ETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYW 920
             + +K+ + + GP+SV +++ L      +     DE C+  +L HAVL VGYG Q    +W
Sbjct:   237 KALKRAVARVGPISVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHW 296

Query:   921 LVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 953
             +++NSWG    ++G+  + R  NNACGI  +A +
Sbjct:   297 IIKNSWGENWGNKGYILMARNKNNACGIANLASF 330

 Score = 367 (134.2 bits), Expect = 1.4e-32, P = 1.4e-32
 Identities = 80/215 (37%), Positives = 116/215 (53%)

Query:   377 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 436
             PD+ D+RKK    P  +Q  CGSCWAFS  G LEGQ   KTGKL+  S   LV+C  +  
Sbjct:   120 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSEND 179

Query:   437 GCGGCDGLEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKS-KVKLFTGKDFLYFNG 494
             GCGG   +    +Y  +  G++SE  YPY    G+   C Y+ + K     G   +    
Sbjct:   180 GCGG-GYMTNAFQYVQKNRGIDSEDAYPYV---GQDESCMYNPTGKAAKCRGYREIPEGN 235

Query:   495 SETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPY 554
              + +K+ + + GP+SV +++ L  F   +     DE C+  +L HAVL VGYG Q    +
Sbjct:   236 EKALKRAVARVGPISVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGNKH 295

Query:   555 WLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 588
             W+ +NSWG    ++G+  + R  NNACGI  +A +
Sbjct:   296 WIIKNSWGENWGNKGYILMARNKNNACGIANLASF 330

 Score = 147 (56.8 bits), Expect = 6.2e-07, P = 6.2e-07
 Identities = 27/61 (44%), Positives = 39/61 (63%)

Query:   962 DETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAG 1020
             DE C+  +L HAVL VGYG Q    +W+++NSWG    ++G+  + R  NNACGI  +A 
Sbjct:   270 DENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLAS 329

Query:  1021 Y 1021
             +
Sbjct:   330 F 330


>UNIPROTKB|Q3ZKN1 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:AY738221
            RefSeq:NP_001029168.1 UniGene:Cfa.588 HSSP:P43235
            ProteinModelPortal:Q3ZKN1 SMR:Q3ZKN1 STRING:Q3ZKN1 GeneID:608843
            KEGG:cfa:608843 InParanoid:Q3ZKN1 NextBio:20894470 Uniprot:Q3ZKN1
        Length = 330

 Score = 370 (135.3 bits), Expect = 6.9e-33, P = 6.9e-33
 Identities = 78/214 (36%), Positives = 116/214 (54%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 71
             PD+ D+RKK    P  +Q  CGSCWAFS  G LEGQ   KTGKL+  S   LV+C  +  
Sbjct:   117 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSEND 176

Query:    72 GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDFLHFNGS 129
             GC G +   + +Y  +  G++SE  YPY    G+   C Y+ + K     G   +     
Sbjct:   177 GCGGGYMTNAFQYVQKNRGIDSEDAYPYV---GQDESCMYNPTGKAAKCRGYREIPEGNE 233

Query:   130 ETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYW 189
             + +K+ + + GP+SV +++ L      +     DE C+  +L HAVL VGYG Q    +W
Sbjct:   234 KALKRAVARVGPISVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHW 293

Query:   190 LVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 222
             +++NSWG    ++G+  + R  NNACGI  +A +
Sbjct:   294 IIKNSWGENWGNKGYILMARNKNNACGIANLASF 327

 Score = 369 (135.0 bits), Expect = 8.8e-33, P = 8.8e-33
 Identities = 78/214 (36%), Positives = 116/214 (54%)

Query:   743 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 802
             PD+ D+RKK    P  +Q  CGSCWAFS  G LEGQ   KTGKL+  S   LV+C  +  
Sbjct:   117 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSEND 176

Query:   803 GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDFLHFNGS 860
             GC G +   + +Y  +  G++SE  YPY    G+   C Y+ + K     G   +     
Sbjct:   177 GCGGGYMTNAFQYVQKNRGIDSEDAYPYV---GQDESCMYNPTGKAAKCRGYREIPEGNE 233

Query:   861 ETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYW 920
             + +K+ + + GP+SV +++ L      +     DE C+  +L HAVL VGYG Q    +W
Sbjct:   234 KALKRAVARVGPISVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHW 293

Query:   921 LVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 953
             +++NSWG    ++G+  + R  NNACGI  +A +
Sbjct:   294 IIKNSWGENWGNKGYILMARNKNNACGIANLASF 327

 Score = 367 (134.2 bits), Expect = 1.4e-32, P = 1.4e-32
 Identities = 80/215 (37%), Positives = 116/215 (53%)

Query:   377 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 436
             PD+ D+RKK    P  +Q  CGSCWAFS  G LEGQ   KTGKL+  S   LV+C  +  
Sbjct:   117 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSEND 176

Query:   437 GCGGCDGLEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKS-KVKLFTGKDFLYFNG 494
             GCGG   +    +Y  +  G++SE  YPY    G+   C Y+ + K     G   +    
Sbjct:   177 GCGG-GYMTNAFQYVQKNRGIDSEDAYPYV---GQDESCMYNPTGKAAKCRGYREIPEGN 232

Query:   495 SETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPY 554
              + +K+ + + GP+SV +++ L  F   +     DE C+  +L HAVL VGYG Q    +
Sbjct:   233 EKALKRAVARVGPISVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGNKH 292

Query:   555 WLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 588
             W+ +NSWG    ++G+  + R  NNACGI  +A +
Sbjct:   293 WIIKNSWGENWGNKGYILMARNKNNACGIANLASF 327

 Score = 147 (56.8 bits), Expect = 6.1e-07, P = 6.1e-07
 Identities = 27/61 (44%), Positives = 39/61 (63%)

Query:   962 DETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAG 1020
             DE C+  +L HAVL VGYG Q    +W+++NSWG    ++G+  + R  NNACGI  +A 
Sbjct:   267 DENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLAS 326

Query:  1021 Y 1021
             +
Sbjct:   327 F 327


>UNIPROTKB|F1NEC8 [details] [associations]
            symbol:CTSL2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:AADN02067812 IPI:IPI00820956 Ensembl:ENSGALT00000037988
            ArrayExpress:F1NEC8 Uniprot:F1NEC8
        Length = 218

 Score = 368 (134.6 bits), Expect = 1.1e-32, P = 1.1e-32
 Identities = 85/223 (38%), Positives = 119/223 (53%)

Query:   377 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 436
             P + DWR+K    P  DQ  CGSCWAFS  G LEGQ+  KTGKLV  S+  LV+C++   
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRP-E 60

Query:   437 GCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYF- 492
             G  GC+G  ++Q  +Y     G++SE+ YPY   + E   C Y K++        F+   
Sbjct:    61 GNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDED--CRY-KAEYNAANDTGFVDIP 117

Query:   493 NGSE-TMKKILYKYGPLSVGLNS-HL-IHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 549
              G E  + K +   GP+SV +++ H    FY      + D  CS  DL H VL+VGYG +
Sbjct:   118 QGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPD--CSSEDLDHGVLVVGYGFE 175

Query:   550 DDIPYWLARNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 591
             D   YW+ +NSWG    D+G+  + +   N CGI   A Y  +
Sbjct:   176 DGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPLV 218

 Score = 362 (132.5 bits), Expect = 4.9e-32, P = 4.9e-32
 Identities = 82/222 (36%), Positives = 116/222 (52%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ-- 69
             P + DWR+K    P  DQ  CGSCWAFS  G LEGQ+  KTGKLV  S+  LV+C++   
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEG 61

Query:    70 CSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF-N 127
               GC+G   + + +Y     G++SE+ YPY   + E   C Y K++        F+    
Sbjct:    62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDED--CRY-KAEYNAANDTGFVDIPQ 118

Query:   128 GSE-TMKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
             G E  + K +   GP+SV +++       Y      + D  CS  DL H VL+VGYG +D
Sbjct:   119 GHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPD--CSSEDLDHGVLVVGYGFED 176

Query:   185 NIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 225
                YW+V+NSWG    D+G+  + +   N CGI   A Y  +
Sbjct:   177 GKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPLV 218

 Score = 361 (132.1 bits), Expect = 6.3e-32, P = 6.3e-32
 Identities = 82/222 (36%), Positives = 116/222 (52%)

Query:   743 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ-- 800
             P + DWR+K    P  DQ  CGSCWAFS  G LEGQ+  KTGKLV  S+  LV+C++   
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEG 61

Query:   801 CSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF-N 858
               GC+G   + + +Y     G++SE+ YPY   + E   C Y K++        F+    
Sbjct:    62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDED--CRY-KAEYNAANDTGFVDIPQ 118

Query:   859 GSE-TMKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 915
             G E  + K +   GP+SV +++       Y      + D  CS  DL H VL+VGYG +D
Sbjct:   119 GHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPD--CSSEDLDHGVLVVGYGFED 176

Query:   916 NIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 956
                YW+V+NSWG    D+G+  + +   N CGI   A Y  +
Sbjct:   177 GKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPLV 218

 Score = 143 (55.4 bits), Expect = 3.0e-07, P = 3.0e-07
 Identities = 32/102 (31%), Positives = 49/102 (48%)

Query:   928 PIGPDEGFFKIERGNNACGIEQIAGYATIDVVKN----DETCSPYDLGHAVLLVGYGKQD 983
             P G +    K         +   AG+++    ++    +  CS  DL H VL+VGYG +D
Sbjct:   117 PQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFED 176

Query:   984 DIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 1024
                YW+V+NSWG    D+G+  + +   N CGI   A Y  +
Sbjct:   177 GKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPLV 218


>WB|WBGene00000776 [details] [associations]
            symbol:cpl-1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0040010 "positive regulation
            of growth rate" evidence=IMP] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] [GO:0040011
            "locomotion" evidence=IMP] [GO:0070265 "necrotic cell death"
            evidence=IMP] [GO:0031983 "vesicle lumen" evidence=IDA] [GO:0042718
            "yolk granule" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0009792 GO:GO:0040010 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0040011
            GO:GO:0031983 GO:GO:0070265 GeneTree:ENSGT00660000095458 KO:K01365
            GO:GO:0042718 MEROPS:I29.009 EMBL:Z92812 GeneID:180111
            KEGG:cel:CELE_T03E6.7 CTD:180111 PIR:T24387 RefSeq:NP_001256718.1
            HSSP:P80067 ProteinModelPortal:O45734 SMR:O45734 DIP:DIP-26616N
            IntAct:O45734 MINT:MINT-211563 STRING:O45734 PaxDb:O45734
            EnsemblMetazoa:T03E6.7.1 EnsemblMetazoa:T03E6.7.2 UCSC:T03E6.7.1
            WormBase:T03E6.7a InParanoid:O45734 OMA:HIENHNR NextBio:908128
            Uniprot:O45734
        Length = 337

 Score = 367 (134.2 bits), Expect = 1.4e-32, P = 1.4e-32
 Identities = 97/321 (30%), Positives = 152/321 (47%)

Query:   652 ENILETFKAFIVKRGRQYANDEEIKERFEYFKQ----DGHKKHERYGTSEFSDRSPEEIL 707
             E+ +E +  +     ++Y+  EE      + K     + H +  R G   F +     I 
Sbjct:    26 ESAIEKWDDYKEDFDKEYSESEEQTYMEAFVKNMIHIENHNRDHRLGRKTF-EMGLNHIA 84

Query:   708 CKTGFKWSERT-YERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSC 766
                  ++ +   Y R+  D             +  VPD  DWR  ++     +Q  CGSC
Sbjct:    85 DLPFSQYRKLNGYRRLFGDSRIKNSSSFLAPFNVQVPDEVDWRDTHLVTDVKNQGMCGSC 144

Query:   767 WAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPSIEYTHQA-GLES 823
             WAFS  G LEGQ+A K G+LV  S+  LV+C+ +    GC+G   + + EY     G+++
Sbjct:   145 WAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLMDQAFEYIRDNHGVDT 204

Query:   824 EKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF-NGSETMKKI-LYKYGPLSVLLNSDL 881
             E+ YPYK   G   KC ++K  V     K ++    G E   KI +   GP+S+ +++  
Sbjct:   205 EESYPYK---GRDMKCHFNKKTVGA-DDKGYVDTPEGDEEQLKIAVATQGPISIAIDAG- 259

Query:   882 IHDYNGTPIRKN----DETCSPYDLGHAVLLVGYGKQ-DNIPYWLVRNSWGPIGPDEGFF 936
                +    + K     DE CS  +L H VLLVGYG   ++  YW+V+NSWG    ++G+ 
Sbjct:   260 ---HRSFQLYKKGVYYDEECSSEELDHGVLLVGYGTDPEHGDYWIVKNSWGAGWGEKGYI 316

Query:   937 KIERG-NNACGIEQIAGYATI 956
             +I R  NN CG+   A Y  +
Sbjct:   317 RIARNRNNHCGVATKASYPLV 337

 Score = 363 (132.8 bits), Expect = 3.8e-32, P = 3.8e-32
 Identities = 97/318 (30%), Positives = 150/318 (47%)

Query:   286 ENILETFKAFIVKRGRQYANDEEIKERFEYFKQ----DGHKKHERYGTSEFSDRSPEEIL 341
             E+ +E +  +     ++Y+  EE      + K     + H +  R G   F +     I 
Sbjct:    26 ESAIEKWDDYKEDFDKEYSESEEQTYMEAFVKNMIHIENHNRDHRLGRKTF-EMGLNHIA 84

Query:   342 CKTGFKWSERT-YERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSC 400
                  ++ +   Y R+  D             +  VPD  DWR  ++     +Q  CGSC
Sbjct:    85 DLPFSQYRKLNGYRRLFGDSRIKNSSSFLAPFNVQVPDEVDWRDTHLVTDVKNQGMCGSC 144

Query:   401 WAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDG--LEQPIEYTHQA-GLE 457
             WAFS  G LEGQ+A K G+LV  S+  LV+C+ +  G  GC+G  ++Q  EY     G++
Sbjct:   145 WAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKY-GNHGCNGGLMDQAFEYIRDNHGVD 203

Query:   458 SEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYF-NGSETMKKI-LYKYGPLSVGLNSH 515
             +E+ YPY+   G   KC ++K  V     K ++    G E   KI +   GP+S+ +++ 
Sbjct:   204 TEESYPYK---GRDMKCHFNKKTVGA-DDKGYVDTPEGDEEQLKIAVATQGPISIAIDAG 259

Query:   516 LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDI-PYWLARNSWGPIGPDEGFFKIE 574
                F         DE CS  +L H VLLVGYG   +   YW+ +NSWG    ++G+ +I 
Sbjct:   260 HRSFQLYKKGVYYDEECSSEELDHGVLLVGYGTDPEHGDYWIVKNSWGAGWGEKGYIRIA 319

Query:   575 RG-NNACGIEQIAGYATI 591
             R  NN CG+   A Y  +
Sbjct:   320 RNRNNHCGVATKASYPLV 337

 Score = 356 (130.4 bits), Expect = 2.2e-31, P = 2.2e-31
 Identities = 83/226 (36%), Positives = 122/226 (53%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             VPD  DWR  ++     +Q  CGSCWAFS  G LEGQ+A K G+LV  S+  LV+C+ + 
Sbjct:   120 VPDEVDWRDTHLVTDVKNQGMCGSCWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKY 179

Query:    71 S--GCDGCFFEPSIEYTHQA-GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF- 126
                GC+G   + + EY     G+++E+ YPYK   G   KC ++K  V     K ++   
Sbjct:   180 GNHGCNGGLMDQAFEYIRDNHGVDTEESYPYK---GRDMKCHFNKKTVGA-DDKGYVDTP 235

Query:   127 NGSETMKKI-LYKYGPLSVLLNSDLIHDYNGTPIRKN----DETCSPYDLGHAVLLVGYG 181
              G E   KI +   GP+S+ +++     +    + K     DE CS  +L H VLLVGYG
Sbjct:   236 EGDEEQLKIAVATQGPISIAIDAG----HRSFQLYKKGVYYDEECSSEELDHGVLLVGYG 291

Query:   182 KQ-DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
                ++  YW+V+NSWG    ++G+ +I R  NN CG+   A Y  +
Sbjct:   292 TDPEHGDYWIVKNSWGAGWGEKGYIRIARNRNNHCGVATKASYPLV 337

 Score = 149 (57.5 bits), Expect = 3.9e-07, P = 3.9e-07
 Identities = 32/80 (40%), Positives = 45/80 (56%)

Query:   951 AGYATIDVVKN----DETCSPYDLGHAVLLVGYGKQDDI-PYWLVRNSWGPIGPDEGFFK 1005
             AG+ +  + K     DE CS  +L H VLLVGYG   +   YW+V+NSWG    ++G+ +
Sbjct:   258 AGHRSFQLYKKGVYYDEECSSEELDHGVLLVGYGTDPEHGDYWIVKNSWGAGWGEKGYIR 317

Query:  1006 IERG-NNACGIEQIAGYATI 1024
             I R  NN CG+   A Y  +
Sbjct:   318 IARNRNNHCGVATKASYPLV 337


>WB|WBGene00019986 [details] [associations]
            symbol:R09F10.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            EMBL:FO081137 HSSP:P53634 PIR:D89588 RefSeq:NP_509408.1
            ProteinModelPortal:Q23030 SMR:Q23030 STRING:Q23030 MEROPS:C01.A44
            PaxDb:Q23030 EnsemblMetazoa:R09F10.1 GeneID:181087
            KEGG:cel:CELE_R09F10.1 UCSC:R09F10.1 CTD:181087 WormBase:R09F10.1
            InParanoid:Q23030 OMA:EYPYSAL NextBio:912346 Uniprot:Q23030
        Length = 383

 Score = 367 (134.2 bits), Expect = 1.4e-32, P = 1.4e-32
 Identities = 91/304 (29%), Positives = 154/304 (50%)

Query:   656 ETFKAFIVKRGRQYANDEEIKERFEYFKQ---DGHKKHER-YG----TSEFSDRSPEEIL 707
             + F  FI+K  R+Y + EE + R++ F +   +   + ER  G     +EF+D + EE+ 
Sbjct:    80 QMFNDFILKFDRKYTSVEEFEYRYQIFLRNVIEFEAEEERNLGLDLDVNEFTDWTDEELQ 139

Query:   708 CKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPV-PDAWDWRKKNVTGPAGDQAACGSC 766
                     E  Y +   D              G + P + DWR++    P  +Q  CGSC
Sbjct:   140 KMV----QENKYTKYDFDTPKFEGSYLET---GVIRPASIDWREQGKLTPIKNQGQCGSC 192

Query:   767 WAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKD 826
             WAF+    +E Q AIK GKLV  S+ ++V+C  + +GC G +   ++++  + GLESEK+
Sbjct:   193 WAFATVASVEAQNAIKKGKLVSLSEQEMVDCDGRNNGCSGGYRPYAMKFVKENGLESEKE 252

Query:   827 YPYKNANGEKFKCAYDKSKVKLFTGKDF-LHFNGSETMKKILYKYGPLSVLLNS-DLIHD 884
             YPY     ++  C   ++  ++F   DF +  N  E +   +   GP++  +N    ++ 
Sbjct:   253 YPYSALKHDQ--CFLKENDTRVFID-DFRMLSNNEEDIANWVGTKGPVTFGMNVVKAMYS 309

Query:   885 YNGTPIRKNDETCSPYDLG-HAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNN 943
             Y       + E C+   +G HA+ ++GYG +    YW+V+NSWG      G+F++ RG N
Sbjct:   310 YRSGIFNPSVEDCTEKSMGAHALTIIGYGGEGESAYWIVKNSWGTSWGASGYFRLARGVN 369

Query:   944 ACGI 947
             +CG+
Sbjct:   370 SCGL 373

 Score = 360 (131.8 bits), Expect = 8.1e-32, P = 8.1e-32
 Identities = 92/306 (30%), Positives = 154/306 (50%)

Query:   290 ETFKAFIVKRGRQYANDEEIKERFEYFKQ---DGHKKHER-YG----TSEFSDRSPEEIL 341
             + F  FI+K  R+Y + EE + R++ F +   +   + ER  G     +EF+D + EE+ 
Sbjct:    80 QMFNDFILKFDRKYTSVEEFEYRYQIFLRNVIEFEAEEERNLGLDLDVNEFTDWTDEELQ 139

Query:   342 CKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPV-PDAWDWRKKNVTGPAGDQAACGSC 400
                     E  Y +   D              G + P + DWR++    P  +Q  CGSC
Sbjct:   140 KMV----QENKYTKYDFDTPKFEGSYLET---GVIRPASIDWREQGKLTPIKNQGQCGSC 192

Query:   401 WAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQ-PIEYTHQAGLESE 459
             WAF+    +E Q AIK GKLV  S+ ++V+C  + +GC G  G     +++  + GLESE
Sbjct:   193 WAFATVASVEAQNAIKKGKLVSLSEQEMVDCDGRNNGCSG--GYRPYAMKFVKENGLESE 250

Query:   460 KDYPYRNGNGEKFKCAYDKSKVKLFTGKDF-LYFNGSETMKKILYKYGPLSVGLNS-HLI 517
             K+YPY     ++  C   ++  ++F   DF +  N  E +   +   GP++ G+N    +
Sbjct:   251 KEYPYSALKHDQ--CFLKENDTRVFID-DFRMLSNNEEDIANWVGTKGPVTFGMNVVKAM 307

Query:   518 HFYNGTPIRKNDETCSPYDLG-HAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERG 576
             + Y       + E C+   +G HA+ ++GYG + +  YW+ +NSWG      G+F++ RG
Sbjct:   308 YSYRSGIFNPSVEDCTEKSMGAHALTIIGYGGEGESAYWIVKNSWGTSWGASGYFRLARG 367

Query:   577 NNACGI 582
              N+CG+
Sbjct:   368 VNSCGL 373

 Score = 346 (126.9 bits), Expect = 2.5e-30, P = 2.5e-30
 Identities = 69/208 (33%), Positives = 116/208 (55%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 71
             P + DWR++    P  +Q  CGSCWAF+    +E Q AIK GKLV  S+ ++V+C  + +
Sbjct:   169 PASIDWREQGKLTPIKNQGQCGSCWAFATVASVEAQNAIKKGKLVSLSEQEMVDCDGRNN 228

Query:    72 GCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF-LHFNGSE 130
             GC G +   ++++  + GLESEK+YPY     ++  C   ++  ++F   DF +  N  E
Sbjct:   229 GCSGGYRPYAMKFVKENGLESEKEYPYSALKHDQ--CFLKENDTRVFID-DFRMLSNNEE 285

Query:   131 TMKKILYKYGPLSVLLNS-DLIHDYNGTPIRKNDETCSPYDLG-HAVLLVGYGKQDNIPY 188
              +   +   GP++  +N    ++ Y       + E C+   +G HA+ ++GYG +    Y
Sbjct:   286 DIANWVGTKGPVTFGMNVVKAMYSYRSGIFNPSVEDCTEKSMGAHALTIIGYGGEGESAY 345

Query:   189 WLVRNSWGPIGPDEGFFKIERGNNACGI 216
             W+V+NSWG      G+F++ RG N+CG+
Sbjct:   346 WIVKNSWGTSWGASGYFRLARGVNSCGL 373

 Score = 138 (53.6 bits), Expect = 8.4e-06, P = 8.4e-06
 Identities = 22/54 (40%), Positives = 36/54 (66%)

Query:   963 ETCSPYDLG-HAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGI 1015
             E C+   +G HA+ ++GYG + +  YW+V+NSWG      G+F++ RG N+CG+
Sbjct:   320 EDCTEKSMGAHALTIIGYGGEGESAYWIVKNSWGTSWGASGYFRLARGVNSCGL 373


>ZFIN|ZDB-GENE-030131-3539 [details] [associations]
            symbol:ctsh "cathepsin H" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-030131-3539
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 KO:K01366 HOVERGEN:HBG011513
            CTD:1512 OrthoDB:EOG4W9J43 MEROPS:I29.003 HSSP:P43235 EMBL:BC067615
            IPI:IPI00506892 RefSeq:NP_997853.1 UniGene:Dr.14176
            ProteinModelPortal:Q6NWF2 SMR:Q6NWF2 PRIDE:Q6NWF2 GeneID:324818
            KEGG:dre:324818 InParanoid:Q6NWF2 NextBio:20808976 Bgee:Q6NWF2
            Uniprot:Q6NWF2
        Length = 330

 Score = 354 (129.7 bits), Expect = 1.5e-32, Sum P(2) = 1.5e-32
 Identities = 95/290 (32%), Positives = 143/290 (49%)

Query:   674 EIKERFEYFKQDGHKKHERYGTSEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXX 733
             E K+R +   +  HK     G ++FSD +  E      FK   +TY  +   +       
Sbjct:    55 ENKKRIDQHNEGNHKFS--MGLNQFSDMTFAE------FK---KTY-LLTEPQNCSATRG 102

Query:   734 XXXXXDGPVPDAWDWRKKN--VTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSK 791
                  +G  PDA DWR K   +T    +Q  CGSCW FS  G LE   AI TGKL++ ++
Sbjct:   103 NHVSSNGLYPDAIDWRTKGHYITD-VKNQGPCGSCWTFSTTGCLESVTAIATGKLLQLAE 161

Query:   792 SQLVECAKQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKL 848
              QL++CA      GC+G     + EY  +  GL +E DYPY+   G+   C +       
Sbjct:   162 QQLIDCAGDFDNHGCNGGLPSHAFEYIMYNKGLMTEDDYPYQAKGGQ---CRFKPQLAAA 218

Query:   849 FTGKDFLHFNGSETMKKI--LYKYGPLSVL--LNSDLIHDYNGTPIRKNDETCSPYDL-G 903
             F  K+ ++    + M  +  + +  P+S    + SD +H  +G  I  + E  +  D+  
Sbjct:   219 FV-KEVVNITKYDEMGMVDAVARLNPVSFAYEVTSDFMHYKDG--IYTSTECHNTTDMVN 275

Query:   904 HAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 953
             HAVL VGY +++  PYW+V+NSWG     +G+F IERG N CG+   + Y
Sbjct:   276 HAVLAVGYAEENGTPYWIVKNSWGTNWGIKGYFYIERGKNMCGLAACSSY 325

 Score = 348 (127.6 bits), Expect = 1.6e-30, P = 1.6e-30
 Identities = 82/228 (35%), Positives = 121/228 (53%)

Query:     5 VEKDGPVPDAWDWRKKN--VTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQ 62
             V  +G  PDA DWR K   +T    +Q  CGSCW FS  G LE   AI TGKL++ ++ Q
Sbjct:   105 VSSNGLYPDAIDWRTKGHYITD-VKNQGPCGSCWTFSTTGCLESVTAIATGKLLQLAEQQ 163

Query:    63 LVECAKQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFT 119
             L++CA      GC+G     + EY  +  GL +E DYPY+   G+   C +       F 
Sbjct:   164 LIDCAGDFDNHGCNGGLPSHAFEYIMYNKGLMTEDDYPYQAKGGQ---CRFKPQLAAAFV 220

Query:   120 GKDFLHFNGSETMKKI--LYKYGPLSVL--LNSDLIHDYNGTPIRKNDETCSPYDL-GHA 174
              K+ ++    + M  +  + +  P+S    + SD +H  +G  I  + E  +  D+  HA
Sbjct:   221 -KEVVNITKYDEMGMVDAVARLNPVSFAYEVTSDFMHYKDG--IYTSTECHNTTDMVNHA 277

Query:   175 VLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
             VL VGY +++  PYW+V+NSWG     +G+F IERG N CG+   + Y
Sbjct:   278 VLAVGYAEENGTPYWIVKNSWGTNWGIKGYFYIERGKNMCGLAACSSY 325

 Score = 341 (125.1 bits), Expect = 3.6e-31, Sum P(2) = 3.6e-31
 Identities = 95/292 (32%), Positives = 141/292 (48%)

Query:   308 EIKERFEYFKQDGHKKHERYGTSEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXX 367
             E K+R +   +  HK     G ++FSD +  E      FK   +TY  +   +       
Sbjct:    55 ENKKRIDQHNEGNHKFS--MGLNQFSDMTFAE------FK---KTY-LLTEPQNCSATRG 102

Query:   368 XXXXXDGPVPDAWDWRKKN--VTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSK 425
                  +G  PDA DWR K   +T    +Q  CGSCW FS  G LE   AI TGKL++ ++
Sbjct:   103 NHVSSNGLYPDAIDWRTKGHYITD-VKNQGPCGSCWTFSTTGCLESVTAIATGKLLQLAE 161

Query:   426 SQLVECAKQCS--GCGGCDGL-EQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKV 481
              QL++CA      GC G  GL     EY  +  GL +E DYPY+   G+   C +     
Sbjct:   162 QQLIDCAGDFDNHGCNG--GLPSHAFEYIMYNKGLMTEDDYPYQAKGGQ---CRFKPQLA 216

Query:   482 KLFTGKDFLYFNGSETMKKI--LYKYGPLSVG--LNSHLIHFYNGTPIRKNDETCSPYDL 537
               F  K+ +     + M  +  + +  P+S    + S  +H+ +G  I  + E  +  D+
Sbjct:   217 AAFV-KEVVNITKYDEMGMVDAVARLNPVSFAYEVTSDFMHYKDG--IYTSTECHNTTDM 273

Query:   538 -GHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGY 588
               HAVL VGY +++  PYW+ +NSWG     +G+F IERG N CG+   + Y
Sbjct:   274 VNHAVLAVGYAEENGTPYWIVKNSWGTNWGIKGYFYIERGKNMCGLAACSSY 325

 Score = 149 (57.5 bits), Expect = 1.0e-06, Sum P(2) = 1.0e-06
 Identities = 25/52 (48%), Positives = 35/52 (67%)

Query:   970 LGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 1021
             + HAVL VGY +++  PYW+V+NSWG     +G+F IERG N CG+   + Y
Sbjct:   274 VNHAVLAVGYAEENGTPYWIVKNSWGTNWGIKGYFYIERGKNMCGLAACSSY 325

 Score = 40 (19.1 bits), Expect = 1.5e-32, Sum P(2) = 1.5e-32
 Identities = 18/55 (32%), Positives = 23/55 (41%)

Query:   106 FKCAYDKSKVKLFTGKDFLHFNG--SETMKK--ILYKYGPLSVLL-NSDLIHDYN 155
             F   Y    V L+T +D  HF    S+  KK  I   Y  L + L N   I  +N
Sbjct:    10 FAVLYQVLAVPLYTEEDEYHFKSWMSQYNKKYEINEFYQRLQIFLENKKRIDQHN 64

 Score = 40 (19.1 bits), Expect = 1.0e-06, Sum P(2) = 1.0e-06
 Identities = 18/55 (32%), Positives = 23/55 (41%)

Query:   837 FKCAYDKSKVKLFTGKDFLHFNG--SETMKK--ILYKYGPLSVLL-NSDLIHDYN 886
             F   Y    V L+T +D  HF    S+  KK  I   Y  L + L N   I  +N
Sbjct:    10 FAVLYQVLAVPLYTEEDEYHFKSWMSQYNKKYEINEFYQRLQIFLENKKRIDQHN 64


>UNIPROTKB|F1PAK0 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9615 "Canis lupus
            familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:AAEX03011051 Ensembl:ENSCAFT00000019176 OMA:YEPACTQ
            Uniprot:F1PAK0
        Length = 339

 Score = 366 (133.9 bits), Expect = 1.8e-32, P = 1.8e-32
 Identities = 85/225 (37%), Positives = 121/225 (53%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             +PD+ DWR+K        Q +CG+CWAFS  G LE Q  +KTGKLV  S   LV+C+ + 
Sbjct:   123 LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEK 182

Query:   802 ---SGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--L 855
                 GC+G F   + +Y     G++SE  YPYK  NG   KC YD SK +  T   +  L
Sbjct:   183 YGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAVNG---KCRYD-SKKRAATCSKYTEL 238

Query:   856 HFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKN---DETCSPYDLGHAVLLVGYG 912
              F   + +K+ +   GP+SV +  D  H Y+    R     + +C+  ++ H VL+VGYG
Sbjct:   239 PFGSEDALKEAVANKGPVSVAI--DASH-YSFFLYRSGVYYEPSCTQ-NVNHGVLVVGYG 294

Query:   913 KQDNIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 956
               +   YWLV+NSWG    D+G+ ++ R + N CGI     Y  I
Sbjct:   295 NLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYPEI 339

 Score = 365 (133.5 bits), Expect = 2.4e-32, P = 2.4e-32
 Identities = 85/225 (37%), Positives = 120/225 (53%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +PD+ DWR+K        Q  CG+CWAFS  G LE Q  +KTGKLV  S   LV+C+ + 
Sbjct:   123 LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEK 182

Query:    71 ---SGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--L 124
                 GC+G F   + +Y     G++SE  YPYK  NG   KC YD SK +  T   +  L
Sbjct:   183 YGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAVNG---KCRYD-SKKRAATCSKYTEL 238

Query:   125 HFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKN---DETCSPYDLGHAVLLVGYG 181
              F   + +K+ +   GP+SV +  D  H Y+    R     + +C+  ++ H VL+VGYG
Sbjct:   239 PFGSEDALKEAVANKGPVSVAI--DASH-YSFFLYRSGVYYEPSCTQ-NVNHGVLVVGYG 294

Query:   182 KQDNIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 225
               +   YWLV+NSWG    D+G+ ++ R + N CGI     Y  I
Sbjct:   295 NLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYPEI 339

 Score = 361 (132.1 bits), Expect = 6.3e-32, P = 6.3e-32
 Identities = 82/223 (36%), Positives = 121/223 (54%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +PD+ DWR+K        Q +CG+CWAFS  G LE Q  +KTGKLV  S   LV+C+ + 
Sbjct:   123 LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEK 182

Query:   436 SGCGGCDG--LEQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF--L 490
              G  GC+G  +    +Y     G++SE  YPY+  NG   KC YD SK +  T   +  L
Sbjct:   183 YGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAVNG---KCRYD-SKKRAATCSKYTEL 238

Query:   491 YFNGSETMKKILYKYGPLSVGLN-SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 549
              F   + +K+ +   GP+SV ++ SH   F   + +   + +C+  ++ H VL+VGYG  
Sbjct:   239 PFGSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYY-EPSCTQ-NVNHGVLVVGYGNL 296

Query:   550 DDIPYWLARNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 591
             +   YWL +NSWG    D+G+ ++ R + N CGI     Y  I
Sbjct:   297 NGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYPEI 339

 Score = 127 (49.8 bits), Expect = 0.00011, P = 0.00011
 Identities = 25/64 (39%), Positives = 38/64 (59%)

Query:   962 DETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAG 1020
             + +C+  ++ H VL+VGYG  +   YWLV+NSWG    D+G+ ++ R + N CGI     
Sbjct:   277 EPSCTQ-NVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPS 335

Query:  1021 YATI 1024
             Y  I
Sbjct:   336 YPEI 339


>UNIPROTKB|Q8HY81 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            CTD:1520 KO:K01368 OrthoDB:EOG4JM7Q2 EMBL:AY156692
            RefSeq:NP_001002938.2 UniGene:Cfa.1661 ProteinModelPortal:Q8HY81
            SMR:Q8HY81 STRING:Q8HY81 MEROPS:C01.034 GeneID:403400
            KEGG:cfa:403400 InParanoid:Q8HY81 NextBio:20816922 Uniprot:Q8HY81
        Length = 331

 Score = 365 (133.5 bits), Expect = 2.4e-32, P = 2.4e-32
 Identities = 85/225 (37%), Positives = 121/225 (53%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             +PD+ DWR+K        Q +CG+CWAFS  G LE Q  +KTGKLV  S   LV+C+ + 
Sbjct:   115 LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEK 174

Query:   802 ---SGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--L 855
                 GC+G F   + +Y     G++SE  YPYK  NG   KC YD SK +  T   +  L
Sbjct:   175 YGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNG---KCRYD-SKKRAATCSKYTEL 230

Query:   856 HFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKN---DETCSPYDLGHAVLLVGYG 912
              F   + +K+ +   GP+SV +  D  H Y+    R     + +C+  ++ H VL+VGYG
Sbjct:   231 PFGSEDALKEAVANKGPVSVAI--DASH-YSFFLYRSGVYYEPSCTQ-NVNHGVLVVGYG 286

Query:   913 KQDNIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 956
               +   YWLV+NSWG    D+G+ ++ R + N CGI     Y  I
Sbjct:   287 NLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYPEI 331

 Score = 364 (133.2 bits), Expect = 3.0e-32, P = 3.0e-32
 Identities = 85/225 (37%), Positives = 120/225 (53%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +PD+ DWR+K        Q  CG+CWAFS  G LE Q  +KTGKLV  S   LV+C+ + 
Sbjct:   115 LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEK 174

Query:    71 ---SGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--L 124
                 GC+G F   + +Y     G++SE  YPYK  NG   KC YD SK +  T   +  L
Sbjct:   175 YGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNG---KCRYD-SKKRAATCSKYTEL 230

Query:   125 HFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKN---DETCSPYDLGHAVLLVGYG 181
              F   + +K+ +   GP+SV +  D  H Y+    R     + +C+  ++ H VL+VGYG
Sbjct:   231 PFGSEDALKEAVANKGPVSVAI--DASH-YSFFLYRSGVYYEPSCTQ-NVNHGVLVVGYG 286

Query:   182 KQDNIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 225
               +   YWLV+NSWG    D+G+ ++ R + N CGI     Y  I
Sbjct:   287 NLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYPEI 331

 Score = 361 (132.1 bits), Expect = 6.3e-32, P = 6.3e-32
 Identities = 82/223 (36%), Positives = 121/223 (54%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +PD+ DWR+K        Q +CG+CWAFS  G LE Q  +KTGKLV  S   LV+C+ + 
Sbjct:   115 LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEK 174

Query:   436 SGCGGCDG--LEQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF--L 490
              G  GC+G  +    +Y     G++SE  YPY+  NG   KC YD SK +  T   +  L
Sbjct:   175 YGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNG---KCRYD-SKKRAATCSKYTEL 230

Query:   491 YFNGSETMKKILYKYGPLSVGLN-SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 549
              F   + +K+ +   GP+SV ++ SH   F   + +   + +C+  ++ H VL+VGYG  
Sbjct:   231 PFGSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYY-EPSCTQ-NVNHGVLVVGYGNL 288

Query:   550 DDIPYWLARNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 591
             +   YWL +NSWG    D+G+ ++ R + N CGI     Y  I
Sbjct:   289 NGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYPEI 331

 Score = 127 (49.8 bits), Expect = 0.00010, P = 0.00010
 Identities = 25/64 (39%), Positives = 38/64 (59%)

Query:   962 DETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAG 1020
             + +C+  ++ H VL+VGYG  +   YWLV+NSWG    D+G+ ++ R + N CGI     
Sbjct:   269 EPSCTQ-NVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPS 327

Query:  1021 YATI 1024
             Y  I
Sbjct:   328 YPEI 331


>RGD|1560071 [details] [associations]
            symbol:Ctsll3 "cathepsin L-like 3" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1560071 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:CH474032 IPI:IPI00560469 RefSeq:XP_001065834.2
            RefSeq:XP_573976.3 UniGene:Rn.104851 MEROPS:C01.107
            Ensembl:ENSRNOT00000061398 GeneID:498691 KEGG:rno:498691
            UCSC:RGD:1560071 CTD:70202 OMA:NCGIASD OrthoDB:EOG4HDSTZ
            NextBio:700548 Uniprot:D3ZJV2
        Length = 330

 Score = 363 (132.8 bits), Expect = 3.8e-32, P = 3.8e-32
 Identities = 91/227 (40%), Positives = 119/227 (52%)

Query:   374 GPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAK 433
             G VP   DWRK     P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C+ 
Sbjct:   112 GDVPKTVDWRKHGYVTPVKNQGPCGSCWAFSAVGSLEGQVFRKTGKLVPLSEQNLVDCSW 171

Query:   434 QCSGCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGE-KFKCAYDKSKVKLFTGKDF 489
                G  GCDG   +   +Y     GL++   YPY   NG  ++   Y  +KV    G  F
Sbjct:   172 S-HGNKGCDGGLPDFAFQYVKDNGGLDTSVSYPYEALNGTCRYNPKYSAAKV---VG--F 225

Query:   490 LYFNGSET-MKKILYKYGPLSVGLN-SHL-IHFYNGTPIRKNDETCSPYDLGHAVLLVGY 546
             +    SE  + K +   GP+SVG++  H    FY G    + D  CS  +L HAVL+VGY
Sbjct:   226 MSIPPSENALMKAVATVGPISVGIDIKHKSFQFYKGGMYYEPD--CSSTNLNHAVLVVGY 283

Query:   547 GKQDD-IPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
             G++ D   YWL +NSWG     +G+ K+ +  NN CGI   A Y  +
Sbjct:   284 GEESDGRKYWLVKNSWGRDWGMDGYIKMAKDWNNNCGIASDASYPIV 330

 Score = 356 (130.4 bits), Expect = 2.2e-31, P = 2.2e-31
 Identities = 90/228 (39%), Positives = 120/228 (52%)

Query:     9 GPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAK 68
             G VP   DWRK     P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C+ 
Sbjct:   112 GDVPKTVDWRKHGYVTPVKNQGPCGSCWAFSAVGSLEGQVFRKTGKLVPLSEQNLVDCSW 171

Query:    69 QCS--GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGE-KFKCAYDKSKVKLFTGKDFL 124
                  GCDG   + + +Y     GL++   YPY+  NG  ++   Y  +KV    G  F+
Sbjct:   172 SHGNKGCDGGLPDFAFQYVKDNGGLDTSVSYPYEALNGTCRYNPKYSAAKV---VG--FM 226

Query:   125 HFNGSET-MKKILYKYGPLSVLLNSDLIHD----YNGTPIRKNDETCSPYDLGHAVLLVG 179
                 SE  + K +   GP+SV +  D+ H     Y G    + D  CS  +L HAVL+VG
Sbjct:   227 SIPPSENALMKAVATVGPISVGI--DIKHKSFQFYKGGMYYEPD--CSSTNLNHAVLVVG 282

Query:   180 YGKQ-DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
             YG++ D   YWLV+NSWG     +G+ K+ +  NN CGI   A Y  +
Sbjct:   283 YGEESDGRKYWLVKNSWGRDWGMDGYIKMAKDWNNNCGIASDASYPIV 330

 Score = 356 (130.4 bits), Expect = 2.2e-31, P = 2.2e-31
 Identities = 90/228 (39%), Positives = 120/228 (52%)

Query:   740 GPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAK 799
             G VP   DWRK     P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C+ 
Sbjct:   112 GDVPKTVDWRKHGYVTPVKNQGPCGSCWAFSAVGSLEGQVFRKTGKLVPLSEQNLVDCSW 171

Query:   800 QCS--GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGE-KFKCAYDKSKVKLFTGKDFL 855
                  GCDG   + + +Y     GL++   YPY+  NG  ++   Y  +KV    G  F+
Sbjct:   172 SHGNKGCDGGLPDFAFQYVKDNGGLDTSVSYPYEALNGTCRYNPKYSAAKV---VG--FM 226

Query:   856 HFNGSET-MKKILYKYGPLSVLLNSDLIHD----YNGTPIRKNDETCSPYDLGHAVLLVG 910
                 SE  + K +   GP+SV +  D+ H     Y G    + D  CS  +L HAVL+VG
Sbjct:   227 SIPPSENALMKAVATVGPISVGI--DIKHKSFQFYKGGMYYEPD--CSSTNLNHAVLVVG 282

Query:   911 YGKQ-DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
             YG++ D   YWLV+NSWG     +G+ K+ +  NN CGI   A Y  +
Sbjct:   283 YGEESDGRKYWLVKNSWGRDWGMDGYIKMAKDWNNNCGIASDASYPIV 330

 Score = 143 (55.4 bits), Expect = 1.7e-06, P = 1.7e-06
 Identities = 29/62 (46%), Positives = 39/62 (62%)

Query:   965 CSPYDLGHAVLLVGYGKQDD-IPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYA 1022
             CS  +L HAVL+VGYG++ D   YWLV+NSWG     +G+ K+ +  NN CGI   A Y 
Sbjct:   269 CSSTNLNHAVLVVGYGEESDGRKYWLVKNSWGRDWGMDGYIKMAKDWNNNCGIASDASYP 328

Query:  1023 TI 1024
              +
Sbjct:   329 IV 330


>UNIPROTKB|Q9GL24 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 CTD:1515 KO:K01365
            OrthoDB:EOG48PMKF EMBL:AJ279008 RefSeq:NP_001239115.1
            UniGene:Cfa.3571 ProteinModelPortal:Q9GL24 SMR:Q9GL24
            MEROPS:C01.032 Ensembl:ENSCAFT00000001770
            Ensembl:ENSCAFT00000023837 GeneID:100684364 KEGG:cfa:100684364
            InParanoid:Q9GL24 OMA:FDQNLDT NextBio:20817211 Uniprot:Q9GL24
        Length = 333

 Score = 351 (128.6 bits), Expect = 7.4e-31, P = 7.4e-31
 Identities = 84/224 (37%), Positives = 116/224 (51%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC--AK 68
             +P + DWR+K    P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C  A+
Sbjct:   114 IPKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQ 173

Query:    69 QCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 127
                GC+G   + +  Y     GL+SE+ YPY   + E   C Y K +        F+   
Sbjct:   174 GNEGCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTET--CNY-KPECSAANDTGFVDLP 230

Query:   128 GSE-TMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ-- 183
               E  + K +   GP+SV +++      +  + I   D  CS  DL H VL+VGYG +  
Sbjct:   231 QREKALMKAVATLGPISVAIDAGHQSFQFYKSGIYF-DPDCSSKDLDHGVLVVGYGFEGT 289

Query:   184 -DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
               N  +W+V+NSWGP     G+ K+ +  NN CGI   A Y T+
Sbjct:   290 DSNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 333

 Score = 350 (128.3 bits), Expect = 8.1e-32, Sum P(2) = 8.1e-32
 Identities = 84/224 (37%), Positives = 116/224 (51%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC--AK 799
             +P + DWR+K    P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C  A+
Sbjct:   114 IPKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQ 173

Query:   800 QCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 858
                GC+G   + +  Y     GL+SE+ YPY   + E   C Y K +        F+   
Sbjct:   174 GNEGCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTET--CNY-KPECSAANDTGFVDLP 230

Query:   859 GSE-TMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ-- 914
               E  + K +   GP+SV +++      +  + I   D  CS  DL H VL+VGYG +  
Sbjct:   231 QREKALMKAVATLGPISVAIDAGHQSFQFYKSGIYF-DPDCSSKDLDHGVLVVGYGFEGT 289

Query:   915 -DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
               N  +W+V+NSWGP     G+ K+ +  NN CGI   A Y T+
Sbjct:   290 DSNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 333

 Score = 348 (127.6 bits), Expect = 1.3e-31, Sum P(2) = 1.3e-31
 Identities = 85/227 (37%), Positives = 118/227 (51%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +P + DWR+K    P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C++  
Sbjct:   114 IPKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSR-A 172

Query:   436 SGCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYF 492
              G  GC+G  ++    Y     GL+SE+ YPY   + E   C Y K +        F+  
Sbjct:   173 QGNEGCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTET--CNY-KPECSAANDTGFVDL 229

Query:   493 NGSE-TMKKILYKYGPLSVGLNS-HL-IHFY-NGTPIRKNDETCSPYDLGHAVLLVGYGK 548
                E  + K +   GP+SV +++ H    FY +G      D  CS  DL H VL+VGYG 
Sbjct:   230 PQREKALMKAVATLGPISVAIDAGHQSFQFYKSGIYF---DPDCSSKDLDHGVLVVGYGF 286

Query:   549 Q---DDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
             +    +  +W+ +NSWGP     G+ K+ +  NN CGI   A Y T+
Sbjct:   287 EGTDSNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 333

 Score = 141 (54.7 bits), Expect = 1.7e-05, Sum P(2) = 1.7e-05
 Identities = 32/82 (39%), Positives = 45/82 (54%)

Query:   951 AGYATIDVVKN----DETCSPYDLGHAVLLVGYGKQ---DDIPYWLVRNSWGPIGPDEGF 1003
             AG+ +    K+    D  CS  DL H VL+VGYG +    +  +W+V+NSWGP     G+
Sbjct:   252 AGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKFWIVKNSWGPEWGWNGY 311

Query:  1004 FKIERG-NNACGIEQIAGYATI 1024
              K+ +  NN CGI   A Y T+
Sbjct:   312 VKMAKDQNNHCGIATAASYPTV 333

 Score = 37 (18.1 bits), Expect = 8.1e-32, Sum P(2) = 8.1e-32
 Identities = 11/48 (22%), Positives = 22/48 (45%)

Query:   306 DEEIKERFEYFKQDGHKKHERYGTSEFSDRSPEEILCKTGFKWSERTY 353
             +EE ++    F+   HKK + +    F++  P+ +       W E+ Y
Sbjct:    85 NEEFRQVMNGFQNQKHKKGKMFQEPLFAE-IPKSV------DWREKGY 125

 Score = 37 (18.1 bits), Expect = 8.1e-32, Sum P(2) = 8.1e-32
 Identities = 11/48 (22%), Positives = 22/48 (45%)

Query:   672 DEEIKERFEYFKQDGHKKHERYGTSEFSDRSPEEILCKTGFKWSERTY 719
             +EE ++    F+   HKK + +    F++  P+ +       W E+ Y
Sbjct:    85 NEEFRQVMNGFQNQKHKKGKMFQEPLFAE-IPKSV------DWREKGY 125


>ZFIN|ZDB-GENE-040718-61 [details] [associations]
            symbol:ctsl.1 "cathepsin L.1" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-040718-61
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            GeneTree:ENSGT00660000095458 MEROPS:C01.092 EMBL:FP015965
            EMBL:BC075887 IPI:IPI00513499 RefSeq:NP_001002368.1
            UniGene:Dr.85174 SMR:Q6DHT0 Ensembl:ENSDART00000017756
            GeneID:436641 KEGG:dre:436641 CTD:436641 InParanoid:Q6DHT0
            OMA:GGQMENA OrthoDB:EOG41ZFB9 NextBio:20831086 Uniprot:Q6DHT0
        Length = 334

 Score = 359 (131.4 bits), Expect = 1.0e-31, P = 1.0e-31
 Identities = 83/228 (36%), Positives = 116/228 (50%)

Query:     5 VEKDGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLV 64
             + K   VPD  DWR K       DQ  CGSCWAFS  G LEGQ   KTGKLV  S+ QLV
Sbjct:   112 LRKAAVVPDTVDWRDKGYVTDIKDQKQCGSCWAFSATGSLEGQTFRKTGKLVSLSEQQLV 171

Query:    65 ECAKQCS--GCDGCFFEPSIEYTH-QAGLESEKDYPYKNANGEKFKCAYDKSKVKLF-TG 120
             +C+      GCDG   + + +Y     GL++E  YPY+  +GE   C ++ S V    TG
Sbjct:   172 DCSGSYGNYGCDGGLMDQAFQYIEANKGLDTEDSYPYEAQDGE---CRFNPSTVGASCTG 228

Query:   121 KDFLHFNGSETMKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLV 178
                +       +++ +   GP+SV +++       Y+      N+  CS  +L H VL V
Sbjct:   229 YVDIASGDESALQEAVATIGPISVAIDAGHSSFQLYSSGVY--NEPDCSSSELDHGVLAV 286

Query:   179 GYGKQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
             GYG  +   YW+V+NSWG     +G+  + R  +N CGI   A Y  +
Sbjct:   287 GYGSSNGDDYWIVKNSWGLDWGVQGYILMSRNKSNQCGIATAASYPLV 334

 Score = 357 (130.7 bits), Expect = 1.7e-31, P = 1.7e-31
 Identities = 83/221 (37%), Positives = 114/221 (51%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             VPD  DWR K       DQ  CGSCWAFS  G LEGQ   KTGKLV  S+ QLV+C+   
Sbjct:   118 VPDTVDWRDKGYVTDIKDQKQCGSCWAFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGSY 177

Query:   436 SGCGGCDG--LEQPIEYTH-QAGLESEKDYPYRNGNGEKFKCAYDKSKVKLF-TGKDFLY 491
              G  GCDG  ++Q  +Y     GL++E  YPY   +GE   C ++ S V    TG   + 
Sbjct:   178 -GNYGCDGGLMDQAFQYIEANKGLDTEDSYPYEAQDGE---CRFNPSTVGASCTGYVDIA 233

Query:   492 FNGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD 551
                   +++ +   GP+SV +++    F   +    N+  CS  +L H VL VGYG  + 
Sbjct:   234 SGDESALQEAVATIGPISVAIDAGHSSFQLYSSGVYNEPDCSSSELDHGVLAVGYGSSNG 293

Query:   552 IPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
               YW+ +NSWG     +G+  + R  +N CGI   A Y  +
Sbjct:   294 DDYWIVKNSWGLDWGVQGYILMSRNKSNQCGIATAASYPLV 334

 Score = 356 (130.4 bits), Expect = 2.2e-31, P = 2.2e-31
 Identities = 82/222 (36%), Positives = 114/222 (51%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             VPD  DWR K       DQ  CGSCWAFS  G LEGQ   KTGKLV  S+ QLV+C+   
Sbjct:   118 VPDTVDWRDKGYVTDIKDQKQCGSCWAFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGSY 177

Query:   802 S--GCDGCFFEPSIEYTH-QAGLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHF 857
                GCDG   + + +Y     GL++E  YPY+  +GE   C ++ S V    TG   +  
Sbjct:   178 GNYGCDGGLMDQAFQYIEANKGLDTEDSYPYEAQDGE---CRFNPSTVGASCTGYVDIAS 234

Query:   858 NGSETMKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 915
                  +++ +   GP+SV +++       Y+      N+  CS  +L H VL VGYG  +
Sbjct:   235 GDESALQEAVATIGPISVAIDAGHSSFQLYSSGVY--NEPDCSSSELDHGVLAVGYGSSN 292

Query:   916 NIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
                YW+V+NSWG     +G+  + R  +N CGI   A Y  +
Sbjct:   293 GDDYWIVKNSWGLDWGVQGYILMSRNKSNQCGIATAASYPLV 334

 Score = 137 (53.3 bits), Expect = 8.1e-06, P = 8.1e-06
 Identities = 27/67 (40%), Positives = 37/67 (55%)

Query:   959 VKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQ 1017
             V N+  CS  +L H VL VGYG  +   YW+V+NSWG     +G+  + R  +N CGI  
Sbjct:   268 VYNEPDCSSSELDHGVLAVGYGSSNGDDYWIVKNSWGLDWGVQGYILMSRNKSNQCGIAT 327

Query:  1018 IAGYATI 1024
              A Y  +
Sbjct:   328 AASYPLV 334


>TAIR|locus:2130180 [details] [associations]
            symbol:AT4G16190 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0042744 "hydrogen peroxide catabolic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005773
            EMBL:CP002687 HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 EMBL:Z97340 EMBL:AL161543 UniGene:At.25555
            EMBL:AY039556 EMBL:AY129473 EMBL:AY136316 EMBL:BT000733
            EMBL:AK226366 IPI:IPI00543588 PIR:D71428 RefSeq:NP_567489.1
            HSSP:P25779 ProteinModelPortal:Q9SUL1 SMR:Q9SUL1 STRING:Q9SUL1
            MEROPS:C01.A06 PRIDE:Q9SUL1 EnsemblPlants:AT4G16190.1 GeneID:827311
            KEGG:ath:AT4G16190 TAIR:At4g16190 InParanoid:Q9SUL1 OMA:NACGINK
            PhylomeDB:Q9SUL1 ProtClustDB:CLSN2917559 Genevestigator:Q9SUL1
            Uniprot:Q9SUL1
        Length = 373

 Score = 359 (131.4 bits), Expect = 1.0e-31, P = 1.0e-31
 Identities = 99/317 (31%), Positives = 150/317 (47%)

Query:   292 FKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHER--------YGTSEFSDRSPEEILCK 343
             F  F  K  + YA   E   RF  FK +  +            +G ++FSD +P+E   K
Sbjct:    55 FTLFKSKYEKTYATQVEHDHRFRVFKANLRRARRNQLLDPSAVHGVTQFSDLTPKEFRRK 114

Query:   344 TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAF 403
               F   +R   R+  D             D  +P  +DWR++    P  +Q  CGSCW+F
Sbjct:   115 --FLGLKRRGFRLPTD---TQTAPILPTSD--LPTEFDWREQGAVTPVKNQGMCGSCWSF 167

Query:   404 SIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC-----SGC-GGCDG--LEQPIEYTHQAG 455
             S  G LEG + + T +LV  S+ QLV+C  +C     + C  GC G  +    EY  +AG
Sbjct:   168 SAIGALEGAHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALKAG 227

Query:   456 -LESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNS 514
              L  E+DYPY  G  +   C +DKSK+        +  +  + +   L ++GPL++ +N+
Sbjct:   228 GLMKEEDYPY-TGR-DHTACKFDKSKIVASVSNFSVVSSDEDQIAANLVQHGPLAIAINA 285

Query:   515 HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD-------DIPYWLARNSWGPIGPD 567
               +  Y G         CS     H VLLVG+G          + PYW+ +NSWG +  +
Sbjct:   286 MWMQTYIGGV--SCPYVCSKSQ-DHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGAMWGE 342

Query:   568 EGFFKIERG-NNACGIE 583
              G++KI RG +N CG++
Sbjct:   343 HGYYKICRGPHNMCGMD 359

 Score = 358 (131.1 bits), Expect = 1.3e-31, P = 1.3e-31
 Identities = 98/317 (30%), Positives = 148/317 (46%)

Query:   658 FKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHER--------YGTSEFSDRSPEEILCK 709
             F  F  K  + YA   E   RF  FK +  +            +G ++FSD +P+E   K
Sbjct:    55 FTLFKSKYEKTYATQVEHDHRFRVFKANLRRARRNQLLDPSAVHGVTQFSDLTPKEFRRK 114

Query:   710 TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAF 769
               F   +R   R+  D             D  +P  +DWR++    P  +Q  CGSCW+F
Sbjct:   115 --FLGLKRRGFRLPTD---TQTAPILPTSD--LPTEFDWREQGAVTPVKNQGMCGSCWSF 167

Query:   770 SIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC---------SGCDGCFFEPSIEYTHQAG 820
             S  G LEG + + T +LV  S+ QLV+C  +C         SGC G     + EY  +AG
Sbjct:   168 SAIGALEGAHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALKAG 227

Query:   821 -LESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNS 879
              L  E+DYPY     +   C +DKSK+        +  +  + +   L ++GPL++ +N+
Sbjct:   228 GLMKEEDYPYTGR--DHTACKFDKSKIVASVSNFSVVSSDEDQIAANLVQHGPLAIAINA 285

Query:   880 DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIP-------YWLVRNSWGPIGPD 932
               +  Y G         CS     H VLLVG+G     P       YW+++NSWG +  +
Sbjct:   286 MWMQTYIGGV--SCPYVCSKSQ-DHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGAMWGE 342

Query:   933 EGFFKIERG-NNACGIE 948
              G++KI RG +N CG++
Sbjct:   343 HGYYKICRGPHNMCGMD 359

 Score = 331 (121.6 bits), Expect = 1.0e-28, P = 1.0e-28
 Identities = 76/225 (33%), Positives = 116/225 (51%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P  +DWR++    P  +Q  CGSCW+FS  G LEG + + T +LV  S+ QLV+C  +C
Sbjct:   140 LPTEFDWREQGAVTPVKNQGMCGSCWSFSAIGALEGAHFLATKELVSLSEQQLVDCDHEC 199

Query:    71 ---------SGCDGCFFEPSIEYTHQAG-LESEKDYPYKNANGEKFKCAYDKSKVKLFTG 120
                      SGC G     + EY  +AG L  E+DYPY     +   C +DKSK+     
Sbjct:   200 DPAQANSCDSGCSGGLMNNAFEYALKAGGLMKEEDYPYTGR--DHTACKFDKSKIVASVS 257

Query:   121 KDFLHFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGY 180
                +  +  + +   L ++GPL++ +N+  +  Y G         CS     H VLLVG+
Sbjct:   258 NFSVVSSDEDQIAANLVQHGPLAIAINAMWMQTYIGGV--SCPYVCSKSQ-DHGVLLVGF 314

Query:   181 GKQDNIP-------YWLVRNSWGPIGPDEGFFKIERG-NNACGIE 217
             G     P       YW+++NSWG +  + G++KI RG +N CG++
Sbjct:   315 GSSGYAPIRLKEKPYWIIKNSWGAMWGEHGYYKICRGPHNMCGMD 359


>UNIPROTKB|F1NYJ1 [details] [associations]
            symbol:CTSL2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 IPI:IPI00602255
            OMA:DITHHEF EMBL:AADN02067812 Ensembl:ENSGALT00000020588
            ArrayExpress:F1NYJ1 Uniprot:F1NYJ1
        Length = 339

 Score = 358 (131.1 bits), Expect = 1.3e-31, P = 1.3e-31
 Identities = 99/285 (34%), Positives = 140/285 (49%)

Query:   323 KHE-RYGTSEFSDRSPEEIL-CKTGFKW--SERTYERIVADRXXXXXXXXXXXXDGPVPD 378
             KH  + G ++F D + EE      G+K   SER Y      R                P 
Sbjct:    71 KHSYKLGMNQFGDMTAEEFRQLMNGYKHKKSERKY------RGSQFLEPSFLEA----PR 120

Query:   379 AWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGC 438
             + DWR+K    P  DQ  CGSCWAFS  G LEGQ+  KTGKLV  S+  LV+C++   G 
Sbjct:   121 SVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRP-EGN 179

Query:   439 GGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYF-NG 494
              GC+G  ++Q  +Y     G++SE+ YPY   + E   C Y K++        F+    G
Sbjct:   180 QGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDED--CRY-KAEYNAANDTGFVDIPQG 236

Query:   495 SE-TMKKILYKYGPLSVGLNS-HL-IHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ-D 550
              E  + K +   GP+SV +++ H    FY      + D  CS  DL H VL+VGYG + +
Sbjct:   237 HERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPD--CSSEDLDHGVLVVGYGFEGE 294

Query:   551 DIP---YWLARNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 591
             D+    YW+ +NSWG    D+G+  + +   N CGI   A Y  +
Sbjct:   295 DVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPLV 339

 Score = 349 (127.9 bits), Expect = 1.2e-30, P = 1.2e-30
 Identities = 96/284 (33%), Positives = 135/284 (47%)

Query:   689 KHE-RYGTSEFSDRSPEEIL-CKTGFKW--SERTYERIVADRXXXXXXXXXXXXDGPVPD 744
             KH  + G ++F D + EE      G+K   SER Y      R                P 
Sbjct:    71 KHSYKLGMNQFGDMTAEEFRQLMNGYKHKKSERKY------RGSQFLEPSFLEA----PR 120

Query:   745 AWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ--CS 802
             + DWR+K    P  DQ  CGSCWAFS  G LEGQ+  KTGKLV  S+  LV+C++     
Sbjct:   121 SVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQ 180

Query:   803 GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF-NGS 860
             GC+G   + + +Y     G++SE+ YPY   + E   C Y K++        F+    G 
Sbjct:   181 GCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDED--CRY-KAEYNAANDTGFVDIPQGH 237

Query:   861 E-TMKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ--- 914
             E  + K +   GP+SV +++       Y      + D  CS  DL H VL+VGYG +   
Sbjct:   238 ERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPD--CSSEDLDHGVLVVGYGFEGED 295

Query:   915 -DNIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 956
              D   YW+V+NSWG    D+G+  + +   N CGI   A Y  +
Sbjct:   296 VDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPLV 339

 Score = 347 (127.2 bits), Expect = 2.0e-30, P = 2.0e-30
 Identities = 82/226 (36%), Positives = 116/226 (51%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ-- 69
             P + DWR+K    P  DQ  CGSCWAFS  G LEGQ+  KTGKLV  S+  LV+C++   
Sbjct:   119 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEG 178

Query:    70 CSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF-N 127
               GC+G   + + +Y     G++SE+ YPY   + E   C Y K++        F+    
Sbjct:   179 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDED--CRY-KAEYNAANDTGFVDIPQ 235

Query:   128 GSE-TMKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ- 183
             G E  + K +   GP+SV +++       Y      + D  CS  DL H VL+VGYG + 
Sbjct:   236 GHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPD--CSSEDLDHGVLVVGYGFEG 293

Query:   184 ---DNIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 225
                D   YW+V+NSWG    D+G+  + +   N CGI   A Y  +
Sbjct:   294 EDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPLV 339

 Score = 130 (50.8 bits), Expect = 5.0e-05, P = 5.0e-05
 Identities = 32/106 (30%), Positives = 51/106 (48%)

Query:   928 PIGPDEGFFKIERGNNACGIEQIAGYATIDVVKN----DETCSPYDLGHAVLLVGYGKQ- 982
             P G +    K         +   AG+++    ++    +  CS  DL H VL+VGYG + 
Sbjct:   234 PQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEG 293

Query:   983 DDIP---YWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 1024
             +D+    YW+V+NSWG    D+G+  + +   N CGI   A Y  +
Sbjct:   294 EDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPLV 339


>FB|FBgn0034229 [details] [associations]
            symbol:CG4847 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0032504
            "multicellular organism reproduction" evidence=IEP] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0005615 "extracellular space"
            evidence=ISM;IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:AE013599 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 GO:GO:0032504 GeneTree:ENSGT00560000076599
            KO:K01371 EMBL:BT099507 RefSeq:NP_725686.1 UniGene:Dm.4677
            SMR:A1ZAU4 IntAct:A1ZAU4 MEROPS:C01.A28 EnsemblMetazoa:FBtr0086935
            GeneID:36973 KEGG:dme:Dmel_CG4847 UCSC:CG4847-RB
            FlyBase:FBgn0034229 InParanoid:A1ZAU4 OMA:GGFQEYA OrthoDB:EOG4J9KFC
            ChiTaRS:CG4847 GenomeRNAi:36973 NextBio:801302 Uniprot:A1ZAU4
        Length = 420

 Score = 357 (130.7 bits), Expect = 1.7e-31, P = 1.7e-31
 Identities = 87/274 (31%), Positives = 130/274 (47%)

Query:   692 RYGTSEFSDRSPEEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRK 750
             +   + F+D +  E L + TG K S     R  A                P+PDA+DWR+
Sbjct:   158 KQAVNAFADLTHSEFLSQLTGLKRSPEAKARAAASLKLVNLPAK------PIPDAFDWRE 211

Query:   751 KNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ----CSGCDG 806
                  P   Q  CGSCWAF+  G +EG    KTG L   S+  LV+C        +GCDG
Sbjct:   212 HGGVTPVKFQGTCGSCWAFATTGAIEGHTFRKTGSLPNLSEQNLVDCGPVEDFGLNGCDG 271

Query:   807 CFFEPSIEYTH--QAGLESEKDYPYKNANGEKFKCAYDKSKV-KLFTGKDFLHFNGSETM 863
              F E +  +    Q G+  E  YPY +  G    C YD SK      G   +     E +
Sbjct:   272 GFQEAAFCFIDEVQKGVSQEGAYPYIDNKGT---CKYDGSKSGATLQGFAAIPPKDEEQL 328

Query:   864 KKILYKYGPLSVLLNS-DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLV 922
             KK++   GP++  +N  + + +Y G     ND+ C+  +  H++L+VGYG +    YW+V
Sbjct:   329 KKVVATLGPVACSVNGLETLKNYAGGIY--NDDECNKGEPNHSILVVGYGSEKGQDYWIV 386

Query:   923 RNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 956
             +NSW     ++G+F++ RG N C I +   Y  +
Sbjct:   387 KNSWDDTWGEKGYFRLPRGKNYCFIAEECSYPVV 420

 Score = 353 (129.3 bits), Expect = 4.5e-31, P = 4.5e-31
 Identities = 77/224 (34%), Positives = 115/224 (51%)

Query:    10 PVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ 69
             P+PDA+DWR+     P   Q  CGSCWAF+  G +EG    KTG L   S+  LV+C   
Sbjct:   202 PIPDAFDWREHGGVTPVKFQGTCGSCWAFATTGAIEGHTFRKTGSLPNLSEQNLVDCGPV 261

Query:    70 ----CSGCDGCFFEPSIEYTH--QAGLESEKDYPYKNANGEKFKCAYDKSKV-KLFTGKD 122
                  +GCDG F E +  +    Q G+  E  YPY +  G    C YD SK      G  
Sbjct:   262 EDFGLNGCDGGFQEAAFCFIDEVQKGVSQEGAYPYIDNKGT---CKYDGSKSGATLQGFA 318

Query:   123 FLHFNGSETMKKILYKYGPLSVLLNS-DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG 181
              +     E +KK++   GP++  +N  + + +Y G     ND+ C+  +  H++L+VGYG
Sbjct:   319 AIPPKDEEQLKKVVATLGPVACSVNGLETLKNYAGGIY--NDDECNKGEPNHSILVVGYG 376

Query:   182 KQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 225
              +    YW+V+NSW     ++G+F++ RG N C I +   Y  +
Sbjct:   377 SEKGQDYWIVKNSWDDTWGEKGYFRLPRGKNYCFIAEECSYPVV 420

 Score = 349 (127.9 bits), Expect = 1.2e-30, P = 1.2e-30
 Identities = 86/274 (31%), Positives = 126/274 (45%)

Query:   326 RYGTSEFSDRSPEEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRK 384
             +   + F+D +  E L + TG K S     R  A                P+PDA+DWR+
Sbjct:   158 KQAVNAFADLTHSEFLSQLTGLKRSPEAKARAAASLKLVNLPAK------PIPDAFDWRE 211

Query:   385 KNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAK-QCSGCGGCDG 443
                  P   Q  CGSCWAF+  G +EG    KTG L   S+  LV+C   +  G  GCDG
Sbjct:   212 HGGVTPVKFQGTCGSCWAFATTGAIEGHTFRKTGSLPNLSEQNLVDCGPVEDFGLNGCDG 271

Query:   444 LEQPIEYTH----QAGLESEKDYPYRNGNGEKFKCAYDKSKV-KLFTGKDFLYFNGSETM 498
               Q   +      Q G+  E  YPY +  G    C YD SK      G   +     E +
Sbjct:   272 GFQEAAFCFIDEVQKGVSQEGAYPYIDNKGT---CKYDGSKSGATLQGFAAIPPKDEEQL 328

Query:   499 KKILYKYGPLSVGLNS-HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLA 557
             KK++   GP++  +N    +  Y G     ND+ C+  +  H++L+VGYG +    YW+ 
Sbjct:   329 KKVVATLGPVACSVNGLETLKNYAGGIY--NDDECNKGEPNHSILVVGYGSEKGQDYWIV 386

Query:   558 RNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 591
             +NSW     ++G+F++ RG N C I +   Y  +
Sbjct:   387 KNSWDDTWGEKGYFRLPRGKNYCFIAEECSYPVV 420

 Score = 158 (60.7 bits), Expect = 6.5e-08, P = 6.5e-08
 Identities = 30/87 (34%), Positives = 49/87 (56%)

Query:   941 GNNAC---GIEQIAGYATIDVVKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPI 997
             G  AC   G+E +  YA    + ND+ C+  +  H++L+VGYG +    YW+V+NSW   
Sbjct:   336 GPVACSVNGLETLKNYA--GGIYNDDECNKGEPNHSILVVGYGSEKGQDYWIVKNSWDDT 393

Query:   998 GPDEGFFKIERGNNACGIEQIAGYATI 1024
               ++G+F++ RG N C I +   Y  +
Sbjct:   394 WGEKGYFRLPRGKNYCFIAEECSYPVV 420


>RGD|1308751 [details] [associations]
            symbol:RGD1308751 "similar to Cathepsin L precursor (Major
            excreted protein) (MEP)" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1308751 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:CH474032 IPI:IPI00365697 RefSeq:XP_001065885.2
            RefSeq:XP_225137.5 MEROPS:C01.069 Ensembl:ENSRNOT00000061391
            GeneID:290981 KEGG:rno:290981 UCSC:RGD:1308751 CTD:290981
            OMA:ESYAYEA OrthoDB:EOG42823G NextBio:631921 Uniprot:D3ZKC3
        Length = 330

 Score = 357 (130.7 bits), Expect = 1.7e-31, P = 1.7e-31
 Identities = 85/226 (37%), Positives = 121/226 (53%)

Query:   374 GPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAK 433
             G +P + DWR+     P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C+ 
Sbjct:   112 GDIPKSLDWREHGYVTPVKNQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDCSW 171

Query:   434 QCSGCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYD-KSKVKLFTGKDF 489
                  G C+G  +E   +Y  +  GL++ + Y Y   +G    C Y+ K      TG   
Sbjct:   172 SYGNLG-CNGGLMEFAFQYVKENRGLDTGESYAYEAQDG---LCRYNPKYSAANVTGFVK 227

Query:   490 LYFNGSETMKKILYKYGPLSVGLNSH--LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYG 547
             +  +  + M  +    GP+SVG++SH     FY+G    + D  CS  ++ HAVL+VGYG
Sbjct:   228 VPLSEDDLMSAVA-SVGPVSVGIDSHHQSFRFYSGGMYYEPD--CSSTEMDHAVLVVGYG 284

Query:   548 KQDDI-PYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
             ++ D   YWL +NSWG     +G+ K+ +  NN CGI   A Y T+
Sbjct:   285 EESDGGKYWLVKNSWGEDWGMDGYIKMAKDQNNNCGIATYAIYPTV 330

 Score = 344 (126.2 bits), Expect = 4.2e-30, P = 4.2e-30
 Identities = 83/225 (36%), Positives = 120/225 (53%)

Query:     9 GPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAK 68
             G +P + DWR+     P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C+ 
Sbjct:   112 GDIPKSLDWREHGYVTPVKNQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDCSW 171

Query:    69 QCS--GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFL 124
                  GC+G   E + +Y  +  GL++ + Y Y+  +G    C Y+ K      TG   +
Sbjct:   172 SYGNLGCNGGLMEFAFQYVKENRGLDTGESYAYEAQDG---LCRYNPKYSAANVTGFVKV 228

Query:   125 HFNGSETMKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGK 182
               +  + M  +    GP+SV ++S       Y+G    + D  CS  ++ HAVL+VGYG+
Sbjct:   229 PLSEDDLMSAVA-SVGPVSVGIDSHHQSFRFYSGGMYYEPD--CSSTEMDHAVLVVGYGE 285

Query:   183 Q-DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
             + D   YWLV+NSWG     +G+ K+ +  NN CGI   A Y T+
Sbjct:   286 ESDGGKYWLVKNSWGEDWGMDGYIKMAKDQNNNCGIATYAIYPTV 330

 Score = 343 (125.8 bits), Expect = 5.3e-30, P = 5.3e-30
 Identities = 83/225 (36%), Positives = 120/225 (53%)

Query:   740 GPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAK 799
             G +P + DWR+     P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C+ 
Sbjct:   112 GDIPKSLDWREHGYVTPVKNQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDCSW 171

Query:   800 QCS--GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFL 855
                  GC+G   E + +Y  +  GL++ + Y Y+  +G    C Y+ K      TG   +
Sbjct:   172 SYGNLGCNGGLMEFAFQYVKENRGLDTGESYAYEAQDG---LCRYNPKYSAANVTGFVKV 228

Query:   856 HFNGSETMKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGK 913
               +  + M  +    GP+SV ++S       Y+G    + D  CS  ++ HAVL+VGYG+
Sbjct:   229 PLSEDDLMSAVA-SVGPVSVGIDSHHQSFRFYSGGMYYEPD--CSSTEMDHAVLVVGYGE 285

Query:   914 Q-DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
             + D   YWLV+NSWG     +G+ K+ +  NN CGI   A Y T+
Sbjct:   286 ESDGGKYWLVKNSWGEDWGMDGYIKMAKDQNNNCGIATYAIYPTV 330

 Score = 144 (55.7 bits), Expect = 1.3e-06, P = 1.3e-06
 Identities = 29/62 (46%), Positives = 40/62 (64%)

Query:   965 CSPYDLGHAVLLVGYGKQDDI-PYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYA 1022
             CS  ++ HAVL+VGYG++ D   YWLV+NSWG     +G+ K+ +  NN CGI   A Y 
Sbjct:   269 CSSTEMDHAVLVVGYGEESDGGKYWLVKNSWGEDWGMDGYIKMAKDQNNNCGIATYAIYP 328

Query:  1023 TI 1024
             T+
Sbjct:   329 TV 330


>UNIPROTKB|O60911 [details] [associations]
            symbol:CTSL2 "Cathepsin L2" species:9606 "Homo sapiens"
            [GO:0004177 "aminopeptidase activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA] [GO:0005902
            "microvillus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0007283 "spermatogenesis" evidence=IEA] [GO:0009267 "cellular
            response to starvation" evidence=IEA] [GO:0009749 "response to
            glucose stimulus" evidence=IEA] [GO:0009897 "external side of
            plasma membrane" evidence=IEA] [GO:0010259 "multicellular
            organismal aging" evidence=IEA] [GO:0021675 "nerve development"
            evidence=IEA] [GO:0030984 "kininogen binding" evidence=IEA]
            [GO:0032403 "protein complex binding" evidence=IEA] [GO:0034698
            "response to gonadotropin stimulus" evidence=IEA] [GO:0042277
            "peptide binding" evidence=IEA] [GO:0043005 "neuron projection"
            evidence=IEA] [GO:0043204 "perikaryon" evidence=IEA] [GO:0046697
            "decidualization" evidence=IEA] [GO:0048102 "autophagic cell death"
            evidence=IEA] [GO:0051384 "response to glucocorticoid stimulus"
            evidence=IEA] [GO:0060008 "Sertoli cell differentiation"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=TAS] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_118779 Reactome:REACT_6900
            GO:GO:0009897 GO:GO:0019886 GO:GO:0034698 GO:GO:0043204
            GO:GO:0009749 GO:GO:0030141 GO:GO:0051384 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0045177 GO:GO:0043005 GO:GO:0007283
            GO:GO:0004177 GO:GO:0042277 GO:GO:0009267 GO:GO:0021675
            GO:GO:0043202 GO:GO:0005902 GO:GO:0010259 GO:GO:0004197
            GO:GO:0048102 GO:GO:0046697 HOVERGEN:HBG011513 CTD:1515
            OrthoDB:EOG48PMKF OMA:FDQNLDT GO:GO:0060008 EMBL:Y14734
            EMBL:AB001928 EMBL:AF070448 EMBL:AB019534 EMBL:AY358641
            EMBL:AL445670 EMBL:BC023504 EMBL:BC110512 IPI:IPI00000013
            RefSeq:NP_001188504.1 RefSeq:NP_001324.2 UniGene:Hs.610096 PDB:1FH0
            PDB:3H6S PDB:3KFQ PDBsum:1FH0 PDBsum:3H6S PDBsum:3KFQ
            ProteinModelPortal:O60911 SMR:O60911 IntAct:O60911 STRING:O60911
            MEROPS:I29.010 PhosphoSite:O60911 PaxDb:O60911 PeptideAtlas:O60911
            PRIDE:O60911 Ensembl:ENST00000259470 Ensembl:ENST00000538255
            GeneID:1515 KEGG:hsa:1515 UCSC:uc004awt.3 GeneCards:GC09M099794
            HGNC:HGNC:2538 HPA:CAB017112 MIM:603308 neXtProt:NX_O60911
            PharmGKB:PA27036 InParanoid:O60911 KO:K01375 PhylomeDB:O60911
            BRENDA:3.4.22.43 SABIO-RK:O60911 BindingDB:O60911 ChEMBL:CHEMBL3272
            ChiTaRS:CTSL2 EvolutionaryTrace:O60911 GenomeRNAi:1515 NextBio:6277
            Bgee:O60911 CleanEx:HS_CTSL2 Genevestigator:O60911
            GermOnline:ENSG00000136943 Uniprot:O60911
        Length = 334

 Score = 356 (130.4 bits), Expect = 2.2e-31, P = 2.2e-31
 Identities = 87/226 (38%), Positives = 118/226 (52%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P + DWRKK    P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C++  
Sbjct:   114 LPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQ 173

Query:    71 S--GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAY-DKSKVKLFTGKDFLHF 126
                GC+G F   + +Y  +  GL+SE+ YPY  A  E   C Y  ++ V   TG   +  
Sbjct:   174 GNQGCNGGFMARAFQYVKENGGLDSEESYPYV-AVDEI--CKYRPENSVANDTGFTVVAP 230

Query:   127 NGSETMKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG--- 181
                + + K +   GP+SV +++       Y      + D  CS  +L H VL+VGYG   
Sbjct:   231 GKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPD--CSSKNLDHGVLVVGYGFEG 288

Query:   182 -KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
                +N  YWLV+NSWGP     G+ KI +  NN CGI   A Y  +
Sbjct:   289 ANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV 334

 Score = 355 (130.0 bits), Expect = 2.8e-31, P = 2.8e-31
 Identities = 87/226 (38%), Positives = 118/226 (52%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             +P + DWRKK    P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C++  
Sbjct:   114 LPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQ 173

Query:   802 S--GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAY-DKSKVKLFTGKDFLHF 857
                GC+G F   + +Y  +  GL+SE+ YPY  A  E   C Y  ++ V   TG   +  
Sbjct:   174 GNQGCNGGFMARAFQYVKENGGLDSEESYPYV-AVDEI--CKYRPENSVANDTGFTVVAP 230

Query:   858 NGSETMKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG--- 912
                + + K +   GP+SV +++       Y      + D  CS  +L H VL+VGYG   
Sbjct:   231 GKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPD--CSSKNLDHGVLVVGYGFEG 288

Query:   913 -KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
                +N  YWLV+NSWGP     G+ KI +  NN CGI   A Y  +
Sbjct:   289 ANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV 334

 Score = 347 (127.2 bits), Expect = 2.0e-30, P = 2.0e-30
 Identities = 85/227 (37%), Positives = 119/227 (52%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +P + DWRKK    P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C++  
Sbjct:   114 LPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRP- 172

Query:   436 SGCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAY-DKSKVKLFTGKDFLY 491
              G  GC+G  + +  +Y  +  GL+SE+ YPY   +     C Y  ++ V   TG   + 
Sbjct:   173 QGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEI---CKYRPENSVANDTGFTVVA 229

Query:   492 FNGSETMKKILYKYGPLSVGLNS-HL-IHFYNGTPIRKNDETCSPYDLGHAVLLVGYG-- 547
                 + + K +   GP+SV +++ H    FY      + D  CS  +L H VL+VGYG  
Sbjct:   230 PGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPD--CSSKNLDHGVLVVGYGFE 287

Query:   548 --KQDDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
                 ++  YWL +NSWGP     G+ KI +  NN CGI   A Y  +
Sbjct:   288 GANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV 334

 Score = 144 (55.7 bits), Expect = 2.4e-06, Sum P(2) = 2.4e-06
 Identities = 34/104 (32%), Positives = 50/104 (48%)

Query:   930 GPDEGFFKIERGNNACGIEQIAGYATIDVVKN----DETCSPYDLGHAVLLVGYG----K 981
             G ++   K         +   AG+++    K+    +  CS  +L H VL+VGYG     
Sbjct:   231 GKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGAN 290

Query:   982 QDDIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 1024
              ++  YWLV+NSWGP     G+ KI +  NN CGI   A Y  +
Sbjct:   291 SNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV 334

 Score = 42 (19.8 bits), Expect = 2.4e-06, Sum P(2) = 2.4e-06
 Identities = 26/103 (25%), Positives = 41/103 (39%)

Query:   249 CGVASCLCLPSLTDRITDQVVARVDTLAIEGSLTFDNENILETFKAFIVKRGRQYANDEE 308
             CG  SC    S T  +  Q+  +   L    SL+   +N+++  +     +G Q  N   
Sbjct:   135 CG--SCWAF-SATGALEGQMFRKTGKLV---SLS--EQNLVDCSRP----QGNQGCNGGF 182

Query:   309 IKERFEYFKQDGH-KKHERY---GTSEFSDRSPEEILCK-TGF 346
             +   F+Y K++G     E Y      E     PE  +   TGF
Sbjct:   183 MARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGF 225

 Score = 42 (19.8 bits), Expect = 2.4e-06, Sum P(2) = 2.4e-06
 Identities = 26/103 (25%), Positives = 41/103 (39%)

Query:   615 CGVASCLCLPSLTDRITDQVVARVDTLAIEGSLTFDNENILETFKAFIVKRGRQYANDEE 674
             CG  SC    S T  +  Q+  +   L    SL+   +N+++  +     +G Q  N   
Sbjct:   135 CG--SCWAF-SATGALEGQMFRKTGKLV---SLS--EQNLVDCSRP----QGNQGCNGGF 182

Query:   675 IKERFEYFKQDGH-KKHERY---GTSEFSDRSPEEILCK-TGF 712
             +   F+Y K++G     E Y      E     PE  +   TGF
Sbjct:   183 MARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGF 225


>UNIPROTKB|Q90686 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9031 "Gallus gallus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            CTD:1513 KO:K01371 PANTHER:PTHR12411:SF55 EMBL:U37691
            IPI:IPI00575213 RefSeq:NP_990302.1 UniGene:Gga.51509
            ProteinModelPortal:Q90686 SMR:Q90686 MEROPS:C01.036 GeneID:395818
            KEGG:gga:395818 NextBio:20815886 Uniprot:Q90686
        Length = 334

 Score = 356 (130.4 bits), Expect = 2.2e-31, P = 2.2e-31
 Identities = 78/215 (36%), Positives = 111/215 (51%)

Query:   377 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 436
             P A DWR+K    P  DQ  CGSCWAFS  G LEGQ   +TGKL+  S   LV C    +
Sbjct:   121 PAAVDWRRKGYVTPVKDQGQCGSCWAFSSVGALEGQLKRRTGKLLSLSPQNLVYCVSNNN 180

Query:   437 GCGGCDGLEQPIEYTH-QAGLESEKDYPYRNGNGEKFKCAYDKS-KVKLFTGKDFLYFNG 494
             GCGG   +    EY     G++SE  YPY    G+   C Y  + K     G   +  + 
Sbjct:   181 GCGG-GYMTNAFEYVRLNRGIDSEDAYPYI---GQDESCMYSPTGKAAKCRGYREIPEDN 236

Query:   495 SETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPY 554
              + +K+ + + GP+SVG+++ L  F   +     D  C+P ++ HAVL VGYG Q    +
Sbjct:   237 EKALKRAVARIGPVSVGIDASLPSFQFYSRGVYYDTGCNPENINHAVLAVGYGAQKGTKH 296

Query:   555 WLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 588
             W+ +NSWG    ++G+  + R     CGI  +A +
Sbjct:   297 WIIKNSWGTEWGNKGYVLLARNMKQTCGIANLASF 331

 Score = 351 (128.6 bits), Expect = 7.4e-31, P = 7.4e-31
 Identities = 75/215 (34%), Positives = 111/215 (51%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 71
             P A DWR+K    P  DQ  CGSCWAFS  G LEGQ   +TGKL+  S   LV C    +
Sbjct:   121 PAAVDWRRKGYVTPVKDQGQCGSCWAFSSVGALEGQLKRRTGKLLSLSPQNLVYCVSNNN 180

Query:    72 GCDGCFFEPSIEYTH-QAGLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDFLHFNGS 129
             GC G +   + EY     G++SE  YPY    G+   C Y  + K     G   +  +  
Sbjct:   181 GCGGGYMTNAFEYVRLNRGIDSEDAYPYI---GQDESCMYSPTGKAAKCRGYREIPEDNE 237

Query:   130 ETMKKILYKYGPLSVLLNSDL-IHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPY 188
             + +K+ + + GP+SV +++ L    +    +   D  C+P ++ HAVL VGYG Q    +
Sbjct:   238 KALKRAVARIGPVSVGIDASLPSFQFYSRGVYY-DTGCNPENINHAVLAVGYGAQKGTKH 296

Query:   189 WLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 222
             W+++NSWG    ++G+  + R     CGI  +A +
Sbjct:   297 WIIKNSWGTEWGNKGYVLLARNMKQTCGIANLASF 331

 Score = 350 (128.3 bits), Expect = 9.5e-31, P = 9.5e-31
 Identities = 75/215 (34%), Positives = 111/215 (51%)

Query:   743 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 802
             P A DWR+K    P  DQ  CGSCWAFS  G LEGQ   +TGKL+  S   LV C    +
Sbjct:   121 PAAVDWRRKGYVTPVKDQGQCGSCWAFSSVGALEGQLKRRTGKLLSLSPQNLVYCVSNNN 180

Query:   803 GCDGCFFEPSIEYTH-QAGLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDFLHFNGS 860
             GC G +   + EY     G++SE  YPY    G+   C Y  + K     G   +  +  
Sbjct:   181 GCGGGYMTNAFEYVRLNRGIDSEDAYPYI---GQDESCMYSPTGKAAKCRGYREIPEDNE 237

Query:   861 ETMKKILYKYGPLSVLLNSDL-IHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPY 919
             + +K+ + + GP+SV +++ L    +    +   D  C+P ++ HAVL VGYG Q    +
Sbjct:   238 KALKRAVARIGPVSVGIDASLPSFQFYSRGVYY-DTGCNPENINHAVLAVGYGAQKGTKH 296

Query:   920 WLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 953
             W+++NSWG    ++G+  + R     CGI  +A +
Sbjct:   297 WIIKNSWGTEWGNKGYVLLARNMKQTCGIANLASF 331

 Score = 137 (53.3 bits), Expect = 8.1e-06, P = 8.1e-06
 Identities = 44/150 (29%), Positives = 68/150 (45%)

Query:   875 VLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGK--QDNIPYWLVRNSWGPIGPD 932
             V LN   I   +  P    DE+C     G A    GY +  +DN     ++ +   IGP 
Sbjct:   194 VRLNRG-IDSEDAYPYIGQDESCMYSPTGKAAKCRGYREIPEDNEK--ALKRAVARIGP- 249

Query:   933 EGFFKIERGNNACGIEQIAGYATIDVVKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRN 992
                  +  G +A  +     Y+    V  D  C+P ++ HAVL VGYG Q    +W+++N
Sbjct:   250 -----VSVGIDA-SLPSFQFYSR--GVYYDTGCNPENINHAVLAVGYGAQKGTKHWIIKN 301

Query:   993 SWGPIGPDEGFFKIERG-NNACGIEQIAGY 1021
             SWG    ++G+  + R     CGI  +A +
Sbjct:   302 SWGTEWGNKGYVLLARNMKQTCGIANLASF 331


>FB|FBgn0013770 [details] [associations]
            symbol:Cp1 "Cysteine proteinase-1" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS;NAS] [GO:0030163 "protein catabolic process"
            evidence=NAS] [GO:0005764 "lysosome" evidence=NAS] [GO:0048102
            "autophagic cell death" evidence=IEP] [GO:0035071 "salivary gland
            cell autophagic cell death" evidence=IEP] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0045169 "fusome" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:AE013599 GO:GO:0007586 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0035071 GO:GO:0045169 GeneTree:ENSGT00660000095458 KO:K01365
            EMBL:U75652 EMBL:AF012089 EMBL:BT016071 EMBL:D31970
            RefSeq:NP_523735.2 RefSeq:NP_725347.1 RefSeq:NP_725348.1
            UniGene:Dm.7400 ProteinModelPortal:Q95029 SMR:Q95029 IntAct:Q95029
            MINT:MINT-814156 STRING:Q95029 MEROPS:C01.092 PaxDb:Q95029
            EnsemblMetazoa:FBtr0087593 GeneID:36546 KEGG:dme:Dmel_CG6692
            CTD:36546 FlyBase:FBgn0013770 InParanoid:Q95029 OMA:ICHGADP
            OrthoDB:EOG46M91C PhylomeDB:Q95029 GenomeRNAi:36546 NextBio:799136
            Bgee:Q95029 GermOnline:CG6692 Uniprot:Q95029
        Length = 371

 Score = 354 (129.7 bits), Expect = 3.5e-31, P = 3.5e-31
 Identities = 97/337 (28%), Positives = 164/337 (48%)

Query:   641 LAIEGSLTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHK--KH-ERYG--- 694
             LA+  +++F +  ++E +  F ++  + Y ++ E + R + F ++ HK  KH +R+    
Sbjct:    43 LAVAQAVSFADV-VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGK 101

Query:   695 ------TSEFSDRSPEEIL-CKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWD 747
                    ++++D    E      GF ++   ++++ A                 +P + D
Sbjct:   102 VSFKLAVNKYADLLHHEFRQLMNGFNYT--LHKQLRAADESFKGVTFISPAHVTLPKSVD 159

Query:   748 WRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC--SGCD 805
             WR K       DQ  CGSCWAFS  G LEGQ+  K+G LV  S+  LV+C+ +   +GC+
Sbjct:   160 WRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCN 219

Query:   806 GCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF-NGSET- 862
             G   + +  Y     G+++EK YPY+  +     C ++K  V   T + F     G E  
Sbjct:   220 GGLMDNAFRYIKDNGGIDTEKSYPYEAIDDS---CHFNKGTVGA-TDRGFTDIPQGDEKK 275

Query:   863 MKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN-IPYW 920
             M + +   GP+SV ++ S     +    +  N+  C   +L H VL+VG+G  ++   YW
Sbjct:   276 MAEAVATVGPVSVAIDASHESFQFYSEGVY-NEPQCDAQNLDHGVLVVGFGTDESGEDYW 334

Query:   921 LVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 956
             LV+NSWG    D+GF K+ R   N CGI   + Y  +
Sbjct:   335 LVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPLV 371

 Score = 352 (129.0 bits), Expect = 5.8e-31, P = 5.8e-31
 Identities = 100/339 (29%), Positives = 163/339 (48%)

Query:   275 LAIEGSLTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHK--KH-ERYG--- 328
             LA+  +++F +  ++E +  F ++  + Y ++ E + R + F ++ HK  KH +R+    
Sbjct:    43 LAVAQAVSFADV-VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGK 101

Query:   329 ------TSEFSDRSPEEIL-CKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWD 381
                    ++++D    E      GF ++   ++++ A                 +P + D
Sbjct:   102 VSFKLAVNKYADLLHHEFRQLMNGFNYT--LHKQLRAADESFKGVTFISPAHVTLPKSVD 159

Query:   382 WRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGC 441
             WR K       DQ  CGSCWAFS  G LEGQ+  K+G LV  S+  LV+C+ +  G  GC
Sbjct:   160 WRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKY-GNNGC 218

Query:   442 DG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYF-NGSET 497
             +G  ++    Y     G+++EK YPY   +     C ++K  V   T + F     G E 
Sbjct:   219 NGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDS---CHFNKGTVGA-TDRGFTDIPQGDEK 274

Query:   498 -MKKILYKYGPLSVGLN-SH-LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD-IP 553
              M + +   GP+SV ++ SH    FY+      N+  C   +L H VL+VG+G  +    
Sbjct:   275 KMAEAVATVGPVSVAIDASHESFQFYSEGVY--NEPQCDAQNLDHGVLVVGFGTDESGED 332

Query:   554 YWLARNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 591
             YWL +NSWG    D+GF K+ R   N CGI   + Y  +
Sbjct:   333 YWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPLV 371

 Score = 334 (122.6 bits), Expect = 4.9e-29, P = 4.9e-29
 Identities = 78/223 (34%), Positives = 116/223 (52%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P + DWR K       DQ  CGSCWAFS  G LEGQ+  K+G LV  S+  LV+C+ + 
Sbjct:   154 LPKSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKY 213

Query:    71 --SGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF- 126
               +GC+G   + +  Y     G+++EK YPY+  +     C ++K  V   T + F    
Sbjct:   214 GNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDS---CHFNKGTVGA-TDRGFTDIP 269

Query:   127 NGSET-MKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
              G E  M + +   GP+SV ++ S     +    +  N+  C   +L H VL+VG+G  +
Sbjct:   270 QGDEKKMAEAVATVGPVSVAIDASHESFQFYSEGVY-NEPQCDAQNLDHGVLVVGFGTDE 328

Query:   185 N-IPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 225
             +   YWLV+NSWG    D+GF K+ R   N CGI   + Y  +
Sbjct:   329 SGEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPLV 371

 Score = 141 (54.7 bits), Expect = 6.4e-06, Sum P(2) = 6.4e-06
 Identities = 28/68 (41%), Positives = 38/68 (55%)

Query:   959 VKNDETCSPYDLGHAVLLVGYGKQDD-IPYWLVRNSWGPIGPDEGFFKIERGN-NACGIE 1016
             V N+  C   +L H VL+VG+G  +    YWLV+NSWG    D+GF K+ R   N CGI 
Sbjct:   304 VYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIA 363

Query:  1017 QIAGYATI 1024
               + Y  +
Sbjct:   364 SASSYPLV 371

 Score = 43 (20.2 bits), Expect = 6.4e-06, Sum P(2) = 6.4e-06
 Identities = 15/56 (26%), Positives = 24/56 (42%)

Query:   179 GYGKQDNIPYWLVRNSW----GPIGP-DEGFFKIERGNNACGIEQIAGYATIDVVI 229
             G   + + PY  + +S     G +G  D GF  I +G+     E +A    + V I
Sbjct:   235 GIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 290

 Score = 40 (19.1 bits), Expect = 1.3e-05, Sum P(2) = 1.3e-05
 Identities = 11/35 (31%), Positives = 16/35 (45%)

Query:   562 GPIGP-DEGFFKIERGNNACGIEQIAGYATIDVVI 595
             G +G  D GF  I +G+     E +A    + V I
Sbjct:   256 GTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 290


>RGD|2448 [details] [associations]
            symbol:Ctsl1 "cathepsin L1" species:10116 "Rattus norvegicus"
          [GO:0002250 "adaptive immune response" evidence=ISO] [GO:0004177
          "aminopeptidase activity" evidence=IDA] [GO:0004197 "cysteine-type
          endopeptidase activity" evidence=ISO;IDA] [GO:0005576 "extracellular
          region" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IDA]
          [GO:0005764 "lysosome" evidence=ISO;IDA] [GO:0005773 "vacuole"
          evidence=IDA] [GO:0005902 "microvillus" evidence=IDA] [GO:0006508
          "proteolysis" evidence=IEP;ISO] [GO:0007154 "cell communication"
          evidence=IDA] [GO:0007283 "spermatogenesis" evidence=IEP] [GO:0008234
          "cysteine-type peptidase activity" evidence=ISO] [GO:0008584 "male
          gonad development" evidence=IEP] [GO:0009267 "cellular response to
          starvation" evidence=IEP] [GO:0009749 "response to glucose stimulus"
          evidence=IEP] [GO:0009897 "external side of plasma membrane"
          evidence=IDA] [GO:0010259 "multicellular organismal aging"
          evidence=IEP] [GO:0014070 "response to organic cyclic compound"
          evidence=IEP] [GO:0021675 "nerve development" evidence=IEP]
          [GO:0030984 "kininogen binding" evidence=IPI] [GO:0032403 "protein
          complex binding" evidence=IPI] [GO:0034698 "response to gonadotropin
          stimulus" evidence=IEP] [GO:0042277 "peptide binding" evidence=IDA]
          [GO:0042393 "histone binding" evidence=ISO] [GO:0043005 "neuron
          projection" evidence=IDA] [GO:0043204 "perikaryon" evidence=IDA]
          [GO:0046697 "decidualization" evidence=IEP] [GO:0048102 "autophagic
          cell death" evidence=IEP] [GO:0051384 "response to glucocorticoid
          stimulus" evidence=IEP] [GO:0060008 "Sertoli cell differentiation"
          evidence=IEP] [GO:0097067 "cellular response to thyroid hormone
          stimulus" evidence=ISO] [GO:0030141 "secretory granule" evidence=IDA]
          [GO:0045177 "apical part of cell" evidence=IDA] [GO:0060441
          "epithelial tube branching involved in lung morphogenesis"
          evidence=ISO] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
          PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:Y00697 RGD:2448
          GO:GO:0005576 GO:GO:0009897 GO:GO:0034698 GO:GO:0043204 GO:GO:0009749
          GO:GO:0051384 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
          InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
          PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
          PROSITE:PS00139 PROSITE:PS00639 GO:GO:0043005 GO:GO:0007283
          GO:GO:0004177 GO:GO:0005764 GO:GO:0042277 GO:GO:0009267 GO:GO:0021675
          GO:GO:0005902 GO:GO:0010259 GO:GO:0004197 GO:GO:0048102 GO:GO:0046697
          GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 KO:K01365
          OrthoDB:EOG48PMKF MEROPS:C01.032 OMA:FDQNLDT CTD:1514
          BRENDA:3.4.22.15 GO:GO:0060008 EMBL:AF025476 EMBL:BC063175
          EMBL:S85184 IPI:IPI00326070 PIR:S07098 RefSeq:NP_037288.1
          UniGene:Rn.1294 ProteinModelPortal:P07154 SMR:P07154 IntAct:P07154
          STRING:P07154 PhosphoSite:P07154 PRIDE:P07154
          Ensembl:ENSRNOT00000025462 GeneID:25697 KEGG:rno:25697 UCSC:RGD:2448
          InParanoid:P07154 SABIO-RK:P07154 BindingDB:P07154 ChEMBL:CHEMBL2305
          NextBio:607715 Genevestigator:P07154 GermOnline:ENSRNOG00000018566
          Uniprot:P07154
        Length = 334

 Score = 354 (129.7 bits), Expect = 3.5e-31, P = 3.5e-31
 Identities = 84/229 (36%), Positives = 123/229 (53%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +P   DWR+K    P  +Q  CGSCWAFS +G LEGQ  +KTGKL+  S+  LV+C+   
Sbjct:   114 IPKTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHD- 172

Query:   436 SGCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYF 492
              G  GC+G  ++   +Y  +  GL+SE+ YPY   +G    C Y +++  +     F+  
Sbjct:   173 QGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGS---CKY-RAEYAVANDTGFVDI 228

Query:   493 NGSE-TMKKILYKYGPLSVGLN-SH-LIHFYN-GTPIRKNDETCSPYDLGHAVLLVGYGK 548
                E  + K +   GP+SV ++ SH  + FY+ G     N   CS  DL H VL+VGYG 
Sbjct:   229 PQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPN---CSSKDLDHGVLVVGYGY 285

Query:   549 Q----DDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATID 592
             +    +   YWL +NSWG     +G+ KI +  NN CG+   A Y  ++
Sbjct:   286 EGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPIVN 334

 Score = 351 (128.6 bits), Expect = 7.4e-31, P = 7.4e-31
 Identities = 81/228 (35%), Positives = 122/228 (53%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P   DWR+K    P  +Q  CGSCWAFS +G LEGQ  +KTGKL+  S+  LV+C+   
Sbjct:   114 IPKTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQ 173

Query:    71 S--GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 127
                GC+G   + + +Y  +  GL+SE+ YPY+  +G    C Y +++  +     F+   
Sbjct:   174 GNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGS---CKY-RAEYAVANDTGFVDIP 229

Query:   128 GSE-TMKKILYKYGPLSVLLNSD--LIHDYN-GTPIRKNDETCSPYDLGHAVLLVGYGKQ 183
               E  + K +   GP+SV +++    +  Y+ G     N   CS  DL H VL+VGYG +
Sbjct:   230 QQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPN---CSSKDLDHGVLVVGYGYE 286

Query:   184 ----DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATID 226
                 +   YWLV+NSWG     +G+ KI +  NN CG+   A Y  ++
Sbjct:   287 GTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPIVN 334

 Score = 350 (128.3 bits), Expect = 9.5e-31, P = 9.5e-31
 Identities = 81/228 (35%), Positives = 122/228 (53%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             +P   DWR+K    P  +Q  CGSCWAFS +G LEGQ  +KTGKL+  S+  LV+C+   
Sbjct:   114 IPKTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQ 173

Query:   802 S--GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 858
                GC+G   + + +Y  +  GL+SE+ YPY+  +G    C Y +++  +     F+   
Sbjct:   174 GNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGS---CKY-RAEYAVANDTGFVDIP 229

Query:   859 GSE-TMKKILYKYGPLSVLLNSD--LIHDYN-GTPIRKNDETCSPYDLGHAVLLVGYGKQ 914
               E  + K +   GP+SV +++    +  Y+ G     N   CS  DL H VL+VGYG +
Sbjct:   230 QQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPN---CSSKDLDHGVLVVGYGYE 286

Query:   915 ----DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATID 957
                 +   YWLV+NSWG     +G+ KI +  NN CG+   A Y  ++
Sbjct:   287 GTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPIVN 334

 Score = 133 (51.9 bits), Expect = 2.2e-05, P = 2.2e-05
 Identities = 28/69 (40%), Positives = 39/69 (56%)

Query:   962 DETCSPYDLGHAVLLVGYGKQ----DDIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIE 1016
             +  CS  DL H VL+VGYG +    +   YWLV+NSWG     +G+ KI +  NN CG+ 
Sbjct:   266 EPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLA 325

Query:  1017 QIAGYATID 1025
               A Y  ++
Sbjct:   326 TAASYPIVN 334


>UNIPROTKB|A4IFS7 [details] [associations]
            symbol:CTSL1 "CTSL1 protein" species:9913 "Bos taurus"
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197 GO:GO:0002250
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 GO:GO:0097067
            OrthoDB:EOG48PMKF MEROPS:C01.032 CTD:1514 EMBL:DAAA02023987
            EMBL:BC134741 IPI:IPI00708619 RefSeq:NP_001077155.1
            UniGene:Bt.23199 SMR:A4IFS7 Ensembl:ENSBTAT00000000962
            GeneID:515200 KEGG:bta:515200 InParanoid:A4IFS7 OMA:NDEQALM
            NextBio:20871707 Uniprot:A4IFS7
        Length = 333

 Score = 354 (129.7 bits), Expect = 3.5e-31, P = 3.5e-31
 Identities = 89/229 (38%), Positives = 120/229 (52%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +P + DWR+K    P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C+ Q 
Sbjct:   114 IPPSVDWREKGYVTPVKNQGKCGSCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCS-QP 172

Query:   436 SGCGGCDG--LEQPIEYTHQAG-LESEKDYPYRNGNGEKFKCAYD-KSKVKLFTGKDFLY 491
              G  GC G  ++   +Y    G L+SE+ YPY    G    C Y+  +     TG  F+ 
Sbjct:   173 EGNRGCHGGFIDNAFQYVLDVGGLDSEESYPYTGLVGT---CLYNPNNSAANETG--FVD 227

Query:   492 FNGSE-TMKKILYKYGPLSVGLNSH--LIHFY-NGTPIRKNDETCSPYDLGHAVLLVGYG 547
                 E  + K +   GP+SV +++H     FY +G     N   CS   + HAVL+VGYG
Sbjct:   228 LPKQEKALMKAVANLGPISVAVDAHNPSFQFYKSGIYYEPN---CSSESVDHAVLVVGYG 284

Query:   548 KQ----DDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
              +    DD  YWL +NSWG      G+ K+ +  NN CGI  +A Y T+
Sbjct:   285 FEGADSDDNKYWLVKNSWGEHWGMNGYIKMAKDRNNHCGIATMASYPTV 333

 Score = 343 (125.8 bits), Expect = 5.3e-30, P = 5.3e-30
 Identities = 84/226 (37%), Positives = 120/226 (53%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P + DWR+K    P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C++  
Sbjct:   114 IPPSVDWREKGYVTPVKNQGKCGSCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCSQPE 173

Query:    71 S--GCDGCFFEPSIEYTHQAG-LESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHF 126
                GC G F + + +Y    G L+SE+ YPY    G    C Y+  +     TG  F+  
Sbjct:   174 GNRGCHGGFIDNAFQYVLDVGGLDSEESYPYTGLVGT---CLYNPNNSAANETG--FVDL 228

Query:   127 NGSE-TMKKILYKYGPLSVLLNS-DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ- 183
                E  + K +   GP+SV +++ +    +  + I   +  CS   + HAVL+VGYG + 
Sbjct:   229 PKQEKALMKAVANLGPISVAVDAHNPSFQFYKSGIYY-EPNCSSESVDHAVLVVGYGFEG 287

Query:   184 ---DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
                D+  YWLV+NSWG      G+ K+ +  NN CGI  +A Y T+
Sbjct:   288 ADSDDNKYWLVKNSWGEHWGMNGYIKMAKDRNNHCGIATMASYPTV 333

 Score = 343 (125.8 bits), Expect = 5.3e-30, P = 5.3e-30
 Identities = 84/226 (37%), Positives = 120/226 (53%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             +P + DWR+K    P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C++  
Sbjct:   114 IPPSVDWREKGYVTPVKNQGKCGSCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCSQPE 173

Query:   802 S--GCDGCFFEPSIEYTHQAG-LESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHF 857
                GC G F + + +Y    G L+SE+ YPY    G    C Y+  +     TG  F+  
Sbjct:   174 GNRGCHGGFIDNAFQYVLDVGGLDSEESYPYTGLVGT---CLYNPNNSAANETG--FVDL 228

Query:   858 NGSE-TMKKILYKYGPLSVLLNS-DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ- 914
                E  + K +   GP+SV +++ +    +  + I   +  CS   + HAVL+VGYG + 
Sbjct:   229 PKQEKALMKAVANLGPISVAVDAHNPSFQFYKSGIYY-EPNCSSESVDHAVLVVGYGFEG 287

Query:   915 ---DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
                D+  YWLV+NSWG      G+ K+ +  NN CGI  +A Y T+
Sbjct:   288 ADSDDNKYWLVKNSWGEHWGMNGYIKMAKDRNNHCGIATMASYPTV 333

 Score = 143 (55.4 bits), Expect = 1.7e-06, P = 1.7e-06
 Identities = 30/68 (44%), Positives = 40/68 (58%)

Query:   962 DETCSPYDLGHAVLLVGYGKQ----DDIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIE 1016
             +  CS   + HAVL+VGYG +    DD  YWLV+NSWG      G+ K+ +  NN CGI 
Sbjct:   266 EPNCSSESVDHAVLVVGYGFEGADSDDNKYWLVKNSWGEHWGMNGYIKMAKDRNNHCGIA 325

Query:  1017 QIAGYATI 1024
              +A Y T+
Sbjct:   326 TMASYPTV 333


>TAIR|locus:2167821 [details] [associations]
            symbol:RD21B "esponsive to dehydration 21B" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS] [GO:0005773
            "vacuole" evidence=IDA] [GO:0009651 "response to salt stress"
            evidence=IEP] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005829 "cytosol" evidence=IDA] [GO:0052541 "plant-type cell
            wall cellulose metabolic process" evidence=RCA] [GO:0052546 "cell
            wall pectin metabolic process" evidence=RCA] [GO:0005783
            "endoplasmic reticulum" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005783 GO:GO:0005829 EMBL:CP002688
            GO:GO:0005773 GO:GO:0009651 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AB008267 HSSP:O65039
            HOGENOM:HOG000230773 KO:K01376 InterPro:IPR000118 Pfam:PF00396
            SMART:SM00277 ProtClustDB:CLSN2688498 EMBL:AY062608 EMBL:AY114661
            IPI:IPI00520971 RefSeq:NP_568620.1 UniGene:At.24130 SMR:Q9FMH8
            IntAct:Q9FMH8 STRING:Q9FMH8 MEROPS:C01.A12
            EnsemblPlants:AT5G43060.1 GeneID:834321 KEGG:ath:AT5G43060
            TAIR:At5g43060 InParanoid:Q9FMH8 OMA:ENSEASL Genevestigator:Q9FMH8
            Uniprot:Q9FMH8
        Length = 463

 Score = 354 (129.7 bits), Expect = 3.5e-31, P = 3.5e-31
 Identities = 103/327 (31%), Positives = 160/327 (48%)

Query:   650 DNENILETFKAFIVKRGRQYANDE----EIKERFEYFKQ-----DGHK-KHERY--GTSE 697
             D+E +   ++A++V+ G++  N      E  +RFE FK      D H  K+  Y  G + 
Sbjct:    43 DSE-VERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTKNLSYKLGLTR 101

Query:   698 FSDRSPEEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGP 756
             F+D + EE      G K ++R  +   +DR               +PD+ DWRK+     
Sbjct:   102 FADLTNEEYRSMYLGAKPTKRVLK--TSDRYQARVGDA-------LPDSVDWRKEGAVAD 152

Query:   757 AGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS-GCDGCFFEPSIEY 815
               DQ +CGSCWAFS  G +EG   I TG L+  S+ +LV+C    + GC+G   + + E+
Sbjct:   153 VKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEF 212

Query:   816 T-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHFNGSETMKKILYKYGP 872
                  G+++E DYPYK A+G   +C  ++   K+ T   +  +  N   ++KK L  + P
Sbjct:   213 IIKNGGIDTEADYPYKAADG---RCDQNRKNAKVVTIDSYEDVPENSEASLKKAL-AHQP 268

Query:   873 LSVLLNSD--LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIG 930
             +SV + +       Y+       D  C   +L H V+ VGYG ++   YW+VRNSWG   
Sbjct:   269 ISVAIEAGGRAFQLYSSGVF---DGLCGT-ELDHGVVAVGYGTENGKDYWIVRNSWGNRW 324

Query:   931 PDEGFFKIERGNNA----CGIEQIAGY 953
              + G+ K+ R   A    CGI   A Y
Sbjct:   325 GESGYIKMARNIEAPTGKCGIAMEASY 351

 Score = 339 (124.4 bits), Expect = 4.3e-29, P = 4.3e-29
 Identities = 102/329 (31%), Positives = 158/329 (48%)

Query:   284 DNENILETFKAFIVKRGRQYANDE----EIKERFEYFKQ-----DGHK-KHERY--GTSE 331
             D+E +   ++A++V+ G++  N      E  +RFE FK      D H  K+  Y  G + 
Sbjct:    43 DSE-VERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTKNLSYKLGLTR 101

Query:   332 FSDRSPEEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGP 390
             F+D + EE      G K ++R  +   +DR               +PD+ DWRK+     
Sbjct:   102 FADLTNEEYRSMYLGAKPTKRVLK--TSDRYQARVGDA-------LPDSVDWRKEGAVAD 152

Query:   391 AGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS-GCGGCDGL-EQPI 448
               DQ +CGSCWAFS  G +EG   I TG L+  S+ +LV+C    + GC G  GL +   
Sbjct:   153 VKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNG--GLMDYAF 210

Query:   449 EYT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF--LYFNGSETMKKILYKY 505
             E+     G+++E DYPY+  +G   +C  ++   K+ T   +  +  N   ++KK L  +
Sbjct:   211 EFIIKNGGIDTEADYPYKAADG---RCDQNRKNAKVVTIDSYEDVPENSEASLKKAL-AH 266

Query:   506 GPLSVGLNS--HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGP 563
              P+SV + +       Y+       D  C   +L H V+ VGYG ++   YW+ RNSWG 
Sbjct:   267 QPISVAIEAGGRAFQLYSSGVF---DGLCGT-ELDHGVVAVGYGTENGKDYWIVRNSWGN 322

Query:   564 IGPDEGFFKIERGNNA----CGIEQIAGY 588
                + G+ K+ R   A    CGI   A Y
Sbjct:   323 RWGESGYIKMARNIEAPTGKCGIAMEASY 351

 Score = 329 (120.9 bits), Expect = 1.3e-28, Sum P(2) = 1.3e-28
 Identities = 77/222 (34%), Positives = 115/222 (51%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +PD+ DWRK+       DQ  CGSCWAFS  G +EG   I TG L+  S+ +LV+C    
Sbjct:   138 LPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSY 197

Query:    71 S-GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHF 126
             + GC+G   + + E+     G+++E DYPYK A+G   +C  ++   K+ T   +  +  
Sbjct:   198 NQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADG---RCDQNRKNAKVVTIDSYEDVPE 254

Query:   127 NGSETMKKILYKYGPLSVLLNSD--LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
             N   ++KK L  + P+SV + +       Y+       D  C   +L H V+ VGYG ++
Sbjct:   255 NSEASLKKAL-AHQPISVAIEAGGRAFQLYSSGVF---DGLCGT-ELDHGVVAVGYGTEN 309

Query:   185 NIPYWLVRNSWGPIGPDEGFFKIERGNNA----CGIEQIAGY 222
                YW+VRNSWG    + G+ K+ R   A    CGI   A Y
Sbjct:   310 GKDYWIVRNSWGNRWGESGYIKMARNIEAPTGKCGIAMEASY 351

 Score = 51 (23.0 bits), Expect = 1.3e-28, Sum P(2) = 1.3e-28
 Identities = 19/62 (30%), Positives = 25/62 (40%)

Query:   451 THQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKI-LYKYGPLS 509
             T + G+  E  YP + G           S +K  T  D  YF+  E+     LYKYG   
Sbjct:   340 TGKCGIAMEASYPIKKGQNPPNPGPSPPSPIKPPTTCD-KYFSCPESNTCCCLYKYGKYC 398

Query:   510 VG 511
              G
Sbjct:   399 FG 400


>MGI|MGI:88564 [details] [associations]
            symbol:Ctsl "cathepsin L" species:10090 "Mus musculus"
            [GO:0004177 "aminopeptidase activity" evidence=ISO] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISO] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005730 "nucleolus"
            evidence=NAS] [GO:0005737 "cytoplasm" evidence=ISO] [GO:0005764
            "lysosome" evidence=ISO] [GO:0005773 "vacuole" evidence=ISO]
            [GO:0005902 "microvillus" evidence=ISO] [GO:0006508 "proteolysis"
            evidence=ISO;IDA] [GO:0007154 "cell communication" evidence=ISO]
            [GO:0008152 "metabolic process" evidence=TAS] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISO;TAS] [GO:0009897 "external side of
            plasma membrane" evidence=ISO] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0030141 "secretory granule" evidence=ISO]
            [GO:0030984 "kininogen binding" evidence=ISO] [GO:0032403 "protein
            complex binding" evidence=ISO] [GO:0042277 "peptide binding"
            evidence=ISO] [GO:0042393 "histone binding" evidence=ISO;NAS]
            [GO:0043005 "neuron projection" evidence=ISO] [GO:0043204
            "perikaryon" evidence=ISO] [GO:0045177 "apical part of cell"
            evidence=ISO] [GO:0048863 "stem cell differentiation" evidence=NAS]
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IMP] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:88564 GO:GO:0005730 GO:GO:0009897 GO:GO:0034698
            GO:GO:0043204 GO:GO:0009749 GO:GO:0030141 GO:GO:0048863
            GO:GO:0051384 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045177 GO:GO:0043005
            GO:GO:0007283 GO:GO:0004177 GO:GO:0005764 GO:GO:0042277
            GO:GO:0009267 GO:GO:0021675 GO:GO:0042393 GO:GO:0005902
            GO:GO:0010259 GO:GO:0004197 GO:GO:0048102 GO:GO:0046697
            HOVERGEN:HBG011513 KO:K01365 OMA:EEFRATH OrthoDB:EOG48PMKF
            MEROPS:C01.032 BRENDA:3.4.22.15 ChiTaRS:CTSL1 EMBL:X06086
            EMBL:J02583 EMBL:M20495 EMBL:AF121837 EMBL:AF121838 EMBL:AF121839
            EMBL:BC068163 EMBL:X04392 IPI:IPI00128154 PIR:S01177
            RefSeq:NP_034114.1 UniGene:Mm.930 PDB:1MVV PDBsum:1MVV
            ProteinModelPortal:P06797 SMR:P06797 STRING:P06797
            PhosphoSite:P06797 PaxDb:P06797 PRIDE:P06797
            Ensembl:ENSMUST00000021933 GeneID:13039 KEGG:mmu:13039 CTD:13039
            InParanoid:P06797 BioCyc:MetaCyc:MONOMER-14812 BindingDB:P06797
            ChEMBL:CHEMBL5291 NextBio:282928 Bgee:P06797 CleanEx:MM_CTSL
            Genevestigator:P06797 GermOnline:ENSMUSG00000021477 GO:GO:0060008
            Uniprot:P06797
        Length = 334

 Score = 353 (129.3 bits), Expect = 4.5e-31, P = 4.5e-31
 Identities = 84/229 (36%), Positives = 124/229 (54%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +P + DWR+K    P  +Q  CGSCWAFS +G LEGQ  +KTGKL+  S+  LV+C+   
Sbjct:   114 IPKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCS-HA 172

Query:   436 SGCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYF 492
              G  GC+G  ++   +Y  +  GL+SE+ YPY   +G    C Y +++  +     F+  
Sbjct:   173 QGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGS---CKY-RAEFAVANDTGFVDI 228

Query:   493 NGSE-TMKKILYKYGPLSVGLN-SH-LIHFYN-GTPIRKNDETCSPYDLGHAVLLVGYGK 548
                E  + K +   GP+SV ++ SH  + FY+ G     N   CS  +L H VLLVGYG 
Sbjct:   229 PQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPN---CSSKNLDHGVLLVGYGY 285

Query:   549 Q----DDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATID 592
             +    +   YWL +NSWG     EG+ KI +  +N CG+   A Y  ++
Sbjct:   286 EGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVVN 334

 Score = 352 (129.0 bits), Expect = 5.8e-31, P = 5.8e-31
 Identities = 82/228 (35%), Positives = 124/228 (54%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC--AK 68
             +P + DWR+K    P  +Q  CGSCWAFS +G LEGQ  +KTGKL+  S+  LV+C  A+
Sbjct:   114 IPKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQ 173

Query:    69 QCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 127
                GC+G   + + +Y  +  GL+SE+ YPY+  +G    C Y +++  +     F+   
Sbjct:   174 GNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGS---CKY-RAEFAVANDTGFVDIP 229

Query:   128 GSE-TMKKILYKYGPLSVLLNSD--LIHDYN-GTPIRKNDETCSPYDLGHAVLLVGYGKQ 183
               E  + K +   GP+SV +++    +  Y+ G     N   CS  +L H VLLVGYG +
Sbjct:   230 QQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPN---CSSKNLDHGVLLVGYGYE 286

Query:   184 ----DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATID 226
                 +   YWLV+NSWG     EG+ KI +  +N CG+   A Y  ++
Sbjct:   287 GTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVVN 334

 Score = 351 (128.6 bits), Expect = 7.4e-31, P = 7.4e-31
 Identities = 82/228 (35%), Positives = 124/228 (54%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC--AK 799
             +P + DWR+K    P  +Q  CGSCWAFS +G LEGQ  +KTGKL+  S+  LV+C  A+
Sbjct:   114 IPKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQ 173

Query:   800 QCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 858
                GC+G   + + +Y  +  GL+SE+ YPY+  +G    C Y +++  +     F+   
Sbjct:   174 GNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGS---CKY-RAEFAVANDTGFVDIP 229

Query:   859 GSE-TMKKILYKYGPLSVLLNSD--LIHDYN-GTPIRKNDETCSPYDLGHAVLLVGYGKQ 914
               E  + K +   GP+SV +++    +  Y+ G     N   CS  +L H VLLVGYG +
Sbjct:   230 QQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPN---CSSKNLDHGVLLVGYGYE 286

Query:   915 ----DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATID 957
                 +   YWLV+NSWG     EG+ KI +  +N CG+   A Y  ++
Sbjct:   287 GTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVVN 334

 Score = 130 (50.8 bits), Expect = 0.00028, Sum P(2) = 0.00028
 Identities = 28/69 (40%), Positives = 39/69 (56%)

Query:   962 DETCSPYDLGHAVLLVGYGKQ----DDIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIE 1016
             +  CS  +L H VLLVGYG +    +   YWLV+NSWG     EG+ KI +  +N CG+ 
Sbjct:   266 EPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLA 325

Query:  1017 QIAGYATID 1025
               A Y  ++
Sbjct:   326 TAASYPVVN 334

 Score = 37 (18.1 bits), Expect = 0.00028, Sum P(2) = 0.00028
 Identities = 13/56 (23%), Positives = 24/56 (42%)

Query:   281 LTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHERYGTSEFSDRS 336
             ++   +N+++   A    +G Q  N   +   F+Y K++G    E     E  D S
Sbjct:   159 ISLSEQNLVDCSHA----QGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGS 210

 Score = 37 (18.1 bits), Expect = 0.00028, Sum P(2) = 0.00028
 Identities = 13/56 (23%), Positives = 24/56 (42%)

Query:   647 LTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHERYGTSEFSDRS 702
             ++   +N+++   A    +G Q  N   +   F+Y K++G    E     E  D S
Sbjct:   159 ISLSEQNLVDCSHA----QGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGS 210


>DICTYBASE|DDB_G0272298 [details] [associations]
            symbol:DDB_G0272298 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0272298 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246
            SMART:SM00848 EMBL:AAFI02000008 KO:K01365 RefSeq:XP_645281.1
            ProteinModelPortal:Q559Q3 MEROPS:C01.A53 EnsemblProtists:DDB0203746
            GeneID:8618447 KEGG:ddi:DDB_G0272298 InParanoid:Q559Q3 OMA:PANINWR
            Uniprot:Q559Q3
        Length = 305

 Score = 352 (129.0 bits), Expect = 5.8e-31, P = 5.8e-31
 Identities = 94/312 (30%), Positives = 155/312 (49%)

Query:   662 IVKRGRQYANDEEIKERFEYFKQDGHK--KHERYGTSEFSDRSPEEILCKTGFKWSERTY 719
             +VK  + Y N++E  +RF+ F QD +    + R    E  +    E    T  +++++ +
Sbjct:     1 MVKYNKHYKNNKEYLKRFDIF-QDNYNFILNHRNKNGENIEMDLNEYSDLTQKEFADKFF 59

Query:   720 ERIVADRXXXXXXXXXXX-----XDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGM 774
             E++V +                  +  +P ++DWR     G   +Q +C SCW+FS  G 
Sbjct:    60 EKLVPEPRSGPINDIKATPFKHNVNATIPKSFDWRDHGAVGKVKNQGSCASCWSFSALGA 119

Query:   775 LEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKN 831
             LEG Y IK G+L++ S+  LV+CA      GC   +   + +Y     G+  E  YPY  
Sbjct:   120 LEGHYYIKYGELLDLSEQNLVDCATPFGPKGCKTGWMHDAFKYIISSGGVNLESQYPY-- 177

Query:   832 ANGEKFKCAYDKS-KVKLFTGKDFL-HFNGSETMKKILYKYGPLSVLLNS---DLIHDYN 886
               G+   C +++S K    +G   +  F+ S  M+ I   YGP++V +++   +  H   
Sbjct:   178 -TGKDEVCKFNQSEKEAKVSGFVMIPKFDESALMEAIAL-YGPVAVPIDTSTKEFQHLSG 235

Query:   887 GTPIRKNDETCSPYDLGHAVLLVGYGKQDN-IPYWLVRNSWGPIGPDEGFFKIERG-NNA 944
             G  I  +D +C P++  HAVL +GYG  +N + Y+L++NSWG      GFFK++RG    
Sbjct:   236 G--IYYSD-SCDPWNTIHAVLAIGYGTDENGVDYFLMKNSWGKSWGTNGFFKVKRGVKGK 292

Query:   945 CGIEQIAGYATI 956
             CGI   A Y  +
Sbjct:   293 CGIVTAASYPIV 304

 Score = 350 (128.3 bits), Expect = 9.5e-31, P = 9.5e-31
 Identities = 94/310 (30%), Positives = 152/310 (49%)

Query:   296 IVKRGRQYANDEEIKERFEYFKQDGHK--KHERYGTSEFSDRSPEEILCKTGFKWSERTY 353
             +VK  + Y N++E  +RF+ F QD +    + R    E  +    E    T  +++++ +
Sbjct:     1 MVKYNKHYKNNKEYLKRFDIF-QDNYNFILNHRNKNGENIEMDLNEYSDLTQKEFADKFF 59

Query:   354 ERIVADRXXXXXXXXXXX-----XDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGM 408
             E++V +                  +  +P ++DWR     G   +Q +C SCW+FS  G 
Sbjct:    60 EKLVPEPRSGPINDIKATPFKHNVNATIPKSFDWRDHGAVGKVKNQGSCASCWSFSALGA 119

Query:   409 LEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCD-G-LEQPIEYT-HQAGLESEKDYPYR 465
             LEG Y IK G+L++ S+  LV+CA    G  GC  G +    +Y     G+  E  YPY 
Sbjct:   120 LEGHYYIKYGELLDLSEQNLVDCATPF-GPKGCKTGWMHDAFKYIISSGGVNLESQYPY- 177

Query:   466 NGNGE--KFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNSHLIHFYNGT 523
              G  E  KF  +  ++KV  F       F+ S  M+ I   YGP++V +++    F + +
Sbjct:   178 TGKDEVCKFNQSEKEAKVSGFVM--IPKFDESALMEAIAL-YGPVAVPIDTSTKEFQHLS 234

Query:   524 PIRKNDETCSPYDLGHAVLLVGYGKQDD-IPYWLARNSWGPIGPDEGFFKIERG-NNACG 581
                   ++C P++  HAVL +GYG  ++ + Y+L +NSWG      GFFK++RG    CG
Sbjct:   235 GGIYYSDSCDPWNTIHAVLAIGYGTDENGVDYFLMKNSWGKSWGTNGFFKVKRGVKGKCG 294

Query:   582 IEQIAGYATI 591
             I   A Y  +
Sbjct:   295 IVTAASYPIV 304

 Score = 325 (119.5 bits), Expect = 4.5e-28, P = 4.5e-28
 Identities = 78/225 (34%), Positives = 119/225 (52%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P ++DWR     G   +Q  C SCW+FS  G LEG Y IK G+L++ S+  LV+CA   
Sbjct:    87 IPKSFDWRDHGAVGKVKNQGSCASCWSFSALGALEGHYYIKYGELLDLSEQNLVDCATPF 146

Query:    71 S--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDFL-H 125
                GC   +   + +Y     G+  E  YPY    G+   C +++S K    +G   +  
Sbjct:   147 GPKGCKTGWMHDAFKYIISSGGVNLESQYPY---TGKDEVCKFNQSEKEAKVSGFVMIPK 203

Query:   126 FNGSETMKKILYKYGPLSVLLNS---DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGK 182
             F+ S  M+ I   YGP++V +++   +  H   G  I  +D +C P++  HAVL +GYG 
Sbjct:   204 FDESALMEAIAL-YGPVAVPIDTSTKEFQHLSGG--IYYSD-SCDPWNTIHAVLAIGYGT 259

Query:   183 QDN-IPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
              +N + Y+L++NSWG      GFFK++RG    CGI   A Y  +
Sbjct:   260 DENGVDYFLMKNSWGKSWGTNGFFKVKRGVKGKCGIVTAASYPIV 304

 Score = 142 (55.0 bits), Expect = 1.8e-06, P = 1.8e-06
 Identities = 26/64 (40%), Positives = 40/64 (62%)

Query:   963 ETCSPYDLGHAVLLVGYGKQDD-IPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAG 1020
             ++C P++  HAVL +GYG  ++ + Y+L++NSWG      GFFK++RG    CGI   A 
Sbjct:   241 DSCDPWNTIHAVLAIGYGTDENGVDYFLMKNSWGKSWGTNGFFKVKRGVKGKCGIVTAAS 300

Query:  1021 YATI 1024
             Y  +
Sbjct:   301 YPIV 304


>TAIR|locus:2090614 [details] [associations]
            symbol:AT3G19390 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA;ISS] [GO:0000041 "transition metal ion
            transport" evidence=RCA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:CP002686 GenomeReviews:BA000014_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AB025624
            MEROPS:I29.003 OMA:KAMDQKC HSSP:O65039 HOGENOM:HOG000230773
            InterPro:IPR000118 Pfam:PF00396 SMART:SM00277 EMBL:AY062725
            EMBL:AY093350 IPI:IPI00520189 RefSeq:NP_566633.1 UniGene:At.27473
            ProteinModelPortal:Q9LT78 SMR:Q9LT78 IntAct:Q9LT78 STRING:Q9LT78
            PaxDb:Q9LT78 PRIDE:Q9LT78 EnsemblPlants:AT3G19390.1 GeneID:821473
            KEGG:ath:AT3G19390 TAIR:At3g19390 InParanoid:Q9LT78
            PhylomeDB:Q9LT78 ProtClustDB:CLSN2917188 Genevestigator:Q9LT78
            Uniprot:Q9LT78
        Length = 452

 Score = 351 (128.6 bits), Expect = 7.4e-31, P = 7.4e-31
 Identities = 96/317 (30%), Positives = 149/317 (47%)

Query:   658 FKAFIVKRGRQYANDEEIKERFEYFKQ-----DGHKK--HERY--GTSEFSDRSPEEILC 708
             ++ ++V+  + Y    E + RFE FK      + H    +  Y  G + F+D + +E   
Sbjct:    43 YERWLVENRKNYNGLGEKERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRA 102

Query:   709 KTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWA 768
                    ERT   +  ++               +PDA DWR K    P  DQ +CGSCWA
Sbjct:   103 IYLRSKMERTRVPVKGEKYLYKVGDS-------LPDAIDWRAKGAVNPVKDQGSCGSCWA 155

Query:   769 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS-GCDGCFFEPSIEYT-HQAGLESEKD 826
             FS  G +EG   IKTG+L+  S+ +LV+C    + GC G   + + ++     G+++E+D
Sbjct:   156 FSAIGAVEGINQIKTGELISLSEQELVDCDTSYNDGCGGGLMDYAFKFIIENGGIDTEED 215

Query:   827 YPYKNANGEKFKCAYDKSKVKLFT--GKDFLHFNGSETMKKILYKYGPLSVLLNSD--LI 882
             YPY   +     C  DK   ++ T  G + +  N  +++KK L    P+SV + +     
Sbjct:   216 YPYIATDVNV--CNSDKKNTRVVTIDGYEDVPQNDEKSLKKALANQ-PISVAIEAGGRAF 272

Query:   883 HDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERG- 941
               Y          TC    L H V+ VGYG +    YW+VRNSWG    + G+FK+ER  
Sbjct:   273 QLYTSGVFTG---TCGT-SLDHGVVAVGYGSEGGQDYWIVRNSWGSNWGESGYFKLERNI 328

Query:   942 ---NNACGIEQIAGYAT 955
                +  CG+  +A Y T
Sbjct:   329 KESSGKCGVAMMASYPT 345

 Score = 349 (127.9 bits), Expect = 1.2e-30, P = 1.2e-30
 Identities = 98/319 (30%), Positives = 150/319 (47%)

Query:   292 FKAFIVKRGRQYANDEEIKERFEYFKQ-----DGHKK--HERY--GTSEFSDRSPEEILC 342
             ++ ++V+  + Y    E + RFE FK      + H    +  Y  G + F+D + +E   
Sbjct:    43 YERWLVENRKNYNGLGEKERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRA 102

Query:   343 KTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWA 402
                    ERT   +  ++               +PDA DWR K    P  DQ +CGSCWA
Sbjct:   103 IYLRSKMERTRVPVKGEKYLYKVGDS-------LPDAIDWRAKGAVNPVKDQGSCGSCWA 155

Query:   403 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS-GCGGCDGL-EQPIEYT-HQAGLESE 459
             FS  G +EG   IKTG+L+  S+ +LV+C    + GCGG  GL +   ++     G+++E
Sbjct:   156 FSAIGAVEGINQIKTGELISLSEQELVDCDTSYNDGCGG--GLMDYAFKFIIENGGIDTE 213

Query:   460 KDYPYRNGNGEKFKCAYDKSKVKLFT--GKDFLYFNGSETMKKILYKYGPLSVGLNS--H 515
             +DYPY   +     C  DK   ++ T  G + +  N  +++KK L    P+SV + +   
Sbjct:   214 EDYPYIATDVNV--CNSDKKNTRVVTIDGYEDVPQNDEKSLKKALANQ-PISVAIEAGGR 270

Query:   516 LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIER 575
                 Y          TC    L H V+ VGYG +    YW+ RNSWG    + G+FK+ER
Sbjct:   271 AFQLYTSGVFTG---TCGT-SLDHGVVAVGYGSEGGQDYWIVRNSWGSNWGESGYFKLER 326

Query:   576 G----NNACGIEQIAGYAT 590
                  +  CG+  +A Y T
Sbjct:   327 NIKESSGKCGVAMMASYPT 345

 Score = 344 (126.2 bits), Expect = 4.2e-30, P = 4.2e-30
 Identities = 79/224 (35%), Positives = 115/224 (51%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +PDA DWR K    P  DQ  CGSCWAFS  G +EG   IKTG+L+  S+ +LV+C    
Sbjct:   129 LPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCDTSY 188

Query:    71 S-GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFT--GKDFLHF 126
             + GC G   + + ++     G+++E+DYPY   +     C  DK   ++ T  G + +  
Sbjct:   189 NDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNV--CNSDKKNTRVVTIDGYEDVPQ 246

Query:   127 NGSETMKKILYKYGPLSVLLNSD--LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
             N  +++KK L    P+SV + +       Y          TC    L H V+ VGYG + 
Sbjct:   247 NDEKSLKKALANQ-PISVAIEAGGRAFQLYTSGVFTG---TCGT-SLDHGVVAVGYGSEG 301

Query:   185 NIPYWLVRNSWGPIGPDEGFFKIERG----NNACGIEQIAGYAT 224
                YW+VRNSWG    + G+FK+ER     +  CG+  +A Y T
Sbjct:   302 GQDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPT 345

 Score = 138 (53.6 bits), Expect = 1.1e-05, P = 1.1e-05
 Identities = 27/64 (42%), Positives = 36/64 (56%)

Query:   964 TCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERG----NNACGIEQIA 1019
             TC    L H V+ VGYG +    YW+VRNSWG    + G+FK+ER     +  CG+  +A
Sbjct:   283 TCGT-SLDHGVVAVGYGSEGGQDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMA 341

Query:  1020 GYAT 1023
              Y T
Sbjct:   342 SYPT 345


>ZFIN|ZDB-GENE-030131-106 [details] [associations]
            symbol:ctsl1a "cathepsin L, 1 a" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030131-106 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 HSSP:P43235
            KO:K01365 EMBL:BC066490 IPI:IPI00495935 RefSeq:NP_997749.1
            UniGene:Dr.104499 ProteinModelPortal:Q6NYR5 SMR:Q6NYR5
            MEROPS:C01.074 PRIDE:Q6NYR5 GeneID:321453 KEGG:dre:321453
            CTD:321453 InParanoid:Q6NYR5 NextBio:20807387 ArrayExpress:Q6NYR5
            Bgee:Q6NYR5 Uniprot:Q6NYR5
        Length = 337

 Score = 351 (128.6 bits), Expect = 7.4e-31, P = 7.4e-31
 Identities = 89/229 (38%), Positives = 126/229 (55%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             VP+  DWR+K    P  DQ  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C++  
Sbjct:   116 VPNKLDWREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRP- 174

Query:   436 SGCGGCDG--LEQPIEYTH-QAGLESEKDYPYRNGNGEKFKCAYD-KSKVKLFTGKDFLY 491
              G  GC+G  ++Q  +Y   Q GL+SE+ YPY  G  ++  C +D K+     TG  F+ 
Sbjct:   175 EGNEGCNGGLMDQAFQYVKDQNGLDSEESYPYL-GTDDQ-PCHFDPKNSAANDTG--FVD 230

Query:   492 F-NGSE-TMKKILYKYGPLSVGLNS-H-LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYG 547
               +G E  + K +   GP+SV +++ H    FY  + I    E CS  +L H VL VGYG
Sbjct:   231 IPSGKERALMKAIAAVGPVSVAIDAGHESFQFYQ-SGIYYEKE-CSSEELDHGVLAVGYG 288

Query:   548 KQ-DDIP---YWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
              + +D+    YW+ +NSW     D+G+  + +  +N CGI   A Y  +
Sbjct:   289 FEGEDVDGKKYWIVKNSWSENWGDKGYIYMAKDRHNHCGIATAASYPLV 337

 Score = 347 (127.2 bits), Expect = 2.0e-30, P = 2.0e-30
 Identities = 85/229 (37%), Positives = 123/229 (53%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ- 69
             VP+  DWR+K    P  DQ +CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C++  
Sbjct:   116 VPNKLDWREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRPE 175

Query:    70 -CSGCDGCFFEPSIEYTH-QAGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHF 126
                GC+G   + + +Y   Q GL+SE+ YPY   + +   C +D K+     TG  F+  
Sbjct:   176 GNEGCNGGLMDQAFQYVKDQNGLDSEESYPYLGTDDQP--CHFDPKNSAANDTG--FVDI 231

Query:   127 -NGSE-TMKKILYKYGPLSVLLNSDLIHD---YNGTPIRKNDETCSPYDLGHAVLLVGYG 181
              +G E  + K +   GP+SV +++   H+   +  + I    E CS  +L H VL VGYG
Sbjct:   232 PSGKERALMKAIAAVGPVSVAIDAG--HESFQFYQSGIYYEKE-CSSEELDHGVLAVGYG 288

Query:   182 KQ----DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
              +    D   YW+V+NSW     D+G+  + +  +N CGI   A Y  +
Sbjct:   289 FEGEDVDGKKYWIVKNSWSENWGDKGYIYMAKDRHNHCGIATAASYPLV 337

 Score = 344 (126.2 bits), Expect = 4.2e-30, P = 4.2e-30
 Identities = 85/229 (37%), Positives = 122/229 (53%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ- 800
             VP+  DWR+K    P  DQ  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C++  
Sbjct:   116 VPNKLDWREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRPE 175

Query:   801 -CSGCDGCFFEPSIEYTH-QAGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHF 857
                GC+G   + + +Y   Q GL+SE+ YPY   + +   C +D K+     TG  F+  
Sbjct:   176 GNEGCNGGLMDQAFQYVKDQNGLDSEESYPYLGTDDQP--CHFDPKNSAANDTG--FVDI 231

Query:   858 -NGSE-TMKKILYKYGPLSVLLNSDLIHD---YNGTPIRKNDETCSPYDLGHAVLLVGYG 912
              +G E  + K +   GP+SV +++   H+   +  + I    E CS  +L H VL VGYG
Sbjct:   232 PSGKERALMKAIAAVGPVSVAIDAG--HESFQFYQSGIYYEKE-CSSEELDHGVLAVGYG 288

Query:   913 KQ----DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
              +    D   YW+V+NSW     D+G+  + +  +N CGI   A Y  +
Sbjct:   289 FEGEDVDGKKYWIVKNSWSENWGDKGYIYMAKDRHNHCGIATAASYPLV 337


>ZFIN|ZDB-GENE-080724-8 [details] [associations]
            symbol:ctso "cathepsin O" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            ZFIN:ZDB-GENE-080724-8 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 CTD:1519 KO:K01374 EMBL:CR931784
            IPI:IPI00513613 RefSeq:XP_695717.3 UniGene:Dr.88386
            Ensembl:ENSDART00000074786 GeneID:567333 KEGG:dre:567333
            NextBio:20888622 Uniprot:E7FA09
        Length = 334

 Score = 350 (128.3 bits), Expect = 9.5e-31, P = 9.5e-31
 Identities = 76/209 (36%), Positives = 115/209 (55%)

Query:   743 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 802
             P  +DWR   V GP  +Q +CG CWAFSI   +E   A    KL + S  Q+++C+ Q  
Sbjct:   122 PPRFDWRDHGVVGPVHNQGSCGGCWAFSIVEAIESVSAKGGEKLQQLSVQQVIDCSYQNQ 181

Query:   803 GCDGCFFEPSIEYTHQAGLE--SEKDYPYKNANG--EKFKCAYDKSKVKLFTGKDFLHFN 858
             GC+G     ++ +  Q+ L+  SE +YP+K A+G  + F  A+    V+ ++  DF    
Sbjct:   182 GCNGGSPVEALYWLTQSKLKLVSEAEYPFKGADGVCQFFPQAHAGVAVRNYSAYDFS--G 239

Query:   859 GSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIP 918
               E M   L  +GPL V++++    DY G  I+ +   CS +   HAVL+ GY     +P
Sbjct:   240 QEEVMMSALVDFGPLVVIVDAISWQDYLGGIIQHH---CSSHKANHAVLITGYDTTGEVP 296

Query:   919 YWLVRNSWGPIGPDEGFFKIERGNNACGI 947
             YW+VRNSWG    D+G+  I+ GN+ CG+
Sbjct:   297 YWIVRNSWGTSWGDDGYAYIKIGNDVCGV 325

 Score = 350 (128.3 bits), Expect = 9.5e-31, P = 9.5e-31
 Identities = 77/217 (35%), Positives = 117/217 (53%)

Query:     4 EVEKDGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQL 63
             E++     P  +DWR   V GP  +Q  CG CWAFSI   +E   A    KL + S  Q+
Sbjct:   114 EIKVKANNPPRFDWRDHGVVGPVHNQGSCGGCWAFSIVEAIESVSAKGGEKLQQLSVQQV 173

Query:    64 VECAKQCSGCDGCFFEPSIEYTHQAGLE--SEKDYPYKNANG--EKFKCAYDKSKVKLFT 119
             ++C+ Q  GC+G     ++ +  Q+ L+  SE +YP+K A+G  + F  A+    V+ ++
Sbjct:   174 IDCSYQNQGCNGGSPVEALYWLTQSKLKLVSEAEYPFKGADGVCQFFPQAHAGVAVRNYS 233

Query:   120 GKDFLHFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVG 179
               DF      E M   L  +GPL V++++    DY G  I+ +   CS +   HAVL+ G
Sbjct:   234 AYDFS--GQEEVMMSALVDFGPLVVIVDAISWQDYLGGIIQHH---CSSHKANHAVLITG 288

Query:   180 YGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGI 216
             Y     +PYW+VRNSWG    D+G+  I+ GN+ CG+
Sbjct:   289 YDTTGEVPYWIVRNSWGTSWGDDGYAYIKIGNDVCGV 325

 Score = 327 (120.2 bits), Expect = 2.8e-28, P = 2.8e-28
 Identities = 73/209 (34%), Positives = 109/209 (52%)

Query:   377 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 436
             P  +DWR   V GP  +Q +CG CWAFSI   +E   A    KL + S  Q+++C+ Q  
Sbjct:   122 PPRFDWRDHGVVGPVHNQGSCGGCWAFSIVEAIESVSAKGGEKLQQLSVQQVIDCSYQNQ 181

Query:   437 GCGGCDGLEQPIEYTH-QAGLESEKDYPYRNGNG--EKFKCAYDKSKVKLFTGKDFLYFN 493
             GC G   +E     T  +  L SE +YP++  +G  + F  A+    V+ ++  DF    
Sbjct:   182 GCNGGSPVEALYWLTQSKLKLVSEAEYPFKGADGVCQFFPQAHAGVAVRNYSAYDFS--G 239

Query:   494 GSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIP 553
               E M   L  +GPL V +++     Y G  I+ +   CS +   HAVL+ GY    ++P
Sbjct:   240 QEEVMMSALVDFGPLVVIVDAISWQDYLGGIIQHH---CSSHKANHAVLITGYDTTGEVP 296

Query:   554 YWLARNSWGPIGPDEGFFKIERGNNACGI 582
             YW+ RNSWG    D+G+  I+ GN+ CG+
Sbjct:   297 YWIVRNSWGTSWGDDGYAYIKIGNDVCGV 325

 Score = 150 (57.9 bits), Expect = 2.9e-07, P = 2.9e-07
 Identities = 24/51 (47%), Positives = 34/51 (66%)

Query:   965 CSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGI 1015
             CS +   HAVL+ GY    ++PYW+VRNSWG    D+G+  I+ GN+ CG+
Sbjct:   275 CSSHKANHAVLITGYDTTGEVPYWIVRNSWGTSWGDDGYAYIKIGNDVCGV 325


>TAIR|locus:2152445 [details] [associations]
            symbol:SAG12 "senescence-associated gene 12" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009723 "response to ethylene stimulus" evidence=IEP]
            [GO:0007568 "aging" evidence=IEP;TAS] [GO:0010150 "leaf senescence"
            evidence=IEP;TAS] [GO:0010282 "senescence-associated vacuole"
            evidence=IDA] [GO:0009817 "defense response to fungus, incompatible
            interaction" evidence=IEP] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:CP002688 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0010150 GO:GO:0009817 EMBL:AB016870
            HSSP:O65039 OMA:NDEQALM EMBL:AF370131 EMBL:AY040073 IPI:IPI00544181
            RefSeq:NP_568651.1 UniGene:At.75256 UniGene:At.7710
            ProteinModelPortal:Q9FJ47 SMR:Q9FJ47 IntAct:Q9FJ47 STRING:Q9FJ47
            MEROPS:C01.117 PRIDE:Q9FJ47 ProMEX:Q9FJ47 EnsemblPlants:AT5G45890.1
            GeneID:834629 KEGG:ath:AT5G45890 TAIR:At5g45890 InParanoid:Q9FJ47
            PhylomeDB:Q9FJ47 ProtClustDB:CLSN2917735 ArrayExpress:Q9FJ47
            Genevestigator:Q9FJ47 GO:GO:0010282 Uniprot:Q9FJ47
        Length = 346

 Score = 350 (128.3 bits), Expect = 9.5e-31, P = 9.5e-31
 Identities = 97/332 (29%), Positives = 154/332 (46%)

Query:   646 SLTFDNENILETFKA-FIVKRGRQYANDEEIKERFEYFKQDGHK-KHE---------RYG 694
             S   DNE I++     ++ K GR YA+ +E   R+  FK +  + +H          +  
Sbjct:    25 SRPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLA 84

Query:   695 TSEFSDRSPEEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNV 753
              ++F+D + +E     TGFK       +    +             G +P + DWRKK  
Sbjct:    85 VNQFADLTNDEFRSMYTGFKGVSALSSQ---SQTKMSPFRYQNVSSGALPVSVDWRKKGA 141

Query:   754 TGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSI 813
               P  +Q +CG CWAFS    +EG   IK GKL+  S+ QLV+C     GC+G   + + 
Sbjct:   142 VTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTNDFGCEGGLMDTAF 201

Query:   814 EYTHQAG-LESEKDYPYKNANGEKFKCAYDKSKVKL--FTGKDFLHFNGSETMKKILYKY 870
             E+    G L +E +YPYK   GE   C   K+  K    TG + +  N  + + K +  +
Sbjct:   202 EHIKATGGLTTESNYPYK---GEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAV-AH 257

Query:   871 GPLSV-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN-IPYWLVRNSWGP 928
              P+SV +        +  + +   +  C+ Y L HAV  +GYG+  N   YW+++NSWG 
Sbjct:   258 QPVSVGIEGGGFDFQFYSSGVFTGE--CTTY-LDHAVTAIGYGESTNGSKYWIIKNSWGT 314

Query:   929 IGPDEGFFKIERG----NNACGIEQIAGYATI 956
                + G+ +I++        CG+   A Y TI
Sbjct:   315 KWGESGYMRIQKDVKDKQGLCGLAMKASYPTI 346

 Score = 338 (124.0 bits), Expect = 1.8e-29, P = 1.8e-29
 Identities = 100/334 (29%), Positives = 156/334 (46%)

Query:   280 SLTFDNENILETFKA-FIVKRGRQYANDEEIKERFEYFKQDGHK-KHE---------RYG 328
             S   DNE I++     ++ K GR YA+ +E   R+  FK +  + +H          +  
Sbjct:    25 SRPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLA 84

Query:   329 TSEFSDRSPEEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNV 387
              ++F+D + +E     TGFK       +    +             G +P + DWRKK  
Sbjct:    85 VNQFADLTNDEFRSMYTGFKGVSALSSQ---SQTKMSPFRYQNVSSGALPVSVDWRKKGA 141

Query:   388 TGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGC-GGC-DGLE 445
               P  +Q +CG CWAFS    +EG   IK GKL+  S+ QLV+C     GC GG  D   
Sbjct:   142 VTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTNDFGCEGGLMDTAF 201

Query:   446 QPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKL--FTGKDFLYFNGSETMKKILY 503
             + I+ T   GL +E +YPY+   GE   C   K+  K    TG + +  N  + + K + 
Sbjct:   202 EHIKAT--GGLTTESNYPYK---GEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAV- 255

Query:   504 KYGPLSVGLNSHLIHF-YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD-IPYWLARNSW 561
              + P+SVG+      F +  + +   +  C+ Y L HAV  +GYG+  +   YW+ +NSW
Sbjct:   256 AHQPVSVGIEGGGFDFQFYSSGVFTGE--CTTY-LDHAVTAIGYGESTNGSKYWIIKNSW 312

Query:   562 GPIGPDEGFFKIERG----NNACGIEQIAGYATI 591
             G    + G+ +I++        CG+   A Y TI
Sbjct:   313 GTKWGESGYMRIQKDVKDKQGLCGLAMKASYPTI 346

 Score = 324 (119.1 bits), Expect = 5.8e-28, P = 5.8e-28
 Identities = 75/226 (33%), Positives = 113/226 (50%)

Query:     9 GPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAK 68
             G +P + DWRKK    P  +Q  CG CWAFS    +EG   IK GKL+  S+ QLV+C  
Sbjct:   128 GALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDT 187

Query:    69 QCSGCDGCFFEPSIEYTHQAG-LESEKDYPYKNANGEKFKCAYDKSKVKL--FTGKDFLH 125
                GC+G   + + E+    G L +E +YPYK   GE   C   K+  K    TG + + 
Sbjct:   188 NDFGCEGGLMDTAFEHIKATGGLTTESNYPYK---GEDATCNSKKTNPKATSITGYEDVP 244

Query:   126 FNGSETMKKILYKYGPLSV-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
              N  + + K +  + P+SV +        +  + +   +  C+ Y L HAV  +GYG+  
Sbjct:   245 VNDEQALMKAV-AHQPVSVGIEGGGFDFQFYSSGVFTGE--CTTY-LDHAVTAIGYGEST 300

Query:   185 N-IPYWLVRNSWGPIGPDEGFFKIERG----NNACGIEQIAGYATI 225
             N   YW+++NSWG    + G+ +I++        CG+   A Y TI
Sbjct:   301 NGSKYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYPTI 346


>MGI|MGI:2139628 [details] [associations]
            symbol:Ctso "cathepsin O" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:2139628 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0005764 GeneTree:ENSGT00560000076599 MEROPS:C01.035 CTD:1519
            HOVERGEN:HBG105050 KO:K01374 OMA:SNVCGIA OrthoDB:EOG4V6ZH1
            EMBL:AK034490 EMBL:AK049470 EMBL:AK165930 EMBL:AK166103
            EMBL:BC044664 IPI:IPI00453524 RefSeq:NP_808330.1 UniGene:Mm.254642
            ProteinModelPortal:Q8BM88 SMR:Q8BM88 STRING:Q8BM88
            PhosphoSite:Q8BM88 PRIDE:Q8BM88 Ensembl:ENSMUST00000029649
            GeneID:229445 KEGG:mmu:229445 UCSC:uc008pon.1 InParanoid:Q8BM88
            NextBio:379433 Bgee:Q8BM88 CleanEx:MM_CTSO Genevestigator:Q8BM88
            GermOnline:ENSMUSG00000028015 Uniprot:Q8BM88
        Length = 312

 Score = 338 (124.0 bits), Expect = 9.6e-31, Sum P(2) = 9.6e-31
 Identities = 91/288 (31%), Positives = 141/288 (48%)

Query:   667 RQYANDEEIKERFEYFKQDGHKKHER-YGTSEFSDRSPEEILCKTGFKWSERTYERIVAD 725
             R+ A   E   R  Y     H+     YG ++FS   PEE   K  +  S+  +    A 
Sbjct:    31 REAAALRESLHRHRYLNSFPHENSTAFYGVNQFSYLFPEEF--KALYLGSKYAW----AP 84

Query:   726 RXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGK 785
             R              P+   +DWR K+V  P  +Q  CG CWAFS+   +E   AI+ GK
Sbjct:    85 RYPAEGQRPIPNVSLPL--RFDWRDKHVVNPVRNQEMCGGCWAFSVVSAIESARAIQ-GK 141

Query:   786 LVEF-SKSQLVECAKQCSGCDGCFFEPSIEYTHQAGLE--SEKDYPYKNANGEKFKCAYD 842
              +++ S  Q+++C+   SGC G     ++ + ++  L+  ++  YP+K  NG+   C + 
Sbjct:   142 SLDYLSVQQVIDCSFNNSGCLGGSPLCALRWLNETQLKLVADSQYPFKAVNGQ---CRHF 198

Query:   843 KSKVKLFTGKDF--LHFNGSET-MKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSP 899
                    + KDF   +F G E  M + L  +GPL V++++    DY G  I+ +   CS 
Sbjct:   199 PQSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMSWQDYLGGIIQHH---CSS 255

Query:   900 YDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGI 947
              +  HAVL+ G+ +  N PYW+VRNSWG     EG+  ++ G N CGI
Sbjct:   256 GEANHAVLITGFDRTGNTPYWMVRNSWGSSWGVEGYAHVKMGGNVCGI 303

 Score = 331 (121.6 bits), Expect = 1.0e-28, P = 1.0e-28
 Identities = 74/212 (34%), Positives = 117/212 (55%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEF-SKSQLVECAKQ 69
             +P  +DWR K+V  P  +Q  CG CWAFS+   +E   AI+ GK +++ S  Q+++C+  
Sbjct:    99 LPLRFDWRDKHVVNPVRNQEMCGGCWAFSVVSAIESARAIQ-GKSLDYLSVQQVIDCSFN 157

Query:    70 CSGCDGCFFEPSIEYTHQAGLE--SEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LH 125
              SGC G     ++ + ++  L+  ++  YP+K  NG+   C +        + KDF   +
Sbjct:   158 NSGCLGGSPLCALRWLNETQLKLVADSQYPFKAVNGQ---CRHFPQSQAGVSVKDFSAYN 214

Query:   126 FNGSET-MKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
             F G E  M + L  +GPL V++++    DY G  I+ +   CS  +  HAVL+ G+ +  
Sbjct:   215 FRGQEDEMARALLSFGPLVVIVDAMSWQDYLGGIIQHH---CSSGEANHAVLITGFDRTG 271

Query:   185 NIPYWLVRNSWGPIGPDEGFFKIERGNNACGI 216
             N PYW+VRNSWG     EG+  ++ G N CGI
Sbjct:   272 NTPYWMVRNSWGSSWGVEGYAHVKMGGNVCGI 303

 Score = 308 (113.5 bits), Expect = 3.0e-26, P = 3.0e-26
 Identities = 88/289 (30%), Positives = 138/289 (47%)

Query:   301 RQYANDEEIKERFEYFKQDGHKKHER-YGTSEFSDRSPEEILCKTGFKWSERTYERIVAD 359
             R+ A   E   R  Y     H+     YG ++FS   PEE   K  +  S+  +    A 
Sbjct:    31 REAAALRESLHRHRYLNSFPHENSTAFYGVNQFSYLFPEEF--KALYLGSKYAW----AP 84

Query:   360 RXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGK 419
             R              P+   +DWR K+V  P  +Q  CG CWAFS+   +E   AI+ GK
Sbjct:    85 RYPAEGQRPIPNVSLPL--RFDWRDKHVVNPVRNQEMCGGCWAFSVVSAIESARAIQ-GK 141

Query:   420 LVEF-SKSQLVECAKQCSGCGGCDGLEQPIEYTHQAGLE--SEKDYPYRNGNGEKFKCAY 476
              +++ S  Q+++C+   SGC G   L   + + ++  L+  ++  YP++  NG+   C +
Sbjct:   142 SLDYLSVQQVIDCSFNNSGCLGGSPL-CALRWLNETQLKLVADSQYPFKAVNGQ---CRH 197

Query:   477 DKSKVKLFTGKDFLYFN--GSET-MKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCS 533
                     + KDF  +N  G E  M + L  +GPL V +++     Y G  I+ +   CS
Sbjct:   198 FPQSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMSWQDYLGGIIQHH---CS 254

Query:   534 PYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGI 582
               +  HAVL+ G+ +  + PYW+ RNSWG     EG+  ++ G N CGI
Sbjct:   255 SGEANHAVLITGFDRTGNTPYWMVRNSWGSSWGVEGYAHVKMGGNVCGI 303

 Score = 138 (53.6 bits), Expect = 1.7e-05, Sum P(2) = 1.7e-05
 Identities = 23/51 (45%), Positives = 32/51 (62%)

Query:   965 CSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGI 1015
             CS  +  HAVL+ G+ +  + PYW+VRNSWG     EG+  ++ G N CGI
Sbjct:   253 CSSGEANHAVLITGFDRTGNTPYWMVRNSWGSSWGVEGYAHVKMGGNVCGI 303

 Score = 39 (18.8 bits), Expect = 9.6e-31, Sum P(2) = 9.6e-31
 Identities = 10/29 (34%), Positives = 13/29 (44%)

Query:   425 KSQLVECAKQCSGCGGCDGLEQPIEYTHQ 453
             K QLV     C  C G  G+     ++HQ
Sbjct:     2 KPQLVNLLLLCCCCLGRHGVAGTWSWSHQ 30


>UNIPROTKB|Q10991 [details] [associations]
            symbol:CTSL "Cathepsin L1" species:9940 "Ovis aries"
            [GO:0005515 "protein binding" evidence=IPI] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            MEROPS:C01.032 ProteinModelPortal:Q10991 SMR:Q10991 Uniprot:Q10991
        Length = 217

 Score = 349 (127.9 bits), Expect = 1.2e-30, P = 1.2e-30
 Identities = 84/223 (37%), Positives = 121/223 (54%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             VP + DW KK    P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+ ++  
Sbjct:     1 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQ 60

Query:    71 S--GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGE-KFKCAYDKSKVKLFTGKDFLHF 126
                GC+G   + + +Y  +  GL+SE+ YPY+  +    +K  Y  +K    TG  F+  
Sbjct:    61 GNQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTSCNYKPEYSAAKD---TG--FVDI 115

Query:   127 NGSE-TMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
                E  + K +   GP+SV +++      +  + I   D  CS  DL H VL+VGYG + 
Sbjct:   116 PQREKALMKAVATVGPISVAIDAGHSSFQFYKSGIYY-DPDCSSKDLDHGVLVVGYGFEG 174

Query:   185 -NIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
              N  +W+V+NSWGP   ++G+ K+ +  NN CGI   A Y T+
Sbjct:   175 TNNKFWIVKNSWGPEWGNKGYVKMAKDQNNHCGIATAASYPTV 217

 Score = 348 (127.6 bits), Expect = 1.6e-30, P = 1.6e-30
 Identities = 84/223 (37%), Positives = 121/223 (54%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             VP + DW KK    P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+ ++  
Sbjct:     1 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQ 60

Query:   802 S--GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGE-KFKCAYDKSKVKLFTGKDFLHF 857
                GC+G   + + +Y  +  GL+SE+ YPY+  +    +K  Y  +K    TG  F+  
Sbjct:    61 GNQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTSCNYKPEYSAAKD---TG--FVDI 115

Query:   858 NGSE-TMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 915
                E  + K +   GP+SV +++      +  + I   D  CS  DL H VL+VGYG + 
Sbjct:   116 PQREKALMKAVATVGPISVAIDAGHSSFQFYKSGIYY-DPDCSSKDLDHGVLVVGYGFEG 174

Query:   916 -NIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
              N  +W+V+NSWGP   ++G+ K+ +  NN CGI   A Y T+
Sbjct:   175 TNNKFWIVKNSWGPEWGNKGYVKMAKDQNNHCGIATAASYPTV 217

 Score = 343 (125.8 bits), Expect = 5.3e-30, P = 5.3e-30
 Identities = 85/225 (37%), Positives = 120/225 (53%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             VP + DW KK    P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+ ++  
Sbjct:     1 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRP- 59

Query:   436 SGCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGE-KFKCAYDKSKVKLFTGKDFLY 491
              G  GC+G  ++   +Y  +  GL+SE+ YPY   +    +K  Y  +K    TG  F+ 
Sbjct:    60 QGNQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTSCNYKPEYSAAKD---TG--FVD 114

Query:   492 FNGSE-TMKKILYKYGPLSVGLNS-HL-IHFYNGTPIRKNDETCSPYDLGHAVLLVGYGK 548
                 E  + K +   GP+SV +++ H    FY        D  CS  DL H VL+VGYG 
Sbjct:   115 IPQREKALMKAVATVGPISVAIDAGHSSFQFYKSGIYY--DPDCSSKDLDHGVLVVGYGF 172

Query:   549 QD-DIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
             +  +  +W+ +NSWGP   ++G+ K+ +  NN CGI   A Y T+
Sbjct:   173 EGTNNKFWIVKNSWGPEWGNKGYVKMAKDQNNHCGIATAASYPTV 217

 Score = 152 (58.6 bits), Expect = 4.1e-10, Sum P(2) = 4.1e-10
 Identities = 32/80 (40%), Positives = 48/80 (60%)

Query:   951 AGYATIDVVKN----DETCSPYDLGHAVLLVGYGKQD-DIPYWLVRNSWGPIGPDEGFFK 1005
             AG+++    K+    D  CS  DL H VL+VGYG +  +  +W+V+NSWGP   ++G+ K
Sbjct:   138 AGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTNNKFWIVKNSWGPEWGNKGYVK 197

Query:  1006 IERG-NNACGIEQIAGYATI 1024
             + +  NN CGI   A Y T+
Sbjct:   198 MAKDQNNHCGIATAASYPTV 217

 Score = 49 (22.3 bits), Expect = 4.1e-10, Sum P(2) = 4.1e-10
 Identities = 28/103 (27%), Positives = 44/103 (42%)

Query:   249 CGVASCLCLPSLTDRITDQVVARVDTLAIEGSLTFDNENILETFKAFIVKRGRQYANDEE 308
             CG  SC    S T  +  Q+  +   L    SL+   +N++++ +     +G Q  N   
Sbjct:    22 CG--SCWAF-SATGALEGQMFRKTGKLV---SLS--EQNLVDSSRP----QGNQGCNGGL 69

Query:   309 IKERFEYFKQDGHKKHERYGTSEFSDRS----PEEILCK-TGF 346
             +   F+Y K++G    E     E +D S    PE    K TGF
Sbjct:    70 MDNAFQYIKENGGLDSEESYPYEATDTSCNYKPEYSAAKDTGF 112

 Score = 49 (22.3 bits), Expect = 4.1e-10, Sum P(2) = 4.1e-10
 Identities = 28/103 (27%), Positives = 44/103 (42%)

Query:   615 CGVASCLCLPSLTDRITDQVVARVDTLAIEGSLTFDNENILETFKAFIVKRGRQYANDEE 674
             CG  SC    S T  +  Q+  +   L    SL+   +N++++ +     +G Q  N   
Sbjct:    22 CG--SCWAF-SATGALEGQMFRKTGKLV---SLS--EQNLVDSSRP----QGNQGCNGGL 69

Query:   675 IKERFEYFKQDGHKKHERYGTSEFSDRS----PEEILCK-TGF 712
             +   F+Y K++G    E     E +D S    PE    K TGF
Sbjct:    70 MDNAFQYIKENGGLDSEESYPYEATDTSCNYKPEYSAAKDTGF 112


>UNIPROTKB|P09648 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9031 "Gallus gallus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            OrthoDB:EOG48PMKF MEROPS:C01.032 IPI:IPI00602255 PIR:S00081
            UniGene:Gga.523 ProteinModelPortal:P09648 SMR:P09648 Uniprot:P09648
        Length = 218

 Score = 349 (127.9 bits), Expect = 1.2e-30, P = 1.2e-30
 Identities = 82/223 (36%), Positives = 116/223 (52%)

Query:   377 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 436
             P + DWR+K    P  DQ  CGSCWAFS  G LEGQ+    GKLV  S+  LV+C++   
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRP-E 60

Query:   437 GCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYF- 492
             G  GC+G  ++Q  +Y     G++SE+ YPY   + E   C Y K++        F+   
Sbjct:    61 GNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDED--CRY-KAEYNAANDTGFVDIP 117

Query:   493 NGSE-TMKKILYKYGPLSVGLNS-HL-IHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 549
              G E  + K +   GP+SV +++ H    FY      + D  CS  DL H VL+VGYG +
Sbjct:   118 QGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPD--CSSEDLDHGVLVVGYGFE 175

Query:   550 DDIPYWLARNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 591
                 YW+ +NSWG    D+G+  + +   N CGI   A Y  +
Sbjct:   176 GGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPLV 218

 Score = 343 (125.8 bits), Expect = 5.3e-30, P = 5.3e-30
 Identities = 79/222 (35%), Positives = 113/222 (50%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ-- 69
             P + DWR+K    P  DQ  CGSCWAFS  G LEGQ+    GKLV  S+  LV+C++   
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRPEG 61

Query:    70 CSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF-N 127
               GC+G   + + +Y     G++SE+ YPY   + E   C Y K++        F+    
Sbjct:    62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDED--CRY-KAEYNAANDTGFVDIPQ 118

Query:   128 GSE-TMKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
             G E  + K +   GP+SV +++       Y      + D  CS  DL H VL+VGYG + 
Sbjct:   119 GHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPD--CSSEDLDHGVLVVGYGFEG 176

Query:   185 NIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 225
                YW+V+NSWG    D+G+  + +   N CGI   A Y  +
Sbjct:   177 GKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPLV 218

 Score = 342 (125.4 bits), Expect = 6.8e-30, P = 6.8e-30
 Identities = 79/222 (35%), Positives = 113/222 (50%)

Query:   743 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ-- 800
             P + DWR+K    P  DQ  CGSCWAFS  G LEGQ+    GKLV  S+  LV+C++   
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRPEG 61

Query:   801 CSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF-N 858
               GC+G   + + +Y     G++SE+ YPY   + E   C Y K++        F+    
Sbjct:    62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDED--CRY-KAEYNAANDTGFVDIPQ 118

Query:   859 GSE-TMKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 915
             G E  + K +   GP+SV +++       Y      + D  CS  DL H VL+VGYG + 
Sbjct:   119 GHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPD--CSSEDLDHGVLVVGYGFEG 176

Query:   916 NIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 956
                YW+V+NSWG    D+G+  + +   N CGI   A Y  +
Sbjct:   177 GKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPLV 218

 Score = 136 (52.9 bits), Expect = 2.3e-06, P = 2.3e-06
 Identities = 31/102 (30%), Positives = 48/102 (47%)

Query:   928 PIGPDEGFFKIERGNNACGIEQIAGYATIDVVKN----DETCSPYDLGHAVLLVGYGKQD 983
             P G +    K         +   AG+++    ++    +  CS  DL H VL+VGYG + 
Sbjct:   117 PQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEG 176

Query:   984 DIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 1024
                YW+V+NSWG    D+G+  + +   N CGI   A Y  +
Sbjct:   177 GKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPLV 218


>TAIR|locus:2117979 [details] [associations]
            symbol:AT4G23520 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002687 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            KO:K01376 IPI:IPI00527171 RefSeq:NP_567686.2 UniGene:At.32421
            ProteinModelPortal:F4JNL3 SMR:F4JNL3 MEROPS:C01.A22 PRIDE:F4JNL3
            EnsemblPlants:AT4G23520.1 GeneID:828452 KEGG:ath:AT4G23520
            OMA:PANDEIS ArrayExpress:F4JNL3 Uniprot:F4JNL3
        Length = 356

 Score = 348 (127.6 bits), Expect = 1.6e-30, P = 1.6e-30
 Identities = 99/331 (29%), Positives = 154/331 (46%)

Query:   272 VDTLAIEGSLTFDNENILETFKAFIVKRGRQYANDEEIKER-FEYFKQ-----DGHK-KH 324
             +D  A  G     NE +   F+ ++ K G+ Y N    KER F+ FK      D H  K+
Sbjct:    27 MDLPATSGGHNRSNEEVEFIFQMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHNAKN 86

Query:   325 ERY--GTSEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDW 382
               Y  G + F+D + +E   +  F  S +  +R                 D  +P++ DW
Sbjct:    87 LSYQLGLTRFADLTVQEY--RDLFPGSPKPKQR----NLKTSRRYVPLAGD-QLPESVDW 139

Query:   383 RKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCD 442
             R++       DQ  C SCWAFS    +EG   I TG+L+  S+ +LV+C    +GC G  
Sbjct:   140 RQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQELVDCNLVNNGCYGSG 199

Query:   443 GLEQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKI 501
              ++   ++  +  GL+SEKDYPY+   G   +     +KV      + +  N   +++K 
Sbjct:   200 LMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQSTSNKVITIDSYEDVPANDEISLQKA 259

Query:   502 LYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSW 561
             +  + P+SVG++     F        N   C   +L HA+++VGYG ++   YW+ RNSW
Sbjct:   260 V-AHQPVSVGVDKKSQEFMLYRSCIYNGP-CGT-NLDHALVIVGYGSENGQDYWIVRNSW 316

Query:   562 GPIGPDEGFFKIERG----NNACGIEQIAGY 588
             G    D G+ KI R        CGI  +A Y
Sbjct:   317 GTTWGDAGYIKIARNFEDPKGLCGIAMLASY 347

 Score = 333 (122.3 bits), Expect = 6.3e-29, P = 6.3e-29
 Identities = 98/331 (29%), Positives = 153/331 (46%)

Query:   638 VDTLAIEGSLTFDNENILETFKAFIVKRGRQYANDEEIKER-FEYFKQ-----DGHK-KH 690
             +D  A  G     NE +   F+ ++ K G+ Y N    KER F+ FK      D H  K+
Sbjct:    27 MDLPATSGGHNRSNEEVEFIFQMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHNAKN 86

Query:   691 ERY--GTSEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDW 748
               Y  G + F+D + +E   +  F  S +  +R                 D  +P++ DW
Sbjct:    87 LSYQLGLTRFADLTVQEY--RDLFPGSPKPKQR----NLKTSRRYVPLAGD-QLPESVDW 139

Query:   749 RKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGC- 807
             R++       DQ  C SCWAFS    +EG   I TG+L+  S+ +LV+C    +GC G  
Sbjct:   140 RQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQELVDCNLVNNGCYGSG 199

Query:   808 FFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKI 866
               + + ++  +  GL+SEKDYPY+   G   +     +KV      + +  N   +++K 
Sbjct:   200 LMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQSTSNKVITIDSYEDVPANDEISLQKA 259

Query:   867 LYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSW 926
             +  + P+SV ++              N   C   +L HA+++VGYG ++   YW+VRNSW
Sbjct:   260 V-AHQPVSVGVDKKSQEFMLYRSCIYNGP-CGT-NLDHALVIVGYGSENGQDYWIVRNSW 316

Query:   927 GPIGPDEGFFKIERG----NNACGIEQIAGY 953
             G    D G+ KI R        CGI  +A Y
Sbjct:   317 GTTWGDAGYIKIARNFEDPKGLCGIAMLASY 347

 Score = 300 (110.7 bits), Expect = 2.1e-25, P = 2.1e-25
 Identities = 70/218 (32%), Positives = 110/218 (50%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P++ DWR++       DQ  C SCWAFS    +EG   I TG+L+  S+ +LV+C    
Sbjct:   133 LPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQELVDCNLVN 192

Query:    71 SGCDGC-FFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNG 128
             +GC G    + + ++  +  GL+SEKDYPY+   G   +     +KV      + +  N 
Sbjct:   193 NGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQSTSNKVITIDSYEDVPAND 252

Query:   129 SETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPY 188
               +++K +  + P+SV ++              N   C   +L HA+++VGYG ++   Y
Sbjct:   253 EISLQKAV-AHQPVSVGVDKKSQEFMLYRSCIYNGP-CGT-NLDHALVIVGYGSENGQDY 309

Query:   189 WLVRNSWGPIGPDEGFFKIERG----NNACGIEQIAGY 222
             W+VRNSWG    D G+ KI R        CGI  +A Y
Sbjct:   310 WIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASY 347

 Score = 133 (51.9 bits), Expect = 2.6e-05, P = 2.6e-05
 Identities = 25/57 (43%), Positives = 34/57 (59%)

Query:   969 DLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERG----NNACGIEQIAGY 1021
             +L HA+++VGYG ++   YW+VRNSWG    D G+ KI R        CGI  +A Y
Sbjct:   291 NLDHALVIVGYGSENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASY 347


>UNIPROTKB|P25975 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 EMBL:X91755 EMBL:BC102312 EMBL:AB017648
            IPI:IPI00687440 PIR:S15845 RefSeq:NP_776457.1 UniGene:Bt.3987
            ProteinModelPortal:P25975 SMR:P25975 STRING:P25975
            Ensembl:ENSBTAT00000022710 Ensembl:ENSBTAT00000036427 GeneID:281108
            KEGG:bta:281108 CTD:1515 InParanoid:P25975 KO:K01365 OMA:EEFRATH
            OrthoDB:EOG48PMKF BindingDB:P25975 ChEMBL:CHEMBL2113
            NextBio:20805179 ArrayExpress:P25975 Uniprot:P25975
        Length = 334

 Score = 347 (127.2 bits), Expect = 2.0e-30, P = 2.0e-30
 Identities = 84/225 (37%), Positives = 116/225 (51%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC--AK 68
             VP + DW KK    P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C  A+
Sbjct:   114 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQ 173

Query:    69 QCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 127
                GC+G   + + +Y     GL+SE+ YPY   +     C Y K +        F+   
Sbjct:   174 GNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNS--CNY-KPECSAANDTGFVDIP 230

Query:   128 GSE-TMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ-- 183
               E  + K +   GP+SV +++      +  + I   D  CS  DL H VL+VGYG +  
Sbjct:   231 QREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYY-DPDCSSKDLDHGVLVVGYGFEGT 289

Query:   184 --DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
               +N  +W+V+NSWGP     G+ K+ +  NN CGI   A Y T+
Sbjct:   290 DSNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334

 Score = 346 (126.9 bits), Expect = 2.5e-30, P = 2.5e-30
 Identities = 84/225 (37%), Positives = 116/225 (51%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC--AK 799
             VP + DW KK    P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C  A+
Sbjct:   114 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQ 173

Query:   800 QCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 858
                GC+G   + + +Y     GL+SE+ YPY   +     C Y K +        F+   
Sbjct:   174 GNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNS--CNY-KPECSAANDTGFVDIP 230

Query:   859 GSE-TMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ-- 914
               E  + K +   GP+SV +++      +  + I   D  CS  DL H VL+VGYG +  
Sbjct:   231 QREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYY-DPDCSSKDLDHGVLVVGYGFEGT 289

Query:   915 --DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
               +N  +W+V+NSWGP     G+ K+ +  NN CGI   A Y T+
Sbjct:   290 DSNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334

 Score = 342 (125.4 bits), Expect = 6.8e-30, P = 6.8e-30
 Identities = 84/227 (37%), Positives = 116/227 (51%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             VP + DW KK    P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C++  
Sbjct:   114 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSR-A 172

Query:   436 SGCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYF 492
              G  GC+G  ++   +Y     GL+SE+ YPY   +     C Y K +        F+  
Sbjct:   173 QGNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNS--CNY-KPECSAANDTGFVDI 229

Query:   493 NGSE-TMKKILYKYGPLSVGLNS-HL-IHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 549
                E  + K +   GP+SV +++ H    FY        D  CS  DL H VL+VGYG +
Sbjct:   230 PQREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYY--DPDCSSKDLDHGVLVVGYGFE 287

Query:   550 ----DDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
                 ++  +W+ +NSWGP     G+ K+ +  NN CGI   A Y T+
Sbjct:   288 GTDSNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334

 Score = 141 (54.7 bits), Expect = 6.5e-06, Sum P(2) = 6.5e-06
 Identities = 32/83 (38%), Positives = 46/83 (55%)

Query:   951 AGYATIDVVKN----DETCSPYDLGHAVLLVGYGKQ----DDIPYWLVRNSWGPIGPDEG 1002
             AG+ +    K+    D  CS  DL H VL+VGYG +    ++  +W+V+NSWGP     G
Sbjct:   252 AGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPEWGWNG 311

Query:  1003 FFKIERG-NNACGIEQIAGYATI 1024
             + K+ +  NN CGI   A Y T+
Sbjct:   312 YVKMAKDQNNHCGIATAASYPTV 334

 Score = 41 (19.5 bits), Expect = 6.5e-06, Sum P(2) = 6.5e-06
 Identities = 19/72 (26%), Positives = 32/72 (44%)

Query:   249 CGVASCLCLPSLTDRITDQVVARVDTLAIEGSLTFDNENILETFKAFIVKRGRQYANDEE 308
             CG  SC    S T  +  Q+  +   L    SL+   +N+++  +A    +G Q  N   
Sbjct:   135 CG--SCWAF-SATGALEGQMFRKTGKLV---SLS--EQNLVDCSRA----QGNQGCNGGL 182

Query:   309 IKERFEYFKQDG 320
             +   F+Y K +G
Sbjct:   183 MDNAFQYIKDNG 194

 Score = 41 (19.5 bits), Expect = 6.5e-06, Sum P(2) = 6.5e-06
 Identities = 19/72 (26%), Positives = 32/72 (44%)

Query:   615 CGVASCLCLPSLTDRITDQVVARVDTLAIEGSLTFDNENILETFKAFIVKRGRQYANDEE 674
             CG  SC    S T  +  Q+  +   L    SL+   +N+++  +A    +G Q  N   
Sbjct:   135 CG--SCWAF-SATGALEGQMFRKTGKLV---SLS--EQNLVDCSRA----QGNQGCNGGL 182

Query:   675 IKERFEYFKQDG 686
             +   F+Y K +G
Sbjct:   183 MDNAFQYIKDNG 194


>UNIPROTKB|F1SS93 [details] [associations]
            symbol:CTSS "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0002250 "adaptive immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0016020 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0002250
            GeneTree:ENSGT00560000076577 GO:GO:0097067 EMBL:CU463875
            Ensembl:ENSSSCT00000007284 OMA:CEIESAV Uniprot:F1SS93
        Length = 342

 Score = 347 (127.2 bits), Expect = 2.0e-30, P = 2.0e-30
 Identities = 81/222 (36%), Positives = 117/222 (52%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             +PD+ DWR+K        Q +CGSCWAFS  G LE Q  +KTG+LV  S   LV+C+ + 
Sbjct:   126 LPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEK 185

Query:   802 ---SGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--L 855
                 GC+G F   + +Y     G++SE  YPYK  +G   KC YD SK +  T   +  L
Sbjct:   186 YRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVDG---KCKYD-SKNRAATCSRYTEL 241

Query:   856 HFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 915
              F     +K+ +   GP+SV +++     +        D +C+  ++ H VL+VGYG  +
Sbjct:   242 PFADEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQ-NVNHGVLVVGYGNLN 300

Query:   916 NIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 956
                YWLV+NSWG    D G+ ++ R + N CGI     Y  I
Sbjct:   301 GKDYWLVKNSWGLNFGDGGYIRMARNSENHCGIANYPSYPEI 342

 Score = 346 (126.9 bits), Expect = 2.5e-30, P = 2.5e-30
 Identities = 81/222 (36%), Positives = 116/222 (52%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +PD+ DWR+K        Q  CGSCWAFS  G LE Q  +KTG+LV  S   LV+C+ + 
Sbjct:   126 LPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEK 185

Query:    71 ---SGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--L 124
                 GC+G F   + +Y     G++SE  YPYK  +G   KC YD SK +  T   +  L
Sbjct:   186 YRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVDG---KCKYD-SKNRAATCSRYTEL 241

Query:   125 HFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
              F     +K+ +   GP+SV +++     +        D +C+  ++ H VL+VGYG  +
Sbjct:   242 PFADEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQ-NVNHGVLVVGYGNLN 300

Query:   185 NIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 225
                YWLV+NSWG    D G+ ++ R + N CGI     Y  I
Sbjct:   301 GKDYWLVKNSWGLNFGDGGYIRMARNSENHCGIANYPSYPEI 342

 Score = 343 (125.8 bits), Expect = 5.3e-30, P = 5.3e-30
 Identities = 80/223 (35%), Positives = 120/223 (53%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +PD+ DWR+K        Q +CGSCWAFS  G LE Q  +KTG+LV  S   LV+C+ + 
Sbjct:   126 LPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEK 185

Query:   436 SGCGGCDG--LEQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF--L 490
                 GC+G  + +  +Y     G++SE  YPY+  +G   KC YD SK +  T   +  L
Sbjct:   186 YRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVDG---KCKYD-SKNRAATCSRYTEL 241

Query:   491 YFNGSETMKKILYKYGPLSVGLNS-HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 549
              F     +K+ +   GP+SV +++ H   F+  + +   D +C+  ++ H VL+VGYG  
Sbjct:   242 PFADEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYY-DPSCTQ-NVNHGVLVVGYGNL 299

Query:   550 DDIPYWLARNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 591
             +   YWL +NSWG    D G+ ++ R + N CGI     Y  I
Sbjct:   300 NGKDYWLVKNSWGLNFGDGGYIRMARNSENHCGIANYPSYPEI 342

 Score = 127 (49.8 bits), Expect = 0.00011, P = 0.00011
 Identities = 26/64 (40%), Positives = 37/64 (57%)

Query:   962 DETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAG 1020
             D +C+  ++ H VL+VGYG  +   YWLV+NSWG    D G+ ++ R + N CGI     
Sbjct:   280 DPSCTQ-NVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDGGYIRMARNSENHCGIANYPS 338

Query:  1021 YATI 1024
             Y  I
Sbjct:   339 YPEI 342


>UNIPROTKB|F1PGK4 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 OMA:SNVCGIA
            EMBL:AAEX03010073 Ensembl:ENSCAFT00000013638 Uniprot:F1PGK4
        Length = 316

 Score = 347 (127.2 bits), Expect = 2.0e-30, P = 2.0e-30
 Identities = 85/260 (32%), Positives = 129/260 (49%)

Query:   693 YGTSEFSDRSPEEILCKTGFKWSERTYERIVADRXXXX-XXXXXXXXDGPVPDAWDWRKK 751
             YG ++FS  SPEE      FK     Y R    R             +  +P  +DWR K
Sbjct:    62 YGINQFSYLSPEE------FK---AIYLRSKPSRSPRYPAEVRTSIRNVSLPLRFDWRDK 112

Query:   752 NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEP 811
              V     +Q  CG CWAFS+ G +E  YAIK   L + S  Q+++C+    GC G     
Sbjct:   113 RVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGKPLADISVQQVIDCSYNNYGCSGGSTLN 172

Query:   812 SIEYTH--QAGLESEKDYPYKNANG--EKFKCAYDKSKVKLFTGKDFLHFNGSETMKKIL 867
             ++ + +  Q  L  + +YP+K  NG    F  +Y    ++ ++  DF   +  + M K+L
Sbjct:   173 ALNWLNKTQVKLVRDSEYPFKAQNGLCHYFSDSYSGFSIRGYSAYDFS--DQEDEMAKVL 230

Query:   868 YKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWG 927
               +GPL V++++    DY G  I+ +   CS  +  HAVL+ G+ K  + PYW+VRNSWG
Sbjct:   231 LTFGPLVVVVDAVSWQDYLGGIIQHH---CSSGEANHAVLITGFDKIGSTPYWIVRNSWG 287

Query:   928 PIGPDEGFFKIERGNNACGI 947
                  +G+  ++ G N CGI
Sbjct:   288 SSWGVDGYAHVKMGGNICGI 307

 Score = 340 (124.7 bits), Expect = 1.1e-29, P = 1.1e-29
 Identities = 72/210 (34%), Positives = 113/210 (53%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P  +DWR K V     +Q  CG CWAFS+ G +E  YAIK   L + S  Q+++C+   
Sbjct:   103 LPLRFDWRDKRVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGKPLADISVQQVIDCSYNN 162

Query:    71 SGCDGCFFEPSIEYTH--QAGLESEKDYPYKNANG--EKFKCAYDKSKVKLFTGKDFLHF 126
              GC G     ++ + +  Q  L  + +YP+K  NG    F  +Y    ++ ++  DF   
Sbjct:   163 YGCSGGSTLNALNWLNKTQVKLVRDSEYPFKAQNGLCHYFSDSYSGFSIRGYSAYDFS-- 220

Query:   127 NGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNI 186
             +  + M K+L  +GPL V++++    DY G  I+ +   CS  +  HAVL+ G+ K  + 
Sbjct:   221 DQEDEMAKVLLTFGPLVVVVDAVSWQDYLGGIIQHH---CSSGEANHAVLITGFDKIGST 277

Query:   187 PYWLVRNSWGPIGPDEGFFKIERGNNACGI 216
             PYW+VRNSWG     +G+  ++ G N CGI
Sbjct:   278 PYWIVRNSWGSSWGVDGYAHVKMGGNICGI 307

 Score = 325 (119.5 bits), Expect = 4.5e-28, P = 4.5e-28
 Identities = 83/261 (31%), Positives = 125/261 (47%)

Query:   327 YGTSEFSDRSPEEILCKTGFKWSERTYERIVADRXXXX-XXXXXXXXDGPVPDAWDWRKK 385
             YG ++FS  SPEE      FK     Y R    R             +  +P  +DWR K
Sbjct:    62 YGINQFSYLSPEE------FK---AIYLRSKPSRSPRYPAEVRTSIRNVSLPLRFDWRDK 112

Query:   386 NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLE 445
              V     +Q  CG CWAFS+ G +E  YAIK   L + S  Q+++C+    GC G   L 
Sbjct:   113 RVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGKPLADISVQQVIDCSYNNYGCSGGSTLN 172

Query:   446 QPIEYTH--QAGLESEKDYPYRNGNG--EKFKCAYDKSKVKLFTGKDFLYFNGSETMKKI 501
               + + +  Q  L  + +YP++  NG    F  +Y    ++ ++  DF   +  + M K+
Sbjct:   173 A-LNWLNKTQVKLVRDSEYPFKAQNGLCHYFSDSYSGFSIRGYSAYDFS--DQEDEMAKV 229

Query:   502 LYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSW 561
             L  +GPL V +++     Y G  I+ +   CS  +  HAVL+ G+ K    PYW+ RNSW
Sbjct:   230 LLTFGPLVVVVDAVSWQDYLGGIIQHH---CSSGEANHAVLITGFDKIGSTPYWIVRNSW 286

Query:   562 GPIGPDEGFFKIERGNNACGI 582
             G     +G+  ++ G N CGI
Sbjct:   287 GSSWGVDGYAHVKMGGNICGI 307

 Score = 134 (52.2 bits), Expect = 1.5e-05, P = 1.5e-05
 Identities = 23/51 (45%), Positives = 31/51 (60%)

Query:   965 CSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGI 1015
             CS  +  HAVL+ G+ K    PYW+VRNSWG     +G+  ++ G N CGI
Sbjct:   257 CSSGEANHAVLITGFDKIGSTPYWIVRNSWGSSWGVDGYAHVKMGGNICGI 307


>TAIR|locus:2030427 [details] [associations]
            symbol:XCP2 "xylem cysteine peptidase 2" species:3702
            "Arabidopsis thaliana" [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009507 "chloroplast" evidence=ISM] [GO:0008233 "peptidase
            activity" evidence=ISS] [GO:0005618 "cell wall" evidence=IDA]
            [GO:0010623 "developmental programmed cell death" evidence=IMP]
            [GO:0010075 "regulation of meristem growth" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0005886 GO:GO:0005618 GO:GO:0005773
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AC069251 EMBL:AC007369 GO:GO:0010623
            OMA:YKEIPEG HOGENOM:HOG000230773 KO:K16290 EMBL:AF191028
            EMBL:BT004822 IPI:IPI00526722 PIR:A86341 RefSeq:NP_564126.1
            UniGene:At.21316 ProteinModelPortal:Q9LM66 SMR:Q9LM66 IntAct:Q9LM66
            STRING:Q9LM66 MEROPS:C01.120 PaxDb:Q9LM66 PRIDE:Q9LM66
            ProMEX:Q9LM66 EnsemblPlants:AT1G20850.1 GeneID:838677
            KEGG:ath:AT1G20850 GeneFarm:5034 TAIR:At1g20850 InParanoid:Q9LM66
            PhylomeDB:Q9LM66 ProtClustDB:CLSN2917031 Genevestigator:Q9LM66
            GermOnline:AT1G20850 Uniprot:Q9LM66
        Length = 356

 Score = 347 (127.2 bits), Expect = 2.0e-30, P = 2.0e-30
 Identities = 99/322 (30%), Positives = 149/322 (46%)

Query:   651 NENILETFKAFIVKRGRQYANDEEIKERFEYFKQ------DGHKKHERY--GTSEFSDRS 702
             ++ ++E F+ +I    + Y   EE   RFE FK       + +KK + Y  G +EF+D S
Sbjct:    44 HDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKSYWLGLNEFADLS 103

Query:   703 PEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXD-GPVPDAWDWRKKNVTGPAGDQA 761
              EE      FK      +  +  R            D   VP + DWRKK       +Q 
Sbjct:   104 HEE------FKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQG 157

Query:   762 ACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC-AKQCSGCDGCFFEPSIEY-THQA 819
             +CGSCWAFS    +EG   I TG L   S+ +L++C     +GC+G   + + EY     
Sbjct:   158 SCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNG 217

Query:   820 GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNS 879
             GL  E+DYPY    G   +   D+S+     G   +  N  +++ K L  + PLSV +++
Sbjct:   218 GLRKEEDYPYSMEEGT-CEMQKDESETVTINGHQDVPTNDEKSLLKAL-AHQPLSVAIDA 275

Query:   880 D--LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFK 937
                    Y+G      D  C   DL H V  VGYG      Y +V+NSWGP   ++G+ +
Sbjct:   276 SGREFQFYSGGVF---DGRCG-VDLDHGVAAVGYGSSKGSDYIIVKNSWGPKWGEKGYIR 331

Query:   938 IERGNNA----CGIEQIAGYAT 955
             ++R        CGI ++A + T
Sbjct:   332 LKRNTGKPEGLCGINKMASFPT 353

 Score = 341 (125.1 bits), Expect = 8.7e-30, P = 8.7e-30
 Identities = 101/324 (31%), Positives = 149/324 (45%)

Query:   285 NENILETFKAFIVKRGRQYANDEEIKERFEYFKQ------DGHKKHERY--GTSEFSDRS 336
             ++ ++E F+ +I    + Y   EE   RFE FK       + +KK + Y  G +EF+D S
Sbjct:    44 HDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKSYWLGLNEFADLS 103

Query:   337 PEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXD-GPVPDAWDWRKKNVTGPAGDQA 395
              EE      FK      +  +  R            D   VP + DWRKK       +Q 
Sbjct:   104 HEE------FKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQG 157

Query:   396 ACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC-AKQCSGCGGCDGL-EQPIEY-TH 452
             +CGSCWAFS    +EG   I TG L   S+ +L++C     +GC G  GL +   EY   
Sbjct:   158 SCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNG--GLMDYAFEYIVK 215

Query:   453 QAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGL 512
               GL  E+DYPY    G   +   D+S+     G   +  N  +++ K L  + PLSV +
Sbjct:   216 NGGLRKEEDYPYSMEEGT-CEMQKDESETVTINGHQDVPTNDEKSLLKAL-AHQPLSVAI 273

Query:   513 NS--HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGF 570
             ++      FY+G      D  C   DL H V  VGYG      Y + +NSWGP   ++G+
Sbjct:   274 DASGREFQFYSGGVF---DGRCG-VDLDHGVAAVGYGSSKGSDYIIVKNSWGPKWGEKGY 329

Query:   571 FKIERGNNA----CGIEQIAGYAT 590
              +++R        CGI ++A + T
Sbjct:   330 IRLKRNTGKPEGLCGINKMASFPT 353

 Score = 312 (114.9 bits), Expect = 1.1e-26, P = 1.1e-26
 Identities = 74/222 (33%), Positives = 109/222 (49%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC-AKQ 69
             VP + DWRKK       +Q  CGSCWAFS    +EG   I TG L   S+ +L++C    
Sbjct:   138 VPKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTY 197

Query:    70 CSGCDGCFFEPSIEY-THQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNG 128
              +GC+G   + + EY     GL  E+DYPY    G   +   D+S+     G   +  N 
Sbjct:   198 NNGCNGGLMDYAFEYIVKNGGLRKEEDYPYSMEEGT-CEMQKDESETVTINGHQDVPTND 256

Query:   129 SETMKKILYKYGPLSVLLNSD--LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNI 186
              +++ K L  + PLSV +++       Y+G      D  C   DL H V  VGYG     
Sbjct:   257 EKSLLKAL-AHQPLSVAIDASGREFQFYSGGVF---DGRCG-VDLDHGVAAVGYGSSKGS 311

Query:   187 PYWLVRNSWGPIGPDEGFFKIERGNNA----CGIEQIAGYAT 224
              Y +V+NSWGP   ++G+ +++R        CGI ++A + T
Sbjct:   312 DYIIVKNSWGPKWGEKGYIRLKRNTGKPEGLCGINKMASFPT 353


>UNIPROTKB|H9KYW5 [details] [associations]
            symbol:CTSS "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0002250 "adaptive immune response" evidence=IEA]
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 OMA:YEPACTQ EMBL:AADN02010496
            Ensembl:ENSGALT00000001122 Uniprot:H9KYW5
        Length = 245

 Score = 346 (126.9 bits), Expect = 2.5e-30, P = 2.5e-30
 Identities = 80/222 (36%), Positives = 112/222 (50%)

Query:   740 GPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAK 799
             G  PDA DWR+K       +Q ACG+CWAFS  G LE Q  +KTGKLV  S   LV+C+ 
Sbjct:    28 GGAPDAMDWREKGCVTEVKNQGACGACWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSM 87

Query:   800 QCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF-L 855
                  GC G F   + +Y     G++SE+ YPY   NG    C Y+ S       K   L
Sbjct:    88 MYGNKGCGGGFMTRAFQYIIDNNGIDSEESYPYMAQNGT---CQYNVSTRAATCSKYVEL 144

Query:   856 HFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 915
              +     +K  +   GP+SV +++     +       +D  C+  ++ H VL+VGYG  +
Sbjct:   145 PYADEAALKDAVANVGPVSVAIDATQPTFFLYRSGVYDDPRCTQ-EVNHGVLVVGYGTLN 203

Query:   916 NIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 956
                +WLV+NSWG    D G+ ++ R + N CGI   A Y  I
Sbjct:   204 EKDFWLVKNSWGERFGDGGYIRMSRNHANHCGIASYASYPQI 245

 Score = 345 (126.5 bits), Expect = 3.3e-30, P = 3.3e-30
 Identities = 80/223 (35%), Positives = 114/223 (51%)

Query:   374 GPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAK 433
             G  PDA DWR+K       +Q ACG+CWAFS  G LE Q  +KTGKLV  S   LV+C+ 
Sbjct:    28 GGAPDAMDWREKGCVTEVKNQGACGACWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSM 87

Query:   434 QCS--GCGGCDGLEQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF- 489
                  GCGG   + +  +Y     G++SE+ YPY   NG    C Y+ S       K   
Sbjct:    88 MYGNKGCGG-GFMTRAFQYIIDNNGIDSEESYPYMAQNGT---CQYNVSTRAATCSKYVE 143

Query:   490 LYFNGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 549
             L +     +K  +   GP+SV +++    F+       +D  C+  ++ H VL+VGYG  
Sbjct:   144 LPYADEAALKDAVANVGPVSVAIDATQPTFFLYRSGVYDDPRCTQ-EVNHGVLVVGYGTL 202

Query:   550 DDIPYWLARNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 591
             ++  +WL +NSWG    D G+ ++ R + N CGI   A Y  I
Sbjct:   203 NEKDFWLVKNSWGERFGDGGYIRMSRNHANHCGIASYASYPQI 245

 Score = 340 (124.7 bits), Expect = 1.1e-29, P = 1.1e-29
 Identities = 79/222 (35%), Positives = 111/222 (50%)

Query:     9 GPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAK 68
             G  PDA DWR+K       +Q  CG+CWAFS  G LE Q  +KTGKLV  S   LV+C+ 
Sbjct:    28 GGAPDAMDWREKGCVTEVKNQGACGACWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSM 87

Query:    69 QCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF-L 124
                  GC G F   + +Y     G++SE+ YPY   NG    C Y+ S       K   L
Sbjct:    88 MYGNKGCGGGFMTRAFQYIIDNNGIDSEESYPYMAQNGT---CQYNVSTRAATCSKYVEL 144

Query:   125 HFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
              +     +K  +   GP+SV +++     +       +D  C+  ++ H VL+VGYG  +
Sbjct:   145 PYADEAALKDAVANVGPVSVAIDATQPTFFLYRSGVYDDPRCTQ-EVNHGVLVVGYGTLN 203

Query:   185 NIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 225
                +WLV+NSWG    D G+ ++ R + N CGI   A Y  I
Sbjct:   204 EKDFWLVKNSWGERFGDGGYIRMSRNHANHCGIASYASYPQI 245

 Score = 135 (52.6 bits), Expect = 5.6e-06, P = 5.6e-06
 Identities = 27/67 (40%), Positives = 40/67 (59%)

Query:   959 VKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQ 1017
             V +D  C+  ++ H VL+VGYG  ++  +WLV+NSWG    D G+ ++ R + N CGI  
Sbjct:   180 VYDDPRCTQ-EVNHGVLVVGYGTLNEKDFWLVKNSWGERFGDGGYIRMSRNHANHCGIAS 238

Query:  1018 IAGYATI 1024
              A Y  I
Sbjct:   239 YASYPQI 245


>UNIPROTKB|Q5E998 [details] [associations]
            symbol:CTSL2 "Cathepsin L2" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            HOVERGEN:HBG011513 UniGene:Bt.3987 MEROPS:C01.032 EMBL:BT021022
            IPI:IPI00711962 ProteinModelPortal:Q5E998 SMR:Q5E998 STRING:Q5E998
            InParanoid:Q5E998 Uniprot:Q5E998
        Length = 334

 Score = 345 (126.5 bits), Expect = 3.3e-30, P = 3.3e-30
 Identities = 84/225 (37%), Positives = 116/225 (51%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC--AK 68
             VP + DW KK    P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C  A+
Sbjct:   114 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQ 173

Query:    69 QCSGCDGCFFEPSIEYTHQAG-LESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 127
                GC+G   + + +Y    G L+SE+ YPY   +     C Y K +        F+   
Sbjct:   174 GNQGCNGGLMDNAFQYIKDNGCLDSEESYPYLATDTNS--CNY-KPECSAANDTGFVDIP 230

Query:   128 GSE-TMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ-- 183
               E  + K +   GP+SV +++      +  + I   D  CS  DL H VL+VGYG +  
Sbjct:   231 QREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYY-DPDCSSKDLDHGVLVVGYGFEGT 289

Query:   184 --DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
               +N  +W+V+NSWGP     G+ K+ +  NN CGI   A Y T+
Sbjct:   290 DSNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334

 Score = 344 (126.2 bits), Expect = 4.2e-30, P = 4.2e-30
 Identities = 84/225 (37%), Positives = 116/225 (51%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC--AK 799
             VP + DW KK    P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C  A+
Sbjct:   114 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQ 173

Query:   800 QCSGCDGCFFEPSIEYTHQAG-LESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 858
                GC+G   + + +Y    G L+SE+ YPY   +     C Y K +        F+   
Sbjct:   174 GNQGCNGGLMDNAFQYIKDNGCLDSEESYPYLATDTNS--CNY-KPECSAANDTGFVDIP 230

Query:   859 GSE-TMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ-- 914
               E  + K +   GP+SV +++      +  + I   D  CS  DL H VL+VGYG +  
Sbjct:   231 QREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYY-DPDCSSKDLDHGVLVVGYGFEGT 289

Query:   915 --DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
               +N  +W+V+NSWGP     G+ K+ +  NN CGI   A Y T+
Sbjct:   290 DSNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334

 Score = 340 (124.7 bits), Expect = 1.1e-29, P = 1.1e-29
 Identities = 84/227 (37%), Positives = 116/227 (51%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             VP + DW KK    P  +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C++  
Sbjct:   114 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSR-A 172

Query:   436 SGCGGCDG--LEQPIEYTHQAG-LESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYF 492
              G  GC+G  ++   +Y    G L+SE+ YPY   +     C Y K +        F+  
Sbjct:   173 QGNQGCNGGLMDNAFQYIKDNGCLDSEESYPYLATDTNS--CNY-KPECSAANDTGFVDI 229

Query:   493 NGSE-TMKKILYKYGPLSVGLNS-HL-IHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 549
                E  + K +   GP+SV +++ H    FY        D  CS  DL H VL+VGYG +
Sbjct:   230 PQREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYY--DPDCSSKDLDHGVLVVGYGFE 287

Query:   550 ----DDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
                 ++  +W+ +NSWGP     G+ K+ +  NN CGI   A Y T+
Sbjct:   288 GTDSNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334

 Score = 141 (54.7 bits), Expect = 6.5e-06, Sum P(2) = 6.5e-06
 Identities = 32/83 (38%), Positives = 46/83 (55%)

Query:   951 AGYATIDVVKN----DETCSPYDLGHAVLLVGYGKQ----DDIPYWLVRNSWGPIGPDEG 1002
             AG+ +    K+    D  CS  DL H VL+VGYG +    ++  +W+V+NSWGP     G
Sbjct:   252 AGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPEWGWNG 311

Query:  1003 FFKIERG-NNACGIEQIAGYATI 1024
             + K+ +  NN CGI   A Y T+
Sbjct:   312 YVKMAKDQNNHCGIATAASYPTV 334

 Score = 41 (19.5 bits), Expect = 6.5e-06, Sum P(2) = 6.5e-06
 Identities = 19/72 (26%), Positives = 32/72 (44%)

Query:   249 CGVASCLCLPSLTDRITDQVVARVDTLAIEGSLTFDNENILETFKAFIVKRGRQYANDEE 308
             CG  SC    S T  +  Q+  +   L    SL+   +N+++  +A    +G Q  N   
Sbjct:   135 CG--SCWAF-SATGALEGQMFRKTGKLV---SLS--EQNLVDCSRA----QGNQGCNGGL 182

Query:   309 IKERFEYFKQDG 320
             +   F+Y K +G
Sbjct:   183 MDNAFQYIKDNG 194

 Score = 41 (19.5 bits), Expect = 6.5e-06, Sum P(2) = 6.5e-06
 Identities = 19/72 (26%), Positives = 32/72 (44%)

Query:   615 CGVASCLCLPSLTDRITDQVVARVDTLAIEGSLTFDNENILETFKAFIVKRGRQYANDEE 674
             CG  SC    S T  +  Q+  +   L    SL+   +N+++  +A    +G Q  N   
Sbjct:   135 CG--SCWAF-SATGALEGQMFRKTGKLV---SLS--EQNLVDCSRA----QGNQGCNGGL 182

Query:   675 IKERFEYFKQDG 686
             +   F+Y K +G
Sbjct:   183 MDNAFQYIKDNG 194


>UNIPROTKB|E9PI30 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            EMBL:AP001201 HGNC:HGNC:2546 IPI:IPI00984532
            ProteinModelPortal:E9PI30 SMR:E9PI30 Ensembl:ENST00000528419
            ArrayExpress:E9PI30 Bgee:E9PI30 Uniprot:E9PI30
        Length = 364

 Score = 280 (103.6 bits), Expect = 3.8e-30, Sum P(3) = 3.8e-30
 Identities = 79/264 (29%), Positives = 119/264 (45%)

Query:   290 ETFKAFIVKRGRQYANDEEIKERFEYFK----QDGHKKHERYGTSEFSDRSPEEILCKTG 345
             E FK F ++  R Y + EE   R + F     Q    + E  GT+EF   +P   L +  
Sbjct:    40 EAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGV-TPFSDLTEEE 98

Query:   346 FKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRK-KNVTGPAGDQAACGSCWAFS 404
             F      Y R                 +  VP + DWRK  +   P  DQ  C  CWA +
Sbjct:    99 FG-QLYGYRRAAGGVPSMGREIRSEEPEESVPFSCDWRKVASAISPIKDQKNCNCCWAMA 157

Query:   405 IAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQPIEYTHQAGLESEKDYPY 464
              AG +E  + I     V+ S  +L++C +   GC G    +  I   + +GL SEKDYP+
Sbjct:   158 AAGNIETLWRISFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNNSGLASEKDYPF 217

Query:   465 RNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSE-TMKKILYKYGPLSVGLNSHLIHFYNGT 523
             + G     +C + K   K+   +DF+    +E  + + L  YGP++V +N   +  Y   
Sbjct:   218 Q-GKVRAHRC-HPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKG 275

Query:   524 PIRKNDETCSPYDLGHAVLLVGYG 547
              I+    TC P  + H+VLLVG+G
Sbjct:   276 VIKATPTTCDPQLVDHSVLLVGFG 299

 Score = 279 (103.3 bits), Expect = 4.8e-30, Sum P(3) = 4.8e-30
 Identities = 79/264 (29%), Positives = 120/264 (45%)

Query:   656 ETFKAFIVKRGRQYANDEEIKERFEYFK----QDGHKKHERYGTSEFSDRSPEEILCKTG 711
             E FK F ++  R Y + EE   R + F     Q    + E  GT+EF   +P   L +  
Sbjct:    40 EAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGV-TPFSDLTEEE 98

Query:   712 FKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRK-KNVTGPAGDQAACGSCWAFS 770
             F      Y R                 +  VP + DWRK  +   P  DQ  C  CWA +
Sbjct:    99 FG-QLYGYRRAAGGVPSMGREIRSEEPEESVPFSCDWRKVASAISPIKDQKNCNCCWAMA 157

Query:   771 IAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPY 829
              AG +E  + I     V+ S  +L++C +   GC G F ++  I   + +GL SEKDYP+
Sbjct:   158 AAGNIETLWRISFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNNSGLASEKDYPF 217

Query:   830 KNANGEKFKCAYDKSKVKLFTGKDFLHFNGSE-TMKKILYKYGPLSVLLNSDLIHDYNGT 888
             +       +C + K   K+   +DF+    +E  + + L  YGP++V +N   +  Y   
Sbjct:   218 QG-KVRAHRC-HPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKG 275

Query:   889 PIRKNDETCSPYDLGHAVLLVGYG 912
              I+    TC P  + H+VLLVG+G
Sbjct:   276 VIKATPTTCDPQLVDHSVLLVGFG 299

 Score = 255 (94.8 bits), Expect = 2.6e-21, Sum P(2) = 2.6e-21
 Identities = 59/179 (32%), Positives = 93/179 (51%)

Query:     6 EKDGPVPDAWDWRK-KNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLV 64
             E +  VP + DWRK  +   P  DQ +C  CWA + AG +E  + I     V+ S  +L+
Sbjct:   123 EPEESVPFSCDWRKVASAISPIKDQKNCNCCWAMAAAGNIETLWRISFWDFVDVSVQELL 182

Query:    65 ECAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF 123
             +C +   GC G F ++  I   + +GL SEKDYP++       +C + K   K+   +DF
Sbjct:   183 DCGRCGDGCHGGFVWDAFITVLNNSGLASEKDYPFQG-KVRAHRC-HPKKYQKVAWIQDF 240

Query:   124 LHFNGSE-TMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG 181
             +    +E  + + L  YGP++V +N   +  Y    I+    TC P  + H+VLLVG+G
Sbjct:   241 IMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKGVIKATPTTCDPQLVDHSVLLVGFG 299

 Score = 57 (25.1 bits), Expect = 3.8e-30, Sum P(3) = 3.8e-30
 Identities = 7/10 (70%), Positives = 10/10 (100%)

Query:   986 PYWLVRNSWG 995
             PYW+++NSWG
Sbjct:   325 PYWILKNSWG 334

 Score = 57 (25.1 bits), Expect = 3.8e-30, Sum P(3) = 3.8e-30
 Identities = 12/24 (50%), Positives = 16/24 (66%)

Query:   958 VVK-NDETCSPYDLGHAVLLVGYG 980
             V+K    TC P  + H+VLLVG+G
Sbjct:   276 VIKATPTTCDPQLVDHSVLLVGFG 299

 Score = 57 (25.1 bits), Expect = 1.9e-26, Sum P(2) = 1.9e-26
 Identities = 7/10 (70%), Positives = 10/10 (100%)

Query:   918 PYWLVRNSWG 927
             PYW+++NSWG
Sbjct:   325 PYWILKNSWG 334

 Score = 57 (25.1 bits), Expect = 2.6e-21, Sum P(2) = 2.6e-21
 Identities = 7/10 (70%), Positives = 10/10 (100%)

Query:   187 PYWLVRNSWG 196
             PYW+++NSWG
Sbjct:   325 PYWILKNSWG 334

 Score = 55 (24.4 bits), Expect = 3.1e-26, Sum P(2) = 3.1e-26
 Identities = 7/10 (70%), Positives = 9/10 (90%)

Query:   553 PYWLARNSWG 562
             PYW+ +NSWG
Sbjct:   325 PYWILKNSWG 334


>UNIPROTKB|D3ZZR3 [details] [associations]
            symbol:D3ZZR3 "Uncharacterized protein" species:10116
            "Rattus norvegicus" [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0016020 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0002250 GeneTree:ENSGT00560000076577 GO:GO:0097067
            OrthoDB:EOG4JM7Q2 IPI:IPI00210228 PRIDE:D3ZZR3
            Ensembl:ENSRNOT00000028732 Uniprot:D3ZZR3
        Length = 331

 Score = 344 (126.2 bits), Expect = 4.2e-30, P = 4.2e-30
 Identities = 84/224 (37%), Positives = 118/224 (52%)

Query:   375 PVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA-K 433
             P    W  R K        Q +CGSCWAFS  G LEGQ  +KTGKLV  S   LV+C+ +
Sbjct:   113 PAGVKWKERTKGCWKNLVFQGSCGSCWAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTE 172

Query:   434 QCSGCGGCDG--LEQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYD-KSKVKLFTGKDF 489
             +  G  GC G  + +  +Y     G++SE  YPY+    EK  C YD K++    +    
Sbjct:   173 EKYGNKGCGGGFMTEAFQYIIDNGGIDSEASYPYK-AMDEK--CHYDPKNRAATCSRYIE 229

Query:   490 LYFNGSETMKKILYKYGPLSVGLN-SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGK 548
             L F   E +K+ +   GP+SVG++ SH   F   + +  +D +C+  ++ H VL+VGYG 
Sbjct:   230 LPFGDEEALKEAVATKGPVSVGIDASHSSFFLYQSGVY-DDPSCTE-NVNHGVLVVGYGT 287

Query:   549 QDDIPYWLARNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 591
              D   YWL +NSWG    D+G+ ++ R N N CGI     Y  I
Sbjct:   288 LDGKDYWLVKNSWGLHFGDQGYIRMARNNKNHCGIASYCSYPEI 331

 Score = 340 (124.7 bits), Expect = 1.1e-29, P = 1.1e-29
 Identities = 84/228 (36%), Positives = 116/228 (50%)

Query:     5 VEKDGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLV 64
             V ++ P    W  R K        Q  CGSCWAFS  G LEGQ  +KTGKLV  S   LV
Sbjct:   108 VNQNLPAGVKWKERTKGCWKNLVFQGSCGSCWAFSAVGALEGQLKLKTGKLVSLSAQNLV 167

Query:    65 ECAKQ----CSGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYD-KSKVKLF 118
             +C+ +      GC G F   + +Y     G++SE  YPYK A  EK  C YD K++    
Sbjct:   168 DCSTEEKYGNKGCGGGFMTEAFQYIIDNGGIDSEASYPYK-AMDEK--CHYDPKNRAATC 224

Query:   119 TGKDFLHFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLV 178
             +    L F   E +K+ +   GP+SV +++     +       +D +C+  ++ H VL+V
Sbjct:   225 SRYIELPFGDEEALKEAVATKGPVSVGIDASHSSFFLYQSGVYDDPSCTE-NVNHGVLVV 283

Query:   179 GYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 225
             GYG  D   YWLV+NSWG    D+G+ ++ R N N CGI     Y  I
Sbjct:   284 GYGTLDGKDYWLVKNSWGLHFGDQGYIRMARNNKNHCGIASYCSYPEI 331

 Score = 339 (124.4 bits), Expect = 1.4e-29, P = 1.4e-29
 Identities = 83/223 (37%), Positives = 114/223 (51%)

Query:   741 PVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ 800
             P    W  R K        Q +CGSCWAFS  G LEGQ  +KTGKLV  S   LV+C+ +
Sbjct:   113 PAGVKWKERTKGCWKNLVFQGSCGSCWAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTE 172

Query:   801 ----CSGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDF 854
                   GC G F   + +Y     G++SE  YPYK A  EK  C YD K++    +    
Sbjct:   173 EKYGNKGCGGGFMTEAFQYIIDNGGIDSEASYPYK-AMDEK--CHYDPKNRAATCSRYIE 229

Query:   855 LHFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 914
             L F   E +K+ +   GP+SV +++     +       +D +C+  ++ H VL+VGYG  
Sbjct:   230 LPFGDEEALKEAVATKGPVSVGIDASHSSFFLYQSGVYDDPSCTE-NVNHGVLVVGYGTL 288

Query:   915 DNIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 956
             D   YWLV+NSWG    D+G+ ++ R N N CGI     Y  I
Sbjct:   289 DGKDYWLVKNSWGLHFGDQGYIRMARNNKNHCGIASYCSYPEI 331

 Score = 144 (55.7 bits), Expect = 1.3e-06, P = 1.3e-06
 Identities = 29/67 (43%), Positives = 40/67 (59%)

Query:   959 VKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQ 1017
             V +D +C+  ++ H VL+VGYG  D   YWLV+NSWG    D+G+ ++ R N N CGI  
Sbjct:   266 VYDDPSCTE-NVNHGVLVVGYGTLDGKDYWLVKNSWGLHFGDQGYIRMARNNKNHCGIAS 324

Query:  1018 IAGYATI 1024
                Y  I
Sbjct:   325 YCSYPEI 331


>ZFIN|ZDB-GENE-040426-1583 [details] [associations]
            symbol:ctssa "cathepsin S, a" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-040426-1583
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            GeneTree:ENSGT00660000095458 EMBL:CR548627 IPI:IPI00491948
            UniGene:Dr.81560 SMR:Q1L8W8 Ensembl:ENSDART00000053638 OMA:RNTREER
            OrthoDB:EOG480HX9 Uniprot:Q1L8W8
        Length = 328

 Score = 343 (125.8 bits), Expect = 5.3e-30, P = 5.3e-30
 Identities = 76/220 (34%), Positives = 109/220 (49%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +P   +W +  +  P  +Q  CGSCWAFS  G LE Q   +T  LV  S   L++C+   
Sbjct:   113 LPQRVNWTEHGMVSPVQNQGPCGSCWAFSAVGSLEAQMKRRTAALVPLSAQNLLDCSVSL 172

Query:   436 SGCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKS-KVKLFTGKDFLY 491
              G  GC G  L +   Y  Q  G++S   YPY +  G    C Y  S +    TG   + 
Sbjct:   173 -GNRGCKGGFLSRAFLYVIQNRGIDSSTFYPYEHKEGV---CRYSVSGRAGYCTGFRIVP 228

Query:   492 FNGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD 551
              +    ++  +   GP+SVG+N+ L+ F+       ND  CS   + HAVL+VGYG ++ 
Sbjct:   229 RHNEAALQSAVANIGPVSVGINAKLLSFHRYRSGIYNDPKCSSALINHAVLVVGYGSENG 288

Query:   552 IPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 591
               YWL +NSWG    + G+ ++ R  N CGI     Y TI
Sbjct:   289 QDYWLVKNSWGTAWGENGYIRMARNKNMCGISSFGIYPTI 328

 Score = 339 (124.4 bits), Expect = 1.4e-29, P = 1.4e-29
 Identities = 76/221 (34%), Positives = 109/221 (49%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P   +W +  +  P  +Q  CGSCWAFS  G LE Q   +T  LV  S   L++C+   
Sbjct:   113 LPQRVNWTEHGMVSPVQNQGPCGSCWAFSAVGSLEAQMKRRTAALVPLSAQNLLDCSVSL 172

Query:    71 S--GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDFLHF 126
                GC G F   +  Y  Q  G++S   YPY++  G    C Y  S +    TG   +  
Sbjct:   173 GNRGCKGGFLSRAFLYVIQNRGIDSSTFYPYEHKEGV---CRYSVSGRAGYCTGFRIVPR 229

Query:   127 NGSETMKKILYKYGPLSVLLNSDLI--HDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
             +    ++  +   GP+SV +N+ L+  H Y       ND  CS   + HAVL+VGYG ++
Sbjct:   230 HNEAALQSAVANIGPVSVGINAKLLSFHRYRSGIY--NDPKCSSALINHAVLVVGYGSEN 287

Query:   185 NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 225
                YWLV+NSWG    + G+ ++ R  N CGI     Y TI
Sbjct:   288 GQDYWLVKNSWGTAWGENGYIRMARNKNMCGISSFGIYPTI 328

 Score = 339 (124.4 bits), Expect = 1.4e-29, P = 1.4e-29
 Identities = 76/221 (34%), Positives = 109/221 (49%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             +P   +W +  +  P  +Q  CGSCWAFS  G LE Q   +T  LV  S   L++C+   
Sbjct:   113 LPQRVNWTEHGMVSPVQNQGPCGSCWAFSAVGSLEAQMKRRTAALVPLSAQNLLDCSVSL 172

Query:   802 S--GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDFLHF 857
                GC G F   +  Y  Q  G++S   YPY++  G    C Y  S +    TG   +  
Sbjct:   173 GNRGCKGGFLSRAFLYVIQNRGIDSSTFYPYEHKEGV---CRYSVSGRAGYCTGFRIVPR 229

Query:   858 NGSETMKKILYKYGPLSVLLNSDLI--HDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 915
             +    ++  +   GP+SV +N+ L+  H Y       ND  CS   + HAVL+VGYG ++
Sbjct:   230 HNEAALQSAVANIGPVSVGINAKLLSFHRYRSGIY--NDPKCSSALINHAVLVVGYGSEN 287

Query:   916 NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 956
                YWLV+NSWG    + G+ ++ R  N CGI     Y TI
Sbjct:   288 GQDYWLVKNSWGTAWGENGYIRMARNKNMCGISSFGIYPTI 328

 Score = 163 (62.4 bits), Expect = 9.9e-09, P = 9.9e-09
 Identities = 29/66 (43%), Positives = 39/66 (59%)

Query:   959 VKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQI 1018
             + ND  CS   + HAVL+VGYG ++   YWLV+NSWG    + G+ ++ R  N CGI   
Sbjct:   263 IYNDPKCSSALINHAVLVVGYGSENGQDYWLVKNSWGTAWGENGYIRMARNKNMCGISSF 322

Query:  1019 AGYATI 1024
               Y TI
Sbjct:   323 GIYPTI 328


>TAIR|locus:2038515 [details] [associations]
            symbol:AT1G06260 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0048046 "apoplast"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0048046 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC025290
            MEROPS:I29.003 HSSP:O65039 HOGENOM:HOG000230773 OMA:METAFEF
            IPI:IPI00525965 PIR:D86198 RefSeq:NP_563764.1 UniGene:At.24617
            ProteinModelPortal:Q9LNC1 SMR:Q9LNC1 PaxDb:Q9LNC1 PRIDE:Q9LNC1
            EnsemblPlants:AT1G06260.1 GeneID:837137 KEGG:ath:AT1G06260
            TAIR:At1g06260 InParanoid:Q9LNC1 PhylomeDB:Q9LNC1
            ProtClustDB:CLSN2916975 Genevestigator:Q9LNC1 Uniprot:Q9LNC1
        Length = 343

 Score = 342 (125.4 bits), Expect = 6.8e-30, P = 6.8e-30
 Identities = 80/222 (36%), Positives = 112/222 (50%)

Query:     9 GPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC-- 66
             G VPDA DWR +    P  +Q  CG CWAFS    +EG   IKTG LV  S+ QL++C  
Sbjct:   125 GNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDV 184

Query:    67 AKQCSGCDGCFFEPSIEYTH-QAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLH 125
                  GC G   E + E+     GL +E DYPY    G    C  +KSK K+ T + +  
Sbjct:   185 GTYNKGCSGGLMETAFEFIKTNGGLATETDYPYTGIEGT---CDQEKSKNKVVTIQGYQK 241

Query:   126 FNGSETMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
                +E   +I     P+SV +++   I     + +  N   C   +L H V +VGYG + 
Sbjct:   242 VAQNEASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTN--YCGT-NLNHGVTVVGYGVEG 298

Query:   185 NIPYWLVRNSWGPIGPDEGFFKIERG----NNACGIEQIAGY 222
             +  YW+V+NSWG    +EG+ ++ERG       CGI  +A Y
Sbjct:   299 DQKYWIVKNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASY 340

 Score = 342 (125.4 bits), Expect = 6.8e-30, P = 6.8e-30
 Identities = 80/222 (36%), Positives = 112/222 (50%)

Query:   740 GPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC-- 797
             G VPDA DWR +    P  +Q  CG CWAFS    +EG   IKTG LV  S+ QL++C  
Sbjct:   125 GNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDV 184

Query:   798 AKQCSGCDGCFFEPSIEYTH-QAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLH 856
                  GC G   E + E+     GL +E DYPY    G    C  +KSK K+ T + +  
Sbjct:   185 GTYNKGCSGGLMETAFEFIKTNGGLATETDYPYTGIEGT---CDQEKSKNKVVTIQGYQK 241

Query:   857 FNGSETMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 915
                +E   +I     P+SV +++   I     + +  N   C   +L H V +VGYG + 
Sbjct:   242 VAQNEASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTN--YCGT-NLNHGVTVVGYGVEG 298

Query:   916 NIPYWLVRNSWGPIGPDEGFFKIERG----NNACGIEQIAGY 953
             +  YW+V+NSWG    +EG+ ++ERG       CGI  +A Y
Sbjct:   299 DQKYWIVKNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASY 340

 Score = 339 (124.4 bits), Expect = 1.4e-29, P = 1.4e-29
 Identities = 83/225 (36%), Positives = 115/225 (51%)

Query:   374 GPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC-- 431
             G VPDA DWR +    P  +Q  CG CWAFS    +EG   IKTG LV  S+ QL++C  
Sbjct:   125 GNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDV 184

Query:   432 AKQCSGCGGCDGL-EQPIEYTH-QAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF 489
                  GC G  GL E   E+     GL +E DYPY    G    C  +KSK K+ T + +
Sbjct:   185 GTYNKGCSG--GLMETAFEFIKTNGGLATETDYPYTGIEGT---CDQEKSKNKVVTIQGY 239

Query:   490 LYFNGSETMKKILYKYGPLSVGLNS--HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYG 547
                  +E   +I     P+SVG+++   +   Y+ + +  N   C   +L H V +VGYG
Sbjct:   240 QKVAQNEASLQIAAAQQPVSVGIDAGGFIFQLYS-SGVFTN--YCGT-NLNHGVTVVGYG 295

Query:   548 KQDDIPYWLARNSWGPIGPDEGFFKIERG----NNACGIEQIAGY 588
              + D  YW+ +NSWG    +EG+ ++ERG       CGI  +A Y
Sbjct:   296 VEGDQKYWIVKNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASY 340

 Score = 134 (52.2 bits), Expect = 1.8e-05, P = 1.8e-05
 Identities = 25/57 (43%), Positives = 35/57 (61%)

Query:   969 DLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERG----NNACGIEQIAGY 1021
             +L H V +VGYG + D  YW+V+NSWG    +EG+ ++ERG       CGI  +A Y
Sbjct:   284 NLNHGVTVVGYGVEGDQKYWIVKNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASY 340


>UNIPROTKB|P25774 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0016020 "membrane"
            evidence=IEA] [GO:0005576 "extracellular region" evidence=NAS]
            [GO:0005764 "lysosome" evidence=IDA;NAS] [GO:0097067 "cellular
            response to thyroid hormone stimulus" evidence=IEP] [GO:0002250
            "adaptive immune response" evidence=IEP] [GO:0019882 "antigen
            processing and presentation" evidence=TAS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=TAS] [GO:0006955
            "immune response" evidence=TAS] [GO:0002474 "antigen processing and
            presentation of peptide antigen via MHC class I" evidence=TAS]
            [GO:0002480 "antigen processing and presentation of exogenous
            peptide antigen via MHC class I, TAP-independent" evidence=TAS]
            [GO:0019886 "antigen processing and presentation of exogenous
            peptide antigen via MHC class II" evidence=TAS] [GO:0036021
            "endolysosome lumen" evidence=TAS] [GO:0042590 "antigen processing
            and presentation of exogenous peptide antigen via MHC class I"
            evidence=TAS] [GO:0043202 "lysosomal lumen" evidence=TAS]
            [GO:0045087 "innate immune response" evidence=TAS] [GO:0043231
            "intracellular membrane-bounded organelle" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 Reactome:REACT_118779
            Reactome:REACT_6900 GO:GO:0005576 GO:GO:0002480 GO:GO:0016020
            GO:GO:0019886 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087 EMBL:CH471121
            GO:GO:0004197 GO:GO:0002250 HOVERGEN:HBG011513 GO:GO:0097067
            GO:GO:0036021 EMBL:AL356292 CTD:1520 KO:K01368 OMA:KAMDQKC
            OrthoDB:EOG4JM7Q2 EMBL:S93414 EMBL:M86553 EMBL:M90696 EMBL:U07374
            EMBL:U07370 EMBL:U07371 EMBL:U07372 EMBL:U07373 EMBL:CR541676
            EMBL:AK301472 EMBL:AK314482 EMBL:BC002642 IPI:IPI00299150
            IPI:IPI00910216 PIR:A42482 RefSeq:NP_001186668.1 RefSeq:NP_004070.3
            UniGene:Hs.181301 PDB:1BXF PDB:1GLO PDB:1MS6 PDB:1NPZ PDB:1NQC
            PDB:2C0Y PDB:2F1G PDB:2FQ9 PDB:2FRA PDB:2FRQ PDB:2FT2 PDB:2FUD
            PDB:2FYE PDB:2G6D PDB:2G7Y PDB:2H7J PDB:2HH5 PDB:2HHN PDB:2HXZ
            PDB:2OP3 PDB:2R9M PDB:2R9N PDB:2R9O PDB:3IEJ PDB:3KWN PDB:3MPE
            PDB:3MPF PDB:3N3G PDB:3N4C PDB:3OVX PDBsum:1BXF PDBsum:1GLO
            PDBsum:1MS6 PDBsum:1NPZ PDBsum:1NQC PDBsum:2C0Y PDBsum:2F1G
            PDBsum:2FQ9 PDBsum:2FRA PDBsum:2FRQ PDBsum:2FT2 PDBsum:2FUD
            PDBsum:2FYE PDBsum:2G6D PDBsum:2G7Y PDBsum:2H7J PDBsum:2HH5
            PDBsum:2HHN PDBsum:2HXZ PDBsum:2OP3 PDBsum:2R9M PDBsum:2R9N
            PDBsum:2R9O PDBsum:3IEJ PDBsum:3KWN PDBsum:3MPE PDBsum:3MPF
            PDBsum:3N3G PDBsum:3N4C PDBsum:3OVX ProteinModelPortal:P25774
            SMR:P25774 IntAct:P25774 STRING:P25774 MEROPS:I29.004
            PhosphoSite:P25774 DMDM:88984046 PaxDb:P25774 PeptideAtlas:P25774
            PRIDE:P25774 DNASU:1520 Ensembl:ENST00000368985
            Ensembl:ENST00000448301 GeneID:1520 KEGG:hsa:1520 UCSC:uc001evn.3
            GeneCards:GC01M150702 HGNC:HGNC:2545 HPA:CAB000460 HPA:HPA002988
            MIM:116845 neXtProt:NX_P25774 PharmGKB:PA27041 InParanoid:P25774
            PhylomeDB:P25774 BRENDA:3.4.22.27 BindingDB:P25774
            ChEMBL:CHEMBL2954 ChiTaRS:CTSS EvolutionaryTrace:P25774
            GenomeRNAi:1520 NextBio:6291 PMAP-CutDB:P25774 ArrayExpress:P25774
            Bgee:P25774 CleanEx:HS_CTSS Genevestigator:P25774
            GermOnline:ENSG00000163131 Uniprot:P25774
        Length = 331

 Score = 341 (125.1 bits), Expect = 8.7e-30, P = 8.7e-30
 Identities = 77/222 (34%), Positives = 118/222 (53%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +PD+ DWR+K        Q +CG+CWAFS  G LE Q  +KTGKLV  S   LV+C+ + 
Sbjct:   115 LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEK 174

Query:   436 SGCGGCDG--LEQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF--L 490
              G  GC+G  +    +Y     G++S+  YPY+  +    KC YD SK +  T   +  L
Sbjct:   175 YGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQ---KCQYD-SKYRAATCSKYTEL 230

Query:   491 YFNGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 550
              +   + +K+ +   GP+SVG+++    F+        + +C+  ++ H VL+VGYG  +
Sbjct:   231 PYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQ-NVNHGVLVVGYGDLN 289

Query:   551 DIPYWLARNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 591
                YWL +NSWG    +EG+ ++ R   N CGI     Y  I
Sbjct:   290 GKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI 331

 Score = 333 (122.3 bits), Expect = 6.3e-29, P = 6.3e-29
 Identities = 77/222 (34%), Positives = 117/222 (52%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             +PD+ DWR+K        Q +CG+CWAFS  G LE Q  +KTGKLV  S   LV+C+ + 
Sbjct:   115 LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEK 174

Query:   802 ---SGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--L 855
                 GC+G F   + +Y     G++S+  YPYK  +    KC YD SK +  T   +  L
Sbjct:   175 YGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQ---KCQYD-SKYRAATCSKYTEL 230

Query:   856 HFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 915
              +   + +K+ +   GP+SV +++     +        + +C+  ++ H VL+VGYG  +
Sbjct:   231 PYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQ-NVNHGVLVVGYGDLN 289

Query:   916 NIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 956
                YWLV+NSWG    +EG+ ++ R   N CGI     Y  I
Sbjct:   290 GKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI 331

 Score = 332 (121.9 bits), Expect = 8.0e-29, P = 8.0e-29
 Identities = 77/222 (34%), Positives = 116/222 (52%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +PD+ DWR+K        Q  CG+CWAFS  G LE Q  +KTGKLV  S   LV+C+ + 
Sbjct:   115 LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEK 174

Query:    71 ---SGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--L 124
                 GC+G F   + +Y     G++S+  YPYK  +    KC YD SK +  T   +  L
Sbjct:   175 YGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQ---KCQYD-SKYRAATCSKYTEL 230

Query:   125 HFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
              +   + +K+ +   GP+SV +++     +        + +C+  ++ H VL+VGYG  +
Sbjct:   231 PYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQ-NVNHGVLVVGYGDLN 289

Query:   185 NIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 225
                YWLV+NSWG    +EG+ ++ R   N CGI     Y  I
Sbjct:   290 GKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI 331

 Score = 126 (49.4 bits), Expect = 0.00013, P = 0.00013
 Identities = 25/64 (39%), Positives = 37/64 (57%)

Query:   962 DETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAG 1020
             + +C+  ++ H VL+VGYG  +   YWLV+NSWG    +EG+ ++ R   N CGI     
Sbjct:   269 EPSCTQ-NVNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPS 327

Query:  1021 YATI 1024
             Y  I
Sbjct:   328 YPEI 331


>TAIR|locus:2825832 [details] [associations]
            symbol:RD21A "responsive to dehydration 21A" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;IMP]
            [GO:0008234 "cysteine-type peptidase activity"
            evidence=ISS;IDA;IMP] [GO:0009507 "chloroplast" evidence=IDA]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0008233 "peptidase
            activity" evidence=IDA] [GO:0048046 "apoplast" evidence=IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005829 "cytosol"
            evidence=RCA] [GO:0009506 "plasmodesma" evidence=IDA] [GO:0050832
            "defense response to fungus" evidence=IMP] [GO:0006096 "glycolysis"
            evidence=RCA] [GO:0006833 "water transport" evidence=RCA]
            [GO:0006972 "hyperosmotic response" evidence=RCA] [GO:0007030
            "Golgi organization" evidence=RCA] [GO:0009266 "response to
            temperature stimulus" evidence=RCA] [GO:0009651 "response to salt
            stress" evidence=RCA] [GO:0015996 "chlorophyll catabolic process"
            evidence=RCA] [GO:0042744 "hydrogen peroxide catabolic process"
            evidence=RCA] [GO:0046686 "response to cadmium ion" evidence=RCA]
            [GO:0009414 "response to water deprivation" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0009506 GO:GO:0009507 GO:GO:0005773
            GO:GO:0050832 GO:GO:0048046 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC083835
            HOGENOM:HOG000230773 KO:K01376 InterPro:IPR000118 Pfam:PF00396
            SMART:SM00277 UniGene:At.43549 EMBL:D13043 EMBL:AY072130
            EMBL:AY133781 IPI:IPI00530094 PIR:JN0719 RefSeq:NP_564497.1
            UniGene:At.47599 UniGene:At.71705 ProteinModelPortal:P43297
            SMR:P43297 IntAct:P43297 STRING:P43297 MEROPS:C01.064 PaxDb:P43297
            PRIDE:P43297 ProMEX:P43297 EnsemblPlants:AT1G47128.1 GeneID:841122
            KEGG:ath:AT1G47128 TAIR:At1g47128 InParanoid:P43297 OMA:EAWLVKH
            PhylomeDB:P43297 ProtClustDB:CLSN2688498 Genevestigator:P43297
            GermOnline:AT1G47128 Uniprot:P43297
        Length = 462

 Score = 341 (125.1 bits), Expect = 8.7e-30, P = 8.7e-30
 Identities = 96/329 (29%), Positives = 160/329 (48%)

Query:   654 ILETFKAFIVKRGRQYANDEEIKE--RFEYFKQ-----DGHKKHE---RYGTSEFSDRSP 703
             ++  ++A++VK G+  + +  +++  RFE FK      D H +     R G + F+D + 
Sbjct:    46 VMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTN 105

Query:   704 EEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAA 762
             +E   K  G K  E+  ER  + R               +P++ DWRKK       DQ  
Sbjct:   106 DEYRSKYLGAKM-EKKGERRTSLRYEARVGDE-------LPESIDWRKKGAVAEVKDQGG 157

Query:   763 CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS-GCDGCFFEPSIEYT-HQAG 820
             CGSCWAFS  G +EG   I TG L+  S+ +LV+C    + GC+G   + + E+     G
Sbjct:   158 CGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGG 217

Query:   821 LESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNSD 880
             ++++KDYPYK  +G   +   +   V + + +D   ++  E++KK +  + P+S+ + + 
Sbjct:   218 IDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYS-EESLKKAV-AHQPISIAIEAG 275

Query:   881 --LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKI 938
                   Y+       D +C    L H V+ VGYG ++   YW+VRNSWG    + G+ ++
Sbjct:   276 GRAFQLYDSGIF---DGSCGT-QLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRM 331

Query:   939 ERG----NNACGIEQIAGYATIDVVKNDE 963
              R     +  CGI     Y     +KN E
Sbjct:   332 ARNIASSSGKCGIAIEPSYP----IKNGE 356

 Score = 327 (120.2 bits), Expect = 5.0e-27, P = 5.0e-27
 Identities = 92/315 (29%), Positives = 154/315 (48%)

Query:   288 ILETFKAFIVKRGRQYANDEEIKE--RFEYFKQ-----DGHKKHE---RYGTSEFSDRSP 337
             ++  ++A++VK G+  + +  +++  RFE FK      D H +     R G + F+D + 
Sbjct:    46 VMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTN 105

Query:   338 EEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAA 396
             +E   K  G K  E+  ER  + R               +P++ DWRKK       DQ  
Sbjct:   106 DEYRSKYLGAKM-EKKGERRTSLRYEARVGDE-------LPESIDWRKKGAVAEVKDQGG 157

Query:   397 CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS-GCGGCDGL-EQPIEYT-HQ 453
             CGSCWAFS  G +EG   I TG L+  S+ +LV+C    + GC G  GL +   E+    
Sbjct:   158 CGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNG--GLMDYAFEFIIKN 215

Query:   454 AGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLN 513
              G++++KDYPY+  +G   +   +   V + + +D   ++  E++KK +  + P+S+ + 
Sbjct:   216 GGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYS-EESLKKAV-AHQPISIAIE 273

Query:   514 S--HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFF 571
             +       Y+       D +C    L H V+ VGYG ++   YW+ RNSWG    + G+ 
Sbjct:   274 AGGRAFQLYDSGIF---DGSCGT-QLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYL 329

Query:   572 KIERG----NNACGI 582
             ++ R     +  CGI
Sbjct:   330 RMARNIASSSGKCGI 344

 Score = 307 (113.1 bits), Expect = 1.4e-24, Sum P(2) = 1.4e-24
 Identities = 69/214 (32%), Positives = 113/214 (52%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P++ DWRKK       DQ  CGSCWAFS  G +EG   I TG L+  S+ +LV+C    
Sbjct:   137 LPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSY 196

Query:    71 S-GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNG 128
             + GC+G   + + E+     G++++KDYPYK  +G   +   +   V + + +D   ++ 
Sbjct:   197 NEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYS- 255

Query:   129 SETMKKILYKYGPLSVLLNSD--LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNI 186
              E++KK +  + P+S+ + +       Y+       D +C    L H V+ VGYG ++  
Sbjct:   256 EESLKKAV-AHQPISIAIEAGGRAFQLYDSGIF---DGSCGT-QLDHGVVAVGYGTENGK 310

Query:   187 PYWLVRNSWGPIGPDEGFFKIERG----NNACGI 216
              YW+VRNSWG    + G+ ++ R     +  CGI
Sbjct:   311 DYWIVRNSWGKSWGESGYLRMARNIASSSGKCGI 344

 Score = 43 (20.2 bits), Expect = 1.4e-24, Sum P(2) = 1.4e-24
 Identities = 14/54 (25%), Positives = 20/54 (37%)

Query:   453 QAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYG 506
             + G+  E  YP +NG           S +K  T  D  Y          L++YG
Sbjct:   341 KCGIAIEPSYPIKNGENPPNPGPSPPSPIKPPTQCDSYYTCPESNTCCCLFEYG 394


>ZFIN|ZDB-GENE-050208-336 [details] [associations]
            symbol:ctskl "cathepsin K, like" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-050208-336 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:BX465190
            GeneTree:ENSGT00660000095458 IPI:IPI00491185 RefSeq:XP_695425.1
            UniGene:Dr.110795 Ensembl:ENSDART00000062749 GeneID:567046
            KEGG:dre:567046 CTD:567046 NextBio:20888499 Bgee:F1QCP8
            Uniprot:F1QCP8
        Length = 349

 Score = 340 (124.7 bits), Expect = 1.1e-29, P = 1.1e-29
 Identities = 73/215 (33%), Positives = 112/215 (52%)

Query:   747 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GC 804
             D+R K       DQ  CGSCW+FS  G +EGQ    TG+LV  S+ QLV+C++     GC
Sbjct:   138 DYRAKGYVTEVKDQGYCGSCWSFSTTGAIEGQMYKHTGRLVSLSEQQLVDCSRSYGTYGC 197

Query:   805 DGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKL-FTGKDFLHFNGSETM 863
              G +   + +Y     LES   YPY + + +   C Y+K+      +   F+     + +
Sbjct:   198 SGAWMANAYDYVINNALESSDTYPYTSVDTQP--CFYEKNLAMAGISDYRFVPAGNEQAL 255

Query:   864 KKILYKYGPLSVLLNSDLIHD-YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLV 922
                +   GP+SV +++D     +  + I K +  C+P +L HAVL+VGYG ++   YW++
Sbjct:   256 ADAVATVGPVSVAIDADNPSFLFYSSGIYK-ESNCNPNNLNHAVLVVGYGSEEGTDYWII 314

Query:   923 RNSWGPIGPDEGFFK-IERGNNACGIEQIAGYATI 956
             +NSWG    + G+ + I  G N CGI   A Y  I
Sbjct:   315 KNSWGTGWGEGGYMRMIRNGKNTCGIASYALYPII 349

 Score = 339 (124.4 bits), Expect = 1.4e-29, P = 1.4e-29
 Identities = 73/215 (33%), Positives = 112/215 (52%)

Query:    16 DWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GC 73
             D+R K       DQ  CGSCW+FS  G +EGQ    TG+LV  S+ QLV+C++     GC
Sbjct:   138 DYRAKGYVTEVKDQGYCGSCWSFSTTGAIEGQMYKHTGRLVSLSEQQLVDCSRSYGTYGC 197

Query:    74 DGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKL-FTGKDFLHFNGSETM 132
              G +   + +Y     LES   YPY + + +   C Y+K+      +   F+     + +
Sbjct:   198 SGAWMANAYDYVINNALESSDTYPYTSVDTQP--CFYEKNLAMAGISDYRFVPAGNEQAL 255

Query:   133 KKILYKYGPLSVLLNSDLIHD-YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLV 191
                +   GP+SV +++D     +  + I K +  C+P +L HAVL+VGYG ++   YW++
Sbjct:   256 ADAVATVGPVSVAIDADNPSFLFYSSGIYK-ESNCNPNNLNHAVLVVGYGSEEGTDYWII 314

Query:   192 RNSWGPIGPDEGFFK-IERGNNACGIEQIAGYATI 225
             +NSWG    + G+ + I  G N CGI   A Y  I
Sbjct:   315 KNSWGTGWGEGGYMRMIRNGKNTCGIASYALYPII 349

 Score = 328 (120.5 bits), Expect = 2.2e-28, P = 2.2e-28
 Identities = 74/216 (34%), Positives = 111/216 (51%)

Query:   381 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGG 440
             D+R K       DQ  CGSCW+FS  G +EGQ    TG+LV  S+ QLV+C++   G  G
Sbjct:   138 DYRAKGYVTEVKDQGYCGSCWSFSTTGAIEGQMYKHTGRLVSLSEQQLVDCSRSY-GTYG 196

Query:   441 CDG--LEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKL-FTGKDFLYFNGSET 497
             C G  +    +Y     LES   YPY + + +   C Y+K+      +   F+     + 
Sbjct:   197 CSGAWMANAYDYVINNALESSDTYPYTSVDTQP--CFYEKNLAMAGISDYRFVPAGNEQA 254

Query:   498 MKKILYKYGPLSVGLNSHLIHF-YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWL 556
             +   +   GP+SV +++    F +  + I K +  C+P +L HAVL+VGYG ++   YW+
Sbjct:   255 LADAVATVGPVSVAIDADNPSFLFYSSGIYK-ESNCNPNNLNHAVLVVGYGSEEGTDYWI 313

Query:   557 ARNSWGPIGPDEGFFK-IERGNNACGIEQIAGYATI 591
              +NSWG    + G+ + I  G N CGI   A Y  I
Sbjct:   314 IKNSWGTGWGEGGYMRMIRNGKNTCGIASYALYPII 349

 Score = 155 (59.6 bits), Expect = 9.1e-08, P = 9.1e-08
 Identities = 27/67 (40%), Positives = 40/67 (59%)

Query:   959 VKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFK-IERGNNACGIEQ 1017
             +  +  C+P +L HAVL+VGYG ++   YW+++NSWG    + G+ + I  G N CGI  
Sbjct:   283 IYKESNCNPNNLNHAVLVVGYGSEEGTDYWIIKNSWGTGWGEGGYMRMIRNGKNTCGIAS 342

Query:  1018 IAGYATI 1024
              A Y  I
Sbjct:   343 YALYPII 349


>UNIPROTKB|Q28944 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9823 "Sus scrofa"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 KO:K01365 OrthoDB:EOG48PMKF MEROPS:C01.032
            CTD:1514 EMBL:D37917 EMBL:AJ315771 PIR:A58195 RefSeq:NP_999057.1
            UniGene:Ssc.54036 ProteinModelPortal:Q28944 SMR:Q28944
            STRING:Q28944 Ensembl:ENSSSCT00000012233 GeneID:396926
            KEGG:ssc:396926 OMA:DASETGK ArrayExpress:Q28944 Uniprot:Q28944
        Length = 334

 Score = 340 (124.7 bits), Expect = 1.1e-29, P = 1.1e-29
 Identities = 84/227 (37%), Positives = 116/227 (51%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             VP + DWR+K       +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C++  
Sbjct:   114 VPKSVDWREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRP- 172

Query:   436 SGCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYF 492
              G  GC+G  ++   +Y     GL++E+ YPY  G  E   C Y K +        F+  
Sbjct:   173 QGNQGCNGGLMDNAFQYVKDNGGLDTEESYPYL-GR-ETNSCTY-KPECSAANDTGFVDI 229

Query:   493 NGSE-TMKKILYKYGPLSVGLNS-HL-IHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 549
                E  + K +   GP+SV +++ H    FY        D  CS  DL H VL+VGYG +
Sbjct:   230 PQREKALMKAVATVGPISVAIDAGHSSFQFYKSGIYY--DPDCSSKDLDHGVLVVGYGFE 287

Query:   550 ----DDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
                 +   +W+ +NSWGP     G+ K+ +  NN CGI   A Y T+
Sbjct:   288 GTDSNSSKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGISTAASYPTV 334

 Score = 339 (124.4 bits), Expect = 1.4e-29, P = 1.4e-29
 Identities = 81/225 (36%), Positives = 116/225 (51%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             VP + DWR+K       +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C++  
Sbjct:   114 VPKSVDWREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQ 173

Query:    71 S--GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 127
                GC+G   + + +Y     GL++E+ YPY     E   C Y K +        F+   
Sbjct:   174 GNQGCNGGLMDNAFQYVKDNGGLDTEESYPYLGR--ETNSCTY-KPECSAANDTGFVDIP 230

Query:   128 GSE-TMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ-- 183
               E  + K +   GP+SV +++      +  + I   D  CS  DL H VL+VGYG +  
Sbjct:   231 QREKALMKAVATVGPISVAIDAGHSSFQFYKSGIYY-DPDCSSKDLDHGVLVVGYGFEGT 289

Query:   184 --DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
               ++  +W+V+NSWGP     G+ K+ +  NN CGI   A Y T+
Sbjct:   290 DSNSSKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGISTAASYPTV 334

 Score = 338 (124.0 bits), Expect = 1.8e-29, P = 1.8e-29
 Identities = 81/225 (36%), Positives = 116/225 (51%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             VP + DWR+K       +Q  CGSCWAFS  G LEGQ   KTGKLV  S+  LV+C++  
Sbjct:   114 VPKSVDWREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQ 173

Query:   802 S--GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 858
                GC+G   + + +Y     GL++E+ YPY     E   C Y K +        F+   
Sbjct:   174 GNQGCNGGLMDNAFQYVKDNGGLDTEESYPYLGR--ETNSCTY-KPECSAANDTGFVDIP 230

Query:   859 GSE-TMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ-- 914
               E  + K +   GP+SV +++      +  + I   D  CS  DL H VL+VGYG +  
Sbjct:   231 QREKALMKAVATVGPISVAIDAGHSSFQFYKSGIYY-DPDCSSKDLDHGVLVVGYGFEGT 289

Query:   915 --DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
               ++  +W+V+NSWGP     G+ K+ +  NN CGI   A Y T+
Sbjct:   290 DSNSSKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGISTAASYPTV 334

 Score = 143 (55.4 bits), Expect = 1.8e-06, P = 1.8e-06
 Identities = 32/83 (38%), Positives = 46/83 (55%)

Query:   951 AGYATIDVVKN----DETCSPYDLGHAVLLVGYGKQ----DDIPYWLVRNSWGPIGPDEG 1002
             AG+++    K+    D  CS  DL H VL+VGYG +    +   +W+V+NSWGP     G
Sbjct:   252 AGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWGPEWGWNG 311

Query:  1003 FFKIERG-NNACGIEQIAGYATI 1024
             + K+ +  NN CGI   A Y T+
Sbjct:   312 YVKMAKDQNNHCGISTAASYPTV 334


>MGI|MGI:1349426 [details] [associations]
            symbol:Ctsj "cathepsin J" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=ISO]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0048471 "perinuclear region
            of cytoplasm" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:1349426 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF136272
            EMBL:AF158182 EMBL:AY034579 EMBL:AK005526 EMBL:AK131661
            EMBL:BC103769 IPI:IPI00126770 RefSeq:NP_036137.1 UniGene:Mm.31948
            ProteinModelPortal:Q9R014 SMR:Q9R014 MEROPS:C01.038 PRIDE:Q9R014
            Ensembl:ENSMUST00000071526 GeneID:26898 KEGG:mmu:26898
            UCSC:uc007qwa.1 CTD:26898 InParanoid:Q9R014 KO:K09599
            NextBio:304745 Bgee:Q9R014 CleanEx:MM_CTSJ Genevestigator:Q9R014
            GermOnline:ENSMUSG00000055298 Uniprot:Q9R014
        Length = 334

 Score = 340 (124.7 bits), Expect = 1.1e-29, P = 1.1e-29
 Identities = 84/227 (37%), Positives = 115/227 (50%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +PD  DWR++    P  +Q  CGSCWAF+ AG +EGQ   KTG L   S   L++C+K  
Sbjct:   114 LPDYKDWREEGYVTPVRNQGKCGSCWAFAAAGAIEGQMFWKTGNLTPLSVQNLLDCSKTV 173

Query:   436 SGCGGCDG--LEQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYF 492
              G  GC      Q  EY     GLE+E  YPY   +G    C Y +S+       D++  
Sbjct:   174 -GNKGCQSGTAHQAFEYVLKNKGLEAEATYPYEGKDGP---CRY-RSENASANITDYVNL 228

Query:   493 NGSETMKKI-LYKYGPLSVGLN-SH-LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 549
               +E    + +   GP+S  ++ SH    FYNG      +  CS Y + HAVL+VGYG +
Sbjct:   229 PPNELYLWVAVASIGPVSAAIDASHDSFRFYNGGIYY--EPNCSSYFVNHAVLVVGYGSE 286

Query:   550 DDIP----YWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
              D+     YWL +NSWG      G+ +I +  NN CGI  +A Y  I
Sbjct:   287 GDVKDGNNYWLIKNSWGEEWGMNGYMQIAKDHNNHCGIASLASYPNI 333

 Score = 334 (122.6 bits), Expect = 4.9e-29, P = 4.9e-29
 Identities = 79/226 (34%), Positives = 116/226 (51%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +PD  DWR++    P  +Q  CGSCWAF+ AG +EGQ   KTG L   S   L++C+K  
Sbjct:   114 LPDYKDWREEGYVTPVRNQGKCGSCWAFAAAGAIEGQMFWKTGNLTPLSVQNLLDCSKTV 173

Query:    71 S--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 127
                GC       + EY     GLE+E  YPY+  +G    C Y +S+       D+++  
Sbjct:   174 GNKGCQSGTAHQAFEYVLKNKGLEAEATYPYEGKDGP---CRY-RSENASANITDYVNLP 229

Query:   128 GSETMKKI-LYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
              +E    + +   GP+S  +++  D    YNG      +  CS Y + HAVL+VGYG + 
Sbjct:   230 PNELYLWVAVASIGPVSAAIDASHDSFRFYNGGIYY--EPNCSSYFVNHAVLVVGYGSEG 287

Query:   185 NIP----YWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
             ++     YWL++NSWG      G+ +I +  NN CGI  +A Y  I
Sbjct:   288 DVKDGNNYWLIKNSWGEEWGMNGYMQIAKDHNNHCGIASLASYPNI 333

 Score = 334 (122.6 bits), Expect = 4.9e-29, P = 4.9e-29
 Identities = 79/226 (34%), Positives = 116/226 (51%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             +PD  DWR++    P  +Q  CGSCWAF+ AG +EGQ   KTG L   S   L++C+K  
Sbjct:   114 LPDYKDWREEGYVTPVRNQGKCGSCWAFAAAGAIEGQMFWKTGNLTPLSVQNLLDCSKTV 173

Query:   802 S--GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 858
                GC       + EY     GLE+E  YPY+  +G    C Y +S+       D+++  
Sbjct:   174 GNKGCQSGTAHQAFEYVLKNKGLEAEATYPYEGKDGP---CRY-RSENASANITDYVNLP 229

Query:   859 GSETMKKI-LYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 915
              +E    + +   GP+S  +++  D    YNG      +  CS Y + HAVL+VGYG + 
Sbjct:   230 PNELYLWVAVASIGPVSAAIDASHDSFRFYNGGIYY--EPNCSSYFVNHAVLVVGYGSEG 287

Query:   916 NIP----YWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
             ++     YWL++NSWG      G+ +I +  NN CGI  +A Y  I
Sbjct:   288 DVKDGNNYWLIKNSWGEEWGMNGYMQIAKDHNNHCGIASLASYPNI 333

 Score = 148 (57.2 bits), Expect = 4.9e-07, P = 4.9e-07
 Identities = 29/68 (42%), Positives = 40/68 (58%)

Query:   962 DETCSPYDLGHAVLLVGYGKQDDIP----YWLVRNSWGPIGPDEGFFKIERG-NNACGIE 1016
             +  CS Y + HAVL+VGYG + D+     YWL++NSWG      G+ +I +  NN CGI 
Sbjct:   266 EPNCSSYFVNHAVLVVGYGSEGDVKDGNNYWLIKNSWGEEWGMNGYMQIAKDHNNHCGIA 325

Query:  1017 QIAGYATI 1024
              +A Y  I
Sbjct:   326 SLASYPNI 333


>TAIR|locus:2122113 [details] [associations]
            symbol:XCP1 "xylem cysteine peptidase 1" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0000325 "plant-type vacuole" evidence=IDA] [GO:0005634
            "nucleus" evidence=IDA] [GO:0010623 "developmental programmed cell
            death" evidence=IMP] [GO:0010413 "glucuronoxylan metabolic process"
            evidence=RCA] [GO:0045492 "xylan biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005886
            GO:GO:0005634 EMBL:CP002687 GenomeReviews:CT486007_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0000325
            EMBL:AL022604 EMBL:AL161587 GO:GO:0010623 MEROPS:I29.003
            HOGENOM:HOG000230773 EMBL:AF191027 EMBL:AK117394 EMBL:BT005179
            IPI:IPI00532220 PIR:T06122 RefSeq:NP_567983.1 UniGene:At.2280
            UniGene:At.67622 ProteinModelPortal:O65493 SMR:O65493 STRING:O65493
            PaxDb:O65493 PRIDE:O65493 EnsemblPlants:AT4G35350.1 GeneID:829688
            KEGG:ath:AT4G35350 GeneFarm:5033 TAIR:At4g35350 InParanoid:O65493
            KO:K16290 OMA:FEVFREN PhylomeDB:O65493 ProtClustDB:CLSN2689772
            Genevestigator:O65493 Uniprot:O65493
        Length = 355

 Score = 340 (124.7 bits), Expect = 1.1e-29, P = 1.1e-29
 Identities = 98/322 (30%), Positives = 151/322 (46%)

Query:   650 DNENILETFKAFIVKRGRQYANDEEIKERFEYFKQD----GHKKHE--RY--GTSEFSDR 701
             + + +LE F++++ +  + Y + EE   RFE F+++      + +E   Y  G +EF+D 
Sbjct:    43 NTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADL 102

Query:   702 SPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQA 761
             + EE   +     ++  + R    R            D  +P + DWRKK    P  DQ 
Sbjct:   103 THEEFKGRY-LGLAKPQFSR---KRQPSANFRYRDITD--LPKSVDWRKKGAVAPVKDQG 156

Query:   762 ACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC-SGCDGCFFEPSIEYT-HQA 819
              CGSCWAFS    +EG   I TG L   S+ +L++C     SGC+G   + + +Y     
Sbjct:   157 QCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTG 216

Query:   820 GLESEKDYPYKNANGEKFKCAYDKSKVKLFT--GKDFLHFNGSETMKKILYKYGPLSVLL 877
             GL  E DYPY    G    C   K  V+  T  G + +  N  E++ K L  + P+SV +
Sbjct:   217 GLHKEDDYPYLMEEGI---CQEQKEDVERVTISGYEDVPENDDESLVKAL-AHQPVSVAI 272

Query:   878 NSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFK 937
              +    D+        +  C   DL H V  VGYG      Y +V+NSWGP   ++GF +
Sbjct:   273 EASG-RDFQFYKGGVFNGKCGT-DLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIR 330

Query:   938 IERGNNA----CGIEQIAGYAT 955
             ++R        CGI ++A Y T
Sbjct:   331 MKRNTGKPEGLCGINKMASYPT 352

 Score = 335 (123.0 bits), Expect = 3.8e-29, P = 3.8e-29
 Identities = 102/326 (31%), Positives = 152/326 (46%)

Query:   284 DNENILETFKAFIVKRGRQYANDEEIKERFEYFKQD----GHKKHE--RY--GTSEFSDR 335
             + + +LE F++++ +  + Y + EE   RFE F+++      + +E   Y  G +EF+D 
Sbjct:    43 NTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADL 102

Query:   336 SPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQA 395
             + EE   +     ++  + R    R            D  +P + DWRKK    P  DQ 
Sbjct:   103 THEEFKGRY-LGLAKPQFSR---KRQPSANFRYRDITD--LPKSVDWRKKGAVAPVKDQG 156

Query:   396 ACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC-SGCGGCDGL-EQPIEYT-H 452
              CGSCWAFS    +EG   I TG L   S+ +L++C     SGC G  GL +   +Y   
Sbjct:   157 QCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNG--GLMDYAFQYIIS 214

Query:   453 QAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFT--GKDFLYFNGSETMKKILYKYGPLSV 510
               GL  E DYPY    G    C   K  V+  T  G + +  N  E++ K L  + P+SV
Sbjct:   215 TGGLHKEDDYPYLMEEGI---CQEQKEDVERVTISGYEDVPENDDESLVKAL-AHQPVSV 270

Query:   511 GLNS--HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDE 568
              + +      FY G     N + C   DL H V  VGYG      Y + +NSWGP   ++
Sbjct:   271 AIEASGRDFQFYKGGVF--NGK-CGT-DLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEK 326

Query:   569 GFFKIERGNNA----CGIEQIAGYAT 590
             GF +++R        CGI ++A Y T
Sbjct:   327 GFIRMKRNTGKPEGLCGINKMASYPT 352

 Score = 311 (114.5 bits), Expect = 1.4e-26, P = 1.4e-26
 Identities = 77/222 (34%), Positives = 108/222 (48%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P + DWRKK    P  DQ  CGSCWAFS    +EG   I TG L   S+ +L++C    
Sbjct:   137 LPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTF 196

Query:    71 -SGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFT--GKDFLHF 126
              SGC+G   + + +Y     GL  E DYPY    G    C   K  V+  T  G + +  
Sbjct:   197 NSGCNGGLMDYAFQYIISTGGLHKEDDYPYLMEEGI---CQEQKEDVERVTISGYEDVPE 253

Query:   127 NGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNI 186
             N  E++ K L  + P+SV + +    D+        +  C   DL H V  VGYG     
Sbjct:   254 NDDESLVKAL-AHQPVSVAIEASG-RDFQFYKGGVFNGKCGT-DLDHGVAAVGYGSSKGS 310

Query:   187 PYWLVRNSWGPIGPDEGFFKIERGNNA----CGIEQIAGYAT 224
              Y +V+NSWGP   ++GF +++R        CGI ++A Y T
Sbjct:   311 DYVIVKNSWGPRWGEKGFIRMKRNTGKPEGLCGINKMASYPT 352

 Score = 120 (47.3 bits), Expect = 0.00069, P = 0.00068
 Identities = 24/59 (40%), Positives = 33/59 (55%)

Query:   969 DLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNA----CGIEQIAGYAT 1023
             DL H V  VGYG      Y +V+NSWGP   ++GF +++R        CGI ++A Y T
Sbjct:   294 DLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRNTGKPEGLCGINKMASYPT 352


>WB|WBGene00011102 [details] [associations]
            symbol:R07E3.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            EMBL:Z49207 HSSP:P53634 PIR:T24030 RefSeq:NP_001041280.1
            ProteinModelPortal:Q21810 SMR:Q21810 STRING:Q21810 MEROPS:C01.A43
            PaxDb:Q21810 EnsemblMetazoa:R07E3.1a GeneID:181242
            KEGG:cel:CELE_R07E3.1 UCSC:R07E3.1a CTD:181242 WormBase:R07E3.1a
            HOGENOM:HOG000021028 InParanoid:Q21810 OMA:ACKNEVI NextBio:913066
            ArrayExpress:Q21810 Uniprot:Q21810
        Length = 402

 Score = 340 (124.7 bits), Expect = 1.1e-29, P = 1.1e-29
 Identities = 93/317 (29%), Positives = 149/317 (47%)

Query:   652 ENILETFKAFIVKRGRQYANDEEIKERFE-YFKQDGH-------KKH--ERYGTSEFSDR 701
             +NI + + A+  K  + YA  +E  +R   Y+  D +        +H    YG ++ SD 
Sbjct:    84 QNIAKEYIAYTEKFDKSYATSQESLKRLNAYYNTDENIANWNIQNEHGSAEYGHNDMSDW 143

Query:   702 SPEE----ILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPA 757
             + EE    +L K+ +K   +  E I  +               P PD +DWR KNV  P 
Sbjct:   144 TDEEFEKTLLPKSFYKRLHKEAEFI--EPIPESLTAKKGESSSPFPDFFDWRDKNVITPV 201

Query:   758 GDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIEYTH 817
               Q  CGSCWAF+    +E  +AI  G+    S+  L++C    + CDG   + +  Y H
Sbjct:   202 KAQGQCGSCWAFASTATVEAAWAIAHGEKRNLSEQTLLDCDLVDNACDGGDEDKAFRYIH 261

Query:   818 QAGLESEKDYPY--KNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSV 875
             + GL +  D PY     NG      ++ +++K      FLH +  +++   L  +GP+++
Sbjct:   262 RNGLANAVDLPYVAHRQNGCAVNDHWNTTRIK---AAYFLHHD-EDSIINWLVNFGPVNI 317

Query:   876 -LLNSDLIHDYNGTPIRKNDETCSPYDLG-HAVLLVGYG-KQDNIPYWLVRNSWGPI-GP 931
              +     +  Y G     ++  C    +G HA+L+ GYG  +    YW+V+NSWG   G 
Sbjct:   318 GMAVIQPMRAYKGGVFTPSEYACKNEVIGLHALLITGYGTSKTGEKYWIVKNSWGNTWGV 377

Query:   932 DEGFFKIERGNNACGIE 948
             + G+    RG NACGIE
Sbjct:   378 EHGYIYFARGINACGIE 394

 Score = 331 (121.6 bits), Expect = 1.0e-28, P = 1.0e-28
 Identities = 92/318 (28%), Positives = 149/318 (46%)

Query:   286 ENILETFKAFIVKRGRQYANDEEIKERFE-YFKQDGH-------KKH--ERYGTSEFSDR 335
             +NI + + A+  K  + YA  +E  +R   Y+  D +        +H    YG ++ SD 
Sbjct:    84 QNIAKEYIAYTEKFDKSYATSQESLKRLNAYYNTDENIANWNIQNEHGSAEYGHNDMSDW 143

Query:   336 SPEE----ILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPA 391
             + EE    +L K+ +K   +  E I  +               P PD +DWR KNV  P 
Sbjct:   144 TDEEFEKTLLPKSFYKRLHKEAEFI--EPIPESLTAKKGESSSPFPDFFDWRDKNVITPV 201

Query:   392 GDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQPIEYT 451
               Q  CGSCWAF+    +E  +AI  G+    S+  L++C    + C G D  ++   Y 
Sbjct:   202 KAQGQCGSCWAFASTATVEAAWAIAHGEKRNLSEQTLLDCDLVDNACDGGDE-DKAFRYI 260

Query:   452 HQAGLESEKDYPY--RNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLS 509
             H+ GL +  D PY     NG      ++ +++K      FL+ +  +++   L  +GP++
Sbjct:   261 HRNGLANAVDLPYVAHRQNGCAVNDHWNTTRIK---AAYFLHHD-EDSIINWLVNFGPVN 316

Query:   510 VGLNS-HLIHFYNGTPIRKNDETCSPYDLG-HAVLLVGYG-KQDDIPYWLARNSWGPI-G 565
             +G+     +  Y G     ++  C    +G HA+L+ GYG  +    YW+ +NSWG   G
Sbjct:   317 IGMAVIQPMRAYKGGVFTPSEYACKNEVIGLHALLITGYGTSKTGEKYWIVKNSWGNTWG 376

Query:   566 PDEGFFKIERGNNACGIE 583
              + G+    RG NACGIE
Sbjct:   377 VEHGYIYFARGINACGIE 394

 Score = 317 (116.6 bits), Expect = 3.2e-27, P = 3.2e-27
 Identities = 72/218 (33%), Positives = 110/218 (50%)

Query:     6 EKDGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVE 65
             E   P PD +DWR KNV  P   Q  CGSCWAF+    +E  +AI  G+    S+  L++
Sbjct:   181 ESSSPFPDFFDWRDKNVITPVKAQGQCGSCWAFASTATVEAAWAIAHGEKRNLSEQTLLD 240

Query:    66 CAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPY--KNANGEKFKCAYDKSKVKLFTGKDF 123
             C    + CDG   + +  Y H+ GL +  D PY     NG      ++ +++K      F
Sbjct:   241 CDLVDNACDGGDEDKAFRYIHRNGLANAVDLPYVAHRQNGCAVNDHWNTTRIK---AAYF 297

Query:   124 LHFNGSETMKKILYKYGPLSV-LLNSDLIHDYNGTPIRKNDETCSPYDLG-HAVLLVGYG 181
             LH +  +++   L  +GP+++ +     +  Y G     ++  C    +G HA+L+ GYG
Sbjct:   298 LHHD-EDSIINWLVNFGPVNIGMAVIQPMRAYKGGVFTPSEYACKNEVIGLHALLITGYG 356

Query:   182 -KQDNIPYWLVRNSWGPI-GPDEGFFKIERGNNACGIE 217
               +    YW+V+NSWG   G + G+    RG NACGIE
Sbjct:   357 TSKTGEKYWIVKNSWGNTWGVEHGYIYFARGINACGIE 394


>UNIPROTKB|P43234 [details] [associations]
            symbol:CTSO "Cathepsin O" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 Reactome:REACT_6900
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0004197
            CleanEx:HS_CTSO EMBL:X77383 EMBL:BC049206 IPI:IPI00017257
            PIR:A55090 RefSeq:NP_001325.1 UniGene:Hs.75262
            ProteinModelPortal:P43234 SMR:P43234 IntAct:P43234 STRING:P43234
            MEROPS:C01.035 PhosphoSite:P43234 DMDM:1168795 PRIDE:P43234
            DNASU:1519 Ensembl:ENST00000433477 GeneID:1519 KEGG:hsa:1519
            UCSC:uc003ipg.3 CTD:1519 GeneCards:GC04M156845 HGNC:HGNC:2542
            HPA:HPA002041 MIM:600550 neXtProt:NX_P43234 PharmGKB:PA27040
            HOVERGEN:HBG105050 InParanoid:P43234 KO:K01374 OMA:SNVCGIA
            OrthoDB:EOG4V6ZH1 PhylomeDB:P43234 BindingDB:P43234
            ChEMBL:CHEMBL3035 GenomeRNAi:1519 NextBio:6287 Bgee:P43234
            Genevestigator:P43234 GermOnline:ENSG00000151792 Uniprot:P43234
        Length = 321

 Score = 339 (124.4 bits), Expect = 1.4e-29, P = 1.4e-29
 Identities = 72/210 (34%), Positives = 114/210 (54%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             +P  +DWR K V     +Q  CG CWAFS+ G +E  YAIK   L + S  Q+++C+   
Sbjct:   108 LPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNN 167

Query:   802 SGCDGCFFEPSIEYTH--QAGLESEKDYPYKNANG--EKFKCAYDKSKVKLFTGKDFLHF 857
              GC+G     ++ + +  Q  L  + +YP+K  NG    F  ++    +K ++  DF   
Sbjct:   168 YGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFS-- 225

Query:   858 NGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNI 917
             +  + M K L  +GPL V++++    DY G  I+ +   CS  +  HAVL+ G+ K  + 
Sbjct:   226 DQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHH---CSSGEANHAVLITGFDKTGST 282

Query:   918 PYWLVRNSWGPIGPDEGFFKIERGNNACGI 947
             PYW+VRNSWG     +G+  ++ G+N CGI
Sbjct:   283 PYWIVRNSWGSSWGVDGYAHVKMGSNVCGI 312

 Score = 337 (123.7 bits), Expect = 2.3e-29, P = 2.3e-29
 Identities = 72/210 (34%), Positives = 114/210 (54%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P  +DWR K V     +Q  CG CWAFS+ G +E  YAIK   L + S  Q+++C+   
Sbjct:   108 LPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNN 167

Query:    71 SGCDGCFFEPSIEYTH--QAGLESEKDYPYKNANG--EKFKCAYDKSKVKLFTGKDFLHF 126
              GC+G     ++ + +  Q  L  + +YP+K  NG    F  ++    +K ++  DF   
Sbjct:   168 YGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFS-- 225

Query:   127 NGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNI 186
             +  + M K L  +GPL V++++    DY G  I+ +   CS  +  HAVL+ G+ K  + 
Sbjct:   226 DQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHH---CSSGEANHAVLITGFDKTGST 282

Query:   187 PYWLVRNSWGPIGPDEGFFKIERGNNACGI 216
             PYW+VRNSWG     +G+  ++ G+N CGI
Sbjct:   283 PYWIVRNSWGSSWGVDGYAHVKMGSNVCGI 312

 Score = 314 (115.6 bits), Expect = 6.8e-27, P = 6.8e-27
 Identities = 70/211 (33%), Positives = 109/211 (51%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +P  +DWR K V     +Q  CG CWAFS+ G +E  YAIK   L + S  Q+++C+   
Sbjct:   108 LPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNN 167

Query:   436 SGCGGCDGLEQPIEYTH--QAGLESEKDYPYRNGNG--EKFKCAYDKSKVKLFTGKDFLY 491
              GC G   L   + + +  Q  L  + +YP++  NG    F  ++    +K ++  DF  
Sbjct:   168 YGCNGGSTLNA-LNWLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFS- 225

Query:   492 FNGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD 551
              +  + M K L  +GPL V +++     Y G  I+ +   CS  +  HAVL+ G+ K   
Sbjct:   226 -DQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHH---CSSGEANHAVLITGFDKTGS 281

Query:   552 IPYWLARNSWGPIGPDEGFFKIERGNNACGI 582
              PYW+ RNSWG     +G+  ++ G+N CGI
Sbjct:   282 TPYWIVRNSWGSSWGVDGYAHVKMGSNVCGI 312

 Score = 138 (53.6 bits), Expect = 5.7e-06, P = 5.7e-06
 Identities = 23/51 (45%), Positives = 32/51 (62%)

Query:   965 CSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGI 1015
             CS  +  HAVL+ G+ K    PYW+VRNSWG     +G+  ++ G+N CGI
Sbjct:   262 CSSGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGVDGYAHVKMGSNVCGI 312


>ZFIN|ZDB-GENE-080215-7 [details] [associations]
            symbol:zgc:174153 "zgc:174153" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-080215-7
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 EMBL:BX000534 EMBL:BX322603
            IPI:IPI00483644 Ensembl:ENSDART00000113654 OMA:ITLCISA Bgee:F1R8Y0
            Uniprot:F1R8Y0
        Length = 336

 Score = 339 (124.4 bits), Expect = 1.4e-29, P = 1.4e-29
 Identities = 84/225 (37%), Positives = 120/225 (53%)

Query:   377 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 436
             P   DWR++    P  DQ  CGSCW+FS  G LEGQ   KTGKL+  S+  LV+C++   
Sbjct:   116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP-Q 174

Query:   437 GCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYD-KSKVKLFTGKDFLYF 492
             G  GC+G  ++Q  +Y  +  GL+SE+ YPY     +   C YD +  V   TG  F+  
Sbjct:   175 GNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLAR--DDLPCRYDPRFNVAKITG--FVDI 230

Query:   493 -NGSE-TMKKILYKYGPLSVGLN-SHL-IHFYNGTPIRKNDETCSPYDLGHAVLLVGYGK 548
              +G+E  +   +   GP+SV ++ SH  + FY        +  CS   L HAVL+VGYG 
Sbjct:   231 PSGNEPALMNAVAAVGPVSVAIDASHQSLQFYQSGIYY--ERACSSSRLDHAVLVVGYGY 288

Query:   549 QD-DIP---YWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 588
             Q  D+    YW+ +NSW     D+G+  + +  NN CG+   A Y
Sbjct:   289 QGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASY 333

 Score = 325 (119.5 bits), Expect = 4.5e-28, P = 4.5e-28
 Identities = 80/223 (35%), Positives = 118/223 (52%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 71
             P   DWR++    P  DQ  CGSCW+FS  G LEGQ   KTGKL+  S+  LV+C++   
Sbjct:   116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175

Query:    72 --GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHF- 126
               GC+G   + + +Y  +  GL+SE+ YPY     +   C YD +  V   TG  F+   
Sbjct:   176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLAR--DDLPCRYDPRFNVAKITG--FVDIP 231

Query:   127 NGSE-TMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
             +G+E  +   +   GP+SV ++ S     +  + I   +  CS   L HAVL+VGYG Q 
Sbjct:   232 SGNEPALMNAVAAVGPVSVAIDASHQSLQFYQSGIYY-ERACSSSRLDHAVLVVGYGYQG 290

Query:   185 -NIP---YWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 222
              ++    YW+V+NSW     D+G+  + +  NN CG+   A Y
Sbjct:   291 ADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASY 333

 Score = 324 (119.1 bits), Expect = 5.8e-28, P = 5.8e-28
 Identities = 80/223 (35%), Positives = 118/223 (52%)

Query:   743 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 802
             P   DWR++    P  DQ  CGSCW+FS  G LEGQ   KTGKL+  S+  LV+C++   
Sbjct:   116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175

Query:   803 --GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHF- 857
               GC+G   + + +Y  +  GL+SE+ YPY     +   C YD +  V   TG  F+   
Sbjct:   176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLAR--DDLPCRYDPRFNVAKITG--FVDIP 231

Query:   858 NGSE-TMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 915
             +G+E  +   +   GP+SV ++ S     +  + I   +  CS   L HAVL+VGYG Q 
Sbjct:   232 SGNEPALMNAVAAVGPVSVAIDASHQSLQFYQSGIYY-ERACSSSRLDHAVLVVGYGYQG 290

Query:   916 -NIP---YWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 953
              ++    YW+V+NSW     D+G+  + +  NN CG+   A Y
Sbjct:   291 ADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASY 333


>ZFIN|ZDB-GENE-980526-285 [details] [associations]
            symbol:ctsl1b "cathepsin L, 1 b" species:7955
            "Danio rerio" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005576
            "extracellular region" evidence=IDA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-980526-285 GO:GO:0005576 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:BX465840 IPI:IPI00498443 Ensembl:ENSDART00000145570
            Bgee:F1R7B3 Uniprot:F1R7B3
        Length = 352

 Score = 339 (124.4 bits), Expect = 1.4e-29, P = 1.4e-29
 Identities = 84/225 (37%), Positives = 120/225 (53%)

Query:   377 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 436
             P   DWR++    P  DQ  CGSCW+FS  G LEGQ   KTGKL+  S+  LV+C++   
Sbjct:   132 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP-Q 190

Query:   437 GCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYD-KSKVKLFTGKDFLYF 492
             G  GC+G  ++Q  +Y  +  GL+SE+ YPY     +   C YD +  V   TG  F+  
Sbjct:   191 GNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLAR--DDLPCRYDPRFNVAKITG--FVDI 246

Query:   493 -NGSE-TMKKILYKYGPLSVGLN-SHL-IHFYNGTPIRKNDETCSPYDLGHAVLLVGYGK 548
              +G+E  +   +   GP+SV ++ SH  + FY        +  CS   L HAVL+VGYG 
Sbjct:   247 PSGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYY--ERACSSSRLDHAVLVVGYGY 304

Query:   549 QD-DIP---YWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 588
             Q  D+    YW+ +NSW     D+G+  + +  NN CG+   A Y
Sbjct:   305 QGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASY 349

 Score = 325 (119.5 bits), Expect = 4.5e-28, P = 4.5e-28
 Identities = 80/223 (35%), Positives = 118/223 (52%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 71
             P   DWR++    P  DQ  CGSCW+FS  G LEGQ   KTGKL+  S+  LV+C++   
Sbjct:   132 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 191

Query:    72 --GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHF- 126
               GC+G   + + +Y  +  GL+SE+ YPY     +   C YD +  V   TG  F+   
Sbjct:   192 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLAR--DDLPCRYDPRFNVAKITG--FVDIP 247

Query:   127 NGSE-TMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
             +G+E  +   +   GP+SV ++ S     +  + I   +  CS   L HAVL+VGYG Q 
Sbjct:   248 SGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYY-ERACSSSRLDHAVLVVGYGYQG 306

Query:   185 -NIP---YWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 222
              ++    YW+V+NSW     D+G+  + +  NN CG+   A Y
Sbjct:   307 ADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASY 349

 Score = 324 (119.1 bits), Expect = 5.8e-28, P = 5.8e-28
 Identities = 80/223 (35%), Positives = 118/223 (52%)

Query:   743 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 802
             P   DWR++    P  DQ  CGSCW+FS  G LEGQ   KTGKL+  S+  LV+C++   
Sbjct:   132 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 191

Query:   803 --GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHF- 857
               GC+G   + + +Y  +  GL+SE+ YPY     +   C YD +  V   TG  F+   
Sbjct:   192 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLAR--DDLPCRYDPRFNVAKITG--FVDIP 247

Query:   858 NGSE-TMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 915
             +G+E  +   +   GP+SV ++ S     +  + I   +  CS   L HAVL+VGYG Q 
Sbjct:   248 SGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYY-ERACSSSRLDHAVLVVGYGYQG 306

Query:   916 -NIP---YWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 953
              ++    YW+V+NSW     D+G+  + +  NN CG+   A Y
Sbjct:   307 ADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASY 349


>UNIPROTKB|P07711 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9606 "Homo sapiens"
            [GO:0005576 "extracellular region" evidence=NAS] [GO:0005764
            "lysosome" evidence=IDA;NAS] [GO:0006508 "proteolysis"
            evidence=IDA] [GO:0002250 "adaptive immune response" evidence=IEP]
            [GO:0019882 "antigen processing and presentation" evidence=TAS]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0036021 "endolysosome lumen" evidence=TAS] [GO:0043202
            "lysosomal lumen" evidence=TAS] [GO:0045087 "innate immune
            response" evidence=TAS] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042393 "histone binding" evidence=IDA] [GO:0005634 "nucleus"
            evidence=TAS] [GO:0071888 "macrophage apoptotic process"
            evidence=NAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            Reactome:REACT_118779 EMBL:X12451 GO:GO:0005634 Reactome:REACT_6900
            GO:GO:0005576 GO:GO:0019886 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087
            GO:GO:0042393 GO:GO:0004197 GO:GO:0002250 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0036021 KO:K01365 OrthoDB:EOG48PMKF EMBL:M20496
            EMBL:CR457053 EMBL:BX537395 EMBL:AL160279 EMBL:BC012612 EMBL:X05256
            IPI:IPI00012887 PIR:S01002 RefSeq:NP_001244900.1
            RefSeq:NP_001244901.1 RefSeq:NP_001903.1 RefSeq:NP_666023.1
            UniGene:Hs.731507 UniGene:Hs.731952 PDB:1CJL PDB:1CS8 PDB:1ICF
            PDB:1MHW PDB:2NQD PDB:2VHS PDB:2XU1 PDB:2XU3 PDB:2XU4 PDB:2XU5
            PDB:2YJ2 PDB:2YJ8 PDB:2YJ9 PDB:2YJB PDB:2YJC PDB:3BC3 PDB:3H89
            PDB:3H8B PDB:3H8C PDB:3HHA PDB:3HWN PDB:3IV2 PDB:3K24 PDB:3KSE
            PDB:3OF8 PDB:3OF9 PDBsum:1CJL PDBsum:1CS8 PDBsum:1ICF PDBsum:1MHW
            PDBsum:2NQD PDBsum:2VHS PDBsum:2XU1 PDBsum:2XU3 PDBsum:2XU4
            PDBsum:2XU5 PDBsum:2YJ2 PDBsum:2YJ8 PDBsum:2YJ9 PDBsum:2YJB
            PDBsum:2YJC PDBsum:3BC3 PDBsum:3H89 PDBsum:3H8B PDBsum:3H8C
            PDBsum:3HHA PDBsum:3HWN PDBsum:3IV2 PDBsum:3K24 PDBsum:3KSE
            PDBsum:3OF8 PDBsum:3OF9 ProteinModelPortal:P07711 SMR:P07711
            IntAct:P07711 STRING:P07711 MEROPS:I29.001 PhosphoSite:P07711
            DMDM:115741 PaxDb:P07711 PeptideAtlas:P07711 PRIDE:P07711
            DNASU:1514 Ensembl:ENST00000340342 Ensembl:ENST00000343150
            GeneID:1514 KEGG:hsa:1514 UCSC:uc004aph.3 CTD:1514
            GeneCards:GC09P090341 H-InvDB:HIX0058839 H-InvDB:HIX0170314
            HGNC:HGNC:2537 HPA:CAB000459 MIM:116880 neXtProt:NX_P07711
            PharmGKB:PA162382890 InParanoid:P07711 OMA:REPLFAQ PhylomeDB:P07711
            BRENDA:3.4.22.15 BindingDB:P07711 ChEMBL:CHEMBL3837 ChiTaRS:CTSL1
            DrugBank:DB00040 EvolutionaryTrace:P07711 GenomeRNAi:1514
            NextBio:6271 PMAP-CutDB:P07711 ArrayExpress:P07711 Bgee:P07711
            CleanEx:HS_CTSL1 Genevestigator:P07711 GermOnline:ENSG00000135047
            GO:GO:0071888 Uniprot:P07711
        Length = 333

 Score = 338 (124.0 bits), Expect = 1.8e-29, P = 1.8e-29
 Identities = 86/228 (37%), Positives = 120/228 (52%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA--KQ 69
             P + DWR+K    P  +Q  CGSCWAFS  G LEGQ   KTG+L+  S+  LV+C+  + 
Sbjct:   115 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 174

Query:    70 CSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHFN 127
               GC+G   + + +Y     GL+SE+ YPY+ A  E   C Y+ K  V   TG  F+   
Sbjct:   175 NEGCNGGLMDYAFQYVQDNGGLDSEESYPYE-ATEES--CKYNPKYSVANDTG--FVDIP 229

Query:   128 GSE-TMKKILYKYGPLSVLLNSDLIHD----YNGTPIRKNDETCSPYDLGHAVLLVGYG- 181
               E  + K +   GP+SV +++   H+    Y      + D  CS  D+ H VL+VGYG 
Sbjct:   230 KQEKALMKAVATVGPISVAIDAG--HESFLFYKEGIYFEPD--CSSEDMDHGVLVVGYGF 285

Query:   182 ---KQDNIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 225
                + DN  YWLV+NSWG      G+ K+ +   N CGI   A Y T+
Sbjct:   286 ESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333

 Score = 337 (123.7 bits), Expect = 2.3e-29, P = 2.3e-29
 Identities = 86/228 (37%), Positives = 120/228 (52%)

Query:   743 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA--KQ 800
             P + DWR+K    P  +Q  CGSCWAFS  G LEGQ   KTG+L+  S+  LV+C+  + 
Sbjct:   115 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 174

Query:   801 CSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHFN 858
               GC+G   + + +Y     GL+SE+ YPY+ A  E   C Y+ K  V   TG  F+   
Sbjct:   175 NEGCNGGLMDYAFQYVQDNGGLDSEESYPYE-ATEES--CKYNPKYSVANDTG--FVDIP 229

Query:   859 GSE-TMKKILYKYGPLSVLLNSDLIHD----YNGTPIRKNDETCSPYDLGHAVLLVGYG- 912
               E  + K +   GP+SV +++   H+    Y      + D  CS  D+ H VL+VGYG 
Sbjct:   230 KQEKALMKAVATVGPISVAIDAG--HESFLFYKEGIYFEPD--CSSEDMDHGVLVVGYGF 285

Query:   913 ---KQDNIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 956
                + DN  YWLV+NSWG      G+ K+ +   N CGI   A Y T+
Sbjct:   286 ESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333

 Score = 329 (120.9 bits), Expect = 1.7e-28, P = 1.7e-28
 Identities = 84/227 (37%), Positives = 117/227 (51%)

Query:   377 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 436
             P + DWR+K    P  +Q  CGSCWAFS  G LEGQ   KTG+L+  S+  LV+C+    
Sbjct:   115 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGP-Q 173

Query:   437 GCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYD-KSKVKLFTGKDFLYF 492
             G  GC+G  ++   +Y     GL+SE+ YPY      +  C Y+ K  V   TG  F+  
Sbjct:   174 GNEGCNGGLMDYAFQYVQDNGGLDSEESYPYE---ATEESCKYNPKYSVANDTG--FVDI 228

Query:   493 NGSE-TMKKILYKYGPLSVGLNS-H-LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYG-- 547
                E  + K +   GP+SV +++ H    FY      + D  CS  D+ H VL+VGYG  
Sbjct:   229 PKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPD--CSSEDMDHGVLVVGYGFE 286

Query:   548 --KQDDIPYWLARNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 591
               + D+  YWL +NSWG      G+ K+ +   N CGI   A Y T+
Sbjct:   287 STESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333

 Score = 132 (51.5 bits), Expect = 2.9e-05, P = 2.9e-05
 Identities = 28/65 (43%), Positives = 37/65 (56%)

Query:   965 CSPYDLGHAVLLVGYG----KQDDIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIA 1019
             CS  D+ H VL+VGYG    + D+  YWLV+NSWG      G+ K+ +   N CGI   A
Sbjct:   269 CSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAA 328

Query:  1020 GYATI 1024
              Y T+
Sbjct:   329 SYPTV 333


>UNIPROTKB|Q86GF7 [details] [associations]
            symbol:Cys "Crustapain" species:6703 "Pandalus borealis"
            [GO:0005576 "extracellular region" evidence=IC] [GO:0007586
            "digestion" evidence=NAS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IDA] [GO:0030163 "protein catabolic process"
            evidence=NAS] [GO:0030574 "collagen catabolic process"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005576
            GO:GO:0007586 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0030163 GO:GO:0030574 EMBL:AB091669
            ProteinModelPortal:Q86GF7 SMR:Q86GF7 MEROPS:C01.030 Uniprot:Q86GF7
        Length = 323

 Score = 338 (124.0 bits), Expect = 1.8e-29, P = 1.8e-29
 Identities = 99/317 (31%), Positives = 142/317 (44%)

Query:   661 FIVKRGRQYANDEEIKERFEYF--KQDGHKKH-ERYGTSE---------FSDRSPEEILC 708
             F  K G++YAN EE   R   F  K    ++H ERY   E         FSD + EE+L 
Sbjct:    23 FKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHEEVLA 82

Query:   709 -KTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCW 767
              KTG   + R +   V  +              P+    DWR K    P  DQ  CGSCW
Sbjct:    83 TKTGM--TRRRHPLSVLPKSAPTT---------PMAADVDWRNKGAVTPVKDQGQCGSCW 131

Query:   768 AFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPSIEYT-HQAGLESE 824
             AFS    LEG + +KTG LV  S+  LV+C+      GC+G +   + +Y     G+++E
Sbjct:   132 AFSAVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRGIDTE 191

Query:   825 KDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSET-MKKILYKYGPLSVLLNS--DL 881
               YPYK  +     C YD   +           +G E+ ++  +   GP+SV +++    
Sbjct:   192 SSYPYKAIDDN---CRYDAGNIGATVSSYVEPASGDESALQHAVQNEGPVSVCIDAGQSS 248

Query:   882 IHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNI-PYWLVRNSWGPIGPDEGFFKIER 940
                Y G      +  C  +   HAV  VGYG   N   YW+V+NSWG    + G+ K+ R
Sbjct:   249 FGSYGGGVYY--EPNCDSWYANHAVTAVGYGTDANGGDYWIVKNSWGAWWGESGYIKMAR 306

Query:   941 G-NNACGIEQIAGYATI 956
               +N C I   + Y  +
Sbjct:   307 NRDNNCAIATYSVYPVV 323

 Score = 323 (118.8 bits), Expect = 7.4e-28, P = 7.4e-28
 Identities = 99/318 (31%), Positives = 142/318 (44%)

Query:   295 FIVKRGRQYANDEEIKERFEYF--KQDGHKKH-ERYGTSE---------FSDRSPEEILC 342
             F  K G++YAN EE   R   F  K    ++H ERY   E         FSD + EE+L 
Sbjct:    23 FKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHEEVLA 82

Query:   343 -KTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCW 401
              KTG   + R +   V  +              P+    DWR K    P  DQ  CGSCW
Sbjct:    83 TKTGM--TRRRHPLSVLPKSAPTT---------PMAADVDWRNKGAVTPVKDQGQCGSCW 131

Query:   402 AFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDG--LEQPIEYT-HQAGLES 458
             AFS    LEG + +KTG LV  S+  LV+C+    G  GC+G    Q  +Y     G+++
Sbjct:   132 AFSAVAALEGAHFLKTGDLVSLSEQNLVDCSSSY-GNQGCNGGWPYQAYQYIIANRGIDT 190

Query:   459 EKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSET-MKKILYKYGPLSVGLNSHLI 517
             E  YPY+  +     C YD   +           +G E+ ++  +   GP+SV +++   
Sbjct:   191 ESSYPYKAIDDN---CRYDAGNIGATVSSYVEPASGDESALQHAVQNEGPVSVCIDAGQS 247

Query:   518 HF--YNGTPIRKNDETCSPYDLGHAVLLVGYGKQ-DDIPYWLARNSWGPIGPDEGFFKIE 574
              F  Y G      +  C  +   HAV  VGYG   +   YW+ +NSWG    + G+ K+ 
Sbjct:   248 SFGSYGGGVYY--EPNCDSWYANHAVTAVGYGTDANGGDYWIVKNSWGAWWGESGYIKMA 305

Query:   575 RG-NNACGIEQIAGYATI 591
             R  +N C I   + Y  +
Sbjct:   306 RNRDNNCAIATYSVYPVV 323

 Score = 305 (112.4 bits), Expect = 6.2e-26, P = 6.2e-26
 Identities = 71/218 (32%), Positives = 104/218 (47%)

Query:    16 DWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GC 73
             DWR K    P  DQ  CGSCWAFS    LEG + +KTG LV  S+  LV+C+      GC
Sbjct:   111 DWRNKGAVTPVKDQGQCGSCWAFSAVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGC 170

Query:    74 DGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSET- 131
             +G +   + +Y     G+++E  YPYK  +     C YD   +           +G E+ 
Sbjct:   171 NGGWPYQAYQYIIANRGIDTESSYPYKAIDDN---CRYDAGNIGATVSSYVEPASGDESA 227

Query:   132 MKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNI-PY 188
             ++  +   GP+SV +++       Y G      +  C  +   HAV  VGYG   N   Y
Sbjct:   228 LQHAVQNEGPVSVCIDAGQSSFGSYGGGVYY--EPNCDSWYANHAVTAVGYGTDANGGDY 285

Query:   189 WLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
             W+V+NSWG    + G+ K+ R  +N C I   + Y  +
Sbjct:   286 WIVKNSWGAWWGESGYIKMARNRDNNCAIATYSVYPVV 323


>MGI|MGI:1860262 [details] [associations]
            symbol:Cts7 "cathepsin 7" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005576
            "extracellular region" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005768 "endosome" evidence=IEA]
            [GO:0005794 "Golgi apparatus" evidence=IEA] [GO:0006508
            "proteolysis" evidence=ISA] [GO:0007049 "cell cycle" evidence=IEA]
            [GO:0007067 "mitosis" evidence=IEA] [GO:0008152 "metabolic process"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=ISA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0051301 "cell
            division" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:1860262 GO:GO:0005634 GO:GO:0005794 GO:GO:0048471
            GO:GO:0005615 GO:GO:0051301 GO:GO:0007067 GO:GO:0005768
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GO:GO:0008233 EMBL:CH466546
            EMBL:AY014779 EMBL:CT030645 EMBL:BC064740 EMBL:AF250837
            IPI:IPI00131132 RefSeq:NP_062412.1 UniGene:Mm.3692 HSSP:O60911
            ProteinModelPortal:Q91ZF2 SMR:Q91ZF2 STRING:Q91ZF2 MEROPS:C01.016
            PRIDE:Q91ZF2 Ensembl:ENSMUST00000021892 GeneID:56092 KEGG:mmu:56092
            UCSC:uc007qwi.1 CTD:56092 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 InParanoid:Q91ZF2 OMA:ERRVIWE OrthoDB:EOG44QT2S
            NextBio:311908 Bgee:Q91ZF2 Genevestigator:Q91ZF2 Uniprot:Q91ZF2
        Length = 331

 Score = 337 (123.7 bits), Expect = 2.3e-29, P = 2.3e-29
 Identities = 79/232 (34%), Positives = 120/232 (51%)

Query:     5 VEKDGP-VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQL 63
             ++K  P +P   DWRK+    P   Q  CG+CWAFS+   +EGQ   KTGKL+  S   L
Sbjct:   105 IQKRNPKIPPTLDWRKEGYVTPVRRQGSCGACWAFSVTACIEGQLFKKTGKLIPLSVQNL 164

Query:    64 VECAKQ--CSGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTG 120
             ++C+      GCDG     + +Y  +  GLE+E  YPY+ A  +  +   ++S VK+   
Sbjct:   165 MDCSVSYGTKGCDGGRPYDAFQYVKNNGGLEAEATYPYE-AKAKHCRYRPERSVVKV--N 221

Query:   121 KDFLHFNGSETMKKILYKYGPLSVLLNSD--LIHDYNGTPIRKNDETCSPYDLGHAVLLV 178
             + F+     E + + L  +GP++V ++      H Y G     ++  C    L H +LLV
Sbjct:   222 RFFVVPRNEEALLQALVTHGPIAVAIDGSHASFHSYRGGIY--HEPKCRKDTLDHGLLLV 279

Query:   179 GYG----KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
             GYG    + +N  YWL++NS G    + G+ K+ RG NN CGI   A Y  +
Sbjct:   280 GYGYEGHESENRKYWLLKNSHGERWGENGYMKLPRGQNNYCGIASYAMYPAL 331

 Score = 332 (121.9 bits), Expect = 8.0e-29, P = 8.0e-29
 Identities = 77/225 (34%), Positives = 117/225 (52%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ- 800
             +P   DWRK+    P   Q +CG+CWAFS+   +EGQ   KTGKL+  S   L++C+   
Sbjct:   112 IPPTLDWRKEGYVTPVRRQGSCGACWAFSVTACIEGQLFKKTGKLIPLSVQNLMDCSVSY 171

Query:   801 -CSGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 858
                GCDG     + +Y  +  GLE+E  YPY+ A  +  +   ++S VK+   + F+   
Sbjct:   172 GTKGCDGGRPYDAFQYVKNNGGLEAEATYPYE-AKAKHCRYRPERSVVKV--NRFFVVPR 228

Query:   859 GSETMKKILYKYGPLSVLLNSD--LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG---- 912
               E + + L  +GP++V ++      H Y G     ++  C    L H +LLVGYG    
Sbjct:   229 NEEALLQALVTHGPIAVAIDGSHASFHSYRGGIY--HEPKCRKDTLDHGLLLVGYGYEGH 286

Query:   913 KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
             + +N  YWL++NS G    + G+ K+ RG NN CGI   A Y  +
Sbjct:   287 ESENRKYWLLKNSHGERWGENGYMKLPRGQNNYCGIASYAMYPAL 331

 Score = 329 (120.9 bits), Expect = 1.7e-28, P = 1.7e-28
 Identities = 80/228 (35%), Positives = 117/228 (51%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +P   DWRK+    P   Q +CG+CWAFS+   +EGQ   KTGKL+  S   L++C+   
Sbjct:   112 IPPTLDWRKEGYVTPVRRQGSCGACWAFSVTACIEGQLFKKTGKLIPLSVQNLMDCSVSY 171

Query:   436 SGCGGCDGLE--QPIEYT-HQAGLESEKDYPYRNGNGEKFKCAY--DKSKVKLFTGKDFL 490
              G  GCDG       +Y  +  GLE+E  YPY     +   C Y  ++S VK+   + F+
Sbjct:   172 -GTKGCDGGRPYDAFQYVKNNGGLEAEATYPYE---AKAKHCRYRPERSVVKV--NRFFV 225

Query:   491 YFNGSETMKKILYKYGPLSVGLN-SHL-IHFYNGTPIRKNDETCSPYDLGHAVLLVGYG- 547
                  E + + L  +GP++V ++ SH   H Y G     ++  C    L H +LLVGYG 
Sbjct:   226 VPRNEEALLQALVTHGPIAVAIDGSHASFHSYRGGIY--HEPKCRKDTLDHGLLLVGYGY 283

Query:   548 ---KQDDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
                + ++  YWL +NS G    + G+ K+ RG NN CGI   A Y  +
Sbjct:   284 EGHESENRKYWLLKNSHGERWGENGYMKLPRGQNNYCGIASYAMYPAL 331

 Score = 119 (46.9 bits), Expect = 0.00077, P = 0.00077
 Identities = 26/71 (36%), Positives = 39/71 (54%)

Query:   959 VKNDETCSPYDLGHAVLLVGYG----KQDDIPYWLVRNSWGPIGPDEGFFKIERG-NNAC 1013
             + ++  C    L H +LLVGYG    + ++  YWL++NS G    + G+ K+ RG NN C
Sbjct:   261 IYHEPKCRKDTLDHGLLLVGYGYEGHESENRKYWLLKNSHGERWGENGYMKLPRGQNNYC 320

Query:  1014 GIEQIAGYATI 1024
             GI   A Y  +
Sbjct:   321 GIASYAMYPAL 331


>TAIR|locus:2090629 [details] [associations]
            symbol:AT3G19400 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0019344 "cysteine biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            EMBL:CP002686 GenomeReviews:BA000014_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AB025624
            MEROPS:I29.003 HOGENOM:HOG000230773 EMBL:AK118509 IPI:IPI00543468
            RefSeq:NP_566634.2 UniGene:At.38409 ProteinModelPortal:Q9LT77
            SMR:Q9LT77 PaxDb:Q9LT77 PRIDE:Q9LT77 EnsemblPlants:AT3G19400.1
            GeneID:821474 KEGG:ath:AT3G19400 TAIR:At3g19400 InParanoid:Q9LT77
            OMA:IGEHERR ProtClustDB:CLSN2679975 Genevestigator:Q9LT77
            Uniprot:Q9LT77
        Length = 362

 Score = 336 (123.3 bits), Expect = 3.0e-29, P = 3.0e-29
 Identities = 101/337 (29%), Positives = 160/337 (47%)

Query:   641 LAIEGSLTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQ-----DGHKK-HER-- 692
             +A E  +  +   +   ++ ++V+  + Y    E + RF+ FK      D H    +R  
Sbjct:    27 VATETEIERNETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTF 86

Query:   693 -YGTSEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPV-PDAWDWRK 750
               G + F+D + EE       K  ERT + +  +R            +G V PD  DWR 
Sbjct:    87 EVGLTRFADLTNEEFRAIYLRKKMERTKDSVKTERYLYK--------EGDVLPDEVDWRA 138

Query:   751 KNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC--SGCDGCF 808
                     DQ  CGSCWAFS  G +EG   I TG+L+  S+ +LV+C +    +GCDG  
Sbjct:   139 NGAVVSVKDQGNCGSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGI 198

Query:   809 FEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKS---KVKLFTGKDFLHFNGSETMK 864
                + E+  +  G+E+++DYPY NAN +   C  DK+   +V    G + +  +  +++K
Sbjct:   199 MNYAFEFIMKNGGIETDQDYPY-NAN-DLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLK 256

Query:   865 KILYKYGPLSVLL--NSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLV 922
             K +  + P+SV +  +S     Y    +     TC    L H V++VGYG      YW++
Sbjct:   257 KAV-AHQPVSVAIEASSQAFQLYKSGVMTG---TCG-ISLDHGVVVVGYGSTSGEDYWII 311

Query:   923 RNSWGPIGPDEGFFKIERGNN----ACGIEQIAGYAT 955
             RNSWG    D G+ K++R  +     CGI  +  Y T
Sbjct:   312 RNSWGLNWGDSGYVKLQRNIDDPFGKCGIAMMPSYPT 348

 Score = 324 (119.1 bits), Expect = 5.8e-28, P = 5.8e-28
 Identities = 100/338 (29%), Positives = 157/338 (46%)

Query:   275 LAIEGSLTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQ-----DGHKK-HER-- 326
             +A E  +  +   +   ++ ++V+  + Y    E + RF+ FK      D H    +R  
Sbjct:    27 VATETEIERNETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTF 86

Query:   327 -YGTSEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPV-PDAWDWRK 384
               G + F+D + EE       K  ERT + +  +R            +G V PD  DWR 
Sbjct:    87 EVGLTRFADLTNEEFRAIYLRKKMERTKDSVKTERYLYK--------EGDVLPDEVDWRA 138

Query:   385 KNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDG- 443
                     DQ  CGSCWAFS  G +EG   I TG+L+  S+ +LV+C +     G CDG 
Sbjct:   139 NGAVVSVKDQGNCGSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAG-CDGG 197

Query:   444 -LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKS---KVKLFTGKDFLYFNGSETM 498
              +    E+  +  G+E+++DYPY N N +   C  DK+   +V    G + +  +  +++
Sbjct:   198 IMNYAFEFIMKNGGIETDQDYPY-NAN-DLGLCNADKNNNTRVVTIDGYEDVPRDDEKSL 255

Query:   499 KKILYKYGPLSVGL--NSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWL 556
             KK +  + P+SV +  +S     Y    +     TC    L H V++VGYG      YW+
Sbjct:   256 KKAV-AHQPVSVAIEASSQAFQLYKSGVMTG---TCG-ISLDHGVVVVGYGSTSGEDYWI 310

Query:   557 ARNSWGPIGPDEGFFKIERGNN----ACGIEQIAGYAT 590
              RNSWG    D G+ K++R  +     CGI  +  Y T
Sbjct:   311 IRNSWGLNWGDSGYVKLQRNIDDPFGKCGIAMMPSYPT 348

 Score = 322 (118.4 bits), Expect = 9.4e-28, P = 9.4e-28
 Identities = 80/231 (34%), Positives = 121/231 (52%)

Query:     7 KDGPV-PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVE 65
             K+G V PD  DWR         DQ +CGSCWAFS  G +EG   I TG+L+  S+ +LV+
Sbjct:   125 KEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQITTGELISLSEQELVD 184

Query:    66 CAKQC--SGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKS---KVKLFT 119
             C +    +GCDG     + E+  +  G+E+++DYPY NAN +   C  DK+   +V    
Sbjct:   185 CDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPY-NAN-DLGLCNADKNNNTRVVTID 242

Query:   120 GKDFLHFNGSETMKKILYKYGPLSVLL--NSDLIHDYNGTPIRKNDETCSPYDLGHAVLL 177
             G + +  +  +++KK +  + P+SV +  +S     Y    +     TC    L H V++
Sbjct:   243 GYEDVPRDDEKSLKKAV-AHQPVSVAIEASSQAFQLYKSGVMTG---TCG-ISLDHGVVV 297

Query:   178 VGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNN----ACGIEQIAGYAT 224
             VGYG      YW++RNSWG    D G+ K++R  +     CGI  +  Y T
Sbjct:   298 VGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQRNIDDPFGKCGIAMMPSYPT 348

 Score = 122 (48.0 bits), Expect = 0.00043, P = 0.00043
 Identities = 25/64 (39%), Positives = 34/64 (53%)

Query:   964 TCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNN----ACGIEQIA 1019
             TC    L H V++VGYG      YW++RNSWG    D G+ K++R  +     CGI  + 
Sbjct:   286 TCG-ISLDHGVVVVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQRNIDDPFGKCGIAMMP 344

Query:  1020 GYAT 1023
              Y T
Sbjct:   345 SYPT 348


>UNIPROTKB|F1S4J6 [details] [associations]
            symbol:Ssc.54235 "Cathepsin L1" species:9823 "Sus scrofa"
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197
            GO:GO:0002250 GeneTree:ENSGT00560000076577 GO:GO:0097067
            EMBL:CU571031 RefSeq:XP_003130681.1 Ensembl:ENSSSCT00000011983
            GeneID:100515919 KEGG:ssc:100515919 OMA:IAICATK Uniprot:F1S4J6
        Length = 332

 Score = 334 (122.6 bits), Expect = 4.9e-29, P = 4.9e-29
 Identities = 85/225 (37%), Positives = 117/225 (52%)

Query:   377 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 436
             P + DWR+K       +Q  CGSCWAFS  G LEGQ   KT KL+  S+  LV+C+    
Sbjct:   115 PHSVDWREKGYVTAVKNQGHCGSCWAFSATGALEGQMFRKTSKLISLSEQNLVDCSWP-E 173

Query:   437 GCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYD-KSKVKLFTGKDFLYF 492
             G  GC+G  ++   +Y     GL+SE+ YPY   +G    C Y  +S     TG   +  
Sbjct:   174 GNEGCNGGLMDNAFQYIKDNGGLDSEESYPYFGKDGS---CKYKPQSSAANDTGYVDIPK 230

Query:   493 NGSETMKKILYKYGPLSVGLN-SH-LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ- 549
                  MK +    GP+SVG++ SH    FY+ T I    + CS  DL H VL+VGYG + 
Sbjct:   231 QEKALMKAVA-TVGPISVGIDASHESFQFYS-TGIYFEPQ-CSSEDLDHGVLVVGYGVEG 287

Query:   550 --DDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
                +  YWL +NSWG     +G+ K+ +  NN CGI  +A Y  +
Sbjct:   288 AHSNNKYWLVKNSWGNTWGMDGYIKMTKDQNNHCGIATMASYPVV 332

 Score = 332 (121.9 bits), Expect = 8.0e-29, P = 8.0e-29
 Identities = 82/224 (36%), Positives = 116/224 (51%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA--KQ 69
             P + DWR+K       +Q  CGSCWAFS  G LEGQ   KT KL+  S+  LV+C+  + 
Sbjct:   115 PHSVDWREKGYVTAVKNQGHCGSCWAFSATGALEGQMFRKTSKLISLSEQNLVDCSWPEG 174

Query:    70 CSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHFN 127
               GC+G   + + +Y     GL+SE+ YPY   +G    C Y  +S     TG  ++   
Sbjct:   175 NEGCNGGLMDNAFQYIKDNGGLDSEESYPYFGKDGS---CKYKPQSSAANDTG--YVDIP 229

Query:   128 GSE-TMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ-- 183
               E  + K +   GP+SV ++ S     +  T I    + CS  DL H VL+VGYG +  
Sbjct:   230 KQEKALMKAVATVGPISVGIDASHESFQFYSTGIYFEPQ-CSSEDLDHGVLVVGYGVEGA 288

Query:   184 -DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
               N  YWLV+NSWG     +G+ K+ +  NN CGI  +A Y  +
Sbjct:   289 HSNNKYWLVKNSWGNTWGMDGYIKMTKDQNNHCGIATMASYPVV 332

 Score = 331 (121.6 bits), Expect = 1.0e-28, P = 1.0e-28
 Identities = 82/224 (36%), Positives = 116/224 (51%)

Query:   743 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA--KQ 800
             P + DWR+K       +Q  CGSCWAFS  G LEGQ   KT KL+  S+  LV+C+  + 
Sbjct:   115 PHSVDWREKGYVTAVKNQGHCGSCWAFSATGALEGQMFRKTSKLISLSEQNLVDCSWPEG 174

Query:   801 CSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHFN 858
               GC+G   + + +Y     GL+SE+ YPY   +G    C Y  +S     TG  ++   
Sbjct:   175 NEGCNGGLMDNAFQYIKDNGGLDSEESYPYFGKDGS---CKYKPQSSAANDTG--YVDIP 229

Query:   859 GSE-TMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ-- 914
               E  + K +   GP+SV ++ S     +  T I    + CS  DL H VL+VGYG +  
Sbjct:   230 KQEKALMKAVATVGPISVGIDASHESFQFYSTGIYFEPQ-CSSEDLDHGVLVVGYGVEGA 288

Query:   915 -DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
               N  YWLV+NSWG     +G+ K+ +  NN CGI  +A Y  +
Sbjct:   289 HSNNKYWLVKNSWGNTWGMDGYIKMTKDQNNHCGIATMASYPVV 332

 Score = 137 (53.3 bits), Expect = 2.9e-05, Sum P(2) = 2.9e-05
 Identities = 28/64 (43%), Positives = 38/64 (59%)

Query:   965 CSPYDLGHAVLLVGYGKQ---DDIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAG 1020
             CS  DL H VL+VGYG +    +  YWLV+NSWG     +G+ K+ +  NN CGI  +A 
Sbjct:   269 CSSEDLDHGVLVVGYGVEGAHSNNKYWLVKNSWGNTWGMDGYIKMTKDQNNHCGIATMAS 328

Query:  1021 YATI 1024
             Y  +
Sbjct:   329 YPVV 332

 Score = 39 (18.8 bits), Expect = 2.9e-05, Sum P(2) = 2.9e-05
 Identities = 9/27 (33%), Positives = 15/27 (55%)

Query:   302 QYANDE---EIKERFEYFKQDGHKKHE 325
             QY  D    + +E + YF +DG  K++
Sbjct:   188 QYIKDNGGLDSEESYPYFGKDGSCKYK 214

 Score = 39 (18.8 bits), Expect = 2.9e-05, Sum P(2) = 2.9e-05
 Identities = 9/27 (33%), Positives = 15/27 (55%)

Query:   668 QYANDE---EIKERFEYFKQDGHKKHE 691
             QY  D    + +E + YF +DG  K++
Sbjct:   188 QYIKDNGGLDSEESYPYFGKDGSCKYK 214


>ZFIN|ZDB-GENE-030131-572 [details] [associations]
            symbol:wu:fb37b09 "wu:fb37b09" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030131-572 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:BX465840 IPI:IPI00866294 RefSeq:XP_001923796.1
            UniGene:Dr.25683 PRIDE:E9QBE2 Ensembl:ENSDART00000133962
            GeneID:321853 KEGG:dre:321853 NextBio:20807556 Uniprot:E9QBE2
        Length = 335

 Score = 334 (122.6 bits), Expect = 4.9e-29, P = 4.9e-29
 Identities = 84/225 (37%), Positives = 120/225 (53%)

Query:   377 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 436
             P   DWR++    P  DQ  CGSCW+FS  G LEGQ   KTGKL+  S+  LV+C++   
Sbjct:   116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP-H 174

Query:   437 GCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYD-KSKVKLFTGKDFLYF 492
             G  GC+G  ++Q  +Y  +  GL+SE+ YPY     +   C YD +  V   TG  F+  
Sbjct:   175 GNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLAR--DDLPCRYDPRFNVAKITG--FVDI 230

Query:   493 -NGSE-TMKKILYKYGPLSVGLN-SHL-IHFYNGTPIRKNDETCSPYDLGHAVLLVGYGK 548
               G+E  +   +   GP+SV ++ SH  + FY        +  C+   L HAVL+VGYG 
Sbjct:   231 PKGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYY--ERACTS-QLDHAVLVVGYGY 287

Query:   549 QD-DIP---YWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 588
             Q  D+    YW+ +NSW     D+G+  + +  NN CGI  +A Y
Sbjct:   288 QGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASY 332

 Score = 321 (118.1 bits), Expect = 1.2e-27, P = 1.2e-27
 Identities = 80/223 (35%), Positives = 118/223 (52%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 71
             P   DWR++    P  DQ  CGSCW+FS  G LEGQ   KTGKL+  S+  LV+C++   
Sbjct:   116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHG 175

Query:    72 --GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHF- 126
               GC+G   + + +Y  +  GL+SE+ YPY     +   C YD +  V   TG  F+   
Sbjct:   176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLAR--DDLPCRYDPRFNVAKITG--FVDIP 231

Query:   127 NGSE-TMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
              G+E  +   +   GP+SV ++ S     +  + I   +  C+   L HAVL+VGYG Q 
Sbjct:   232 KGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYY-ERACTS-QLDHAVLVVGYGYQG 289

Query:   185 -NIP---YWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 222
              ++    YW+V+NSW     D+G+  + +  NN CGI  +A Y
Sbjct:   290 ADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASY 332

 Score = 320 (117.7 bits), Expect = 1.5e-27, P = 1.5e-27
 Identities = 80/223 (35%), Positives = 118/223 (52%)

Query:   743 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 802
             P   DWR++    P  DQ  CGSCW+FS  G LEGQ   KTGKL+  S+  LV+C++   
Sbjct:   116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHG 175

Query:   803 --GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHF- 857
               GC+G   + + +Y  +  GL+SE+ YPY     +   C YD +  V   TG  F+   
Sbjct:   176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLAR--DDLPCRYDPRFNVAKITG--FVDIP 231

Query:   858 NGSE-TMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 915
              G+E  +   +   GP+SV ++ S     +  + I   +  C+   L HAVL+VGYG Q 
Sbjct:   232 KGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYY-ERACTS-QLDHAVLVVGYGYQG 289

Query:   916 -NIP---YWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 953
              ++    YW+V+NSW     D+G+  + +  NN CGI  +A Y
Sbjct:   290 ADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASY 332


>ZFIN|ZDB-GENE-071004-74 [details] [associations]
            symbol:zgc:174855 "zgc:174855" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-071004-74
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 MEROPS:C01.032 EMBL:BX000534 EMBL:BC152282
            IPI:IPI00773140 RefSeq:NP_001096592.1 UniGene:Dr.104905 SMR:A7MCR6
            STRING:A7MCR6 Ensembl:ENSDART00000109968 GeneID:569326
            KEGG:dre:569326 NextBio:20889622 Uniprot:A7MCR6
        Length = 335

 Score = 333 (122.3 bits), Expect = 6.3e-29, P = 6.3e-29
 Identities = 84/225 (37%), Positives = 120/225 (53%)

Query:   377 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 436
             P   DWR++    P  DQ  CGSCW+FS  G LEGQ   KTGKL+  S+  LV+C++   
Sbjct:   116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP-Q 174

Query:   437 GCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYD-KSKVKLFTGKDFLYF 492
             G  GC+G  ++Q  +Y  +  GL+SE+ YPY     +   C YD +  V   TG  F+  
Sbjct:   175 GNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLAR--DDLPCRYDPRFNVAKITG--FVDI 230

Query:   493 -NGSE-TMKKILYKYGPLSVGLN-SHL-IHFYNGTPIRKNDETCSPYDLGHAVLLVGYGK 548
               G+E  +   +   GP+SV ++ SH  + FY        +  C+   L HAVL+VGYG 
Sbjct:   231 PRGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYY--ERACTSR-LDHAVLVVGYGY 287

Query:   549 QD-DIP---YWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 588
             Q  D+    YW+ +NSW     D+G+  + +  NN CGI  +A Y
Sbjct:   288 QGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASY 332

 Score = 319 (117.4 bits), Expect = 2.0e-27, P = 2.0e-27
 Identities = 80/223 (35%), Positives = 118/223 (52%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 71
             P   DWR++    P  DQ  CGSCW+FS  G LEGQ   KTGKL+  S+  LV+C++   
Sbjct:   116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175

Query:    72 --GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHF- 126
               GC+G   + + +Y  +  GL+SE+ YPY     +   C YD +  V   TG  F+   
Sbjct:   176 NQGCNGGIMDQAFQYVKENKGLDSEQSYPYLAR--DDLPCRYDPRFNVAKITG--FVDIP 231

Query:   127 NGSE-TMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
              G+E  +   +   GP+SV ++ S     +  + I   +  C+   L HAVL+VGYG Q 
Sbjct:   232 RGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYY-ERACTSR-LDHAVLVVGYGYQG 289

Query:   185 -NIP---YWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 222
              ++    YW+V+NSW     D+G+  + +  NN CGI  +A Y
Sbjct:   290 ADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASY 332

 Score = 318 (117.0 bits), Expect = 2.5e-27, P = 2.5e-27
 Identities = 80/223 (35%), Positives = 118/223 (52%)

Query:   743 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 802
             P   DWR++    P  DQ  CGSCW+FS  G LEGQ   KTGKL+  S+  LV+C++   
Sbjct:   116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175

Query:   803 --GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHF- 857
               GC+G   + + +Y  +  GL+SE+ YPY     +   C YD +  V   TG  F+   
Sbjct:   176 NQGCNGGIMDQAFQYVKENKGLDSEQSYPYLAR--DDLPCRYDPRFNVAKITG--FVDIP 231

Query:   858 NGSE-TMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 915
              G+E  +   +   GP+SV ++ S     +  + I   +  C+   L HAVL+VGYG Q 
Sbjct:   232 RGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYY-ERACTSR-LDHAVLVVGYGYQG 289

Query:   916 -NIP---YWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 953
              ++    YW+V+NSW     D+G+  + +  NN CGI  +A Y
Sbjct:   290 ADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASY 332


>ZFIN|ZDB-GENE-041010-76 [details] [associations]
            symbol:ctsll "cathepsin L, like" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-041010-76
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            OrthoDB:EOG48PMKF MEROPS:C01.032 EMBL:BX119902 IPI:IPI00616622
            UniGene:Dr.79994 SMR:A2BEM8 Ensembl:ENSDART00000144226
            InParanoid:A2BEM8 OMA:PRYSAAN Uniprot:A2BEM8
        Length = 337

 Score = 332 (121.9 bits), Expect = 8.0e-29, P = 8.0e-29
 Identities = 96/281 (34%), Positives = 136/281 (48%)

Query:   323 KHE-RYGTSEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWD 381
             KH  R G ++F D + EE      F+ +   Y R    +              P     D
Sbjct:    70 KHTFRLGMNQFGDMTNEE------FRQAMNGYNRDPNRKSKGSLFIEPSFFTAP--QQID 121

Query:   382 WRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGC 441
             WR+K    P  DQ  CGSCWAFS  G LEGQ   KTGKLV  S+  L++C++   G  GC
Sbjct:   122 WRQKGYVTPIKDQKRCGSCWAFSSTGALEGQVFRKTGKLVSLSEQNLMDCSRP-QGNNGC 180

Query:   442 DG--LEQPIEYTHQA-GLESEKDYPYRNGNGEKFKCAYD-KSKVKLFTGKDFLYF-NGSE 496
             DG  ++Q  +Y     GL+SE+ YPY   + +   C YD +      TG  F+   +G E
Sbjct:   181 DGGLMDQAFQYVQDNNGLDSEESYPYLATDDQP--CHYDPRYSAANVTG--FVDIPSGKE 236

Query:   497 -TMKKILYKYGPLSVGLNS-H-LIHFY-NGTPIRKNDETCSPYDLGHAVLLVGYGKQD-D 551
               + K +   GP++V +++ H    FY +G    K    CS  +L H VL+VGYG +  D
Sbjct:   237 HALMKAVAAVGPVAVAIDAGHESFQFYQSGIYYEK---ACSTEELDHGVLVVGYGYEGVD 293

Query:   552 IP---YWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 588
             +    YW+ +NSW     D+G+  + +   N CGI   A Y
Sbjct:   294 VAGRRYWIVKNSWTDRWGDKGYIYMAKDLKNHCGIATSASY 334

 Score = 321 (118.1 bits), Expect = 1.2e-27, P = 1.2e-27
 Identities = 92/280 (32%), Positives = 135/280 (48%)

Query:   689 KHE-RYGTSEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWD 747
             KH  R G ++F D + EE      F+ +   Y R    +              P     D
Sbjct:    70 KHTFRLGMNQFGDMTNEE------FRQAMNGYNRDPNRKSKGSLFIEPSFFTAP--QQID 121

Query:   748 WRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC--SGCD 805
             WR+K    P  DQ  CGSCWAFS  G LEGQ   KTGKLV  S+  L++C++    +GCD
Sbjct:   122 WRQKGYVTPIKDQKRCGSCWAFSSTGALEGQVFRKTGKLVSLSEQNLMDCSRPQGNNGCD 181

Query:   806 GCFFEPSIEYTHQA-GLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHF-NGSE- 861
             G   + + +Y     GL+SE+ YPY   + +   C YD +      TG  F+   +G E 
Sbjct:   182 GGLMDQAFQYVQDNNGLDSEESYPYLATDDQP--CHYDPRYSAANVTG--FVDIPSGKEH 237

Query:   862 TMKKILYKYGPLSVLLNS--DLIHDY-NGTPIRKNDETCSPYDLGHAVLLVGYGKQD-NI 917
              + K +   GP++V +++  +    Y +G    K    CS  +L H VL+VGYG +  ++
Sbjct:   238 ALMKAVAAVGPVAVAIDAGHESFQFYQSGIYYEK---ACSTEELDHGVLVVGYGYEGVDV 294

Query:   918 P---YWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 953
                 YW+V+NSW     D+G+  + +   N CGI   A Y
Sbjct:   295 AGRRYWIVKNSWTDRWGDKGYIYMAKDLKNHCGIATSASY 334

 Score = 318 (117.0 bits), Expect = 2.5e-27, P = 2.5e-27
 Identities = 81/225 (36%), Positives = 118/225 (52%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC- 70
             P   DWR+K    P  DQ  CGSCWAFS  G LEGQ   KTGKLV  S+  L++C++   
Sbjct:   117 PQQIDWRQKGYVTPIKDQKRCGSCWAFSSTGALEGQVFRKTGKLVSLSEQNLMDCSRPQG 176

Query:    71 -SGCDGCFFEPSIEYTHQA-GLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHF- 126
              +GCDG   + + +Y     GL+SE+ YPY   + +   C YD +      TG  F+   
Sbjct:   177 NNGCDGGLMDQAFQYVQDNNGLDSEESYPYLATDDQP--CHYDPRYSAANVTG--FVDIP 232

Query:   127 NGSE-TMKKILYKYGPLSVLLNS--DLIHDY-NGTPIRKNDETCSPYDLGHAVLLVGYGK 182
             +G E  + K +   GP++V +++  +    Y +G    K    CS  +L H VL+VGYG 
Sbjct:   233 SGKEHALMKAVAAVGPVAVAIDAGHESFQFYQSGIYYEK---ACSTEELDHGVLVVGYGY 289

Query:   183 QD-NIP---YWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 222
             +  ++    YW+V+NSW     D+G+  + +   N CGI   A Y
Sbjct:   290 EGVDVAGRRYWIVKNSWTDRWGDKGYIYMAKDLKNHCGIATSASY 334


>UNIPROTKB|E1BPI9 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 CTD:1519 KO:K01374 OMA:SNVCGIA
            EMBL:DAAA02044933 IPI:IPI01004081 RefSeq:XP_002694471.2
            RefSeq:XP_874012.4 Ensembl:ENSBTAT00000014691 GeneID:616804
            KEGG:bta:616804 Uniprot:E1BPI9
        Length = 313

 Score = 331 (121.6 bits), Expect = 1.0e-28, P = 1.0e-28
 Identities = 83/260 (31%), Positives = 130/260 (50%)

Query:   693 YGTSEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKN 752
             YG ++FS   PEE       + S   + R  A+                +P  +DWR K+
Sbjct:    59 YGINQFSYLFPEEFKA-IYLRSSPSRFPRFPAEEYTSISNLS-------LPLRFDWRDKH 110

Query:   753 VTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPS 812
             V     +Q  CG CWAFS+ G +E   AIK   L   S  Q+++C+    GC+G     +
Sbjct:   111 VVTQVRNQKTCGGCWAFSVVGAVESVCAIKGQPLEVLSVQQVIDCSYSNYGCNGGSPLSA 170

Query:   813 IEYTH--QAGLESEKDYPYKNANG--EKFKCAYDKSKVKLFTGKDFLHFNGSET-MKKIL 867
             + + +  Q  L  + +YP++  NG    F  ++  S +K ++  DF   +G E  M + L
Sbjct:   171 LYWLNKLQVKLVRDSEYPFQAQNGLCRYFSDSHSGSSIKGYSAYDF---SGQEDKMAEAL 227

Query:   868 YKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWG 927
                GPL V++++    DY G  I+ +   CS  +  HAVL+ G+ K  +IPYW+VRNSWG
Sbjct:   228 LALGPLIVVVDAMSWQDYLGGIIQHH---CSSGEANHAVLVTGFDKTGSIPYWIVRNSWG 284

Query:   928 PIGPDEGFFKIERGNNACGI 947
                  +G+ +++ G N CGI
Sbjct:   285 TSWGIDGYVRVKMGGNVCGI 304

 Score = 324 (119.1 bits), Expect = 5.8e-28, P = 5.8e-28
 Identities = 73/211 (34%), Positives = 115/211 (54%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P  +DWR K+V     +Q  CG CWAFS+ G +E   AIK   L   S  Q+++C+   
Sbjct:   100 LPLRFDWRDKHVVTQVRNQKTCGGCWAFSVVGAVESVCAIKGQPLEVLSVQQVIDCSYSN 159

Query:    71 SGCDGCFFEPSIEYTH--QAGLESEKDYPYKNANG--EKFKCAYDKSKVKLFTGKDFLHF 126
              GC+G     ++ + +  Q  L  + +YP++  NG    F  ++  S +K ++  DF   
Sbjct:   160 YGCNGGSPLSALYWLNKLQVKLVRDSEYPFQAQNGLCRYFSDSHSGSSIKGYSAYDF--- 216

Query:   127 NGSET-MKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN 185
             +G E  M + L   GPL V++++    DY G  I+ +   CS  +  HAVL+ G+ K  +
Sbjct:   217 SGQEDKMAEALLALGPLIVVVDAMSWQDYLGGIIQHH---CSSGEANHAVLVTGFDKTGS 273

Query:   186 IPYWLVRNSWGPIGPDEGFFKIERGNNACGI 216
             IPYW+VRNSWG     +G+ +++ G N CGI
Sbjct:   274 IPYWIVRNSWGTSWGIDGYVRVKMGGNVCGI 304

 Score = 313 (115.2 bits), Expect = 8.7e-27, P = 8.7e-27
 Identities = 82/260 (31%), Positives = 122/260 (46%)

Query:   327 YGTSEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKN 386
             YG ++FS   PEE       + S   + R  A+                +P  +DWR K+
Sbjct:    59 YGINQFSYLFPEEFKA-IYLRSSPSRFPRFPAEEYTSISNLS-------LPLRFDWRDKH 110

Query:   387 VTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQ 446
             V     +Q  CG CWAFS+ G +E   AIK   L   S  Q+++C+    GC G   L  
Sbjct:   111 VVTQVRNQKTCGGCWAFSVVGAVESVCAIKGQPLEVLSVQQVIDCSYSNYGCNGGSPLSA 170

Query:   447 PIEYTH-QAGLESEKDYPYRNGNG--EKFKCAYDKSKVKLFTGKDFLYFNGSET-MKKIL 502
                    Q  L  + +YP++  NG    F  ++  S +K ++  DF   +G E  M + L
Sbjct:   171 LYWLNKLQVKLVRDSEYPFQAQNGLCRYFSDSHSGSSIKGYSAYDF---SGQEDKMAEAL 227

Query:   503 YKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWG 562
                GPL V +++     Y G  I+ +   CS  +  HAVL+ G+ K   IPYW+ RNSWG
Sbjct:   228 LALGPLIVVVDAMSWQDYLGGIIQHH---CSSGEANHAVLVTGFDKTGSIPYWIVRNSWG 284

Query:   563 PIGPDEGFFKIERGNNACGI 582
                  +G+ +++ G N CGI
Sbjct:   285 TSWGIDGYVRVKMGGNVCGI 304

 Score = 145 (56.1 bits), Expect = 8.9e-07, P = 8.9e-07
 Identities = 24/51 (47%), Positives = 33/51 (64%)

Query:   965 CSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGI 1015
             CS  +  HAVL+ G+ K   IPYW+VRNSWG     +G+ +++ G N CGI
Sbjct:   254 CSSGEANHAVLVTGFDKTGSIPYWIVRNSWGTSWGIDGYVRVKMGGNVCGI 304


>DICTYBASE|DDB_G0291191 [details] [associations]
            symbol:DDB_G0291191 "cysteine protease" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0291191
            GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AAFI02000175 MEROPS:C01.022
            ProtClustDB:CLSZ2429603 RefSeq:XP_635374.1
            ProteinModelPortal:Q54F16 PRIDE:Q54F16 EnsemblProtists:DDB0252831
            GeneID:8628022 KEGG:ddi:DDB_G0291191 OMA:NETQIAS Uniprot:Q54F16
        Length = 352

 Score = 329 (120.9 bits), Expect = 1.7e-28, P = 1.7e-28
 Identities = 103/328 (31%), Positives = 149/328 (45%)

Query:   292 FKAFIVKRGRQYANDEEIKERFEYFKQ-----DGHKKHE-------RYGTSEFSDRSPEE 339
             F AF  K  + Y+ +E +  +FE FK      D   K         ++G ++F+D S EE
Sbjct:    27 FIAFQNKYNKIYSAEEYLV-KFETFKSNLLNIDALNKQATTIGSDTKFGVNKFADLSKEE 85

Query:   340 ILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK--NVTGPAG----- 392
                K  +  S+    R+  D                 P A+DWR    +   P G     
Sbjct:    86 F--KKYYLSSKEA--RLTDDLPMLPNLSDDIIS--ATPAAFDWRNTGGSTKFPQGTPVTA 139

Query:   393 --DQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC------SGCG-GCDG 443
               +Q  CGSCW+FS  G +EGQ+ + TG LV  S+  LV+C   C      + C  GCDG
Sbjct:   140 VKNQGQCGSCWSFSTTGNVEGQHYLSTGTLVGLSEQNLVDCDHTCMTYENENVCNAGCDG 199

Query:   444 LEQPIEYTH---QAGLESEKDYPYRNGNGE-KFKCAYDKSKVKLFTGKDFLYFNGSETMK 499
               QP  Y +     G+++E  YPY   +GE KF  A   +K+  FT    +       + 
Sbjct:   200 GLQPNAYNYIIKNGGIQTEATYPYTAVDGECKFNSAQVGAKISSFT----MVPQNETQIA 255

Query:   500 KILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDI-----PY 554
               L+  GPL++  ++    FY G      D  C    L H +L+VGYG QD I     PY
Sbjct:   256 SYLFNNGPLAIAADAEEWQFYMGGVF---DFPCGQ-TLDHGILIVGYGAQDTIVGKNTPY 311

Query:   555 WLARNSWGPIGPDEGFFKIERGNNACGI 582
             W+ +NSWG    + G+ K+ER  + CG+
Sbjct:   312 WIIKNSWGADWGEAGYLKVERNTDKCGV 339

 Score = 316 (116.3 bits), Expect = 4.1e-27, P = 4.1e-27
 Identities = 100/328 (30%), Positives = 150/328 (45%)

Query:   658 FKAFIVKRGRQYANDEEIKERFEYFKQ-----DGHKKHE-------RYGTSEFSDRSPEE 705
             F AF  K  + Y+ +E +  +FE FK      D   K         ++G ++F+D S EE
Sbjct:    27 FIAFQNKYNKIYSAEEYLV-KFETFKSNLLNIDALNKQATTIGSDTKFGVNKFADLSKEE 85

Query:   706 ILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK--NVTGPAG----- 758
                K  +  S+    R+  D                 P A+DWR    +   P G     
Sbjct:    86 F--KKYYLSSKEA--RLTDDLPMLPNLSDDIIS--ATPAAFDWRNTGGSTKFPQGTPVTA 139

Query:   759 --DQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC----------SGCDG 806
               +Q  CGSCW+FS  G +EGQ+ + TG LV  S+  LV+C   C          +GCDG
Sbjct:   140 VKNQGQCGSCWSFSTTGNVEGQHYLSTGTLVGLSEQNLVDCDHTCMTYENENVCNAGCDG 199

Query:   807 CFFEPSIEYT-HQAGLESEKDYPYKNANGE-KFKCAYDKSKVKLFTGKDFLHFNGSETMK 864
                  +  Y     G+++E  YPY   +GE KF  A   +K+  FT    +  N ++ + 
Sbjct:   200 GLQPNAYNYIIKNGGIQTEATYPYTAVDGECKFNSAQVGAKISSFT---MVPQNETQ-IA 255

Query:   865 KILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD-----NIPY 919
               L+  GPL++  +++    Y G      D  C    L H +L+VGYG QD     N PY
Sbjct:   256 SYLFNNGPLAIAADAEEWQFYMGGVF---DFPCGQ-TLDHGILIVGYGAQDTIVGKNTPY 311

Query:   920 WLVRNSWGPIGPDEGFFKIERGNNACGI 947
             W+++NSWG    + G+ K+ER  + CG+
Sbjct:   312 WIIKNSWGADWGEAGYLKVERNTDKCGV 339

 Score = 312 (114.9 bits), Expect = 1.1e-26, P = 1.1e-26
 Identities = 81/244 (33%), Positives = 119/244 (48%)

Query:     1 MLMEVEKD--GPVPDAWDWRKK--NVTGPAG-------DQADCGSCWAFSIAGMLEGQYA 49
             ML  +  D     P A+DWR    +   P G       +Q  CGSCW+FS  G +EGQ+ 
Sbjct:   104 MLPNLSDDIISATPAAFDWRNTGGSTKFPQGTPVTAVKNQGQCGSCWSFSTTGNVEGQHY 163

Query:    50 IKTGKLVEFSKSQLVECAKQC----------SGCDGCFFEPSIEYT-HQAGLESEKDYPY 98
             + TG LV  S+  LV+C   C          +GCDG     +  Y     G+++E  YPY
Sbjct:   164 LSTGTLVGLSEQNLVDCDHTCMTYENENVCNAGCDGGLQPNAYNYIIKNGGIQTEATYPY 223

Query:    99 KNANGE-KFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNSDLIHDYNGT 157
                +GE KF  A   +K+  FT    +  N ++ +   L+  GPL++  +++    Y G 
Sbjct:   224 TAVDGECKFNSAQVGAKISSFT---MVPQNETQ-IASYLFNNGPLAIAADAEEWQFYMGG 279

Query:   158 PIRKNDETCSPYDLGHAVLLVGYGKQD-----NIPYWLVRNSWGPIGPDEGFFKIERGNN 212
                  D  C    L H +L+VGYG QD     N PYW+++NSWG    + G+ K+ER  +
Sbjct:   280 VF---DFPCGQ-TLDHGILIVGYGAQDTIVGKNTPYWIIKNSWGADWGEAGYLKVERNTD 335

Query:   213 ACGI 216
              CG+
Sbjct:   336 KCGV 339

 Score = 136 (52.9 bits), Expect = 1.2e-05, P = 1.2e-05
 Identities = 23/51 (45%), Positives = 33/51 (64%)

Query:   970 LGHAVLLVGYGKQDDI-----PYWLVRNSWGPIGPDEGFFKIERGNNACGI 1015
             L H +L+VGYG QD I     PYW+++NSWG    + G+ K+ER  + CG+
Sbjct:   289 LDHGILIVGYGAQDTIVGKNTPYWIIKNSWGADWGEAGYLKVERNTDKCGV 339


>FB|FBgn0032228 [details] [associations]
            symbol:CG5367 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE014134 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 HSSP:P80067
            RefSeq:NP_609387.1 UniGene:Dm.26782 ProteinModelPortal:Q9VKY4
            SMR:Q9VKY4 MEROPS:C01.A30 EnsemblMetazoa:FBtr0080055 GeneID:34401
            KEGG:dme:Dmel_CG5367 UCSC:CG5367-RA FlyBase:FBgn0032228
            InParanoid:Q9VKY4 OMA:QIVDCSV OrthoDB:EOG4THT8X PhylomeDB:Q9VKY4
            GenomeRNAi:34401 NextBio:788324 ArrayExpress:Q9VKY4 Bgee:Q9VKY4
            Uniprot:Q9VKY4
        Length = 338

 Score = 317 (116.6 bits), Expect = 2.1e-28, Sum P(2) = 2.1e-28
 Identities = 86/317 (27%), Positives = 144/317 (45%)

Query:   287 NILETFKAFIVKRGRQYANDEEIKERFEYFKQDGH--KKH-ERYGTSEFSDRSPEEILCK 343
             N    F+ F     R+Y    +    ++ F+++    ++H + Y   + S R    I   
Sbjct:    31 NCKSEFEKFKNNNNRKYLRTYDEMRSYKAFEENFKVIEEHNQNYKEGQTSFRLKPNIFAD 90

Query:   344 TGFKWSERTYERIVADRXXXXXXXXXXXXDGP----VPDAWDWRKKNVTGPAGDQAACGS 399
                    + + R++                 P    VP++ DWR K    P  +Q +CGS
Sbjct:    91 MSTDGYLKGFLRLLKSNIEDSADNMAEIVGSPLMANVPESLDWRSKGFITPPYNQLSCGS 150

Query:   400 CWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDG--LEQPIEYTHQAG-L 456
             C+AFSIA  + GQ   +TGK++  SK Q+V+C+    G  GC G  L   + Y    G +
Sbjct:   151 CYAFSIAESIMGQVFKRTGKILSLSKQQIVDCSVS-HGNQGCVGGSLRNTLSYLQSTGGI 209

Query:   457 ESEKDYPYRNGNGEKFKCAY--DKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNS 514
               ++DYPY    G   KC +  D S V + T    L     + ++  +   GP+++ +N+
Sbjct:   210 MRDQDYPYVARKG---KCQFVPDLSVVNV-TSWAILPVRDEQAIQAAVTHIGPVAISINA 265

Query:   515 HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIE 574
                 F   +    +D  CS   + HA++++G+GK     YW+ +N WG    + G+ +I 
Sbjct:   266 SPKTFQLYSDGIYDDPLCSSASVNHAMVVIGFGKD----YWILKNWWGQNWGENGYIRIR 321

Query:   575 RGNNACGIEQIAGYATI 591
             +G N CGI   A YA +
Sbjct:   322 KGVNMCGIANYAAYAIV 338

 Score = 312 (114.9 bits), Expect = 7.3e-28, Sum P(2) = 7.3e-28
 Identities = 85/317 (26%), Positives = 143/317 (45%)

Query:   653 NILETFKAFIVKRGRQYANDEEIKERFEYFKQDGH--KKH-ERYGTSEFSDRSPEEILCK 709
             N    F+ F     R+Y    +    ++ F+++    ++H + Y   + S R    I   
Sbjct:    31 NCKSEFEKFKNNNNRKYLRTYDEMRSYKAFEENFKVIEEHNQNYKEGQTSFRLKPNIFAD 90

Query:   710 TGFKWSERTYERIVADRXXXXXXXXXXXXDGP----VPDAWDWRKKNVTGPAGDQAACGS 765
                    + + R++                 P    VP++ DWR K    P  +Q +CGS
Sbjct:    91 MSTDGYLKGFLRLLKSNIEDSADNMAEIVGSPLMANVPESLDWRSKGFITPPYNQLSCGS 150

Query:   766 CWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPSIEYTHQAG-LE 822
             C+AFSIA  + GQ   +TGK++  SK Q+V+C+      GC G     ++ Y    G + 
Sbjct:   151 CYAFSIAESIMGQVFKRTGKILSLSKQQIVDCSVSHGNQGCVGGSLRNTLSYLQSTGGIM 210

Query:   823 SEKDYPYKNANGEKFKCAY--DKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLN-S 879
              ++DYPY    G   KC +  D S V + T    L     + ++  +   GP+++ +N S
Sbjct:   211 RDQDYPYVARKG---KCQFVPDLSVVNV-TSWAILPVRDEQAIQAAVTHIGPVAISINAS 266

Query:   880 DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIE 939
                       I  +D  CS   + HA++++G+GK     YW+++N WG    + G+ +I 
Sbjct:   267 PKTFQLYSDGIY-DDPLCSSASVNHAMVVIGFGKD----YWILKNWWGQNWGENGYIRIR 321

Query:   940 RGNNACGIEQIAGYATI 956
             +G N CGI   A YA +
Sbjct:   322 KGVNMCGIANYAAYAIV 338

 Score = 310 (114.2 bits), Expect = 1.8e-26, P = 1.8e-26
 Identities = 72/221 (32%), Positives = 113/221 (51%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             VP++ DWR K    P  +Q  CGSC+AFSIA  + GQ   +TGK++  SK Q+V+C+   
Sbjct:   127 VPESLDWRSKGFITPPYNQLSCGSCYAFSIAESIMGQVFKRTGKILSLSKQQIVDCSVSH 186

Query:    71 S--GCDGCFFEPSIEYTHQAG-LESEKDYPYKNANGEKFKCAY--DKSKVKLFTGKDFLH 125
                GC G     ++ Y    G +  ++DYPY    G   KC +  D S V + T    L 
Sbjct:   187 GNQGCVGGSLRNTLSYLQSTGGIMRDQDYPYVARKG---KCQFVPDLSVVNV-TSWAILP 242

Query:   126 FNGSETMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
                 + ++  +   GP+++ +N S          I  +D  CS   + HA++++G+GK  
Sbjct:   243 VRDEQAIQAAVTHIGPVAISINASPKTFQLYSDGIY-DDPLCSSASVNHAMVVIGFGKD- 300

Query:   185 NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 225
                YW+++N WG    + G+ +I +G N CGI   A YA +
Sbjct:   301 ---YWILKNWWGQNWGENGYIRIRKGVNMCGIANYAAYAIV 338

 Score = 139 (54.0 bits), Expect = 2.3e-05, Sum P(2) = 2.3e-05
 Identities = 24/68 (35%), Positives = 40/68 (58%)

Query:   957 DVVKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 1016
             D + +D  CS   + HA++++G+GK     YW+++N WG    + G+ +I +G N CGI 
Sbjct:   275 DGIYDDPLCSSASVNHAMVVIGFGKD----YWILKNWWGQNWGENGYIRIRKGVNMCGIA 330

Query:  1017 QIAGYATI 1024
               A YA +
Sbjct:   331 NYAAYAIV 338

 Score = 38 (18.4 bits), Expect = 2.1e-28, Sum P(2) = 2.1e-28
 Identities = 9/27 (33%), Positives = 15/27 (55%)

Query:    88 AGLESEKDYPYKNANGEKFKCAYDKSK 114
             A  +SE +  +KN N  K+   YD+ +
Sbjct:    30 ANCKSEFE-KFKNNNNRKYLRTYDEMR 55


>UNIPROTKB|F1PMM9 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9615 "Canis lupus
            familiaris" [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00639 GO:GO:0004197 GO:GO:0002250
            GeneTree:ENSGT00560000076577 GO:GO:0097067 EMBL:AAEX03000499
            Ensembl:ENSCAFT00000002029 OMA:EFKQVLN Uniprot:F1PMM9
        Length = 341

 Score = 314 (115.6 bits), Expect = 6.8e-27, P = 6.8e-27
 Identities = 81/224 (36%), Positives = 112/224 (50%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA-KQ 69
             VP + DWR++    P  DQ  C  CWAFS  G LEGQ   KTGKLV  S+  LV+C+  Q
Sbjct:   122 VPSSVDWREQGYVTPVKDQGQCLGCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWSQ 181

Query:    70 CS-GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 127
              + GC+G   E + +Y     GL+SE+ YPY  A  E  K   +KS   +      L  N
Sbjct:   182 GNRGCNGGLMEYAFQYVKDNGGLDSEESYPYL-ARNEPCKYRPEKSAANVTAFWPIL--N 238

Query:   128 GSETMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG----K 182
               + +   +   GP+S  ++S      +    I   D  CS   L H VL+VGYG    +
Sbjct:   239 EEDGLMTTVATVGPVSAAVDSSPQSFQFYKKGIYY-DPKCSNKLLNHGVLVVGYGFEGAE 297

Query:   183 QDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
              DN  YW+V+NSWG     +G+  + +  +N CGI   A Y  +
Sbjct:   298 SDNKKYWIVKNSWGTNWGMQGYMLLAKDRDNHCGIATRASYPVV 341

 Score = 313 (115.2 bits), Expect = 2.2e-28, Sum P(2) = 2.2e-28
 Identities = 81/224 (36%), Positives = 112/224 (50%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA-KQ 800
             VP + DWR++    P  DQ  C  CWAFS  G LEGQ   KTGKLV  S+  LV+C+  Q
Sbjct:   122 VPSSVDWREQGYVTPVKDQGQCLGCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWSQ 181

Query:   801 CS-GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 858
              + GC+G   E + +Y     GL+SE+ YPY  A  E  K   +KS   +      L  N
Sbjct:   182 GNRGCNGGLMEYAFQYVKDNGGLDSEESYPYL-ARNEPCKYRPEKSAANVTAFWPIL--N 238

Query:   859 GSETMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG----K 913
               + +   +   GP+S  ++S      +    I   D  CS   L H VL+VGYG    +
Sbjct:   239 EEDGLMTTVATVGPVSAAVDSSPQSFQFYKKGIYY-DPKCSNKLLNHGVLVVGYGFEGAE 297

Query:   914 QDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
              DN  YW+V+NSWG     +G+  + +  +N CGI   A Y  +
Sbjct:   298 SDNKKYWIVKNSWGTNWGMQGYMLLAKDRDNHCGIATRASYPVV 341

 Score = 307 (113.1 bits), Expect = 9.5e-28, Sum P(2) = 9.5e-28
 Identities = 80/226 (35%), Positives = 110/226 (48%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             VP + DWR++    P  DQ  C  CWAFS  G LEGQ   KTGKLV  S+  LV+C+   
Sbjct:   122 VPSSVDWREQGYVTPVKDQGQCLGCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWS- 180

Query:   436 SGCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYF 492
              G  GC+G  +E   +Y     GL+SE+ YPY   N E  K   +KS   +      L  
Sbjct:   181 QGNRGCNGGLMEYAFQYVKDNGGLDSEESYPYLARN-EPCKYRPEKSAANVTAFWPIL-- 237

Query:   493 NGSETMKKILYKYGPLSVGLNS--HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYG--- 547
             N  + +   +   GP+S  ++S      FY        D  CS   L H VL+VGYG   
Sbjct:   238 NEEDGLMTTVATVGPVSAAVDSSPQSFQFYKKGIYY--DPKCSNKLLNHGVLVVGYGFEG 295

Query:   548 -KQDDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
              + D+  YW+ +NSWG     +G+  + +  +N CGI   A Y  +
Sbjct:   296 AESDNKKYWIVKNSWGTNWGMQGYMLLAKDRDNHCGIATRASYPVV 341

 Score = 42 (19.8 bits), Expect = 2.2e-28, Sum P(2) = 2.2e-28
 Identities = 15/55 (27%), Positives = 24/55 (43%)

Query:   306 DEEIKERFEYFKQDGHKKHERYGTSEFSDRSPEEILCKTGFKWSERTYERIVADR 360
             +EE K+    FK   HKK + +    F++  P  +       W E+ Y   V D+
Sbjct:    93 NEEFKQVLNDFKIQKHKKGKVFPAPLFAE-VPSSV------DWREQGYVTPVKDQ 140

 Score = 42 (19.8 bits), Expect = 2.2e-28, Sum P(2) = 2.2e-28
 Identities = 15/55 (27%), Positives = 24/55 (43%)

Query:   672 DEEIKERFEYFKQDGHKKHERYGTSEFSDRSPEEILCKTGFKWSERTYERIVADR 726
             +EE K+    FK   HKK + +    F++  P  +       W E+ Y   V D+
Sbjct:    93 NEEFKQVLNDFKIQKHKKGKVFPAPLFAE-VPSSV------DWREQGYVTPVKDQ 140


>ZFIN|ZDB-GENE-001205-4 [details] [associations]
            symbol:ctsk "cathepsin K" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-001205-4 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            CTD:1513 KO:K01371 OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55
            EMBL:BC092901 IPI:IPI00512751 RefSeq:NP_001017778.1
            UniGene:Dr.76224 ProteinModelPortal:Q568D6 SMR:Q568D6 GeneID:550475
            KEGG:dre:550475 InParanoid:Q568D6 NextBio:20879718
            ArrayExpress:Q568D6 Uniprot:Q568D6
        Length = 333

 Score = 327 (120.2 bits), Expect = 2.8e-28, P = 2.8e-28
 Identities = 100/327 (30%), Positives = 157/327 (48%)

Query:   282 TFDNENILETFKAFIVKRGRQY--ANDEEIK----ERFEYFKQDGHKKHE----RY--GT 329
             + DN ++ E ++++ +   R+Y   N+E I+    E+   F +  +K++E     Y  G 
Sbjct:    20 SLDNLSLDEAWESWKITHKREYNGLNEESIRRTIWEKNMLFIEAHNKEYELGIHTYDLGM 79

Query:   330 SEFSDRSPEEILCKT-GFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVT 388
             + F D + EE+  K  G +     Y R  A+              G +P + D+RK    
Sbjct:    80 NHFGDMTLEEVAEKVMGLQMP--MY-RDPANTFVPDDRV------GKLPKSIDYRKLGYV 130

Query:   389 GPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQPI 448
                 +Q +CGSCWAFS  G LEGQ     G+LV+ S   LV+C  +  GCGG   +    
Sbjct:   131 TSVKNQGSCGSCWAFSSVGALEGQLMKTKGQLVDLSPQNLVDCVTENDGCGG-GYMTNAF 189

Query:   449 EY-THQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFT-GKDFLYFNGSETMKKILYKYG 506
              Y ++  G++SE+ YPY    G   +CAY+ S V     G   +       +   +   G
Sbjct:   190 RYVSNNQGIDSEESYPYV---GTDQQCAYNTSGVAASCRGYKEIPQGNERALTAAVANVG 246

Query:   507 PLSVGLN---SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD-DIPYWLARNSWG 562
             P+SVG++   S  +++ +G      D  C+  D+ HAVL VGYG       YW+ +NSWG
Sbjct:   247 PVSVGIDAMQSTFLYYKSGVYY---DPNCNKEDVNHAVLAVGYGATPRGKKYWIVKNSWG 303

Query:   563 PIGPDEGFFKIERG-NNACGIEQIAGY 588
                  +G+  + R  NNACGI  +A +
Sbjct:   304 EEWGKKGYVLMARNRNNACGIANLASF 330

 Score = 325 (119.5 bits), Expect = 4.5e-28, P = 4.5e-28
 Identities = 99/326 (30%), Positives = 155/326 (47%)

Query:   648 TFDNENILETFKAFIVKRGRQY--ANDEEIK----ERFEYFKQDGHKKHE----RY--GT 695
             + DN ++ E ++++ +   R+Y   N+E I+    E+   F +  +K++E     Y  G 
Sbjct:    20 SLDNLSLDEAWESWKITHKREYNGLNEESIRRTIWEKNMLFIEAHNKEYELGIHTYDLGM 79

Query:   696 SEFSDRSPEEILCKT-GFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVT 754
             + F D + EE+  K  G +     Y R  A+              G +P + D+RK    
Sbjct:    80 NHFGDMTLEEVAEKVMGLQMP--MY-RDPANTFVPDDRV------GKLPKSIDYRKLGYV 130

Query:   755 GPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIE 814
                 +Q +CGSCWAFS  G LEGQ     G+LV+ S   LV+C  +  GC G +   +  
Sbjct:   131 TSVKNQGSCGSCWAFSSVGALEGQLMKTKGQLVDLSPQNLVDCVTENDGCGGGYMTNAFR 190

Query:   815 Y-THQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFT-GKDFLHFNGSETMKKILYKYGP 872
             Y ++  G++SE+ YPY    G   +CAY+ S V     G   +       +   +   GP
Sbjct:   191 YVSNNQGIDSEESYPYV---GTDQQCAYNTSGVAASCRGYKEIPQGNERALTAAVANVGP 247

Query:   873 LSV---LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN-IPYWLVRNSWGP 928
             +SV    + S  ++  +G      D  C+  D+ HAVL VGYG       YW+V+NSWG 
Sbjct:   248 VSVGIDAMQSTFLYYKSGVYY---DPNCNKEDVNHAVLAVGYGATPRGKKYWIVKNSWGE 304

Query:   929 IGPDEGFFKIERG-NNACGIEQIAGY 953
                 +G+  + R  NNACGI  +A +
Sbjct:   305 EWGKKGYVLMARNRNNACGIANLASF 330

 Score = 321 (118.1 bits), Expect = 1.2e-27, P = 1.2e-27
 Identities = 76/224 (33%), Positives = 112/224 (50%)

Query:     6 EKDGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVE 65
             ++ G +P + D+RK        +Q  CGSCWAFS  G LEGQ     G+LV+ S   LV+
Sbjct:   113 DRVGKLPKSIDYRKLGYVTSVKNQGSCGSCWAFSSVGALEGQLMKTKGQLVDLSPQNLVD 172

Query:    66 CAKQCSGCDGCFFEPSIEY-THQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFT-GKDF 123
             C  +  GC G +   +  Y ++  G++SE+ YPY    G   +CAY+ S V     G   
Sbjct:   173 CVTENDGCGGGYMTNAFRYVSNNQGIDSEESYPYV---GTDQQCAYNTSGVAASCRGYKE 229

Query:   124 LHFNGSETMKKILYKYGPLSV---LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGY 180
             +       +   +   GP+SV    + S  ++  +G      D  C+  D+ HAVL VGY
Sbjct:   230 IPQGNERALTAAVANVGPVSVGIDAMQSTFLYYKSGVYY---DPNCNKEDVNHAVLAVGY 286

Query:   181 GKQDN-IPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 222
             G       YW+V+NSWG     +G+  + R  NNACGI  +A +
Sbjct:   287 GATPRGKKYWIVKNSWGEEWGKKGYVLMARNRNNACGIANLASF 330

 Score = 134 (52.2 bits), Expect = 1.7e-05, P = 1.7e-05
 Identities = 27/62 (43%), Positives = 36/62 (58%)

Query:   962 DETCSPYDLGHAVLLVGYGKQD-DIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIA 1019
             D  C+  D+ HAVL VGYG       YW+V+NSWG     +G+  + R  NNACGI  +A
Sbjct:   269 DPNCNKEDVNHAVLAVGYGATPRGKKYWIVKNSWGEEWGKKGYVLMARNRNNACGIANLA 328

Query:  1020 GY 1021
              +
Sbjct:   329 SF 330


>UNIPROTKB|F1P0K2 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            OMA:SNVCGIA EMBL:AADN02016534 IPI:IPI00651180
            Ensembl:ENSGALT00000015270 Uniprot:F1P0K2
        Length = 320

 Score = 327 (120.2 bits), Expect = 2.8e-28, P = 2.8e-28
 Identities = 72/210 (34%), Positives = 111/210 (52%)

Query:   741 PVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ 800
             P+P  +DWR K V     +Q  CG CWAFS+ G +E  YAIK   L E S  Q+++C+  
Sbjct:   106 PLPKKFDWRDKKVIAEVRNQQTCGGCWAFSVVGGIESAYAIKGHNLEELSVQQVIDCSYS 165

Query:   801 CSGCDGCFFEPSIEYTHQAGLESEKD--YPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 858
               GC G     ++ + +Q  ++  +D  Y +K   G      +    V + TG     F+
Sbjct:   166 NYGCSGGSTITALSWLNQTKVKLVRDSEYTFKAQTGLCHYFPHSDFGVSI-TGFAAYDFS 224

Query:   859 GSET-MKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNI 917
             G E  M ++L  +GPL+V +++    DY G  I+ +   CS     HAVL+ G+     I
Sbjct:   225 GQEEEMMRVLVDWGPLAVTVDAVSWQDYLGGIIQYH---CSSGKANHAVLITGFDTTGII 281

Query:   918 PYWLVRNSWGPIGPDEGFFKIERGNNACGI 947
             PYW+V+NSWG     +G+ +++ G+N CGI
Sbjct:   282 PYWIVQNSWGRTWGIDGYVRVKIGSNVCGI 311

 Score = 327 (120.2 bits), Expect = 2.8e-28, P = 2.8e-28
 Identities = 72/213 (33%), Positives = 113/213 (53%)

Query:     7 KDGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC 66
             ++ P+P  +DWR K V     +Q  CG CWAFS+ G +E  YAIK   L E S  Q+++C
Sbjct:   103 EEKPLPKKFDWRDKKVIAEVRNQQTCGGCWAFSVVGGIESAYAIKGHNLEELSVQQVIDC 162

Query:    67 AKQCSGCDGCFFEPSIEYTHQAGLESEKD--YPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
             +    GC G     ++ + +Q  ++  +D  Y +K   G      +    V + TG    
Sbjct:   163 SYSNYGCSGGSTITALSWLNQTKVKLVRDSEYTFKAQTGLCHYFPHSDFGVSI-TGFAAY 221

Query:   125 HFNGSET-MKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 183
              F+G E  M ++L  +GPL+V +++    DY G  I+ +   CS     HAVL+ G+   
Sbjct:   222 DFSGQEEEMMRVLVDWGPLAVTVDAVSWQDYLGGIIQYH---CSSGKANHAVLITGFDTT 278

Query:   184 DNIPYWLVRNSWGPIGPDEGFFKIERGNNACGI 216
               IPYW+V+NSWG     +G+ +++ G+N CGI
Sbjct:   279 GIIPYWIVQNSWGRTWGIDGYVRVKIGSNVCGI 311

 Score = 303 (111.7 bits), Expect = 1.0e-25, P = 1.0e-25
 Identities = 69/211 (32%), Positives = 109/211 (51%)

Query:   375 PVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ 434
             P+P  +DWR K V     +Q  CG CWAFS+ G +E  YAIK   L E S  Q+++C+  
Sbjct:   106 PLPKKFDWRDKKVIAEVRNQQTCGGCWAFSVVGGIESAYAIKGHNLEELSVQQVIDCSYS 165

Query:   435 CSGCGGCDGLEQPIEYTHQAGLESEKD--YPYRNGNGEKFKCAYDKSKVKLFTGKDFLYF 492
               GC G   +   + + +Q  ++  +D  Y ++   G      +    V + TG     F
Sbjct:   166 NYGCSGGSTITA-LSWLNQTKVKLVRDSEYTFKAQTGLCHYFPHSDFGVSI-TGFAAYDF 223

Query:   493 NGSET-MKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD 551
             +G E  M ++L  +GPL+V +++     Y G  I+ +   CS     HAVL+ G+     
Sbjct:   224 SGQEEEMMRVLVDWGPLAVTVDAVSWQDYLGGIIQYH---CSSGKANHAVLITGFDTTGI 280

Query:   552 IPYWLARNSWGPIGPDEGFFKIERGNNACGI 582
             IPYW+ +NSWG     +G+ +++ G+N CGI
Sbjct:   281 IPYWIVQNSWGRTWGIDGYVRVKIGSNVCGI 311

 Score = 129 (50.5 bits), Expect = 5.7e-05, P = 5.7e-05
 Identities = 22/51 (43%), Positives = 32/51 (62%)

Query:   965 CSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGI 1015
             CS     HAVL+ G+     IPYW+V+NSWG     +G+ +++ G+N CGI
Sbjct:   261 CSSGKANHAVLITGFDTTGIIPYWIVQNSWGRTWGIDGYVRVKIGSNVCGI 311


>TAIR|locus:505006391 [details] [associations]
            symbol:CEP3 "cysteine endopeptidase 3" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005783 EMBL:CP002686
            GenomeReviews:BA000014_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 PROSITE:PS00014
            EMBL:AL049659 HSSP:O65039 HOGENOM:HOG000230773 KO:K16292
            EMBL:AK119026 IPI:IPI00525150 PIR:T06707 RefSeq:NP_566901.1
            UniGene:At.3162 ProteinModelPortal:Q9STL5 SMR:Q9STL5 MEROPS:C01.A02
            PRIDE:Q9STL5 EnsemblPlants:AT3G48350.1 GeneID:823993
            KEGG:ath:AT3G48350 TAIR:At3g48350 InParanoid:Q9STL5 OMA:DITHHEF
            PhylomeDB:Q9STL5 ProtClustDB:CLSN2917387 Genevestigator:Q9STL5
            Uniprot:Q9STL5
        Length = 364

 Score = 326 (119.8 bits), Expect = 3.5e-28, P = 3.5e-28
 Identities = 79/224 (35%), Positives = 118/224 (52%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC-AKQ 69
             VP + DWR+K       +Q DCGSCWAFS    +EG   I+T KLV  S+ +LV+C  ++
Sbjct:   126 VPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEE 185

Query:    70 CSGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNG 128
               GC G   EP+ E+  +  G+++E+ YPY +++ +  +      +     G + +  N 
Sbjct:   186 NQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSSDVQFCRANSIGGETVTIDGHEHVPEND 245

Query:   129 SETMKKILYKYGPLSVLLN---SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN 185
              E + K +  + P+SV ++   SD      G  I +    C    L H V++VGYG+  N
Sbjct:   246 EEELLKAV-AHQPVSVAIDAGSSDFQLYSEGVFIGE----CGT-QLNHGVVIVGYGETKN 299

Query:   186 -IPYWLVRNSWGPIGPDEGFFKIERG---NNA-CGIEQIAGYAT 224
                YW+VRNSWGP   + G+ +IERG   N   CGI   A Y T
Sbjct:   300 GTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYPT 343

 Score = 321 (118.1 bits), Expect = 1.2e-27, P = 1.2e-27
 Identities = 95/322 (29%), Positives = 154/322 (47%)

Query:   652 ENILETFKAFIVKRGRQYANDEEIKERFEYFKQD------GHKKHERYGT--SEFSDRSP 703
             EN+ + ++ +        A+ E IK RF  F+ +       +KK++ Y    + F+D + 
Sbjct:    32 ENVWKLYERWRGHHSVSRASHEAIK-RFNVFRHNVLHVHRTNKKNKPYKLKINRFADITH 90

Query:   704 EEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAAC 763
              E   ++ +  S   + R++                  VP + DWR+K       +Q  C
Sbjct:    91 HEF--RSSYAGSNVKHHRMLRGPKRGSGGFMYENVTR-VPSSVDWREKGAVTEVKNQQDC 147

Query:   764 GSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC-AKQCSGCDGCFFEPSIEYT-HQAGL 821
             GSCWAFS    +EG   I+T KLV  S+ +LV+C  ++  GC G   EP+ E+  +  G+
Sbjct:   148 GSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQGCAGGLMEPAFEFIKNNGGI 207

Query:   822 ESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLN--- 878
             ++E+ YPY +++ +  +      +     G + +  N  E + K +  + P+SV ++   
Sbjct:   208 KTEETYPYDSSDVQFCRANSIGGETVTIDGHEHVPENDEEELLKAV-AHQPVSVAIDAGS 266

Query:   879 SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN-IPYWLVRNSWGPIGPDEGFFK 937
             SD      G  I +    C    L H V++VGYG+  N   YW+VRNSWGP   + G+ +
Sbjct:   267 SDFQLYSEGVFIGE----CGT-QLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVR 321

Query:   938 IERG---NNA-CGIEQIAGYAT 955
             IERG   N   CGI   A Y T
Sbjct:   322 IERGISENEGRCGIAMEASYPT 343

 Score = 303 (111.7 bits), Expect = 1.0e-25, P = 1.0e-25
 Identities = 93/324 (28%), Positives = 155/324 (47%)

Query:   286 ENILETFKAFIVKRGRQYANDEEIKERFEYFKQD------GHKKHERYGT--SEFSDRSP 337
             EN+ + ++ +        A+ E IK RF  F+ +       +KK++ Y    + F+D + 
Sbjct:    32 ENVWKLYERWRGHHSVSRASHEAIK-RFNVFRHNVLHVHRTNKKNKPYKLKINRFADITH 90

Query:   338 EEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAAC 397
              E   ++ +  S   + R++                  VP + DWR+K       +Q  C
Sbjct:    91 HEF--RSSYAGSNVKHHRMLRGPKRGSGGFMYENVTR-VPSSVDWREKGAVTEVKNQQDC 147

Query:   398 GSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC-AKQCSGCGGCDGLEQP-IEYT-HQA 454
             GSCWAFS    +EG   I+T KLV  S+ +LV+C  ++  GC G  GL +P  E+  +  
Sbjct:   148 GSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQGCAG--GLMEPAFEFIKNNG 205

Query:   455 GLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNS 514
             G+++E+ YPY + + +  +      +     G + +  N  E + K +  + P+SV +++
Sbjct:   206 GIKTEETYPYDSSDVQFCRANSIGGETVTIDGHEHVPENDEEELLKAV-AHQPVSVAIDA 264

Query:   515 HLIHF--YN-GTPIRKNDETCSPYDLGHAVLLVGYGK-QDDIPYWLARNSWGPIGPDEGF 570
                 F  Y+ G  I +    C    L H V++VGYG+ ++   YW+ RNSWGP   + G+
Sbjct:   265 GSSDFQLYSEGVFIGE----CGT-QLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGY 319

Query:   571 FKIERG----NNACGIEQIAGYAT 590
              +IERG       CGI   A Y T
Sbjct:   320 VRIERGISENEGRCGIAMEASYPT 343

 Score = 133 (51.9 bits), Expect = 2.7e-05, P = 2.7e-05
 Identities = 28/59 (47%), Positives = 37/59 (62%)

Query:   970 LGHAVLLVGYGK-QDDIPYWLVRNSWGPIGPDEGFFKIERG---NNA-CGIEQIAGYAT 1023
             L H V++VGYG+ ++   YW+VRNSWGP   + G+ +IERG   N   CGI   A Y T
Sbjct:   285 LNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYPT 343


>FB|FBgn0033874 [details] [associations]
            symbol:CG6347 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE013599 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 HSSP:P53634 EMBL:AY069609
            RefSeq:NP_610906.1 UniGene:Dm.608 SMR:Q7K0S6 MEROPS:C01.A29
            EnsemblMetazoa:FBtr0087637 GeneID:36531 KEGG:dme:Dmel_CG6347
            UCSC:CG6347-RA FlyBase:FBgn0033874 InParanoid:Q7K0S6 OMA:FEYIRDH
            OrthoDB:EOG4FQZ74 GenomeRNAi:36531 NextBio:799046 Uniprot:Q7K0S6
        Length = 352

 Score = 324 (119.1 bits), Expect = 5.8e-28, P = 5.8e-28
 Identities = 93/320 (29%), Positives = 146/320 (45%)

Query:   655 LETFKAFIVKRGRQYANDEEI-KERFEYFKQD----GHKKHE------RYGTSEFSDRSP 703
             ++ F  F+ + G+ Y+++E + +E     K       +K  +      R G +  +D + 
Sbjct:    35 VQNFDDFLRQTGKVYSDEERVYRESIFAAKMSLITLSNKNADNGVSGFRLGVNTLADMTR 94

Query:   704 EEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAA- 762
             +EI    G K SE   ER                 +  +P+ +DWR+K    P G Q   
Sbjct:    95 KEIATLLGSKISEFG-ERYTNGHINFVTARNPASAN--LPEMFDWREKGGVTPPGFQGVG 151

Query:   763 CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPSIEYTHQAG 820
             CG+CW+F+  G LEG    +TG L   S+  LV+CA      GCDG F E   EY    G
Sbjct:   152 CGACWSFATTGALEGHLFRRTGVLASLSQQNLVDCADDYGNMGCDGGFQEYGFEYIRDHG 211

Query:   821 LESEKDYPYKNANGE--KFKCAYDKSKVKLFTGKDFLHFN-GSET-MKKILYKYGPLSVL 876
             +     YPY     +  + + A    +  L   +D+     G E  MK+++   GPL+  
Sbjct:   212 VTLANKYPYTQTEMQCRQNETAGRPPRESLVKIRDYATITPGDEEKMKEVIATLGPLACS 271

Query:   877 LNSDLI--HDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEG 934
             +N+D I    Y+G      DE C+  +L H+V +VGYG ++   YW+++NS+     + G
Sbjct:   272 MNADTISFEQYSGGIYE--DEECNQGELNHSVTVVGYGTENGRDYWIIKNSYSQNWGEGG 329

Query:   935 FFKIERGNNA-CGIEQIAGY 953
             F +I R     CGI     Y
Sbjct:   330 FMRILRNAGGFCGIASECSY 349

 Score = 311 (114.5 bits), Expect = 1.4e-26, P = 1.4e-26
 Identities = 74/222 (33%), Positives = 110/222 (49%)

Query:    11 VPDAWDWRKKNVTGPAGDQA-DCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ 69
             +P+ +DWR+K    P G Q   CG+CW+F+  G LEG    +TG L   S+  LV+CA  
Sbjct:   130 LPEMFDWREKGGVTPPGFQGVGCGACWSFATTGALEGHLFRRTGVLASLSQQNLVDCADD 189

Query:    70 CS--GCDGCFFEPSIEYTHQAGLESEKDYPYKNANGE--KFKCAYDKSKVKLFTGKDFLH 125
                 GCDG F E   EY    G+     YPY     +  + + A    +  L   +D+  
Sbjct:   190 YGNMGCDGGFQEYGFEYIRDHGVTLANKYPYTQTEMQCRQNETAGRPPRESLVKIRDYAT 249

Query:   126 FN-GSET-MKKILYKYGPLSVLLNSDLI--HDYNGTPIRKNDETCSPYDLGHAVLLVGYG 181
                G E  MK+++   GPL+  +N+D I    Y+G      DE C+  +L H+V +VGYG
Sbjct:   250 ITPGDEEKMKEVIATLGPLACSMNADTISFEQYSGGIYE--DEECNQGELNHSVTVVGYG 307

Query:   182 KQDNIPYWLVRNSWGPIGPDEGFFKIERGNNA-CGIEQIAGY 222
              ++   YW+++NS+     + GF +I R     CGI     Y
Sbjct:   308 TENGRDYWIIKNSYSQNWGEGGFMRILRNAGGFCGIASECSY 349

 Score = 306 (112.8 bits), Expect = 4.9e-26, P = 4.9e-26
 Identities = 93/321 (28%), Positives = 145/321 (45%)

Query:   289 LETFKAFIVKRGRQYANDEEI-KERFEYFKQD----GHKKHE------RYGTSEFSDRSP 337
             ++ F  F+ + G+ Y+++E + +E     K       +K  +      R G +  +D + 
Sbjct:    35 VQNFDDFLRQTGKVYSDEERVYRESIFAAKMSLITLSNKNADNGVSGFRLGVNTLADMTR 94

Query:   338 EEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAA- 396
             +EI    G K SE   ER                 +  +P+ +DWR+K    P G Q   
Sbjct:    95 KEIATLLGSKISEFG-ERYTNGHINFVTARNPASAN--LPEMFDWREKGGVTPPGFQGVG 151

Query:   397 CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDG--LEQPIEYTHQA 454
             CG+CW+F+  G LEG    +TG L   S+  LV+CA    G  GCDG   E   EY    
Sbjct:   152 CGACWSFATTGALEGHLFRRTGVLASLSQQNLVDCADDY-GNMGCDGGFQEYGFEYIRDH 210

Query:   455 GLESEKDYPYRNGNGE--KFKCAYDKSKVKLFTGKDFLYFN-GSET-MKKILYKYGPLSV 510
             G+     YPY     +  + + A    +  L   +D+     G E  MK+++   GPL+ 
Sbjct:   211 GVTLANKYPYTQTEMQCRQNETAGRPPRESLVKIRDYATITPGDEEKMKEVIATLGPLAC 270

Query:   511 GLNSHLIHF--YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDE 568
              +N+  I F  Y+G      DE C+  +L H+V +VGYG ++   YW+ +NS+     + 
Sbjct:   271 SMNADTISFEQYSGGIYE--DEECNQGELNHSVTVVGYGTENGRDYWIIKNSYSQNWGEG 328

Query:   569 GFFKIERGNNA-CGIEQIAGY 588
             GF +I R     CGI     Y
Sbjct:   329 GFMRILRNAGGFCGIASECSY 349


>MGI|MGI:1922258 [details] [associations]
            symbol:4930486L24Rik "RIKEN cDNA 4930486L24 gene"
            species:10090 "Mus musculus" [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030054 "cell
            junction" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 MGI:MGI:1922258
            GO:GO:0005576 GO:GO:0030054 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 HSSP:P07711
            EMBL:AY146988 EMBL:AK145933 EMBL:BC061218 IPI:IPI00280732
            RefSeq:NP_835199.1 UniGene:Mm.19839 ProteinModelPortal:Q80UB0
            SMR:Q80UB0 MEROPS:C01.972 PRIDE:Q80UB0 Ensembl:ENSMUST00000091569
            GeneID:214639 KEGG:mmu:214639 UCSC:uc007qvs.1 InParanoid:Q80UB0
            OMA:RYHAENS OrthoDB:EOG4XWG0N NextBio:374408 Bgee:Q80UB0
            CleanEx:MM_4930486L24RIK Genevestigator:Q80UB0 Uniprot:Q80UB0
        Length = 333

 Score = 322 (118.4 bits), Expect = 9.4e-28, P = 9.4e-28
 Identities = 83/227 (36%), Positives = 111/227 (48%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC--AK 799
             VP   DWR      P  +Q  C S WAFS  G LEGQ   KTG+LV  S+  L++C  + 
Sbjct:   114 VPKYVDWRMLGYVTPVKNQGYCASSWAFSATGSLEGQMFKKTGRLVPLSEQNLLDCMGSN 173

Query:   800 QCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 858
                 C G F + + +Y     GL +E+ YPY    G   KC Y          +DF+   
Sbjct:   174 VTHDCSGGFMQNAFQYVKDNGGLATEESYPYI---GPGRKCRYHAEN-SAANVRDFVQIP 229

Query:   859 G-SETMKKILYKYGPLSVLLNSDLIHD---YNGTPIRKNDETCSPYDLGHAVLLVGYG-- 912
             G  E + K + K GP+SV +  D  HD   +  + I    + C    L HAVL+VGYG  
Sbjct:   230 GREEALMKAVAKVGPISVAV--DASHDSFQFYDSGIYYEPQ-CKRVHLNHAVLVVGYGFE 286

Query:   913 --KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
               + D   YWLV+NSWG     +G+ KI +  NN CGI  +A Y  +
Sbjct:   287 GEESDGNSYWLVKNSWGEEWGMKGYIKIAKDWNNHCGIATLATYPIV 333

 Score = 321 (118.1 bits), Expect = 1.2e-27, P = 1.2e-27
 Identities = 83/227 (36%), Positives = 111/227 (48%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC--AK 68
             VP   DWR      P  +Q  C S WAFS  G LEGQ   KTG+LV  S+  L++C  + 
Sbjct:   114 VPKYVDWRMLGYVTPVKNQGYCASSWAFSATGSLEGQMFKKTGRLVPLSEQNLLDCMGSN 173

Query:    69 QCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 127
                 C G F + + +Y     GL +E+ YPY    G   KC Y          +DF+   
Sbjct:   174 VTHDCSGGFMQNAFQYVKDNGGLATEESYPYI---GPGRKCRYHAEN-SAANVRDFVQIP 229

Query:   128 G-SETMKKILYKYGPLSVLLNSDLIHD---YNGTPIRKNDETCSPYDLGHAVLLVGYG-- 181
             G  E + K + K GP+SV +  D  HD   +  + I    + C    L HAVL+VGYG  
Sbjct:   230 GREEALMKAVAKVGPISVAV--DASHDSFQFYDSGIYYEPQ-CKRVHLNHAVLVVGYGFE 286

Query:   182 --KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
               + D   YWLV+NSWG     +G+ KI +  NN CGI  +A Y  +
Sbjct:   287 GEESDGNSYWLVKNSWGEEWGMKGYIKIAKDWNNHCGIATLATYPIV 333

 Score = 311 (114.5 bits), Expect = 1.4e-26, P = 1.4e-26
 Identities = 82/227 (36%), Positives = 114/227 (50%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC--AK 433
             VP   DWR      P  +Q  C S WAFS  G LEGQ   KTG+LV  S+  L++C  + 
Sbjct:   114 VPKYVDWRMLGYVTPVKNQGYCASSWAFSATGSLEGQMFKKTGRLVPLSEQNLLDCMGSN 173

Query:   434 QCSGCGGCDGLEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYF 492
                 C G   ++   +Y     GL +E+ YPY  G G K +   + S   +   +DF+  
Sbjct:   174 VTHDCSG-GFMQNAFQYVKDNGGLATEESYPYI-GPGRKCRYHAENSAANV---RDFVQI 228

Query:   493 NG-SETMKKILYKYGPLSVGLN-SH-LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYG-- 547
              G  E + K + K GP+SV ++ SH    FY+ + I    + C    L HAVL+VGYG  
Sbjct:   229 PGREEALMKAVAKVGPISVAVDASHDSFQFYD-SGIYYEPQ-CKRVHLNHAVLVVGYGFE 286

Query:   548 --KQDDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
               + D   YWL +NSWG     +G+ KI +  NN CGI  +A Y  +
Sbjct:   287 GEESDGNSYWLVKNSWGEEWGMKGYIKIAKDWNNHCGIATLATYPIV 333

 Score = 133 (51.9 bits), Expect = 2.2e-05, P = 2.2e-05
 Identities = 29/65 (44%), Positives = 37/65 (56%)

Query:   965 CSPYDLGHAVLLVGYG----KQDDIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIA 1019
             C    L HAVL+VGYG    + D   YWLV+NSWG     +G+ KI +  NN CGI  +A
Sbjct:   269 CKRVHLNHAVLVVGYGFEGEESDGNSYWLVKNSWGEEWGMKGYIKIAKDWNNHCGIATLA 328

Query:  1020 GYATI 1024
              Y  +
Sbjct:   329 TYPIV 333


>MGI|MGI:1861723 [details] [associations]
            symbol:Ctsr "cathepsin R" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=ISA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=ISA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0030163 "protein
            catabolic process" evidence=ISA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1861723 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0030163
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF245399
            EMBL:AY014778 EMBL:AK014432 EMBL:AK005429 IPI:IPI00120321
            RefSeq:NP_064680.1 UniGene:Mm.315715 ProteinModelPortal:Q9JIA9
            SMR:Q9JIA9 MEROPS:C01.042 PRIDE:Q9JIA9 Ensembl:ENSMUST00000021889
            GeneID:56835 KEGG:mmu:56835 CTD:56835 InParanoid:Q9JIA9 KO:K09601
            OMA:ASHESFK OrthoDB:EOG4ZCT6D NextBio:313379 Bgee:Q9JIA9
            CleanEx:MM_CTSR Genevestigator:Q9JIA9 GermOnline:ENSMUSG00000055679
            Uniprot:Q9JIA9
        Length = 334

 Score = 322 (118.4 bits), Expect = 9.4e-28, P = 9.4e-28
 Identities = 82/236 (34%), Positives = 118/236 (50%)

Query:     2 LMEVEKDGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKS 61
             +M+ E    +P   DWRKK    P   Q DC +CWAF++ G +E Q   +TGKL   S  
Sbjct:   106 IMKREAGSILPKFVDWRKKGYVTPVRRQGDCDACWAFAVTGAIEAQAIWQTGKLTPLSVQ 165

Query:    62 QLVECAKQC--SGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKL- 117
              LV+C+K    +GC G     + +Y  H  GLESE  YPY+  +G    C Y+    K  
Sbjct:   166 NLVDCSKPQGNNGCLGGDTYNAFQYVLHNGGLESEATYPYEGKDGP---CRYNPKNSKAE 222

Query:   118 FTGKDFLHFNGSET-MKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHA 174
              TG  F+    SE  +   +   GP++  +++  +   +Y G     ++  CS   + H 
Sbjct:   223 ITG--FVSLPQSEDILMAAVATIGPITAGIDASHESFKNYKGGIY--HEPNCSSDTVTHG 278

Query:   175 VLLVGYG----KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
             VL+VGYG    + D   YWL++NSWG      G+ K+ +  NN CGI   A Y TI
Sbjct:   279 VLVVGYGFKGIETDGNHYWLIKNSWGKRWGIRGYMKLAKDKNNHCGIASYAHYPTI 334

 Score = 317 (116.6 bits), Expect = 3.2e-27, P = 3.2e-27
 Identities = 81/226 (35%), Positives = 110/226 (48%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +P   DWRKK    P   Q  C +CWAF++ G +E Q   +TGKL   S   LV+C+K  
Sbjct:   115 LPKFVDWRKKGYVTPVRRQGDCDACWAFAVTGAIEAQAIWQTGKLTPLSVQNLVDCSKP- 173

Query:   436 SGCGGCDGLE--QPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKL-FTGKDFLY 491
              G  GC G +     +Y  H  GLESE  YPY   +G    C Y+    K   TG  F+ 
Sbjct:   174 QGNNGCLGGDTYNAFQYVLHNGGLESEATYPYEGKDGP---CRYNPKNSKAEITG--FVS 228

Query:   492 FNGSET-MKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYG--- 547
                SE  +   +   GP++ G+++    F N      ++  CS   + H VL+VGYG   
Sbjct:   229 LPQSEDILMAAVATIGPITAGIDASHESFKNYKGGIYHEPNCSSDTVTHGVLVVGYGFKG 288

Query:   548 -KQDDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
              + D   YWL +NSWG      G+ K+ +  NN CGI   A Y TI
Sbjct:   289 IETDGNHYWLIKNSWGKRWGIRGYMKLAKDKNNHCGIASYAHYPTI 334

 Score = 309 (113.8 bits), Expect = 2.3e-26, P = 2.3e-26
 Identities = 79/227 (34%), Positives = 113/227 (49%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             +P   DWRKK    P   Q  C +CWAF++ G +E Q   +TGKL   S   LV+C+K  
Sbjct:   115 LPKFVDWRKKGYVTPVRRQGDCDACWAFAVTGAIEAQAIWQTGKLTPLSVQNLVDCSKPQ 174

Query:   802 --SGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKL-FTGKDFLHF 857
               +GC G     + +Y  H  GLESE  YPY+  +G    C Y+    K   TG  F+  
Sbjct:   175 GNNGCLGGDTYNAFQYVLHNGGLESEATYPYEGKDGP---CRYNPKNSKAEITG--FVSL 229

Query:   858 NGSET-MKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG-- 912
               SE  +   +   GP++  +++  +   +Y G     ++  CS   + H VL+VGYG  
Sbjct:   230 PQSEDILMAAVATIGPITAGIDASHESFKNYKGGIY--HEPNCSSDTVTHGVLVVGYGFK 287

Query:   913 --KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
               + D   YWL++NSWG      G+ K+ +  NN CGI   A Y TI
Sbjct:   288 GIETDGNHYWLIKNSWGKRWGIRGYMKLAKDKNNHCGIASYAHYPTI 334

 Score = 129 (50.5 bits), Expect = 6.2e-05, P = 6.2e-05
 Identities = 28/71 (39%), Positives = 39/71 (54%)

Query:   959 VKNDETCSPYDLGHAVLLVGYG----KQDDIPYWLVRNSWGPIGPDEGFFKIERG-NNAC 1013
             + ++  CS   + H VL+VGYG    + D   YWL++NSWG      G+ K+ +  NN C
Sbjct:   264 IYHEPNCSSDTVTHGVLVVGYGFKGIETDGNHYWLIKNSWGKRWGIRGYMKLAKDKNNHC 323

Query:  1014 GIEQIAGYATI 1024
             GI   A Y TI
Sbjct:   324 GIASYAHYPTI 334


>TAIR|locus:2128243 [details] [associations]
            symbol:AT4G11310 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005618 "cell wall"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 GO:GO:0005618 EMBL:CP002687
            GenomeReviews:CT486007_GR EMBL:AL096882 EMBL:AL161531
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            HOGENOM:HOG000230773 KO:K01376 EMBL:AY093066 EMBL:BT000099
            IPI:IPI00520496 PIR:T13022 RefSeq:NP_567376.1 UniGene:At.43189
            ProteinModelPortal:Q9SUT0 SMR:Q9SUT0 IntAct:Q9SUT0 STRING:Q9SUT0
            MEROPS:C01.A20 PaxDb:Q9SUT0 PRIDE:Q9SUT0 EnsemblPlants:AT4G11310.1
            GeneID:826733 KEGG:ath:AT4G11310 TAIR:At4g11310 InParanoid:Q9SUT0
            OMA:EVCHGAD PhylomeDB:Q9SUT0 ProtClustDB:CLSN2689395
            Genevestigator:Q9SUT0 GermOnline:AT4G11310 Uniprot:Q9SUT0
        Length = 364

 Score = 322 (118.4 bits), Expect = 9.4e-28, P = 9.4e-28
 Identities = 107/378 (28%), Positives = 171/378 (45%)

Query:   603 KAIMLIQAVFLLCGVASCLCLPSLTDRITDQVVARVDTLAIEGSLTFDNENILETFKAFI 662
             K+ MLI  V ++  +ASC         I   VV+  D   +     FD E  L  F++++
Sbjct:     5 KSAMLILLVAMV--IASC------ATAIDMSVVSYDDNNRLHS--VFDAEASL-IFESWM 53

Query:   663 VKRGRQYANDEEIKERFEYFKQD----GHKKHE----RYGTSEFSDRSPEEI--LCKTGF 712
             VK G+ Y +  E + R   F+ +     ++  E    R G + F+D S  E   +C    
Sbjct:    54 VKHGKVYGSVAEKERRLTIFEDNLRFINNRNAENLSYRLGLTGFADLSLHEYKEVCHGAD 113

Query:   713 KWSERTYERIVA-DRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSI 771
                 R +  + + DR            D  +P + DWR +       DQ  C SCWAFS 
Sbjct:   114 PRPPRNHVFMTSSDRYKTSA-------DDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFST 166

Query:   772 AGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYK 830
              G +EG   I TG+LV  S+  L+ C K+ +GC G   E + E+  +  GL ++ DYPYK
Sbjct:   167 VGAVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKLETAYEFIMKNGGLGTDNDYPYK 226

Query:   831 NANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPI 890
               NG       + +K  +  G + L  N    + K +  + P++ +++S    ++     
Sbjct:   227 AVNGVCDGRLKENNKNVMIDGYENLPANDESALMKAV-AHQPVTAVIDSSS-REFQLYES 284

Query:   891 RKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERG----NNACG 946
                D +C   +L H V++VGYG ++   YWLV+NS G    + G+ K+ R        CG
Sbjct:   285 GVFDGSCGT-NLNHGVVVVGYGTENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCG 343

Query:   947 IEQIAGYATIDVVKNDET 964
             I   A Y   +    D++
Sbjct:   344 IAMRASYPLKNSFSTDKS 361

 Score = 310 (114.2 bits), Expect = 1.8e-26, P = 1.8e-26
 Identities = 108/369 (29%), Positives = 164/369 (44%)

Query:   237 KAIMLIQAVFLLCGVASCLCLPSLTDRITDQVVARVDTLAIEGSLTFDNENILETFKAFI 296
             K+ MLI  V ++  +ASC         I   VV+  D   +     FD E  L  F++++
Sbjct:     5 KSAMLILLVAMV--IASC------ATAIDMSVVSYDDNNRLHS--VFDAEASL-IFESWM 53

Query:   297 VKRGRQYANDEEIKERFEYFKQD----GHKKHE----RYGTSEFSDRSPEEI--LCKTGF 346
             VK G+ Y +  E + R   F+ +     ++  E    R G + F+D S  E   +C    
Sbjct:    54 VKHGKVYGSVAEKERRLTIFEDNLRFINNRNAENLSYRLGLTGFADLSLHEYKEVCHGAD 113

Query:   347 KWSERTYERIVA-DRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSI 405
                 R +  + + DR            D  +P + DWR +       DQ  C SCWAFS 
Sbjct:   114 PRPPRNHVFMTSSDRYKTSA-------DDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFST 166

Query:   406 AGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQPIEYTHQ-AGLESEKDYPY 464
              G +EG   I TG+LV  S+  L+ C K+ +GCGG   LE   E+  +  GL ++ DYPY
Sbjct:   167 VGAVEGLNKIVTGELVTLSEQDLINCNKENNGCGG-GKLETAYEFIMKNGGLGTDNDYPY 225

Query:   465 RNGNGEKFKCAYDKSKVKLFTGKDFLYFNG-SETMKKILYKYGPLSVGLNSHLIHFYNGT 523
             +  NG       + +K  +  G + L  N  S  MK + ++     +  +S     Y   
Sbjct:   226 KAVNGVCDGRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESG 285

Query:   524 PIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERG----NNA 579
                  D +C   +L H V++VGYG ++   YWL +NS G    + G+ K+ R        
Sbjct:   286 VF---DGSCGT-NLNHGVVVVGYGTENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGL 341

Query:   580 CGIEQIAGY 588
             CGI   A Y
Sbjct:   342 CGIAMRASY 350

 Score = 299 (110.3 bits), Expect = 2.7e-25, P = 2.7e-25
 Identities = 71/220 (32%), Positives = 109/220 (49%)

Query:     8 DGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 67
             D  +P + DWR +       DQ  C SCWAFS  G +EG   I TG+LV  S+  L+ C 
Sbjct:   134 DDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCN 193

Query:    68 KQCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF 126
             K+ +GC G   E + E+  +  GL ++ DYPYK  NG       + +K  +  G + L  
Sbjct:   194 KENNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNVMIDGYENLPA 253

Query:   127 NGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNI 186
             N    + K +  + P++ +++S    ++        D +C   +L H V++VGYG ++  
Sbjct:   254 NDESALMKAV-AHQPVTAVIDSSS-REFQLYESGVFDGSCGT-NLNHGVVVVGYGTENGR 310

Query:   187 PYWLVRNSWGPIGPDEGFFKIERG----NNACGIEQIAGY 222
              YWLV+NS G    + G+ K+ R        CGI   A Y
Sbjct:   311 DYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASY 350


>TAIR|locus:2055440 [details] [associations]
            symbol:AT2G34080 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002685 GenomeReviews:CT485783_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 MEROPS:I29.003 EMBL:AC002341
            HOGENOM:HOG000230773 HSSP:P53634 IPI:IPI00530325 PIR:B84752
            RefSeq:NP_565780.1 UniGene:At.28613 UniGene:At.37859
            ProteinModelPortal:O22961 SMR:O22961 EnsemblPlants:AT2G34080.1
            GeneID:817969 KEGG:ath:AT2G34080 TAIR:At2g34080 InParanoid:O22961
            OMA:SENDYSY PhylomeDB:O22961 ProtClustDB:CLSN2688064
            ArrayExpress:O22961 Genevestigator:O22961 Uniprot:O22961
        Length = 345

 Score = 321 (118.1 bits), Expect = 1.2e-27, P = 1.2e-27
 Identities = 94/333 (28%), Positives = 153/333 (45%)

Query:   642 AIEGSLTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQD----------GHKKHE 691
             A   ++ F  +++++  + ++ +  R+Y ++ E   R + FK++          G+K + 
Sbjct:    23 ATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDVFKKNLKFIENFNKKGNKSY- 81

Query:   692 RYGTSEFSDRSPEEILC-KTGFKW-SERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWR 749
             + G +EF+D + EE L   TG K  +E +  ++VA                 V ++ DWR
Sbjct:    82 KLGVNEFADWTNEEFLAIHTGLKGLTEVSPSKVVAKTISSQTWNVSDM----VVESKDWR 137

Query:   750 KKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS-GCDGCF 808
              +    P   Q  CG CWAFS    +EG   I  G LV  S+ QL++C ++   GCDG  
Sbjct:   138 AEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCDREYDRGCDGGI 197

Query:   809 FEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKIL 867
                +  Y  Q  G+ SE DY Y+ ++G    C  +       +G   +  N    + + +
Sbjct:   198 MSDAFNYVVQNRGIASENDYSYQGSDGG---CRSNARPAARISGFQTVPSNNERALLEAV 254

Query:   868 YKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG-KQDNIPYWLVRN 924
              +  P+SV +++  D    Y+G      D  C      HAV  VGYG  QD   YWL +N
Sbjct:   255 SRQ-PVSVSMDATGDGFMHYSGGVY---DGPCGTSS-NHAVTFVGYGTSQDGTKYWLAKN 309

Query:   925 SWGPIGPDEGFFKIERG----NNACGIEQIAGY 953
             SWG    ++G+ +I R        CG+ Q A Y
Sbjct:   310 SWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFY 342

 Score = 313 (115.2 bits), Expect = 8.7e-27, P = 8.7e-27
 Identities = 94/336 (27%), Positives = 153/336 (45%)

Query:   276 AIEGSLTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQD----------GHKKHE 325
             A   ++ F  +++++  + ++ +  R+Y ++ E   R + FK++          G+K + 
Sbjct:    23 ATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDVFKKNLKFIENFNKKGNKSY- 81

Query:   326 RYGTSEFSDRSPEEILC-KTGFKW-SERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWR 383
             + G +EF+D + EE L   TG K  +E +  ++VA                 V ++ DWR
Sbjct:    82 KLGVNEFADWTNEEFLAIHTGLKGLTEVSPSKVVAKTISSQTWNVSDM----VVESKDWR 137

Query:   384 KKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDG 443
              +    P   Q  CG CWAFS    +EG   I  G LV  S+ QL++C ++     GCDG
Sbjct:   138 AEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCDREYDR--GCDG 195

Query:   444 --LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKK 500
               +     Y  Q  G+ SE DY Y+  +G    C  +       +G   +  N    + +
Sbjct:   196 GIMSDAFNYVVQNRGIASENDYSYQGSDGG---CRSNARPAARISGFQTVPSNNERALLE 252

Query:   501 ILYKYGPLSVGLNSH---LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYG-KQDDIPYWL 556
              + +  P+SV +++     +H+  G      D  C      HAV  VGYG  QD   YWL
Sbjct:   253 AVSRQ-PVSVSMDATGDGFMHYSGGV----YDGPCGTSS-NHAVTFVGYGTSQDGTKYWL 306

Query:   557 ARNSWGPIGPDEGFFKIERG----NNACGIEQIAGY 588
             A+NSWG    ++G+ +I R        CG+ Q A Y
Sbjct:   307 AKNSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFY 342

 Score = 283 (104.7 bits), Expect = 1.4e-23, P = 1.4e-23
 Identities = 71/221 (32%), Positives = 103/221 (46%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             V ++ DWR +    P   Q  CG CWAFS    +EG   I  G LV  S+ QL++C ++ 
Sbjct:   130 VVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCDREY 189

Query:    71 S-GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNG 128
               GCDG     +  Y  Q  G+ SE DY Y+ ++G    C  +       +G   +  N 
Sbjct:   190 DRGCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGG---CRSNARPAARISGFQTVPSNN 246

Query:   129 SETMKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG-KQDN 185
                + + + +  P+SV +++  D    Y+G      D  C      HAV  VGYG  QD 
Sbjct:   247 ERALLEAVSRQ-PVSVSMDATGDGFMHYSGGVY---DGPCGTSS-NHAVTFVGYGTSQDG 301

Query:   186 IPYWLVRNSWGPIGPDEGFFKIERG----NNACGIEQIAGY 222
               YWL +NSWG    ++G+ +I R        CG+ Q A Y
Sbjct:   302 TKYWLAKNSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFY 342


>RGD|69241 [details] [associations]
            symbol:Ctsj "cathepsin J" species:10116 "Rattus norvegicus"
           [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
           evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
           evidence=IEA] [GO:0048471 "perinuclear region of cytoplasm"
           evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
           PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:L14776
           RGD:69241 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
           InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246
           SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
           GO:GO:0005764 HOVERGEN:HBG011513 MEROPS:C01.038 CTD:26898 KO:K09599
           EMBL:AF310623 EMBL:BC097263 IPI:IPI00205027 PIR:I58002
           RefSeq:NP_058817.1 UniGene:Rn.34875 ProteinModelPortal:Q63088
           SMR:Q63088 PRIDE:Q63088 GeneID:29174 KEGG:rno:29174 NextBio:608244
           Genevestigator:Q63088 Uniprot:Q63088
        Length = 334

 Score = 320 (117.7 bits), Expect = 1.5e-27, P = 1.5e-27
 Identities = 84/228 (36%), Positives = 111/228 (48%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +P+  DWRK+    P  +Q  CGSCWAF+  G +EGQ   KTG L   S   L++C+K  
Sbjct:   114 LPNFKDWRKEGYVTPVRNQGKCGSCWAFAAVGAIEGQMFSKTGNLTPLSVQNLLDCSKS- 172

Query:   436 SGCGGCD-GL-EQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKL-FTGKDFLY 491
              G  GC  G   Q   Y     GLE+E  YPY   +G    C Y         TG  F+ 
Sbjct:   173 EGNNGCRWGTAHQAFNYVLKNKGLEAEATYPYEGKDGP---CRYHSENASANITG--FVN 227

Query:   492 FNGSETMKKI-LYKYGPLSVGLN-SH-LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYG- 547
                +E    + +   GP+S  ++ SH    FY+G     ++  CS Y + HAVL+VGYG 
Sbjct:   228 LPPNELYLWVAVASIGPVSAAIDASHDSFRFYSGGVY--HEPNCSSYVVNHAVLVVGYGF 285

Query:   548 ---KQDDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
                + D   YWL +NSWG      GF KI +  NN CGI   A +  I
Sbjct:   286 EGNETDGNNYWLIKNSWGEEWGINGFMKIAKDRNNHCGIASQASFPDI 333

 Score = 319 (117.4 bits), Expect = 2.0e-27, P = 2.0e-27
 Identities = 80/228 (35%), Positives = 114/228 (50%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P+  DWRK+    P  +Q  CGSCWAF+  G +EGQ   KTG L   S   L++C+K  
Sbjct:   114 LPNFKDWRKEGYVTPVRNQGKCGSCWAFAAVGAIEGQMFSKTGNLTPLSVQNLLDCSKS- 172

Query:    71 SGCDGCFF---EPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKL-FTGKDFLH 125
              G +GC +     +  Y     GLE+E  YPY+  +G    C Y         TG  F++
Sbjct:   173 EGNNGCRWGTAHQAFNYVLKNKGLEAEATYPYEGKDGP---CRYHSENASANITG--FVN 227

Query:   126 FNGSETMKKI-LYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG- 181
                +E    + +   GP+S  +++  D    Y+G     ++  CS Y + HAVL+VGYG 
Sbjct:   228 LPPNELYLWVAVASIGPVSAAIDASHDSFRFYSGGVY--HEPNCSSYVVNHAVLVVGYGF 285

Query:   182 ---KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
                + D   YWL++NSWG      GF KI +  NN CGI   A +  I
Sbjct:   286 EGNETDGNNYWLIKNSWGEEWGINGFMKIAKDRNNHCGIASQASFPDI 333

 Score = 319 (117.4 bits), Expect = 2.0e-27, P = 2.0e-27
 Identities = 80/228 (35%), Positives = 114/228 (50%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             +P+  DWRK+    P  +Q  CGSCWAF+  G +EGQ   KTG L   S   L++C+K  
Sbjct:   114 LPNFKDWRKEGYVTPVRNQGKCGSCWAFAAVGAIEGQMFSKTGNLTPLSVQNLLDCSKS- 172

Query:   802 SGCDGCFF---EPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKL-FTGKDFLH 856
              G +GC +     +  Y     GLE+E  YPY+  +G    C Y         TG  F++
Sbjct:   173 EGNNGCRWGTAHQAFNYVLKNKGLEAEATYPYEGKDGP---CRYHSENASANITG--FVN 227

Query:   857 FNGSETMKKI-LYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG- 912
                +E    + +   GP+S  +++  D    Y+G     ++  CS Y + HAVL+VGYG 
Sbjct:   228 LPPNELYLWVAVASIGPVSAAIDASHDSFRFYSGGVY--HEPNCSSYVVNHAVLVVGYGF 285

Query:   913 ---KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
                + D   YWL++NSWG      GF KI +  NN CGI   A +  I
Sbjct:   286 EGNETDGNNYWLIKNSWGEEWGINGFMKIAKDRNNHCGIASQASFPDI 333

 Score = 139 (54.0 bits), Expect = 4.9e-06, P = 4.9e-06
 Identities = 31/71 (43%), Positives = 40/71 (56%)

Query:   959 VKNDETCSPYDLGHAVLLVGYG----KQDDIPYWLVRNSWGPIGPDEGFFKIERG-NNAC 1013
             V ++  CS Y + HAVL+VGYG    + D   YWL++NSWG      GF KI +  NN C
Sbjct:   263 VYHEPNCSSYVVNHAVLVVGYGFEGNETDGNNYWLIKNSWGEEWGINGFMKIAKDRNNHC 322

Query:  1014 GIEQIAGYATI 1024
             GI   A +  I
Sbjct:   323 GIASQASFPDI 333


>ZFIN|ZDB-GENE-050417-107 [details] [associations]
            symbol:zgc:110239 "zgc:110239" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-050417-107
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 MEROPS:I29.003 OrthoDB:EOG412M56 EMBL:BC092817
            IPI:IPI00503987 RefSeq:NP_001017633.1 UniGene:Dr.39081
            ProteinModelPortal:Q568K7 GeneID:550326 KEGG:dre:550326
            HOGENOM:HOG000007373 HOVERGEN:HBG105018 InParanoid:Q568K7
            NextBio:20879584 ArrayExpress:Q568K7 Uniprot:Q568K7
        Length = 546

 Score = 338 (124.0 bits), Expect = 1.6e-27, P = 1.6e-27
 Identities = 104/313 (33%), Positives = 147/313 (46%)

Query:   298 KRGRQYANDEEIKERFEYFKQDGHKKHE--RYGTS------EFSDRSPEEILCKTGFKWS 349
             K  RQY N+ E +ER   F  +    H   R G S        +DRS +E+    G    
Sbjct:   249 KFNRQYDNEMEHEEREHNFVHNIRYVHSMNRAGLSFSLSVNHLADRSQKELSMMRG---C 305

Query:   350 ERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGML 409
             +RT++     R                P++ DWR      P  DQA CGSCW+F+  G L
Sbjct:   306 QRTHK---VHRKAQPFPSEIRSI--ATPNSVDWRLYGAVTPVKDQAVCGSCWSFATTGTL 360

Query:   410 EGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQ--PIEYTHQ-AGLESEKDY-PYR 465
             EG   +KTG+L   S+  LV+C     G  GCDG E+    E+  +  G+ + + Y  Y 
Sbjct:   361 EGALFLKTGQLTSLSQQMLVDCTWGF-GNNGCDGGEEWRAFEWIMKHGGISTAESYGAYM 419

Query:   466 NGNGEKFKCAYDKSK-VKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNS-HL-IHFY-N 521
               NG    C YDKS  V   TG   +       +K  ++K+GP++V +++ H    FY N
Sbjct:   420 GMNG---LCHYDKSSMVAQLTGYTNVTSGDILALKAAIFKFGPVAVSIDAAHRSFAFYSN 476

Query:   522 GT---PIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNN 578
             G    P  KN       DL HAVL VGYG  ++  YWL +NSW     ++G+  +   +N
Sbjct:   477 GVYYEPECKNGIN----DLDHAVLAVGYGIMNNESYWLVKNSWSSYWGNDGYILMSMKDN 532

Query:   579 ACGIEQIAGYATI 591
              CG+   A YAT+
Sbjct:   533 NCGVATDAIYATL 545

 Score = 329 (120.9 bits), Expect = 1.5e-26, Sum P(2) = 1.5e-26
 Identities = 101/312 (32%), Positives = 144/312 (46%)

Query:   664 KRGRQYANDEEIKERFEYFKQDGHKKHE--RYGTS------EFSDRSPEEILCKTGFKWS 715
             K  RQY N+ E +ER   F  +    H   R G S        +DRS +E+    G    
Sbjct:   249 KFNRQYDNEMEHEEREHNFVHNIRYVHSMNRAGLSFSLSVNHLADRSQKELSMMRG---C 305

Query:   716 ERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGML 775
             +RT++     R                P++ DWR      P  DQA CGSCW+F+  G L
Sbjct:   306 QRTHK---VHRKAQPFPSEIRSI--ATPNSVDWRLYGAVTPVKDQAVCGSCWSFATTGTL 360

Query:   776 EGQYAIKTGKLVEFSKSQLVECAKQC--SGCDGCFFEPSIEYTHQ-AGLESEKDY-PYKN 831
             EG   +KTG+L   S+  LV+C      +GCDG     + E+  +  G+ + + Y  Y  
Sbjct:   361 EGALFLKTGQLTSLSQQMLVDCTWGFGNNGCDGGEEWRAFEWIMKHGGISTAESYGAYMG 420

Query:   832 ANGEKFKCAYDKSK-VKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNS---DLIHDYNG 887
              NG    C YDKS  V   TG   +       +K  ++K+GP++V +++         NG
Sbjct:   421 MNG---LCHYDKSSMVAQLTGYTNVTSGDILALKAAIFKFGPVAVSIDAAHRSFAFYSNG 477

Query:   888 T---PIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNA 944
                 P  KN       DL HAVL VGYG  +N  YWLV+NSW     ++G+  +   +N 
Sbjct:   478 VYYEPECKNGIN----DLDHAVLAVGYGIMNNESYWLVKNSWSSYWGNDGYILMSMKDNN 533

Query:   945 CGIEQIAGYATI 956
             CG+   A YAT+
Sbjct:   534 CGVATDAIYATL 545

 Score = 315 (115.9 bits), Expect = 6.3e-25, P = 6.3e-25
 Identities = 80/225 (35%), Positives = 114/225 (50%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC- 70
             P++ DWR      P  DQA CGSCW+F+  G LEG   +KTG+L   S+  LV+C     
Sbjct:   328 PNSVDWRLYGAVTPVKDQAVCGSCWSFATTGTLEGALFLKTGQLTSLSQQMLVDCTWGFG 387

Query:    71 -SGCDGCFFEPSIEYTHQ-AGLESEKDY-PYKNANGEKFKCAYDKSK-VKLFTGKDFLHF 126
              +GCDG     + E+  +  G+ + + Y  Y   NG    C YDKS  V   TG   +  
Sbjct:   388 NNGCDGGEEWRAFEWIMKHGGISTAESYGAYMGMNG---LCHYDKSSMVAQLTGYTNVTS 444

Query:   127 NGSETMKKILYKYGPLSVLLNS---DLIHDYNGT---PIRKNDETCSPYDLGHAVLLVGY 180
                  +K  ++K+GP++V +++         NG    P  KN       DL HAVL VGY
Sbjct:   445 GDILALKAAIFKFGPVAVSIDAAHRSFAFYSNGVYYEPECKNGIN----DLDHAVLAVGY 500

Query:   181 GKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 225
             G  +N  YWLV+NSW     ++G+  +   +N CG+   A YAT+
Sbjct:   501 GIMNNESYWLVKNSWSSYWGNDGYILMSMKDNNCGVATDAIYATL 545

 Score = 133 (51.9 bits), Expect = 8.4e-05, Sum P(2) = 8.4e-05
 Identities = 25/56 (44%), Positives = 35/56 (62%)

Query:   969 DLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 1024
             DL HAVL VGYG  ++  YWLV+NSW     ++G+  +   +N CG+   A YAT+
Sbjct:   490 DLDHAVLAVGYGIMNNESYWLVKNSWSSYWGNDGYILMSMKDNNCGVATDAIYATL 545

 Score = 46 (21.3 bits), Expect = 1.5e-26, Sum P(2) = 1.5e-26
 Identities = 20/78 (25%), Positives = 35/78 (44%)

Query:   452 HQAGLESEKDYPYRNGNGEKFKCAYD----KSKVKLFTGKDFLYFNGSETMKKILYKYGP 507
             H  GL S    PY     E F+  YD    +S++  + G+   +F G++     +YK  P
Sbjct:    32 HVKGLLS---LPYAEIK-EPFEAWYDLTGKRSRIDYYHGQVCTFFVGNDLDYGAVYKITP 87

Query:   508 LSVGLNSHLIHFY--NGT 523
             ++     + +  +  NGT
Sbjct:    88 VTTETEFNTMKCFQLNGT 105


>RGD|708447 [details] [associations]
            symbol:Testin "testin gene" species:10116 "Rattus norvegicus"
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0030054 "cell junction" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 RGD:708447 GO:GO:0005576 GO:GO:0030054 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            MEROPS:C01.972 OMA:RYHAENS OrthoDB:EOG4XWG0N EMBL:U16858
            IPI:IPI00207173 PIR:I52525 PIR:PC1251 RefSeq:NP_775155.1
            UniGene:Rn.10029 ProteinModelPortal:P15242 SMR:P15242
            Ensembl:ENSRNOT00000024467 GeneID:286916 KEGG:rno:286916
            UCSC:RGD:708447 CTD:286916 InParanoid:P15242 NextBio:625036
            Genevestigator:P15242 GermOnline:ENSRNOG00000018028 Uniprot:P15242
        Length = 333

 Score = 319 (117.4 bits), Expect = 2.0e-27, P = 2.0e-27
 Identities = 80/228 (35%), Positives = 117/228 (51%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC--AK 68
             VP   DWR+     P  +Q  C S WAFS  G LEGQ   KT +L+  S+  L++C  + 
Sbjct:   114 VPKRVDWRQLGYVTPVKNQGHCASSWAFSATGSLEGQMFRKTERLIPLSEQNLLDCMGSN 173

Query:    69 QCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 127
                GC G F + + +Y     GL +E+ YPY+   G+  +C Y          +DF+   
Sbjct:   174 VTHGCSGGFMQYAFQYVKDNGGLATEESYPYR---GQGRECRYHAEN-SAANVRDFVQIP 229

Query:   128 GSE-TMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG---- 181
             GSE  + K + K GP+SV ++ S     + G+ I    + C    L HAVL+VGYG    
Sbjct:   230 GSEEALMKAVAKVGPISVAVDASHGSFQFYGSGIYYEPQ-CKRVHLNHAVLVVGYGFEGE 288

Query:   182 KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATIDVV 228
             + D   +WLV+NSWG     +G+ K+ +  +N CGI   A Y+T  +V
Sbjct:   289 ESDGNSFWLVKNSWGEEWGMKGYMKLAKDWSNHCGI---ATYSTYPIV 333

 Score = 318 (117.0 bits), Expect = 2.5e-27, P = 2.5e-27
 Identities = 80/228 (35%), Positives = 117/228 (51%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC--AK 799
             VP   DWR+     P  +Q  C S WAFS  G LEGQ   KT +L+  S+  L++C  + 
Sbjct:   114 VPKRVDWRQLGYVTPVKNQGHCASSWAFSATGSLEGQMFRKTERLIPLSEQNLLDCMGSN 173

Query:   800 QCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 858
                GC G F + + +Y     GL +E+ YPY+   G+  +C Y          +DF+   
Sbjct:   174 VTHGCSGGFMQYAFQYVKDNGGLATEESYPYR---GQGRECRYHAEN-SAANVRDFVQIP 229

Query:   859 GSE-TMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG---- 912
             GSE  + K + K GP+SV ++ S     + G+ I    + C    L HAVL+VGYG    
Sbjct:   230 GSEEALMKAVAKVGPISVAVDASHGSFQFYGSGIYYEPQ-CKRVHLNHAVLVVGYGFEGE 288

Query:   913 KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATIDVV 959
             + D   +WLV+NSWG     +G+ K+ +  +N CGI   A Y+T  +V
Sbjct:   289 ESDGNSFWLVKNSWGEEWGMKGYMKLAKDWSNHCGI---ATYSTYPIV 333

 Score = 312 (114.9 bits), Expect = 1.1e-26, P = 1.1e-26
 Identities = 82/230 (35%), Positives = 119/230 (51%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC--AK 433
             VP   DWR+     P  +Q  C S WAFS  G LEGQ   KT +L+  S+  L++C  + 
Sbjct:   114 VPKRVDWRQLGYVTPVKNQGHCASSWAFSATGSLEGQMFRKTERLIPLSEQNLLDCMGSN 173

Query:   434 QCSGCGGCDGLEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYF 492
                GC G   ++   +Y     GL +E+ YPYR G G + +   + S   +   +DF+  
Sbjct:   174 VTHGCSG-GFMQYAFQYVKDNGGLATEESYPYR-GQGRECRYHAENSAANV---RDFVQI 228

Query:   493 NGSE-TMKKILYKYGPLSVGLN-SH-LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYG-- 547
              GSE  + K + K GP+SV ++ SH    FY G+ I    + C    L HAVL+VGYG  
Sbjct:   229 PGSEEALMKAVAKVGPISVAVDASHGSFQFY-GSGIYYEPQ-CKRVHLNHAVLVVGYGFE 286

Query:   548 --KQDDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATIDVV 594
               + D   +WL +NSWG     +G+ K+ +  +N CGI   A Y+T  +V
Sbjct:   287 GEESDGNSFWLVKNSWGEEWGMKGYMKLAKDWSNHCGI---ATYSTYPIV 333


>UNIPROTKB|E9PTT3 [details] [associations]
            symbol:Ctsr "Protein Ctsr" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            IPI:IPI00627092 Ensembl:ENSRNOT00000024115 RGD:631422
            Uniprot:E9PTT3
        Length = 334

 Score = 318 (117.0 bits), Expect = 2.5e-27, P = 2.5e-27
 Identities = 82/227 (36%), Positives = 114/227 (50%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             +P   DWRKK       +Q  C SCWAF++ G +EGQ   KTG+L   S   LV+C K  
Sbjct:   115 LPKFVDWRKKGYVTRVQNQKFCNSCWAFAVTGAIEGQMFNKTGQLTPLSVQNLVDCTKS- 173

Query:   802 SGCDGCFF-EPSIEYTH---QAGLESEKDYPYKNANGEKFKCAYDKSKVKL-FTGKDFLH 856
              G +GC + +P I Y +     GLE+E  YPYK   G    C Y+    K   TG  F+ 
Sbjct:   174 QGNEGCQWGDPHIAYEYVLNNGGLEAEATYPYKGKEGV---CRYNPKHSKAEITG--FVS 228

Query:   857 FNGSET-MKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDE-TCSPYDLGHAVLLVGYG-- 912
                SE  + + +   GP+SV +++   + +        DE  CS   + H+VL+VGYG  
Sbjct:   229 LPESEDILMEAVATIGPISVAVDASF-NSFGFYKKGLYDEPNCSNNTVNHSVLVVGYGFE 287

Query:   913 --KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
               + D   YWL++NSWG      G+ KI +  NN C I   A Y T+
Sbjct:   288 GNETDGNSYWLIKNSWGRKWGLRGYMKIPKDQNNFCAIASYAHYPTV 334

 Score = 317 (116.6 bits), Expect = 3.2e-27, P = 3.2e-27
 Identities = 82/227 (36%), Positives = 114/227 (50%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P   DWRKK       +Q  C SCWAF++ G +EGQ   KTG+L   S   LV+C K  
Sbjct:   115 LPKFVDWRKKGYVTRVQNQKFCNSCWAFAVTGAIEGQMFNKTGQLTPLSVQNLVDCTKS- 173

Query:    71 SGCDGCFF-EPSIEYTH---QAGLESEKDYPYKNANGEKFKCAYDKSKVKL-FTGKDFLH 125
              G +GC + +P I Y +     GLE+E  YPYK   G    C Y+    K   TG  F+ 
Sbjct:   174 QGNEGCQWGDPHIAYEYVLNNGGLEAEATYPYKGKEGV---CRYNPKHSKAEITG--FVS 228

Query:   126 FNGSET-MKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDE-TCSPYDLGHAVLLVGYG-- 181
                SE  + + +   GP+SV +++   + +        DE  CS   + H+VL+VGYG  
Sbjct:   229 LPESEDILMEAVATIGPISVAVDASF-NSFGFYKKGLYDEPNCSNNTVNHSVLVVGYGFE 287

Query:   182 --KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
               + D   YWL++NSWG      G+ KI +  NN C I   A Y T+
Sbjct:   288 GNETDGNSYWLIKNSWGRKWGLRGYMKIPKDQNNFCAIASYAHYPTV 334

 Score = 304 (112.1 bits), Expect = 8.0e-26, P = 8.0e-26
 Identities = 79/226 (34%), Positives = 111/226 (49%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +P   DWRKK       +Q  C SCWAF++ G +EGQ   KTG+L   S   LV+C K  
Sbjct:   115 LPKFVDWRKKGYVTRVQNQKFCNSCWAFAVTGAIEGQMFNKTGQLTPLSVQNLVDCTKS- 173

Query:   436 SGCGGCDGLEQPIEYTH---QAGLESEKDYPYRNGNGEKFKCAYDKSKVKL-FTGKDFLY 491
              G  GC   +  I Y +     GLE+E  YPY+   G++  C Y+    K   TG  F+ 
Sbjct:   174 QGNEGCQWGDPHIAYEYVLNNGGLEAEATYPYK---GKEGVCRYNPKHSKAEITG--FVS 228

Query:   492 FNGSET-MKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYG--- 547
                SE  + + +   GP+SV +++    F        ++  CS   + H+VL+VGYG   
Sbjct:   229 LPESEDILMEAVATIGPISVAVDASFNSFGFYKKGLYDEPNCSNNTVNHSVLVVGYGFEG 288

Query:   548 -KQDDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
              + D   YWL +NSWG      G+ KI +  NN C I   A Y T+
Sbjct:   289 NETDGNSYWLIKNSWGRKWGLRGYMKIPKDQNNFCAIASYAHYPTV 334

 Score = 125 (49.1 bits), Expect = 0.00017, P = 0.00017
 Identities = 29/69 (42%), Positives = 38/69 (55%)

Query:   962 DE-TCSPYDLGHAVLLVGYG----KQDDIPYWLVRNSWGPIGPDEGFFKIERG-NNACGI 1015
             DE  CS   + H+VL+VGYG    + D   YWL++NSWG      G+ KI +  NN C I
Sbjct:   266 DEPNCSNNTVNHSVLVVGYGFEGNETDGNSYWLIKNSWGRKWGLRGYMKIPKDQNNFCAI 325

Query:  1016 EQIAGYATI 1024
                A Y T+
Sbjct:   326 ASYAHYPTV 334


>TAIR|locus:2097104 [details] [associations]
            symbol:AT3G43960 species:3702 "Arabidopsis thaliana"
            [GO:0005886 "plasma membrane" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0031225 "anchored to
            membrane" evidence=TAS] [GO:0048767 "root hair elongation"
            evidence=IMP] [GO:0016132 "brassinosteroid biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:CP002686
            GenomeReviews:BA000014_GR GO:GO:0031225 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0048767 MEROPS:I29.003 HOGENOM:HOG000230773
            EMBL:AL163975 EMBL:AK118634 IPI:IPI00526842 PIR:T48950
            RefSeq:NP_566867.1 UniGene:At.43352 ProteinModelPortal:Q9LXW3
            SMR:Q9LXW3 STRING:Q9LXW3 PaxDb:Q9LXW3 PRIDE:Q9LXW3
            EnsemblPlants:AT3G43960.1 GeneID:823513 KEGG:ath:AT3G43960
            TAIR:At3g43960 eggNOG:NOG286334 InParanoid:Q9LXW3 KO:K01376
            OMA:MAISFRT PhylomeDB:Q9LXW3 ProtClustDB:CLSN2917367
            Genevestigator:Q9LXW3 GermOnline:AT3G43960 Uniprot:Q9LXW3
        Length = 376

 Score = 318 (117.0 bits), Expect = 2.5e-27, P = 2.5e-27
 Identities = 94/309 (30%), Positives = 154/309 (49%)

Query:   651 NEN-ILETFKAFIVKRGRQYANDEEIKERFEYFKQ----------DGHKKHERYGTSEFS 699
             NE  +L  ++ ++V+ G+ Y    E + RF+ FK           D ++ +ER G ++FS
Sbjct:    33 NEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYER-GLNKFS 91

Query:   700 DRSPEEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXXDGPV-PDAWDWRKKNVTGP- 756
             D + +E      G K  +++    VA+R            +G V PD  DWR++    P 
Sbjct:    92 DLTADEFQASYLGGKMEKKSLSD-VAERYQYK--------EGDVLPDEVDWRERGAVVPR 142

Query:   757 AGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPSIE 814
                Q  CGSCWAF+  G +EG   I TG+LV  S+ +L++C +     GC G     + E
Sbjct:   143 VKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFE 202

Query:   815 YTHQ-AGLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDFLHFNGSETMKKILYKYGP 872
             +  +  G+ S++ Y Y   +    K    K+ +V    G + +  N   ++KK +  Y P
Sbjct:   203 FIKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAV-AYQP 261

Query:   873 LSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG-KQDNIPYWLVRNSWGPIGP 931
             +SV++++  + DY  + + K    CS     H VL+VGYG   D   YWL+RNSWGP   
Sbjct:   262 ISVMISAANMSDYK-SGVYKG--ACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWG 318

Query:   932 DEGFFKIER 940
             + G+ +++R
Sbjct:   319 EGGYLRLQR 327

 Score = 305 (112.4 bits), Expect = 6.2e-26, P = 6.2e-26
 Identities = 94/310 (30%), Positives = 152/310 (49%)

Query:   285 NEN-ILETFKAFIVKRGRQYANDEEIKERFEYFKQ----------DGHKKHERYGTSEFS 333
             NE  +L  ++ ++V+ G+ Y    E + RF+ FK           D ++ +ER G ++FS
Sbjct:    33 NEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYER-GLNKFS 91

Query:   334 DRSPEEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXXDGPV-PDAWDWRKKNVTGP- 390
             D + +E      G K  +++    VA+R            +G V PD  DWR++    P 
Sbjct:    92 DLTADEFQASYLGGKMEKKSLSD-VAERYQYK--------EGDVLPDEVDWRERGAVVPR 142

Query:   391 AGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCGGCDGLEQPI 448
                Q  CGSCWAF+  G +EG   I TG+LV  S+ +L++C +     GC G  G     
Sbjct:   143 VKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAG-GGAVWAF 201

Query:   449 EYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKS-KVKLFTGKDFLYFNGSETMKKILYKYG 506
             E+  +  G+ S++ Y Y   +    K    K+ +V    G + +  N   ++KK +  Y 
Sbjct:   202 EFIKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAV-AYQ 260

Query:   507 PLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYG-KQDDIPYWLARNSWGPIG 565
             P+SV +++  +  Y  + + K    CS     H VL+VGYG   D+  YWL RNSWGP  
Sbjct:   261 PISVMISAANMSDYK-SGVYKG--ACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEW 317

Query:   566 PDEGFFKIER 575
              + G+ +++R
Sbjct:   318 GEGGYLRLQR 327

 Score = 290 (107.1 bits), Expect = 2.5e-24, P = 2.5e-24
 Identities = 71/210 (33%), Positives = 112/210 (53%)

Query:     7 KDGPV-PDAWDWRKKNVTGP-AGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLV 64
             K+G V PD  DWR++    P    Q +CGSCWAF+  G +EG   I TG+LV  S+ +L+
Sbjct:   122 KEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELI 181

Query:    65 ECAKQCS--GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKS-KVKLFTG 120
             +C +     GC G     + E+  +  G+ S++ Y Y   +    K    K+ +V    G
Sbjct:   182 DCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTING 241

Query:   121 KDFLHFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGY 180
              + +  N   ++KK +  Y P+SV++++  + DY  + + K    CS     H VL+VGY
Sbjct:   242 HEVVPVNDEMSLKKAV-AYQPISVMISAANMSDYK-SGVYKG--ACSNLWGDHNVLIVGY 297

Query:   181 G-KQDNIPYWLVRNSWGPIGPDEGFFKIER 209
             G   D   YWL+RNSWGP   + G+ +++R
Sbjct:   298 GTSSDEGDYWLIRNSWGPEWGEGGYLRLQR 327


>TAIR|locus:2128253 [details] [associations]
            symbol:AT4G11320 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002687 GenomeReviews:CT486007_GR EMBL:AL096882 EMBL:AL161531
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 OMA:ICHGADP
            HOGENOM:HOG000230773 KO:K01376 ProtClustDB:CLSN2689395
            EMBL:AY035055 EMBL:AY051062 IPI:IPI00520480 PIR:T13023
            RefSeq:NP_567377.1 UniGene:At.25206 ProteinModelPortal:Q9SUS9
            SMR:Q9SUS9 STRING:Q9SUS9 MEROPS:C01.A21 PaxDb:Q9SUS9 PRIDE:Q9SUS9
            EnsemblPlants:AT4G11320.1 GeneID:826734 KEGG:ath:AT4G11320
            TAIR:At4g11320 InParanoid:Q9SUS9 PhylomeDB:Q9SUS9
            Genevestigator:Q9SUS9 GermOnline:AT4G11320 Uniprot:Q9SUS9
        Length = 371

 Score = 317 (116.6 bits), Expect = 3.2e-27, P = 3.2e-27
 Identities = 94/321 (29%), Positives = 148/321 (46%)

Query:   649 FDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQD----GHKKHE----RYGTSEFSD 700
             FD E  L  F++++VK G+ Y +  E + R   F+ +     ++  E    R G + F+D
Sbjct:    48 FDAEATL-MFESWMVKHGKVYDSVAEKERRLTIFEDNLRFITNRNAENLSYRLGLNRFAD 106

Query:   701 RSPEEI--LCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPV-PDAWDWRKKNVTGPA 757
              S  E   +C        R +  + +              DG V P + DWR +      
Sbjct:   107 LSLHEYGEICHGADPRPPRNHVFMTSSNRYKTS-------DGDVLPKSVDWRNEGAVTEV 159

Query:   758 GDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIEYT- 816
              DQ  C SCWAFS  G +EG   I TG+LV  S+  L+ C K+ +GC G   E + E+  
Sbjct:   160 KDQGLCRSCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKVETAYEFIM 219

Query:   817 HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVL 876
             +  GL ++ DYPYK  NG       + +K  +  G + L  N    + K +  + P++ +
Sbjct:   220 NNGGLGTDNDYPYKALNGVCEGRLKEDNKNVMIDGYENLPANDEAALMKAV-AHQPVTAV 278

Query:   877 LNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFF 936
             ++S    ++        D TC   +L H V++VGYG ++   YW+V+NS G    + G+ 
Sbjct:   279 VDSSS-REFQLYESGVFDGTCGT-NLNHGVVVVGYGTENGRDYWIVKNSRGDTWGEAGYM 336

Query:   937 KIERG----NNACGIEQIAGY 953
             K+ R        CGI   A Y
Sbjct:   337 KMARNIANPRGLCGIAMRASY 357

 Score = 304 (112.1 bits), Expect = 8.0e-26, P = 8.0e-26
 Identities = 95/323 (29%), Positives = 144/323 (44%)

Query:   283 FDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQD----GHKKHE----RYGTSEFSD 334
             FD E  L  F++++VK G+ Y +  E + R   F+ +     ++  E    R G + F+D
Sbjct:    48 FDAEATL-MFESWMVKHGKVYDSVAEKERRLTIFEDNLRFITNRNAENLSYRLGLNRFAD 106

Query:   335 RSPEEI--LCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPV-PDAWDWRKKNVTGPA 391
              S  E   +C        R +  + +              DG V P + DWR +      
Sbjct:   107 LSLHEYGEICHGADPRPPRNHVFMTSSNRYKTS-------DGDVLPKSVDWRNEGAVTEV 159

Query:   392 GDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQPIEYT 451
              DQ  C SCWAFS  G +EG   I TG+LV  S+  L+ C K+ +GCGG   +E   E+ 
Sbjct:   160 KDQGLCRSCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKENNGCGG-GKVETAYEFI 218

Query:   452 -HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSET-MKKILYKYGPLS 509
              +  GL ++ DYPY+  NG       + +K  +  G + L  N     MK + ++     
Sbjct:   219 MNNGGLGTDNDYPYKALNGVCEGRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAV 278

Query:   510 VGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEG 569
             V  +S     Y        D TC   +L H V++VGYG ++   YW+ +NS G    + G
Sbjct:   279 VDSSSREFQLYESGVF---DGTCGT-NLNHGVVVVGYGTENGRDYWIVKNSRGDTWGEAG 334

Query:   570 FFKIERG----NNACGIEQIAGY 588
             + K+ R        CGI   A Y
Sbjct:   335 YMKMARNIANPRGLCGIAMRASY 357

 Score = 299 (110.3 bits), Expect = 2.7e-25, P = 2.7e-25
 Identities = 73/221 (33%), Positives = 110/221 (49%)

Query:     8 DGPV-PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC 66
             DG V P + DWR +       DQ  C SCWAFS  G +EG   I TG+LV  S+  L+ C
Sbjct:   140 DGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVEGLNKIVTGELVTLSEQDLINC 199

Query:    67 AKQCSGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLH 125
              K+ +GC G   E + E+  +  GL ++ DYPYK  NG       + +K  +  G + L 
Sbjct:   200 NKENNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCEGRLKEDNKNVMIDGYENLP 259

Query:   126 FNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN 185
              N    + K +  + P++ +++S    ++        D TC   +L H V++VGYG ++ 
Sbjct:   260 ANDEAALMKAV-AHQPVTAVVDSSS-REFQLYESGVFDGTCGT-NLNHGVVVVGYGTENG 316

Query:   186 IPYWLVRNSWGPIGPDEGFFKIERG----NNACGIEQIAGY 222
               YW+V+NS G    + G+ K+ R        CGI   A Y
Sbjct:   317 RDYWIVKNSRGDTWGEAGYMKMARNIANPRGLCGIAMRASY 357


>RGD|631421 [details] [associations]
            symbol:Ctsq "cathepsin Q" species:10116 "Rattus norvegicus"
            [GO:0005764 "lysosome" evidence=NAS] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 RGD:631421 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 UniGene:Rn.34875 EMBL:AF187323 IPI:IPI00214897
            PIR:JC7183 RefSeq:NP_640355.1 UniGene:Rn.35820
            ProteinModelPortal:Q9QZE3 SMR:Q9QZE3 STRING:Q9QZE3 MEROPS:C01.039
            PRIDE:Q9QZE3 Ensembl:ENSRNOT00000024208 GeneID:246147
            KEGG:rno:246147 UCSC:RGD:631421 CTD:104002 InParanoid:Q9QZE3
            OMA:ESEDVLM OrthoDB:EOG4HHP48 NextBio:623425 Genevestigator:Q9QZE3
            GermOnline:ENSRNOG00000017946 Uniprot:Q9QZE3
        Length = 343

 Score = 316 (116.3 bits), Expect = 4.1e-27, P = 4.1e-27
 Identities = 99/320 (30%), Positives = 143/320 (44%)

Query:   297 VKRGRQYANDEEIKERF---EYFKQ-DGHKKHERYGTS-------EFSDRSPEEILCKT- 344
             +K  + Y+ +EE+ +R    E  K+ + H +    G +       +F+D + EE      
Sbjct:    34 IKYEKLYSPEEEVLKRVVWEENVKKIELHNRENSLGKNTYTMEINDFADMTDEEFKDMII 93

Query:   345 GFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFS 404
             GF+      E+ +  R               +P   DWR +        Q  C SCWAF 
Sbjct:    94 GFQLPVHNTEKRLWKRALGSFFPNSWNWRDALPKFVDWRNEGYVTRVRKQGGCSSCWAFP 153

Query:   405 IAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGC--DGLEQPIEYT-HQAGLESEKD 461
             + G +EGQ   KTGKL+  S   L++C+K   G  GC         +Y  H  GLE+E  
Sbjct:   154 VTGAIEGQMFKKTGKLIPLSVQNLIDCSKP-QGNRGCLWGNTYNAFQYVLHNGGLEAEAT 212

Query:   462 YPYRNGNGEKFKCAYD-KSKVKLFTGKDFLYFNGSET--MKKILYKYGPLSVGLN--SHL 516
             YPY    G    C Y+ K+     TG  F+    SE   M  +  K GP++ G++  S  
Sbjct:   213 YPYERKEGV---CRYNPKNSSAKITG--FVVLPESEDVLMDAVATK-GPIATGVHVISSS 266

Query:   517 IHFYNGTPIRKNDETCSPYDLGHAVLLVGYG----KQDDIPYWLARNSWGPIGPDEGFFK 572
               FY       ++  CS Y + HAVL+VGYG    + D   YWL +NSWG      G+ K
Sbjct:   267 FRFYQKGVY--HEPKCSSY-VNHAVLVVGYGFEGNETDGNNYWLIKNSWGKRWGLRGYMK 323

Query:   573 IERG-NNACGIEQIAGYATI 591
             I +  NN C I  +A Y T+
Sbjct:   324 IAKDRNNHCAIASLAQYPTV 343

 Score = 309 (113.8 bits), Expect = 2.3e-26, P = 2.3e-26
 Identities = 95/319 (29%), Positives = 144/319 (45%)

Query:   663 VKRGRQYANDEEIKERF---EYFKQ-DGHKKHERYGTS-------EFSDRSPEEILCKT- 710
             +K  + Y+ +EE+ +R    E  K+ + H +    G +       +F+D + EE      
Sbjct:    34 IKYEKLYSPEEEVLKRVVWEENVKKIELHNRENSLGKNTYTMEINDFADMTDEEFKDMII 93

Query:   711 GFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFS 770
             GF+      E+ +  R               +P   DWR +        Q  C SCWAF 
Sbjct:    94 GFQLPVHNTEKRLWKRALGSFFPNSWNWRDALPKFVDWRNEGYVTRVRKQGGCSSCWAFP 153

Query:   771 IAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPS---IEYT-HQAGLESEKD 826
             + G +EGQ   KTGKL+  S   L++C+K   G  GC +  +    +Y  H  GLE+E  
Sbjct:   154 VTGAIEGQMFKKTGKLIPLSVQNLIDCSKP-QGNRGCLWGNTYNAFQYVLHNGGLEAEAT 212

Query:   827 YPYKNANGEKFKCAYD-KSKVKLFTGKDFLHFNGSETMKKILYKYGPLSV---LLNSDLI 882
             YPY+   G    C Y+ K+     TG   L  +    M  +  K GP++    +++S   
Sbjct:   213 YPYERKEGV---CRYNPKNSSAKITGFVVLPESEDVLMDAVATK-GPIATGVHVISSSFR 268

Query:   883 HDYNGTPIRKNDETCSPYDLGHAVLLVGYG----KQDNIPYWLVRNSWGPIGPDEGFFKI 938
                 G     ++  CS Y + HAVL+VGYG    + D   YWL++NSWG      G+ KI
Sbjct:   269 FYQKGV---YHEPKCSSY-VNHAVLVVGYGFEGNETDGNNYWLIKNSWGKRWGLRGYMKI 324

Query:   939 ERG-NNACGIEQIAGYATI 956
              +  NN C I  +A Y T+
Sbjct:   325 AKDRNNHCAIASLAQYPTV 343

 Score = 299 (110.3 bits), Expect = 2.7e-25, P = 2.7e-25
 Identities = 78/228 (34%), Positives = 111/228 (48%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P   DWR +        Q  C SCWAF + G +EGQ   KTGKL+  S   L++C+K  
Sbjct:   125 LPKFVDWRNEGYVTRVRKQGGCSSCWAFPVTGAIEGQMFKKTGKLIPLSVQNLIDCSKP- 183

Query:    71 SGCDGCFFEPS---IEYT-HQAGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLH 125
              G  GC +  +    +Y  H  GLE+E  YPY+   G    C Y+ K+     TG   L 
Sbjct:   184 QGNRGCLWGNTYNAFQYVLHNGGLEAEATYPYERKEGV---CRYNPKNSSAKITGFVVLP 240

Query:   126 FNGSETMKKILYKYGPLSV---LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG- 181
              +    M  +  K GP++    +++S       G     ++  CS Y + HAVL+VGYG 
Sbjct:   241 ESEDVLMDAVATK-GPIATGVHVISSSFRFYQKGV---YHEPKCSSY-VNHAVLVVGYGF 295

Query:   182 ---KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
                + D   YWL++NSWG      G+ KI +  NN C I  +A Y T+
Sbjct:   296 EGNETDGNNYWLIKNSWGKRWGLRGYMKIAKDRNNHCAIASLAQYPTV 343

 Score = 134 (52.2 bits), Expect = 1.8e-05, P = 1.8e-05
 Identities = 30/71 (42%), Positives = 41/71 (57%)

Query:   959 VKNDETCSPYDLGHAVLLVGYG----KQDDIPYWLVRNSWGPIGPDEGFFKIERG-NNAC 1013
             V ++  CS Y + HAVL+VGYG    + D   YWL++NSWG      G+ KI +  NN C
Sbjct:   274 VYHEPKCSSY-VNHAVLVVGYGFEGNETDGNNYWLIKNSWGKRWGLRGYMKIAKDRNNHC 332

Query:  1014 GIEQIAGYATI 1024
              I  +A Y T+
Sbjct:   333 AIASLAQYPTV 343


>RGD|1562210 [details] [associations]
            symbol:MGC114246 "similar to cathepsin R" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1562210 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 EMBL:CH474032 MEROPS:C01.042 KO:K09601
            OMA:ASHESFK OrthoDB:EOG4ZCT6D EMBL:BC091563 IPI:IPI00555186
            RefSeq:NP_001017509.1 UniGene:Rn.198321 SMR:Q5BJA0
            Ensembl:ENSRNOT00000061470 GeneID:498688 KEGG:rno:498688
            UCSC:RGD:1562210 InParanoid:Q5BJA0 NextBio:700535
            Genevestigator:Q5BJA0 Uniprot:Q5BJA0
        Length = 334

 Score = 315 (115.9 bits), Expect = 5.3e-27, P = 5.3e-27
 Identities = 84/227 (37%), Positives = 113/227 (49%)

Query:   377 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 436
             P   DWRKK    P   Q  C +CWAFS+ G +E Q   ++GKL+  S   LV+C+K   
Sbjct:   116 PKFVDWRKKGYVTPVRRQGNCNACWAFSVTGAIEAQTIWQSGKLIPLSVQNLVDCSKP-Q 174

Query:   437 GCGGCDGLE--QPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYD-KSKVKLFTGKDFLYF 492
             G  GC G +     +Y  H  GL+SE  YPY   +G    C Y+ K+     TG  F+  
Sbjct:   175 GNNGCLGGDTYNAFQYVLHNGGLQSEATYPYEGKDGP---CRYNPKNSSAEITG--FVSL 229

Query:   493 NGSETMKKI-LYKYGPLSVGLN-SH-LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYG-K 548
               SE +  + +   GP+S G++ SH    FY       ++  CS   + H VL+VGYG K
Sbjct:   230 PESEDILMVAVATIGPISAGIDASHESFKFYKKGIY--HEPNCSSNSVTHGVLVVGYGFK 287

Query:   549 QDDIP---YWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
              +D     YWL +NSWG      G+ KI +  NN C I   A Y TI
Sbjct:   288 GNDTGGDHYWLIKNSWGKQWGIRGYMKITKDKNNHCAIASYAHYPTI 334

 Score = 303 (111.7 bits), Expect = 1.0e-25, P = 1.0e-25
 Identities = 80/227 (35%), Positives = 115/227 (50%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC- 70
             P   DWRKK    P   Q +C +CWAFS+ G +E Q   ++GKL+  S   LV+C+K   
Sbjct:   116 PKFVDWRKKGYVTPVRRQGNCNACWAFSVTGAIEAQTIWQSGKLIPLSVQNLVDCSKPQG 175

Query:    71 -SGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHFN 127
              +GC G     + +Y  H  GL+SE  YPY+  +G    C Y+ K+     TG  F+   
Sbjct:   176 NNGCLGGDTYNAFQYVLHNGGLQSEATYPYEGKDGP---CRYNPKNSSAEITG--FVSLP 230

Query:   128 GSETMKKI-LYKYGPLSVLLNSDLIHDYNGTPIRK---NDETCSPYDLGHAVLLVGYGKQ 183
              SE +  + +   GP+S  +  D  H+ +    +K   ++  CS   + H VL+VGYG +
Sbjct:   231 ESEDILMVAVATIGPISAGI--DASHE-SFKFYKKGIYHEPNCSSNSVTHGVLVVGYGFK 287

Query:   184 DNIP----YWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
              N      YWL++NSWG      G+ KI +  NN C I   A Y TI
Sbjct:   288 GNDTGGDHYWLIKNSWGKQWGIRGYMKITKDKNNHCAIASYAHYPTI 334

 Score = 300 (110.7 bits), Expect = 2.1e-25, P = 2.1e-25
 Identities = 80/227 (35%), Positives = 114/227 (50%)

Query:   743 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC- 801
             P   DWRKK    P   Q  C +CWAFS+ G +E Q   ++GKL+  S   LV+C+K   
Sbjct:   116 PKFVDWRKKGYVTPVRRQGNCNACWAFSVTGAIEAQTIWQSGKLIPLSVQNLVDCSKPQG 175

Query:   802 -SGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHFN 858
              +GC G     + +Y  H  GL+SE  YPY+  +G    C Y+ K+     TG  F+   
Sbjct:   176 NNGCLGGDTYNAFQYVLHNGGLQSEATYPYEGKDGP---CRYNPKNSSAEITG--FVSLP 230

Query:   859 GSETMKKI-LYKYGPLSVLLNSDLIHDYNGTPIRK---NDETCSPYDLGHAVLLVGYGKQ 914
              SE +  + +   GP+S  +  D  H+ +    +K   ++  CS   + H VL+VGYG +
Sbjct:   231 ESEDILMVAVATIGPISAGI--DASHE-SFKFYKKGIYHEPNCSSNSVTHGVLVVGYGFK 287

Query:   915 DNIP----YWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
              N      YWL++NSWG      G+ KI +  NN C I   A Y TI
Sbjct:   288 GNDTGGDHYWLIKNSWGKQWGIRGYMKITKDKNNHCAIASYAHYPTI 334

 Score = 127 (49.8 bits), Expect = 0.00010, P = 0.00010
 Identities = 29/71 (40%), Positives = 39/71 (54%)

Query:   959 VKNDETCSPYDLGHAVLLVGYG-KQDDIP---YWLVRNSWGPIGPDEGFFKIERG-NNAC 1013
             + ++  CS   + H VL+VGYG K +D     YWL++NSWG      G+ KI +  NN C
Sbjct:   264 IYHEPNCSSNSVTHGVLVVGYGFKGNDTGGDHYWLIKNSWGKQWGIRGYMKITKDKNNHC 323

Query:  1014 GIEQIAGYATI 1024
              I   A Y TI
Sbjct:   324 AIASYAHYPTI 334


>WB|WBGene00022189 [details] [associations]
            symbol:Y71H2AR.2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0004181
            "metallocarboxypeptidase activity" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] [GO:0008340 "determination of adult lifespan"
            evidence=IMP] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0008340 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 HSSP:P07711
            eggNOG:NOG331187 HOGENOM:HOG000114005 EMBL:FO081570
            RefSeq:NP_497627.1 UniGene:Cel.28419 ProteinModelPortal:Q9BL26
            SMR:Q9BL26 EnsemblMetazoa:Y71H2AR.2 GeneID:190615
            KEGG:cel:CELE_Y71H2AR.2 UCSC:Y71H2AR.2 CTD:190615
            WormBase:Y71H2AR.2 InParanoid:Q9BL26 OMA:CAMATTI NextBio:946382
            Uniprot:Q9BL26
        Length = 345

 Score = 314 (115.6 bits), Expect = 6.8e-27, P = 6.8e-27
 Identities = 73/234 (31%), Positives = 122/234 (52%)

Query:     5 VEKDGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKT-GKLVEFSKSQL 63
             +  D    +  DWR+K + GP  DQ  C +  AF+I   +E  YA  T G L+ FS+ QL
Sbjct:    76 IHMDRTTEEFLDWREKGIVGPVKDQGKCNASHAFAITSSIESMYAKATNGTLLSFSEQQL 135

Query:    64 VECAKQ-CSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKD 122
             ++C  Q   GC+  F   +I Y    G+E+E DYPY +   EK  C +D +K K+   K 
Sbjct:   136 IDCNDQGYKGCEEQFAMNAIGYLATHGIETEADYPYVDKTNEK--CTFDSTKSKIHLKKG 193

Query:   123 FLHFNGSETMKKI-LYKYGPLSVLLNSD-LIHDYNGTPIRKNDETC-SPYDLGHAVLLVG 179
              +   G+E + K+ +  YGP    + +   ++DY       + E C S +++  ++++VG
Sbjct:   194 VVA-EGNEVLGKVYVTNYGPAFFTMRAPPSLYDYKIGIYNPSIEECTSTHEI-RSMVIVG 251

Query:   180 YGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDVVIQRLV 233
             YG +    YW+V+ S+G    ++G+ K+ R  NAC +       T ++ ++ LV
Sbjct:   252 YGIEGEQKYWIVKGSFGTSWGEQGYMKLARDVNACAMATTIAVLT-EIFLRVLV 304

 Score = 310 (114.2 bits), Expect = 1.8e-26, P = 1.8e-26
 Identities = 69/206 (33%), Positives = 112/206 (54%)

Query:   747 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKLVEFSKSQLVECAKQ-CSGC 804
             DWR+K + GP  DQ  C +  AF+I   +E  YA  T G L+ FS+ QL++C  Q   GC
Sbjct:    87 DWREKGIVGPVKDQGKCNASHAFAITSSIESMYAKATNGTLLSFSEQQLIDCNDQGYKGC 146

Query:   805 DGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMK 864
             +  F   +I Y    G+E+E DYPY +   EK  C +D +K K+   K  +   G+E + 
Sbjct:   147 EEQFAMNAIGYLATHGIETEADYPYVDKTNEK--CTFDSTKSKIHLKKGVVA-EGNEVLG 203

Query:   865 KI-LYKYGPLSVLLNSD-LIHDYNGTPIRKNDETC-SPYDLGHAVLLVGYGKQDNIPYWL 921
             K+ +  YGP    + +   ++DY       + E C S +++  ++++VGYG +    YW+
Sbjct:   204 KVYVTNYGPAFFTMRAPPSLYDYKIGIYNPSIEECTSTHEI-RSMVIVGYGIEGEQKYWI 262

Query:   922 VRNSWGPIGPDEGFFKIERGNNACGI 947
             V+ S+G    ++G+ K+ R  NAC +
Sbjct:   263 VKGSFGTSWGEQGYMKLARDVNACAM 288

 Score = 287 (106.1 bits), Expect = 5.2e-24, P = 5.2e-24
 Identities = 70/225 (31%), Positives = 116/225 (51%)

Query:   381 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKLVEFSKSQLVECAKQCSGCG 439
             DWR+K + GP  DQ  C +  AF+I   +E  YA  T G L+ FS+ QL++C  Q  G  
Sbjct:    87 DWREKGIVGPVKDQGKCNASHAFAITSSIESMYAKATNGTLLSFSEQQLIDCNDQ--GYK 144

Query:   440 GCDG--LEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSET 497
             GC+       I Y    G+E+E DYPY +   EK  C +D +K K+   K  +   G+E 
Sbjct:   145 GCEEQFAMNAIGYLATHGIETEADYPYVDKTNEK--CTFDSTKSKIHLKKGVVA-EGNEV 201

Query:   498 MKKI-LYKYGPLSVGLNS--HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPY 554
             + K+ +  YGP    + +   L  +  G      +E  S +++  ++++VGYG + +  Y
Sbjct:   202 LGKVYVTNYGPAFFTMRAPPSLYDYKIGIYNPSIEECTSTHEI-RSMVIVGYGIEGEQKY 260

Query:   555 WLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDVVIQRLV 599
             W+ + S+G    ++G+ K+ R  NAC +       T ++ ++ LV
Sbjct:   261 WIVKGSFGTSWGEQGYMKLARDVNACAMATTIAVLT-EIFLRVLV 304


>UNIPROTKB|F1NHB8 [details] [associations]
            symbol:F1NHB8 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 EMBL:AADN02044011
            IPI:IPI00586027 Ensembl:ENSGALT00000021873 OMA:SELDHAV
            Uniprot:F1NHB8
        Length = 329

 Score = 313 (115.2 bits), Expect = 8.7e-27, P = 8.7e-27
 Identities = 96/324 (29%), Positives = 151/324 (46%)

Query:   284 DNENILET-FKAFIVKRGRQYANDEEIKERFEYFKQDG---HKKHER---YGTS--EFSD 334
             D E++    F  +  + G++Y+++EE + R   F  +    H K+     Y  +    +D
Sbjct:    17 DTEHVHHRLFHHYKERFGKRYSSEEEHEHRKRTFIHNMRFVHSKNRAALSYSLALNHLAD 76

Query:   335 RSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQ 394
             R+P+E+    G + S         D                +P++ DWR      P  DQ
Sbjct:    77 RTPQEMAALRGRRRS--------GDPKSGQPFSMQLYASLVLPESLDWRLYGAVTPVKDQ 128

Query:   395 AACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQPIEYT--- 451
             A CGSCW+F+  G +EG   +KTG L   S+  L++C+    G   CDG E+   Y    
Sbjct:   129 AVCGSCWSFATTGAMEGALFLKTGVLTPLSQQVLIDCSWGF-GNYACDGGEEWRAYEWIK 187

Query:   452 HQAGLESEKDY-PYRNGNGEKFKCAYDKSK-VKLFTGKDFLYFNGSETMKKILYKYGPLS 509
                G+ S + Y PY   NG    C Y++S+ V    G   +    +E +K  L+K+GP++
Sbjct:   188 KHGGIASTESYGPYLGQNGY---CHYNQSELVAPLAGYVTVESGNAEALKAALFKHGPVA 244

Query:   510 VGLN-SHL-IHFY-NGTPIRKN--DETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPI 564
             V ++ SH    FY NG     +  +ET    +L HAVL VGYG      YWL +NSW   
Sbjct:   245 VNIDASHKSFTFYANGVYEEPHCGNETS---ELDHAVLAVGYGVLHGKSYWLIKNSWSTY 301

Query:   565 GPDEGFFKIERGNNACGIEQIAGY 588
               ++G+  +   +N CG+   A +
Sbjct:   302 WGNDGYILMAMKDNNCGVATAASF 325

 Score = 304 (112.1 bits), Expect = 8.0e-26, P = 8.0e-26
 Identities = 94/325 (28%), Positives = 150/325 (46%)

Query:   650 DNENILET-FKAFIVKRGRQYANDEEIKERFEYFKQDG---HKKHER---YGTS--EFSD 700
             D E++    F  +  + G++Y+++EE + R   F  +    H K+     Y  +    +D
Sbjct:    17 DTEHVHHRLFHHYKERFGKRYSSEEEHEHRKRTFIHNMRFVHSKNRAALSYSLALNHLAD 76

Query:   701 RSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQ 760
             R+P+E+    G + S         D                +P++ DWR      P  DQ
Sbjct:    77 RTPQEMAALRGRRRS--------GDPKSGQPFSMQLYASLVLPESLDWRLYGAVTPVKDQ 128

Query:   761 AACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSG--CDGCFFEPSIEYTHQ 818
             A CGSCW+F+  G +EG   +KTG L   S+  L++C+       CDG     + E+  +
Sbjct:   129 AVCGSCWSFATTGAMEGALFLKTGVLTPLSQQVLIDCSWGFGNYACDGGEEWRAYEWIKK 188

Query:   819 -AGLESEKDY-PYKNANGEKFKCAYDKSK-VKLFTGKDFLHFNGSETMKKILYKYGPLSV 875
               G+ S + Y PY   NG    C Y++S+ V    G   +    +E +K  L+K+GP++V
Sbjct:   189 HGGIASTESYGPYLGQNGY---CHYNQSELVAPLAGYVTVESGNAEALKAALFKHGPVAV 245

Query:   876 LLNSDLIHD----Y-NGTPIRKN--DETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGP 928
               N D  H     Y NG     +  +ET    +L HAVL VGYG      YWL++NSW  
Sbjct:   246 --NIDASHKSFTFYANGVYEEPHCGNETS---ELDHAVLAVGYGVLHGKSYWLIKNSWST 300

Query:   929 IGPDEGFFKIERGNNACGIEQIAGY 953
                ++G+  +   +N CG+   A +
Sbjct:   301 YWGNDGYILMAMKDNNCGVATAASF 325

 Score = 292 (107.8 bits), Expect = 1.5e-24, P = 1.5e-24
 Identities = 75/224 (33%), Positives = 113/224 (50%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P++ DWR      P  DQA CGSCW+F+  G +EG   +KTG L   S+  L++C+   
Sbjct:   110 LPESLDWRLYGAVTPVKDQAVCGSCWSFATTGAMEGALFLKTGVLTPLSQQVLIDCSWGF 169

Query:    71 SG--CDGCFFEPSIEYTHQ-AGLESEKDY-PYKNANGEKFKCAYDKSK-VKLFTGKDFLH 125
                 CDG     + E+  +  G+ S + Y PY   NG    C Y++S+ V    G   + 
Sbjct:   170 GNYACDGGEEWRAYEWIKKHGGIASTESYGPYLGQNGY---CHYNQSELVAPLAGYVTVE 226

Query:   126 FNGSETMKKILYKYGPLSVLLNSDLIHD----Y-NGTPIRKN--DETCSPYDLGHAVLLV 178
                +E +K  L+K+GP++V  N D  H     Y NG     +  +ET    +L HAVL V
Sbjct:   227 SGNAEALKAALFKHGPVAV--NIDASHKSFTFYANGVYEEPHCGNETS---ELDHAVLAV 281

Query:   179 GYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
             GYG      YWL++NSW     ++G+  +   +N CG+   A +
Sbjct:   282 GYGVLHGKSYWLIKNSWSTYWGNDGYILMAMKDNNCGVATAASF 325


>TAIR|locus:2029924 [details] [associations]
            symbol:AT1G29090 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002684 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            HOGENOM:HOG000230773 HSSP:P53634 ProtClustDB:CLSN2688064
            EMBL:BT004146 IPI:IPI00545702 RefSeq:NP_564321.2 UniGene:At.40814
            ProteinModelPortal:Q84W75 SMR:Q84W75 MEROPS:C01.A15
            EnsemblPlants:AT1G29090.1 GeneID:839784 KEGG:ath:AT1G29090
            TAIR:At1g29090 InParanoid:Q84W75 OMA:SIRGHED PhylomeDB:Q84W75
            ArrayExpress:Q84W75 Genevestigator:Q84W75 Uniprot:Q84W75
        Length = 355

 Score = 313 (115.2 bits), Expect = 8.7e-27, P = 8.7e-27
 Identities = 95/327 (29%), Positives = 146/327 (44%)

Query:   647 LTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQD------GHKKHER-Y--GTSE 697
             +TF    + E  + ++ +  R Y+++ E + RF+ FK++       +KK +R Y  G +E
Sbjct:    36 VTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNE 95

Query:   698 FSDRSPEEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGP 756
             F+D + EE +   TG K           D              G   +  DWR +    P
Sbjct:    96 FADWTREEFIATHTGLKGVNGIPSSEFVDEMIPSWNWNVSDVAGR--ETKDWRYEGAVTP 153

Query:   757 AGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC-SGCDGCFFEPSIEY 815
                Q  CG CWAFS    +EG   I    LV  S+ QL++C ++  +GC+G     +  Y
Sbjct:   154 VKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMSDAFSY 213

Query:   816 T-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLS 874
                  G+ SE  YPY+ A G    C Y+        G   +  N    + + + K  P+S
Sbjct:   214 IIKNRGIASEASYPYQAAEGT---CRYNGKPSAWIRGFQTVPSNNERALLEAVSKQ-PVS 269

Query:   875 VLLNSD---LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ-DNIPYWLVRNSWGPIG 930
             V +++D    +H Y+G      DE     ++ HAV  VGYG   + I YWL +NSWG   
Sbjct:   270 VSIDADGPGFMH-YSGGVY---DEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETW 325

Query:   931 PDEGFFKIERG----NNACGIEQIAGY 953
              + G+ +I R        CG+ Q A Y
Sbjct:   326 GENGYIRIRRDVAWPQGMCGVAQYAFY 352

 Score = 299 (110.3 bits), Expect = 2.7e-25, P = 2.7e-25
 Identities = 93/329 (28%), Positives = 143/329 (43%)

Query:   281 LTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQD------GHKKHER-Y--GTSE 331
             +TF    + E  + ++ +  R Y+++ E + RF+ FK++       +KK +R Y  G +E
Sbjct:    36 VTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNE 95

Query:   332 FSDRSPEEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGP 390
             F+D + EE +   TG K           D              G   +  DWR +    P
Sbjct:    96 FADWTREEFIATHTGLKGVNGIPSSEFVDEMIPSWNWNVSDVAGR--ETKDWRYEGAVTP 153

Query:   391 AGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDG--LEQPI 448
                Q  CG CWAFS    +EG   I    LV  S+ QL++C ++     GC+G  +    
Sbjct:   154 VKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDN--GCNGGIMSDAF 211

Query:   449 EYT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGP 507
              Y     G+ SE  YPY+   G    C Y+        G   +  N    + + + K  P
Sbjct:   212 SYIIKNRGIASEASYPYQAAEGT---CRYNGKPSAWIRGFQTVPSNNERALLEAVSKQ-P 267

Query:   508 LSVGLNSH---LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD-IPYWLARNSWGP 563
             +SV +++     +H+  G      DE     ++ HAV  VGYG   + I YWLA+NSWG 
Sbjct:   268 VSVSIDADGPGFMHYSGGV----YDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGE 323

Query:   564 IGPDEGFFKIERG----NNACGIEQIAGY 588
                + G+ +I R        CG+ Q A Y
Sbjct:   324 TWGENGYIRIRRDVAWPQGMCGVAQYAFY 352

 Score = 277 (102.6 bits), Expect = 6.1e-23, P = 6.1e-23
 Identities = 70/217 (32%), Positives = 101/217 (46%)

Query:    16 DWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC-SGCD 74
             DWR +    P   Q  CG CWAFS    +EG   I    LV  S+ QL++C ++  +GC+
Sbjct:   144 DWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCN 203

Query:    75 GCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMK 133
             G     +  Y     G+ SE  YPY+ A G    C Y+        G   +  N    + 
Sbjct:   204 GGIMSDAFSYIIKNRGIASEASYPYQAAEGT---CRYNGKPSAWIRGFQTVPSNNERALL 260

Query:   134 KILYKYGPLSVLLNSD---LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ-DNIPYW 189
             + + K  P+SV +++D    +H Y+G      DE     ++ HAV  VGYG   + I YW
Sbjct:   261 EAVSKQ-PVSVSIDADGPGFMH-YSGGVY---DEPYCGTNVNHAVTFVGYGTSPEGIKYW 315

Query:   190 LVRNSWGPIGPDEGFFKIERG----NNACGIEQIAGY 222
             L +NSWG    + G+ +I R        CG+ Q A Y
Sbjct:   316 LAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFY 352


>TAIR|locus:2157712 [details] [associations]
            symbol:CEP1 "cysteine endopeptidase 1" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005783 EMBL:CP002688
            GenomeReviews:BA000015_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 PROSITE:PS00014
            EMBL:AB024031 MEROPS:I29.003 EMBL:HM367092 EMBL:AY091087
            IPI:IPI00516991 RefSeq:NP_568722.1 UniGene:At.7918 HSSP:O65039
            ProteinModelPortal:Q9FGR9 SMR:Q9FGR9 PaxDb:Q9FGR9 PRIDE:Q9FGR9
            EnsemblPlants:AT5G50260.1 GeneID:835091 KEGG:ath:AT5G50260
            TAIR:At5g50260 HOGENOM:HOG000230773 InParanoid:Q9FGR9 KO:K16292
            OMA:WHSKKYH PhylomeDB:Q9FGR9 ProtClustDB:CLSN2689970
            Genevestigator:Q9FGR9 Uniprot:Q9FGR9
        Length = 361

 Score = 302 (111.4 bits), Expect = 1.3e-25, P = 1.3e-25
 Identities = 90/307 (29%), Positives = 139/307 (45%)

Query:   673 EEIKERFEYFKQ------DGHKKHERYGT--SEFSDRSPEEILCKTGFKWSERTYERIVA 724
             EE  +RF  FK       + +KK + Y    ++F D + EE   +  +  S   + R+  
Sbjct:    52 EEKAKRFNVFKHNVKHIHETNKKDKSYKLKLNKFGDMTSEEF--RRTYAGSNIKHHRMFQ 109

Query:   725 DRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTG 784
                           +  +P + DWRK     P  +Q  CGSCWAFS    +EG   I+T 
Sbjct:   110 GEKKATKSFMYANVN-TLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTK 168

Query:   785 KLVEFSKSQLVEC-AKQCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYD 842
             KL   S+ +LV+C   Q  GC+G   + + E+  +  GL SE  YPYK A+ E      +
Sbjct:   169 KLTSLSEQELVDCDTNQNQGCNGGLMDLAFEFIKEKGGLTSELVYPYK-ASDETCDTNKE 227

Query:   843 KSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDL 902
              + V    G + +  N  + + K +    P+SV +++    D+           C   +L
Sbjct:   228 NAPVVSIDGHEDVPKNSEDDLMKAVANQ-PVSVAIDAGG-SDFQFYSEGVFTGRCGT-EL 284

Query:   903 GHAVLLVGYGKQ-DNIPYWLVRNSWGPIGPDEGFFKIERG----NNACGIEQIAGYATID 957
              H V +VGYG   D   YW+V+NSWG    ++G+ +++RG       CGI   A Y    
Sbjct:   285 NHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYP--- 341

Query:   958 VVKNDET 964
              +KN  T
Sbjct:   342 -LKNSNT 347

 Score = 291 (107.5 bits), Expect = 1.1e-26, Sum P(2) = 1.1e-26
 Identities = 72/219 (32%), Positives = 108/219 (49%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC-AKQ 69
             +P + DWRK     P  +Q  CGSCWAFS    +EG   I+T KL   S+ +LV+C   Q
Sbjct:   126 LPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCDTNQ 185

Query:    70 CSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNG 128
               GC+G   + + E+  +  GL SE  YPYK A+ E      + + V    G + +  N 
Sbjct:   186 NQGCNGGLMDLAFEFIKEKGGLTSELVYPYK-ASDETCDTNKENAPVVSIDGHEDVPKNS 244

Query:   129 SETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ-DNIP 187
              + + K +    P+SV +++    D+           C   +L H V +VGYG   D   
Sbjct:   245 EDDLMKAVANQ-PVSVAIDAGG-SDFQFYSEGVFTGRCGT-ELNHGVAVVGYGTTIDGTK 301

Query:   188 YWLVRNSWGPIGPDEGFFKIERG----NNACGIEQIAGY 222
             YW+V+NSWG    ++G+ +++RG       CGI   A Y
Sbjct:   302 YWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASY 340

 Score = 282 (104.3 bits), Expect = 1.0e-25, Sum P(2) = 1.0e-25
 Identities = 87/300 (29%), Positives = 134/300 (44%)

Query:   307 EEIKERFEYFKQ------DGHKKHERYGT--SEFSDRSPEEILCKTGFKWSERTYERIVA 358
             EE  +RF  FK       + +KK + Y    ++F D + EE   +  +  S   + R+  
Sbjct:    52 EEKAKRFNVFKHNVKHIHETNKKDKSYKLKLNKFGDMTSEEF--RRTYAGSNIKHHRMFQ 109

Query:   359 DRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTG 418
                           +  +P + DWRK     P  +Q  CGSCWAFS    +EG   I+T 
Sbjct:   110 GEKKATKSFMYANVN-TLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTK 168

Query:   419 KLVEFSKSQLVEC-AKQCSGCGGCDGL-EQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCA 475
             KL   S+ +LV+C   Q  GC G  GL +   E+  +  GL SE  YPY+  + E     
Sbjct:   169 KLTSLSEQELVDCDTNQNQGCNG--GLMDLAFEFIKEKGGLTSELVYPYK-ASDETCDTN 225

Query:   476 YDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNS--HLIHFYNGTPIRKNDETCS 533
              + + V    G + +  N  + + K +    P+SV +++      FY+          C 
Sbjct:   226 KENAPVVSIDGHEDVPKNSEDDLMKAVANQ-PVSVAIDAGGSDFQFYSEGVFTGR---CG 281

Query:   534 PYDLGHAVLLVGYGKQ-DDIPYWLARNSWGPIGPDEGFFKIERG----NNACGIEQIAGY 588
               +L H V +VGYG   D   YW+ +NSWG    ++G+ +++RG       CGI   A Y
Sbjct:   282 T-ELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASY 340

 Score = 48 (22.0 bits), Expect = 1.1e-26, Sum P(2) = 1.1e-26
 Identities = 10/29 (34%), Positives = 15/29 (51%)

Query:   820 GLESEKDYPYKNANGEKFKCAYDKSKVKL 848
             G+  E  YP KN+N    + + D  K +L
Sbjct:   333 GIAMEASYPLKNSNTNPSRLSLDSLKDEL 361

 Score = 44 (20.5 bits), Expect = 3.0e-26, Sum P(2) = 3.0e-26
 Identities = 9/29 (31%), Positives = 14/29 (48%)

Query:   455 GLESEKDYPYRNGNGEKFKCAYDKSKVKL 483
             G+  E  YP +N N    + + D  K +L
Sbjct:   333 GIAMEASYPLKNSNTNPSRLSLDSLKDEL 361


>UNIPROTKB|G3V9F8 [details] [associations]
            symbol:Ctsm "RCG24133" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015645 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 EMBL:CH474032
            PANTHER:PTHR12411:SF58 Ensembl:ENSRNOT00000045830 RGD:631420
            Uniprot:G3V9F8
        Length = 333

 Score = 298 (110.0 bits), Expect = 2.9e-26, Sum P(2) = 2.9e-26
 Identities = 79/230 (34%), Positives = 118/230 (51%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             VP+  +WRK+    P   Q  C  CWAFS+AG +EGQ   KTG+L+  S   LV+C++  
Sbjct:   114 VPNFINWRKRGYVTPVRRQGRCNVCWAFSVAGAIEGQMFQKTGQLIPLSVQNLVDCSRP- 172

Query:   802 SGCDGCFFEPS---IEYTHQ-AGLESEKDYPYKNANGEKFKCAY--DKSKVKLFTGKDFL 855
              G  GC+   +   ++Y  +  GLESE  YPY+   G    C Y  D S   + T  +F+
Sbjct:   173 QGNLGCYLGNTYLALQYVKENGGLESEATYPYEEKEGS---CRYHPDNSTASI-TDFEFV 228

Query:   856 HFNGSETMKKILYKYGPLSVLLNS---DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG 912
               N    M  +    GP+SV +++     +   NG     ++  CS   + HA+LLVGYG
Sbjct:   229 PKNEDALMNAVA-TLGPISVAIDARHESFLFYRNGI---YHEPNCSSSVVTHAMLLVGYG 284

Query:   913 ----KQDNIPYWLVRNSWGPIGPDEGFFKI--ERGNNACGIEQIAGYATI 956
                 + D   YW+++NS G    + G+ KI  ++GN+ CGI   A Y  +
Sbjct:   285 FVGEESDGRKYWILKNSMGNKWGNRGYMKIAKDQGNH-CGIATYALYPRV 333

 Score = 297 (109.6 bits), Expect = 4.5e-25, P = 4.5e-25
 Identities = 79/230 (34%), Positives = 118/230 (51%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             VP+  +WRK+    P   Q  C  CWAFS+AG +EGQ   KTG+L+  S   LV+C++  
Sbjct:   114 VPNFINWRKRGYVTPVRRQGRCNVCWAFSVAGAIEGQMFQKTGQLIPLSVQNLVDCSRP- 172

Query:    71 SGCDGCFFEPS---IEYTHQ-AGLESEKDYPYKNANGEKFKCAY--DKSKVKLFTGKDFL 124
              G  GC+   +   ++Y  +  GLESE  YPY+   G    C Y  D S   + T  +F+
Sbjct:   173 QGNLGCYLGNTYLALQYVKENGGLESEATYPYEEKEGS---CRYHPDNSTASI-TDFEFV 228

Query:   125 HFNGSETMKKILYKYGPLSVLLNS---DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG 181
               N    M  +    GP+SV +++     +   NG     ++  CS   + HA+LLVGYG
Sbjct:   229 PKNEDALMNAVA-TLGPISVAIDARHESFLFYRNGI---YHEPNCSSSVVTHAMLLVGYG 284

Query:   182 ----KQDNIPYWLVRNSWGPIGPDEGFFKI--ERGNNACGIEQIAGYATI 225
                 + D   YW+++NS G    + G+ KI  ++GN+ CGI   A Y  +
Sbjct:   285 FVGEESDGRKYWILKNSMGNKWGNRGYMKIAKDQGNH-CGIATYALYPRV 333

 Score = 295 (108.9 bits), Expect = 6.0e-26, Sum P(2) = 6.0e-26
 Identities = 80/230 (34%), Positives = 116/230 (50%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             VP+  +WRK+    P   Q  C  CWAFS+AG +EGQ   KTG+L+  S   LV+C++  
Sbjct:   114 VPNFINWRKRGYVTPVRRQGRCNVCWAFSVAGAIEGQMFQKTGQLIPLSVQNLVDCSRP- 172

Query:   436 SGCGGC--DGLEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAY--DKSKVKLFTGKDFL 490
              G  GC        ++Y  +  GLESE  YPY    G    C Y  D S   + T  +F+
Sbjct:   173 QGNLGCYLGNTYLALQYVKENGGLESEATYPYEEKEGS---CRYHPDNSTASI-TDFEFV 228

Query:   491 YFNGSETMKKILYKYGPLSVGLNS-H--LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYG 547
               N    M  +    GP+SV +++ H   + + NG     ++  CS   + HA+LLVGYG
Sbjct:   229 PKNEDALMNAVA-TLGPISVAIDARHESFLFYRNGI---YHEPNCSSSVVTHAMLLVGYG 284

Query:   548 ----KQDDIPYWLARNSWGPIGPDEGFFKI--ERGNNACGIEQIAGYATI 591
                 + D   YW+ +NS G    + G+ KI  ++GN+ CGI   A Y  +
Sbjct:   285 FVGEESDGRKYWILKNSMGNKWGNRGYMKIAKDQGNH-CGIATYALYPRV 333

 Score = 37 (18.1 bits), Expect = 2.9e-26, Sum P(2) = 2.9e-26
 Identities = 14/57 (24%), Positives = 33/57 (57%)

Query:    90 LESEKDYPYKNANGEKFKCAYDKS--KVKLFTGKDFLHFNGSETMKKILYKYGPLSV 144
             ++ EK Y  +   G+K +  ++++  K+KL  G++ L  +G  TM+  +  +G +++
Sbjct:    34 IKYEKTYSLEE-EGQK-RAVWEENMKKIKLHNGENGLGKHGF-TME--MNAFGDMTI 85


>DICTYBASE|DDB_G0283867 [details] [associations]
            symbol:cprC "cysteine proteinase 3" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0283867 GenomeReviews:CM000153_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:AAFI02000057
            KO:K01365 EMBL:X03930 RefSeq:XP_638859.1 ProteinModelPortal:Q23894
            SMR:Q23894 MEROPS:C01.114 EnsemblProtists:DDB0220784 GeneID:8624257
            KEGG:ddi:DDB_G0283867 OMA:NNVEHIN Uniprot:Q23894
        Length = 337

 Score = 308 (113.5 bits), Expect = 3.0e-26, P = 3.0e-26
 Identities = 74/214 (34%), Positives = 110/214 (51%)

Query:    16 DWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GC 73
             DWR+K+   P  DQ  CGSC++FS  G +EG  AIKTGKLV  S+  +++C+      GC
Sbjct:   126 DWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNILDCSSSFGNEGC 185

Query:    74 DGCFFEPSIEYT-HQAGLESEKDYPYK-NANGEKFKCAYDKSKV--KLFTGKDFLHFNGS 129
             +G     + EY     GL SE+ YPY+   N E   C + +  V  K+ + K+    + +
Sbjct:   186 NGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDE---CKFQEGSVAAKITSYKEIEAGDEN 242

Query:   130 ETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYW 189
             +    +L    P+SV +++        T     +  CS  DL H VL VG G  +   Y+
Sbjct:   243 DLQNALLLN--PVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHGVLAVGMGTDNGEDYY 300

Query:   190 LVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 222
             +V+NSWGP     G+  + R  +N CGI  +A Y
Sbjct:   301 IVKNSWGPSWGLNGYIHMARNKDNNCGISTMASY 334

 Score = 307 (113.1 bits), Expect = 3.8e-26, P = 3.8e-26
 Identities = 74/214 (34%), Positives = 110/214 (51%)

Query:   747 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GC 804
             DWR+K+   P  DQ  CGSC++FS  G +EG  AIKTGKLV  S+  +++C+      GC
Sbjct:   126 DWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNILDCSSSFGNEGC 185

Query:   805 DGCFFEPSIEYT-HQAGLESEKDYPYK-NANGEKFKCAYDKSKV--KLFTGKDFLHFNGS 860
             +G     + EY     GL SE+ YPY+   N E   C + +  V  K+ + K+    + +
Sbjct:   186 NGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDE---CKFQEGSVAAKITSYKEIEAGDEN 242

Query:   861 ETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYW 920
             +    +L    P+SV +++        T     +  CS  DL H VL VG G  +   Y+
Sbjct:   243 DLQNALLLN--PVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHGVLAVGMGTDNGEDYY 300

Query:   921 LVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 953
             +V+NSWGP     G+  + R  +N CGI  +A Y
Sbjct:   301 IVKNSWGPSWGLNGYIHMARNKDNNCGISTMASY 334

 Score = 304 (112.1 bits), Expect = 8.0e-26, P = 8.0e-26
 Identities = 72/212 (33%), Positives = 107/212 (50%)

Query:   381 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGG 440
             DWR+K+   P  DQ  CGSC++FS  G +EG  AIKTGKLV  S+  +++C+    G  G
Sbjct:   126 DWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNILDCSSSF-GNEG 184

Query:   441 CDG--LEQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSET 497
             C+G  +    EY     GL SE+ YPY     ++ K        K+ + K+    + ++ 
Sbjct:   185 CNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDECKFQEGSVAAKITSYKEIEAGDENDL 244

Query:   498 MKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLA 557
                +L    P+SV +++    F   T     +  CS  DL H VL VG G  +   Y++ 
Sbjct:   245 QNALLLN--PVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHGVLAVGMGTDNGEDYYIV 302

Query:   558 RNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 588
             +NSWGP     G+  + R  +N CGI  +A Y
Sbjct:   303 KNSWGPSWGLNGYIHMARNKDNNCGISTMASY 334

 Score = 125 (49.1 bits), Expect = 0.00017, P = 0.00017
 Identities = 25/61 (40%), Positives = 34/61 (55%)

Query:   962 DETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAG 1020
             +  CS  DL H VL VG G  +   Y++V+NSWGP     G+  + R  +N CGI  +A 
Sbjct:   274 EPACSSEDLDHGVLAVGMGTDNGEDYYIVKNSWGPSWGLNGYIHMARNKDNNCGISTMAS 333

Query:  1021 Y 1021
             Y
Sbjct:   334 Y 334


>UNIPROTKB|F1NWG2 [details] [associations]
            symbol:CTSC "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 InterPro:IPR014882 Pfam:PF08773
            OMA:YDDFLHY GO:GO:0001913 EMBL:AADN02004805 IPI:IPI00577371
            Ensembl:ENSGALT00000027869 Uniprot:F1NWG2
        Length = 463

 Score = 321 (118.1 bits), Expect = 3.3e-26, P = 3.3e-26
 Identities = 83/242 (34%), Positives = 130/242 (53%)

Query:     5 VEKDGPVPDAWDWRKKN---VTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVE--FS 59
             ++K   +P++WDWR  N      P  +QA CGSC+AF+  GMLE +  I T    +  FS
Sbjct:   225 LKKVSGLPESWDWRNVNGVNYVSPVRNQASCGSCYAFASMGMLEARIRILTNNTQKPVFS 284

Query:    60 KSQLVECAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLF 118
               Q+V C++   GCDG F +  + +Y    G+  E  +PY     +   C + +S    +
Sbjct:   285 PQQVVSCSQYSQGCDGGFPYLIAGKYVQDFGVVEEDCFPY---TAKDTPCLFKRSCYHYY 341

Query:   119 TGKDFLHFNG------SETMKKI-LYKYGPLSV---LLNSDLIHD---YNGTPIRKNDET 165
             T +   H+ G      +E + K+ L   GP++V   + N  + +    Y+ T ++  DE 
Sbjct:   342 TSE--YHYVGGFYGACNEALMKLELVLSGPMAVAFEVYNDFMFYKEGIYHHTGLK--DEF 397

Query:   166 CSPYDL-GHAVLLVGYGK--QDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
              +P++L  HAVLLVGYGK  +    +W+V+NSWG    ++G+F+I RG + C IE IA  
Sbjct:   398 -NPFELTNHAVLLVGYGKDPESGEKFWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVA 456

Query:   223 AT 224
             AT
Sbjct:   457 AT 458

 Score = 320 (117.7 bits), Expect = 1.0e-25, Sum P(2) = 1.0e-25
 Identities = 82/236 (34%), Positives = 128/236 (54%)

Query:   742 VPDAWDWRKKN---VTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 796
             +P++WDWR  N      P  +QA+CGSC+AF+  GMLE +  I T    +  FS  Q+V 
Sbjct:   231 LPESWDWRNVNGVNYVSPVRNQASCGSCYAFASMGMLEARIRILTNNTQKPVFSPQQVVS 290

Query:   797 CAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 855
             C++   GCDG F +  + +Y    G+  E  +PY     +   C + +S    +T +   
Sbjct:   291 CSQYSQGCDGGFPYLIAGKYVQDFGVVEEDCFPY---TAKDTPCLFKRSCYHYYTSE--Y 345

Query:   856 HFNG------SETMKKI-LYKYGPLSV---LLNSDLIHD---YNGTPIRKNDETCSPYDL 902
             H+ G      +E + K+ L   GP++V   + N  + +    Y+ T ++  DE  +P++L
Sbjct:   346 HYVGGFYGACNEALMKLELVLSGPMAVAFEVYNDFMFYKEGIYHHTGLK--DEF-NPFEL 402

Query:   903 -GHAVLLVGYGK--QDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 955
               HAVLLVGYGK  +    +W+V+NSWG    ++G+F+I RG + C IE IA  AT
Sbjct:   403 TNHAVLLVGYGKDPESGEKFWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAAT 458

 Score = 311 (114.5 bits), Expect = 1.8e-24, Sum P(2) = 1.8e-24
 Identities = 81/237 (34%), Positives = 124/237 (52%)

Query:   376 VPDAWDWRKKN---VTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 430
             +P++WDWR  N      P  +QA+CGSC+AF+  GMLE +  I T    +  FS  Q+V 
Sbjct:   231 LPESWDWRNVNGVNYVSPVRNQASCGSCYAFASMGMLEARIRILTNNTQKPVFSPQQVVS 290

Query:   431 CAKQCSGCGGCDGLEQPI--EYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKD 488
             C++   GC G  G    I  +Y    G+  E  +PY     +   C + +S    +T  +
Sbjct:   291 CSQYSQGCDG--GFPYLIAGKYVQDFGVVEEDCFPY---TAKDTPCLFKRSCYHYYTS-E 344

Query:   489 FLYFNG-----SETMKKI-LYKYGPLSVGL---NSHLIH---FYNGTPIRKNDETCSPYD 536
             + Y  G     +E + K+ L   GP++V     N  + +    Y+ T ++  DE  +P++
Sbjct:   345 YHYVGGFYGACNEALMKLELVLSGPMAVAFEVYNDFMFYKEGIYHHTGLK--DEF-NPFE 401

Query:   537 L-GHAVLLVGYGKQDDI--PYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 590
             L  HAVLLVGYGK  +    +W+ +NSWG    ++G+F+I RG + C IE IA  AT
Sbjct:   402 LTNHAVLLVGYGKDPESGEKFWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAAT 458

 Score = 155 (59.6 bits), Expect = 1.8e-06, Sum P(2) = 1.8e-06
 Identities = 30/61 (49%), Positives = 42/61 (68%)

Query:   966 SPYDL-GHAVLLVGYGKQDDI--PYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYA 1022
             +P++L  HAVLLVGYGK  +    +W+V+NSWG    ++G+F+I RG + C IE IA  A
Sbjct:   398 NPFELTNHAVLLVGYGKDPESGEKFWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAA 457

Query:  1023 T 1023
             T
Sbjct:   458 T 458

 Score = 37 (18.1 bits), Expect = 1.0e-25, Sum P(2) = 1.0e-25
 Identities = 12/39 (30%), Positives = 20/39 (51%)

Query:   210 GNN-ACGIEQIAGYATIDVVIQRLVLEKKAIMLIQAVFL 247
             G N AC   Q    ++ DV +++L L+K  + L    F+
Sbjct:   131 GRNWACFTGQKISSSSSDVHVRQLPLQKPRVGLSSRRFV 169

 Score = 37 (18.1 bits), Expect = 1.0e-25, Sum P(2) = 1.0e-25
 Identities = 12/39 (30%), Positives = 20/39 (51%)

Query:   576 GNN-ACGIEQIAGYATIDVVIQRLVLEKKAIMLIQAVFL 613
             G N AC   Q    ++ DV +++L L+K  + L    F+
Sbjct:   131 GRNWACFTGQKISSSSSDVHVRQLPLQKPRVGLSSRRFV 169


>FB|FBgn0037396 [details] [associations]
            symbol:CG11459 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE014297 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 KO:K01365 HSSP:P07711 EMBL:AY060710
            RefSeq:NP_649608.1 UniGene:Dm.3894 SMR:Q9VNK6 MEROPS:C01.A31
            EnsemblMetazoa:FBtr0078623 GeneID:40741 KEGG:dme:Dmel_CG11459
            UCSC:CG11459-RA FlyBase:FBgn0037396 InParanoid:Q9VNK6 OMA:NYDEREL
            OrthoDB:EOG4MGQPX ChiTaRS:CG11459 GenomeRNAi:40741 NextBio:820359
            Uniprot:Q9VNK6
        Length = 336

 Score = 306 (112.8 bits), Expect = 4.9e-26, P = 4.9e-26
 Identities = 74/224 (33%), Positives = 115/224 (51%)

Query:    11 VPDAWDWRKKNVTGPAGDQA-DCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ 69
             + +  DWR+     P GDQ  +C SCWAFS +G+LE   A K G LV  S   LV+C   
Sbjct:   118 ITEGIDWRQYGYISPVGDQGTECLSCWAFSTSGVLEAHMAKKYGNLVPLSPKHLVDCVPY 177

Query:    70 CS-GCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAY--DKSKVKLFTGKDFLHF 126
              + GC G +   +  YT   G+ +++ YPY+  +GE   C +  D+S   L +G   L  
Sbjct:   178 PNNGCSGGWVSVAFNYTRDHGIATKESYPYEPVSGE---CLWKSDRSAGTL-SGYVTLGN 233

Query:   127 NGSETMKKILYKYGPLSVLLNSDLIHD----YNGTPIRKNDETCSPYDLGHAVLLVGYGK 182
                  + +++Y  GP++V +  D +H+    Y+G  +          DL H+VLLVG+G 
Sbjct:   234 YDERELAEVVYNIGPVAVSI--DHLHEEFDQYSGGVLSIPACRSKRQDLTHSVLLVGFGT 291

Query:   183 QDNI-PYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYAT 224
                   YW+++NS+G    + G+ K+ R  NN CG+  +  Y T
Sbjct:   292 HRKWGDYWIIKNSYGTDWGESGYLKLARNANNMCGVASLPQYPT 335

 Score = 304 (112.1 bits), Expect = 8.0e-26, P = 8.0e-26
 Identities = 74/224 (33%), Positives = 114/224 (50%)

Query:   742 VPDAWDWRKKNVTGPAGDQAA-CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ 800
             + +  DWR+     P GDQ   C SCWAFS +G+LE   A K G LV  S   LV+C   
Sbjct:   118 ITEGIDWRQYGYISPVGDQGTECLSCWAFSTSGVLEAHMAKKYGNLVPLSPKHLVDCVPY 177

Query:   801 CS-GCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAY--DKSKVKLFTGKDFLHF 857
              + GC G +   +  YT   G+ +++ YPY+  +GE   C +  D+S   L +G   L  
Sbjct:   178 PNNGCSGGWVSVAFNYTRDHGIATKESYPYEPVSGE---CLWKSDRSAGTL-SGYVTLGN 233

Query:   858 NGSETMKKILYKYGPLSVLLNSDLIHD----YNGTPIRKNDETCSPYDLGHAVLLVGYGK 913
                  + +++Y  GP++V +  D +H+    Y+G  +          DL H+VLLVG+G 
Sbjct:   234 YDERELAEVVYNIGPVAVSI--DHLHEEFDQYSGGVLSIPACRSKRQDLTHSVLLVGFGT 291

Query:   914 QDNI-PYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYAT 955
                   YW+++NS+G    + G+ K+ R  NN CG+  +  Y T
Sbjct:   292 HRKWGDYWIIKNSYGTDWGESGYLKLARNANNMCGVASLPQYPT 335

 Score = 285 (105.4 bits), Expect = 8.6e-24, P = 8.6e-24
 Identities = 77/226 (34%), Positives = 113/226 (50%)

Query:   376 VPDAWDWRKKNVTGPAGDQAA-CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ 434
             + +  DWR+     P GDQ   C SCWAFS +G+LE   A K G LV  S   LV+C   
Sbjct:   118 ITEGIDWRQYGYISPVGDQGTECLSCWAFSTSGVLEAHMAKKYGNLVPLSPKHLVDCVPY 177

Query:   435 CS-GCGGCDGLEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAY--DKSKVKLFTGKDFLY 491
              + GC G   +     YT   G+ +++ YPY   +GE   C +  D+S   L +G   L 
Sbjct:   178 PNNGCSG-GWVSVAFNYTRDHGIATKESYPYEPVSGE---CLWKSDRSAGTL-SGYVTLG 232

Query:   492 FNGSETMKKILYKYGPLSVGLNSHLIH-F--YNGTPIRKNDETCSPYDLGHAVLLVGYG- 547
                   + +++Y  GP++V ++ HL   F  Y+G  +          DL H+VLLVG+G 
Sbjct:   233 NYDERELAEVVYNIGPVAVSID-HLHEEFDQYSGGVLSIPACRSKRQDLTHSVLLVGFGT 291

Query:   548 --KQDDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYAT 590
               K  D  YW+ +NS+G    + G+ K+ R  NN CG+  +  Y T
Sbjct:   292 HRKWGD--YWIIKNSYGTDWGESGYLKLARNANNMCGVASLPQYPT 335


>RGD|2445 [details] [associations]
            symbol:Ctsc "cathepsin C" species:10116 "Rattus norvegicus"
          [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA;ISO]
          [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
          [GO:0004252 "serine-type endopeptidase activity" evidence=IEA;ISO]
          [GO:0005764 "lysosome" evidence=IDA;TAS] [GO:0005783 "endoplasmic
          reticulum" evidence=IDA] [GO:0005794 "Golgi apparatus" evidence=IDA]
          [GO:0006508 "proteolysis" evidence=IEP;ISO;TAS] [GO:0007568 "aging"
          evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
          evidence=ISO] [GO:0010033 "response to organic substance"
          evidence=IDA] [GO:0031404 "chloride ion binding" evidence=IDA]
          [GO:0042802 "identical protein binding" evidence=IDA] [GO:0043621
          "protein self-association" evidence=IDA] InterPro:IPR000668
          InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
          InterPro:IPR000169 RGD:2445 GO:GO:0005783 GO:GO:0005794 GO:GO:0007568
          GO:GO:0010033 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
          InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139
          PROSITE:PS00639 GO:GO:0004252 GO:GO:0005764 GO:GO:0043621
          GO:GO:0042802 GO:GO:0031404 GO:GO:0004197
          GeneTree:ENSGT00560000076599 CTD:1075 HOGENOM:HOG000068022
          HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ InterPro:IPR014882
          Pfam:PF08773 MEROPS:C01.070 OMA:YDDFLHY GO:GO:0001913 EMBL:D90404
          IPI:IPI00193765 PIR:A41158 RefSeq:NP_058793.1 UniGene:Rn.203177
          PDB:1JQP PDBsum:1JQP ProteinModelPortal:P80067 SMR:P80067
          STRING:P80067 PhosphoSite:P80067 PRIDE:P80067
          Ensembl:ENSRNOT00000022342 GeneID:25423 KEGG:rno:25423
          InParanoid:P80067 SABIO-RK:P80067 EvolutionaryTrace:P80067
          NextBio:606591 ArrayExpress:P80067 Genevestigator:P80067
          GermOnline:ENSRNOG00000016496 Uniprot:P80067
        Length = 462

 Score = 319 (117.4 bits), Expect = 5.7e-26, P = 5.7e-26
 Identities = 79/234 (33%), Positives = 120/234 (51%)

Query:   742 VPDAWDWRKK---NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 796
             +P++WDWR     N   P  +Q +CGSC++F+  GMLE +  I T        S  ++V 
Sbjct:   230 LPESWDWRNVRGINFVSPVRNQESCGSCYSFASLGMLEARIRILTNNSQTPILSPQEVVS 289

Query:   797 CAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANGE---KFKCAYDKSKVKLFTGK 852
             C+    GCDG F +  + +Y    G+  E  +PY   +     K  C    S    + G 
Sbjct:   290 CSPYAQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDAPCKPKENCLRYYSSEYYYVGG 349

Query:   853 DFLHFNGSETMKKILYKYGPLSVL--LNSDLIHDYNGTPIRKN-DETCSPYDL-GHAVLL 908
              +   N +  MK  L K+GP++V   ++ D +H ++G        +  +P++L  HAVLL
Sbjct:   350 FYGGCNEA-LMKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLL 408

Query:   909 VGYGKQD--NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDVVK 960
             VGYGK     + YW+V+NSWG    + G+F+I RG + C IE IA  A I + K
Sbjct:   409 VGYGKDPVTGLDYWIVKNSWGSQWGESGYFRIRRGTDECAIESIA-MAAIPIPK 461

 Score = 315 (115.9 bits), Expect = 1.9e-25, P = 1.9e-25
 Identities = 76/225 (33%), Positives = 115/225 (51%)

Query:    11 VPDAWDWRKK---NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 65
             +P++WDWR     N   P  +Q  CGSC++F+  GMLE +  I T        S  ++V 
Sbjct:   230 LPESWDWRNVRGINFVSPVRNQESCGSCYSFASLGMLEARIRILTNNSQTPILSPQEVVS 289

Query:    66 CAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANGE---KFKCAYDKSKVKLFTGK 121
             C+    GCDG F +  + +Y    G+  E  +PY   +     K  C    S    + G 
Sbjct:   290 CSPYAQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDAPCKPKENCLRYYSSEYYYVGG 349

Query:   122 DFLHFNGSETMKKILYKYGPLSVL--LNSDLIHDYNGTPIRKN-DETCSPYDL-GHAVLL 177
              +   N +  MK  L K+GP++V   ++ D +H ++G        +  +P++L  HAVLL
Sbjct:   350 FYGGCNEA-LMKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLL 408

Query:   178 VGYGKQD--NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIA 220
             VGYGK     + YW+V+NSWG    + G+F+I RG + C IE IA
Sbjct:   409 VGYGKDPVTGLDYWIVKNSWGSQWGESGYFRIRRGTDECAIESIA 453

 Score = 305 (112.4 bits), Expect = 3.0e-24, P = 3.0e-24
 Identities = 72/229 (31%), Positives = 119/229 (51%)

Query:   376 VPDAWDWRKK---NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 430
             +P++WDWR     N   P  +Q +CGSC++F+  GMLE +  I T        S  ++V 
Sbjct:   230 LPESWDWRNVRGINFVSPVRNQESCGSCYSFASLGMLEARIRILTNNSQTPILSPQEVVS 289

Query:   431 CAKQCSGCGGCDGLEQPI--EYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKD 488
             C+    GC G  G    I  +Y    G+  E  +PY   +     C   ++ ++ ++ + 
Sbjct:   290 CSPYAQGCDG--GFPYLIAGKYAQDFGVVEENCFPYTATDAP---CKPKENCLRYYSSEY 344

Query:   489 FL---YFNG-SETMKKI-LYKYGPLSVGLNSH--LIHFYNGTPIRKN-DETCSPYDL-GH 539
             +    ++ G +E + K+ L K+GP++V    H   +H+++G        +  +P++L  H
Sbjct:   345 YYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNH 404

Query:   540 AVLLVGYGKQD--DIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIA 586
             AVLLVGYGK     + YW+ +NSWG    + G+F+I RG + C IE IA
Sbjct:   405 AVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRIRRGTDECAIESIA 453

 Score = 153 (58.9 bits), Expect = 2.7e-07, P = 2.7e-07
 Identities = 29/57 (50%), Positives = 39/57 (68%)

Query:   966 SPYDL-GHAVLLVGYGKQD--DIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIA 1019
             +P++L  HAVLLVGYGK     + YW+V+NSWG    + G+F+I RG + C IE IA
Sbjct:   397 NPFELTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRIRRGTDECAIESIA 453


>DICTYBASE|DDB_G0272815 [details] [associations]
            symbol:cprE "cysteine proteinase 5" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0272815 GO:GO:0005615
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GenomeReviews:CM000151_GR GO:GO:0005764
            EMBL:AAFI02000008 MEROPS:I29.003 KO:K01376 EMBL:L36205
            RefSeq:XP_644977.1 ProteinModelPortal:P54640 SMR:P54640
            PRIDE:P54640 EnsemblProtists:DDB0185092 GeneID:8618654
            KEGG:ddi:DDB_G0272815 OMA:METAFEF ProtClustDB:CLSZ2430780
            Uniprot:P54640
        Length = 344

 Score = 247 (92.0 bits), Expect = 6.3e-26, Sum P(3) = 6.3e-26
 Identities = 60/169 (35%), Positives = 84/169 (49%)

Query:    16 DWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDG 75
             DWR +    P  +Q  CG CW+FS  G  EG +    G+LV  S+  L++C+ + SGCDG
Sbjct:   117 DWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCSTENSGCDG 176

Query:    76 CFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN-GSETMK 133
                  + EY  +  G+++E  YPYK  NG   KC Y KS+    T   +     GSE+  
Sbjct:   177 GLMTYAFEYIINNNGIDTESSYPYKAENG---KCEY-KSENSGATLSSYKTVTAGSESSL 232

Query:   134 KILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG 181
             +      P+SV ++ S        + I    E CS  +L H VL VGYG
Sbjct:   233 ESAVNVNPVSVAIDASHQSFQLYTSGIYYEPE-CSSENLDHGVLAVGYG 280

 Score = 246 (91.7 bits), Expect = 1.1e-25, Sum P(3) = 1.1e-25
 Identities = 60/169 (35%), Positives = 84/169 (49%)

Query:   747 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDG 806
             DWR +    P  +Q  CG CW+FS  G  EG +    G+LV  S+  L++C+ + SGCDG
Sbjct:   117 DWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCSTENSGCDG 176

Query:   807 CFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN-GSETMK 864
                  + EY  +  G+++E  YPYK  NG   KC Y KS+    T   +     GSE+  
Sbjct:   177 GLMTYAFEYIINNNGIDTESSYPYKAENG---KCEY-KSENSGATLSSYKTVTAGSESSL 232

Query:   865 KILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG 912
             +      P+SV ++ S        + I    E CS  +L H VL VGYG
Sbjct:   233 ESAVNVNPVSVAIDASHQSFQLYTSGIYYEPE-CSSENLDHGVLAVGYG 280

 Score = 245 (91.3 bits), Expect = 1.8e-18, P = 1.8e-18
 Identities = 68/220 (30%), Positives = 98/220 (44%)

Query:   381 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGG 440
             DWR +    P  +Q  CG CW+FS  G  EG +    G+LV  S+  L++C+ + SGC G
Sbjct:   117 DWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCSTENSGCDG 176

Query:   441 CDGL-EQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFN-GSET 497
               GL     EY  +  G+++E  YPY+  NG   KC Y KS+    T   +     GSE+
Sbjct:   177 --GLMTYAFEYIINNNGIDTESSYPYKAENG---KCEY-KSENSGATLSSYKTVTAGSES 230

Query:   498 MKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGK--------- 548
               +      P+SV +++    F   T     +  CS  +L H VL VGYG          
Sbjct:   231 SLESAVNVNPVSVAIDASHQSFQLYTSGIYYEPECSSENLDHGVLAVGYGSGSGSSSGQS 290

Query:   549 ----------QDDIPYWLARNSWGPIGPDEGFFKIERGNN 578
                            YW+ +NSWG     EG+  + R  +
Sbjct:   291 SGQSSGNLSASSSNEYWIVKNSWGTSWGIEGYILMSRNRD 330

 Score = 82 (33.9 bits), Expect = 6.3e-26, Sum P(3) = 6.3e-26
 Identities = 15/39 (38%), Positives = 22/39 (56%)

Query:   987 YWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 1024
             YW+V+NSWG     EG+  + R  +N CGI   A +  +
Sbjct:   306 YWIVKNSWGTSWGIEGYILMSRNRDNNCGIASSASFPVV 344

 Score = 82 (33.9 bits), Expect = 1.2e-23, Sum P(2) = 1.2e-23
 Identities = 15/39 (38%), Positives = 22/39 (56%)

Query:   188 YWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
             YW+V+NSWG     EG+  + R  +N CGI   A +  +
Sbjct:   306 YWIVKNSWGTSWGIEGYILMSRNRDNNCGIASSASFPVV 344

 Score = 82 (33.9 bits), Expect = 1.2e-23, Sum P(2) = 1.2e-23
 Identities = 15/39 (38%), Positives = 22/39 (56%)

Query:   919 YWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
             YW+V+NSWG     EG+  + R  +N CGI   A +  +
Sbjct:   306 YWIVKNSWGTSWGIEGYILMSRNRDNNCGIASSASFPVV 344

 Score = 52 (23.4 bits), Expect = 6.3e-26, Sum P(3) = 6.3e-26
 Identities = 10/16 (62%), Positives = 11/16 (68%)

Query:   965 CSPYDLGHAVLLVGYG 980
             CS  +L H VL VGYG
Sbjct:   265 CSSENLDHGVLAVGYG 280


>MGI|MGI:109553 [details] [associations]
            symbol:Ctsc "cathepsin C" species:10090 "Mus musculus"
            [GO:0001913 "T cell mediated cytotoxicity" evidence=IGI]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IMP]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0005783 "endoplasmic
            reticulum" evidence=ISO] [GO:0005794 "Golgi apparatus"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=ISO;IMP]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0010033
            "response to organic substance" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0031404 "chloride ion
            binding" evidence=ISO] [GO:0042802 "identical protein binding"
            evidence=ISO] [GO:0043621 "protein self-association" evidence=ISO]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 MGI:MGI:109553 GO:GO:0005783
            GO:GO:0005794 GO:GO:0007568 GO:GO:0010033 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004252 GO:GO:0005764 GO:GO:0031404 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 MEROPS:C01.070 OMA:YDDFLHY
            GO:GO:0001913 EMBL:U89269 EMBL:U74683 EMBL:BC067063 IPI:IPI00130015
            RefSeq:NP_034112.3 UniGene:Mm.322945 ProteinModelPortal:P97821
            SMR:P97821 STRING:P97821 PhosphoSite:P97821 PaxDb:P97821
            PRIDE:P97821 Ensembl:ENSMUST00000032779 GeneID:13032 KEGG:mmu:13032
            InParanoid:P97821 BindingDB:P97821 ChEMBL:CHEMBL3454 ChiTaRS:CTSC
            NextBio:282904 Bgee:P97821 CleanEx:MM_CTSC Genevestigator:P97821
            Uniprot:P97821
        Length = 462

 Score = 318 (117.0 bits), Expect = 7.6e-26, P = 7.6e-26
 Identities = 76/237 (32%), Positives = 126/237 (53%)

Query:   742 VPDAWDWRKK---NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 796
             +P++WDWR     N   P  +Q +CGSC++F+  GMLE +  I T        S  ++V 
Sbjct:   230 LPESWDWRNVQGVNYVSPVRNQESCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVS 289

Query:   797 CAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 855
             C+    GCDG F +  + +Y    G+  E  +PY     +   C   ++ ++ ++  D+ 
Sbjct:   290 CSPYAQGCDGGFPYLIAGKYAQDFGVVEESCFPY---TAKDSPCKPRENCLRYYSS-DYY 345

Query:   856 HFNG-----SETMKKI-LYKYGPLSVL--LNSDLIHDYNGTPIRKN-DETCSPYDL-GHA 905
             +  G     +E + K+ L K+GP++V   ++ D +H ++G        +  +P++L  HA
Sbjct:   346 YVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHA 405

Query:   906 VLLVGYGKQD--NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDVVK 960
             VLLVGYG+     I YW+++NSWG    + G+F+I RG + C IE IA  A I + K
Sbjct:   406 VLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIA-VAAIPIPK 461

 Score = 314 (115.6 bits), Expect = 2.4e-25, P = 2.4e-25
 Identities = 73/228 (32%), Positives = 121/228 (53%)

Query:    11 VPDAWDWRKK---NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 65
             +P++WDWR     N   P  +Q  CGSC++F+  GMLE +  I T        S  ++V 
Sbjct:   230 LPESWDWRNVQGVNYVSPVRNQESCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVS 289

Query:    66 CAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
             C+    GCDG F +  + +Y    G+  E  +PY     +   C   ++ ++ ++  D+ 
Sbjct:   290 CSPYAQGCDGGFPYLIAGKYAQDFGVVEESCFPY---TAKDSPCKPRENCLRYYSS-DYY 345

Query:   125 HFNG-----SETMKKI-LYKYGPLSVL--LNSDLIHDYNGTPIRKN-DETCSPYDL-GHA 174
             +  G     +E + K+ L K+GP++V   ++ D +H ++G        +  +P++L  HA
Sbjct:   346 YVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHA 405

Query:   175 VLLVGYGKQD--NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIA 220
             VLLVGYG+     I YW+++NSWG    + G+F+I RG + C IE IA
Sbjct:   406 VLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIA 453

 Score = 308 (113.5 bits), Expect = 1.3e-24, P = 1.3e-24
 Identities = 74/230 (32%), Positives = 118/230 (51%)

Query:   376 VPDAWDWRKK---NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 430
             +P++WDWR     N   P  +Q +CGSC++F+  GMLE +  I T        S  ++V 
Sbjct:   230 LPESWDWRNVQGVNYVSPVRNQESCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVS 289

Query:   431 CAKQCSGCGGCDGLEQPI--EYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKD 488
             C+    GC G  G    I  +Y    G+  E  +PY     +   C   ++ ++ ++  D
Sbjct:   290 CSPYAQGCDG--GFPYLIAGKYAQDFGVVEESCFPY---TAKDSPCKPRENCLRYYSS-D 343

Query:   489 FLYFNG-----SETMKKI-LYKYGPLSVGLNSH--LIHFYNGTPIRKN-DETCSPYDL-G 538
             + Y  G     +E + K+ L K+GP++V    H   +H+++G        +  +P++L  
Sbjct:   344 YYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTN 403

Query:   539 HAVLLVGYGKQD--DIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIA 586
             HAVLLVGYG+     I YW+ +NSWG    + G+F+I RG + C IE IA
Sbjct:   404 HAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIA 453

 Score = 151 (58.2 bits), Expect = 4.5e-07, P = 4.5e-07
 Identities = 28/57 (49%), Positives = 39/57 (68%)

Query:   966 SPYDL-GHAVLLVGYGKQD--DIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIA 1019
             +P++L  HAVLLVGYG+     I YW+++NSWG    + G+F+I RG + C IE IA
Sbjct:   397 NPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIA 453


>UNIPROTKB|F1NZ37 [details] [associations]
            symbol:LOC420160 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 EMBL:AADN02062018
            IPI:IPI00587784 Ensembl:ENSGALT00000006765 OMA:CGVANQA
            Uniprot:F1NZ37
        Length = 340

 Score = 304 (112.1 bits), Expect = 8.0e-26, P = 8.0e-26
 Identities = 71/222 (31%), Positives = 112/222 (50%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC- 70
             P   DWR +    P  +Q  CGSCWAFS  G LEG     TGKL   S+  L++C+ +  
Sbjct:   121 PAEVDWRMRGYVTPVKNQGHCGSCWAFSATGALEGLVFNWTGKLAVLSEQNLIDCSWKLG 180

Query:    71 -SGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNG 128
              +GC G +   + +Y H   G+ SE  YPY+  +     C Y+ +         +L   G
Sbjct:   181 NNGCQGGYMTRAFQYVHDNGGMNSEHIYPYQATDTSS--CRYNPADRAANCSTVWLVAQG 238

Query:   129 SET-MKKILYKYGPLSVLLNSD--LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG-KQD 184
             SE  +++ +   GP+SV +++     H Y       N   CS   + H +L VGYG  Q+
Sbjct:   239 SEAALEQAVATVGPVSVAVDASSFFFHFYKSGIF--NSMFCSQ-KVNHGMLAVGYGISQE 295

Query:   185 ---NIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 222
                N+ YW+++NSW  +  ++G+ ++ +G NN CG+   A +
Sbjct:   296 ARKNVSYWILKNSWSEVWGEKGYIRLLKGVNNHCGVANQASF 337

 Score = 304 (112.1 bits), Expect = 8.0e-26, P = 8.0e-26
 Identities = 73/223 (32%), Positives = 112/223 (50%)

Query:   377 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 436
             P   DWR +    P  +Q  CGSCWAFS  G LEG     TGKL   S+  L++C+ +  
Sbjct:   121 PAEVDWRMRGYVTPVKNQGHCGSCWAFSATGALEGLVFNWTGKLAVLSEQNLIDCSWKL- 179

Query:   437 GCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFN 493
             G  GC G  + +  +Y H   G+ SE  YPY+  +     C Y+ +         +L   
Sbjct:   180 GNNGCQGGYMTRAFQYVHDNGGMNSEHIYPYQATDTSS--CRYNPADRAANCSTVWLVAQ 237

Query:   494 GSET-MKKILYKYGPLSVGLN--SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYG-KQ 549
             GSE  +++ +   GP+SV ++  S   HFY       N   CS   + H +L VGYG  Q
Sbjct:   238 GSEAALEQAVATVGPVSVAVDASSFFFHFYKSGIF--NSMFCSQ-KVNHGMLAVGYGISQ 294

Query:   550 D---DIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 588
             +   ++ YW+ +NSW  +  ++G+ ++ +G NN CG+   A +
Sbjct:   295 EARKNVSYWILKNSWSEVWGEKGYIRLLKGVNNHCGVANQASF 337

 Score = 303 (111.7 bits), Expect = 1.0e-25, P = 1.0e-25
 Identities = 71/222 (31%), Positives = 112/222 (50%)

Query:   743 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC- 801
             P   DWR +    P  +Q  CGSCWAFS  G LEG     TGKL   S+  L++C+ +  
Sbjct:   121 PAEVDWRMRGYVTPVKNQGHCGSCWAFSATGALEGLVFNWTGKLAVLSEQNLIDCSWKLG 180

Query:   802 -SGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNG 859
              +GC G +   + +Y H   G+ SE  YPY+  +     C Y+ +         +L   G
Sbjct:   181 NNGCQGGYMTRAFQYVHDNGGMNSEHIYPYQATDTSS--CRYNPADRAANCSTVWLVAQG 238

Query:   860 SET-MKKILYKYGPLSVLLNSD--LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG-KQD 915
             SE  +++ +   GP+SV +++     H Y       N   CS   + H +L VGYG  Q+
Sbjct:   239 SEAALEQAVATVGPVSVAVDASSFFFHFYKSGIF--NSMFCSQ-KVNHGMLAVGYGISQE 295

Query:   916 ---NIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 953
                N+ YW+++NSW  +  ++G+ ++ +G NN CG+   A +
Sbjct:   296 ARKNVSYWILKNSWSEVWGEKGYIRLLKGVNNHCGVANQASF 337


>TAIR|locus:2082881 [details] [associations]
            symbol:AT3G49340 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002686
            GenomeReviews:BA000014_GR EMBL:AC012329 EMBL:AL132956
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 MEROPS:I29.003 HOGENOM:HOG000230773 HSSP:P07711
            KO:K01376 IPI:IPI00520642 PIR:T45839 RefSeq:NP_566920.1
            UniGene:At.53854 ProteinModelPortal:Q9SG15 SMR:Q9SG15
            EnsemblPlants:AT3G49340.1 GeneID:824096 KEGG:ath:AT3G49340
            TAIR:At3g49340 InParanoid:Q9SG15 OMA:PQNDEEA PhylomeDB:Q9SG15
            ProtClustDB:CLSN2688476 Genevestigator:Q9SG15 Uniprot:Q9SG15
        Length = 341

 Score = 304 (112.1 bits), Expect = 8.0e-26, P = 8.0e-26
 Identities = 93/337 (27%), Positives = 153/337 (45%)

Query:   636 ARVDTLAIEGSLTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQD-------GHK 688
             +R   +   G L F+  + +E  + ++ +  R Y++D E   RFE F  +          
Sbjct:    15 SRTSGVTSRGGL-FE-ASAVEKHEQWMSRFNRVYSDDSEKTSRFEIFTNNLKFVESINMN 72

Query:   689 KHERY--GTSEFSDRSPEEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDA 745
              ++ Y    +EFSD + EE   + TG    E        D              G   ++
Sbjct:    73 TNKTYTLDVNEFSDLTDEEFKARYTGLVVPEGMTRISTTDSHETVSFRYENV--GETGES 130

Query:   746 WDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCD 805
              DW ++        Q  CG CWAFS    +EG   I  G+LV  S+ QL++C+ + +GC 
Sbjct:   131 MDWIQEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIANGELVSLSEQQLLDCSTENNGCG 190

Query:   806 GCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMK 864
             G     + +Y  +  G+ +E +YPY+   G +  C  +       +G + +  N  E + 
Sbjct:   191 GGIMWKAFDYIKENQGITTEDNYPYQ---GAQQTCESNHLAAATISGYETVPQNDEEALL 247

Query:   865 KILYKYGPLSVLLNS---DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG-KQDNIPYW 920
             K + +  P+SV +     + IH Y+G     N E C    L HAV +VGYG  ++ I YW
Sbjct:   248 KAVSQQ-PVSVAIEGSGYEFIH-YSGGIF--NGE-CGT-QLTHAVTIVGYGVSEEGIKYW 301

Query:   921 LVRNSWGPIGPDEGFFKIERGNNA----CGIEQIAGY 953
             L++NSWG    + G+ +I R  ++    CG+  +A Y
Sbjct:   302 LLKNSWGESWGENGYMRIMRDVDSPQGMCGLASLAYY 338

 Score = 299 (110.3 bits), Expect = 2.7e-25, P = 2.7e-25
 Identities = 94/339 (27%), Positives = 153/339 (45%)

Query:   270 ARVDTLAIEGSLTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQD-------GHK 322
             +R   +   G L F+  + +E  + ++ +  R Y++D E   RFE F  +          
Sbjct:    15 SRTSGVTSRGGL-FE-ASAVEKHEQWMSRFNRVYSDDSEKTSRFEIFTNNLKFVESINMN 72

Query:   323 KHERY--GTSEFSDRSPEEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDA 379
              ++ Y    +EFSD + EE   + TG    E        D              G   ++
Sbjct:    73 TNKTYTLDVNEFSDLTDEEFKARYTGLVVPEGMTRISTTDSHETVSFRYENV--GETGES 130

Query:   380 WDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCG 439
              DW ++        Q  CG CWAFS    +EG   I  G+LV  S+ QL++C+ + +GCG
Sbjct:   131 MDWIQEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIANGELVSLSEQQLLDCSTENNGCG 190

Query:   440 GCDGLE-QPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSET 497
             G  G+  +  +Y  +  G+ +E +YPY+   G +  C  +       +G + +  N  E 
Sbjct:   191 G--GIMWKAFDYIKENQGITTEDNYPYQ---GAQQTCESNHLAAATISGYETVPQNDEEA 245

Query:   498 MKKILYKYGPLSVGLNS---HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYG-KQDDIP 553
             + K + +  P+SV +       IH+  G     N E C    L HAV +VGYG  ++ I 
Sbjct:   246 LLKAVSQQ-PVSVAIEGSGYEFIHYSGGI---FNGE-CGT-QLTHAVTIVGYGVSEEGIK 299

Query:   554 YWLARNSWGPIGPDEGFFKIERGNNA----CGIEQIAGY 588
             YWL +NSWG    + G+ +I R  ++    CG+  +A Y
Sbjct:   300 YWLLKNSWGESWGENGYMRIMRDVDSPQGMCGLASLAYY 338

 Score = 284 (105.0 bits), Expect = 1.1e-23, P = 1.1e-23
 Identities = 70/226 (30%), Positives = 113/226 (50%)

Query:     6 EKDGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVE 65
             E  G   ++ DW ++        Q  CG CWAFS    +EG   I  G+LV  S+ QL++
Sbjct:   122 ENVGETGESMDWIQEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIANGELVSLSEQQLLD 181

Query:    66 CAKQCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
             C+ + +GC G     + +Y  +  G+ +E +YPY+   G +  C  +       +G + +
Sbjct:   182 CSTENNGCGGGIMWKAFDYIKENQGITTEDNYPYQ---GAQQTCESNHLAAATISGYETV 238

Query:   125 HFNGSETMKKILYKYGPLSVLLNS---DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG 181
               N  E + K + +  P+SV +     + IH Y+G     N E C    L HAV +VGYG
Sbjct:   239 PQNDEEALLKAVSQQ-PVSVAIEGSGYEFIH-YSGGIF--NGE-CGT-QLTHAVTIVGYG 292

Query:   182 -KQDNIPYWLVRNSWGPIGPDEGFFKIERGNNA----CGIEQIAGY 222
               ++ I YWL++NSWG    + G+ +I R  ++    CG+  +A Y
Sbjct:   293 VSEEGIKYWLLKNSWGESWGENGYMRIMRDVDSPQGMCGLASLAYY 338


>UNIPROTKB|F1PSK8 [details] [associations]
            symbol:F1PSK8 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 InterPro:IPR014882
            Pfam:PF08773 EMBL:AAEX03012741 Ensembl:ENSCAFT00000007054
            Uniprot:F1PSK8
        Length = 405

 Score = 302 (111.4 bits), Expect = 1.3e-25, P = 1.3e-25
 Identities = 77/229 (33%), Positives = 112/229 (48%)

Query:   742 VPDAWDWRK---KNVTGPAGDQAA-CGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLV 795
             +P +WDWR     N   P  +QAA CGSC+AF+   MLE +  I T        S  ++V
Sbjct:   173 LPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIV 232

Query:   796 ECAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANG--EKFKCAYDKSKVKLFTGK 852
              C++   GC+G F +  + +Y    GL  E  +PY  ++   +   C    S    + G 
Sbjct:   233 SCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYAGSDSPCKPNDCFRYYSSEYYYVG- 291

Query:   853 DFLHFNGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKN-DETCSPYDL-GHAVLL 908
              F        MK  L ++GP++V      D  H   G        +  +P++L  HAVLL
Sbjct:   292 GFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTNHAVLL 351

Query:   909 VGYGKQD--NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 955
             VGYG      + YW+V+NSWG    ++G+F+I RG + C IE IA  AT
Sbjct:   352 VGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVAAT 400

 Score = 299 (110.3 bits), Expect = 2.7e-25, P = 2.7e-25
 Identities = 77/234 (32%), Positives = 113/234 (48%)

Query:     6 EKDGPVPDAWDWRK---KNVTGPAGDQA-DCGSCWAFSIAGMLEGQYAIKTGKLVE--FS 59
             E+   +P +WDWR     N   P  +QA  CGSC+AF+   MLE +  I T        S
Sbjct:   168 EEISRLPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRILTNNTQTPILS 227

Query:    60 KSQLVECAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANG--EKFKCAYDKSKVK 116
               ++V C++   GC+G F +  + +Y    GL  E  +PY  ++   +   C    S   
Sbjct:   228 PQEIVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYAGSDSPCKPNDCFRYYSSEY 287

Query:   117 LFTGKDFLHFNGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKN-DETCSPYDL-G 172
              + G  F        MK  L ++GP++V      D  H   G        +  +P++L  
Sbjct:   288 YYVG-GFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTN 346

Query:   173 HAVLLVGYGKQD--NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 224
             HAVLLVGYG      + YW+V+NSWG    ++G+F+I RG + C IE IA  AT
Sbjct:   347 HAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVAAT 400

 Score = 288 (106.4 bits), Expect = 5.7e-23, P = 5.7e-23
 Identities = 76/232 (32%), Positives = 112/232 (48%)

Query:   376 VPDAWDWRK---KNVTGPAGDQAA-CGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLV 429
             +P +WDWR     N   P  +QAA CGSC+AF+   MLE +  I T        S  ++V
Sbjct:   173 LPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIV 232

Query:   430 ECAKQCSGCGGCDGLEQPI--EYTHQAGLESEKDYPYRNGNG--EKFKCAYDKSKVKLFT 485
              C++   GC G  G    I  +Y    GL  E  +PY   +   +   C    S    + 
Sbjct:   233 SCSQYAQGCEG--GFPYLIAGKYAQDFGLVEEACFPYAGSDSPCKPNDCFRYYSSEYYYV 290

Query:   486 GKDFLYFNGSETMKKI-LYKYGPLSVGLNSH--LIHFYNGTPIRKN-DETCSPYDL-GHA 540
             G    Y   +E + K+ L ++GP++V    +    H+  G        +  +P++L  HA
Sbjct:   291 GG--FYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTNHA 348

Query:   541 VLLVGYGKQD--DIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 590
             VLLVGYG      + YW+ +NSWG    ++G+F+I RG + C IE IA  AT
Sbjct:   349 VLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVAAT 400

 Score = 153 (58.9 bits), Expect = 2.1e-07, P = 2.1e-07
 Identities = 30/61 (49%), Positives = 41/61 (67%)

Query:   966 SPYDL-GHAVLLVGYGKQD--DIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYA 1022
             +P++L  HAVLLVGYG      + YW+V+NSWG    ++G+F+I RG + C IE IA  A
Sbjct:   340 NPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVAA 399

Query:  1023 T 1023
             T
Sbjct:   400 T 400


>UNIPROTKB|J9P219 [details] [associations]
            symbol:J9P219 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 InterPro:IPR014882
            Pfam:PF08773 OMA:YDDFLHY EMBL:AAEX03012741
            Ensembl:ENSCAFT00000050015 Uniprot:J9P219
        Length = 406

 Score = 302 (111.4 bits), Expect = 1.3e-25, P = 1.3e-25
 Identities = 77/229 (33%), Positives = 112/229 (48%)

Query:   742 VPDAWDWRK---KNVTGPAGDQAA-CGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLV 795
             +P +WDWR     N   P  +QAA CGSC+AF+   MLE +  I T        S  ++V
Sbjct:   174 LPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIV 233

Query:   796 ECAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANG--EKFKCAYDKSKVKLFTGK 852
              C++   GC+G F +  + +Y    GL  E  +PY  ++   +   C    S    + G 
Sbjct:   234 SCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYAGSDSPCKPNDCFRYYSSEYYYVG- 292

Query:   853 DFLHFNGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKN-DETCSPYDL-GHAVLL 908
              F        MK  L ++GP++V      D  H   G        +  +P++L  HAVLL
Sbjct:   293 GFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTNHAVLL 352

Query:   909 VGYGKQD--NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 955
             VGYG      + YW+V+NSWG    ++G+F+I RG + C IE IA  AT
Sbjct:   353 VGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVAAT 401

 Score = 299 (110.3 bits), Expect = 2.7e-25, P = 2.7e-25
 Identities = 77/234 (32%), Positives = 113/234 (48%)

Query:     6 EKDGPVPDAWDWRK---KNVTGPAGDQA-DCGSCWAFSIAGMLEGQYAIKTGKLVE--FS 59
             E+   +P +WDWR     N   P  +QA  CGSC+AF+   MLE +  I T        S
Sbjct:   169 EEISRLPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRILTNNTQTPILS 228

Query:    60 KSQLVECAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANG--EKFKCAYDKSKVK 116
               ++V C++   GC+G F +  + +Y    GL  E  +PY  ++   +   C    S   
Sbjct:   229 PQEIVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYAGSDSPCKPNDCFRYYSSEY 288

Query:   117 LFTGKDFLHFNGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKN-DETCSPYDL-G 172
              + G  F        MK  L ++GP++V      D  H   G        +  +P++L  
Sbjct:   289 YYVG-GFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTN 347

Query:   173 HAVLLVGYGKQD--NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 224
             HAVLLVGYG      + YW+V+NSWG    ++G+F+I RG + C IE IA  AT
Sbjct:   348 HAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVAAT 401

 Score = 288 (106.4 bits), Expect = 6.1e-23, P = 6.1e-23
 Identities = 76/232 (32%), Positives = 112/232 (48%)

Query:   376 VPDAWDWRK---KNVTGPAGDQAA-CGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLV 429
             +P +WDWR     N   P  +QAA CGSC+AF+   MLE +  I T        S  ++V
Sbjct:   174 LPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIV 233

Query:   430 ECAKQCSGCGGCDGLEQPI--EYTHQAGLESEKDYPYRNGNG--EKFKCAYDKSKVKLFT 485
              C++   GC G  G    I  +Y    GL  E  +PY   +   +   C    S    + 
Sbjct:   234 SCSQYAQGCEG--GFPYLIAGKYAQDFGLVEEACFPYAGSDSPCKPNDCFRYYSSEYYYV 291

Query:   486 GKDFLYFNGSETMKKI-LYKYGPLSVGLNSH--LIHFYNGTPIRKN-DETCSPYDL-GHA 540
             G    Y   +E + K+ L ++GP++V    +    H+  G        +  +P++L  HA
Sbjct:   292 GG--FYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTNHA 349

Query:   541 VLLVGYGKQD--DIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 590
             VLLVGYG      + YW+ +NSWG    ++G+F+I RG + C IE IA  AT
Sbjct:   350 VLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVAAT 401

 Score = 153 (58.9 bits), Expect = 2.1e-07, P = 2.1e-07
 Identities = 30/61 (49%), Positives = 41/61 (67%)

Query:   966 SPYDL-GHAVLLVGYGKQD--DIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYA 1022
             +P++L  HAVLLVGYG      + YW+V+NSWG    ++G+F+I RG + C IE IA  A
Sbjct:   341 NPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVAA 400

Query:  1023 T 1023
             T
Sbjct:   401 T 401


>UNIPROTKB|F1RWA9 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 EMBL:CU855637
            Ensembl:ENSSSCT00000009707 OMA:WAFSIVG Uniprot:F1RWA9
        Length = 194

 Score = 302 (111.4 bits), Expect = 1.3e-25, P = 1.3e-25
 Identities = 67/190 (35%), Positives = 103/190 (54%)

Query:    32 CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIEYTH--QAG 89
             CG CWAFS+   +E  YAIK   L   S  Q+++C+    GC+G     ++ + +  Q  
Sbjct:     2 CGGCWAFSVVSAVESAYAIKGQPLEVLSVQQVIDCSYNNYGCNGGSTLNALYWLNKTQVK 61

Query:    90 LESEKDYPYKNANG--EKFKCAYDKSKVKLFTGKDFLHFNGSET-MKKILYKYGPLSVLL 146
             + S+ +YP+K  NG    F C++    +K ++  DF   +G E  M K L   GPL V++
Sbjct:    62 VVSDSEYPFKAQNGLCHYFSCSHSGVSIKDYSAYDF---SGQEDEMAKTLLTLGPLIVIV 118

Query:   147 NSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFK 206
             ++    DY G  I+ +   CS  +  HAVL+ G+ K  + PYW+VRNSWG     +G+  
Sbjct:   119 DAVSWQDYLGGIIQHH---CSSGEANHAVLVTGFDKTGSTPYWIVRNSWGSAWGIDGYAL 175

Query:   207 IERGNNACGI 216
             ++ G N CGI
Sbjct:   176 VKMGGNICGI 185

 Score = 302 (111.4 bits), Expect = 1.3e-25, P = 1.3e-25
 Identities = 67/190 (35%), Positives = 103/190 (54%)

Query:   763 CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIEYTH--QAG 820
             CG CWAFS+   +E  YAIK   L   S  Q+++C+    GC+G     ++ + +  Q  
Sbjct:     2 CGGCWAFSVVSAVESAYAIKGQPLEVLSVQQVIDCSYNNYGCNGGSTLNALYWLNKTQVK 61

Query:   821 LESEKDYPYKNANG--EKFKCAYDKSKVKLFTGKDFLHFNGSET-MKKILYKYGPLSVLL 877
             + S+ +YP+K  NG    F C++    +K ++  DF   +G E  M K L   GPL V++
Sbjct:    62 VVSDSEYPFKAQNGLCHYFSCSHSGVSIKDYSAYDF---SGQEDEMAKTLLTLGPLIVIV 118

Query:   878 NSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFK 937
             ++    DY G  I+ +   CS  +  HAVL+ G+ K  + PYW+VRNSWG     +G+  
Sbjct:   119 DAVSWQDYLGGIIQHH---CSSGEANHAVLVTGFDKTGSTPYWIVRNSWGSAWGIDGYAL 175

Query:   938 IERGNNACGI 947
             ++ G N CGI
Sbjct:   176 VKMGGNICGI 185

 Score = 277 (102.6 bits), Expect = 6.1e-23, P = 6.1e-23
 Identities = 65/190 (34%), Positives = 95/190 (50%)

Query:   397 CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQPIEYTH-QAG 455
             CG CWAFS+   +E  YAIK   L   S  Q+++C+    GC G   L         Q  
Sbjct:     2 CGGCWAFSVVSAVESAYAIKGQPLEVLSVQQVIDCSYNNYGCNGGSTLNALYWLNKTQVK 61

Query:   456 LESEKDYPYRNGNG--EKFKCAYDKSKVKLFTGKDFLYFNGSET-MKKILYKYGPLSVGL 512
             + S+ +YP++  NG    F C++    +K ++  DF   +G E  M K L   GPL V +
Sbjct:    62 VVSDSEYPFKAQNGLCHYFSCSHSGVSIKDYSAYDF---SGQEDEMAKTLLTLGPLIVIV 118

Query:   513 NSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFK 572
             ++     Y G  I+ +   CS  +  HAVL+ G+ K    PYW+ RNSWG     +G+  
Sbjct:   119 DAVSWQDYLGGIIQHH---CSSGEANHAVLVTGFDKTGSTPYWIVRNSWGSAWGIDGYAL 175

Query:   573 IERGNNACGI 582
             ++ G N CGI
Sbjct:   176 VKMGGNICGI 185

 Score = 135 (52.6 bits), Expect = 9.9e-07, P = 9.9e-07
 Identities = 23/51 (45%), Positives = 31/51 (60%)

Query:   965 CSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGI 1015
             CS  +  HAVL+ G+ K    PYW+VRNSWG     +G+  ++ G N CGI
Sbjct:   135 CSSGEANHAVLVTGFDKTGSTPYWIVRNSWGSAWGIDGYALVKMGGNICGI 185


>UNIPROTKB|E9PSK9 [details] [associations]
            symbol:Ctsql2 "Protein Ctsql2" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            IPI:IPI00562656 Ensembl:ENSRNOT00000045847 RGD:1303225
            ArrayExpress:E9PSK9 Uniprot:E9PSK9
        Length = 342

 Score = 302 (111.4 bits), Expect = 1.3e-25, P = 1.3e-25
 Identities = 78/226 (34%), Positives = 110/226 (48%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +P + DWRK+       +Q  C SCWAF +AG +EGQ   KTGKL   S   LV+C+K  
Sbjct:   125 LPKSIDWRKEGYVTRVREQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCSKP- 183

Query:   436 SGCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYF 492
              G  GC G       +Y  Q  GLESE  YPY+   G++  C Y+         +     
Sbjct:   184 QGNKGCRGGTTYNAFQYVLQNGGLESEATYPYK---GKEGLCKYNPKNAYAKITRFVALP 240

Query:   493 NGSETMKKILYKYGPLSVGLNS--HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYG--- 547
                + +   L   GP++ G++      HF +G     ++  C+   + HAVL+VGYG   
Sbjct:   241 EDEDVLMDALATKGPVAAGIHVVYSYFHFVSGI---YHEPKCNNR-VNHAVLVVGYGFEG 296

Query:   548 -KQDDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
              + D   YWL +NSWG     +G+ KI +  NN CGI   A Y  +
Sbjct:   297 NETDGNNYWLIKNSWGKQWGLKGYMKIAKDRNNHCGIATFAQYPIV 342

 Score = 290 (107.1 bits), Expect = 2.5e-24, P = 2.5e-24
 Identities = 80/226 (35%), Positives = 111/226 (49%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P + DWRK+       +Q  C SCWAF +AG +EGQ   KTGKL   S   LV+C+K  
Sbjct:   125 LPKSIDWRKEGYVTRVREQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCSKPQ 184

Query:    71 S--GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHF 126
                GC G     + +Y  Q  GLESE  YPYK   G    C Y+ K+     T    L  
Sbjct:   185 GNKGCRGGTTYNAFQYVLQNGGLESEATYPYKGKEG---LCKYNPKNAYAKITRFVALPE 241

Query:   127 NGSETMKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG--- 181
             +    M  +  K GP++  ++      H  +G     ++  C+   + HAVL+VGYG   
Sbjct:   242 DEDVLMDALATK-GPVAAGIHVVYSYFHFVSGI---YHEPKCNNR-VNHAVLVVGYGFEG 296

Query:   182 -KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
              + D   YWL++NSWG     +G+ KI +  NN CGI   A Y  +
Sbjct:   297 NETDGNNYWLIKNSWGKQWGLKGYMKIAKDRNNHCGIATFAQYPIV 342

 Score = 290 (107.1 bits), Expect = 2.5e-24, P = 2.5e-24
 Identities = 80/226 (35%), Positives = 111/226 (49%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             +P + DWRK+       +Q  C SCWAF +AG +EGQ   KTGKL   S   LV+C+K  
Sbjct:   125 LPKSIDWRKEGYVTRVREQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCSKPQ 184

Query:   802 S--GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHF 857
                GC G     + +Y  Q  GLESE  YPYK   G    C Y+ K+     T    L  
Sbjct:   185 GNKGCRGGTTYNAFQYVLQNGGLESEATYPYKGKEG---LCKYNPKNAYAKITRFVALPE 241

Query:   858 NGSETMKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG--- 912
             +    M  +  K GP++  ++      H  +G     ++  C+   + HAVL+VGYG   
Sbjct:   242 DEDVLMDALATK-GPVAAGIHVVYSYFHFVSGI---YHEPKCNNR-VNHAVLVVGYGFEG 296

Query:   913 -KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
              + D   YWL++NSWG     +G+ KI +  NN CGI   A Y  +
Sbjct:   297 NETDGNNYWLIKNSWGKQWGLKGYMKIAKDRNNHCGIATFAQYPIV 342

 Score = 128 (50.1 bits), Expect = 8.4e-05, P = 8.4e-05
 Identities = 32/91 (35%), Positives = 48/91 (52%)

Query:   940 RGNNACGIEQIAGYAT-IDVVKNDETCSPYDLGHAVLLVGYG----KQDDIPYWLVRNSW 994
             +G  A GI  +  Y   +  + ++  C+   + HAVL+VGYG    + D   YWL++NSW
Sbjct:   253 KGPVAAGIHVVYSYFHFVSGIYHEPKCNNR-VNHAVLVVGYGFEGNETDGNNYWLIKNSW 311

Query:   995 GPIGPDEGFFKIERG-NNACGIEQIAGYATI 1024
             G     +G+ KI +  NN CGI   A Y  +
Sbjct:   312 GKQWGLKGYMKIAKDRNNHCGIATFAQYPIV 342


>ZFIN|ZDB-GENE-030619-9 [details] [associations]
            symbol:ctsc "cathepsin C" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030619-9 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 MEROPS:C01.070 HSSP:P43235
            EMBL:BC064286 IPI:IPI00486570 RefSeq:NP_999887.1 UniGene:Dr.32463
            ProteinModelPortal:Q6P2V1 SMR:Q6P2V1 PRIDE:Q6P2V1 GeneID:368704
            KEGG:dre:368704 InParanoid:Q6P2V1 NextBio:20813127
            ArrayExpress:Q6P2V1 Bgee:Q6P2V1 Uniprot:Q6P2V1
        Length = 455

 Score = 313 (115.2 bits), Expect = 2.6e-25, P = 2.6e-25
 Identities = 83/234 (35%), Positives = 118/234 (50%)

Query:    11 VPDAWDWRKKN---VTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 65
             +P  WDWR  N      P  +QA CGSC++F+  GMLE +  I+T    +  FS  Q+V 
Sbjct:   224 LPQHWDWRNVNGVNFVSPVRNQAQCGSCYSFATMGMLEARVRIQTNNTQQPVFSPQQVVS 283

Query:    66 CAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLH 125
             C++   GCDG F     +Y    G+  E  +PY    G    C       K +   D+ +
Sbjct:   284 CSQYSQGCDGGFPYLIGKYIQDFGIVEEDCFPY---TGSDSPCNLPAKCTKYYAS-DYHY 339

Query:   126 FNG-----SETMKKI-LYKYGPLSVLLN--SDLIHD----YNGTPIRKNDETCSPYDL-G 172
               G     SE+   + L K GP+ V L    D ++     Y+ T +R   +  +P++L  
Sbjct:   340 VGGFYGGCSESAMMLELVKNGPMGVALEVYPDFMNYKEGIYHHTGLR---DANNPFELTN 396

Query:   173 HAVLLVGYGK--QDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 224
             HAVLLVGYG+  +    YW+V+NSWG    + GFF+I RG + C IE IA  AT
Sbjct:   397 HAVLLVGYGQCHKTGEKYWIVKNSWGSGWGENGFFRIRRGTDECAIESIAVAAT 450

 Score = 312 (114.9 bits), Expect = 3.6e-25, P = 3.6e-25
 Identities = 83/234 (35%), Positives = 118/234 (50%)

Query:   742 VPDAWDWRKKN---VTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 796
             +P  WDWR  N      P  +QA CGSC++F+  GMLE +  I+T    +  FS  Q+V 
Sbjct:   224 LPQHWDWRNVNGVNFVSPVRNQAQCGSCYSFATMGMLEARVRIQTNNTQQPVFSPQQVVS 283

Query:   797 CAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLH 856
             C++   GCDG F     +Y    G+  E  +PY    G    C       K +   D+ +
Sbjct:   284 CSQYSQGCDGGFPYLIGKYIQDFGIVEEDCFPY---TGSDSPCNLPAKCTKYYAS-DYHY 339

Query:   857 FNG-----SETMKKI-LYKYGPLSVLLN--SDLIHD----YNGTPIRKNDETCSPYDL-G 903
               G     SE+   + L K GP+ V L    D ++     Y+ T +R   +  +P++L  
Sbjct:   340 VGGFYGGCSESAMMLELVKNGPMGVALEVYPDFMNYKEGIYHHTGLR---DANNPFELTN 396

Query:   904 HAVLLVGYGK--QDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 955
             HAVLLVGYG+  +    YW+V+NSWG    + GFF+I RG + C IE IA  AT
Sbjct:   397 HAVLLVGYGQCHKTGEKYWIVKNSWGSGWGENGFFRIRRGTDECAIESIAVAAT 450

 Score = 304 (112.1 bits), Expect = 3.3e-24, P = 3.3e-24
 Identities = 82/236 (34%), Positives = 118/236 (50%)

Query:   376 VPDAWDWRKKN---VTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 430
             +P  WDWR  N      P  +QA CGSC++F+  GMLE +  I+T    +  FS  Q+V 
Sbjct:   224 LPQHWDWRNVNGVNFVSPVRNQAQCGSCYSFATMGMLEARVRIQTNNTQQPVFSPQQVVS 283

Query:   431 CAKQCSGCGGCDGLEQPI-EYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF 489
             C++   GC G  G    I +Y    G+  E  +PY    G    C       K +   D+
Sbjct:   284 CSQYSQGCDG--GFPYLIGKYIQDFGIVEEDCFPY---TGSDSPCNLPAKCTKYYAS-DY 337

Query:   490 LYFNG-----SETMKKI-LYKYGPLSVGLNSH--LIHF----YNGTPIRKNDETCSPYDL 537
              Y  G     SE+   + L K GP+ V L  +   +++    Y+ T +R   +  +P++L
Sbjct:   338 HYVGGFYGGCSESAMMLELVKNGPMGVALEVYPDFMNYKEGIYHHTGLR---DANNPFEL 394

Query:   538 -GHAVLLVGYGK--QDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 590
               HAVLLVGYG+  +    YW+ +NSWG    + GFF+I RG + C IE IA  AT
Sbjct:   395 TNHAVLLVGYGQCHKTGEKYWIVKNSWGSGWGENGFFRIRRGTDECAIESIAVAAT 450

 Score = 151 (58.2 bits), Expect = 4.4e-07, P = 4.4e-07
 Identities = 31/61 (50%), Positives = 41/61 (67%)

Query:   966 SPYDL-GHAVLLVGYGK--QDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYA 1022
             +P++L  HAVLLVGYG+  +    YW+V+NSWG    + GFF+I RG + C IE IA  A
Sbjct:   390 NPFELTNHAVLLVGYGQCHKTGEKYWIVKNSWGSGWGENGFFRIRRGTDECAIESIAVAA 449

Query:  1023 T 1023
             T
Sbjct:   450 T 450


>TAIR|locus:2029934 [details] [associations]
            symbol:AT1G29080 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002684 GenomeReviews:CT485782_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AC021043 MEROPS:I29.003 HOGENOM:HOG000230773
            HSSP:P53634 ProtClustDB:CLSN2688064 EMBL:DQ056468 IPI:IPI00521747
            PIR:C86413 RefSeq:NP_564320.1 UniGene:At.51814
            ProteinModelPortal:Q9LP39 SMR:Q9LP39 EnsemblPlants:AT1G29080.1
            GeneID:839783 KEGG:ath:AT1G29080 TAIR:At1g29080 InParanoid:Q9LP39
            OMA:KTWGENG PhylomeDB:Q9LP39 Genevestigator:Q9LP39 Uniprot:Q9LP39
        Length = 346

 Score = 299 (110.3 bits), Expect = 2.7e-25, P = 2.7e-25
 Identities = 84/286 (29%), Positives = 128/286 (44%)

Query:   680 EYFKQDGHKKHERYGTSEFSDRSPEEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXX 738
             E F   G++ + + G +EF+D + EE L   TG +    T    V +             
Sbjct:    71 ESFNNMGNQSY-KLGVNEFTDWTKEEFLATYTGLRGVNVTSPFEVVNETKPAWNWTVSDV 129

Query:   739 DGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 798
              G   D   WR +    P   Q  CG CWAFS    +EG   I  G L+  S+ QL++C 
Sbjct:   130 LGTNKD---WRNEGAVTPVKSQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDCT 186

Query:   799 K-QCSGCDGCFFEPSIEYT--HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 855
             + Q +GC G  F  +  Y   H+ G+ SE +YPY+   G    C  +     L  G + +
Sbjct:   187 REQNNGCKGGTFVNAFNYIIKHR-GISSENEYPYQVKEGP---CRSNARPAILIRGFENV 242

Query:   856 HFNGSETMKKILYKYGPLSVLLNSD---LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG 912
               N    + + + +  P++V +++     +H Y+G     N   C    + HAV LVGYG
Sbjct:   243 PSNNERALLEAVSRQ-PVAVAIDASEAGFVH-YSGGVY--NARNCGT-SVNHAVTLVGYG 297

Query:   913 KQ-DNIPYWLVRNSWGPIGPDEGFFKIERG----NNACGIEQIAGY 953
                + + YWL +NSWG    + G+ +I R        CG+ Q A Y
Sbjct:   298 TSPEGMKYWLAKNSWGKTWGENGYIRIRRDVEWPQGMCGVAQYASY 343

 Score = 289 (106.8 bits), Expect = 3.2e-24, P = 3.2e-24
 Identities = 81/285 (28%), Positives = 124/285 (43%)

Query:   314 EYFKQDGHKKHERYGTSEFSDRSPEEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXX 372
             E F   G++ + + G +EF+D + EE L   TG +    T    V +             
Sbjct:    71 ESFNNMGNQSY-KLGVNEFTDWTKEEFLATYTGLRGVNVTSPFEVVNETKPAWNWTVSDV 129

Query:   373 DGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 432
              G   D   WR +    P   Q  CG CWAFS    +EG   I  G L+  S+ QL++C 
Sbjct:   130 LGTNKD---WRNEGAVTPVKSQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDCT 186

Query:   433 K-QCSGCGGCDGLEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLY 491
             + Q +GC G   +          G+ SE +YPY+   G    C  +     L  G + + 
Sbjct:   187 REQNNGCKGGTFVNAFNYIIKHRGISSENEYPYQVKEGP---CRSNARPAILIRGFENVP 243

Query:   492 FNGSETMKKILYKYGPLSVGLNSH---LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGK 548
              N    + + + +  P++V +++     +H+  G     N   C    + HAV LVGYG 
Sbjct:   244 SNNERALLEAVSRQ-PVAVAIDASEAGFVHYSGGV---YNARNCGT-SVNHAVTLVGYGT 298

Query:   549 QDD-IPYWLARNSWGPIGPDEGFFKIERG----NNACGIEQIAGY 588
               + + YWLA+NSWG    + G+ +I R        CG+ Q A Y
Sbjct:   299 SPEGMKYWLAKNSWGKTWGENGYIRIRRDVEWPQGMCGVAQYASY 343

 Score = 288 (106.4 bits), Expect = 4.1e-24, P = 4.1e-24
 Identities = 69/218 (31%), Positives = 105/218 (48%)

Query:    16 DWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAK-QCSGCD 74
             DWR +    P   Q +CG CWAFS    +EG   I  G L+  S+ QL++C + Q +GC 
Sbjct:   135 DWRNEGAVTPVKSQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDCTREQNNGCK 194

Query:    75 GCFFEPSIEYT--HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETM 132
             G  F  +  Y   H+ G+ SE +YPY+   G    C  +     L  G + +  N    +
Sbjct:   195 GGTFVNAFNYIIKHR-GISSENEYPYQVKEGP---CRSNARPAILIRGFENVPSNNERAL 250

Query:   133 KKILYKYGPLSVLLNSD---LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ-DNIPY 188
              + + +  P++V +++     +H Y+G     N   C    + HAV LVGYG   + + Y
Sbjct:   251 LEAVSRQ-PVAVAIDASEAGFVH-YSGGVY--NARNCGT-SVNHAVTLVGYGTSPEGMKY 305

Query:   189 WLVRNSWGPIGPDEGFFKIERG----NNACGIEQIAGY 222
             WL +NSWG    + G+ +I R        CG+ Q A Y
Sbjct:   306 WLAKNSWGKTWGENGYIRIRRDVEWPQGMCGVAQYASY 343

 Score = 121 (47.7 bits), Expect = 0.00050, P = 0.00050
 Identities = 26/68 (38%), Positives = 34/68 (50%)

Query:   959 VKNDETCSPYDLGHAVLLVGYGKQDD-IPYWLVRNSWGPIGPDEGFFKIERG----NNAC 1013
             V N   C    + HAV LVGYG   + + YWL +NSWG    + G+ +I R        C
Sbjct:   277 VYNARNCGT-SVNHAVTLVGYGTSPEGMKYWLAKNSWGKTWGENGYIRIRRDVEWPQGMC 335

Query:  1014 GIEQIAGY 1021
             G+ Q A Y
Sbjct:   336 GVAQYASY 343


>MGI|MGI:1927229 [details] [associations]
            symbol:Ctsm "cathepsin M" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008152 "metabolic process" evidence=ISS] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015645 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1927229 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF202528
            EMBL:AY014777 EMBL:AY057446 EMBL:AK005550 EMBL:AK005428
            IPI:IPI00131133 RefSeq:NP_071721.2 UniGene:Mm.279933
            ProteinModelPortal:Q9JL96 SMR:Q9JL96 STRING:Q9JL96 MEROPS:C01.023
            PRIDE:Q9JL96 DNASU:64139 Ensembl:ENSMUST00000099451 GeneID:64139
            KEGG:mmu:64139 UCSC:uc007qwj.1 CTD:64139 InParanoid:Q9JL96
            KO:K09600 OrthoDB:EOG4TTGKR NextBio:319931 Bgee:Q9JL96
            CleanEx:MM_CTSM Genevestigator:Q9JL96 GermOnline:ENSMUSG00000074484
            GermOnline:ENSMUSG00000074871 PANTHER:PTHR12411:SF58 Uniprot:Q9JL96
        Length = 333

 Score = 299 (110.3 bits), Expect = 2.7e-25, P = 2.7e-25
 Identities = 77/228 (33%), Positives = 113/228 (49%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             +P   +W+K+    P   Q  C SCWAFS+ G +EGQ   KTG+L+  S   LV+C++  
Sbjct:   114 LPKFINWKKRGYVTPVQTQGRCNSCWAFSVTGAIEGQMFRKTGQLIPLSVQNLVDCSRP- 172

Query:   802 SGCDGCFFEPS---IEYTHQ-AGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLH 856
              G  GC+   +   + Y  +  GLESE  YPY+  +G    C Y  ++     TG +F+ 
Sbjct:   173 QGNWGCYLGNTYLALHYVMENGGLESEATYPYEEKDGS---CRYSPENSTANITGFEFVP 229

Query:   857 FNGSETMKKILYKYGPLSVLLNS---DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG- 912
              N    M  +    GP+SV +++     +    G     N   CS   + H++LLVGYG 
Sbjct:   230 KNEDALMNAVA-SIGPISVAIDARHASFLFYKRGIYYEPN---CSSCVVTHSMLLVGYGF 285

Query:   913 ---KQDNIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 956
                + D   YWLV+NS G    ++G+ KI R   N CGI   A Y  +
Sbjct:   286 TGRESDGRKYWLVKNSMGTQWGNKGYMKISRDKGNHCGIATYALYPRV 333

 Score = 298 (110.0 bits), Expect = 3.5e-25, P = 3.5e-25
 Identities = 77/228 (33%), Positives = 113/228 (49%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P   +W+K+    P   Q  C SCWAFS+ G +EGQ   KTG+L+  S   LV+C++  
Sbjct:   114 LPKFINWKKRGYVTPVQTQGRCNSCWAFSVTGAIEGQMFRKTGQLIPLSVQNLVDCSRP- 172

Query:    71 SGCDGCFFEPS---IEYTHQ-AGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLH 125
              G  GC+   +   + Y  +  GLESE  YPY+  +G    C Y  ++     TG +F+ 
Sbjct:   173 QGNWGCYLGNTYLALHYVMENGGLESEATYPYEEKDGS---CRYSPENSTANITGFEFVP 229

Query:   126 FNGSETMKKILYKYGPLSVLLNS---DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG- 181
              N    M  +    GP+SV +++     +    G     N   CS   + H++LLVGYG 
Sbjct:   230 KNEDALMNAVA-SIGPISVAIDARHASFLFYKRGIYYEPN---CSSCVVTHSMLLVGYGF 285

Query:   182 ---KQDNIPYWLVRNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 225
                + D   YWLV+NS G    ++G+ KI R   N CGI   A Y  +
Sbjct:   286 TGRESDGRKYWLVKNSMGTQWGNKGYMKISRDKGNHCGIATYALYPRV 333

 Score = 296 (109.3 bits), Expect = 5.7e-25, P = 5.7e-25
 Identities = 76/226 (33%), Positives = 110/226 (48%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +P   +W+K+    P   Q  C SCWAFS+ G +EGQ   KTG+L+  S   LV+C++  
Sbjct:   114 LPKFINWKKRGYVTPVQTQGRCNSCWAFSVTGAIEGQMFRKTGQLIPLSVQNLVDCSRP- 172

Query:   436 SGCGGC--DGLEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYD-KSKVKLFTGKDFLY 491
              G  GC        + Y  +  GLESE  YPY   +G    C Y  ++     TG +F+ 
Sbjct:   173 QGNWGCYLGNTYLALHYVMENGGLESEATYPYEEKDGS---CRYSPENSTANITGFEFVP 229

Query:   492 FNGSETMKKILYKYGPLSVGLNS-HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYG--- 547
              N    M  +    GP+SV +++ H    +    I   +  CS   + H++LLVGYG   
Sbjct:   230 KNEDALMNAVA-SIGPISVAIDARHASFLFYKRGIYY-EPNCSSCVVTHSMLLVGYGFTG 287

Query:   548 -KQDDIPYWLARNSWGPIGPDEGFFKIERGN-NACGIEQIAGYATI 591
              + D   YWL +NS G    ++G+ KI R   N CGI   A Y  +
Sbjct:   288 RESDGRKYWLVKNSMGTQWGNKGYMKISRDKGNHCGIATYALYPRV 333


>RGD|1588248 [details] [associations]
            symbol:Cts8 "cathepsin 8" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1588248 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 IPI:IPI00765053
            RefSeq:NP_001121688.1 UniGene:Rn.220599 Ensembl:ENSRNOT00000061486
            GeneID:680718 KEGG:rno:680718 UCSC:RGD:1588248 CTD:56094
            OMA:DSEWQEW OrthoDB:EOG4JT07C NextBio:719350 Uniprot:D3ZP54
        Length = 333

 Score = 298 (110.0 bits), Expect = 3.5e-25, P = 3.5e-25
 Identities = 78/227 (34%), Positives = 110/227 (48%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +P   DWR++       +Q  C SCWAFS+AG +EGQ   KTG+LV  S   LV+C++  
Sbjct:   114 LPKFVDWRRRGYVTSVKNQGTCNSCWAFSVAGAIEGQMFRKTGRLVSLSPQNLVDCSRP- 172

Query:   436 SGCGGCD--GLEQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAY-DKSKVKLFTGKDFLY 491
              G  GC        ++Y     GLE+E  YPY    G++  C Y  +      TG   + 
Sbjct:   173 EGNHGCHMGSTLYALKYVWSNGGLEAESTYPYE---GKEGPCRYLPRRSAARVTGFSTVA 229

Query:   492 FNGSETMKKILYKYGPLSVGLN-SHL-IHFYNGTPIRKNDETCSPYDLGHAVLLVGYG-- 547
                 E +   +   GP+SVG++ SH+   FY        +  CS   + H+VL+VGYG  
Sbjct:   230 -RSEEALMHAVATIGPISVGIDASHVSFRFYRRGIYY--EPRCSSNRINHSVLVVGYGYE 286

Query:   548 --KQDDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
               + D   YWL +NS G      G+ K+ RG NN CGI     Y  +
Sbjct:   287 GRESDGRKYWLIKNSHGVGWGMNGYMKLARGWNNHCGIATYGFYPRV 333

 Score = 287 (106.1 bits), Expect = 5.2e-24, P = 5.2e-24
 Identities = 75/226 (33%), Positives = 109/226 (48%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             +P   DWR++       +Q  C SCWAFS+AG +EGQ   KTG+LV  S   LV+C++  
Sbjct:   114 LPKFVDWRRRGYVTSVKNQGTCNSCWAFSVAGAIEGQMFRKTGRLVSLSPQNLVDCSRP- 172

Query:   802 SGCDGCFFEPSI---EYT-HQAGLESEKDYPYKNANGEKFKCAY-DKSKVKLFTGKDFLH 856
              G  GC    ++   +Y     GLE+E  YPY+   G    C Y  +      TG   + 
Sbjct:   173 EGNHGCHMGSTLYALKYVWSNGGLEAESTYPYEGKEGP---CRYLPRRSAARVTGFSTVA 229

Query:   857 FNGSETMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG--- 912
                 E +   +   GP+SV ++ S +   +    I   +  CS   + H+VL+VGYG   
Sbjct:   230 -RSEEALMHAVATIGPISVGIDASHVSFRFYRRGIYY-EPRCSSNRINHSVLVVGYGYEG 287

Query:   913 -KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
              + D   YWL++NS G      G+ K+ RG NN CGI     Y  +
Sbjct:   288 RESDGRKYWLIKNSHGVGWGMNGYMKLARGWNNHCGIATYGFYPRV 333

 Score = 286 (105.7 bits), Expect = 6.7e-24, P = 6.7e-24
 Identities = 75/226 (33%), Positives = 109/226 (48%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P   DWR++       +Q  C SCWAFS+AG +EGQ   KTG+LV  S   LV+C++  
Sbjct:   114 LPKFVDWRRRGYVTSVKNQGTCNSCWAFSVAGAIEGQMFRKTGRLVSLSPQNLVDCSRP- 172

Query:    71 SGCDGCFFEPSI---EYT-HQAGLESEKDYPYKNANGEKFKCAY-DKSKVKLFTGKDFLH 125
              G  GC    ++   +Y     GLE+E  YPY+   G    C Y  +      TG   + 
Sbjct:   173 EGNHGCHMGSTLYALKYVWSNGGLEAESTYPYEGKEGP---CRYLPRRSAARVTGFSTVA 229

Query:   126 FNGSETMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG--- 181
                 E +   +   GP+SV ++ S +   +    I   +  CS   + H+VL+VGYG   
Sbjct:   230 -RSEEALMHAVATIGPISVGIDASHVSFRFYRRGIYY-EPRCSSNRINHSVLVVGYGYEG 287

Query:   182 -KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
              + D   YWL++NS G      G+ K+ RG NN CGI     Y  +
Sbjct:   288 RESDGRKYWLIKNSHGVGWGMNGYMKLARGWNNHCGIATYGFYPRV 333


>FB|FBgn0250848 [details] [associations]
            symbol:26-29-p "26-29kD-proteinase" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005811
            "lipid particle" evidence=IDA] [GO:0005875 "microtubule associated
            complex" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005875 EMBL:AE014296 GO:GO:0005811 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 MEROPS:I29.003 HSSP:O65039
            EMBL:AY122222 EMBL:AB011376 RefSeq:NP_620470.1 UniGene:Dm.3049
            SMR:Q9V3U6 MINT:MINT-890485 STRING:Q9V3U6
            EnsemblMetazoa:FBtr0075766 GeneID:39547 KEGG:dme:Dmel_CG8947
            UCSC:CG8947-RA CTD:39547 FlyBase:FBgn0250848 InParanoid:Q9V3U6
            OMA:IHSKNRA OrthoDB:EOG4BVQ8T GenomeRNAi:39547 NextBio:814210
            Uniprot:Q9V3U6
        Length = 549

 Score = 317 (116.6 bits), Expect = 3.9e-25, P = 3.9e-25
 Identities = 98/324 (30%), Positives = 145/324 (44%)

Query:   285 NENILETFKAFIVKRGRQYANDEEIKERFEYFKQDG---HKKHER---Y--GTSEFSDRS 336
             +E++ + F  F  K G  Y +D E + R   F+Q+    H K+     Y    +  +D++
Sbjct:   238 DEHVDKAFHHFKRKHGVAYHSDTEHEHRKNIFRQNLRYIHSKNRAKLTYTLAVNHLADKT 297

Query:   337 PEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAA 396
              EE+  + G+K S   Y                      +PD +DWR      P  DQ+ 
Sbjct:   298 EEELKARRGYK-SSGIYN------TGKPFPYDVPKYKDEIPDQYDWRLYGAVTPVKDQSV 350

Query:   397 CGSCWAFSIAGMLEGQYAIKTG-KLVEFSKSQLVECAKQCSGCGGCDGLEQPIEYT---H 452
             CGSCW+F   G LEG + +K G  LV  S+  L++C+    G  GCDG E    Y     
Sbjct:   351 CGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDCS-WAYGNNGCDGGEDFRVYQWMLQ 409

Query:   453 QAGLESEKDY-PYRNGNGEKFKCAYDK-SKVKLFTGKDFLYFNGSETMKKILYKYGPLSV 510
               G+ +E++Y PY   +G    C  +  + V    G   +  N     K  L K+GPLSV
Sbjct:   410 SGGVPTEEEYGPYLGQDGY---CHVNNVTLVAPIKGFVNVTSNDPNAFKLALLKHGPLSV 466

Query:   511 GLNSH--LIHFYN-GT---PIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPI 564
              +++      FY+ G    P  KND       L HAVL VGYG  +   YWL +NSW   
Sbjct:   467 AIDASPKTFSFYSHGVYYEPTCKNDVD----GLDHAVLAVGYGSINGEDYWLVKNSWSTY 522

Query:   565 GPDEGFFKIERGNNACGIEQIAGY 588
               ++G+  +    N CG+  +  Y
Sbjct:   523 WGNDGYILMSAKKNNCGVMTMPTY 546

 Score = 308 (113.5 bits), Expect = 3.9e-24, P = 3.9e-24
 Identities = 98/325 (30%), Positives = 147/325 (45%)

Query:   651 NENILETFKAFIVKRGRQYANDEEIKERFEYFKQDG---HKKHER---Y--GTSEFSDRS 702
             +E++ + F  F  K G  Y +D E + R   F+Q+    H K+     Y    +  +D++
Sbjct:   238 DEHVDKAFHHFKRKHGVAYHSDTEHEHRKNIFRQNLRYIHSKNRAKLTYTLAVNHLADKT 297

Query:   703 PEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAA 762
              EE+  + G+K S   Y                      +PD +DWR      P  DQ+ 
Sbjct:   298 EEELKARRGYK-SSGIYN------TGKPFPYDVPKYKDEIPDQYDWRLYGAVTPVKDQSV 350

Query:   763 CGSCWAFSIAGMLEGQYAIKTG-KLVEFSKSQLVEC--AKQCSGCDGCFFEPSIEYTHQA 819
             CGSCW+F   G LEG + +K G  LV  S+  L++C  A   +GCDG       ++  Q+
Sbjct:   351 CGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDCSWAYGNNGCDGGEDFRVYQWMLQS 410

Query:   820 G-LESEKDY-PYKNANGEKFKCAYDKSKVKLFTG-KDFLHF--NGSETMKKILYKYGPLS 874
             G + +E++Y PY   +G    C  +   V L    K F++   N     K  L K+GPLS
Sbjct:   411 GGVPTEEEYGPYLGQDGY---CHVNN--VTLVAPIKGFVNVTSNDPNAFKLALLKHGPLS 465

Query:   875 VLLNSD------LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGP 928
             V +++         H     P  KND       L HAVL VGYG  +   YWLV+NSW  
Sbjct:   466 VAIDASPKTFSFYSHGVYYEPTCKNDVD----GLDHAVLAVGYGSINGEDYWLVKNSWST 521

Query:   929 IGPDEGFFKIERGNNACGIEQIAGY 953
                ++G+  +    N CG+  +  Y
Sbjct:   522 YWGNDGYILMSAKKNNCGVMTMPTY 546

 Score = 283 (104.7 bits), Expect = 2.3e-21, P = 2.3e-21
 Identities = 79/230 (34%), Positives = 112/230 (48%)

Query:     7 KDGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTG-KLVEFSKSQLVE 65
             KD  +PD +DWR      P  DQ+ CGSCW+F   G LEG + +K G  LV  S+  L++
Sbjct:   327 KD-EIPDQYDWRLYGAVTPVKDQSVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALID 385

Query:    66 C--AKQCSGCDGCFFEPSIEYTHQAG-LESEKDY-PYKNANGEKFKCAYDKSKVKLFTG- 120
             C  A   +GCDG       ++  Q+G + +E++Y PY   +G    C  +   V L    
Sbjct:   386 CSWAYGNNGCDGGEDFRVYQWMLQSGGVPTEEEYGPYLGQDGY---CHVNN--VTLVAPI 440

Query:   121 KDFLHF--NGSETMKKILYKYGPLSVLLNSD------LIHDYNGTPIRKNDETCSPYDLG 172
             K F++   N     K  L K+GPLSV +++         H     P  KND       L 
Sbjct:   441 KGFVNVTSNDPNAFKLALLKHGPLSVAIDASPKTFSFYSHGVYYEPTCKNDVD----GLD 496

Query:   173 HAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
             HAVL VGYG  +   YWLV+NSW     ++G+  +    N CG+  +  Y
Sbjct:   497 HAVLAVGYGSINGEDYWLVKNSWSTYWGNDGYILMSAKKNNCGVMTMPTY 546


>UNIPROTKB|O97578 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9615 "Canis
            lupus familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 EMBL:AF060171 RefSeq:NP_001182763.1
            UniGene:Cfa.28653 ProteinModelPortal:O97578 SMR:O97578
            MEROPS:C01.070 PRIDE:O97578 GeneID:403458 KEGG:cfa:403458
            InParanoid:O97578 NextBio:20816976 Uniprot:O97578
        Length = 435

 Score = 308 (113.5 bits), Expect = 5.1e-25, P = 5.1e-25
 Identities = 76/228 (33%), Positives = 112/228 (49%)

Query:   742 VPDAWDWRK---KNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 796
             +P +WDWR     N   P  +QA+CGSC+AF+   MLE +  I T        S  ++V 
Sbjct:   204 LPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVS 263

Query:   797 CAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANG--EKFKCAYDKSKVKLFTGKD 853
             C++   GC+G F +  + +Y    GL  E  +PY  ++   +   C    S    + G  
Sbjct:   264 CSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYAGSDSPCKPNDCFRYYSSEYYYVG-G 322

Query:   854 FLHFNGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKN-DETCSPYDL-GHAVLLV 909
             F        MK  L ++GP++V      D  H   G        +  +P++L  HAVLLV
Sbjct:   323 FYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTNHAVLLV 382

Query:   910 GYGKQD--NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 955
             GYG      + YW+V+NSWG    ++G+F+I RG + C IE IA  AT
Sbjct:   383 GYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVAAT 430

 Score = 308 (113.5 bits), Expect = 5.1e-25, P = 5.1e-25
 Identities = 77/233 (33%), Positives = 113/233 (48%)

Query:     6 EKDGPVPDAWDWRK---KNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVE--FSK 60
             E+   +P +WDWR     N   P  +QA CGSC+AF+   MLE +  I T        S 
Sbjct:   199 EEISRLPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSP 258

Query:    61 SQLVECAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANG--EKFKCAYDKSKVKL 117
              ++V C++   GC+G F +  + +Y    GL  E  +PY  ++   +   C    S    
Sbjct:   259 QEIVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYAGSDSPCKPNDCFRYYSSEYY 318

Query:   118 FTGKDFLHFNGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKN-DETCSPYDL-GH 173
             + G  F        MK  L ++GP++V      D  H   G        +  +P++L  H
Sbjct:   319 YVG-GFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTNH 377

Query:   174 AVLLVGYGKQD--NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 224
             AVLLVGYG      + YW+V+NSWG    ++G+F+I RG + C IE IA  AT
Sbjct:   378 AVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVAAT 430

 Score = 294 (108.6 bits), Expect = 3.4e-23, P = 3.4e-23
 Identities = 75/231 (32%), Positives = 112/231 (48%)

Query:   376 VPDAWDWRK---KNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 430
             +P +WDWR     N   P  +QA+CGSC+AF+   MLE +  I T        S  ++V 
Sbjct:   204 LPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVS 263

Query:   431 CAKQCSGCGGCDGLEQPI--EYTHQAGLESEKDYPYRNGNG--EKFKCAYDKSKVKLFTG 486
             C++   GC G  G    I  +Y    GL  E  +PY   +   +   C    S    + G
Sbjct:   264 CSQYAQGCEG--GFPYLIAGKYAQDFGLVEEACFPYAGSDSPCKPNDCFRYYSSEYYYVG 321

Query:   487 KDFLYFNGSETMKKI-LYKYGPLSVGLNSH--LIHFYNGTPIRKN-DETCSPYDL-GHAV 541
                 Y   +E + K+ L ++GP++V    +    H+  G        +  +P++L  HAV
Sbjct:   322 G--FYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTNHAV 379

Query:   542 LLVGYGKQD--DIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 590
             LLVGYG      + YW+ +NSWG    ++G+F+I RG + C IE IA  AT
Sbjct:   380 LLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVAAT 430

 Score = 153 (58.9 bits), Expect = 2.4e-07, P = 2.4e-07
 Identities = 30/61 (49%), Positives = 41/61 (67%)

Query:   966 SPYDL-GHAVLLVGYGKQD--DIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYA 1022
             +P++L  HAVLLVGYG      + YW+V+NSWG    ++G+F+I RG + C IE IA  A
Sbjct:   370 NPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAVAA 429

Query:  1023 T 1023
             T
Sbjct:   430 T 430


>RGD|1309226 [details] [associations]
            symbol:Cts7 "cathepsin 7" species:10116 "Rattus norvegicus"
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005768 "endosome" evidence=IEA] [GO:0005794 "Golgi apparatus"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0007067
            "mitosis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IEA] [GO:0051301 "cell division" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 RGD:1309226 GO:GO:0005634
            GO:GO:0005794 GO:GO:0048471 GO:GO:0005615 GO:GO:0051301
            GO:GO:0007067 GO:GO:0005768 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 MEROPS:C01.016 CTD:56092
            GeneTree:ENSGT00560000076577 OrthoDB:EOG44QT2S EMBL:CH474032
            IPI:IPI00870531 RefSeq:NP_001099569.1 UniGene:Rn.218615
            Ensembl:ENSRNOT00000043686 GeneID:290970 KEGG:rno:290970
            UCSC:RGD:1309226 OMA:VESFNAN Uniprot:D3ZZ07
        Length = 331

 Score = 296 (109.3 bits), Expect = 5.7e-25, P = 5.7e-25
 Identities = 75/225 (33%), Positives = 108/225 (48%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +P   DWR      P   Q  CG+CWAFS+A  +E Q   KTGKL+  S   L++C    
Sbjct:   112 IPKTLDWRDTGCVAPVRSQGGCGACWAFSVAASIESQLFKKTGKLIPLSVQNLIDCTVTY 171

Query:   436 SGCGGCDGLEQ--PIEYT-HQAGLESEKDYPYRNGNGEKFKCAY--DKSKVKLFTGKDFL 490
              G   C G +     +Y  +  GLE+E  YPY     +   C Y  ++S VK+   + F+
Sbjct:   172 -GNNDCSGGKPYTAFQYVKNNGGLEAEATYPYE---AKLRHCRYRPERSVVKI--ARFFV 225

Query:   491 YFNGSETMKKILYKYGPLSVGLNSHLIHF--YNGTPIRKNDETCSPYDLGHAVLLVGYG- 547
                  E + + L  YGP++V ++     F  Y G     ++  C    L H +LLVGYG 
Sbjct:   226 VPRNEEALMQALVTYGPIAVAIDGSHASFKRYRGGIY--HEPKCRRDTLDHGLLLVGYGY 283

Query:   548 ---KQDDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 588
                + ++  YWL +NS G    + G+ K+ R  NN CGI   A Y
Sbjct:   284 EGHESENRKYWLLKNSHGEQWGERGYMKLPRDQNNYCGIASYAMY 328

 Score = 296 (109.3 bits), Expect = 5.7e-25, P = 5.7e-25
 Identities = 75/225 (33%), Positives = 109/225 (48%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             +P   DWR      P   Q  CG+CWAFS+A  +E Q   KTGKL+  S   L++C    
Sbjct:   112 IPKTLDWRDTGCVAPVRSQGGCGACWAFSVAASIESQLFKKTGKLIPLSVQNLIDCTVTY 171

Query:   802 SG--CDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFK-CAY--DKSKVKLFTGKDFL 855
                 C G     + +Y  +  GLE+E  YPY+     K + C Y  ++S VK+   + F+
Sbjct:   172 GNNDCSGGKPYTAFQYVKNNGGLEAEATYPYE----AKLRHCRYRPERSVVKI--ARFFV 225

Query:   856 HFNGSETMKKILYKYGPLSVLLNSD--LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG- 912
                  E + + L  YGP++V ++        Y G     ++  C    L H +LLVGYG 
Sbjct:   226 VPRNEEALMQALVTYGPIAVAIDGSHASFKRYRGGIY--HEPKCRRDTLDHGLLLVGYGY 283

Query:   913 ---KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 953
                + +N  YWL++NS G    + G+ K+ R  NN CGI   A Y
Sbjct:   284 EGHESENRKYWLLKNSHGEQWGERGYMKLPRDQNNYCGIASYAMY 328

 Score = 295 (108.9 bits), Expect = 7.3e-25, P = 7.3e-25
 Identities = 75/225 (33%), Positives = 109/225 (48%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P   DWR      P   Q  CG+CWAFS+A  +E Q   KTGKL+  S   L++C    
Sbjct:   112 IPKTLDWRDTGCVAPVRSQGGCGACWAFSVAASIESQLFKKTGKLIPLSVQNLIDCTVTY 171

Query:    71 SG--CDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFK-CAY--DKSKVKLFTGKDFL 124
                 C G     + +Y  +  GLE+E  YPY+     K + C Y  ++S VK+   + F+
Sbjct:   172 GNNDCSGGKPYTAFQYVKNNGGLEAEATYPYE----AKLRHCRYRPERSVVKI--ARFFV 225

Query:   125 HFNGSETMKKILYKYGPLSVLLNSD--LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG- 181
                  E + + L  YGP++V ++        Y G     ++  C    L H +LLVGYG 
Sbjct:   226 VPRNEEALMQALVTYGPIAVAIDGSHASFKRYRGGIY--HEPKCRRDTLDHGLLLVGYGY 283

Query:   182 ---KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 222
                + +N  YWL++NS G    + G+ K+ R  NN CGI   A Y
Sbjct:   284 EGHESENRKYWLLKNSHGEQWGERGYMKLPRDQNNYCGIASYAMY 328


>UNIPROTKB|Q4QRC2 [details] [associations]
            symbol:Ctsql2 "Protein Ctsql2" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 HOVERGEN:HBG011513 EMBL:CH474032
            RGD:1303225 EMBL:BC097257 IPI:IPI00421946 RefSeq:NP_001002813.2
            UniGene:Rn.128678 SMR:Q4QRC2 MEROPS:C01.111
            Ensembl:ENSRNOT00000038758 GeneID:408201 KEGG:rno:408201 CTD:408201
            InParanoid:Q4QRC2 OMA:NDEGALM NextBio:696394 Genevestigator:Q4QRC2
            Uniprot:Q4QRC2
        Length = 343

 Score = 295 (108.9 bits), Expect = 7.3e-25, P = 7.3e-25
 Identities = 78/228 (34%), Positives = 111/228 (48%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +P + DWRK+       +Q  C SCWAF +AG +EGQ   KTGKL   S   LV+C+K  
Sbjct:   125 LPKSIDWRKEGYVTRVREQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCSKP- 183

Query:   436 SGCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYF 492
              G  GC G       +Y  Q  GLESE  YPY+   G++  C Y+         +     
Sbjct:   184 QGNKGCRGGTTYNAFQYVLQNGGLESEATYPYK---GKEGLCKYNPKNAYAKITRFVALP 240

Query:   493 NGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRK----NDETCSPYDLGHAVLLVGYG- 547
                + +   L   GP++ G+  H++  Y+     K    ++  C+   + HAVL+VGYG 
Sbjct:   241 EDEDVLMDALATKGPVAAGI--HVV--YSSLRFYKKGIYHEPKCNNR-VNHAVLVVGYGF 295

Query:   548 ---KQDDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
                + D   YWL +NSWG     +G+ KI +  NN CGI   A Y  +
Sbjct:   296 EGNETDGNNYWLIKNSWGKQWGLKGYMKIAKDRNNHCGIATFAQYPIV 343

 Score = 286 (105.7 bits), Expect = 6.7e-24, P = 6.7e-24
 Identities = 81/227 (35%), Positives = 111/227 (48%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P + DWRK+       +Q  C SCWAF +AG +EGQ   KTGKL   S   LV+C+K  
Sbjct:   125 LPKSIDWRKEGYVTRVREQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCSKPQ 184

Query:    71 S--GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHF 126
                GC G     + +Y  Q  GLESE  YPYK   G    C Y+ K+     T    L  
Sbjct:   185 GNKGCRGGTTYNAFQYVLQNGGLESEATYPYKGKEG---LCKYNPKNAYAKITRFVALPE 241

Query:   127 NGSETMKKILYKYGPLSV---LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG-- 181
             +    M  +  K GP++    ++ S L     G     ++  C+   + HAVL+VGYG  
Sbjct:   242 DEDVLMDALATK-GPVAAGIHVVYSSLRFYKKGI---YHEPKCNNR-VNHAVLVVGYGFE 296

Query:   182 --KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
               + D   YWL++NSWG     +G+ KI +  NN CGI   A Y  +
Sbjct:   297 GNETDGNNYWLIKNSWGKQWGLKGYMKIAKDRNNHCGIATFAQYPIV 343

 Score = 286 (105.7 bits), Expect = 6.7e-24, P = 6.7e-24
 Identities = 81/227 (35%), Positives = 111/227 (48%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             +P + DWRK+       +Q  C SCWAF +AG +EGQ   KTGKL   S   LV+C+K  
Sbjct:   125 LPKSIDWRKEGYVTRVREQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCSKPQ 184

Query:   802 S--GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHF 857
                GC G     + +Y  Q  GLESE  YPYK   G    C Y+ K+     T    L  
Sbjct:   185 GNKGCRGGTTYNAFQYVLQNGGLESEATYPYKGKEG---LCKYNPKNAYAKITRFVALPE 241

Query:   858 NGSETMKKILYKYGPLSV---LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG-- 912
             +    M  +  K GP++    ++ S L     G     ++  C+   + HAVL+VGYG  
Sbjct:   242 DEDVLMDALATK-GPVAAGIHVVYSSLRFYKKGI---YHEPKCNNR-VNHAVLVVGYGFE 296

Query:   913 --KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
               + D   YWL++NSWG     +G+ KI +  NN CGI   A Y  +
Sbjct:   297 GNETDGNNYWLIKNSWGKQWGLKGYMKIAKDRNNHCGIATFAQYPIV 343

 Score = 121 (47.7 bits), Expect = 0.00050, P = 0.00050
 Identities = 33/94 (35%), Positives = 50/94 (53%)

Query:   940 RGNNACGIEQIAGYATIDVVK----NDETCSPYDLGHAVLLVGYG----KQDDIPYWLVR 991
             +G  A GI  +  Y+++   K    ++  C+   + HAVL+VGYG    + D   YWL++
Sbjct:   253 KGPVAAGIHVV--YSSLRFYKKGIYHEPKCNNR-VNHAVLVVGYGFEGNETDGNNYWLIK 309

Query:   992 NSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 1024
             NSWG     +G+ KI +  NN CGI   A Y  +
Sbjct:   310 NSWGKQWGLKGYMKIAKDRNNHCGIATFAQYPIV 343


>WB|WBGene00044760 [details] [associations]
            symbol:Y71H2AM.25 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0004181
            "metallocarboxypeptidase activity" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            GeneTree:ENSGT00560000076599 EMBL:FO081822 eggNOG:NOG331187
            HOGENOM:HOG000114005 RefSeq:NP_001040887.1
            ProteinModelPortal:Q2AAB9 SMR:Q2AAB9 EnsemblMetazoa:Y71H2AM.25
            GeneID:4363054 KEGG:cel:CELE_Y71H2AM.25 UCSC:Y71H2AM.25 CTD:4363054
            WormBase:Y71H2AM.25 InParanoid:Q2AAB9 NextBio:959635 Uniprot:Q2AAB9
        Length = 299

 Score = 295 (108.9 bits), Expect = 7.3e-25, P = 7.3e-25
 Identities = 71/208 (34%), Positives = 112/208 (53%)

Query:    16 DWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKT-GKLVEFSKSQLVECAKQCSGCD 74
             DWR K + GP  DQ  C +  AF+I+  +E  YA  T G L+ FS+ QL++C     G  
Sbjct:    87 DWRDKGIVGPVKDQGKCNASHAFAISSSIESMYAKATNGSLLSFSEQQLIDCDDH--GFK 144

Query:    75 GCFFEPSIE---YTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSET 131
             GC  +P+I    Y    G+E+E DYPY  A  E  KC +D +K K+   KD      +ET
Sbjct:   145 GCEEQPAINAVSYFIFHGIETEADYPY--AGKENGKCTFDSTKSKIQL-KDAEFVVSNET 201

Query:   132 M-KKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDETC-SPYDLGHAVLLVGYGKQDNIPY 188
               K+++  YGP    + +   ++DY       + E C S +++  ++++VGYG +    Y
Sbjct:   202 QGKELVTNYGPAFFTMRAPPSLYDYKIGIYNPSIEECTSTHEI-RSMVIVGYGIEGVQKY 260

Query:   189 WLVRNSWGPIGPDEGFFKIERGNNACGI 216
             W+V+ S+G    ++G+ K+ R  NAC +
Sbjct:   261 WIVKGSFGTSWGEQGYMKLARDVNACAM 288

 Score = 295 (108.9 bits), Expect = 7.3e-25, P = 7.3e-25
 Identities = 71/208 (34%), Positives = 112/208 (53%)

Query:   747 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKLVEFSKSQLVECAKQCSGCD 805
             DWR K + GP  DQ  C +  AF+I+  +E  YA  T G L+ FS+ QL++C     G  
Sbjct:    87 DWRDKGIVGPVKDQGKCNASHAFAISSSIESMYAKATNGSLLSFSEQQLIDCDDH--GFK 144

Query:   806 GCFFEPSIE---YTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSET 862
             GC  +P+I    Y    G+E+E DYPY  A  E  KC +D +K K+   KD      +ET
Sbjct:   145 GCEEQPAINAVSYFIFHGIETEADYPY--AGKENGKCTFDSTKSKIQL-KDAEFVVSNET 201

Query:   863 M-KKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDETC-SPYDLGHAVLLVGYGKQDNIPY 919
               K+++  YGP    + +   ++DY       + E C S +++  ++++VGYG +    Y
Sbjct:   202 QGKELVTNYGPAFFTMRAPPSLYDYKIGIYNPSIEECTSTHEI-RSMVIVGYGIEGVQKY 260

Query:   920 WLVRNSWGPIGPDEGFFKIERGNNACGI 947
             W+V+ S+G    ++G+ K+ R  NAC +
Sbjct:   261 WIVKGSFGTSWGEQGYMKLARDVNACAM 288

 Score = 281 (104.0 bits), Expect = 2.3e-23, P = 2.3e-23
 Identities = 70/210 (33%), Positives = 111/210 (52%)

Query:   381 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKLVEFSKSQLVECAKQCSGCG 439
             DWR K + GP  DQ  C +  AF+I+  +E  YA  T G L+ FS+ QL++C     G  
Sbjct:    87 DWRDKGIVGPVKDQGKCNASHAFAISSSIESMYAKATNGSLLSFSEQQLIDCDDH--GFK 144

Query:   440 GCDGLEQP----IEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGS 495
             GC+  EQP    + Y    G+E+E DYPY  G  E  KC +D +K K+   KD  +   +
Sbjct:   145 GCE--EQPAINAVSYFIFHGIETEADYPYA-GK-ENGKCTFDSTKSKIQL-KDAEFVVSN 199

Query:   496 ETM-KKILYKYGPLSVGLNS--HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDI 552
             ET  K+++  YGP    + +   L  +  G      +E  S +++  ++++VGYG +   
Sbjct:   200 ETQGKELVTNYGPAFFTMRAPPSLYDYKIGIYNPSIEECTSTHEI-RSMVIVGYGIEGVQ 258

Query:   553 PYWLARNSWGPIGPDEGFFKIERGNNACGI 582
              YW+ + S+G    ++G+ K+ R  NAC +
Sbjct:   259 KYWIVKGSFGTSWGEQGYMKLARDVNACAM 288


>TAIR|locus:2038588 [details] [associations]
            symbol:AT2G27420 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002685
            GenomeReviews:CT485783_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC006232
            MEROPS:I29.003 OMA:EEFRATH HOGENOM:HOG000230773 HSSP:P53634
            ProtClustDB:CLSN2688476 EMBL:AY064033 EMBL:AY096388 IPI:IPI00539752
            PIR:F84672 RefSeq:NP_565649.1 UniGene:At.27094
            ProteinModelPortal:Q9ZQH7 SMR:Q9ZQH7 PRIDE:Q9ZQH7
            EnsemblPlants:AT2G27420.1 GeneID:817287 KEGG:ath:AT2G27420
            TAIR:At2g27420 InParanoid:Q9ZQH7 PhylomeDB:Q9ZQH7
            ArrayExpress:Q9ZQH7 Genevestigator:Q9ZQH7 Uniprot:Q9ZQH7
        Length = 348

 Score = 293 (108.2 bits), Expect = 1.2e-24, P = 1.2e-24
 Identities = 94/338 (27%), Positives = 150/338 (44%)

Query:   637 RVDTLAIEGSLTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQD---------GH 687
             R       GSL F+  + +E  + ++ +  R Y+++ E + RF  FK++          +
Sbjct:    16 RTSLATSRGSL-FE-ASAIEKHEQWMARFNRVYSDETEKRNRFNIFKKNLEFVQNFNMNN 73

Query:   688 KKHERYGTSEFSDRSPEEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAW 746
             K   +   +EFSD + EE     TG    E    RI                     ++ 
Sbjct:    74 KITYKVDINEFSDLTDEEFRATHTGLVVPE-AITRISTLSSGKNTVPFRYGNVSDNGESM 132

Query:   747 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS-GCD 805
             DWR++    P   Q  CG CWAFS    +EG   I  G+LV  S+ QL++C +  + GC 
Sbjct:   133 DWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQLLDCDRDYNQGCR 192

Query:   806 GCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFT--GKDFLHFNGSET 862
             G     + EY     G+ +E +YPY+ +           S  +  T  G + +  N  E 
Sbjct:   193 GGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATISGYETVPMNNEEA 252

Query:   863 MKKILYKYGPLSVLLNSD--LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG-KQDNIPY 919
             + + + +  P+SV +         Y+G     N E C   DL HAV +VGYG  ++   Y
Sbjct:   253 LLQAVSQQ-PVSVGIEGTGAAFRHYSGGVF--NGE-CGT-DLHHAVTIVGYGMSEEGTKY 307

Query:   920 WLVRNSWGPIGPDEGFFKIERGNNA----CGIEQIAGY 953
             W+V+NSWG    + G+ +I+R  +A    CG+  +A Y
Sbjct:   308 WVVKNSWGETWGENGYMRIKRDVDAPQGMCGLAILAFY 345

 Score = 288 (106.4 bits), Expect = 4.1e-24, P = 4.1e-24
 Identities = 96/340 (28%), Positives = 152/340 (44%)

Query:   271 RVDTLAIEGSLTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQD---------GH 321
             R       GSL F+  + +E  + ++ +  R Y+++ E + RF  FK++          +
Sbjct:    16 RTSLATSRGSL-FE-ASAIEKHEQWMARFNRVYSDETEKRNRFNIFKKNLEFVQNFNMNN 73

Query:   322 KKHERYGTSEFSDRSPEEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAW 380
             K   +   +EFSD + EE     TG    E    RI                     ++ 
Sbjct:    74 KITYKVDINEFSDLTDEEFRATHTGLVVPE-AITRISTLSSGKNTVPFRYGNVSDNGESM 132

Query:   381 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS-GC- 438
             DWR++    P   Q  CG CWAFS    +EG   I  G+LV  S+ QL++C +  + GC 
Sbjct:   133 DWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQLLDCDRDYNQGCR 192

Query:   439 GGCDGLEQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFT--GKDFLYFNGS 495
             GG   + +  EY     G+ +E +YPY+             S  +  T  G + +  N  
Sbjct:   193 GGI--MSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATISGYETVPMNNE 250

Query:   496 ETMKKILYKYGPLSVGLNSHLIHF--YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD-I 552
             E + + + +  P+SVG+      F  Y+G     N E C   DL HAV +VGYG  ++  
Sbjct:   251 EALLQAVSQQ-PVSVGIEGTGAAFRHYSGGVF--NGE-CGT-DLHHAVTIVGYGMSEEGT 305

Query:   553 PYWLARNSWGPIGPDEGFFKIERGNNA----CGIEQIAGY 588
              YW+ +NSWG    + G+ +I+R  +A    CG+  +A Y
Sbjct:   306 KYWVVKNSWGETWGENGYMRIKRDVDAPQGMCGLAILAFY 345

 Score = 270 (100.1 bits), Expect = 3.4e-22, P = 3.4e-22
 Identities = 69/221 (31%), Positives = 108/221 (48%)

Query:    13 DAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS- 71
             ++ DWR++    P   Q  CG CWAFS    +EG   I  G+LV  S+ QL++C +  + 
Sbjct:   130 ESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQLLDCDRDYNQ 189

Query:    72 GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFT--GKDFLHFNG 128
             GC G     + EY     G+ +E +YPY+ +           S  +  T  G + +  N 
Sbjct:   190 GCRGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATISGYETVPMNN 249

Query:   129 SETMKKILYKYGPLSVLLNSD--LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG-KQDN 185
              E + + + +  P+SV +         Y+G     N E C   DL HAV +VGYG  ++ 
Sbjct:   250 EEALLQAVSQQ-PVSVGIEGTGAAFRHYSGGVF--NGE-CGT-DLHHAVTIVGYGMSEEG 304

Query:   186 IPYWLVRNSWGPIGPDEGFFKIERGNNA----CGIEQIAGY 222
               YW+V+NSWG    + G+ +I+R  +A    CG+  +A Y
Sbjct:   305 TKYWVVKNSWGETWGENGYMRIKRDVDAPQGMCGLAILAFY 345

 Score = 119 (46.9 bits), Expect = 0.00085, P = 0.00085
 Identities = 28/68 (41%), Positives = 40/68 (58%)

Query:   959 VKNDETCSPYDLGHAVLLVGYGKQDD-IPYWLVRNSWGPIGPDEGFFKIERGNNA----C 1013
             V N E C   DL HAV +VGYG  ++   YW+V+NSWG    + G+ +I+R  +A    C
Sbjct:   280 VFNGE-CGT-DLHHAVTIVGYGMSEEGTKYWVVKNSWGETWGENGYMRIKRDVDAPQGMC 337

Query:  1014 GIEQIAGY 1021
             G+  +A Y
Sbjct:   338 GLAILAFY 345


>UNIPROTKB|H0YD65 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 EMBL:AP002748
            HGNC:HGNC:2531 ChiTaRS:CTSF Ensembl:ENST00000524994 Uniprot:H0YD65
        Length = 283

 Score = 275 (101.9 bits), Expect = 1.9e-24, Sum P(2) = 1.9e-24
 Identities = 83/292 (28%), Positives = 127/292 (43%)

Query:   266 DQVVARVDTLAIEGSLTFDNE-NILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHKKH 324
             ++  + V +L  E  L+ D    +   FK F++   R Y + +E + R   F  +  +  
Sbjct:     9 NETFSSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYES-KEARWRLSVFVNNMVRAQ 67

Query:   325 E---------RYGTSEFSDRSPEEILCKTGFKWSERT-YERIVADRXXXXXXXXXXXXDG 374
             +         +YG ++FSD + EE           RT Y   +  +              
Sbjct:    68 KIQALDRGTAQYGVTKFSDLTEEEF----------RTIYLNTLLRKEPGNKMKQAKSVGD 117

Query:   375 PVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ 434
               P  WDWR K       DQ  CGSCWAFS+ G +EGQ+ +  G L+  S+ +L++C K 
Sbjct:   118 LAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKM 177

Query:   435 CSGCGGCDGLEQPIEYT---HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLY 491
                C G  GL     Y+   +  GLE+E DY Y+   G    C +   K K++       
Sbjct:   178 DKACMG--GLPSNA-YSAIKNLGGLETEDDYSYQ---GHMQSCNFSAEKAKVYINDSVEL 231

Query:   492 FNGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLL 543
                 + +   L K GP+SV +N+  + FY     R     CSP+ + HAVLL
Sbjct:   232 SQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLL 283

 Score = 268 (99.4 bits), Expect = 1.1e-23, Sum P(2) = 1.1e-23
 Identities = 82/291 (28%), Positives = 126/291 (43%)

Query:   632 DQVVARVDTLAIEGSLTFDNE-NILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHKKH 690
             ++  + V +L  E  L+ D    +   FK F++   R Y + +E + R   F  +  +  
Sbjct:     9 NETFSSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYES-KEARWRLSVFVNNMVRAQ 67

Query:   691 E---------RYGTSEFSDRSPEEILCKTGFKWSERT-YERIVADRXXXXXXXXXXXXDG 740
             +         +YG ++FSD + EE           RT Y   +  +              
Sbjct:    68 KIQALDRGTAQYGVTKFSDLTEEEF----------RTIYLNTLLRKEPGNKMKQAKSVGD 117

Query:   741 PVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ 800
               P  WDWR K       DQ  CGSCWAFS+ G +EGQ+ +  G L+  S+ +L++C K 
Sbjct:   118 LAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKM 177

Query:   801 CSGCDGCFFEPSIEYT---HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF 857
                C G    PS  Y+   +  GLE+E DY Y+   G    C +   K K++        
Sbjct:   178 DKACMGGL--PSNAYSAIKNLGGLETEDDYSYQ---GHMQSCNFSAEKAKVYINDSVELS 232

Query:   858 NGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLL 908
                + +   L K GP+SV +N+  +  Y     R     CSP+ + HAVLL
Sbjct:   233 QNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLL 283

 Score = 260 (96.6 bits), Expect = 7.5e-23, Sum P(2) = 7.5e-23
 Identities = 59/169 (34%), Positives = 83/169 (49%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 71
             P  WDWR K       DQ  CGSCWAFS+ G +EGQ+ +  G L+  S+ +L++C K   
Sbjct:   120 PPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDK 179

Query:    72 GCDGCFFEPSIEYT---HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNG 128
              C G    PS  Y+   +  GLE+E DY Y+   G    C +   K K++          
Sbjct:   180 ACMGGL--PSNAYSAIKNLGGLETEDDYSYQ---GHMQSCNFSAEKAKVYINDSVELSQN 234

Query:   129 SETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLL 177
              + +   L K GP+SV +N+  +  Y     R     CSP+ + HAVLL
Sbjct:   235 EQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLL 283

 Score = 43 (20.2 bits), Expect = 1.9e-24, Sum P(2) = 1.9e-24
 Identities = 8/12 (66%), Positives = 10/12 (83%)

Query:   965 CSPYDLGHAVLL 976
             CSP+ + HAVLL
Sbjct:   272 CSPWLIDHAVLL 283


>UNIPROTKB|J9P7C5 [details] [associations]
            symbol:J9P7C5 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 EMBL:AAEX03010953
            Ensembl:ENSCAFT00000012925 Uniprot:J9P7C5
        Length = 321

 Score = 291 (107.5 bits), Expect = 2.0e-24, P = 2.0e-24
 Identities = 77/224 (34%), Positives = 110/224 (49%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P + DWR+K    P  +Q  CGSCWAFS  G  EGQ   KTG LV  S+  L   A+  
Sbjct:   109 IPKSVDWREKGYVTPVKNQGQCGSCWAFSATGAFEGQMFWKTGNLVPLSEQNL---AQGN 165

Query:    71 SGCDGCFFEPSIEYTHQAG-LESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGS 129
              GC+G   + + +Y      L+SE+ YPY   + +   C Y K +        F+     
Sbjct:   166 EGCNGGLMDNAFQYVKDNRCLDSEESYPYLGRDTDT--CNY-KPECSAAHDSGFVDLPQR 222

Query:   130 E-TMKKILYKYGPLSVLLNSDLIHDYNGTPIRKN----DETCSPYDLGHAVLLVGYGKQ- 183
             E  + K +   G ++V +++   H Y      K+    D  CS  DL H VL+VGYG + 
Sbjct:   223 EKALMKAMATLGSITVAIDAG--HQY--FQFYKSSIYFDPDCSSKDLDHGVLVVGYGFEG 278

Query:   184 -DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
              D+   W+V+NSW P      + K+ +G NN CGI   A Y T+
Sbjct:   279 TDSNNKWIVKNSWSPEWGWNSYVKMAKGQNNHCGITA-ASYPTV 321

 Score = 290 (107.1 bits), Expect = 2.5e-24, P = 2.5e-24
 Identities = 77/224 (34%), Positives = 110/224 (49%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             +P + DWR+K    P  +Q  CGSCWAFS  G  EGQ   KTG LV  S+  L   A+  
Sbjct:   109 IPKSVDWREKGYVTPVKNQGQCGSCWAFSATGAFEGQMFWKTGNLVPLSEQNL---AQGN 165

Query:   802 SGCDGCFFEPSIEYTHQAG-LESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGS 860
              GC+G   + + +Y      L+SE+ YPY   + +   C Y K +        F+     
Sbjct:   166 EGCNGGLMDNAFQYVKDNRCLDSEESYPYLGRDTDT--CNY-KPECSAAHDSGFVDLPQR 222

Query:   861 E-TMKKILYKYGPLSVLLNSDLIHDYNGTPIRKN----DETCSPYDLGHAVLLVGYGKQ- 914
             E  + K +   G ++V +++   H Y      K+    D  CS  DL H VL+VGYG + 
Sbjct:   223 EKALMKAMATLGSITVAIDAG--HQY--FQFYKSSIYFDPDCSSKDLDHGVLVVGYGFEG 278

Query:   915 -DNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
              D+   W+V+NSW P      + K+ +G NN CGI   A Y T+
Sbjct:   279 TDSNNKWIVKNSWSPEWGWNSYVKMAKGQNNHCGITA-ASYPTV 321

 Score = 284 (105.0 bits), Expect = 1.1e-23, P = 1.1e-23
 Identities = 76/222 (34%), Positives = 108/222 (48%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +P + DWR+K    P  +Q  CGSCWAFS  G  EGQ   KTG LV  S+  L +  + C
Sbjct:   109 IPKSVDWREKGYVTPVKNQGQCGSCWAFSATGAFEGQMFWKTGNLVPLSEQNLAQGNEGC 168

Query:   436 SGCGGCDGLEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGS 495
             +G G  D   Q ++      L+SE+ YPY   + +   C Y K +        F+     
Sbjct:   169 NG-GLMDNAFQYVK--DNRCLDSEESYPYLGRDTDT--CNY-KPECSAAHDSGFVDLPQR 222

Query:   496 E-TMKKILYKYGPLSVGLNS-H-LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ--D 550
             E  + K +   G ++V +++ H    FY  +     D  CS  DL H VL+VGYG +  D
Sbjct:   223 EKALMKAMATLGSITVAIDAGHQYFQFYKSSIYF--DPDCSSKDLDHGVLVVGYGFEGTD 280

Query:   551 DIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
                 W+ +NSW P      + K+ +G NN CGI   A Y T+
Sbjct:   281 SNNKWIVKNSWSPEWGWNSYVKMAKGQNNHCGITA-ASYPTV 321

 Score = 128 (50.1 bits), Expect = 7.4e-05, P = 7.4e-05
 Identities = 29/66 (43%), Positives = 37/66 (56%)

Query:   962 DETCSPYDLGHAVLLVGYGKQ--DDIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQI 1018
             D  CS  DL H VL+VGYG +  D    W+V+NSW P      + K+ +G NN CGI   
Sbjct:   257 DPDCSSKDLDHGVLVVGYGFEGTDSNNKWIVKNSWSPEWGWNSYVKMAKGQNNHCGITA- 315

Query:  1019 AGYATI 1024
             A Y T+
Sbjct:   316 ASYPTV 321


>DICTYBASE|DDB_G0281605 [details] [associations]
            symbol:cfaD "peptidase C1A family protein"
            species:44689 "Dictyostelium discoideum" [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IDA] [GO:0006508 "proteolysis" evidence=IDA] [GO:0031410
            "cytoplasmic vesicle" evidence=IDA] [GO:0031288 "sorocarp
            morphogenesis" evidence=IMP] [GO:0008285 "negative regulation of
            cell proliferation" evidence=IGI;IDA] [GO:0005576 "extracellular
            region" evidence=IEA;IDA] [GO:0005515 "protein binding"
            evidence=IPI] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0281605
            GO:GO:0008285 GO:GO:0005615 GenomeReviews:CM000152_GR
            eggNOG:COG4870 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0031410 EMBL:AAFI02000042
            GO:GO:0031288 RefSeq:XP_640530.1 HSSP:P07711
            ProteinModelPortal:Q54TR1 STRING:Q54TR1 PRIDE:Q54TR1
            EnsemblProtists:DDB0229857 GeneID:8623140 KEGG:ddi:DDB_G0281605
            InParanoid:Q54TR1 OMA:PSAHEHE ProtClustDB:CLSZ2430523
            Uniprot:Q54TR1
        Length = 531

 Score = 310 (114.2 bits), Expect = 2.0e-24, P = 2.0e-24
 Identities = 95/330 (28%), Positives = 148/330 (44%)

Query:   642 AIEGSLTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQDG-----HKKHE---RY 693
             +I  +L    E     FK +  +  ++Y++ +E  ERF  FK        H   E   + 
Sbjct:   209 SIGDNLLAKEEQASNLFKEYKAQYNKEYSSQDEHDERFINFKAARKIIATHNAKESSYKL 268

Query:   694 GTSEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNV 753
             G + ++D S +E    T  K       ++                   +P   DWR +N 
Sbjct:   269 GMNHYADLSNKEF--NTLVK------PKVARPSVTGADSVHDDESLRSIPSTVDWRNQNC 320

Query:   754 TGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA--KQCSGCDGCFFEP 811
               P  DQ  CGSCW F   G LEG   +  G+LV  S+ QLV+CA      GC G F   
Sbjct:   321 VTPVKDQGICGSCWTFGSTGSLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASS 380

Query:   812 SIEYTHQAG-LESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF-NGSET-MKKILY 868
             + +Y  + G L +E +YPY   NG         S V + TG  +++  +GSE+ ++  + 
Sbjct:   381 AFQYVMEIGSLATESNYPYLMQNGLCRDRTVTPSGVSI-TG--YVNVTSGSESALQNAIA 437

Query:   869 KYGPLSVLLNSDLIHD--YNGTPIRKNDETCSPY--DLGHAVLLVGYGKQDNIPYWLVRN 924
               GP+++ +++  + D  Y  + +  N+  C     DL H VL +GYG      Y+LV+N
Sbjct:   438 TTGPVAIAIDAS-VDDFRYYMSGVY-NNPACKNGLDDLDHEVLAIGYGTYQGQDYFLVKN 495

Query:   925 SWGPIGPDEGFFKIERG-NNACGIEQIAGY 953
             SW      +G+  + R  NN CG+   A Y
Sbjct:   496 SWSTNWGMDGYVYMARNDNNLCGVSSQATY 525

 Score = 301 (111.0 bits), Expect = 2.0e-23, P = 2.0e-23
 Identities = 96/332 (28%), Positives = 145/332 (43%)

Query:   276 AIEGSLTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQDG-----HKKHE---RY 327
             +I  +L    E     FK +  +  ++Y++ +E  ERF  FK        H   E   + 
Sbjct:   209 SIGDNLLAKEEQASNLFKEYKAQYNKEYSSQDEHDERFINFKAARKIIATHNAKESSYKL 268

Query:   328 GTSEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNV 387
             G + ++D S +E    T  K       ++                   +P   DWR +N 
Sbjct:   269 GMNHYADLSNKEF--NTLVK------PKVARPSVTGADSVHDDESLRSIPSTVDWRNQNC 320

Query:   388 TGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDG--LE 445
               P  DQ  CGSCW F   G LEG   +  G+LV  S+ QLV+CA   +G  GC G    
Sbjct:   321 VTPVKDQGICGSCWTFGSTGSLEGTNCVTNGELVSLSEQQLVDCAI-LTGSQGCGGGFAS 379

Query:   446 QPIEYTHQAG-LESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFN---GSET-MKK 500
                +Y  + G L +E +YPY   NG         S V + TG    Y N   GSE+ ++ 
Sbjct:   380 SAFQYVMEIGSLATESNYPYLMQNGLCRDRTVTPSGVSI-TG----YVNVTSGSESALQN 434

Query:   501 ILYKYGPLSVGLNSHLIHF-YNGTPIRKNDETCSPY--DLGHAVLLVGYGKQDDIPYWLA 557
              +   GP+++ +++ +  F Y  + +  N+  C     DL H VL +GYG      Y+L 
Sbjct:   435 AIATTGPVAIAIDASVDDFRYYMSGVY-NNPACKNGLDDLDHEVLAIGYGTYQGQDYFLV 493

Query:   558 RNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 588
             +NSW      +G+  + R  NN CG+   A Y
Sbjct:   494 KNSWSTNWGMDGYVYMARNDNNLCGVSSQATY 525

 Score = 297 (109.6 bits), Expect = 5.7e-23, P = 5.7e-23
 Identities = 75/222 (33%), Positives = 112/222 (50%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA--K 68
             +P   DWR +N   P  DQ  CGSCW F   G LEG   +  G+LV  S+ QLV+CA   
Sbjct:   309 IPSTVDWRNQNCVTPVKDQGICGSCWTFGSTGSLEGTNCVTNGELVSLSEQQLVDCAILT 368

Query:    69 QCSGCDGCFFEPSIEYTHQAG-LESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF- 126
                GC G F   + +Y  + G L +E +YPY   NG         S V + TG  +++  
Sbjct:   369 GSQGCGGGFASSAFQYVMEIGSLATESNYPYLMQNGLCRDRTVTPSGVSI-TG--YVNVT 425

Query:   127 NGSET-MKKILYKYGPLSVLLNSDLIHD--YNGTPIRKNDETCSPY--DLGHAVLLVGYG 181
             +GSE+ ++  +   GP+++ +++  + D  Y  + +  N+  C     DL H VL +GYG
Sbjct:   426 SGSESALQNAIATTGPVAIAIDAS-VDDFRYYMSGVY-NNPACKNGLDDLDHEVLAIGYG 483

Query:   182 KQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGY 222
                   Y+LV+NSW      +G+  + R  NN CG+   A Y
Sbjct:   484 TYQGQDYFLVKNSWSTNWGMDGYVYMARNDNNLCGVSSQATY 525


>UNIPROTKB|F1STR1 [details] [associations]
            symbol:CTSC "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0004252
            "serine-type endopeptidase activity" evidence=IEA] [GO:0001913 "T
            cell mediated cytotoxicity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 KO:K01275 InterPro:IPR014882
            Pfam:PF08773 OMA:YDDFLHY GO:GO:0001913 EMBL:CU855751
            RefSeq:XP_003129789.1 UniGene:Ssc.6155 Ensembl:ENSSSCT00000016280
            GeneID:100522387 KEGG:ssc:100522387 Uniprot:F1STR1
        Length = 463

 Score = 304 (112.1 bits), Expect = 4.0e-24, P = 4.0e-24
 Identities = 79/241 (32%), Positives = 122/241 (50%)

Query:     6 EKDGPVPDAWDWRK---KNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVE--FSK 60
             EK   +P +WDWR     N   P  +QA CGSC++F+  GM+E +  I T        S 
Sbjct:   226 EKSLHLPASWDWRNVRGTNFVTPVRNQASCGSCYSFASMGMMEARIRILTNNTQTPILSP 285

Query:    61 SQLVECAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFT 119
              ++V C++   GC G F +  + +Y    GL  E  +PY    G    C   +   + ++
Sbjct:   286 QEVVSCSQYAQGCAGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSPCTVKEGCFRYYS 342

Query:   120 GKDFLHFNG------SETMKKI-LYKYGPLSVLLN--SDLIHD----YNGTPIRKNDETC 166
              +   H+ G      +E + K+ L  +GP++V      D +H     Y+ T +R   +  
Sbjct:   343 SE--YHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYRKGIYHHTGLR---DPF 397

Query:   167 SPYDL-GHAVLLVGYGKQ--DNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYA 223
             +P++L  HAVLLVGYG      + YW+V+NSWG    ++G+F+I RG + C IE IA  A
Sbjct:   398 NPFELTNHAVLLVGYGTDLASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAA 457

Query:   224 T 224
             T
Sbjct:   458 T 458

 Score = 301 (111.0 bits), Expect = 9.0e-24, P = 9.0e-24
 Identities = 77/236 (32%), Positives = 121/236 (51%)

Query:   742 VPDAWDWRK---KNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 796
             +P +WDWR     N   P  +QA+CGSC++F+  GM+E +  I T        S  ++V 
Sbjct:   231 LPASWDWRNVRGTNFVTPVRNQASCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVS 290

Query:   797 CAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 855
             C++   GC G F +  + +Y    GL  E  +PY    G    C   +   + ++ +   
Sbjct:   291 CSQYAQGCAGGFPYLIAGKYAQDFGLVEEACFPY---TGTDSPCTVKEGCFRYYSSE--Y 345

Query:   856 HFNG------SETMKKI-LYKYGPLSVLLN--SDLIHD----YNGTPIRKNDETCSPYDL 902
             H+ G      +E + K+ L  +GP++V      D +H     Y+ T +R   +  +P++L
Sbjct:   346 HYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYRKGIYHHTGLR---DPFNPFEL 402

Query:   903 -GHAVLLVGYGKQ--DNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 955
               HAVLLVGYG      + YW+V+NSWG    ++G+F+I RG + C IE IA  AT
Sbjct:   403 TNHAVLLVGYGTDLASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAAT 458

 Score = 298 (110.0 bits), Expect = 2.1e-23, P = 2.1e-23
 Identities = 76/237 (32%), Positives = 120/237 (50%)

Query:   376 VPDAWDWRK---KNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 430
             +P +WDWR     N   P  +QA+CGSC++F+  GM+E +  I T        S  ++V 
Sbjct:   231 LPASWDWRNVRGTNFVTPVRNQASCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVS 290

Query:   431 CAKQCSGCGGCDGLEQPI--EYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKD 488
             C++   GC G  G    I  +Y    GL  E  +PY    G    C   +   + ++  +
Sbjct:   291 CSQYAQGCAG--GFPYLIAGKYAQDFGLVEEACFPY---TGTDSPCTVKEGCFRYYSS-E 344

Query:   489 FLYFNG-----SETMKKI-LYKYGPLSVGLNSH--LIHF----YNGTPIRKNDETCSPYD 536
             + Y  G     +E + K+ L  +GP++V    +   +H+    Y+ T +R   +  +P++
Sbjct:   345 YHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYRKGIYHHTGLR---DPFNPFE 401

Query:   537 L-GHAVLLVGYGKQ--DDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 590
             L  HAVLLVGYG      + YW+ +NSWG    ++G+F+I RG + C IE IA  AT
Sbjct:   402 LTNHAVLLVGYGTDLASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAAT 458

 Score = 154 (59.3 bits), Expect = 2.1e-07, P = 2.1e-07
 Identities = 30/61 (49%), Positives = 41/61 (67%)

Query:   966 SPYDL-GHAVLLVGYGKQ--DDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYA 1022
             +P++L  HAVLLVGYG      + YW+V+NSWG    ++G+F+I RG + C IE IA  A
Sbjct:   398 NPFELTNHAVLLVGYGTDLASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAA 457

Query:  1023 T 1023
             T
Sbjct:   458 T 458


>UNIPROTKB|P53634 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9606 "Homo
            sapiens" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0005783 "endoplasmic reticulum" evidence=IEA] [GO:0005794
            "Golgi apparatus" evidence=IEA] [GO:0007568 "aging" evidence=IEA]
            [GO:0010033 "response to organic substance" evidence=IEA]
            [GO:0031404 "chloride ion binding" evidence=IEA] [GO:0042802
            "identical protein binding" evidence=IEA] [GO:0043621 "protein
            self-association" evidence=IEA] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0006508 "proteolysis" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0006955
            "immune response" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005783 GO:GO:0005794 Reactome:REACT_6900
            GO:GO:0006955 GO:GO:0007568 GO:GO:0010033 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004252 GO:GO:0005764 GO:GO:0031404 CTD:1075
            HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ InterPro:IPR014882
            Pfam:PF08773 MEROPS:C01.070 EMBL:X87212 EMBL:U79415 EMBL:AF234263
            EMBL:AF234264 EMBL:AF254757 EMBL:AF525032 EMBL:AF525033
            EMBL:AK292117 EMBL:AK311923 EMBL:AK223038 EMBL:BX537913
            EMBL:AC011088 EMBL:CH471185 EMBL:BC054028 EMBL:BC100891
            EMBL:BC100892 EMBL:BC100893 EMBL:BC100894 EMBL:BC109386
            EMBL:BC110071 EMBL:BC113850 EMBL:BC113897 IPI:IPI00022810
            IPI:IPI00171323 IPI:IPI00872258 PIR:S23941 PIR:S66504
            RefSeq:NP_001107645.1 RefSeq:NP_001805.3 RefSeq:NP_680475.1
            UniGene:Hs.128065 PDB:1K3B PDB:2DJF PDB:2DJG PDB:3PDF PDBsum:1K3B
            PDBsum:2DJF PDBsum:2DJG PDBsum:3PDF ProteinModelPortal:P53634
            SMR:P53634 IntAct:P53634 MINT:MINT-4655964 STRING:P53634
            PhosphoSite:P53634 DMDM:1705632 PaxDb:P53634 PRIDE:P53634
            DNASU:1075 Ensembl:ENST00000227266 Ensembl:ENST00000524463
            Ensembl:ENST00000529974 GeneID:1075 KEGG:hsa:1075 UCSC:uc001pck.4
            UCSC:uc001pcm.4 GeneCards:GC11M088026 HGNC:HGNC:2528 HPA:CAB025364
            MIM:170650 MIM:245000 MIM:245010 MIM:602365 neXtProt:NX_P53634
            Orphanet:2342 Orphanet:678 PharmGKB:PA27028 HOGENOM:HOG000127503
            InParanoid:P53634 OMA:YDDFLHY PhylomeDB:P53634
            BioCyc:MetaCyc:HS03265-MONOMER SABIO-RK:P53634 BindingDB:P53634
            ChEMBL:CHEMBL2252 EvolutionaryTrace:P53634 GenomeRNAi:1075
            NextBio:4488 PMAP-CutDB:P53634 ArrayExpress:P53634 Bgee:P53634
            Genevestigator:P53634 GermOnline:ENSG00000109861 GO:GO:0001913
            Uniprot:P53634
        Length = 463

 Score = 304 (112.1 bits), Expect = 4.0e-24, P = 4.0e-24
 Identities = 81/233 (34%), Positives = 117/233 (50%)

Query:   742 VPDAWDWRKK---NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 796
             +P +WDWR     N   P  +QA+CGSC++F+  GMLE +  I T        S  ++V 
Sbjct:   231 LPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVS 290

Query:   797 CAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANGE-KFK--C-AYDKSKVKLFTG 851
             C++   GC+G F +  + +Y    GL  E  +PY   +   K K  C  Y  S+     G
Sbjct:   291 CSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCKMKEDCFRYYSSEYHYVGG 350

Query:   852 KDFLHFNGSETMKKILYKYGPLSVLLN--SDLIHD----YNGTPIRKNDETCSPYDL-GH 904
               F        MK  L  +GP++V      D +H     Y+ T +R   +  +P++L  H
Sbjct:   351 --FYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLR---DPFNPFELTNH 405

Query:   905 AVLLVGYGKQD--NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 955
             AVLLVGYG      + YW+V+NSWG    + G+F+I RG + C IE IA  AT
Sbjct:   406 AVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAAT 458

 Score = 303 (111.7 bits), Expect = 5.3e-24, P = 5.3e-24
 Identities = 81/233 (34%), Positives = 116/233 (49%)

Query:    11 VPDAWDWRKK---NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 65
             +P +WDWR     N   P  +QA CGSC++F+  GMLE +  I T        S  ++V 
Sbjct:   231 LPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVS 290

Query:    66 CAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANGE-KFK--C-AYDKSKVKLFTG 120
             C++   GC+G F +  + +Y    GL  E  +PY   +   K K  C  Y  S+     G
Sbjct:   291 CSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCKMKEDCFRYYSSEYHYVGG 350

Query:   121 KDFLHFNGSETMKKILYKYGPLSVLLN--SDLIHD----YNGTPIRKNDETCSPYDL-GH 173
               F        MK  L  +GP++V      D +H     Y+ T +R   +  +P++L  H
Sbjct:   351 --FYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLR---DPFNPFELTNH 405

Query:   174 AVLLVGYGKQD--NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 224
             AVLLVGYG      + YW+V+NSWG    + G+F+I RG + C IE IA  AT
Sbjct:   406 AVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAAT 458

 Score = 294 (108.6 bits), Expect = 6.0e-23, P = 6.0e-23
 Identities = 77/237 (32%), Positives = 119/237 (50%)

Query:   376 VPDAWDWRKK---NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 430
             +P +WDWR     N   P  +QA+CGSC++F+  GMLE +  I T        S  ++V 
Sbjct:   231 LPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVS 290

Query:   431 CAKQCSGCGGCDGLEQPI--EYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKD 488
             C++   GC G  G    I  +Y    GL  E  +PY    G    C   +   + ++  +
Sbjct:   291 CSQYAQGCEG--GFPYLIAGKYAQDFGLVEEACFPY---TGTDSPCKMKEDCFRYYSS-E 344

Query:   489 FLYFNG-----SETMKKI-LYKYGPLSVGLNSH--LIHF----YNGTPIRKNDETCSPYD 536
             + Y  G     +E + K+ L  +GP++V    +   +H+    Y+ T +R   +  +P++
Sbjct:   345 YHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLR---DPFNPFE 401

Query:   537 L-GHAVLLVGYGKQD--DIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 590
             L  HAVLLVGYG      + YW+ +NSWG    + G+F+I RG + C IE IA  AT
Sbjct:   402 LTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAAT 458

 Score = 150 (57.9 bits), Expect = 5.8e-07, P = 5.8e-07
 Identities = 30/61 (49%), Positives = 40/61 (65%)

Query:   966 SPYDL-GHAVLLVGYGKQD--DIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYA 1022
             +P++L  HAVLLVGYG      + YW+V+NSWG    + G+F+I RG + C IE IA  A
Sbjct:   398 NPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAA 457

Query:  1023 T 1023
             T
Sbjct:   458 T 458


>UNIPROTKB|F1NT07 [details] [associations]
            symbol:LOC100857883 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 EMBL:AADN02044012
            EMBL:AADN02044013 EMBL:AADN02044014 IPI:IPI00577314
            Ensembl:ENSGALT00000000192 OMA:IYKHGPV Uniprot:F1NT07
        Length = 317

 Score = 288 (106.4 bits), Expect = 4.1e-24, P = 4.1e-24
 Identities = 88/306 (28%), Positives = 136/306 (44%)

Query:   300 GRQYANDEEIKER---FEYFKQDGHKKHER---YGTS--EFSDRSPEEILCKTGFKWSER 351
             GR Y +  E++ R   F +  +  H K+     Y  +    +DR+P+E+    G + S  
Sbjct:    20 GRPYGSAREMEHRQRIFAHHMRFVHSKNRAALSYSLALNHLADRTPQEMAALRGRRRS-- 77

Query:   352 TYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEG 411
                    D                +P++ DWR      P  DQA CGSCW+F+  G +EG
Sbjct:    78 ------GDPNHGLPFPAEHYTGIILPESLDWRMYGAVTPVKDQAVCGSCWSFATTGAMEG 131

Query:   412 QYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQ------PIEYTHQAGLESEKDYPYR 465
                +KTG L   S+  L++C+    G   CDG E+        ++   A  ES   +P  
Sbjct:   132 ALFLKTGVLTPLSQQVLIDCSWG-KGNYACDGGEEWRAKGWIKKHGGIASTESPPSFPLV 190

Query:   466 NGNGEKFKCAYDKSKVKL-FTGKDFLYFNGSETMKKILYKYGPLSVGLN-SH-LIHFYNG 522
               NG    C Y++S++    TG   +       +K  +YK+GP++V ++ SH    FY+ 
Sbjct:   191 LQNG---LCHYNQSEMLAKITGYVNVTSGNITAVKTAIYKHGPVAVSIDASHKTFSFYSN 247

Query:   523 TPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGI 582
                 +      P  L HAVL VGYG      YWL +NSW     ++G+  +   +N CG+
Sbjct:   248 GIYYEPKCANKPGQLDHAVLAVGYGVLQGETYWLIKNSWSTYWGNDGYILMAMKDNNCGV 307

Query:   583 EQIAGY 588
                A Y
Sbjct:   308 ATEATY 313

 Score = 279 (103.3 bits), Expect = 3.8e-23, P = 3.8e-23
 Identities = 85/305 (27%), Positives = 135/305 (44%)

Query:   666 GRQYANDEEIKER---FEYFKQDGHKKHER---YGTS--EFSDRSPEEILCKTGFKWSER 717
             GR Y +  E++ R   F +  +  H K+     Y  +    +DR+P+E+    G + S  
Sbjct:    20 GRPYGSAREMEHRQRIFAHHMRFVHSKNRAALSYSLALNHLADRTPQEMAALRGRRRS-- 77

Query:   718 TYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEG 777
                    D                +P++ DWR      P  DQA CGSCW+F+  G +EG
Sbjct:    78 ------GDPNHGLPFPAEHYTGIILPESLDWRMYGAVTPVKDQAVCGSCWSFATTGAMEG 131

Query:   778 QYAIKTGKLVEFSKSQLVECA--KQCSGCDGCFFEPSIEYTHQAG----LESEKDYPYKN 831
                +KTG L   S+  L++C+  K    CDG     +  +  + G     ES   +P   
Sbjct:   132 ALFLKTGVLTPLSQQVLIDCSWGKGNYACDGGEEWRAKGWIKKHGGIASTESPPSFPLVL 191

Query:   832 ANGEKFKCAYDKSKVKL-FTGKDFLHFNGSETMKKILYKYGPLSVLLN-SDLIHDYNGTP 889
              NG    C Y++S++    TG   +       +K  +YK+GP++V ++ S     +    
Sbjct:   192 QNG---LCHYNQSEMLAKITGYVNVTSGNITAVKTAIYKHGPVAVSIDASHKTFSFYSNG 248

Query:   890 IRKNDETCS-PYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 948
             I    +  + P  L HAVL VGYG      YWL++NSW     ++G+  +   +N CG+ 
Sbjct:   249 IYYEPKCANKPGQLDHAVLAVGYGVLQGETYWLIKNSWSTYWGNDGYILMAMKDNNCGVA 308

Query:   949 QIAGY 953
               A Y
Sbjct:   309 TEATY 313

 Score = 274 (101.5 bits), Expect = 1.3e-22, P = 1.3e-22
 Identities = 69/221 (31%), Positives = 107/221 (48%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA--K 68
             +P++ DWR      P  DQA CGSCW+F+  G +EG   +KTG L   S+  L++C+  K
Sbjct:    96 LPESLDWRMYGAVTPVKDQAVCGSCWSFATTGAMEGALFLKTGVLTPLSQQVLIDCSWGK 155

Query:    69 QCSGCDGCFFEPSIEYTHQAG----LESEKDYPYKNANGEKFKCAYDKSKVKL-FTGKDF 123
                 CDG     +  +  + G     ES   +P    NG    C Y++S++    TG   
Sbjct:   156 GNYACDGGEEWRAKGWIKKHGGIASTESPPSFPLVLQNG---LCHYNQSEMLAKITGYVN 212

Query:   124 LHFNGSETMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCS-PYDLGHAVLLVGYG 181
             +       +K  +YK+GP++V ++ S     +    I    +  + P  L HAVL VGYG
Sbjct:   213 VTSGNITAVKTAIYKHGPVAVSIDASHKTFSFYSNGIYYEPKCANKPGQLDHAVLAVGYG 272

Query:   182 KQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
                   YWL++NSW     ++G+  +   +N CG+   A Y
Sbjct:   273 VLQGETYWLIKNSWSTYWGNDGYILMAMKDNNCGVATEATY 313


>UNIPROTKB|P83654 [details] [associations]
            symbol:P83654 "Ervatamin-C" species:52861 "Tabernaemontana
            divaricata" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0006508 "proteolysis" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005576 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197 PDB:1O0E PDB:2PNS
            PDBsum:1O0E PDBsum:2PNS MEROPS:C01.116 EvolutionaryTrace:P83654
            Uniprot:P83654
        Length = 208

 Score = 286 (105.7 bits), Expect = 6.7e-24, P = 6.7e-24
 Identities = 71/218 (32%), Positives = 110/218 (50%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             +P+  DWRKK    P  +Q +CGSCWAFS    +E    I+TG L+  S+ +LV+C K+ 
Sbjct:     1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKKN 60

Query:   802 SGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGS 860
              GC G  F  + +Y  +  G++++ +YPYK   G    C    SKV    G + + F   
Sbjct:    61 HGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGP---CQA-ASKVVSIDGYNGVPFCNE 116

Query:   861 ETMKK-ILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPY 919
               +K+ +  +   +++  +S     Y+          C    L H V +VGY  Q N  Y
Sbjct:   117 XALKQAVAVQPSTVAIDASSAQFQQYSSGIF---SGPCGT-KLNHGVTIVGY--QAN--Y 168

Query:   920 WLVRNSWGPIGPDEGFFKIER--GNNACGIEQIAGYAT 955
             W+VRNSWG    ++G+ ++ R  G   CGI ++  Y T
Sbjct:   169 WIVRNSWGRYWGEKGYIRMLRVGGCGLCGIARLPYYPT 206

 Score = 285 (105.4 bits), Expect = 8.6e-24, P = 8.6e-24
 Identities = 71/218 (32%), Positives = 109/218 (50%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             +P+  DWRKK    P  +Q  CGSCWAFS    +E    I+TG L+  S+ +LV+C K+ 
Sbjct:     1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKKN 60

Query:    71 SGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGS 129
              GC G  F  + +Y  +  G++++ +YPYK   G    C    SKV    G + + F   
Sbjct:    61 HGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGP---CQA-ASKVVSIDGYNGVPFCNE 116

Query:   130 ETMKK-ILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPY 188
               +K+ +  +   +++  +S     Y+          C    L H V +VGY  Q N  Y
Sbjct:   117 XALKQAVAVQPSTVAIDASSAQFQQYSSGIF---SGPCGT-KLNHGVTIVGY--QAN--Y 168

Query:   189 WLVRNSWGPIGPDEGFFKIER--GNNACGIEQIAGYAT 224
             W+VRNSWG    ++G+ ++ R  G   CGI ++  Y T
Sbjct:   169 WIVRNSWGRYWGEKGYIRMLRVGGCGLCGIARLPYYPT 206

 Score = 262 (97.3 bits), Expect = 2.5e-21, P = 2.5e-21
 Identities = 68/219 (31%), Positives = 106/219 (48%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             +P+  DWRKK    P  +Q +CGSCWAFS    +E    I+TG L+  S+ +LV+C K+ 
Sbjct:     1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKKN 60

Query:   436 SGCGGCDGLEQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNG 494
              GC G        +Y  +  G++++ +YPY+   G    C    SKV    G + + F  
Sbjct:    61 HGCLG-GAFVFAYQYIINNGGIDTQANYPYKAVQGP---CQA-ASKVVSIDGYNGVPFCN 115

Query:   495 SETMKKILYKYGPLSVGLNSHLIHFYN-GTPIRKNDETCSPYDLGHAVLLVGYGKQDDIP 553
                +K+ +    P +V +++    F    + I      C    L H V +VGY       
Sbjct:   116 EXALKQAV-AVQPSTVAIDASSAQFQQYSSGIFSGP--CGT-KLNHGVTIVGYQAN---- 167

Query:   554 YWLARNSWGPIGPDEGFFKIER--GNNACGIEQIAGYAT 590
             YW+ RNSWG    ++G+ ++ R  G   CGI ++  Y T
Sbjct:   168 YWIVRNSWGRYWGEKGYIRMLRVGGCGLCGIARLPYYPT 206


>WB|WBGene00019314 [details] [associations]
            symbol:K02E7.10 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 SMART:SM00645 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 HSSP:P07711
            EMBL:FO080411 PIR:T32392 RefSeq:NP_493904.1 UniGene:Cel.14828
            ProteinModelPortal:O17255 SMR:O17255 EnsemblMetazoa:K02E7.10
            GeneID:186889 KEGG:cel:CELE_K02E7.10 UCSC:K02E7.10 CTD:186889
            WormBase:K02E7.10 eggNOG:NOG331187 HOGENOM:HOG000114005
            InParanoid:O17255 OMA:GNANEAR NextBio:933344 Uniprot:O17255
        Length = 299

 Score = 280 (103.6 bits), Expect = 2.9e-23, P = 2.9e-23
 Identities = 67/211 (31%), Positives = 104/211 (49%)

Query:   378 DAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYA-IKTGKLVEFSKSQLVECAKQCS 436
             D  DWR+K + GP  DQ  C + +AF+    +E  YA    GKL+ FS+ Q+++CA   +
Sbjct:    82 DFLDWREKGIVGPVKDQGKCNASYAFAAIAAIESMYAKANNGKLLSFSEQQIIDCANFTN 141

Query:   437 GCGGCDGLEQPIE--YTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNG 494
              C   + LE  +   +  + G+ +E DYPY  G     KC YD SK+KL      +Y N 
Sbjct:   142 PCQ--ENLENVLSNRFLKENGVGTEADYPYV-GKENVGKCEYDSSKMKLRPTYIDVYPN- 197

Query:   495 SETMKKILYKYGPLSVGLNSHLIHFYNGTPI-RKNDETCSPYDLGHAVLLVGYGKQDDIP 553
              E  +  +  +G     + S    F+  T I     E C   +   ++ +VGYGK     
Sbjct:   198 EEWARAHITTFGTGYFRMRSPPSFFHYKTGIYNPTKEECGNANEARSLAIVGYGKDGAEK 257

Query:   554 YWLARNSWGPIGPDEGFFKIERGNNACGIEQ 584
             YW+ + S+G    + G+ K+ R  NACG+ +
Sbjct:   258 YWIVKGSFGTSWGEHGYMKLARNVNACGMAE 288

 Score = 264 (98.0 bits), Expect = 1.5e-21, P = 1.5e-21
 Identities = 64/210 (30%), Positives = 99/210 (47%)

Query:    13 DAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYA-IKTGKLVEFSKSQLVECAKQCS 71
             D  DWR+K + GP  DQ  C + +AF+    +E  YA    GKL+ FS+ Q+++CA   +
Sbjct:    82 DFLDWREKGIVGPVKDQGKCNASYAFAAIAAIESMYAKANNGKLLSFSEQQIIDCANFTN 141

Query:    72 GCDGCFFEP-SIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSE 130
              C        S  +  + G+ +E DYPY        KC YD SK+KL      ++ N  E
Sbjct:   142 PCQENLENVLSNRFLKENGVGTEADYPYVGKENVG-KCEYDSSKMKLRPTYIDVYPN-EE 199

Query:   131 TMKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPY 188
               +  +  +G     + S     H   G       E C   +   ++ +VGYGK     Y
Sbjct:   200 WARAHITTFGTGYFRMRSPPSFFHYKTGI-YNPTKEECGNANEARSLAIVGYGKDGAEKY 258

Query:   189 WLVRNSWGPIGPDEGFFKIERGNNACGIEQ 218
             W+V+ S+G    + G+ K+ R  NACG+ +
Sbjct:   259 WIVKGSFGTSWGEHGYMKLARNVNACGMAE 288

 Score = 264 (98.0 bits), Expect = 1.5e-21, P = 1.5e-21
 Identities = 64/210 (30%), Positives = 99/210 (47%)

Query:   744 DAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYA-IKTGKLVEFSKSQLVECAKQCS 802
             D  DWR+K + GP  DQ  C + +AF+    +E  YA    GKL+ FS+ Q+++CA   +
Sbjct:    82 DFLDWREKGIVGPVKDQGKCNASYAFAAIAAIESMYAKANNGKLLSFSEQQIIDCANFTN 141

Query:   803 GCDGCFFEP-SIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSE 861
              C        S  +  + G+ +E DYPY        KC YD SK+KL      ++ N  E
Sbjct:   142 PCQENLENVLSNRFLKENGVGTEADYPYVGKENVG-KCEYDSSKMKLRPTYIDVYPN-EE 199

Query:   862 TMKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPY 919
               +  +  +G     + S     H   G       E C   +   ++ +VGYGK     Y
Sbjct:   200 WARAHITTFGTGYFRMRSPPSFFHYKTGI-YNPTKEECGNANEARSLAIVGYGKDGAEKY 258

Query:   920 WLVRNSWGPIGPDEGFFKIERGNNACGIEQ 949
             W+V+ S+G    + G+ K+ R  NACG+ +
Sbjct:   259 WIVKGSFGTSWGEHGYMKLARNVNACGMAE 288


>GENEDB_PFALCIPARUM|PF11_0161 [details] [associations]
            symbol:PF11_0161 "falcipain-2 precursor,
            putative" species:5833 "Plasmodium falciparum" [GO:0020020 "food
            vacuole" evidence=TAS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014186 HSSP:O65039 OMA:NNVEHIN GO:GO:0020020
            MEROPS:C01.046 HOGENOM:HOG000065857 ProtClustDB:PTZ00021
            RefSeq:XP_001347832.1 ProteinModelPortal:Q8I6U5 SMR:Q8I6U5
            IntAct:Q8I6U5 MINT:MINT-1546851 EnsemblProtists:PF11_0161:mRNA
            GeneID:810708 KEGG:pfa:PF11_0161 EuPathDB:PlasmoDB:PF3D7_1115300
            Uniprot:Q8I6U5
        Length = 482

 Score = 297 (109.6 bits), Expect = 3.5e-23, P = 3.5e-23
 Identities = 97/333 (29%), Positives = 155/333 (46%)

Query:   650 DNENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHK-------KHERYGT--SEFSD 700
             +N   +  F  FI    +QY +  E+KERF+ F Q+ HK       K   Y    + F+D
Sbjct:   155 NNVEHINQFYTFIKTNNKQYNSPNEMKERFQVFLQNAHKVKMHNNNKKSLYKKELNRFAD 214

Query:   701 RSPEEILCKTGFKWSERTYE--RIVADRXXXXXXXXXXXXDGPVPDA-WDWRKKNVTGPA 757
              +  E   K     S +  +  + + D+            +     A +DWR  +   P 
Sbjct:   215 LTYHEFKSKYLTLRSSKPLKNSKYLLDQINYDAVIKKYKGNENFDHAAYDWRLHSGVTPV 274

Query:   758 GDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIE-YT 816
              DQ  CGSCWAFS  G +E QYAI+  KL+  S+ +LV+C+ +  GC+G     + E   
Sbjct:   275 KDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFKNYGCNGGLINNAFEDMI 334

Query:   817 HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVL 876
                G+ ++ DYPY   +     C  D+   K +  K++L    ++ +K+ L   GP+S+ 
Sbjct:   335 ELGGICTDDDYPY--VSDAPNLCNIDRCTEK-YGIKNYLSVPDNK-LKEALRFLGPISIS 390

Query:   877 LN-SDLIHDYNGTPIRKN---DETCSPYDLGHAVLLVGYGKQDNI-P---------YWLV 922
             +  SD   D+   P  K    D  C   +L HAV+LVG+G ++ + P         Y+++
Sbjct:   391 IAVSD---DF---PFYKEGIFDGECGD-ELNHAVMLVGFGMKEIVNPLTKKGEKHYYYII 443

Query:   923 RNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 955
             +NSWG    + GF  IE   +  G+ +  G  T
Sbjct:   444 KNSWGQQWGERGFINIETDES--GLMRKCGLGT 474

 Score = 287 (106.1 bits), Expect = 4.8e-22, P = 4.8e-22
 Identities = 98/333 (29%), Positives = 153/333 (45%)

Query:   284 DNENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHK-------KHERYGT--SEFSD 334
             +N   +  F  FI    +QY +  E+KERF+ F Q+ HK       K   Y    + F+D
Sbjct:   155 NNVEHINQFYTFIKTNNKQYNSPNEMKERFQVFLQNAHKVKMHNNNKKSLYKKELNRFAD 214

Query:   335 RSPEEILCKTGFKWSERTYE--RIVADRXXXXXXXXXXXXDGPVPDA-WDWRKKNVTGPA 391
              +  E   K     S +  +  + + D+            +     A +DWR  +   P 
Sbjct:   215 LTYHEFKSKYLTLRSSKPLKNSKYLLDQINYDAVIKKYKGNENFDHAAYDWRLHSGVTPV 274

Query:   392 GDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGC-GGC--DGLEQPI 448
              DQ  CGSCWAFS  G +E QYAI+  KL+  S+ +LV+C+ +  GC GG   +  E  I
Sbjct:   275 KDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFKNYGCNGGLINNAFEDMI 334

Query:   449 EYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPL 508
             E     G+ ++ DYPY +       C  D+   K +  K++L    ++ +K+ L   GP+
Sbjct:   335 EL---GGICTDDDYPYVSDAPNL--CNIDRCTEK-YGIKNYLSVPDNK-LKEALRFLGPI 387

Query:   509 SVGLN-SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDI-P---------YWLA 557
             S+ +  S    FY        D  C   +L HAV+LVG+G ++ + P         Y++ 
Sbjct:   388 SISIAVSDDFPFYKEGIF---DGECGD-ELNHAVMLVGFGMKEIVNPLTKKGEKHYYYII 443

Query:   558 RNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 590
             +NSWG    + GF  IE   +  G+ +  G  T
Sbjct:   444 KNSWGQQWGERGFINIETDES--GLMRKCGLGT 474

 Score = 269 (99.8 bits), Expect = 5.2e-20, P = 5.2e-20
 Identities = 74/226 (32%), Positives = 117/226 (51%)

Query:    14 AWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGC 73
             A+DWR  +   P  DQ +CGSCWAFS  G +E QYAI+  KL+  S+ +LV+C+ +  GC
Sbjct:   262 AYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFKNYGC 321

Query:    74 DGCFFEPSIE-YTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETM 132
             +G     + E      G+ ++ DYPY   +     C  D+   K +  K++L    ++ +
Sbjct:   322 NGGLINNAFEDMIELGGICTDDDYPY--VSDAPNLCNIDRCTEK-YGIKNYLSVPDNK-L 377

Query:   133 KKILYKYGPLSVLLN-SDLIHDYNGTPIRKN---DETCSPYDLGHAVLLVGYGKQDNI-P 187
             K+ L   GP+S+ +  SD   D+   P  K    D  C   +L HAV+LVG+G ++ + P
Sbjct:   378 KEALRFLGPISISIAVSD---DF---PFYKEGIFDGECGD-ELNHAVMLVGFGMKEIVNP 430

Query:   188 ---------YWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 224
                      Y++++NSWG    + GF  IE   +  G+ +  G  T
Sbjct:   431 LTKKGEKHYYYIIKNSWGQQWGERGFINIETDES--GLMRKCGLGT 474


>UNIPROTKB|Q8I6U5 [details] [associations]
            symbol:PF11_0161 "Falcipain-2B" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            EMBL:AE014186 HSSP:O65039 OMA:NNVEHIN GO:GO:0020020 MEROPS:C01.046
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347832.1
            ProteinModelPortal:Q8I6U5 SMR:Q8I6U5 IntAct:Q8I6U5
            MINT:MINT-1546851 EnsemblProtists:PF11_0161:mRNA GeneID:810708
            KEGG:pfa:PF11_0161 EuPathDB:PlasmoDB:PF3D7_1115300 Uniprot:Q8I6U5
        Length = 482

 Score = 297 (109.6 bits), Expect = 3.5e-23, P = 3.5e-23
 Identities = 97/333 (29%), Positives = 155/333 (46%)

Query:   650 DNENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHK-------KHERYGT--SEFSD 700
             +N   +  F  FI    +QY +  E+KERF+ F Q+ HK       K   Y    + F+D
Sbjct:   155 NNVEHINQFYTFIKTNNKQYNSPNEMKERFQVFLQNAHKVKMHNNNKKSLYKKELNRFAD 214

Query:   701 RSPEEILCKTGFKWSERTYE--RIVADRXXXXXXXXXXXXDGPVPDA-WDWRKKNVTGPA 757
              +  E   K     S +  +  + + D+            +     A +DWR  +   P 
Sbjct:   215 LTYHEFKSKYLTLRSSKPLKNSKYLLDQINYDAVIKKYKGNENFDHAAYDWRLHSGVTPV 274

Query:   758 GDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIE-YT 816
              DQ  CGSCWAFS  G +E QYAI+  KL+  S+ +LV+C+ +  GC+G     + E   
Sbjct:   275 KDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFKNYGCNGGLINNAFEDMI 334

Query:   817 HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVL 876
                G+ ++ DYPY   +     C  D+   K +  K++L    ++ +K+ L   GP+S+ 
Sbjct:   335 ELGGICTDDDYPY--VSDAPNLCNIDRCTEK-YGIKNYLSVPDNK-LKEALRFLGPISIS 390

Query:   877 LN-SDLIHDYNGTPIRKN---DETCSPYDLGHAVLLVGYGKQDNI-P---------YWLV 922
             +  SD   D+   P  K    D  C   +L HAV+LVG+G ++ + P         Y+++
Sbjct:   391 IAVSD---DF---PFYKEGIFDGECGD-ELNHAVMLVGFGMKEIVNPLTKKGEKHYYYII 443

Query:   923 RNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 955
             +NSWG    + GF  IE   +  G+ +  G  T
Sbjct:   444 KNSWGQQWGERGFINIETDES--GLMRKCGLGT 474

 Score = 287 (106.1 bits), Expect = 4.8e-22, P = 4.8e-22
 Identities = 98/333 (29%), Positives = 153/333 (45%)

Query:   284 DNENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHK-------KHERYGT--SEFSD 334
             +N   +  F  FI    +QY +  E+KERF+ F Q+ HK       K   Y    + F+D
Sbjct:   155 NNVEHINQFYTFIKTNNKQYNSPNEMKERFQVFLQNAHKVKMHNNNKKSLYKKELNRFAD 214

Query:   335 RSPEEILCKTGFKWSERTYE--RIVADRXXXXXXXXXXXXDGPVPDA-WDWRKKNVTGPA 391
              +  E   K     S +  +  + + D+            +     A +DWR  +   P 
Sbjct:   215 LTYHEFKSKYLTLRSSKPLKNSKYLLDQINYDAVIKKYKGNENFDHAAYDWRLHSGVTPV 274

Query:   392 GDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGC-GGC--DGLEQPI 448
              DQ  CGSCWAFS  G +E QYAI+  KL+  S+ +LV+C+ +  GC GG   +  E  I
Sbjct:   275 KDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFKNYGCNGGLINNAFEDMI 334

Query:   449 EYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPL 508
             E     G+ ++ DYPY +       C  D+   K +  K++L    ++ +K+ L   GP+
Sbjct:   335 EL---GGICTDDDYPYVSDAPNL--CNIDRCTEK-YGIKNYLSVPDNK-LKEALRFLGPI 387

Query:   509 SVGLN-SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDI-P---------YWLA 557
             S+ +  S    FY        D  C   +L HAV+LVG+G ++ + P         Y++ 
Sbjct:   388 SISIAVSDDFPFYKEGIF---DGECGD-ELNHAVMLVGFGMKEIVNPLTKKGEKHYYYII 443

Query:   558 RNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 590
             +NSWG    + GF  IE   +  G+ +  G  T
Sbjct:   444 KNSWGQQWGERGFINIETDES--GLMRKCGLGT 474

 Score = 269 (99.8 bits), Expect = 5.2e-20, P = 5.2e-20
 Identities = 74/226 (32%), Positives = 117/226 (51%)

Query:    14 AWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGC 73
             A+DWR  +   P  DQ +CGSCWAFS  G +E QYAI+  KL+  S+ +LV+C+ +  GC
Sbjct:   262 AYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFKNYGC 321

Query:    74 DGCFFEPSIE-YTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETM 132
             +G     + E      G+ ++ DYPY   +     C  D+   K +  K++L    ++ +
Sbjct:   322 NGGLINNAFEDMIELGGICTDDDYPY--VSDAPNLCNIDRCTEK-YGIKNYLSVPDNK-L 377

Query:   133 KKILYKYGPLSVLLN-SDLIHDYNGTPIRKN---DETCSPYDLGHAVLLVGYGKQDNI-P 187
             K+ L   GP+S+ +  SD   D+   P  K    D  C   +L HAV+LVG+G ++ + P
Sbjct:   378 KEALRFLGPISISIAVSD---DF---PFYKEGIFDGECGD-ELNHAVMLVGFGMKEIVNP 430

Query:   188 ---------YWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 224
                      Y++++NSWG    + GF  IE   +  G+ +  G  T
Sbjct:   431 LTKKGEKHYYYIIKNSWGQQWGERGFINIETDES--GLMRKCGLGT 474


>UNIPROTKB|J9NSE7 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 InterPro:IPR014882 Pfam:PF08773
            EMBL:AAEX03017125 Ensembl:ENSCAFT00000014269 OMA:INGQICH
            Uniprot:J9NSE7
        Length = 458

 Score = 295 (108.9 bits), Expect = 4.2e-23, P = 4.2e-23
 Identities = 75/228 (32%), Positives = 112/228 (49%)

Query:   742 VPDAWDWRK---KNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 796
             +P +WDWR     N   P  +QA+CGSC+AF+   MLE +  I T        S  ++V 
Sbjct:   227 LPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTVMLEARIRILTNNTQTPILSPQEIVS 286

Query:   797 CAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANG--EKFKCAYDKSKVKLFTGKD 853
             C++   GC+G F +  + +Y    GL  E  + Y  ++   +   C +  S    + G  
Sbjct:   287 CSQYAQGCEGGFPYLIAGKYAQDFGLVDEACFSYAGSDSPCKPNDCFHYYSSEYHYVG-G 345

Query:   854 FLHFNGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKN-DETCSPYDL-GHAVLLV 909
             F        MK  L ++GP++V      D  H   G        +  +P++L  HAVLLV
Sbjct:   346 FYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPINPFELTNHAVLLV 405

Query:   910 GYGKQD--NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 955
             GYG      + YW+V+NSWG    ++G+F+I RG + C IE IA  AT
Sbjct:   406 GYGTDSASGMDYWIVKNSWGSRWGEDGYFQICRGTDECAIESIAVAAT 453

 Score = 295 (108.9 bits), Expect = 4.2e-23, P = 4.2e-23
 Identities = 76/233 (32%), Positives = 113/233 (48%)

Query:     6 EKDGPVPDAWDWRK---KNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVE--FSK 60
             E+   +P +WDWR     N   P  +QA CGSC+AF+   MLE +  I T        S 
Sbjct:   222 EEISRLPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTVMLEARIRILTNNTQTPILSP 281

Query:    61 SQLVECAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANG--EKFKCAYDKSKVKL 117
              ++V C++   GC+G F +  + +Y    GL  E  + Y  ++   +   C +  S    
Sbjct:   282 QEIVSCSQYAQGCEGGFPYLIAGKYAQDFGLVDEACFSYAGSDSPCKPNDCFHYYSSEYH 341

Query:   118 FTGKDFLHFNGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKN-DETCSPYDL-GH 173
             + G  F        MK  L ++GP++V      D  H   G        +  +P++L  H
Sbjct:   342 YVG-GFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPINPFELTNH 400

Query:   174 AVLLVGYGKQD--NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 224
             AVLLVGYG      + YW+V+NSWG    ++G+F+I RG + C IE IA  AT
Sbjct:   401 AVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFQICRGTDECAIESIAVAAT 453

 Score = 281 (104.0 bits), Expect = 1.8e-21, P = 1.8e-21
 Identities = 74/231 (32%), Positives = 112/231 (48%)

Query:   376 VPDAWDWRK---KNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 430
             +P +WDWR     N   P  +QA+CGSC+AF+   MLE +  I T        S  ++V 
Sbjct:   227 LPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTVMLEARIRILTNNTQTPILSPQEIVS 286

Query:   431 CAKQCSGCGGCDGLEQPI--EYTHQAGLESEKDYPYRNGNG--EKFKCAYDKSKVKLFTG 486
             C++   GC G  G    I  +Y    GL  E  + Y   +   +   C +  S    + G
Sbjct:   287 CSQYAQGCEG--GFPYLIAGKYAQDFGLVDEACFSYAGSDSPCKPNDCFHYYSSEYHYVG 344

Query:   487 KDFLYFNGSETMKKI-LYKYGPLSVGLNSH--LIHFYNGTPIRKN-DETCSPYDL-GHAV 541
                 Y   +E + K+ L ++GP++V    +    H+  G        +  +P++L  HAV
Sbjct:   345 G--FYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPINPFELTNHAV 402

Query:   542 LLVGYGKQD--DIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 590
             LLVGYG      + YW+ +NSWG    ++G+F+I RG + C IE IA  AT
Sbjct:   403 LLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFQICRGTDECAIESIAVAAT 453

 Score = 148 (57.2 bits), Expect = 9.4e-07, P = 9.4e-07
 Identities = 30/61 (49%), Positives = 41/61 (67%)

Query:   966 SPYDL-GHAVLLVGYGKQD--DIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYA 1022
             +P++L  HAVLLVGYG      + YW+V+NSWG    ++G+F+I RG + C IE IA  A
Sbjct:   393 NPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFQICRGTDECAIESIAVAA 452

Query:  1023 T 1023
             T
Sbjct:   453 T 453


>UNIPROTKB|F1N455 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1 exclusion domain chain"
            species:9913 "Bos taurus" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 IPI:IPI00697314 UniGene:Bt.49573
            InterPro:IPR014882 Pfam:PF08773 OMA:YDDFLHY GO:GO:0001913
            EMBL:DAAA02062487 EMBL:DAAA02062488 Ensembl:ENSBTAT00000014735
            Uniprot:F1N455
        Length = 463

 Score = 295 (108.9 bits), Expect = 4.6e-23, P = 4.6e-23
 Identities = 76/236 (32%), Positives = 119/236 (50%)

Query:   742 VPDAWDWRKK---NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 796
             +P +WDWR     N   P  +Q +CGSC++F+  GM+E +  I T        S  ++V 
Sbjct:   231 LPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVS 290

Query:   797 CAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 855
             C++   GC+G F +  + +Y    GL  E  +PY    G    C   +   + ++ +   
Sbjct:   291 CSQYAQGCEGGFPYLIAGKYAQDFGLVEEDCFPY---TGTDSPCRLKEGCFRYYSSE--Y 345

Query:   856 HFNG------SETMKKI-LYKYGPLSVLLN--SDLIHD----YNGTPIRKNDETCSPYDL 902
             H+ G      +E + K+ L   GP++V      D +H     Y+ T +R   +  +P++L
Sbjct:   346 HYVGGFYGGCNEALMKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLR---DPFNPFEL 402

Query:   903 -GHAVLLVGYGKQ--DNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 955
               HAVLLVGYG      + YW+V+NSWG    + G+F+I RG + C IE IA  AT
Sbjct:   403 TNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIALAAT 458

 Score = 294 (108.6 bits), Expect = 6.0e-23, P = 6.0e-23
 Identities = 76/236 (32%), Positives = 118/236 (50%)

Query:    11 VPDAWDWRKK---NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 65
             +P +WDWR     N   P  +Q  CGSC++F+  GM+E +  I T        S  ++V 
Sbjct:   231 LPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVS 290

Query:    66 CAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
             C++   GC+G F +  + +Y    GL  E  +PY    G    C   +   + ++ +   
Sbjct:   291 CSQYAQGCEGGFPYLIAGKYAQDFGLVEEDCFPY---TGTDSPCRLKEGCFRYYSSE--Y 345

Query:   125 HFNG------SETMKKI-LYKYGPLSVLLN--SDLIHD----YNGTPIRKNDETCSPYDL 171
             H+ G      +E + K+ L   GP++V      D +H     Y+ T +R   +  +P++L
Sbjct:   346 HYVGGFYGGCNEALMKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLR---DPFNPFEL 402

Query:   172 -GHAVLLVGYGKQ--DNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 224
               HAVLLVGYG      + YW+V+NSWG    + G+F+I RG + C IE IA  AT
Sbjct:   403 TNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIALAAT 458

 Score = 286 (105.7 bits), Expect = 5.0e-22, P = 5.0e-22
 Identities = 75/237 (31%), Positives = 117/237 (49%)

Query:   376 VPDAWDWRKK---NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 430
             +P +WDWR     N   P  +Q +CGSC++F+  GM+E +  I T        S  ++V 
Sbjct:   231 LPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVS 290

Query:   431 CAKQCSGCGGCDGLEQPI--EYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKD 488
             C++   GC G  G    I  +Y    GL  E  +PY    G    C   +   + ++  +
Sbjct:   291 CSQYAQGCEG--GFPYLIAGKYAQDFGLVEEDCFPY---TGTDSPCRLKEGCFRYYSS-E 344

Query:   489 FLYFNG-----SETMKKI-LYKYGPLSVGLNSH--LIHF----YNGTPIRKNDETCSPYD 536
             + Y  G     +E + K+ L   GP++V    +   +H+    Y+ T +R   +  +P++
Sbjct:   345 YHYVGGFYGGCNEALMKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLR---DPFNPFE 401

Query:   537 L-GHAVLLVGYGKQ--DDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 590
             L  HAVLLVGYG      + YW+ +NSWG    + G+F+I RG + C IE IA  AT
Sbjct:   402 LTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIALAAT 458

 Score = 152 (58.6 bits), Expect = 3.5e-07, P = 3.5e-07
 Identities = 30/61 (49%), Positives = 40/61 (65%)

Query:   966 SPYDL-GHAVLLVGYGKQ--DDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYA 1022
             +P++L  HAVLLVGYG      + YW+V+NSWG    + G+F+I RG + C IE IA  A
Sbjct:   398 NPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIALAA 457

Query:  1023 T 1023
             T
Sbjct:   458 T 458


>UNIPROTKB|Q3ZCJ8 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9913 "Bos
            taurus" [GO:0031638 "zymogen activation" evidence=IDA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0005764 EMBL:BC102115 IPI:IPI00697314 RefSeq:NP_001028789.1
            UniGene:Bt.49573 ProteinModelPortal:Q3ZCJ8 SMR:Q3ZCJ8 STRING:Q3ZCJ8
            PRIDE:Q3ZCJ8 GeneID:352958 KEGG:bta:352958 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 InParanoid:Q3ZCJ8 KO:K01275
            OrthoDB:EOG4H19VZ BindingDB:Q3ZCJ8 ChEMBL:CHEMBL1075050
            NextBio:20812686 GO:GO:0031638 InterPro:IPR014882 Pfam:PF08773
            Uniprot:Q3ZCJ8
        Length = 463

 Score = 295 (108.9 bits), Expect = 4.6e-23, P = 4.6e-23
 Identities = 76/236 (32%), Positives = 119/236 (50%)

Query:   742 VPDAWDWRKK---NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 796
             +P +WDWR     N   P  +Q +CGSC++F+  GM+E +  I T        S  ++V 
Sbjct:   231 LPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVS 290

Query:   797 CAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 855
             C++   GC+G F +  + +Y    GL  E  +PY    G    C   +   + ++ +   
Sbjct:   291 CSQYAQGCEGGFPYLIAGKYAQDFGLVEEDCFPY---TGTDSPCRLKEGCFRYYSSE--Y 345

Query:   856 HFNG------SETMKKI-LYKYGPLSVLLN--SDLIHD----YNGTPIRKNDETCSPYDL 902
             H+ G      +E + K+ L   GP++V      D +H     Y+ T +R   +  +P++L
Sbjct:   346 HYVGGFYGGCNEALMKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLR---DPFNPFEL 402

Query:   903 -GHAVLLVGYGKQ--DNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 955
               HAVLLVGYG      + YW+V+NSWG    + G+F+I RG + C IE IA  AT
Sbjct:   403 TNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIALAAT 458

 Score = 294 (108.6 bits), Expect = 6.0e-23, P = 6.0e-23
 Identities = 76/236 (32%), Positives = 118/236 (50%)

Query:    11 VPDAWDWRKK---NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 65
             +P +WDWR     N   P  +Q  CGSC++F+  GM+E +  I T        S  ++V 
Sbjct:   231 LPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVS 290

Query:    66 CAKQCSGCDGCF-FEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFL 124
             C++   GC+G F +  + +Y    GL  E  +PY    G    C   +   + ++ +   
Sbjct:   291 CSQYAQGCEGGFPYLIAGKYAQDFGLVEEDCFPY---TGTDSPCRLKEGCFRYYSSE--Y 345

Query:   125 HFNG------SETMKKI-LYKYGPLSVLLN--SDLIHD----YNGTPIRKNDETCSPYDL 171
             H+ G      +E + K+ L   GP++V      D +H     Y+ T +R   +  +P++L
Sbjct:   346 HYVGGFYGGCNEALMKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLR---DPFNPFEL 402

Query:   172 -GHAVLLVGYGKQ--DNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 224
               HAVLLVGYG      + YW+V+NSWG    + G+F+I RG + C IE IA  AT
Sbjct:   403 TNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIALAAT 458

 Score = 286 (105.7 bits), Expect = 5.0e-22, P = 5.0e-22
 Identities = 75/237 (31%), Positives = 117/237 (49%)

Query:   376 VPDAWDWRKK---NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVE 430
             +P +WDWR     N   P  +Q +CGSC++F+  GM+E +  I T        S  ++V 
Sbjct:   231 LPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVS 290

Query:   431 CAKQCSGCGGCDGLEQPI--EYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKD 488
             C++   GC G  G    I  +Y    GL  E  +PY    G    C   +   + ++  +
Sbjct:   291 CSQYAQGCEG--GFPYLIAGKYAQDFGLVEEDCFPY---TGTDSPCRLKEGCFRYYSS-E 344

Query:   489 FLYFNG-----SETMKKI-LYKYGPLSVGLNSH--LIHF----YNGTPIRKNDETCSPYD 536
             + Y  G     +E + K+ L   GP++V    +   +H+    Y+ T +R   +  +P++
Sbjct:   345 YHYVGGFYGGCNEALMKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLR---DPFNPFE 401

Query:   537 L-GHAVLLVGYGKQ--DDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 590
             L  HAVLLVGYG      + YW+ +NSWG    + G+F+I RG + C IE IA  AT
Sbjct:   402 LTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIALAAT 458

 Score = 152 (58.6 bits), Expect = 3.5e-07, P = 3.5e-07
 Identities = 30/61 (49%), Positives = 40/61 (65%)

Query:   966 SPYDL-GHAVLLVGYGKQ--DDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYA 1022
             +P++L  HAVLLVGYG      + YW+V+NSWG    + G+F+I RG + C IE IA  A
Sbjct:   398 NPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIALAA 457

Query:  1023 T 1023
             T
Sbjct:   458 T 458


>GENEDB_PFALCIPARUM|PF11_0165 [details] [associations]
            symbol:PF11_0165 "falcipain 2 precursor"
            species:5833 "Plasmodium falciparum" [GO:0020020 "food vacuole"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014186 HSSP:O65039 GO:GO:0020020
            RefSeq:XP_001347836.1 ProteinModelPortal:Q8I6U4 SMR:Q8I6U4
            IntAct:Q8I6U4 MINT:MINT-1559493 MEROPS:C01.046
            EnsemblProtists:PF11_0165:mRNA GeneID:810712 KEGG:pfa:PF11_0165
            EuPathDB:PlasmoDB:PF3D7_1115700 HOGENOM:HOG000065857 OMA:NESLHAN
            ProtClustDB:PTZ00021 BindingDB:Q8I6U4 ChEMBL:CHEMBL3470
            Uniprot:Q8I6U4
        Length = 484

 Score = 293 (108.2 bits), Expect = 1.0e-22, P = 1.0e-22
 Identities = 95/337 (28%), Positives = 153/337 (45%)

Query:   644 EGSLTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHK-------KHERYGT- 695
             +     +N   +  F  FI    +QY +  E+KERF+ F Q+ HK       K+  Y   
Sbjct:   151 DNKFLMNNAEHINQFYMFIKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKE 210

Query:   696 -SEFSDRSPEEILCKTGFKWSERTYE--RIVADRXXXXXXXXXXXXDGPVPDA-WDWRKK 751
              + F+D +  E   K     S +  +  + + D+            +     A +DWR  
Sbjct:   211 LNRFADLTYHEFKNKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYKGNENFDHAAYDWRLH 270

Query:   752 NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEP 811
             +   P  DQ  CGSCWAFS  G +E QYAI+  KL+  S+ +LV+C+ +  GC+G     
Sbjct:   271 SGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFKNYGCNGGLINN 330

Query:   812 SIE-YTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKY 870
             + E      G+ ++ DYPY   +     C  D+   K +  K++L    ++ +K+ L   
Sbjct:   331 AFEDMIELGGICTDDDYPY--VSDAPNLCNIDRCTEK-YGIKNYLSVPDNK-LKEALRFL 386

Query:   871 GPLS--VLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNI-P--------- 918
             GP+S  V ++ D      G      D  C    L HAV+LVG+G ++ + P         
Sbjct:   387 GPISISVAVSDDFAFYKEGI----FDGECGD-QLNHAVMLVGFGMKEIVNPLTKKGEKHY 441

Query:   919 YWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 955
             Y++++NSWG    + GF  IE   +  G+ +  G  T
Sbjct:   442 YYIIKNSWGQQWGERGFINIETDES--GLMRKCGLGT 476

 Score = 284 (105.0 bits), Expect = 1.1e-21, P = 1.1e-21
 Identities = 98/339 (28%), Positives = 154/339 (45%)

Query:   278 EGSLTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHK-------KHERYGT- 329
             +     +N   +  F  FI    +QY +  E+KERF+ F Q+ HK       K+  Y   
Sbjct:   151 DNKFLMNNAEHINQFYMFIKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKE 210

Query:   330 -SEFSDRSPEEILCKTGFKWSERTYE--RIVADRXXXXXXXXXXXXDGPVPDA-WDWRKK 385
              + F+D +  E   K     S +  +  + + D+            +     A +DWR  
Sbjct:   211 LNRFADLTYHEFKNKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYKGNENFDHAAYDWRLH 270

Query:   386 NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGC-GGC--D 442
             +   P  DQ  CGSCWAFS  G +E QYAI+  KL+  S+ +LV+C+ +  GC GG   +
Sbjct:   271 SGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFKNYGCNGGLINN 330

Query:   443 GLEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKIL 502
               E  IE     G+ ++ DYPY +       C  D+   K +  K++L    ++ +K+ L
Sbjct:   331 AFEDMIEL---GGICTDDDYPYVSDAPNL--CNIDRCTEK-YGIKNYLSVPDNK-LKEAL 383

Query:   503 YKYGPLSVGLN-SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDI-P------- 553
                GP+S+ +  S    FY        D  C    L HAV+LVG+G ++ + P       
Sbjct:   384 RFLGPISISVAVSDDFAFYKEGIF---DGECGD-QLNHAVMLVGFGMKEIVNPLTKKGEK 439

Query:   554 --YWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 590
               Y++ +NSWG    + GF  IE   +  G+ +  G  T
Sbjct:   440 HYYYIIKNSWGQQWGERGFINIETDES--GLMRKCGLGT 476

 Score = 265 (98.3 bits), Expect = 1.5e-19, P = 1.5e-19
 Identities = 72/224 (32%), Positives = 113/224 (50%)

Query:    14 AWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGC 73
             A+DWR  +   P  DQ +CGSCWAFS  G +E QYAI+  KL+  S+ +LV+C+ +  GC
Sbjct:   264 AYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFKNYGC 323

Query:    74 DGCFFEPSIE-YTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETM 132
             +G     + E      G+ ++ DYPY   +     C  D+   K +  K++L    ++ +
Sbjct:   324 NGGLINNAFEDMIELGGICTDDDYPY--VSDAPNLCNIDRCTEK-YGIKNYLSVPDNK-L 379

Query:   133 KKILYKYGPLS--VLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNI-P-- 187
             K+ L   GP+S  V ++ D      G      D  C    L HAV+LVG+G ++ + P  
Sbjct:   380 KEALRFLGPISISVAVSDDFAFYKEGI----FDGECGD-QLNHAVMLVGFGMKEIVNPLT 434

Query:   188 -------YWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 224
                    Y++++NSWG    + GF  IE   +  G+ +  G  T
Sbjct:   435 KKGEKHYYYIIKNSWGQQWGERGFINIETDES--GLMRKCGLGT 476


>UNIPROTKB|Q8I6U4 [details] [associations]
            symbol:PF11_0165 "Falcipain-2A" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            EMBL:AE014186 HSSP:O65039 GO:GO:0020020 RefSeq:XP_001347836.1
            ProteinModelPortal:Q8I6U4 SMR:Q8I6U4 IntAct:Q8I6U4
            MINT:MINT-1559493 MEROPS:C01.046 EnsemblProtists:PF11_0165:mRNA
            GeneID:810712 KEGG:pfa:PF11_0165 EuPathDB:PlasmoDB:PF3D7_1115700
            HOGENOM:HOG000065857 OMA:NESLHAN ProtClustDB:PTZ00021
            BindingDB:Q8I6U4 ChEMBL:CHEMBL3470 Uniprot:Q8I6U4
        Length = 484

 Score = 293 (108.2 bits), Expect = 1.0e-22, P = 1.0e-22
 Identities = 95/337 (28%), Positives = 153/337 (45%)

Query:   644 EGSLTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHK-------KHERYGT- 695
             +     +N   +  F  FI    +QY +  E+KERF+ F Q+ HK       K+  Y   
Sbjct:   151 DNKFLMNNAEHINQFYMFIKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKE 210

Query:   696 -SEFSDRSPEEILCKTGFKWSERTYE--RIVADRXXXXXXXXXXXXDGPVPDA-WDWRKK 751
              + F+D +  E   K     S +  +  + + D+            +     A +DWR  
Sbjct:   211 LNRFADLTYHEFKNKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYKGNENFDHAAYDWRLH 270

Query:   752 NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEP 811
             +   P  DQ  CGSCWAFS  G +E QYAI+  KL+  S+ +LV+C+ +  GC+G     
Sbjct:   271 SGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFKNYGCNGGLINN 330

Query:   812 SIE-YTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKY 870
             + E      G+ ++ DYPY   +     C  D+   K +  K++L    ++ +K+ L   
Sbjct:   331 AFEDMIELGGICTDDDYPY--VSDAPNLCNIDRCTEK-YGIKNYLSVPDNK-LKEALRFL 386

Query:   871 GPLS--VLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNI-P--------- 918
             GP+S  V ++ D      G      D  C    L HAV+LVG+G ++ + P         
Sbjct:   387 GPISISVAVSDDFAFYKEGI----FDGECGD-QLNHAVMLVGFGMKEIVNPLTKKGEKHY 441

Query:   919 YWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 955
             Y++++NSWG    + GF  IE   +  G+ +  G  T
Sbjct:   442 YYIIKNSWGQQWGERGFINIETDES--GLMRKCGLGT 476

 Score = 284 (105.0 bits), Expect = 1.1e-21, P = 1.1e-21
 Identities = 98/339 (28%), Positives = 154/339 (45%)

Query:   278 EGSLTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHK-------KHERYGT- 329
             +     +N   +  F  FI    +QY +  E+KERF+ F Q+ HK       K+  Y   
Sbjct:   151 DNKFLMNNAEHINQFYMFIKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKE 210

Query:   330 -SEFSDRSPEEILCKTGFKWSERTYE--RIVADRXXXXXXXXXXXXDGPVPDA-WDWRKK 385
              + F+D +  E   K     S +  +  + + D+            +     A +DWR  
Sbjct:   211 LNRFADLTYHEFKNKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYKGNENFDHAAYDWRLH 270

Query:   386 NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGC-GGC--D 442
             +   P  DQ  CGSCWAFS  G +E QYAI+  KL+  S+ +LV+C+ +  GC GG   +
Sbjct:   271 SGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFKNYGCNGGLINN 330

Query:   443 GLEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKIL 502
               E  IE     G+ ++ DYPY +       C  D+   K +  K++L    ++ +K+ L
Sbjct:   331 AFEDMIEL---GGICTDDDYPYVSDAPNL--CNIDRCTEK-YGIKNYLSVPDNK-LKEAL 383

Query:   503 YKYGPLSVGLN-SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDI-P------- 553
                GP+S+ +  S    FY        D  C    L HAV+LVG+G ++ + P       
Sbjct:   384 RFLGPISISVAVSDDFAFYKEGIF---DGECGD-QLNHAVMLVGFGMKEIVNPLTKKGEK 439

Query:   554 --YWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 590
               Y++ +NSWG    + GF  IE   +  G+ +  G  T
Sbjct:   440 HYYYIIKNSWGQQWGERGFINIETDES--GLMRKCGLGT 476

 Score = 265 (98.3 bits), Expect = 1.5e-19, P = 1.5e-19
 Identities = 72/224 (32%), Positives = 113/224 (50%)

Query:    14 AWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGC 73
             A+DWR  +   P  DQ +CGSCWAFS  G +E QYAI+  KL+  S+ +LV+C+ +  GC
Sbjct:   264 AYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFKNYGC 323

Query:    74 DGCFFEPSIE-YTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETM 132
             +G     + E      G+ ++ DYPY   +     C  D+   K +  K++L    ++ +
Sbjct:   324 NGGLINNAFEDMIELGGICTDDDYPY--VSDAPNLCNIDRCTEK-YGIKNYLSVPDNK-L 379

Query:   133 KKILYKYGPLS--VLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNI-P-- 187
             K+ L   GP+S  V ++ D      G      D  C    L HAV+LVG+G ++ + P  
Sbjct:   380 KEALRFLGPISISVAVSDDFAFYKEGI----FDGECGD-QLNHAVMLVGFGMKEIVNPLT 434

Query:   188 -------YWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 224
                    Y++++NSWG    + GF  IE   +  G+ +  G  T
Sbjct:   435 KKGEKHYYYIIKNSWGQQWGERGFINIETDES--GLMRKCGLGT 476


>GENEDB_PFALCIPARUM|PF11_0162 [details] [associations]
            symbol:PF11_0162 "falcipain-3" species:5833
            "Plasmodium falciparum" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 HSSP:P43235 EMBL:AE014186 GO:GO:0020020
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347833.1
            ProteinModelPortal:Q8IIL0 SMR:Q8IIL0 MEROPS:C01.063
            EnsemblProtists:PF11_0162:mRNA GeneID:810709 KEGG:pfa:PF11_0162
            EuPathDB:PlasmoDB:PF3D7_1115400 OMA:ENDEDYW ChEMBL:CHEMBL1250373
            Uniprot:Q8IIL0
        Length = 492

 Score = 294 (108.6 bits), Expect = 3.6e-22, Sum P(2) = 3.6e-22
 Identities = 96/320 (30%), Positives = 145/320 (45%)

Query:   650 DNENILETFKAFIVKRGRQYANDEEIKERF-----EYFKQDGHKK--HERY--GTSEFSD 700
             DN   +  F  F+ +  ++Y   EE+++RF      Y K + H K  +  Y  G ++F D
Sbjct:   163 DNLETVNLFYIFLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKFGD 222

Query:   701 RSPEEILCK-TGFKWSE--RTYERIVA-DRXXXXXXXXXXXXDGPVPD-AWDWRKKNVTG 755
              SPEE   K    K     +T    V+ +             D  +   A+DWR      
Sbjct:   223 LSPEEFRSKYLNLKTHGPFKTLSPPVSYEANYEDVIKKYKPADAKLDRIAYDWRLHGGVT 282

Query:   756 PAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIE- 814
             P  DQA CGSCWAFS  G +E QYAI+   L  FS+ +LV+C+ + +GC G +   + + 
Sbjct:   283 PVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVDCSVKNNGCYGGYITNAFDD 342

Query:   815 YTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLS 874
                  GL S+ DYPY +   E   C   +   + +T K ++     +  K+ L   GP+S
Sbjct:   343 MIDLGGLCSQDDYPYVSNLPET--CNLKRCNER-YTIKSYVSIP-DDKFKEALRYLGPIS 398

Query:   875 V-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD----------NIPYWLVR 923
             + +  SD    Y G      D  C      HAV+LVGYG +D             Y++++
Sbjct:   399 ISIAASDDFAFYRGGFY---DGECGAAP-NHAVILVGYGMKDIYNEDTGRMEKFYYYIIK 454

Query:   924 NSWGPIGPDEGFFKIERGNN 943
             NSWG    + G+  +E   N
Sbjct:   455 NSWGSDWGEGGYINLETDEN 474

 Score = 287 (106.1 bits), Expect = 2.4e-21, Sum P(2) = 2.4e-21
 Identities = 99/323 (30%), Positives = 147/323 (45%)

Query:   284 DNENILETFKAFIVKRGRQYANDEEIKERF-----EYFKQDGHKK--HERY--GTSEFSD 334
             DN   +  F  F+ +  ++Y   EE+++RF      Y K + H K  +  Y  G ++F D
Sbjct:   163 DNLETVNLFYIFLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKFGD 222

Query:   335 RSPEEILCK-TGFKWSE--RTYERIVA-DRXXXXXXXXXXXXDGPVPD-AWDWRKKNVTG 389
              SPEE   K    K     +T    V+ +             D  +   A+DWR      
Sbjct:   223 LSPEEFRSKYLNLKTHGPFKTLSPPVSYEANYEDVIKKYKPADAKLDRIAYDWRLHGGVT 282

Query:   390 PAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGC-GG--CDGLEQ 446
             P  DQA CGSCWAFS  G +E QYAI+   L  FS+ +LV+C+ + +GC GG   +  + 
Sbjct:   283 PVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVDCSVKNNGCYGGYITNAFDD 342

Query:   447 PIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYG 506
              I+     GL S+ DYPY +   E   C   +   + +T K ++     +  K+ L   G
Sbjct:   343 MIDL---GGLCSQDDYPYVSNLPET--CNLKRCNER-YTIKSYVSIP-DDKFKEALRYLG 395

Query:   507 PLSVGLN-SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD----DIP------YW 555
             P+S+ +  S    FY G      D  C      HAV+LVGYG +D    D        Y+
Sbjct:   396 PISISIAASDDFAFYRGGFY---DGECGAAP-NHAVILVGYGMKDIYNEDTGRMEKFYYY 451

Query:   556 LARNSWGPIGPDEGFFKIERGNN 578
             + +NSWG    + G+  +E   N
Sbjct:   452 IIKNSWGSDWGEGGYINLETDEN 474

 Score = 257 (95.5 bits), Expect = 1.2e-18, P = 1.2e-18
 Identities = 70/211 (33%), Positives = 103/211 (48%)

Query:    14 AWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGC 73
             A+DWR      P  DQA CGSCWAFS  G +E QYAI+   L  FS+ +LV+C+ + +GC
Sbjct:   272 AYDWRLHGGVTPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVDCSVKNNGC 331

Query:    74 DGCFFEPSIE-YTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETM 132
              G +   + +      GL S+ DYPY +   E   C   +   + +T K ++     +  
Sbjct:   332 YGGYITNAFDDMIDLGGLCSQDDYPYVSNLPET--CNLKRCNER-YTIKSYVSIP-DDKF 387

Query:   133 KKILYKYGPLSV-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD------- 184
             K+ L   GP+S+ +  SD    Y G      D  C      HAV+LVGYG +D       
Sbjct:   388 KEALRYLGPISISIAASDDFAFYRGGFY---DGECGAAP-NHAVILVGYGMKDIYNEDTG 443

Query:   185 ---NIPYWLVRNSWGPIGPDEGFFKIERGNN 212
                   Y++++NSWG    + G+  +E   N
Sbjct:   444 RMEKFYYYIIKNSWGSDWGEGGYINLETDEN 474

 Score = 39 (18.8 bits), Expect = 3.6e-22, Sum P(2) = 3.6e-22
 Identities = 13/48 (27%), Positives = 24/48 (50%)

Query:    83 EYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTG--KDFLHFNG 128
             +Y   + L+SE    +  +  E+   +YDK K    TG  ++ ++ NG
Sbjct:    76 DYIINSLLKSESGKKFIVSKLEELISSYDKEKKMRTTGAEENNMNMNG 123

 Score = 39 (18.8 bits), Expect = 3.6e-22, Sum P(2) = 3.6e-22
 Identities = 20/80 (25%), Positives = 36/80 (45%)

Query:   449 EYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTG--KDFLYFNGSETM--KKILY- 503
             +Y   + L+SE    +     E+   +YDK K    TG  ++ +  NG +    K + + 
Sbjct:    76 DYIINSLLKSESGKKFIVSKLEELISSYDKEKKMRTTGAEENNMNMNGIDDKDNKSVSFV 135

Query:   504 --KYGPLSVGLNSHLIHFYN 521
               K G L V  N++ + + N
Sbjct:   136 NKKNGNLKVN-NNNQVSYSN 154


>UNIPROTKB|Q8IIL0 [details] [associations]
            symbol:PF11_0162 "Falcipain-3" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 HSSP:P43235 EMBL:AE014186 GO:GO:0020020
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347833.1
            ProteinModelPortal:Q8IIL0 SMR:Q8IIL0 MEROPS:C01.063
            EnsemblProtists:PF11_0162:mRNA GeneID:810709 KEGG:pfa:PF11_0162
            EuPathDB:PlasmoDB:PF3D7_1115400 OMA:ENDEDYW ChEMBL:CHEMBL1250373
            Uniprot:Q8IIL0
        Length = 492

 Score = 294 (108.6 bits), Expect = 3.6e-22, Sum P(2) = 3.6e-22
 Identities = 96/320 (30%), Positives = 145/320 (45%)

Query:   650 DNENILETFKAFIVKRGRQYANDEEIKERF-----EYFKQDGHKK--HERY--GTSEFSD 700
             DN   +  F  F+ +  ++Y   EE+++RF      Y K + H K  +  Y  G ++F D
Sbjct:   163 DNLETVNLFYIFLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKFGD 222

Query:   701 RSPEEILCK-TGFKWSE--RTYERIVA-DRXXXXXXXXXXXXDGPVPD-AWDWRKKNVTG 755
              SPEE   K    K     +T    V+ +             D  +   A+DWR      
Sbjct:   223 LSPEEFRSKYLNLKTHGPFKTLSPPVSYEANYEDVIKKYKPADAKLDRIAYDWRLHGGVT 282

Query:   756 PAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIE- 814
             P  DQA CGSCWAFS  G +E QYAI+   L  FS+ +LV+C+ + +GC G +   + + 
Sbjct:   283 PVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVDCSVKNNGCYGGYITNAFDD 342

Query:   815 YTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLS 874
                  GL S+ DYPY +   E   C   +   + +T K ++     +  K+ L   GP+S
Sbjct:   343 MIDLGGLCSQDDYPYVSNLPET--CNLKRCNER-YTIKSYVSIP-DDKFKEALRYLGPIS 398

Query:   875 V-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD----------NIPYWLVR 923
             + +  SD    Y G      D  C      HAV+LVGYG +D             Y++++
Sbjct:   399 ISIAASDDFAFYRGGFY---DGECGAAP-NHAVILVGYGMKDIYNEDTGRMEKFYYYIIK 454

Query:   924 NSWGPIGPDEGFFKIERGNN 943
             NSWG    + G+  +E   N
Sbjct:   455 NSWGSDWGEGGYINLETDEN 474

 Score = 287 (106.1 bits), Expect = 2.4e-21, Sum P(2) = 2.4e-21
 Identities = 99/323 (30%), Positives = 147/323 (45%)

Query:   284 DNENILETFKAFIVKRGRQYANDEEIKERF-----EYFKQDGHKK--HERY--GTSEFSD 334
             DN   +  F  F+ +  ++Y   EE+++RF      Y K + H K  +  Y  G ++F D
Sbjct:   163 DNLETVNLFYIFLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKFGD 222

Query:   335 RSPEEILCK-TGFKWSE--RTYERIVA-DRXXXXXXXXXXXXDGPVPD-AWDWRKKNVTG 389
              SPEE   K    K     +T    V+ +             D  +   A+DWR      
Sbjct:   223 LSPEEFRSKYLNLKTHGPFKTLSPPVSYEANYEDVIKKYKPADAKLDRIAYDWRLHGGVT 282

Query:   390 PAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGC-GG--CDGLEQ 446
             P  DQA CGSCWAFS  G +E QYAI+   L  FS+ +LV+C+ + +GC GG   +  + 
Sbjct:   283 PVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVDCSVKNNGCYGGYITNAFDD 342

Query:   447 PIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYG 506
              I+     GL S+ DYPY +   E   C   +   + +T K ++     +  K+ L   G
Sbjct:   343 MIDL---GGLCSQDDYPYVSNLPET--CNLKRCNER-YTIKSYVSIP-DDKFKEALRYLG 395

Query:   507 PLSVGLN-SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD----DIP------YW 555
             P+S+ +  S    FY G      D  C      HAV+LVGYG +D    D        Y+
Sbjct:   396 PISISIAASDDFAFYRGGFY---DGECGAAP-NHAVILVGYGMKDIYNEDTGRMEKFYYY 451

Query:   556 LARNSWGPIGPDEGFFKIERGNN 578
             + +NSWG    + G+  +E   N
Sbjct:   452 IIKNSWGSDWGEGGYINLETDEN 474

 Score = 257 (95.5 bits), Expect = 1.2e-18, P = 1.2e-18
 Identities = 70/211 (33%), Positives = 103/211 (48%)

Query:    14 AWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGC 73
             A+DWR      P  DQA CGSCWAFS  G +E QYAI+   L  FS+ +LV+C+ + +GC
Sbjct:   272 AYDWRLHGGVTPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVDCSVKNNGC 331

Query:    74 DGCFFEPSIE-YTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETM 132
              G +   + +      GL S+ DYPY +   E   C   +   + +T K ++     +  
Sbjct:   332 YGGYITNAFDDMIDLGGLCSQDDYPYVSNLPET--CNLKRCNER-YTIKSYVSIP-DDKF 387

Query:   133 KKILYKYGPLSV-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD------- 184
             K+ L   GP+S+ +  SD    Y G      D  C      HAV+LVGYG +D       
Sbjct:   388 KEALRYLGPISISIAASDDFAFYRGGFY---DGECGAAP-NHAVILVGYGMKDIYNEDTG 443

Query:   185 ---NIPYWLVRNSWGPIGPDEGFFKIERGNN 212
                   Y++++NSWG    + G+  +E   N
Sbjct:   444 RMEKFYYYIIKNSWGSDWGEGGYINLETDEN 474

 Score = 39 (18.8 bits), Expect = 3.6e-22, Sum P(2) = 3.6e-22
 Identities = 13/48 (27%), Positives = 24/48 (50%)

Query:    83 EYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTG--KDFLHFNG 128
             +Y   + L+SE    +  +  E+   +YDK K    TG  ++ ++ NG
Sbjct:    76 DYIINSLLKSESGKKFIVSKLEELISSYDKEKKMRTTGAEENNMNMNG 123

 Score = 39 (18.8 bits), Expect = 3.6e-22, Sum P(2) = 3.6e-22
 Identities = 20/80 (25%), Positives = 36/80 (45%)

Query:   449 EYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTG--KDFLYFNGSETM--KKILY- 503
             +Y   + L+SE    +     E+   +YDK K    TG  ++ +  NG +    K + + 
Sbjct:    76 DYIINSLLKSESGKKFIVSKLEELISSYDKEKKMRTTGAEENNMNMNGIDDKDNKSVSFV 135

Query:   504 --KYGPLSVGLNSHLIHFYN 521
               K G L V  N++ + + N
Sbjct:   136 NKKNGNLKVN-NNNQVSYSN 154


>RGD|1564827 [details] [associations]
            symbol:RGD1564827 "similar to cathepsin M" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 IPI:IPI00192321
            Ensembl:ENSRNOT00000023990 ArrayExpress:D3ZY04 Uniprot:D3ZY04
        Length = 338

 Score = 269 (99.8 bits), Expect = 4.4e-22, P = 4.4e-22
 Identities = 78/214 (36%), Positives = 103/214 (48%)

Query:   391 AGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDG--LEQPI 448
             A  Q  C SCWAF + G +EGQ   KTGKL   S   LV+C+K   G  GC G       
Sbjct:   136 ASTQGRCNSCWAFPVVGAIEGQMFKKTGKLTPLSVQNLVDCSKP-QGNKGCRGGTTYNAF 194

Query:   449 EYTHQ-AGLESEKDYPYRNGNGEKFKCAYD-KSKVKLFTGKDFLYFNGSETMKKILYKYG 506
             +Y  Q  GLESE  YPY    G++  C Y+  S  K+         N    M  +  K  
Sbjct:   195 QYVLQNGGLESEATYPYE---GKEGLCRYNPNSSAKITXICAPPQKNEDVLMDAVATK-- 249

Query:   507 PLSVGLNSHLIH----FYNGTPIRKNDETCSPYDLGHAVLLVGYG----KQDDIPYWLAR 558
             P++ G+  H++H    FY       ++  C+ Y + HAVL+VGYG    + D   YWL +
Sbjct:   250 PVAAGI--HVVHSSLRFYKKGIY--HEPKCNNY-VNHAVLVVGYGFEGNETDGNNYWLIQ 304

Query:   559 NSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
             NSWG      G+ KI +  NN CGI   A Y  +
Sbjct:   305 NSWGERWGLNGYMKIAKDRNNHCGIATFAQYPIV 338

 Score = 259 (96.2 bits), Expect = 5.2e-21, P = 5.2e-21
 Identities = 74/210 (35%), Positives = 99/210 (47%)

Query:   757 AGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPSIE 814
             A  Q  C SCWAF + G +EGQ   KTGKL   S   LV+C+K     GC G     + +
Sbjct:   136 ASTQGRCNSCWAFPVVGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQ 195

Query:   815 YTHQ-AGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHFNGSETMKKILYKYGP 872
             Y  Q  GLESE  YPY+   G    C Y+  S  K+         N    M  +  K   
Sbjct:   196 YVLQNGGLESEATYPYEGKEG---LCRYNPNSSAKITXICAPPQKNEDVLMDAVATKPVA 252

Query:   873 LSV-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG----KQDNIPYWLVRNSWG 927
               + +++S L     G     ++  C+ Y + HAVL+VGYG    + D   YWL++NSWG
Sbjct:   253 AGIHVVHSSLRFYKKGI---YHEPKCNNY-VNHAVLVVGYGFEGNETDGNNYWLIQNSWG 308

Query:   928 PIGPDEGFFKIERG-NNACGIEQIAGYATI 956
                   G+ KI +  NN CGI   A Y  +
Sbjct:   309 ERWGLNGYMKIAKDRNNHCGIATFAQYPIV 338

 Score = 258 (95.9 bits), Expect = 6.6e-21, P = 6.6e-21
 Identities = 74/210 (35%), Positives = 99/210 (47%)

Query:    26 AGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPSIE 83
             A  Q  C SCWAF + G +EGQ   KTGKL   S   LV+C+K     GC G     + +
Sbjct:   136 ASTQGRCNSCWAFPVVGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQ 195

Query:    84 YTHQ-AGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHFNGSETMKKILYKYGP 141
             Y  Q  GLESE  YPY+   G    C Y+  S  K+         N    M  +  K   
Sbjct:   196 YVLQNGGLESEATYPYEGKEG---LCRYNPNSSAKITXICAPPQKNEDVLMDAVATKPVA 252

Query:   142 LSV-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG----KQDNIPYWLVRNSWG 196
               + +++S L     G     ++  C+ Y + HAVL+VGYG    + D   YWL++NSWG
Sbjct:   253 AGIHVVHSSLRFYKKGI---YHEPKCNNY-VNHAVLVVGYGFEGNETDGNNYWLIQNSWG 308

Query:   197 PIGPDEGFFKIERG-NNACGIEQIAGYATI 225
                   G+ KI +  NN CGI   A Y  +
Sbjct:   309 ERWGLNGYMKIAKDRNNHCGIATFAQYPIV 338

 Score = 125 (49.1 bits), Expect = 0.00018, P = 0.00018
 Identities = 28/71 (39%), Positives = 40/71 (56%)

Query:   959 VKNDETCSPYDLGHAVLLVGYG----KQDDIPYWLVRNSWGPIGPDEGFFKIERG-NNAC 1013
             + ++  C+ Y + HAVL+VGYG    + D   YWL++NSWG      G+ KI +  NN C
Sbjct:   269 IYHEPKCNNY-VNHAVLVVGYGFEGNETDGNNYWLIQNSWGERWGLNGYMKIAKDRNNHC 327

Query:  1014 GIEQIAGYATI 1024
             GI   A Y  +
Sbjct:   328 GIATFAQYPIV 338


>WB|WBGene00012747 [details] [associations]
            symbol:Y40H7A.10 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000230773 EMBL:AL033510
            HSSP:P80067 MEROPS:C01.A48 PIR:T26792 RefSeq:NP_502836.1
            ProteinModelPortal:Q9XWA4 SMR:Q9XWA4 STRING:Q9XWA4
            EnsemblMetazoa:Y40H7A.10 GeneID:189809 KEGG:cel:CELE_Y40H7A.10
            UCSC:Y40H7A.10 CTD:189809 WormBase:Y40H7A.10 eggNOG:NOG286423
            InParanoid:Q9XWA4 OMA:NGPMIVC NextBio:943702 Uniprot:Q9XWA4
        Length = 343

 Score = 263 (97.6 bits), Expect = 1.9e-21, P = 1.9e-21
 Identities = 96/361 (26%), Positives = 159/361 (44%)

Query:   239 IMLIQAVFLLCGVASCLCLPSLTDRITDQVVAR--VDTLAIEGSLTFDNENILETFKAFI 296
             I L+   F+  G A    LPS      DQ++ R  + T  ++ +  F N         F+
Sbjct:     5 IWLLAIFFVHFGCAKPNLLPSYQISDLDQILQRHHIPTPDVKYTNAFQN---------FL 55

Query:   297 VKRGRQYANDEEIKERFEYFKQ-----DGHKKHER----YGTSEFSDRSPEEILCKTGFK 347
             VK  R+Y N+ EI +RF  F +     + + K +     Y  ++FSD + EE        
Sbjct:    56 VKYLREYPNEYEIVKRFTIFSRNLDLVERYNKEDAGKVTYELNDFSDLTEEE-------- 107

Query:   348 WSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKN----VTGPAGDQAACGSCWAF 403
             W +        D                +P++ DWR  N    VTG    Q  CGSCWAF
Sbjct:   108 WKKYLMTP-KPDHSEKSLKPKTLIDKKNLPNSVDWRNVNGTNHVTG-IKYQGPCGSCWAF 165

Query:   404 SIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQPIEYTHQAGLESEKDYP 463
             + A  +E   +I  G L   S  QL++C      CGG + +E  ++Y    G+ +  +YP
Sbjct:   166 ATAAAIESAVSISGGGLQSLSSQQLLDCTVVSDKCGGGEPVEA-LKYAQSHGITTAHNYP 224

Query:   464 YRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLN--SHLIHFYN 521
             Y        KC      V   +   ++     + M +I+   GP+ V  N  ++   FY+
Sbjct:   225 YYFWTT---KCRETVPTVARISS--WMKAESEDEMAQIVALNGPMIVCANFATNKNRFYH 279

Query:   522 GTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACG 581
              + I + D  C   +  HA++++GYG      YW+ +N++  +  ++G+ +++R  N CG
Sbjct:   280 -SGIAE-DPDCGT-EPTHALIVIGYGPD----YWILKNTYSKVWGEKGYMRVKRDVNWCG 332

Query:   582 I 582
             I
Sbjct:   333 I 333

 Score = 247 (92.0 bits), Expect = 7.9e-19, P = 7.9e-19
 Identities = 92/359 (25%), Positives = 155/359 (43%)

Query:   605 IMLIQAVFLLCGVASCLCLPSLTDRITDQVVAR--VDTLAIEGSLTFDNENILETFKAFI 662
             I L+   F+  G A    LPS      DQ++ R  + T  ++ +  F N         F+
Sbjct:     5 IWLLAIFFVHFGCAKPNLLPSYQISDLDQILQRHHIPTPDVKYTNAFQN---------FL 55

Query:   663 VKRGRQYANDEEIKERFEYFKQ-----DGHKKHER----YGTSEFSDRSPEEILCKTGFK 713
             VK  R+Y N+ EI +RF  F +     + + K +     Y  ++FSD + EE        
Sbjct:    56 VKYLREYPNEYEIVKRFTIFSRNLDLVERYNKEDAGKVTYELNDFSDLTEEE-------- 107

Query:   714 WSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKN----VTGPAGDQAACGSCWAF 769
             W +        D                +P++ DWR  N    VTG    Q  CGSCWAF
Sbjct:   108 WKKYLMTP-KPDHSEKSLKPKTLIDKKNLPNSVDWRNVNGTNHVTG-IKYQGPCGSCWAF 165

Query:   770 SIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPY 829
             + A  +E   +I  G L   S  QL++C      C G     +++Y    G+ +  +YPY
Sbjct:   166 ATAAAIESAVSISGGGLQSLSSQQLLDCTVVSDKCGGGEPVEALKYAQSHGITTAHNYPY 225

Query:   830 KNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLN-SDLIHDYNGT 888
                     KC      V   +   ++     + M +I+   GP+ V  N +   + +  +
Sbjct:   226 YFWTT---KCRETVPTVARISS--WMKAESEDEMAQIVALNGPMIVCANFATNKNRFYHS 280

Query:   889 PIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGI 947
              I + D  C   +  HA++++GYG      YW+++N++  +  ++G+ +++R  N CGI
Sbjct:   281 GIAE-DPDCGT-EPTHALIVIGYGPD----YWILKNTYSKVWGEKGYMRVKRDVNWCGI 333

 Score = 214 (80.4 bits), Expect = 1.7e-14, P = 1.7e-14
 Identities = 58/211 (27%), Positives = 101/211 (47%)

Query:    11 VPDAWDWRKKN----VTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC 66
             +P++ DWR  N    VTG    Q  CGSCWAF+ A  +E   +I  G L   S  QL++C
Sbjct:   135 LPNSVDWRNVNGTNHVTG-IKYQGPCGSCWAFATAAAIESAVSISGGGLQSLSSQQLLDC 193

Query:    67 AKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF 126
                   C G     +++Y    G+ +  +YPY        KC      V   +   ++  
Sbjct:   194 TVVSDKCGGGEPVEALKYAQSHGITTAHNYPYYFWTT---KCRETVPTVARISS--WMKA 248

Query:   127 NGSETMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN 185
                + M +I+   GP+ V  N +   + +  + I + D  C   +  HA++++GYG    
Sbjct:   249 ESEDEMAQIVALNGPMIVCANFATNKNRFYHSGIAE-DPDCGT-EPTHALIVIGYGPD-- 304

Query:   186 IPYWLVRNSWGPIGPDEGFFKIERGNNACGI 216
               YW+++N++  +  ++G+ +++R  N CGI
Sbjct:   305 --YWILKNTYSKVWGEKGYMRVKRDVNWCGI 333


>DICTYBASE|DDB_G0278721 [details] [associations]
            symbol:cprD "cysteine proteinase 4" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0278721 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000024 EMBL:L36204 RefSeq:XP_641963.1
            ProteinModelPortal:P54639 SMR:P54639 MEROPS:C01.A57 PRIDE:P54639
            EnsemblProtists:DDB0214999 GeneID:8621695 KEGG:ddi:DDB_G0278721
            OMA:NAFADIT ProtClustDB:CLSZ2846820 Uniprot:P54639
        Length = 442

 Score = 221 (82.9 bits), Expect = 5.0e-21, Sum P(3) = 5.0e-21
 Identities = 69/258 (26%), Positives = 111/258 (43%)

Query:   301 RQYANDEEIKERFEYFKQDGHKKHERYGTSEFSDRSPEEILCKTGF-----KWSERTYER 355
             R Y++ EE   R++ FK +    H+      ++ +  E +L    F     +    TY  
Sbjct:    39 RTYSS-EEFNARYQIFKSNMDYVHQ------WNSKGGETVLGLNVFADITNQEYRTTYLG 91

Query:   356 IVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAI 415
                D               P P   DWR +    P  +Q  CG CW+FS  G  EG + I
Sbjct:    92 TPFDGSALIGTEEEKIFSTPAPTV-DWRAQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFI 150

Query:   416 KTGK---LVEFSKSQLVECAKQCSGCGGCDG--LEQPIEYT-HQAGLESEKDYPYRNGNG 469
              +G    LV  S+  L++C+K   G  GC+G  +    EY  +  G+++E  YPY   +G
Sbjct:   151 ASGTKKDLVSLSEQNLIDCSKSY-GNNGCEGGLMTLAFEYIINNKGIDTESSYPYTAEDG 209

Query:   470 EKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKND 529
             ++ K        ++ + ++    +GSE   +      P+SV +++    F         +
Sbjct:   210 KECKFKTSNIGAQIVSYQNVT--SGSEASLQSASNNAPVSVAIDASNESFQLYESGIYYE 267

Query:   530 ETCSPYDLGHAVLLVGYG 547
               CSP  L H VL+VGYG
Sbjct:   268 PACSPTQLDHGVLVVGYG 285

 Score = 220 (82.5 bits), Expect = 6.6e-21, Sum P(3) = 6.6e-21
 Identities = 69/258 (26%), Positives = 113/258 (43%)

Query:   667 RQYANDEEIKERFEYFKQDGHKKHERYGTSEFSDRSPEEILCKTGF-----KWSERTYER 721
             R Y++ EE   R++ FK +    H+      ++ +  E +L    F     +    TY  
Sbjct:    39 RTYSS-EEFNARYQIFKSNMDYVHQ------WNSKGGETVLGLNVFADITNQEYRTTYLG 91

Query:   722 IVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAI 781
                D               P P   DWR +    P  +Q  CG CW+FS  G  EG + I
Sbjct:    92 TPFDGSALIGTEEEKIFSTPAPTV-DWRAQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFI 150

Query:   782 KTGK---LVEFSKSQLVECAKQC--SGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGE 835
              +G    LV  S+  L++C+K    +GC+G     + EY  +  G+++E  YPY   +G+
Sbjct:   151 ASGTKKDLVSLSEQNLIDCSKSYGNNGCEGGLMTLAFEYIINNKGIDTESSYPYTAEDGK 210

Query:   836 KFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKND 894
             + K        ++ + ++    +GSE   +      P+SV ++ S+       + I   +
Sbjct:   211 ECKFKTSNIGAQIVSYQNVT--SGSEASLQSASNNAPVSVAIDASNESFQLYESGIYY-E 267

Query:   895 ETCSPYDLGHAVLLVGYG 912
               CSP  L H VL+VGYG
Sbjct:   268 PACSPTQLDHGVLVVGYG 285

 Score = 220 (82.5 bits), Expect = 9.5e-19, Sum P(2) = 9.5e-19
 Identities = 55/179 (30%), Positives = 88/179 (49%)

Query:    10 PVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGK---LVEFSKSQLVEC 66
             P P   DWR +    P  +Q  CG CW+FS  G  EG + I +G    LV  S+  L++C
Sbjct:   111 PAPTV-DWRAQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSEQNLIDC 169

Query:    67 AKQC--SGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF 123
             +K    +GC+G     + EY  +  G+++E  YPY   +G++ K        ++ + ++ 
Sbjct:   170 SKSYGNNGCEGGLMTLAFEYIINNKGIDTESSYPYTAEDGKECKFKTSNIGAQIVSYQNV 229

Query:   124 LHFNGSETMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG 181
                +GSE   +      P+SV ++ S+       + I   +  CSP  L H VL+VGYG
Sbjct:   230 T--SGSEASLQSASNNAPVSVAIDASNESFQLYESGIYY-EPACSPTQLDHGVLVVGYG 285

 Score = 85 (35.0 bits), Expect = 5.0e-21, Sum P(3) = 5.0e-21
 Identities = 17/38 (44%), Positives = 22/38 (57%)

Query:   987 YWLVRNSWGPI-GPDEGFFKIERGNNACGIEQIAGYAT 1023
             YW+V+NSWG   G D   F  +  NN CGI  +A + T
Sbjct:   401 YWIVKNSWGTSWGMDGYIFMSKDRNNNCGIATMASFPT 438

 Score = 85 (35.0 bits), Expect = 7.3e-19, Sum P(2) = 7.3e-19
 Identities = 17/38 (44%), Positives = 22/38 (57%)

Query:   919 YWLVRNSWGPI-GPDEGFFKIERGNNACGIEQIAGYAT 955
             YW+V+NSWG   G D   F  +  NN CGI  +A + T
Sbjct:   401 YWIVKNSWGTSWGMDGYIFMSKDRNNNCGIATMASFPT 438

 Score = 85 (35.0 bits), Expect = 9.5e-19, Sum P(2) = 9.5e-19
 Identities = 17/38 (44%), Positives = 22/38 (57%)

Query:   188 YWLVRNSWGPI-GPDEGFFKIERGNNACGIEQIAGYAT 224
             YW+V+NSWG   G D   F  +  NN CGI  +A + T
Sbjct:   401 YWIVKNSWGTSWGMDGYIFMSKDRNNNCGIATMASFPT 438

 Score = 81 (33.6 bits), Expect = 1.9e-18, Sum P(2) = 1.9e-18
 Identities = 16/38 (42%), Positives = 21/38 (55%)

Query:   554 YWLARNSWGPI-GPDEGFFKIERGNNACGIEQIAGYAT 590
             YW+ +NSWG   G D   F  +  NN CGI  +A + T
Sbjct:   401 YWIVKNSWGTSWGMDGYIFMSKDRNNNCGIATMASFPT 438

 Score = 62 (26.9 bits), Expect = 5.0e-21, Sum P(3) = 5.0e-21
 Identities = 11/19 (57%), Positives = 13/19 (68%)

Query:   962 DETCSPYDLGHAVLLVGYG 980
             +  CSP  L H VL+VGYG
Sbjct:   267 EPACSPTQLDHGVLVVGYG 285


>DICTYBASE|DDB_G0279187 [details] [associations]
            symbol:cprG "cysteine proteinase 7" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0279187 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000030 ProtClustDB:CLSZ2846820 MEROPS:C01.081
            EMBL:U72746 RefSeq:XP_641720.2 ProteinModelPortal:Q94504 SMR:Q94504
            PRIDE:Q94504 EnsemblProtists:DDB0215005 GeneID:8621915
            KEGG:ddi:DDB_G0279187 OMA:INTETEK Uniprot:Q94504
        Length = 460

 Score = 219 (82.2 bits), Expect = 5.1e-20, Sum P(3) = 5.1e-20
 Identities = 72/265 (27%), Positives = 110/265 (41%)

Query:   292 FKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHERYGTSEFSDRSPEEILCKTGFK-WSE 350
             F  +++   R Y++ EE   R+  FK +       Y  +E++ +  E +L    F   S 
Sbjct:    30 FTNWMIAHQRHYSS-EEFNGRYNIFKANMD-----Y-VNEWNTKGSETVLGLNVFADISN 82

Query:   351 RTYERIVADRXXXXXXXXXXXXDGPVPDAW---DWRKKNVTGPAGDQAACGSCWAFSIAG 407
               Y                   D  + DA    DWR +    P  +Q  CG CW+FS  G
Sbjct:    83 EEYRATYLGTPFDASSLEMTESD-KIFDASAQVDWRTQGAVTPIKNQGQCGGCWSFSTTG 141

Query:   408 MLEGQYAIKTGK--LVEFSKSQLVECAKQCSGCGGCDG--LEQPIEYT-HQAGLESEKDY 462
               EG   +  GK  LV  S+  L++C+    G  GC+G  +    EY  +  G+++E  Y
Sbjct:   142 ATEGAQYLANGKKNLVSLSEQNLIDCSGSY-GNNGCEGGLMTLAFEYIINNKGIDTESSY 200

Query:   463 PYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNSHLIHFYNG 522
             PY   +G+K  C ++   V           +GSE+        GP SV +++    F   
Sbjct:   201 PYTAEDGKK--CKFNPKNVAAQLSSYVNVTSGSESDLAAKVTQGPTSVAIDASNQSFQLY 258

Query:   523 TPIRKNDETCSPYDLGHAVLLVGYG 547
                  N+  CS   L H VL VG+G
Sbjct:   259 VSGIYNEPACSSTQLDHGVLAVGFG 283

 Score = 215 (80.7 bits), Expect = 1.5e-19, Sum P(3) = 1.5e-19
 Identities = 72/265 (27%), Positives = 112/265 (42%)

Query:   658 FKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHERYGTSEFSDRSPEEILCKTGFK-WSE 716
             F  +++   R Y++ EE   R+  FK +       Y  +E++ +  E +L    F   S 
Sbjct:    30 FTNWMIAHQRHYSS-EEFNGRYNIFKANMD-----Y-VNEWNTKGSETVLGLNVFADISN 82

Query:   717 RTYERIVADRXXXXXXXXXXXXDGPVPDAW---DWRKKNVTGPAGDQAACGSCWAFSIAG 773
               Y                   D  + DA    DWR +    P  +Q  CG CW+FS  G
Sbjct:    83 EEYRATYLGTPFDASSLEMTESD-KIFDASAQVDWRTQGAVTPIKNQGQCGGCWSFSTTG 141

Query:   774 MLEGQYAIKTGK--LVEFSKSQLVECAKQC--SGCDGCFFEPSIEYT-HQAGLESEKDYP 828
               EG   +  GK  LV  S+  L++C+     +GC+G     + EY  +  G+++E  YP
Sbjct:   142 ATEGAQYLANGKKNLVSLSEQNLIDCSGSYGNNGCEGGLMTLAFEYIINNKGIDTESSYP 201

Query:   829 YKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLN-SDLIHDYNG 887
             Y   +G+K  C ++   V           +GSE+        GP SV ++ S+       
Sbjct:   202 YTAEDGKK--CKFNPKNVAAQLSSYVNVTSGSESDLAAKVTQGPTSVAIDASNQSFQLYV 259

Query:   888 TPIRKNDETCSPYDLGHAVLLVGYG 912
             + I  N+  CS   L H VL VG+G
Sbjct:   260 SGIY-NEPACSSTQLDHGVLAVGFG 283

 Score = 214 (80.4 bits), Expect = 4.5e-18, Sum P(2) = 4.5e-18
 Identities = 54/172 (31%), Positives = 81/172 (47%)

Query:    16 DWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGK--LVEFSKSQLVECAKQC--S 71
             DWR +    P  +Q  CG CW+FS  G  EG   +  GK  LV  S+  L++C+     +
Sbjct:   115 DWRTQGAVTPIKNQGQCGGCWSFSTTGATEGAQYLANGKKNLVSLSEQNLIDCSGSYGNN 174

Query:    72 GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSE 130
             GC+G     + EY  +  G+++E  YPY   +G+K  C ++   V           +GSE
Sbjct:   175 GCEGGLMTLAFEYIINNKGIDTESSYPYTAEDGKK--CKFNPKNVAAQLSSYVNVTSGSE 232

Query:   131 TMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG 181
             +        GP SV ++ S+       + I  N+  CS   L H VL VG+G
Sbjct:   233 SDLAAKVTQGPTSVAIDASNQSFQLYVSGIY-NEPACSSTQLDHGVLAVGFG 283

 Score = 86 (35.3 bits), Expect = 5.1e-20, Sum P(3) = 5.1e-20
 Identities = 16/38 (42%), Positives = 23/38 (60%)

Query:   987 YWLVRNSWGPIGPDEGFFKIERGNN-ACGIEQIAGYAT 1023
             YW+V+NSWG     +G+  + +GNN  CGI  +A   T
Sbjct:   418 YWIVKNSWGTSWGMDGYILMTKGNNNQCGIATMASRPT 455

 Score = 86 (35.3 bits), Expect = 2.8e-18, Sum P(3) = 2.8e-18
 Identities = 16/38 (42%), Positives = 23/38 (60%)

Query:   919 YWLVRNSWGPIGPDEGFFKIERGNN-ACGIEQIAGYAT 955
             YW+V+NSWG     +G+  + +GNN  CGI  +A   T
Sbjct:   418 YWIVKNSWGTSWGMDGYILMTKGNNNQCGIATMASRPT 455

 Score = 86 (35.3 bits), Expect = 4.5e-18, Sum P(2) = 4.5e-18
 Identities = 16/38 (42%), Positives = 23/38 (60%)

Query:   188 YWLVRNSWGPIGPDEGFFKIERGNN-ACGIEQIAGYAT 224
             YW+V+NSWG     +G+  + +GNN  CGI  +A   T
Sbjct:   418 YWIVKNSWGTSWGMDGYILMTKGNNNQCGIATMASRPT 455

 Score = 82 (33.9 bits), Expect = 2.1e-17, Sum P(3) = 2.1e-17
 Identities = 15/38 (39%), Positives = 22/38 (57%)

Query:   554 YWLARNSWGPIGPDEGFFKIERGNN-ACGIEQIAGYAT 590
             YW+ +NSWG     +G+  + +GNN  CGI  +A   T
Sbjct:   418 YWIVKNSWGTSWGMDGYILMTKGNNNQCGIATMASRPT 455

 Score = 55 (24.4 bits), Expect = 5.1e-20, Sum P(3) = 5.1e-20
 Identities = 10/22 (45%), Positives = 13/22 (59%)

Query:   959 VKNDETCSPYDLGHAVLLVGYG 980
             + N+  CS   L H VL VG+G
Sbjct:   262 IYNEPACSSTQLDHGVLAVGFG 283

 Score = 39 (18.8 bits), Expect = 8.2e-18, Sum P(3) = 8.2e-18
 Identities = 13/53 (24%), Positives = 24/53 (45%)

Query:   391 AGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDG 443
             +G Q+  GS ++ S +G   G  +  +G  V+ + +     +   SG G   G
Sbjct:   332 SGSQSFSGSLYSGSYSGSQSGSQSGNSGAAVKQTGAGSGSGSGSGSGSGSGSG 384

 Score = 38 (18.4 bits), Expect = 2.8e-18, Sum P(3) = 2.8e-18
 Identities = 9/32 (28%), Positives = 17/32 (53%)

Query:   757 AGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE 788
             +G Q+  GS ++ S +G   G  +  +G  V+
Sbjct:   332 SGSQSFSGSLYSGSYSGSQSGSQSGNSGAAVK 363


>UNIPROTKB|P83443 [details] [associations]
            symbol:P83443 "Macrodontain-1" species:203992 "Pseudananas
            sagenarius" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0006508 "proteolysis" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197
            ProteinModelPortal:P83443 SMR:P83443 MEROPS:C01.028 Uniprot:P83443
        Length = 213

 Score = 247 (92.0 bits), Expect = 9.9e-20, P = 9.9e-20
 Identities = 62/222 (27%), Positives = 102/222 (45%)

Query:    11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
             VP + DWR         +Q  CG CWAF+    +EG Y I+ G LV  S+ ++++CA   
Sbjct:     2 VPQSIDWRDYGAVNEVKNQGPCGGCWAFAAIATVEGIYKIRKGNLVYLSEQEVLDCAVSY 61

Query:    71 SGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGS 129
              GC G +   + ++     G+ ++++YPY+   G      Y  +   + TG  ++  N  
Sbjct:    62 -GCKGGWVNRAYDFIISNNGVTTDENYPYRAYQGT-CNANYFPNSAYI-TGYSYVRRNDE 118

Query:   130 ETMKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIP 187
               M   +    P++ L+++  D    Y G         C  + L HA+ ++GYG+     
Sbjct:   119 SHMMYAVSNQ-PIAALIDASGDNFQYYKGGVY---SGPCG-FSLNHAITIIGYGRDS--- 170

Query:   188 YWLVRNSWGPIGPDEGFFKIER----GNNACGIEQIAGYATI 225
             YW+VRNSWG      G+ +I R        CGI     + T+
Sbjct:   171 YWIVRNSWGSSWGQGGYVRIRRDVSHSGGVCGIAMSPLFPTL 212

 Score = 247 (92.0 bits), Expect = 9.9e-20, P = 9.9e-20
 Identities = 62/222 (27%), Positives = 102/222 (45%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             VP + DWR         +Q  CG CWAF+    +EG Y I+ G LV  S+ ++++CA   
Sbjct:     2 VPQSIDWRDYGAVNEVKNQGPCGGCWAFAAIATVEGIYKIRKGNLVYLSEQEVLDCAVSY 61

Query:   802 SGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGS 860
              GC G +   + ++     G+ ++++YPY+   G      Y  +   + TG  ++  N  
Sbjct:    62 -GCKGGWVNRAYDFIISNNGVTTDENYPYRAYQGT-CNANYFPNSAYI-TGYSYVRRNDE 118

Query:   861 ETMKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIP 918
               M   +    P++ L+++  D    Y G         C  + L HA+ ++GYG+     
Sbjct:   119 SHMMYAVSNQ-PIAALIDASGDNFQYYKGGVY---SGPCG-FSLNHAITIIGYGRDS--- 170

Query:   919 YWLVRNSWGPIGPDEGFFKIER----GNNACGIEQIAGYATI 956
             YW+VRNSWG      G+ +I R        CGI     + T+
Sbjct:   171 YWIVRNSWGSSWGQGGYVRIRRDVSHSGGVCGIAMSPLFPTL 212

 Score = 228 (85.3 bits), Expect = 1.1e-17, P = 1.1e-17
 Identities = 60/223 (26%), Positives = 99/223 (44%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             VP + DWR         +Q  CG CWAF+    +EG Y I+ G LV  S+ ++++CA   
Sbjct:     2 VPQSIDWRDYGAVNEVKNQGPCGGCWAFAAIATVEGIYKIRKGNLVYLSEQEVLDCAVSY 61

Query:   436 SGCGGCDGLEQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNG 494
              GC G   + +  ++     G+ ++++YPYR   G      Y  +   + TG  ++  N 
Sbjct:    62 -GCKG-GWVNRAYDFIISNNGVTTDENYPYRAYQGT-CNANYFPNSAYI-TGYSYVRRND 117

Query:   495 SETMKKILYKYGPLS--VGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDI 552
                M   +    P++  +  +     +Y G         C  + L HA+ ++GYG+    
Sbjct:   118 ESHMMYAVSNQ-PIAALIDASGDNFQYYKGGVY---SGPCG-FSLNHAITIIGYGRDS-- 170

Query:   553 PYWLARNSWGPIGPDEGFFKIER----GNNACGIEQIAGYATI 591
              YW+ RNSWG      G+ +I R        CGI     + T+
Sbjct:   171 -YWIVRNSWGSSWGQGGYVRIRRDVSHSGGVCGIAMSPLFPTL 212


>DICTYBASE|DDB_G0279185 [details] [associations]
            symbol:cprF "cysteine proteinase 6" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0279185 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000030 HSSP:P07711 ProtClustDB:CLSZ2846820 EMBL:U72745
            RefSeq:XP_641725.1 ProteinModelPortal:Q94503 SMR:Q94503
            MEROPS:C01.081 PRIDE:Q94503 EnsemblProtists:DDB0215002
            GeneID:8621921 KEGG:ddi:DDB_G0279185 Uniprot:Q94503
        Length = 434

 Score = 229 (85.7 bits), Expect = 1.4e-19, Sum P(3) = 1.4e-19
 Identities = 76/266 (28%), Positives = 114/266 (42%)

Query:   658 FKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHERYGTSEFSDRSPEEILCKTGFK--WS 715
             F  +++   R Y++ EE   RF  FK +       Y  +E++ +  E +L    F    +
Sbjct:    30 FTNWMIAHQRHYSS-EEFNGRFNIFKANMD-----Y-INEWNTKGSETVLGLNVFADITN 82

Query:   716 ER---TYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIA 772
             E    TY     D              G   ++ DWR K    P  +Q  CG CW+FS  
Sbjct:    83 EEYRATYLGTPFDASSLEMTPSEKVFGGVQANSVDWRAKGAVTPIKNQGECGGCWSFSAT 142

Query:   773 GMLEGQYAIKTGK--LVEFSKSQLVECAKQC--SGCDGCFFEPSIEYT-HQAGLESEKDY 827
             G  EG   I  G   L   S+ QL++C+     +GC+G     + EY  +  G+++E  Y
Sbjct:   143 GATEGAQYIANGDSDLTSVSEQQLIDCSGSYGNNGCEGGLMTLAFEYIINNGGIDTESSY 202

Query:   828 PYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLN-SDLIHDYN 886
             P+  AN EK  C Y+ S +           +GSE+        GP SV ++ S     + 
Sbjct:   203 PF-TANTEK--CKYNPSNIGAELSSYVNVTSGSESDLAAKVTQGPTSVAIDASQPSFQFY 259

Query:   887 GTPIRKNDETCSPYDLGHAVLLVGYG 912
              + I  N+  CS   L H VL VG+G
Sbjct:   260 SSGIY-NEPACSSTQLDHGVLAVGFG 284

 Score = 227 (85.0 bits), Expect = 2.4e-19, Sum P(3) = 2.4e-19
 Identities = 75/266 (28%), Positives = 112/266 (42%)

Query:   292 FKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHERYGTSEFSDRSPEEILCKTGFK--WS 349
             F  +++   R Y++ EE   RF  FK +       Y  +E++ +  E +L    F    +
Sbjct:    30 FTNWMIAHQRHYSS-EEFNGRFNIFKANMD-----Y-INEWNTKGSETVLGLNVFADITN 82

Query:   350 ER---TYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIA 406
             E    TY     D              G   ++ DWR K    P  +Q  CG CW+FS  
Sbjct:    83 EEYRATYLGTPFDASSLEMTPSEKVFGGVQANSVDWRAKGAVTPIKNQGECGGCWSFSAT 142

Query:   407 GMLEGQYAIKTGK--LVEFSKSQLVECAKQCSGCGGCDG--LEQPIEYT-HQAGLESEKD 461
             G  EG   I  G   L   S+ QL++C+    G  GC+G  +    EY  +  G+++E  
Sbjct:   143 GATEGAQYIANGDSDLTSVSEQQLIDCSGSY-GNNGCEGGLMTLAFEYIINNGGIDTESS 201

Query:   462 YPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNSHLIHFYN 521
             YP+   N EK  C Y+ S +           +GSE+        GP SV +++    F  
Sbjct:   202 YPF-TANTEK--CKYNPSNIGAELSSYVNVTSGSESDLAAKVTQGPTSVAIDASQPSFQF 258

Query:   522 GTPIRKNDETCSPYDLGHAVLLVGYG 547
              +    N+  CS   L H VL VG+G
Sbjct:   259 YSSGIYNEPACSSTQLDHGVLAVGFG 284

 Score = 224 (83.9 bits), Expect = 1.8e-17, Sum P(2) = 1.8e-17
 Identities = 57/172 (33%), Positives = 82/172 (47%)

Query:    16 DWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGK--LVEFSKSQLVECAKQC--S 71
             DWR K    P  +Q +CG CW+FS  G  EG   I  G   L   S+ QL++C+     +
Sbjct:   117 DWRAKGAVTPIKNQGECGGCWSFSATGATEGAQYIANGDSDLTSVSEQQLIDCSGSYGNN 176

Query:    72 GCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSE 130
             GC+G     + EY  +  G+++E  YP+  AN EK  C Y+ S +           +GSE
Sbjct:   177 GCEGGLMTLAFEYIINNGGIDTESSYPF-TANTEK--CKYNPSNIGAELSSYVNVTSGSE 233

Query:   131 TMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG 181
             +        GP SV ++ S     +  + I  N+  CS   L H VL VG+G
Sbjct:   234 SDLAAKVTQGPTSVAIDASQPSFQFYSSGIY-NEPACSSTQLDHGVLAVGFG 284

 Score = 68 (29.0 bits), Expect = 1.4e-19, Sum P(3) = 1.4e-19
 Identities = 13/34 (38%), Positives = 20/34 (58%)

Query:   987 YWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIA 1019
             YW+V+NSWG      G+  + +  +N CGI  +A
Sbjct:   389 YWIVKNSWGLDWGINGYILMSKDKDNQCGIATMA 422

 Score = 68 (29.0 bits), Expect = 4.6e-18, Sum P(2) = 4.6e-18
 Identities = 13/34 (38%), Positives = 20/34 (58%)

Query:   919 YWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIA 951
             YW+V+NSWG      G+  + +  +N CGI  +A
Sbjct:   389 YWIVKNSWGLDWGINGYILMSKDKDNQCGIATMA 422

 Score = 68 (29.0 bits), Expect = 1.8e-17, Sum P(2) = 1.8e-17
 Identities = 13/34 (38%), Positives = 20/34 (58%)

Query:   188 YWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIA 220
             YW+V+NSWG      G+  + +  +N CGI  +A
Sbjct:   389 YWIVKNSWGLDWGINGYILMSKDKDNQCGIATMA 422

 Score = 64 (27.6 bits), Expect = 2.1e-17, Sum P(2) = 2.1e-17
 Identities = 12/34 (35%), Positives = 19/34 (55%)

Query:   554 YWLARNSWGPIGPDEGFFKIERG-NNACGIEQIA 586
             YW+ +NSWG      G+  + +  +N CGI  +A
Sbjct:   389 YWIVKNSWGLDWGINGYILMSKDKDNQCGIATMA 422

 Score = 55 (24.4 bits), Expect = 1.4e-19, Sum P(3) = 1.4e-19
 Identities = 10/22 (45%), Positives = 13/22 (59%)

Query:   959 VKNDETCSPYDLGHAVLLVGYG 980
             + N+  CS   L H VL VG+G
Sbjct:   263 IYNEPACSSTQLDHGVLAVGFG 284


>DICTYBASE|DDB_G0278401 [details] [associations]
            symbol:cprH "cysteine proteinase 8" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0278401 EMBL:AAFI02000023
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 ProtClustDB:CLSZ2430780 RefSeq:XP_642342.1
            ProteinModelPortal:Q54Y60 MEROPS:C01.A62 EnsemblProtists:DDB0205428
            GeneID:8621547 KEGG:ddi:DDB_G0278401 InParanoid:Q54Y60 OMA:FANMENE
            Uniprot:Q54Y60
        Length = 337

 Score = 242 (90.2 bits), Expect = 7.5e-19, Sum P(2) = 7.5e-19
 Identities = 70/231 (30%), Positives = 110/231 (47%)

Query:   381 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIK---TGKLVEFSKSQLVECAKQCSG 437
             DWRKK       +Q +C  CW+FS  G  EG + +    T +LV  S+  L++C+    G
Sbjct:   117 DWRKKGAVTHVKNQQSCSGCWSFSATGATEGAHKLANNGTNELVSLSEQNLIDCSTPF-G 175

Query:   438 CGGCDG--LEQPIEYT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFN- 493
               GC+G  +    EY     G+++EK YP+   +G    C Y KS+    T   ++    
Sbjct:   176 NTGCNGGVITYAFEYIISNGGIDTEKSYPFEGTDGT---CRY-KSENSGATISSYVNVTF 231

Query:   494 GSETMKKILYKYGPLSVGLN-SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYG----- 547
             GSE+  +      P++  ++ SH    +  + I   +  CS  +L H VL+VGYG     
Sbjct:   232 GSESSLESAVNVNPVACSIDASHSSFLFYKSGIYF-EPACSRTNLDHGVLVVGYGTENSQ 290

Query:   548 KQDDI--P----YWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 591
              QD    P    YW+A+NSWG      G+  + +  +N CGI  +A +  +
Sbjct:   291 SQDSSSEPNHSNYWIAKNSWGI----NGYILMSKDRDNMCGISTLASFPIV 337

 Score = 237 (88.5 bits), Expect = 8.7e-18, Sum P(2) = 8.7e-18
 Identities = 67/232 (28%), Positives = 111/232 (47%)

Query:   747 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIK---TGKLVEFSKSQLVECAKQC-- 801
             DWRKK       +Q +C  CW+FS  G  EG + +    T +LV  S+  L++C+     
Sbjct:   117 DWRKKGAVTHVKNQQSCSGCWSFSATGATEGAHKLANNGTNELVSLSEQNLIDCSTPFGN 176

Query:   802 SGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN-G 859
             +GC+G     + EY     G+++EK YP++  +G    C Y KS+    T   +++   G
Sbjct:   177 TGCNGGVITYAFEYIISNGGIDTEKSYPFEGTDGT---CRY-KSENSGATISSYVNVTFG 232

Query:   860 SETMKKILYKYGPLSVLLN---SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG---- 912
             SE+  +      P++  ++   S  +   +G      +  CS  +L H VL+VGYG    
Sbjct:   233 SESSLESAVNVNPVACSIDASHSSFLFYKSGIYF---EPACSRTNLDHGVLVVGYGTENS 289

Query:   913 -KQDNI--P----YWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 956
               QD+   P    YW+ +NSWG      G+  + +  +N CGI  +A +  +
Sbjct:   290 QSQDSSSEPNHSNYWIAKNSWGI----NGYILMSKDRDNMCGISTLASFPIV 337

 Score = 236 (88.1 bits), Expect = 2.5e-17, P = 2.5e-17
 Identities = 67/232 (28%), Positives = 110/232 (47%)

Query:    16 DWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIK---TGKLVEFSKSQLVECAKQC-- 70
             DWRKK       +Q  C  CW+FS  G  EG + +    T +LV  S+  L++C+     
Sbjct:   117 DWRKKGAVTHVKNQQSCSGCWSFSATGATEGAHKLANNGTNELVSLSEQNLIDCSTPFGN 176

Query:    71 SGCDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN-G 128
             +GC+G     + EY     G+++EK YP++  +G    C Y KS+    T   +++   G
Sbjct:   177 TGCNGGVITYAFEYIISNGGIDTEKSYPFEGTDGT---CRY-KSENSGATISSYVNVTFG 232

Query:   129 SETMKKILYKYGPLSVLLN---SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG---- 181
             SE+  +      P++  ++   S  +   +G      +  CS  +L H VL+VGYG    
Sbjct:   233 SESSLESAVNVNPVACSIDASHSSFLFYKSGIYF---EPACSRTNLDHGVLVVGYGTENS 289

Query:   182 -KQDNI--P----YWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATI 225
               QD+   P    YW+ +NSWG      G+  + +  +N CGI  +A +  +
Sbjct:   290 QSQDSSSEPNHSNYWIAKNSWGI----NGYILMSKDRDNMCGISTLASFPIV 337

 Score = 40 (19.1 bits), Expect = 7.5e-19, Sum P(2) = 7.5e-19
 Identities = 12/42 (28%), Positives = 18/42 (42%)

Query:    37 AFSIAGMLEG-QYAIKTGKLVEFSKSQLVECAKQCSGCDGCF 77
             A S+ G  E   ++ K    V++ K   V   K    C GC+
Sbjct:    96 ASSLIGTKEEILFSNKFSSTVDWRKKGAVTHVKNQQSCSGCW 137

 Score = 39 (18.8 bits), Expect = 1.1e-17, Sum P(2) = 1.1e-17
 Identities = 12/41 (29%), Positives = 17/41 (41%)

Query:   402 AFSIAGMLEG-QYAIKTGKLVEFSKSQLVECAKQCSGCGGC 441
             A S+ G  E   ++ K    V++ K   V   K    C GC
Sbjct:    96 ASSLIGTKEEILFSNKFSSTVDWRKKGAVTHVKNQQSCSGC 136


>WB|WBGene00013764 [details] [associations]
            symbol:Y113G7B.15 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 GeneTree:ENSGT00560000076599
            EMBL:AL110477 HOGENOM:HOG000019851 RefSeq:NP_507904.2
            ProteinModelPortal:Q9U2X1 SMR:Q9U2X1 DIP:DIP-25339N IntAct:Q9U2X1
            MINT:MINT-1058673 STRING:Q9U2X1 MEROPS:C01.A47
            EnsemblMetazoa:Y113G7B.15 GeneID:190976 KEGG:cel:CELE_Y113G7B.15
            UCSC:Y113G7B.15 CTD:190976 WormBase:Y113G7B.15 eggNOG:NOG302449
            OMA:AEEDIME Uniprot:Q9U2X1
        Length = 362

 Score = 248 (92.4 bits), Expect = 1.9e-18, P = 1.9e-18
 Identities = 73/236 (30%), Positives = 103/236 (43%)

Query:     7 KDGPVPDAWDWRK-----KNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKS 61
             + G +PD +D R        V GP  DQ  CG CWAF+   + E    + +      S  
Sbjct:   127 QSGDIPDYFDLRDIYVDGSPVVGPVKDQEQCGCCWAFATTAITEAANTLYSKSFTSLSDQ 186

Query:    62 QLVECAKQ--CSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKF-KCAYD-KSKVKL 117
             ++ +CA      GC G      ++  H  G  S+ DYPY+         C  D KS V  
Sbjct:   187 EICDCADSGDTPGCVGGDPRNGLKMVHLRGQSSDGDYPYEEYRANTTGNCVGDEKSTVIQ 246

Query:   118 FTGKDFLHFN---GSETMKKILY-KYGPLSVLLN-SDLIHDYNGTPIRKND-ETCSPYDL 171
                 +   F+     E + + LY  + P +V     +    Y    ++  D    +P + 
Sbjct:   247 PETLNVYRFDQDYAEEDIMENLYLNHIPTAVYFRVGENFEWYTSGVLQSEDCYQMTPAEW 306

Query:   172 GHAVLLVGYGKQDN-IPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID 226
              H+V +VGYG  D+ +PYWLVRNSW       G+ KI RG N C IE  A  A ID
Sbjct:   307 -HSVAIVGYGTSDDGVPYWLVRNSWNSDWGLHGYVKIRRGVNWCLIESHAATAMID 361

 Score = 246 (91.7 bits), Expect = 3.5e-18, P = 3.5e-18
 Identities = 72/235 (30%), Positives = 101/235 (42%)

Query:   374 GPVPDAWDWRK-----KNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQL 428
             G +PD +D R        V GP  DQ  CG CWAF+   + E    + +      S  ++
Sbjct:   129 GDIPDYFDLRDIYVDGSPVVGPVKDQEQCGCCWAFATTAITEAANTLYSKSFTSLSDQEI 188

Query:   429 VECAKQ--CSGCGGCDGLEQPIEYTHQAGLESEKDYPYRNGNGEKF-KCAYDKSKVKLFT 485
              +CA      GC G D     ++  H  G  S+ DYPY          C  D+    +  
Sbjct:   189 CDCADSGDTPGCVGGDP-RNGLKMVHLRGQSSDGDYPYEEYRANTTGNCVGDEKSTVIQP 247

Query:   486 GKDFLY-FN---GSETMKKILY-KYGPLSVGLN-SHLIHFYNGTPIRKND-ETCSPYDLG 538
                 +Y F+     E + + LY  + P +V         +Y    ++  D    +P +  
Sbjct:   248 ETLNVYRFDQDYAEEDIMENLYLNHIPTAVYFRVGENFEWYTSGVLQSEDCYQMTPAEW- 306

Query:   539 HAVLLVGYGKQDD-IPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID 592
             H+V +VGYG  DD +PYWL RNSW       G+ KI RG N C IE  A  A ID
Sbjct:   307 HSVAIVGYGTSDDGVPYWLVRNSWNSDWGLHGYVKIRRGVNWCLIESHAATAMID 361

 Score = 246 (91.7 bits), Expect = 3.5e-18, P = 3.5e-18
 Identities = 73/234 (31%), Positives = 102/234 (43%)

Query:   740 GPVPDAWDWRK-----KNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQL 794
             G +PD +D R        V GP  DQ  CG CWAF+   + E    + +      S  ++
Sbjct:   129 GDIPDYFDLRDIYVDGSPVVGPVKDQEQCGCCWAFATTAITEAANTLYSKSFTSLSDQEI 188

Query:   795 VECAKQ--CSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKF-KCAYD-KSKVKLFT 850
              +CA      GC G      ++  H  G  S+ DYPY+         C  D KS V    
Sbjct:   189 CDCADSGDTPGCVGGDPRNGLKMVHLRGQSSDGDYPYEEYRANTTGNCVGDEKSTVIQPE 248

Query:   851 GKDFLHFN---GSETMKKILY-KYGPLSVLLN-SDLIHDYNGTPIRKND-ETCSPYDLGH 904
               +   F+     E + + LY  + P +V     +    Y    ++  D    +P +  H
Sbjct:   249 TLNVYRFDQDYAEEDIMENLYLNHIPTAVYFRVGENFEWYTSGVLQSEDCYQMTPAEW-H 307

Query:   905 AVLLVGYGKQDN-IPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID 957
             +V +VGYG  D+ +PYWLVRNSW       G+ KI RG N C IE  A  A ID
Sbjct:   308 SVAIVGYGTSDDGVPYWLVRNSWNSDWGLHGYVKIRRGVNWCLIESHAATAMID 361

 Score = 143 (55.4 bits), Expect = 2.1e-06, P = 2.1e-06
 Identities = 30/55 (54%), Positives = 34/55 (61%)

Query:   972 HAVLLVGYGKQDD-IPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID 1025
             H+V +VGYG  DD +PYWLVRNSW       G+ KI RG N C IE  A  A ID
Sbjct:   307 HSVAIVGYGTSDDGVPYWLVRNSWNSDWGLHGYVKIRRGVNWCLIESHAATAMID 361


>WB|WBGene00013076 [details] [associations]
            symbol:Y51A2D.8 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            EMBL:AL021497 HSSP:P53634 HOGENOM:HOG000019851 PIR:T27079
            RefSeq:NP_507627.1 ProteinModelPortal:Q9XXQ7 SMR:Q9XXQ7
            MEROPS:C01.A49 EnsemblMetazoa:Y51A2D.8 GeneID:180208
            KEGG:cel:CELE_Y51A2D.8 UCSC:Y51A2D.8 CTD:180208 WormBase:Y51A2D.8
            eggNOG:NOG307864 InParanoid:Q9XXQ7 OMA:VAVYFKV NextBio:908434
            Uniprot:Q9XXQ7
        Length = 386

 Score = 238 (88.8 bits), Expect = 8.2e-18, Sum P(2) = 8.2e-18
 Identities = 77/322 (23%), Positives = 138/322 (42%)

Query:   650 DNENILETFKAFIVKRGRQYANDEEIKERFE---YFKQDGHKKHERYGTSEFSDRSPEEI 706
             ++EN  + F  F+    + Y N +++  + +   Y  Q G  K     T+EF  R    +
Sbjct:    57 ESEN-QQRFNNFV----KSYNNVDKLNAKSKAAGYDTQFGINKFSDLSTAEFHGRLSNVV 111

Query:   707 LCK-TGFKWSERTYERIVAD-RXXXXXXXXXXXXDGPVPDAWDWRKKNVTG-----PAGD 759
                 TG       +++   D R                PD +D R + + G     P  D
Sbjct:   112 PSNNTGLPMLN--FDKKKPDFRAADMNKTRHKRRSTRYPDYFDLRNEKINGRYIVGPIKD 169

Query:   760 QAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS-GCDGCFFEPSIEYTHQ 818
             Q  C  CW F++  ++E  YA  +GK    S  ++ +C  + + GC G      ++Y  +
Sbjct:   170 QGQCACCWGFAVTALVETVYAAHSGKFKSLSDQEVCDCGTEGTPGCKGGSLTLGVQYVKK 229

Query:   819 AGLESEKDYPY-KNANGEKFKCAY-DKSKVKLFTGKDFLHFN---GSETMKKILYKYG-P 872
              GL  ++DYPY +N   +  +C   +  ++      +F   N     E + ++L ++  P
Sbjct:   230 YGLSGDEDYPYDQNRANQGRRCRLRETDRIVPARAFNFAVINPRRAEEQIIQVLTEWKVP 289

Query:   873 LSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNI-----PYWLVRNSW 926
             ++V     D   +Y    I ++D  C      HA  +VGY   ++       YW+++NSW
Sbjct:   290 VAVYFKVGDQFKEYKEGVIIEDD--CRRATQWHAGAIVGYDTVEDSRGRSHDYWIIKNSW 347

Query:   927 GPIGPDEGFFKIERGNNACGIE 948
             G    + G+ ++ RG + C IE
Sbjct:   348 GGDWAESGYVRVVRGRDWCSIE 369

 Score = 236 (88.1 bits), Expect = 9.8e-17, P = 9.8e-17
 Identities = 59/224 (26%), Positives = 105/224 (46%)

Query:    12 PDAWDWRKKNVTG-----PAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC 66
             PD +D R + + G     P  DQ  C  CW F++  ++E  YA  +GK    S  ++ +C
Sbjct:   148 PDYFDLRNEKINGRYIVGPIKDQGQCACCWGFAVTALVETVYAAHSGKFKSLSDQEVCDC 207

Query:    67 AKQCS-GCDGCFFEPSIEYTHQAGLESEKDYPY-KNANGEKFKCAY-DKSKVKLFTGKDF 123
               + + GC G      ++Y  + GL  ++DYPY +N   +  +C   +  ++      +F
Sbjct:   208 GTEGTPGCKGGSLTLGVQYVKKYGLSGDEDYPYDQNRANQGRRCRLRETDRIVPARAFNF 267

Query:   124 LHFN---GSETMKKILYKYG-PLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLV 178
                N     E + ++L ++  P++V     D   +Y    I ++D  C      HA  +V
Sbjct:   268 AVINPRRAEEQIIQVLTEWKVPVAVYFKVGDQFKEYKEGVIIEDD--CRRATQWHAGAIV 325

Query:   179 GYGKQDNI-----PYWLVRNSWGPIGPDEGFFKIERGNNACGIE 217
             GY   ++       YW+++NSWG    + G+ ++ RG + C IE
Sbjct:   326 GYDTVEDSRGRSHDYWIIKNSWGGDWAESGYVRVVRGRDWCSIE 369

 Score = 229 (85.7 bits), Expect = 6.2e-16, P = 6.2e-16
 Identities = 78/325 (24%), Positives = 137/325 (42%)

Query:   284 DNENILETFKAFIVKRGRQYANDEEIKERFE---YFKQDGHKKHERYGTSEFSDRSPEEI 340
             ++EN  + F  F+    + Y N +++  + +   Y  Q G  K     T+EF  R    +
Sbjct:    57 ESEN-QQRFNNFV----KSYNNVDKLNAKSKAAGYDTQFGINKFSDLSTAEFHGRLSNVV 111

Query:   341 LCK-TGFKWSERTYERIVAD-RXXXXXXXXXXXXDGPVPDAWDWRKKNVTG-----PAGD 393
                 TG       +++   D R                PD +D R + + G     P  D
Sbjct:   112 PSNNTGLPMLN--FDKKKPDFRAADMNKTRHKRRSTRYPDYFDLRNEKINGRYIVGPIKD 169

Query:   394 QAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDG--LEQPIEYT 451
             Q  C  CW F++  ++E  YA  +GK    S  ++ +C  +  G  GC G  L   ++Y 
Sbjct:   170 QGQCACCWGFAVTALVETVYAAHSGKFKSLSDQEVCDCGTE--GTPGCKGGSLTLGVQYV 227

Query:   452 HQAGLESEKDYPY-RNGNGEKFKCAY-DKSKVKLFTGKDFLYFN---GSETMKKILYKYG 506
              + GL  ++DYPY +N   +  +C   +  ++      +F   N     E + ++L ++ 
Sbjct:   228 KKYGLSGDEDYPYDQNRANQGRRCRLRETDRIVPARAFNFAVINPRRAEEQIIQVLTEWK 287

Query:   507 -PLSV--GLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDI-----PYWLAR 558
              P++V   +      +  G  I   ++ C      HA  +VGY   +D       YW+ +
Sbjct:   288 VPVAVYFKVGDQFKEYKEGVII---EDDCRRATQWHAGAIVGYDTVEDSRGRSHDYWIIK 344

Query:   559 NSWGPIGPDEGFFKIERGNNACGIE 583
             NSWG    + G+ ++ RG + C IE
Sbjct:   345 NSWGGDWAESGYVRVVRGRDWCSIE 369

 Score = 51 (23.0 bits), Expect = 8.2e-18, Sum P(2) = 8.2e-18
 Identities = 10/31 (32%), Positives = 18/31 (58%)

Query:   286 ENILETFKAFIVKRGRQYANDEEIKERFEYF 316
             E + + F+ F  K  R+Y ++ E ++RF  F
Sbjct:    37 EKLYKAFEDFKKKYNRKYKDESENQQRFNNF 67


>UNIPROTKB|E2QV47 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0097208 "alveolar lamellar body"
            evidence=IEA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0070371 "ERK1 and ERK2 cascade"
            evidence=IEA] [GO:0070324 "thyroid hormone binding" evidence=IEA]
            [GO:0060448 "dichotomous subdivision of terminal units involved in
            lung branching" evidence=IEA] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IEA] [GO:0043129 "surfactant homeostasis"
            evidence=IEA] [GO:0043066 "negative regulation of apoptotic
            process" evidence=IEA] [GO:0033619 "membrane protein proteolysis"
            evidence=IEA] [GO:0032526 "response to retinoic acid" evidence=IEA]
            [GO:0031648 "protein destabilization" evidence=IEA] [GO:0031638
            "zymogen activation" evidence=IEA] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=IEA] [GO:0016505
            "apoptotic protease activator activity" evidence=IEA] [GO:0010815
            "bradykinin catabolic process" evidence=IEA] [GO:0010813
            "neuropeptide catabolic process" evidence=IEA] [GO:0010634
            "positive regulation of epithelial cell migration" evidence=IEA]
            [GO:0010628 "positive regulation of gene expression" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=IEA] [GO:0005829 "cytosol" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0004252 "serine-type endopeptidase activity"
            evidence=IEA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IEA] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 GO:GO:0016505 GO:GO:0010634
            GO:GO:0004197 GO:GO:0042599 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 Ensembl:ENSCAFT00000036196 Uniprot:E2QV47
        Length = 136

 Score = 222 (83.2 bits), Expect = 4.6e-17, P = 4.6e-17
 Identities = 52/136 (38%), Positives = 72/136 (52%)

Query:    93 EKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF--NGSETMKKILYKYGPLSVL--LNS 148
             E  YPYK  +G+   C Y  SK   F  KD  +   N  + M + +  Y P+S    + S
Sbjct:     3 EDSYPYKGQDGD---CKYQPSKAIAFV-KDVANITINDEQAMVEAVALYNPVSFAFEVTS 58

Query:   149 DLIHDYNGTPIRKNDETC--SPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFK 206
             D +    G     +  +C  +P  + HAVL VGYG+Q+ IPYW+V+NSWGP     G+F 
Sbjct:    59 DFMMYRKGI---YSSTSCHKTPDKVNHAVLAVGYGEQNGIPYWIVKNSWGPQWGMNGYFL 115

Query:   207 IERGNNACGIEQIAGY 222
             +ERG N CG+   A Y
Sbjct:   116 MERGKNMCGLAACASY 131

 Score = 222 (83.2 bits), Expect = 4.6e-17, P = 4.6e-17
 Identities = 52/136 (38%), Positives = 72/136 (52%)

Query:   824 EKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHF--NGSETMKKILYKYGPLSVL--LNS 879
             E  YPYK  +G+   C Y  SK   F  KD  +   N  + M + +  Y P+S    + S
Sbjct:     3 EDSYPYKGQDGD---CKYQPSKAIAFV-KDVANITINDEQAMVEAVALYNPVSFAFEVTS 58

Query:   880 DLIHDYNGTPIRKNDETC--SPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFK 937
             D +    G     +  +C  +P  + HAVL VGYG+Q+ IPYW+V+NSWGP     G+F 
Sbjct:    59 DFMMYRKGI---YSSTSCHKTPDKVNHAVLAVGYGEQNGIPYWIVKNSWGPQWGMNGYFL 115

Query:   938 IERGNNACGIEQIAGY 953
             +ERG N CG+   A Y
Sbjct:   116 MERGKNMCGLAACASY 131

 Score = 213 (80.0 bits), Expect = 4.2e-16, P = 4.2e-16
 Identities = 49/136 (36%), Positives = 71/136 (52%)

Query:   459 EKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF--LYFNGSETMKKILYKYGPLSVG--LNS 514
             E  YPY+  +G+   C Y  SK   F  KD   +  N  + M + +  Y P+S    + S
Sbjct:     3 EDSYPYKGQDGD---CKYQPSKAIAFV-KDVANITINDEQAMVEAVALYNPVSFAFEVTS 58

Query:   515 HLIHFYNGTPIRKNDETC--SPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFK 572
               + +  G     +  +C  +P  + HAVL VGYG+Q+ IPYW+ +NSWGP     G+F 
Sbjct:    59 DFMMYRKGI---YSSTSCHKTPDKVNHAVLAVGYGEQNGIPYWIVKNSWGPQWGMNGYFL 115

Query:   573 IERGNNACGIEQIAGY 588
             +ERG N CG+   A Y
Sbjct:   116 MERGKNMCGLAACASY 131

 Score = 174 (66.3 bits), Expect = 6.2e-12, P = 6.2e-12
 Identities = 30/56 (53%), Positives = 39/56 (69%)

Query:   966 SPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 1021
             +P  + HAVL VGYG+Q+ IPYW+V+NSWGP     G+F +ERG N CG+   A Y
Sbjct:    76 TPDKVNHAVLAVGYGEQNGIPYWIVKNSWGPQWGMNGYFLMERGKNMCGLAACASY 131


>GENEDB_PFALCIPARUM|PF14_0553 [details] [associations]
            symbol:PF14_0553 "cysteine proteinase
            falcipain-1" species:5833 "Plasmodium falciparum" [GO:0042540
            "hemoglobin catabolic process" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014187 KO:K01376 HSSP:P07688 GO:GO:0042540
            RefSeq:XP_001348727.1 ProteinModelPortal:Q8I6V0 PRIDE:Q8I6V0
            EnsemblProtists:PF14_0553:mRNA GeneID:812135 KEGG:pfa:PF14_0553
            EuPathDB:PlasmoDB:PF3D7_1458000 HOGENOM:HOG000065906
            ProtClustDB:CLSZ2457715 ChEMBL:CHEMBL1250371 Uniprot:Q8I6V0
        Length = 569

 Score = 218 (81.8 bits), Expect = 4.8e-17, Sum P(2) = 4.8e-17
 Identities = 65/182 (35%), Positives = 92/182 (50%)

Query:     6 EKD--GPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQL 63
             EKD    VP+  D+R+K +     DQ  CGSCWAF+  G +E  +A K   ++ FS+ ++
Sbjct:   326 EKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEV 385

Query:    64 VECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKD 122
             V+C+K   GCDG     S  Y  Q  L    +Y YK A  + F   Y  K KV L +   
Sbjct:   386 VDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYK-AKDDMFCLNYRCKRKVSLSS--- 441

Query:   123 FLHFNGSETMKKILYKYGPLSVLL--NSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGY 180
              +       +   L + GPLSV +  N+D +    G      + TCS  +L H+VLLVGY
Sbjct:   442 -IGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGV----YNGTCSE-ELNHSVLLVGY 495

Query:   181 GK 182
             G+
Sbjct:   496 GQ 497

 Score = 217 (81.4 bits), Expect = 1.9e-14, Sum P(2) = 1.9e-14
 Identities = 62/175 (35%), Positives = 89/175 (50%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             VP+  D+R+K +     DQ  CGSCWAF+  G +E  +A K   ++ FS+ ++V+C+K  
Sbjct:   333 VPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDN 392

Query:   802 SGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHFNGS 860
              GCDG     S  Y  Q  L    +Y YK A  + F   Y  K KV L +    +     
Sbjct:   393 FGCDGGHPFYSFLYVLQNELCLGDEYKYK-AKDDMFCLNYRCKRKVSLSS----IGAVKE 447

Query:   861 ETMKKILYKYGPLSVLL--NSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGK 913
               +   L + GPLSV +  N+D +    G      + TCS  +L H+VLLVGYG+
Sbjct:   448 NQLILALNEVGPLSVNVGVNNDFVAYSEGV----YNGTCSE-ELNHSVLLVGYGQ 497

 Score = 204 (76.9 bits), Expect = 9.1e-16, Sum P(3) = 9.1e-16
 Identities = 58/176 (32%), Positives = 88/176 (50%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             VP+  D+R+K +     DQ  CGSCWAF+  G +E  +A K   ++ FS+ ++V+C+K  
Sbjct:   333 VPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDN 392

Query:   436 SGCGGCDGLEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYD-KSKVKLFTGKDFLYFNG 494
              GC G       + Y  Q  L    +Y Y+    + F   Y  K KV L +    +    
Sbjct:   393 FGCDGGHPFYSFL-YVLQNELCLGDEYKYK-AKDDMFCLNYRCKRKVSLSS----IGAVK 446

Query:   495 SETMKKILYKYGPLSV--GLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGK 548
                +   L + GPLSV  G+N+  + +  G      + TCS  +L H+VLLVGYG+
Sbjct:   447 ENQLILALNEVGPLSVNVGVNNDFVAYSEGV----YNGTCSE-ELNHSVLLVGYGQ 497

 Score = 76 (31.8 bits), Expect = 4.8e-17, Sum P(2) = 4.8e-17
 Identities = 16/42 (38%), Positives = 24/42 (57%)

Query:   184 DNIPY-WLVRNSWGPIGPDEGFFKIERGNNA----CGI-EQI 219
             DNI Y W+++NSW     + GF ++ R  N     CGI E++
Sbjct:   523 DNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEV 564

 Score = 70 (29.7 bits), Expect = 2.0e-16, Sum P(2) = 2.0e-16
 Identities = 16/43 (37%), Positives = 23/43 (53%)

Query:   550 DD--IPYWLARNSWGPIGPDEGFFKIERGNNA----CGI-EQI 585
             DD  I YW+ +NSW     + GF ++ R  N     CGI E++
Sbjct:   522 DDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEV 564

 Score = 52 (23.4 bits), Expect = 9.1e-16, Sum P(3) = 9.1e-16
 Identities = 21/74 (28%), Positives = 32/74 (43%)

Query:   284 DNENILETFKAFIVKRGRQYANDEEIKERFEYFKQD--GHKKHERYG--------TSEFS 333
             +N      F  F+ +  + Y N +E   +FE FK +    K H +           ++FS
Sbjct:   217 NNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFS 276

Query:   334 DRSPEEILCKTGFK 347
             D S EE+  K  FK
Sbjct:   277 DYSEEEL--KEYFK 288

 Score = 52 (23.4 bits), Expect = 1.9e-14, Sum P(2) = 1.9e-14
 Identities = 21/74 (28%), Positives = 32/74 (43%)

Query:   650 DNENILETFKAFIVKRGRQYANDEEIKERFEYFKQD--GHKKHERYG--------TSEFS 699
             +N      F  F+ +  + Y N +E   +FE FK +    K H +           ++FS
Sbjct:   217 NNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFS 276

Query:   700 DRSPEEILCKTGFK 713
             D S EE+  K  FK
Sbjct:   277 DYSEEEL--KEYFK 288


>UNIPROTKB|Q8I6V0 [details] [associations]
            symbol:PF14_0553 "Cysteine proteinase falcipain-1"
            species:36329 "Plasmodium falciparum 3D7" [GO:0042540 "hemoglobin
            catabolic process" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014187 KO:K01376 HSSP:P07688 GO:GO:0042540
            RefSeq:XP_001348727.1 ProteinModelPortal:Q8I6V0 PRIDE:Q8I6V0
            EnsemblProtists:PF14_0553:mRNA GeneID:812135 KEGG:pfa:PF14_0553
            EuPathDB:PlasmoDB:PF3D7_1458000 HOGENOM:HOG000065906
            ProtClustDB:CLSZ2457715 ChEMBL:CHEMBL1250371 Uniprot:Q8I6V0
        Length = 569

 Score = 218 (81.8 bits), Expect = 4.8e-17, Sum P(2) = 4.8e-17
 Identities = 65/182 (35%), Positives = 92/182 (50%)

Query:     6 EKD--GPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQL 63
             EKD    VP+  D+R+K +     DQ  CGSCWAF+  G +E  +A K   ++ FS+ ++
Sbjct:   326 EKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEV 385

Query:    64 VECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKD 122
             V+C+K   GCDG     S  Y  Q  L    +Y YK A  + F   Y  K KV L +   
Sbjct:   386 VDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYK-AKDDMFCLNYRCKRKVSLSS--- 441

Query:   123 FLHFNGSETMKKILYKYGPLSVLL--NSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGY 180
              +       +   L + GPLSV +  N+D +    G      + TCS  +L H+VLLVGY
Sbjct:   442 -IGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGV----YNGTCSE-ELNHSVLLVGY 495

Query:   181 GK 182
             G+
Sbjct:   496 GQ 497

 Score = 217 (81.4 bits), Expect = 1.9e-14, Sum P(2) = 1.9e-14
 Identities = 62/175 (35%), Positives = 89/175 (50%)

Query:   742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
             VP+  D+R+K +     DQ  CGSCWAF+  G +E  +A K   ++ FS+ ++V+C+K  
Sbjct:   333 VPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDN 392

Query:   802 SGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYD-KSKVKLFTGKDFLHFNGS 860
              GCDG     S  Y  Q  L    +Y YK A  + F   Y  K KV L +    +     
Sbjct:   393 FGCDGGHPFYSFLYVLQNELCLGDEYKYK-AKDDMFCLNYRCKRKVSLSS----IGAVKE 447

Query:   861 ETMKKILYKYGPLSVLL--NSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGK 913
               +   L + GPLSV +  N+D +    G      + TCS  +L H+VLLVGYG+
Sbjct:   448 NQLILALNEVGPLSVNVGVNNDFVAYSEGV----YNGTCSE-ELNHSVLLVGYGQ 497

 Score = 204 (76.9 bits), Expect = 9.1e-16, Sum P(3) = 9.1e-16
 Identities = 58/176 (32%), Positives = 88/176 (50%)

Query:   376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
             VP+  D+R+K +     DQ  CGSCWAF+  G +E  +A K   ++ FS+ ++V+C+K  
Sbjct:   333 VPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDN 392

Query:   436 SGCGGCDGLEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYD-KSKVKLFTGKDFLYFNG 494
              GC G       + Y  Q  L    +Y Y+    + F   Y  K KV L +    +    
Sbjct:   393 FGCDGGHPFYSFL-YVLQNELCLGDEYKYK-AKDDMFCLNYRCKRKVSLSS----IGAVK 446

Query:   495 SETMKKILYKYGPLSV--GLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGK 548
                +   L + GPLSV  G+N+  + +  G      + TCS  +L H+VLLVGYG+
Sbjct:   447 ENQLILALNEVGPLSVNVGVNNDFVAYSEGV----YNGTCSE-ELNHSVLLVGYGQ 497

 Score = 76 (31.8 bits), Expect = 4.8e-17, Sum P(2) = 4.8e-17
 Identities = 16/42 (38%), Positives = 24/42 (57%)

Query:   184 DNIPY-WLVRNSWGPIGPDEGFFKIERGNNA----CGI-EQI 219
             DNI Y W+++NSW     + GF ++ R  N     CGI E++
Sbjct:   523 DNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEV 564

 Score = 70 (29.7 bits), Expect = 2.0e-16, Sum P(2) = 2.0e-16
 Identities = 16/43 (37%), Positives = 23/43 (53%)

Query:   550 DD--IPYWLARNSWGPIGPDEGFFKIERGNNA----CGI-EQI 585
             DD  I YW+ +NSW     + GF ++ R  N     CGI E++
Sbjct:   522 DDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEV 564

 Score = 52 (23.4 bits), Expect = 9.1e-16, Sum P(3) = 9.1e-16
 Identities = 21/74 (28%), Positives = 32/74 (43%)

Query:   284 DNENILETFKAFIVKRGRQYANDEEIKERFEYFKQD--GHKKHERYG--------TSEFS 333
             +N      F  F+ +  + Y N +E   +FE FK +    K H +           ++FS
Sbjct:   217 NNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFS 276

Query:   334 DRSPEEILCKTGFK 347
             D S EE+  K  FK
Sbjct:   277 DYSEEEL--KEYFK 288

 Score = 52 (23.4 bits), Expect = 1.9e-14, Sum P(2) = 1.9e-14
 Identities = 21/74 (28%), Positives = 32/74 (43%)

Query:   650 DNENILETFKAFIVKRGRQYANDEEIKERFEYFKQD--GHKKHERYG--------TSEFS 699
             +N      F  F+ +  + Y N +E   +FE FK +    K H +           ++FS
Sbjct:   217 NNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFS 276

Query:   700 DRSPEEILCKTGFK 713
             D S EE+  K  FK
Sbjct:   277 DYSEEEL--KEYFK 288


>DICTYBASE|DDB_G0286055 [details] [associations]
            symbol:DDB_G0286055 "peptidase C1A family protein"
            species:44689 "Dictyostelium discoideum" [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 dictyBase:DDB_G0286055 Pfam:PF00188 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411 EMBL:AAFI02000085
            Gene3D:3.40.33.10 InterPro:IPR001283 InterPro:IPR014044
            PRINTS:PR00837 SMART:SM00198 SUPFAM:SSF55797
            ProtClustDB:CLSZ2429919 RefSeq:XP_637918.1
            ProteinModelPortal:Q54MB6 EnsemblProtists:DDB0186794 GeneID:8625429
            KEGG:ddi:DDB_G0286055 InParanoid:Q54MB6 OMA:GENGFAR Uniprot:Q54MB6
        Length = 435

 Score = 239 (89.2 bits), Expect = 7.9e-17, P = 7.9e-17
 Identities = 64/231 (27%), Positives = 111/231 (48%)

Query:   375 PVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ 434
             P   ++DWR   V G   D + C S WAF+ AG+ E + A++T    ++S  QL++C   
Sbjct:   207 PTDGSFDWRDNGVVGFPKDSSNCASGWAFTAAGIFESRSAMRTRHRYDYSAQQLIDCINV 266

Query:   435 C------------SGCGGCDG-LEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKV 481
             C            + C    G L + + Y    GL++   YPY   +     C+Y++S +
Sbjct:   267 CIIIFSNFSIGNYTKCSRFSGELNKALMYAQAYGLQATSTYPYVGASS--IGCSYNQSSI 324

Query:   482 KLFTGKDFLYFN-GSETMKKILYKYGPLSVGLN-SHLIHFYNGTPIRKNDETCSPYDLGH 539
              +  G D  Y   G +++ +   K GP+ VG+  ++   +Y G     N+      ++ H
Sbjct:   325 AV-EGGDVEYSQVGRDSIVEKCRKQGPVGVGIYVTNEFLYYAGGIFECNNTLIDNANINH 383

Query:   540 AVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYA 589
              VLLVGY ++D+  Y++ +N++G    + GF +I    N  C I +   Y+
Sbjct:   384 NVLLVGYNEKDN--YYIIKNNFGRTWGENGFARITADVNKDCLIAKNPAYS 432

 Score = 230 (86.0 bits), Expect = 8.0e-16, P = 8.0e-16
 Identities = 63/232 (27%), Positives = 113/232 (48%)

Query:    10 PVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ 69
             P   ++DWR   V G   D ++C S WAF+ AG+ E + A++T    ++S  QL++C   
Sbjct:   207 PTDGSFDWRDNGVVGFPKDSSNCASGWAFTAAGIFESRSAMRTRHRYDYSAQQLIDCINV 266

Query:    70 C------------SGCDGCFFE--PSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKV 115
             C            + C     E   ++ Y    GL++   YPY  A+     C+Y++S +
Sbjct:   267 CIIIFSNFSIGNYTKCSRFSGELNKALMYAQAYGLQATSTYPYVGASS--IGCSYNQSSI 324

Query:   116 KLFTGKDFLHFN-GSETMKKILYKYGPLSV--LLNSDLIHDYNGTPIRKNDETCSPYDLG 172
              +  G D  +   G +++ +   K GP+ V   + ++ ++ Y G     N+      ++ 
Sbjct:   325 AV-EGGDVEYSQVGRDSIVEKCRKQGPVGVGIYVTNEFLY-YAGGIFECNNTLIDNANIN 382

Query:   173 HAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYA 223
             H VLLVGY ++DN  Y++++N++G    + GF +I    N  C I +   Y+
Sbjct:   383 HNVLLVGYNEKDN--YYIIKNNFGRTWGENGFARITADVNKDCLIAKNPAYS 432

 Score = 227 (85.0 bits), Expect = 1.7e-15, P = 1.7e-15
 Identities = 63/232 (27%), Positives = 112/232 (48%)

Query:   741 PVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ 800
             P   ++DWR   V G   D + C S WAF+ AG+ E + A++T    ++S  QL++C   
Sbjct:   207 PTDGSFDWRDNGVVGFPKDSSNCASGWAFTAAGIFESRSAMRTRHRYDYSAQQLIDCINV 266

Query:   801 C------------SGCDGCFFE--PSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKV 846
             C            + C     E   ++ Y    GL++   YPY  A+     C+Y++S +
Sbjct:   267 CIIIFSNFSIGNYTKCSRFSGELNKALMYAQAYGLQATSTYPYVGASS--IGCSYNQSSI 324

Query:   847 KLFTGKDFLHFN-GSETMKKILYKYGPLSV--LLNSDLIHDYNGTPIRKNDETCSPYDLG 903
              +  G D  +   G +++ +   K GP+ V   + ++ ++ Y G     N+      ++ 
Sbjct:   325 AV-EGGDVEYSQVGRDSIVEKCRKQGPVGVGIYVTNEFLY-YAGGIFECNNTLIDNANIN 382

Query:   904 HAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYA 954
             H VLLVGY ++DN  Y++++N++G    + GF +I    N  C I +   Y+
Sbjct:   383 HNVLLVGYNEKDN--YYIIKNNFGRTWGENGFARITADVNKDCLIAKNPAYS 432


>DICTYBASE|DDB_G0274385 [details] [associations]
            symbol:DDB_G0274385 "Cysteine proteinase 1,
            mitochondrial" species:44689 "Dictyostelium discoideum" [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0274385 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 EMBL:AAFI02000012 RefSeq:XP_644301.1
            ProteinModelPortal:Q86KD4 EnsemblProtists:DDB0167535 GeneID:8619729
            KEGG:ddi:DDB_G0274385 InParanoid:Q86KD4 OMA:SICVDAS Uniprot:Q86KD4
        Length = 358

 Score = 223 (83.6 bits), Expect = 2.0e-15, P = 2.0e-15
 Identities = 64/209 (30%), Positives = 98/209 (46%)

Query:    16 DWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTG-KLVEFSKSQLVECAKQCSGCD 74
             DWRKK +  P  DQ  CGSC+ FS    +E  + IK G K +  S+ Q V+C      C 
Sbjct:   150 DWRKKGLVTPVKDQGQCGSCYIFSAVEQIETAW-IKAGNKPILLSEQQAVDCDPYDGQCG 208

Query:    75 GCFFEPSIEYTHQAG-LESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSE-TM 132
             G       EY  Q G + +   YPY   +G    C  + S+        ++   G E T+
Sbjct:   209 GGDPYTVYEYFSQVGGVSTNAQYPYTATDGT---CV-NMSRAVPVVSYHYVTQGGDENTL 264

Query:   133 KKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGY--GKQD--N-IP 187
              K +   GP+S+ +++     Y+G  I      C   ++ H V +VG    K D  N + 
Sbjct:   265 IKTIVNDGPVSICVDASTWQSYSGGIITTG---CGK-NIDHCVQVVGLEVDKTDPSNPVQ 320

Query:   188 YWLVRNSWGPIGPDEGFFKIERGNNACGI 216
             Y+++RNSWG     +G+  +  G++ CGI
Sbjct:   321 YYIIRNSWGTDWGIDGYIYVATGSDLCGI 349

 Score = 222 (83.2 bits), Expect = 7.9e-17, Sum P(2) = 7.9e-17
 Identities = 64/209 (30%), Positives = 98/209 (46%)

Query:   747 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTG-KLVEFSKSQLVECAKQCSGCD 805
             DWRKK +  P  DQ  CGSC+ FS    +E  + IK G K +  S+ Q V+C      C 
Sbjct:   150 DWRKKGLVTPVKDQGQCGSCYIFSAVEQIETAW-IKAGNKPILLSEQQAVDCDPYDGQCG 208

Query:   806 GCFFEPSIEYTHQAG-LESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSE-TM 863
             G       EY  Q G + +   YPY   +G    C  + S+        ++   G E T+
Sbjct:   209 GGDPYTVYEYFSQVGGVSTNAQYPYTATDGT---CV-NMSRAVPVVSYHYVTQGGDENTL 264

Query:   864 KKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGY--GKQD--N-IP 918
              K +   GP+S+ +++     Y+G  I      C   ++ H V +VG    K D  N + 
Sbjct:   265 IKTIVNDGPVSICVDASTWQSYSGGIITTG---CGK-NIDHCVQVVGLEVDKTDPSNPVQ 320

Query:   919 YWLVRNSWGPIGPDEGFFKIERGNNACGI 947
             Y+++RNSWG     +G+  +  G++ CGI
Sbjct:   321 YYIIRNSWGTDWGIDGYIYVATGSDLCGI 349

 Score = 221 (82.9 bits), Expect = 1.0e-16, Sum P(2) = 1.0e-16
 Identities = 65/210 (30%), Positives = 98/210 (46%)

Query:   381 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTG-KLVEFSKSQLVECAKQCSGCG 439
             DWRKK +  P  DQ  CGSC+ FS    +E  + IK G K +  S+ Q V+C      CG
Sbjct:   150 DWRKKGLVTPVKDQGQCGSCYIFSAVEQIETAW-IKAGNKPILLSEQQAVDCDPYDGQCG 208

Query:   440 GCDGLEQPIEYTHQAG-LESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSE-T 497
             G D      EY  Q G + +   YPY   +G    C  + S+        ++   G E T
Sbjct:   209 GGDPYTV-YEYFSQVGGVSTNAQYPYTATDGT---CV-NMSRAVPVVSYHYVTQGGDENT 263

Query:   498 MKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGY--GKQDD---I 552
             + K +   GP+S+ +++     Y+G  I      C   ++ H V +VG    K D    +
Sbjct:   264 LIKTIVNDGPVSICVDASTWQSYSGGIITTG---CGK-NIDHCVQVVGLEVDKTDPSNPV 319

Query:   553 PYWLARNSWGPIGPDEGFFKIERGNNACGI 582
              Y++ RNSWG     +G+  +  G++ CGI
Sbjct:   320 QYYIIRNSWGTDWGIDGYIYVATGSDLCGI 349

 Score = 57 (25.1 bits), Expect = 7.9e-17, Sum P(2) = 7.9e-17
 Identities = 19/71 (26%), Positives = 36/71 (50%)

Query:   278 EGSLTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHE---------RYG 328
             +G +  D+ ++ +TF  +  K  + Y +  E++ RF  FK++  K  E         ++ 
Sbjct:    31 DGIIHSDS-SMRDTFNHWAKKHSKIYKDSIEMENRFSNFKENMKKNIELNSMHAGKAKFE 89

Query:   329 TSEFSDRSPEE 339
             ++ FSD S EE
Sbjct:    90 SNGFSDLSEEE 100

 Score = 57 (25.1 bits), Expect = 7.9e-17, Sum P(2) = 7.9e-17
 Identities = 19/71 (26%), Positives = 36/71 (50%)

Query:   644 EGSLTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHE---------RYG 694
             +G +  D+ ++ +TF  +  K  + Y +  E++ RF  FK++  K  E         ++ 
Sbjct:    31 DGIIHSDS-SMRDTFNHWAKKHSKIYKDSIEMENRFSNFKENMKKNIELNSMHAGKAKFE 89

Query:   695 TSEFSDRSPEE 705
             ++ FSD S EE
Sbjct:    90 SNGFSDLSEEE 100


>WB|WBGene00000786 [details] [associations]
            symbol:cpr-6 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            PANTHER:PTHR12411:SF16 EMBL:L39894 EMBL:L39939 EMBL:FO080666
            PIR:T37274 RefSeq:NP_741818.1 UniGene:Cel.18138
            ProteinModelPortal:P43510 SMR:P43510 DIP:DIP-25139N
            MINT:MINT-1074025 STRING:P43510 MEROPS:C01.A51 PaxDb:P43510
            PRIDE:P43510 EnsemblMetazoa:C25B8.3a GeneID:180931
            KEGG:cel:CELE_C25B8.3 UCSC:C25B8.3a CTD:180931 WormBase:C25B8.3a
            InParanoid:P43510 OMA:KAKWGLM NextBio:911608 ArrayExpress:P43510
            Uniprot:P43510
        Length = 379

 Score = 169 (64.5 bits), Expect = 3.1e-09, P = 3.1e-09
 Identities = 52/171 (30%), Positives = 78/171 (45%)

Query:    71 SGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCA--Y-DK--SKVKLFTGKDFLH 125
             +GC    F P   ++ +   +      Y     EK KC   Y DK  S+ K F    +  
Sbjct:   201 NGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEK-KCVSDYTDKTYSEDKFFGASAYGV 259

Query:   126 FNGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 183
              +  E ++K L  +GPL +      D ++   G  +    +       GHAV L+G+G  
Sbjct:   260 KDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGG----GHAVKLIGWGID 315

Query:   184 DNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAGYATIDVVIQRL 232
             D IPYW V NSW     ++GFF+I RG + CGIE   + G   ++ +  RL
Sbjct:   316 DGIPYWTVANSWNTDWGEDGFFRILRGVDECGIESGVVGGIPKLNSLTSRL 366

 Score = 164 (62.8 bits), Expect = 8.1e-17, Sum P(2) = 8.1e-17
 Identities = 49/154 (31%), Positives = 71/154 (46%)

Query:   802 SGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCA--Y-DK--SKVKLFTGKDFLH 856
             +GC    F P   ++ +   +      Y     EK KC   Y DK  S+ K F    +  
Sbjct:   201 NGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEK-KCVSDYTDKTYSEDKFFGASAYGV 259

Query:   857 FNGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 914
              +  E ++K L  +GPL +      D ++   G  +    +       GHAV L+G+G  
Sbjct:   260 KDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGG----GHAVKLIGWGID 315

Query:   915 DNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 948
             D IPYW V NSW     ++GFF+I RG + CGIE
Sbjct:   316 DGIPYWTVANSWNTDWGEDGFFRILRGVDECGIE 349

 Score = 160 (61.4 bits), Expect = 2.3e-16, Sum P(2) = 2.3e-16
 Identities = 45/138 (32%), Positives = 68/138 (49%)

Query:   470 EKFKCA--Y-DK--SKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNSH--LIHFYNG 522
             EK KC   Y DK  S+ K F    +   +  E ++K L  +GPL +    +   +++  G
Sbjct:   234 EK-KCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGG 292

Query:   523 TPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGI 582
               +    +       GHAV L+G+G  D IPYW   NSW     ++GFF+I RG + CGI
Sbjct:   293 VYVHTGGKLGG----GHAVKLIGWGIDDGIPYWTVANSWNTDWGEDGFFRILRGVDECGI 348

Query:   583 EQ--IAGYATIDVVIQRL 598
             E   + G   ++ +  RL
Sbjct:   349 ESGVVGGIPKLNSLTSRL 366

 Score = 144 (55.7 bits), Expect = 1.4e-14, Sum P(2) = 1.4e-14
 Identities = 26/46 (56%), Positives = 32/46 (69%)

Query:   971 GHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 1016
             GHAV L+G+G  D IPYW V NSW     ++GFF+I RG + CGIE
Sbjct:   304 GHAVKLIGWGIDDGIPYWTVANSWNTDWGEDGFFRILRGVDECGIE 349

 Score = 124 (48.7 bits), Expect = 8.1e-17, Sum P(2) = 8.1e-17
 Identities = 33/102 (32%), Positives = 49/102 (48%)

Query:   373 DGPVPDAWD----WRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKL-VEFSKS 426
             D  +P+++D    W K +      DQ++CGSCWAF     +  +  I + G+L V  S  
Sbjct:   102 DLDIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSAD 161

Query:   427 QLVECAKQCS-GCGGCDGLEQPIEYTHQAGLESEKDYPYRNG 467
              L+ C K C  GC G D L     Y  + G+ +  +Y   NG
Sbjct:   162 DLLSCCKSCGFGCNGGDPLAA-WRYWVKDGIVTGSNYTANNG 202

 Score = 112 (44.5 bits), Expect = 2.5e-13, Sum P(2) = 2.5e-13
 Identities = 30/100 (30%), Positives = 48/100 (48%)

Query:   739 DGPVPDAWD----WRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKL-VEFSKS 792
             D  +P+++D    W K +      DQ++CGSCWAF     +  +  I + G+L V  S  
Sbjct:   102 DLDIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSAD 161

Query:   793 QLVECAKQCS-GCDGCFFEPSIEYTHQAGLESEKDYPYKN 831
              L+ C K C  GC+G     +  Y  + G+ +  +Y   N
Sbjct:   162 DLLSCCKSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANN 201

 Score = 111 (44.1 bits), Expect = 1.8e-15, Sum P(2) = 1.8e-15
 Identities = 30/100 (30%), Positives = 47/100 (47%)

Query:     8 DGPVPDAWD----WRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKT-GKL-VEFSKS 61
             D  +P+++D    W K +      DQ+ CGSCWAF     +  +  I + G+L V  S  
Sbjct:   102 DLDIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSAD 161

Query:    62 QLVECAKQCS-GCDGCFFEPSIEYTHQAGLESEKDYPYKN 100
              L+ C K C  GC+G     +  Y  + G+ +  +Y   N
Sbjct:   162 DLLSCCKSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANN 201


>TAIR|locus:2030027 [details] [associations]
            symbol:AT1G29110 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            EMBL:CP002684 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            IPI:IPI00544534 RefSeq:NP_564322.1 UniGene:At.51816
            ProteinModelPortal:F4HZW2 SMR:F4HZW2 EnsemblPlants:AT1G29110.1
            GeneID:839786 KEGG:ath:AT1G29110 OMA:SCRANAR Uniprot:F4HZW2
        Length = 334

 Score = 229 (85.7 bits), Expect = 1.9e-16, P = 1.9e-16
 Identities = 78/325 (24%), Positives = 145/325 (44%)

Query:   647 LTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQD-------GHKKHERY--GTSE 697
             +T + ++I++  + ++ +  R Y ++ E + R + FK++        +  ++ Y  G +E
Sbjct:    27 VTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKVFKKNLKFIENFNNMGNQSYTLGVNE 86

Query:   698 FSDRSPEEILCK-TGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGP 756
             F+D   EE L   TG + +  +   +  ++            D    ++ DWR +    P
Sbjct:    87 FTDWKTEEFLATHTGLRVNVTSLSELF-NKTKPSRNWNMSDIDME-DESKDWRDEGAVTP 144

Query:   757 AGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC-AKQCSGCDGCFFEPSIEY 815
                Q   G+C    I+G            L+  S+ QL++C  ++  GC+G  FE + +Y
Sbjct:   145 VKYQ---GACRLTKISGK----------NLLTLSEQQLIDCDIEKNGGCNGGEFEEAFKY 191

Query:   816 T-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLS 874
                  G+  E +YPY+    E  +    ++      G   +  +    + + + +  P+S
Sbjct:   192 IIKNGGVSLETEYPYQ-VKKESCRANARRAPHTQIRGFQMVPSHNERALLEAVRRQ-PVS 249

Query:   875 VLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPD 932
             VL+++  D    Y G      D  C   D+ HAV +VGYG    + YW+++NSWG    +
Sbjct:   250 VLIDARADSFGHYKGGVYAGLD--CGT-DVNHAVTIVGYGTMSGLNYWVLKNSWGESWGE 306

Query:   933 EGFFKIERG----NNACGIEQIAGY 953
              G+ +I R        CGI Q+A Y
Sbjct:   307 NGYMRIRRDVEWPQGMCGIAQVAAY 331

 Score = 200 (75.5 bits), Expect = 6.4e-13, P = 6.4e-13
 Identities = 59/215 (27%), Positives = 97/215 (45%)

Query:    16 DWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC-AKQCSGCD 74
             DWR +    P   Q   G+C    I+G            L+  S+ QL++C  ++  GC+
Sbjct:   135 DWRDEGAVTPVKYQ---GACRLTKISGK----------NLLTLSEQQLIDCDIEKNGGCN 181

Query:    75 GCFFEPSIEYT-HQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMK 133
             G  FE + +Y     G+  E +YPY+    E  +    ++      G   +  +    + 
Sbjct:   182 GGEFEEAFKYIIKNGGVSLETEYPYQ-VKKESCRANARRAPHTQIRGFQMVPSHNERALL 240

Query:   134 KILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLV 191
             + + +  P+SVL+++  D    Y G      D  C   D+ HAV +VGYG    + YW++
Sbjct:   241 EAVRRQ-PVSVLIDARADSFGHYKGGVYAGLD--CGT-DVNHAVTIVGYGTMSGLNYWVL 296

Query:   192 RNSWGPIGPDEGFFKIERG----NNACGIEQIAGY 222
             +NSWG    + G+ +I R        CGI Q+A Y
Sbjct:   297 KNSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAY 331

 Score = 192 (72.6 bits), Expect = 5.4e-12, P = 5.4e-12
 Identities = 76/326 (23%), Positives = 139/326 (42%)

Query:   281 LTFDNENILETFKAFIVKRGRQYANDEEIKERFEYFKQD-------GHKKHERY--GTSE 331
             +T + ++I++  + ++ +  R Y ++ E + R + FK++        +  ++ Y  G +E
Sbjct:    27 VTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKVFKKNLKFIENFNNMGNQSYTLGVNE 86

Query:   332 FSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPA 391
             F+D   EE L          T    + ++            D    ++ DWR +    P 
Sbjct:    87 FTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSRNWNMSDIDME-DESKDWRDEGAVTPV 145

Query:   392 GDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDG--LEQPIE 449
               Q   G+C    I+G            L+  S+ QL++C  + +G  GC+G   E+  +
Sbjct:   146 KYQ---GACRLTKISGK----------NLLTLSEQQLIDCDIEKNG--GCNGGEFEEAFK 190

Query:   450 YT-HQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPL 508
             Y     G+  E +YPY+    E  +    ++      G   +  +    + + + +  P+
Sbjct:   191 YIIKNGGVSLETEYPYQVKK-ESCRANARRAPHTQIRGFQMVPSHNERALLEAVRRQ-PV 248

Query:   509 SVGLNSHLIHF--YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGP 566
             SV +++    F  Y G      D  C   D+ HAV +VGYG    + YW+ +NSWG    
Sbjct:   249 SVLIDARADSFGHYKGGVYAGLD--CGT-DVNHAVTIVGYGTMSGLNYWVLKNSWGESWG 305

Query:   567 DEGFFKIERG----NNACGIEQIAGY 588
             + G+ +I R        CGI Q+A Y
Sbjct:   306 ENGYMRIRRDVEWPQGMCGIAQVAAY 331

 Score = 129 (50.5 bits), Expect = 6.2e-05, P = 6.2e-05
 Identities = 23/57 (40%), Positives = 33/57 (57%)

Query:   969 DLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERG----NNACGIEQIAGY 1021
             D+ HAV +VGYG    + YW+++NSWG    + G+ +I R        CGI Q+A Y
Sbjct:   275 DVNHAVTIVGYGTMSGLNYWVLKNSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAY 331


>UNIPROTKB|E9PKT6 [details] [associations]
            symbol:CTSH "Cathepsin H" species:9606 "Homo sapiens"
            [GO:0001520 "outer dense fiber" evidence=IEA] [GO:0001656
            "metanephros development" evidence=IEA] [GO:0001669 "acrosomal
            vesicle" evidence=IEA] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0007283 "spermatogenesis" evidence=IEA] [GO:0008284 "positive
            regulation of cell proliferation" evidence=IEA] [GO:0010634
            "positive regulation of epithelial cell migration" evidence=IEA]
            [GO:0016505 "apoptotic protease activator activity" evidence=IEA]
            [GO:0030984 "kininogen binding" evidence=IEA] [GO:0031638 "zymogen
            activation" evidence=IEA] [GO:0031648 "protein destabilization"
            evidence=IEA] [GO:0032403 "protein complex binding" evidence=IEA]
            [GO:0032526 "response to retinoic acid" evidence=IEA] [GO:0033619
            "membrane protein proteolysis" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0043621 "protein
            self-association" evidence=IEA] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IEA] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            InterPro:IPR000169 GO:GO:0043066 GO:GO:0008284 PANTHER:PTHR12411
            PROSITE:PS00139 GO:GO:0045766 GO:GO:0004252 GO:GO:0032526
            GO:GO:0016505 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 GO:GO:0060448 GO:GO:0033619
            EMBL:AC011944 HGNC:HGNC:2535 IPI:IPI00375426
            ProteinModelPortal:E9PKT6 SMR:E9PKT6 PRIDE:E9PKT6
            Ensembl:ENST00000528741 ArrayExpress:E9PKT6 Bgee:E9PKT6
            Uniprot:E9PKT6
        Length = 134

 Score = 216 (81.1 bits), Expect = 2.0e-16, P = 2.0e-16
 Identities = 53/139 (38%), Positives = 71/139 (51%)

Query:   696 SEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVT 754
             ++FSD S  EI  K  + WSE   +   A +             GP P + DWRKK N  
Sbjct:     4 NQFSDMSFAEI--KHKYLWSEP--QNCSATKSNYLRGT------GPYPPSVDWRKKGNFV 53

Query:   755 GPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCDGCFFEPS 812
              P  +Q ACGSCW FS  G LE   AI TGK++  ++ QLV+CA+  +  GC G     +
Sbjct:    54 SPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQA 113

Query:   813 IEYT-HQAGLESEKDYPYK 830
              EY  +  G+  E  YPY+
Sbjct:   114 FEYILYNKGIMGEDTYPYQ 132

 Score = 215 (80.7 bits), Expect = 2.6e-16, P = 2.6e-16
 Identities = 56/141 (39%), Positives = 73/141 (51%)

Query:   330 SEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK-NVT 388
             ++FSD S  EI  K  + WSE   +   A +             GP P + DWRKK N  
Sbjct:     4 NQFSDMSFAEI--KHKYLWSEP--QNCSATKSNYLRGT------GPYPPSVDWRKKGNFV 53

Query:   389 GPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS--GCGGCDGL-E 445
              P  +Q ACGSCW FS  G LE   AI TGK++  ++ QLV+CA+  +  GC G  GL  
Sbjct:    54 SPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQG--GLPS 111

Query:   446 QPIEYT-HQAGLESEKDYPYR 465
             Q  EY  +  G+  E  YPY+
Sbjct:   112 QAFEYILYNKGIMGEDTYPYQ 132

 Score = 206 (77.6 bits), Expect = 2.4e-15, P = 2.4e-15
 Identities = 41/95 (43%), Positives = 54/95 (56%)

Query:     9 GPVPDAWDWRKK-NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 67
             GP P + DWRKK N   P  +Q  CGSCW FS  G LE   AI TGK++  ++ QLV+CA
Sbjct:    38 GPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCA 97

Query:    68 KQCS--GCDGCFFEPSIEYT-HQAGLESEKDYPYK 99
             +  +  GC G     + EY  +  G+  E  YPY+
Sbjct:    98 QDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQ 132


>DICTYBASE|DDB_G0282991 [details] [associations]
            symbol:DDB_G0282991 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0282991 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            EMBL:AAFI02000049 eggNOG:NOG331187 RefSeq:XP_639299.1
            ProteinModelPortal:Q54RQ2 EnsemblProtists:DDB0185304 GeneID:8623870
            KEGG:ddi:DDB_G0282991 InParanoid:Q54RQ2 OMA:PENGNEY Uniprot:Q54RQ2
        Length = 339

 Score = 225 (84.3 bits), Expect = 7.5e-16, P = 7.5e-16
 Identities = 60/218 (27%), Positives = 112/218 (51%)

Query:   747 DWRKKNVTGPAGDQAAC-GSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC--SG 803
             DWR  +   P  +Q  C G+ ++FS  G++E  + IK  +L+  S+  +++C      +G
Sbjct:   119 DWRNFDAVTPVKNQGLCSGAGYSFSAIGVIESSHFIKNKELITLSEQNIIDCTTDMGNNG 178

Query:   804 CDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKF----KCAYDK--SKVKLFTGKDFLH 856
             C G     + +Y   Q G++SE +YPY+    E +    +C Y+   SK  + +  +   
Sbjct:   179 CMGGLALIAFDYIIKQKGIDSEFNYPYEGYLIEPYEGRGRCRYNSFYSKASISSYIEIER 238

Query:   857 FNGSETMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG--K 913
             FN +E  + ++    P+SV+++ S L      + + K D +CS   L H +L +G+G   
Sbjct:   239 FNENELTQSLIKS--PVSVMIDASQLSFMLYKSGVYK-DPSCSSTILNHGILNIGFGVTP 295

Query:   914 QDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQI 950
             ++   Y++++NS+G     +G+  + R  NN CGI  +
Sbjct:   296 ENGNEYYILKNSFGSKWGMKGYIYLSRNFNNHCGISSV 333

 Score = 223 (83.6 bits), Expect = 1.3e-15, P = 1.3e-15
 Identities = 64/228 (28%), Positives = 116/228 (50%)

Query:    16 DWRKKNVTGPAGDQADC-GSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC--SG 72
             DWR  +   P  +Q  C G+ ++FS  G++E  + IK  +L+  S+  +++C      +G
Sbjct:   119 DWRNFDAVTPVKNQGLCSGAGYSFSAIGVIESSHFIKNKELITLSEQNIIDCTTDMGNNG 178

Query:    73 CDGCFFEPSIEYT-HQAGLESEKDYPYKNANGEKF----KCAYDK--SKVKLFTGKDFLH 125
             C G     + +Y   Q G++SE +YPY+    E +    +C Y+   SK  + +  +   
Sbjct:   179 CMGGLALIAFDYIIKQKGIDSEFNYPYEGYLIEPYEGRGRCRYNSFYSKASISSYIEIER 238

Query:   126 FNGSETMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG--K 182
             FN +E  + ++    P+SV+++ S L      + + K D +CS   L H +L +G+G   
Sbjct:   239 FNENELTQSLIKS--PVSVMIDASQLSFMLYKSGVYK-DPSCSSTILNHGILNIGFGVTP 295

Query:   183 QDNIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATIDVVI 229
             ++   Y++++NS+G     +G+  + R  NN CGI  +     I VVI
Sbjct:   296 ENGNEYYILKNSFGSKWGMKGYIYLSRNFNNHCGISSVG----ISVVI 339

 Score = 217 (81.4 bits), Expect = 7.0e-15, P = 7.0e-15
 Identities = 65/230 (28%), Positives = 115/230 (50%)

Query:   381 DWRKKNVTGPAGDQAAC-GSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCG 439
             DWR  +   P  +Q  C G+ ++FS  G++E  + IK  +L+  S+  +++C     G  
Sbjct:   119 DWRNFDAVTPVKNQGLCSGAGYSFSAIGVIESSHFIKNKELITLSEQNIIDCTTDM-GNN 177

Query:   440 GCDGLEQPIEYTH---QAGLESEKDYPYRNGNGEKF----KCAYDK--SKVKLFTGKDFL 490
             GC G    I + +   Q G++SE +YPY     E +    +C Y+   SK  + +  +  
Sbjct:   178 GCMGGLALIAFDYIIKQKGIDSEFNYPYEGYLIEPYEGRGRCRYNSFYSKASISSYIEIE 237

Query:   491 YFNGSETMKKILYKYGPLSVGLNSHLIHF--YNGTPIRKNDETCSPYDLGHAVLLVGYG- 547
              FN +E  + ++    P+SV +++  + F  Y  + + K D +CS   L H +L +G+G 
Sbjct:   238 RFNENELTQSLIKS--PVSVMIDASQLSFMLYK-SGVYK-DPSCSSTILNHGILNIGFGV 293

Query:   548 -KQDDIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYATIDVVI 595
               ++   Y++ +NS+G     +G+  + R  NN CGI  +     I VVI
Sbjct:   294 TPENGNEYYILKNSFGSKWGMKGYIYLSRNFNNHCGISSVG----ISVVI 339


>UNIPROTKB|Q5T8F0 [details] [associations]
            symbol:CTSL1 "Cathepsin L1 light chain" species:9606 "Homo
            sapiens" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            EMBL:AL160279 UniGene:Hs.731507 UniGene:Hs.731952 HGNC:HGNC:2537
            ChiTaRS:CTSL1 IPI:IPI00640540 SMR:Q5T8F0 Ensembl:ENST00000342020
            ChEMBL:CHEMBL1293261 Uniprot:Q5T8F0
        Length = 225

 Score = 208 (78.3 bits), Expect = 1.4e-15, P = 1.4e-15
 Identities = 39/91 (42%), Positives = 55/91 (60%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA--KQ 69
             P + DWR+K    P  +Q  CGSCWAFS  G LEGQ   KTG+L+  S+  LV+C+  + 
Sbjct:   115 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 174

Query:    70 CSGCDGCFFEPSIEYTHQ-AGLESEKDYPYK 99
               GC+G   + + +Y     GL+SE+ YPY+
Sbjct:   175 NEGCNGGLMDYAFQYVQDNGGLDSEESYPYE 205

 Score = 207 (77.9 bits), Expect = 1.9e-15, P = 1.9e-15
 Identities = 39/91 (42%), Positives = 55/91 (60%)

Query:   743 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA--KQ 800
             P + DWR+K    P  +Q  CGSCWAFS  G LEGQ   KTG+L+  S+  LV+C+  + 
Sbjct:   115 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 174

Query:   801 CSGCDGCFFEPSIEYTHQ-AGLESEKDYPYK 830
               GC+G   + + +Y     GL+SE+ YPY+
Sbjct:   175 NEGCNGGLMDYAFQYVQDNGGLDSEESYPYE 205

 Score = 204 (76.9 bits), Expect = 3.9e-15, P = 3.9e-15
 Identities = 40/91 (43%), Positives = 54/91 (59%)

Query:   377 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 436
             P + DWR+K    P  +Q  CGSCWAFS  G LEGQ   KTG+L+  S+  LV+C+    
Sbjct:   115 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGP-Q 173

Query:   437 GCGGCDG--LEQPIEYTHQ-AGLESEKDYPY 464
             G  GC+G  ++   +Y     GL+SE+ YPY
Sbjct:   174 GNEGCNGGLMDYAFQYVQDNGGLDSEESYPY 204


>UNIPROTKB|Q5QP40 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015644 Pfam:PF00112
            InterPro:IPR000169 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 EMBL:AL355860 HOVERGEN:HBG011513
            PANTHER:PTHR12411:SF55 EMBL:AL356292 UniGene:Hs.632466
            HGNC:HGNC:2536 IPI:IPI00514633 SMR:Q5QP40 STRING:Q5QP40
            Ensembl:ENST00000443913 Uniprot:Q5QP40
        Length = 258

 Score = 205 (77.2 bits), Expect = 3.0e-15, P = 3.0e-15
 Identities = 39/89 (43%), Positives = 52/89 (58%)

Query:     6 EKDGPVPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVE 65
             E +G  PD+ D+RKK    P  +Q  CGSCWAFS  G LEGQ   KTGKL+  S   LV+
Sbjct:   169 EWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVD 228

Query:    66 CAKQCSGCDGCFFEPSIEYTHQ-AGLESE 93
             C  +  GC G +   + +Y  +  G++SE
Sbjct:   229 CVSENDGCGGGYMTNAFQYVQKNRGIDSE 257

 Score = 202 (76.2 bits), Expect = 6.3e-15, P = 6.3e-15
 Identities = 38/87 (43%), Positives = 51/87 (58%)

Query:   739 DGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 798
             +G  PD+ D+RKK    P  +Q  CGSCWAFS  G LEGQ   KTGKL+  S   LV+C 
Sbjct:   171 EGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV 230

Query:   799 KQCSGCDGCFFEPSIEYTHQ-AGLESE 824
              +  GC G +   + +Y  +  G++SE
Sbjct:   231 SENDGCGGGYMTNAFQYVQKNRGIDSE 257

 Score = 198 (74.8 bits), Expect = 1.7e-14, P = 1.7e-14
 Identities = 39/88 (44%), Positives = 51/88 (57%)

Query:   373 DGPVPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA 432
             +G  PD+ D+RKK    P  +Q  CGSCWAFS  G LEGQ   KTGKL+  S   LV+C 
Sbjct:   171 EGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV 230

Query:   433 KQCSGCGGCDGLEQPIEYTHQ-AGLESE 459
              +  GCGG   +    +Y  +  G++SE
Sbjct:   231 SENDGCGG-GYMTNAFQYVQKNRGIDSE 257


>WB|WBGene00008231 [details] [associations]
            symbol:tag-329 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            eggNOG:NOG288820 EMBL:Z70750 HSSP:P53634 HOGENOM:HOG000019851
            PIR:T20110 RefSeq:NP_505458.1 ProteinModelPortal:Q18740 SMR:Q18740
            MEROPS:C01.A36 EnsemblMetazoa:C50F4.3 GeneID:183677
            KEGG:cel:CELE_C50F4.3 UCSC:C50F4.3 CTD:183677 WormBase:C50F4.3
            InParanoid:Q18740 OMA:WIFRNSW NextBio:921986 Uniprot:Q18740
        Length = 374

 Score = 210 (79.0 bits), Expect = 7.9e-14, P = 7.9e-14
 Identities = 62/216 (28%), Positives = 96/216 (44%)

Query:   753 VTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA-KQCSGCDGCFFEP 811
             + GP   Q +C  CW F+   + E    +   K +  S+ ++ +CA K   GC+G     
Sbjct:   156 IIGPIKTQDSCACCWGFAATAVAEAALTVHLKKAMNLSEQEVCDCAPKHGPGCNGGDPVD 215

Query:   812 SIEYTHQAGLESEKDYPYK-NANGEKFKCAYDKSKVKLFTGK-DFLH---FNGSETMKKI 866
              +EY  + GL   K+YP+  N + +  +C  +K   +L   + D+     FN    M   
Sbjct:   216 GLEYIKEMGLTGGKEYPFNVNRSTQLGRCESEKYDRELNPLELDYYAIDPFNAEYQMTHH 275

Query:   867 LYKYG-PLSVLLNSDL-IHDYNGTPIRKNDETCSPYDLGH--AVLLVGYGKQDN-----I 917
             LY    P+SV   +   +  Y    +   D  C     GH  +  +VGYG   N     +
Sbjct:   276 LYLLNLPISVAFRTGASLSSYLSGILELAD--CDDEKGGHWHSGAIVGYGTTKNSAGRTV 333

Query:   918 PYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 953
              YW+ RNSW     D+G+ +I RG + C IE   GY
Sbjct:   334 DYWIFRNSWWTDWGDDGYARIVRGEDWCSIES-HGY 368

 Score = 209 (78.6 bits), Expect = 1.0e-13, P = 1.0e-13
 Identities = 62/216 (28%), Positives = 95/216 (43%)

Query:    22 VTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA-KQCSGCDGCFFEP 80
             + GP   Q  C  CW F+   + E    +   K +  S+ ++ +CA K   GC+G     
Sbjct:   156 IIGPIKTQDSCACCWGFAATAVAEAALTVHLKKAMNLSEQEVCDCAPKHGPGCNGGDPVD 215

Query:    81 SIEYTHQAGLESEKDYPYK-NANGEKFKCAYDKSKVKLFTGK-DFLH---FNGSETMKKI 135
              +EY  + GL   K+YP+  N + +  +C  +K   +L   + D+     FN    M   
Sbjct:   216 GLEYIKEMGLTGGKEYPFNVNRSTQLGRCESEKYDRELNPLELDYYAIDPFNAEYQMTHH 275

Query:   136 LYKYG-PLSVLLNSDL-IHDYNGTPIRKNDETCSPYDLGH--AVLLVGYGKQDN-----I 186
             LY    P+SV   +   +  Y    +   D  C     GH  +  +VGYG   N     +
Sbjct:   276 LYLLNLPISVAFRTGASLSSYLSGILELAD--CDDEKGGHWHSGAIVGYGTTKNSAGRTV 333

Query:   187 PYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
              YW+ RNSW     D+G+ +I RG + C IE   GY
Sbjct:   334 DYWIFRNSWWTDWGDDGYARIVRGEDWCSIES-HGY 368

 Score = 208 (78.3 bits), Expect = 1.3e-13, P = 1.3e-13
 Identities = 63/218 (28%), Positives = 99/218 (45%)

Query:   387 VTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA-KQCSGCGGCDGLE 445
             + GP   Q +C  CW F+   + E    +   K +  S+ ++ +CA K   GC G D ++
Sbjct:   156 IIGPIKTQDSCACCWGFAATAVAEAALTVHLKKAMNLSEQEVCDCAPKHGPGCNGGDPVD 215

Query:   446 QPIEYTHQAGLESEKDYPYR-NGNGEKFKCAYDKSKVKLFTGKDFLY----FNGSETMKK 500
               +EY  + GL   K+YP+  N + +  +C  +K   +L   +   Y    FN    M  
Sbjct:   216 G-LEYIKEMGLTGGKEYPFNVNRSTQLGRCESEKYDRELNPLELDYYAIDPFNAEYQMTH 274

Query:   501 ILYKYG-PLSVGLNS--HLIHFYNGTPIRKNDETCSPYDLGH--AVLLVGYGKQDD---- 551
              LY    P+SV   +   L  + +G  +   D  C     GH  +  +VGYG   +    
Sbjct:   275 HLYLLNLPISVAFRTGASLSSYLSGI-LELAD--CDDEKGGHWHSGAIVGYGTTKNSAGR 331

Query:   552 -IPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGY 588
              + YW+ RNSW     D+G+ +I RG + C IE   GY
Sbjct:   332 TVDYWIFRNSWWTDWGDDGYARIVRGEDWCSIES-HGY 368

 Score = 175 (66.7 bits), Expect = 3.5e-15, Sum P(2) = 3.5e-15
 Identities = 60/212 (28%), Positives = 96/212 (45%)

Query:   286 ENILETFKAFIVKRGRQYANDEEIKERFEYF--------KQD-GHKK--HE-RYGTSEFS 333
             E + + F+ FIVK  R Y ++ E K RF+ F        K +   KK  H+ +YG ++FS
Sbjct:    41 EKLYKEFEDFIVKYKRNYKDEIEKKFRFQQFVATHNRVGKMNKAAKKAGHDTKYGINKFS 100

Query:   334 DRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNV-----T 388
             D S +EI      K+        V  +            +G +P  +D R K V      
Sbjct:   101 DLSKKEIHGMYS-KFGPPKNNTNVP-KFNLKNLRVKRQMEG-LPKTFDLRNKKVGGHYII 157

Query:   389 GPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA-KQCSGCGGCDGLEQP 447
             GP   Q +C  CW F+   + E    +   K +  S+ ++ +CA K   GC G D ++  
Sbjct:   158 GPIKTQDSCACCWGFAATAVAEAALTVHLKKAMNLSEQEVCDCAPKHGPGCNGGDPVDG- 216

Query:   448 IEYTHQAGLESEKDYPYR-NGNGEKFKCAYDK 478
             +EY  + GL   K+YP+  N + +  +C  +K
Sbjct:   217 LEYIKEMGLTGGKEYPFNVNRSTQLGRCESEK 248

 Score = 171 (65.3 bits), Expect = 1.0e-14, Sum P(2) = 1.0e-14
 Identities = 59/211 (27%), Positives = 94/211 (44%)

Query:   652 ENILETFKAFIVKRGRQYANDEEIKERFEYF--------KQD-GHKK--HE-RYGTSEFS 699
             E + + F+ FIVK  R Y ++ E K RF+ F        K +   KK  H+ +YG ++FS
Sbjct:    41 EKLYKEFEDFIVKYKRNYKDEIEKKFRFQQFVATHNRVGKMNKAAKKAGHDTKYGINKFS 100

Query:   700 DRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKKNV-----T 754
             D S +EI      K+        V  +            +G +P  +D R K V      
Sbjct:   101 DLSKKEIHGMYS-KFGPPKNNTNVP-KFNLKNLRVKRQMEG-LPKTFDLRNKKVGGHYII 157

Query:   755 GPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECA-KQCSGCDGCFFEPSI 813
             GP   Q +C  CW F+   + E    +   K +  S+ ++ +CA K   GC+G      +
Sbjct:   158 GPIKTQDSCACCWGFAATAVAEAALTVHLKKAMNLSEQEVCDCAPKHGPGCNGGDPVDGL 217

Query:   814 EYTHQAGLESEKDYPYK-NANGEKFKCAYDK 843
             EY  + GL   K+YP+  N + +  +C  +K
Sbjct:   218 EYIKEMGLTGGKEYPFNVNRSTQLGRCESEK 248

 Score = 96 (38.9 bits), Expect = 3.5e-15, Sum P(2) = 3.5e-15
 Identities = 21/55 (38%), Positives = 30/55 (54%)

Query:   972 HAVLLVGYGKQDD-----IPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 1021
             H+  +VGYG   +     + YW+ RNSW     D+G+ +I RG + C IE   GY
Sbjct:   315 HSGAIVGYGTTKNSAGRTVDYWIFRNSWWTDWGDDGYARIVRGEDWCSIES-HGY 368


>MGI|MGI:88561 [details] [associations]
            symbol:Ctsb "cathepsin B" species:10090 "Mus musculus"
            [GO:0004175 "endopeptidase activity" evidence=ISO] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISO] [GO:0005576
            "extracellular region" evidence=ISO] [GO:0005615 "extracellular
            space" evidence=ISO] [GO:0005737 "cytoplasm" evidence=ISO]
            [GO:0005739 "mitochondrion" evidence=ISO;IDA] [GO:0005764
            "lysosome" evidence=ISO;IDA] [GO:0005901 "caveola" evidence=ISO]
            [GO:0006508 "proteolysis" evidence=ISO] [GO:0008233 "peptidase
            activity" evidence=ISO] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISO] [GO:0009897 "external side of plasma
            membrane" evidence=ISO] [GO:0009986 "cell surface" evidence=ISO]
            [GO:0016324 "apical plasma membrane" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0030984 "kininogen binding"
            evidence=ISO] [GO:0032403 "protein complex binding" evidence=ISO]
            [GO:0042277 "peptide binding" evidence=ISO] [GO:0042383
            "sarcolemma" evidence=ISO] [GO:0043621 "protein self-association"
            evidence=ISO] [GO:0048471 "perinuclear region of cytoplasm"
            evidence=ISO] [GO:0050790 "regulation of catalytic activity"
            evidence=IEA] [GO:0060548 "negative regulation of cell death"
            evidence=ISO] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 MGI:MGI:88561
            GO:GO:0005739 GO:GO:0042470 GO:GO:0048471 GO:GO:0005615
            GO:GO:0009612 GO:GO:0009611 GO:GO:0009897 GO:GO:0045471
            GO:GO:0016324 GO:GO:0009749 GO:GO:0006914 GO:GO:0043434
            eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0042383 GO:GO:0014070
            GO:GO:0042277 GO:GO:0060548 GO:GO:0005901 GO:GO:0014075
            GO:GO:0004197 GO:GO:0070670 GO:GO:0007519 MEROPS:C01.060 CTD:1508
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 KO:K01363 OMA:GYPSGAW OrthoDB:EOG4K6G4C
            BRENDA:3.4.22.1 GO:GO:0097067 PANTHER:PTHR12411:SF16 ChiTaRS:CTSB
            EMBL:M65270 EMBL:M65263 EMBL:M65264 EMBL:M65265 EMBL:M65266
            EMBL:M65267 EMBL:M65268 EMBL:M65269 EMBL:M14222 EMBL:X54966
            EMBL:S69034 EMBL:AK083393 EMBL:AK147192 EMBL:AK149884 EMBL:AK151790
            EMBL:AK167361 EMBL:BC006656 IPI:IPI00113517 PIR:A38458
            RefSeq:NP_031824.1 UniGene:Mm.236553 UniGene:Mm.489070
            ProteinModelPortal:P10605 SMR:P10605 IntAct:P10605 STRING:P10605
            PhosphoSite:P10605 SWISS-2DPAGE:P10605 PaxDb:P10605 PRIDE:P10605
            Ensembl:ENSMUST00000006235 GeneID:13030 KEGG:mmu:13030
            UCSC:uc007uhh.1 InParanoid:P10605 BioCyc:MetaCyc:MONOMER-14810
            BindingDB:P10605 ChEMBL:CHEMBL5187 NextBio:282900 Bgee:P10605
            CleanEx:MM_CTSB Genevestigator:P10605 GermOnline:ENSMUSG00000021939
            Uniprot:P10605
        Length = 339

 Score = 162 (62.1 bits), Expect = 3.6e-15, Sum P(2) = 3.6e-15
 Identities = 42/115 (36%), Positives = 63/115 (54%)

Query:   487 KDFLY--FNGSETMKKIL---YKYGPLSVGLN--SHLIHFYNGTPIRKNDETCSPYDLGH 539
             K F Y  ++ S ++K+I+   YK GP+       S  + + +G  + K++        GH
Sbjct:   223 KHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSG--VYKHE--AGDMMGGH 278

Query:   540 AVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQ--IAGYATID 592
             A+ ++G+G ++ +PYWLA NSW     D GFFKI RG N CGIE   +AG    D
Sbjct:   279 AIRILGWGVENGVPYWLAANSWNLDWGDNGFFKILRGENHCGIESEIVAGIPRTD 333

 Score = 159 (61.0 bits), Expect = 7.9e-15, Sum P(2) = 7.9e-15
 Identities = 39/111 (35%), Positives = 62/111 (55%)

Query:   123 FLHFNGSETMKKIL---YKYGPL--SVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLL 177
             +  ++ S ++K+I+   YK GP+  +  + SD +   +G  + K++        GHA+ +
Sbjct:   227 YTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSG--VYKHE--AGDMMGGHAIRI 282

Query:   178 VGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAGYATID 226
             +G+G ++ +PYWL  NSW     D GFFKI RG N CGIE   +AG    D
Sbjct:   283 LGWGVENGVPYWLAANSWNLDWGDNGFFKILRGENHCGIESEIVAGIPRTD 333

 Score = 159 (61.0 bits), Expect = 7.9e-15, Sum P(2) = 7.9e-15
 Identities = 39/111 (35%), Positives = 62/111 (55%)

Query:   854 FLHFNGSETMKKIL---YKYGPL--SVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLL 908
             +  ++ S ++K+I+   YK GP+  +  + SD +   +G  + K++        GHA+ +
Sbjct:   227 YTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSG--VYKHE--AGDMMGGHAIRI 282

Query:   909 VGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAGYATID 957
             +G+G ++ +PYWL  NSW     D GFFKI RG N CGIE   +AG    D
Sbjct:   283 LGWGVENGVPYWLAANSWNLDWGDNGFFKILRGENHCGIESEIVAGIPRTD 333

 Score = 151 (58.2 bits), Expect = 6.5e-14, Sum P(2) = 6.5e-14
 Identities = 28/57 (49%), Positives = 36/57 (63%)

Query:   971 GHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAGYATID 1025
             GHA+ ++G+G ++ +PYWL  NSW     D GFFKI RG N CGIE   +AG    D
Sbjct:   277 GHAIRILGWGVENGVPYWLAANSWNLDWGDNGFFKILRGENHCGIESEIVAGIPRTD 333

 Score = 108 (43.1 bits), Expect = 3.6e-15, Sum P(2) = 3.6e-15
 Identities = 30/94 (31%), Positives = 44/94 (46%)

Query:     7 KDGPVPDAWD----WRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKT-GKL-VEFSK 60
             +D  +P+ +D    W      G   DQ  CGSCWAF     +  +  I T G++ VE S 
Sbjct:    76 EDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSA 135

Query:    61 SQLVECAK-QCS-GCDGCFFEPSIEYTHQAGLES 92
               L+ C   QC  GC+G +   +  +  + GL S
Sbjct:   136 EDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVS 169

 Score = 108 (43.1 bits), Expect = 7.9e-15, Sum P(2) = 7.9e-15
 Identities = 30/93 (32%), Positives = 44/93 (47%)

Query:   739 DGPVPDAWD----WRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKL-VEFSKS 792
             D  +P+ +D    W      G   DQ +CGSCWAF     +  +  I T G++ VE S  
Sbjct:    77 DIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAE 136

Query:   793 QLVECAK-QCS-GCDGCFFEPSIEYTHQAGLES 823
              L+ C   QC  GC+G +   +  +  + GL S
Sbjct:   137 DLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVS 169

 Score = 104 (41.7 bits), Expect = 9.3e-15, Sum P(2) = 9.3e-15
 Identities = 27/76 (35%), Positives = 36/76 (47%)

Query:   373 DGPVPDAWD----WRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKL-VEFSKS 426
             D  +P+ +D    W      G   DQ +CGSCWAF     +  +  I T G++ VE S  
Sbjct:    77 DIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAE 136

Query:   427 QLVECAK-QCS-GCGG 440
              L+ C   QC  GC G
Sbjct:   137 DLLTCCGIQCGDGCNG 152


>UNIPROTKB|E2R6Q7 [details] [associations]
            symbol:CTSB "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730 "nucleolus"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790
            GO:GO:0005764 GO:GO:0004197 CTD:1508 GeneTree:ENSGT00560000076599
            KO:K01363 OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16
            EMBL:AAEX03014318 RefSeq:XP_543203.3 Ensembl:ENSCAFT00000012692
            GeneID:486077 KEGG:cfa:486077 NextBio:20859923 Uniprot:E2R6Q7
        Length = 339

 Score = 164 (62.8 bits), Expect = 5.4e-15, Sum P(2) = 5.4e-15
 Identities = 42/104 (40%), Positives = 52/104 (50%)

Query:   127 NGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
             N  E M +I YK GP+       SD +   +G       E       GHAV ++G+G +D
Sbjct:   235 NEKEIMAEI-YKNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMG----GHAVRILGWGVED 289

Query:   185 NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAGYATID 226
               PYWLV NSW     D GFFKI RG + CGIE   +AG    D
Sbjct:   290 GTPYWLVGNSWNTDWGDNGFFKILRGRDHCGIESEIVAGIPCTD 333

 Score = 164 (62.8 bits), Expect = 8.5e-15, Sum P(3) = 8.5e-15
 Identities = 42/104 (40%), Positives = 52/104 (50%)

Query:   858 NGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 915
             N  E M +I YK GP+       SD +   +G       E       GHAV ++G+G +D
Sbjct:   235 NEKEIMAEI-YKNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMG----GHAVRILGWGVED 289

Query:   916 NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAGYATID 957
               PYWLV NSW     D GFFKI RG + CGIE   +AG    D
Sbjct:   290 GTPYWLVGNSWNTDWGDNGFFKILRGRDHCGIESEIVAGIPCTD 333

 Score = 159 (61.0 bits), Expect = 2.0e-14, Sum P(2) = 2.0e-14
 Identities = 40/104 (38%), Positives = 51/104 (49%)

Query:   493 NGSETMKKILYKYGPLSVGLN--SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 550
             N  E M +I YK GP+       S  + + +G       E       GHAV ++G+G +D
Sbjct:   235 NEKEIMAEI-YKNGPVEAAFTVYSDFLLYKSGVYQHVTGEMMG----GHAVRILGWGVED 289

Query:   551 DIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQ--IAGYATID 592
               PYWL  NSW     D GFFKI RG + CGIE   +AG    D
Sbjct:   290 GTPYWLVGNSWNTDWGDNGFFKILRGRDHCGIESEIVAGIPCTD 333

 Score = 153 (58.9 bits), Expect = 1.7e-13, Sum P(3) = 1.7e-13
 Identities = 30/57 (52%), Positives = 36/57 (63%)

Query:   971 GHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAGYATID 1025
             GHAV ++G+G +D  PYWLV NSW     D GFFKI RG + CGIE   +AG    D
Sbjct:   277 GHAVRILGWGVEDGTPYWLVGNSWNTDWGDNGFFKILRGRDHCGIESEIVAGIPCTD 333

 Score = 105 (42.0 bits), Expect = 8.5e-15, Sum P(3) = 8.5e-15
 Identities = 29/90 (32%), Positives = 46/90 (51%)

Query:   742 VPDAWDWRKKNVTGPA----GDQAACGSCWAFSIAGMLEGQYAIKT-GKL-VEFSKSQLV 795
             +P+++D R++    P      DQ +CGSCWAF     +  +  I+T G + VE S   ++
Sbjct:    80 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNVEVSAEDML 139

Query:   796 ECA-KQCS-GCDGCFFEPSIEYTHQAGLES 823
              C   QC  GC+G F   +  +  + GL S
Sbjct:   140 TCCGDQCGDGCNGGFPAEAWNFWTKQGLVS 169

 Score = 104 (41.7 bits), Expect = 5.4e-15, Sum P(2) = 5.4e-15
 Identities = 29/90 (32%), Positives = 45/90 (50%)

Query:    11 VPDAWDWRKKNVTGPA----GDQADCGSCWAFSIAGMLEGQYAIKT-GKL-VEFSKSQLV 64
             +P+++D R++    P      DQ  CGSCWAF     +  +  I+T G + VE S   ++
Sbjct:    80 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNVEVSAEDML 139

Query:    65 ECA-KQCS-GCDGCFFEPSIEYTHQAGLES 92
              C   QC  GC+G F   +  +  + GL S
Sbjct:   140 TCCGDQCGDGCNGGFPAEAWNFWTKQGLVS 169

 Score = 98 (39.6 bits), Expect = 4.4e-14, Sum P(3) = 4.4e-14
 Identities = 25/73 (34%), Positives = 38/73 (52%)

Query:   376 VPDAWDWRKKNVTGPA----GDQAACGSCWAFSIAGMLEGQYAIKT-GKL-VEFSKSQLV 429
             +P+++D R++    P      DQ +CGSCWAF     +  +  I+T G + VE S   ++
Sbjct:    80 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNVEVSAEDML 139

Query:   430 ECA-KQCS-GCGG 440
              C   QC  GC G
Sbjct:   140 TCCGDQCGDGCNG 152

 Score = 37 (18.1 bits), Expect = 8.5e-15, Sum P(3) = 8.5e-15
 Identities = 10/33 (30%), Positives = 16/33 (48%)

Query:   242 IQAVFLLCGVASCLCLPSLTDRITDQVVARVDT 274
             +  + +L G  S L   +L+D + D V  R  T
Sbjct:     8 LSCLVMLTGAQSRLPFRALSDELVDYVNKRNTT 40

 Score = 37 (18.1 bits), Expect = 8.5e-15, Sum P(3) = 8.5e-15
 Identities = 10/33 (30%), Positives = 16/33 (48%)

Query:   608 IQAVFLLCGVASCLCLPSLTDRITDQVVARVDT 640
             +  + +L G  S L   +L+D + D V  R  T
Sbjct:     8 LSCLVMLTGAQSRLPFRALSDELVDYVNKRNTT 40


>DICTYBASE|DDB_G0288221 [details] [associations]
            symbol:DDB_G0288221 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0288221 Pfam:PF00188 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 PANTHER:PTHR12411 EMBL:AAFI02000109 Gene3D:3.40.33.10
            InterPro:IPR001283 InterPro:IPR014044 SMART:SM00198 SUPFAM:SSF55797
            MEROPS:C01.A52 ProtClustDB:CLSZ2429919 RefSeq:XP_636852.1
            ProteinModelPortal:Q54J84 EnsemblProtists:DDB0187839 GeneID:8626520
            KEGG:ddi:DDB_G0288221 InParanoid:Q54J84 Uniprot:Q54J84
        Length = 395

 Score = 220 (82.5 bits), Expect = 7.3e-15, P = 7.3e-15
 Identities = 67/205 (32%), Positives = 99/205 (48%)

Query:    16 DWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQL-VECAKQC--SG 72
             DW   +   P  DQ +C SCW F     LE +Y IK G + E S   L  + A  C  SG
Sbjct:   193 DW--SDYQTPVRDQGECKSCWVFGSLAALESRYLIKNG-VSEKSTLHLSAQNAMNCITSG 249

Query:    73 CDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETM 132
             C+  +     +Y   +G+  EKDYPY +A G    C    +K + ++G D +  N  +++
Sbjct:   250 CESGWPANVFDYFESSGIAFEKDYPY-DAIGSD-NCTSSSNKFE-YSGYDSVE-NTKDSL 305

Query:   133 KKILYKYGPLSVLLNSDL-IHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLV 191
              + L K GP+++ L SD     Y G      +E     D+ H VLLVGY K  +   W +
Sbjct:   306 IQEL-KNGPITIALYSDTAFQSYAGGIYDSVEEY---KDVNHIVLLVGYDKPTDS--WKI 359

Query:   192 RNSWGPIGPDEGFFKIERGNNACGI 216
             +NS G    + G+ +I   N+  GI
Sbjct:   360 KNSLGTKWGELGYARITASNDKLGI 384

 Score = 217 (81.4 bits), Expect = 1.6e-14, P = 1.6e-14
 Identities = 67/205 (32%), Positives = 98/205 (47%)

Query:   747 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQL-VECAKQC--SG 803
             DW   +   P  DQ  C SCW F     LE +Y IK G + E S   L  + A  C  SG
Sbjct:   193 DW--SDYQTPVRDQGECKSCWVFGSLAALESRYLIKNG-VSEKSTLHLSAQNAMNCITSG 249

Query:   804 CDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETM 863
             C+  +     +Y   +G+  EKDYPY +A G    C    +K + ++G D +  N  +++
Sbjct:   250 CESGWPANVFDYFESSGIAFEKDYPY-DAIGSD-NCTSSSNKFE-YSGYDSVE-NTKDSL 305

Query:   864 KKILYKYGPLSVLLNSDL-IHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLV 922
              + L K GP+++ L SD     Y G      +E     D+ H VLLVGY K  +   W +
Sbjct:   306 IQEL-KNGPITIALYSDTAFQSYAGGIYDSVEEY---KDVNHIVLLVGYDKPTDS--WKI 359

Query:   923 RNSWGPIGPDEGFFKIERGNNACGI 947
             +NS G    + G+ +I   N+  GI
Sbjct:   360 KNSLGTKWGELGYARITASNDKLGI 384

 Score = 199 (75.1 bits), Expect = 1.7e-12, P = 1.7e-12
 Identities = 67/207 (32%), Positives = 96/207 (46%)

Query:   381 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGC-- 438
             DW   +   P  DQ  C SCW F     LE +Y IK G + E  KS L   A+    C  
Sbjct:   193 DW--SDYQTPVRDQGECKSCWVFGSLAALESRYLIKNG-VSE--KSTLHLSAQNAMNCIT 247

Query:   439 GGCD-GLEQPI-EYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSE 496
              GC+ G    + +Y   +G+  EKDYPY +  G    C    +K + ++G D +  N  +
Sbjct:   248 SGCESGWPANVFDYFESSGIAFEKDYPY-DAIGSD-NCTSSSNKFE-YSGYDSVE-NTKD 303

Query:   497 TMKKILYKYGPLSVGLNSHL-IHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYW 555
             ++ + L K GP+++ L S      Y G      +E     D+ H VLLVGY K  D   W
Sbjct:   304 SLIQEL-KNGPITIALYSDTAFQSYAGGIYDSVEEY---KDVNHIVLLVGYDKPTDS--W 357

Query:   556 LARNSWGPIGPDEGFFKIERGNNACGI 582
               +NS G    + G+ +I   N+  GI
Sbjct:   358 KIKNSLGTKWGELGYARITASNDKLGI 384


>WB|WBGene00000783 [details] [associations]
            symbol:cpr-3 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0009792 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39890 EMBL:L39925 EMBL:Z81119
            EMBL:Z82057 PIR:T37282 RefSeq:NP_506790.1 UniGene:Cel.23503
            ProteinModelPortal:P43507 SMR:P43507 MEROPS:C01.A33
            EnsemblMetazoa:T10H4.12 GeneID:180033 KEGG:cel:CELE_T10H4.12
            UCSC:T10H4.12 CTD:180033 WormBase:T10H4.12 eggNOG:NOG240190
            InParanoid:P43507 OMA:PVEASYK NextBio:907824 Uniprot:P43507
        Length = 370

 Score = 163 (62.4 bits), Expect = 8.9e-15, Sum P(2) = 8.9e-15
 Identities = 46/132 (34%), Positives = 66/132 (50%)

Query:   829 YKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPL--SVLLNSDLIHDYN 886
             YK    +K K  Y  S  K+ T K       +E   +I Y YGP+  S  +  D  H  +
Sbjct:   218 YKTEEYKKDK-HYGASAYKVTTTKSV-----TEIQTEI-YHYGPVEASYKVYEDFYHYKS 270

Query:   887 GTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACG 946
             G     + +       GHAV ++G+G ++ + YWL+ NSWG    ++GFFKI RG N C 
Sbjct:   271 GVYHYTSGKLVG----GHAVKIIGWGVENGVDYWLIANSWGTSFGEKGFFKIRRGTNECQ 326

Query:   947 IEQ--IAGYATI 956
             IE   +AG A +
Sbjct:   327 IEGNVVAGIAKL 338

 Score = 163 (62.4 bits), Expect = 1.1e-14, Sum P(2) = 1.1e-14
 Identities = 46/132 (34%), Positives = 66/132 (50%)

Query:    98 YKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPL--SVLLNSDLIHDYN 155
             YK    +K K  Y  S  K+ T K       +E   +I Y YGP+  S  +  D  H  +
Sbjct:   218 YKTEEYKKDK-HYGASAYKVTTTKSV-----TEIQTEI-YHYGPVEASYKVYEDFYHYKS 270

Query:   156 GTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACG 215
             G     + +       GHAV ++G+G ++ + YWL+ NSWG    ++GFFKI RG N C 
Sbjct:   271 GVYHYTSGKLVG----GHAVKIIGWGVENGVDYWLIANSWGTSFGEKGFFKIRRGTNECQ 326

Query:   216 IEQ--IAGYATI 225
             IE   +AG A +
Sbjct:   327 IEGNVVAGIAKL 338

 Score = 153 (58.9 bits), Expect = 3.1e-13, Sum P(3) = 3.1e-13
 Identities = 33/94 (35%), Positives = 50/94 (53%)

Query:   502 LYKYGPLSVGLNSH--LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARN 559
             +Y YGP+      +    H+ +G     + +       GHAV ++G+G ++ + YWL  N
Sbjct:   249 IYHYGPVEASYKVYEDFYHYKSGVYHYTSGKLVG----GHAVKIIGWGVENGVDYWLIAN 304

Query:   560 SWGPIGPDEGFFKIERGNNACGIEQ--IAGYATI 591
             SWG    ++GFFKI RG N C IE   +AG A +
Sbjct:   305 SWGTSFGEKGFFKIRRGTNECQIEGNVVAGIAKL 338

 Score = 150 (57.9 bits), Expect = 2.6e-13, Sum P(2) = 2.6e-13
 Identities = 27/56 (48%), Positives = 38/56 (67%)

Query:   971 GHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAGYATI 1024
             GHAV ++G+G ++ + YWL+ NSWG    ++GFFKI RG N C IE   +AG A +
Sbjct:   283 GHAVKIIGWGVENGVDYWLIANSWGTSFGEKGFFKIRRGTNECQIEGNVVAGIAKL 338

 Score = 105 (42.0 bits), Expect = 8.9e-15, Sum P(2) = 8.9e-15
 Identities = 27/95 (28%), Positives = 45/95 (47%)

Query:   741 PVPDAWDWRKK----NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQL 794
             P+PD +D R+K    N      +QA CGSCWAF  A ++  +  I++    +   S   +
Sbjct:    91 PLPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDI 150

Query:   795 VECA-KQCS-GCDGCFFEPSIEYTHQAGLESEKDY 827
             + C    C  GC G +   ++ +   +G  +  DY
Sbjct:   151 LSCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDY 185

 Score = 104 (41.7 bits), Expect = 1.1e-14, Sum P(2) = 1.1e-14
 Identities = 27/95 (28%), Positives = 45/95 (47%)

Query:    10 PVPDAWDWRKK----NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQL 63
             P+PD +D R+K    N      +QA CGSCWAF  A ++  +  I++    +   S   +
Sbjct:    91 PLPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDI 150

Query:    64 VECA-KQCS-GCDGCFFEPSIEYTHQAGLESEKDY 96
             + C    C  GC G +   ++ +   +G  +  DY
Sbjct:   151 LSCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDY 185

 Score = 100 (40.3 bits), Expect = 2.9e-14, Sum P(2) = 2.9e-14
 Identities = 28/96 (29%), Positives = 45/96 (46%)

Query:   375 PVPDAWDWRKK----NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQL 428
             P+PD +D R+K    N      +QA CGSCWAF  A ++  +  I++    +   S   +
Sbjct:    91 PLPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDI 150

Query:   429 VECA-KQCS-GCGGCDGLEQPIEYTHQAGLESEKDY 462
             + C    C  GC G   +E  + +   +G  +  DY
Sbjct:   151 LSCCGTTCGYGCKGGYSIEA-LRFWASSGAVTGGDY 185

 Score = 38 (18.4 bits), Expect = 3.1e-13, Sum P(3) = 3.1e-13
 Identities = 6/17 (35%), Positives = 11/17 (64%)

Query:   316 FKQDGHKKHERYGTSEF 332
             +K + +KK + YG S +
Sbjct:   218 YKTEEYKKDKHYGASAY 234

 Score = 38 (18.4 bits), Expect = 6.8e-13, Sum P(3) = 6.8e-13
 Identities = 6/17 (35%), Positives = 11/17 (64%)

Query:   682 FKQDGHKKHERYGTSEF 698
             +K + +KK + YG S +
Sbjct:   218 YKTEEYKKDKHYGASAY 234


>WB|WBGene00000781 [details] [associations]
            symbol:cpr-1 species:6239 "Caenorhabditis elegans"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008340 "determination
            of adult lifespan" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008340 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            eggNOG:NOG315657 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            KO:K01363 PANTHER:PTHR12411:SF16 EMBL:M74797 EMBL:Z78012 PIR:T20148
            RefSeq:NP_506002.2 ProteinModelPortal:P25807 SMR:P25807
            DIP:DIP-25619N MINT:MINT-1058393 STRING:P25807 MEROPS:C01.A32
            PaxDb:P25807 EnsemblMetazoa:C52E4.1 GeneID:179637
            KEGG:cel:CELE_C52E4.1 UCSC:C52E4.1 CTD:179637 WormBase:C52E4.1
            InParanoid:P25807 OMA:CSLSCQS NextBio:906250 Uniprot:P25807
        Length = 329

 Score = 165 (63.1 bits), Expect = 5.9e-09, P = 5.9e-09
 Identities = 59/209 (28%), Positives = 88/209 (42%)

Query:    20 KNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFE 79
             K    P     D  SC   S     EG Y I+    + +  S+ V       G  GC   
Sbjct:   131 KGAQQPIISPDDLLSCCGSSCGNGCEGGYPIQA---LRWWDSKGVVTGGDYHGA-GCKPY 186

Query:    80 PSIEYTHQAGLESEKDYPYKNANGEK-FKCAYDKSKVKLFTGKDFLHFNGSETMKKILYK 138
             P    T  +G   E   P  + + +  +  AY K K   F    +     + +++  +Y 
Sbjct:   187 PIAPCT--SGNCPESKTPSCSMSCQSGYSTAYAKDKH--FGVSAYAVPKNAASIQAEIYA 242

Query:   139 YGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPI 198
              GP+    +  +  D+          T   Y  GHA+ ++G+G +   PYWLV NSWG  
Sbjct:   243 NGPVEAAFS--VYEDFYKYKSGVYKHTAGKYLGGHAIKIIGWGTESGSPYWLVANSWGVN 300

Query:   199 GPDEGFFKIERGNNACGIEQ--IAGYATI 225
               + GFFKI RG++ CGIE   +AG A +
Sbjct:   301 WGESGFFKIYRGDDQCGIESAVVAGKAKV 329

 Score = 163 (62.4 bits), Expect = 1.0e-08, P = 1.0e-08
 Identities = 56/195 (28%), Positives = 85/195 (43%)

Query:   765 SCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIEYTHQAGLESE 824
             SC   S     EG Y I+    + +  S+ V       G  GC   P    T  +G   E
Sbjct:   145 SCCGSSCGNGCEGGYPIQA---LRWWDSKGVVTGGDYHGA-GCKPYPIAPCT--SGNCPE 198

Query:   825 KDYPYKNANGEK-FKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNSDLIH 883
                P  + + +  +  AY K K   F    +     + +++  +Y  GP+    +  +  
Sbjct:   199 SKTPSCSMSCQSGYSTAYAKDKH--FGVSAYAVPKNAASIQAEIYANGPVEAAFS--VYE 254

Query:   884 DYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNN 943
             D+          T   Y  GHA+ ++G+G +   PYWLV NSWG    + GFFKI RG++
Sbjct:   255 DFYKYKSGVYKHTAGKYLGGHAIKIIGWGTESGSPYWLVANSWGVNWGESGFFKIYRGDD 314

Query:   944 ACGIEQ--IAGYATI 956
              CGIE   +AG A +
Sbjct:   315 QCGIESAVVAGKAKV 329

 Score = 162 (62.1 bits), Expect = 9.9e-15, Sum P(2) = 9.9e-15
 Identities = 38/122 (31%), Positives = 62/122 (50%)

Query:   472 FKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDET 531
             +  AY K K   F    +     + +++  +Y  GP+    + +   +   + + K+  T
Sbjct:   212 YSTAYAKDKH--FGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSGVYKH--T 267

Query:   532 CSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQ--IAGYA 589
                Y  GHA+ ++G+G +   PYWL  NSWG    + GFFKI RG++ CGIE   +AG A
Sbjct:   268 AGKYLGGHAIKIIGWGTESGSPYWLVANSWGVNWGESGFFKIYRGDDQCGIESAVVAGKA 327

Query:   590 TI 591
              +
Sbjct:   328 KV 329

 Score = 158 (60.7 bits), Expect = 2.2e-14, Sum P(2) = 2.2e-14
 Identities = 30/63 (47%), Positives = 40/63 (63%)

Query:   964 TCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAGY 1021
             T   Y  GHA+ ++G+G +   PYWLV NSWG    + GFFKI RG++ CGIE   +AG 
Sbjct:   267 TAGKYLGGHAIKIIGWGTESGSPYWLVANSWGVNWGESGFFKIYRGDDQCGIESAVVAGK 326

Query:  1022 ATI 1024
             A +
Sbjct:   327 AKV 329

 Score = 104 (41.7 bits), Expect = 2.2e-14, Sum P(2) = 2.2e-14
 Identities = 24/73 (32%), Positives = 37/73 (50%)

Query:   759 DQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVECA-KQC-SGCDGCFFEPSIE 814
             DQA CGSCWAF  A M+  +  I+T    +   S   L+ C    C +GC+G +   ++ 
Sbjct:   106 DQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLLSCCGSSCGNGCEGGYPIQALR 165

Query:   815 YTHQAGLESEKDY 827
             +    G+ +  DY
Sbjct:   166 WWDSKGVVTGGDY 178

 Score = 103 (41.3 bits), Expect = 9.9e-15, Sum P(2) = 9.9e-15
 Identities = 24/73 (32%), Positives = 37/73 (50%)

Query:    28 DQADCGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVECA-KQC-SGCDGCFFEPSIE 83
             DQA CGSCWAF  A M+  +  I+T    +   S   L+ C    C +GC+G +   ++ 
Sbjct:   106 DQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLLSCCGSSCGNGCEGGYPIQALR 165

Query:    84 YTHQAGLESEKDY 96
             +    G+ +  DY
Sbjct:   166 WWDSKGVVTGGDY 178

 Score = 103 (41.3 bits), Expect = 9.9e-15, Sum P(2) = 9.9e-15
 Identities = 27/75 (36%), Positives = 37/75 (49%)

Query:   393 DQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVECAKQCSGCG-GCDGLE--QP 447
             DQA CGSCWAF  A M+  +  I+T    +   S   L+ C    S CG GC+G    Q 
Sbjct:   106 DQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLLSCCG--SSCGNGCEGGYPIQA 163

Query:   448 IEYTHQAGLESEKDY 462
             + +    G+ +  DY
Sbjct:   164 LRWWDSKGVVTGGDY 178


>DICTYBASE|DDB_G0281079 [details] [associations]
            symbol:DDB_G0281079 species:44689 "Dictyostelium
            discoideum" [GO:0030246 "carbohydrate binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR008965 InterPro:IPR013128 InterPro:IPR019028
            Pfam:PF00112 Pfam:PF09478 PRINTS:PR00705 SMART:SM00645
            SMART:SM01063 InterPro:IPR000169 dictyBase:DDB_G0281079
            GO:GO:0030246 EMBL:AAFI02000040 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 SUPFAM:SSF49384 RefSeq:XP_640804.1
            ProteinModelPortal:Q54UH2 EnsemblProtists:DDB0204000 GeneID:8622858
            KEGG:ddi:DDB_G0281079 InParanoid:Q54UH2 OMA:ALESHYY
            ProtClustDB:CLSZ2430562 Uniprot:Q54UH2
        Length = 664

 Score = 228 (85.3 bits), Expect = 1.2e-14, Sum P(2) = 1.2e-14
 Identities = 56/189 (29%), Positives = 89/189 (47%)

Query:   377 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC-AKQC 435
             P + DWR   +     +Q +CGSC+AFS  G LE  Y  K  ++++ S+  LV+C A   
Sbjct:   471 PISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALESHYYRKNNRMLDLSEQNLVDCTASNK 530

Query:   436 SGCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYF 492
                GGC G  +     Y  +  G+  E  YPY    G+   C Y+    +    K F+  
Sbjct:   531 YRNGGCSGGWMHNCYSYIQENGGINQESTYPYEGKFGQ---CRYNSGDAQSRISK-FVMI 586

Query:   493 --NGSETMKKILYKYGPLSVGLNSHLIHF-YNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 549
               +  E +   +   GP+SV  ++    F Y    I  +D  C+ Y   HAV++VGY  +
Sbjct:   587 KQHDEEDLADTVASVGPVSVAYDASTREFMYYSRGIYYSDN-CNKYRTTHAVVVVGYDNE 645

Query:   550 DDIPYWLAR 558
             + + YW+ +
Sbjct:   646 NGVDYWIIK 654

 Score = 225 (84.3 bits), Expect = 2.5e-14, Sum P(2) = 2.5e-14
 Identities = 51/188 (27%), Positives = 90/188 (47%)

Query:   743 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAK--- 799
             P + DWR   +     +Q +CGSC+AFS  G LE  Y  K  ++++ S+  LV+C     
Sbjct:   471 PISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALESHYYRKNNRMLDLSEQNLVDCTASNK 530

Query:   800 -QCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGE-KFKCAYDKSKVKLFTGKDFLH 856
              +  GC G +      Y  +  G+  E  YPY+   G+ ++     +S++  F     + 
Sbjct:   531 YRNGGCSGGWMHNCYSYIQENGGINQESTYPYEGKFGQCRYNSGDAQSRISKFV---MIK 587

Query:   857 FNGSETMKKILYKYGPLSVLLNSDLIHD-YNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 915
              +  E +   +   GP+SV  ++      Y    I  +D  C+ Y   HAV++VGY  ++
Sbjct:   588 QHDEEDLADTVASVGPVSVAYDASTREFMYYSRGIYYSDN-CNKYRTTHAVVVVGYDNEN 646

Query:   916 NIPYWLVR 923
              + YW+++
Sbjct:   647 GVDYWIIK 654

 Score = 224 (83.9 bits), Expect = 1.0e-14, P = 1.0e-14
 Identities = 51/188 (27%), Positives = 89/188 (47%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAK--- 68
             P + DWR   +     +Q  CGSC+AFS  G LE  Y  K  ++++ S+  LV+C     
Sbjct:   471 PISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALESHYYRKNNRMLDLSEQNLVDCTASNK 530

Query:    69 -QCSGCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGE-KFKCAYDKSKVKLFTGKDFLH 125
              +  GC G +      Y  +  G+  E  YPY+   G+ ++     +S++  F     + 
Sbjct:   531 YRNGGCSGGWMHNCYSYIQENGGINQESTYPYEGKFGQCRYNSGDAQSRISKFV---MIK 587

Query:   126 FNGSETMKKILYKYGPLSVLLNSDLIHD-YNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
              +  E +   +   GP+SV  ++      Y    I  +D  C+ Y   HAV++VGY  ++
Sbjct:   588 QHDEEDLADTVASVGPVSVAYDASTREFMYYSRGIYYSDN-CNKYRTTHAVVVVGYDNEN 646

Query:   185 NIPYWLVR 192
              + YW+++
Sbjct:   647 GVDYWIIK 654

 Score = 45 (20.9 bits), Expect = 1.2e-14, Sum P(2) = 1.2e-14
 Identities = 10/31 (32%), Positives = 17/31 (54%)

Query:   312 RF-EYFKQDGHKKHERYGTSEFSDRSPEEIL 341
             RF E +K++        G ++FSD + +E L
Sbjct:   189 RFIEQYKRENQNSTMELGLTQFSDMTHDEFL 219

 Score = 45 (20.9 bits), Expect = 2.5e-14, Sum P(2) = 2.5e-14
 Identities = 10/31 (32%), Positives = 17/31 (54%)

Query:   678 RF-EYFKQDGHKKHERYGTSEFSDRSPEEIL 707
             RF E +K++        G ++FSD + +E L
Sbjct:   189 RFIEQYKRENQNSTMELGLTQFSDMTHDEFL 219


>ZFIN|ZDB-GENE-070323-1 [details] [associations]
            symbol:ctsbb "capthepsin B, b" species:7955 "Danio
            rerio" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-070323-1 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197
            GeneTree:ENSGT00560000076599 PANTHER:PTHR12411:SF16 OMA:CCGFLCG
            EMBL:CU207296 EMBL:CABZ01037785 IPI:IPI00877452
            Ensembl:ENSDART00000097263 Bgee:F1QZT5 Uniprot:F1QZT5
        Length = 326

 Score = 146 (56.5 bits), Expect = 1.1e-14, Sum P(2) = 1.1e-14
 Identities = 26/52 (50%), Positives = 37/52 (71%)

Query:   903 GHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 952
             GHAV ++G+G+++  P+WLV NSW     D G+FKI RG++ CGIE   +AG
Sbjct:   271 GHAVKILGWGEENGTPFWLVANSWNSDWGDNGYFKILRGHDECGIESEMVAG 322

 Score = 146 (56.5 bits), Expect = 1.4e-14, Sum P(2) = 1.4e-14
 Identities = 26/52 (50%), Positives = 37/52 (71%)

Query:   172 GHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 221
             GHAV ++G+G+++  P+WLV NSW     D G+FKI RG++ CGIE   +AG
Sbjct:   271 GHAVKILGWGEENGTPFWLVANSWNSDWGDNGYFKILRGHDECGIESEMVAG 322

 Score = 145 (56.1 bits), Expect = 1.5e-14, Sum P(2) = 1.5e-14
 Identities = 26/52 (50%), Positives = 37/52 (71%)

Query:   971 GHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 1020
             GHAV ++G+G+++  P+WLV NSW     D G+FKI RG++ CGIE   +AG
Sbjct:   271 GHAVKILGWGEENGTPFWLVANSWNSDWGDNGYFKILRGHDECGIESEMVAG 322

 Score = 143 (55.4 bits), Expect = 3.2e-14, Sum P(2) = 3.2e-14
 Identities = 42/135 (31%), Positives = 65/135 (48%)

Query:   459 EKDYPYRNGNG-EKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNSHLI 517
             E+D P   G    K+   Y + K   F  K +   +  + +   LY  GP+         
Sbjct:   195 EQDTPKCTGVCIPKYSVPYKQDKH--FGSKVYNVPSDQQQIMTELYTNGPVEAAFT---- 248

Query:   518 HFYNGTPIRKND--ETCSPYDLG-HAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIE 574
               Y   P+ K+   +  +   LG HAV ++G+G+++  P+WL  NSW     D G+FKI 
Sbjct:   249 -VYEDFPLYKSGVYQHLTGSALGGHAVKILGWGEENGTPFWLVANSWNSDWGDNGYFKIL 307

Query:   575 RGNNACGIEQ--IAG 587
             RG++ CGIE   +AG
Sbjct:   308 RGHDECGIESEMVAG 322

 Score = 120 (47.3 bits), Expect = 1.1e-14, Sum P(2) = 1.1e-14
 Identities = 29/87 (33%), Positives = 41/87 (47%)

Query:   742 VPDAWD----WRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKLV-EFSKSQLV 795
             +PD++D    W          DQ +CGSCWAF     +  +  I + GK   E S   L+
Sbjct:    75 LPDSFDLRDQWPNCKTLNQIRDQGSCGSCWAFGAVESISDRICIHSKGKQSPEISAEDLL 134

Query:   796 ECAKQCS-GCDGCFFEPSIEYTHQAGL 821
              C  QC  GC G F   + +Y  ++GL
Sbjct:   135 SCCDQCGFGCSGGFPAEAWDYWRRSGL 161

 Score = 119 (46.9 bits), Expect = 1.4e-14, Sum P(2) = 1.4e-14
 Identities = 29/87 (33%), Positives = 40/87 (45%)

Query:    11 VPDAWD----WRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKT-GKLV-EFSKSQLV 64
             +PD++D    W          DQ  CGSCWAF     +  +  I + GK   E S   L+
Sbjct:    75 LPDSFDLRDQWPNCKTLNQIRDQGSCGSCWAFGAVESISDRICIHSKGKQSPEISAEDLL 134

Query:    65 ECAKQCS-GCDGCFFEPSIEYTHQAGL 90
              C  QC  GC G F   + +Y  ++GL
Sbjct:   135 SCCDQCGFGCSGGFPAEAWDYWRRSGL 161

 Score = 108 (43.1 bits), Expect = 2.0e-13, Sum P(2) = 2.0e-13
 Identities = 29/88 (32%), Positives = 40/88 (45%)

Query:   376 VPDAWD----WRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKLV-EFSKSQLV 429
             +PD++D    W          DQ +CGSCWAF     +  +  I + GK   E S   L+
Sbjct:    75 LPDSFDLRDQWPNCKTLNQIRDQGSCGSCWAFGAVESISDRICIHSKGKQSPEISAEDLL 134

Query:   430 ECAKQCS-GCGGCDGLEQPIEYTHQAGL 456
              C  QC  GC G    E   +Y  ++GL
Sbjct:   135 SCCDQCGFGCSGGFPAEA-WDYWRRSGL 161


>RGD|621509 [details] [associations]
            symbol:Ctsb "cathepsin B" species:10116 "Rattus norvegicus"
            [GO:0004175 "endopeptidase activity" evidence=IMP;IDA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA;ISO;IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005576
            "extracellular region" evidence=IDA] [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0005730 "nucleolus" evidence=IEA;ISO]
            [GO:0005737 "cytoplasm" evidence=IDA] [GO:0005739 "mitochondrion"
            evidence=IEA;ISO;IDA] [GO:0005764 "lysosome" evidence=IEA;ISO;IDA]
            [GO:0006508 "proteolysis" evidence=IEA;IEP;ISO;IMP;IDA;TAS]
            [GO:0006914 "autophagy" evidence=IEP] [GO:0006950 "response to
            stress" evidence=IEP] [GO:0007283 "spermatogenesis" evidence=IEP]
            [GO:0007519 "skeletal muscle tissue development" evidence=IEP]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0009611
            "response to wounding" evidence=IEP] [GO:0009612 "response to
            mechanical stimulus" evidence=IEP] [GO:0009749 "response to glucose
            stimulus" evidence=IEP] [GO:0009897 "external side of plasma
            membrane" evidence=IDA] [GO:0009986 "cell surface" evidence=IDA]
            [GO:0014070 "response to organic cyclic compound" evidence=IEP]
            [GO:0014075 "response to amine stimulus" evidence=IEP] [GO:0016324
            "apical plasma membrane" evidence=IDA] [GO:0030984 "kininogen
            binding" evidence=IPI] [GO:0032403 "protein complex binding"
            evidence=IPI] [GO:0034097 "response to cytokine stimulus"
            evidence=IEP] [GO:0042277 "peptide binding" evidence=IDA]
            [GO:0042383 "sarcolemma" evidence=IDA] [GO:0042470 "melanosome"
            evidence=IEA] [GO:0043231 "intracellular membrane-bounded
            organelle" evidence=ISO] [GO:0043434 "response to peptide hormone
            stimulus" evidence=IEP] [GO:0043621 "protein self-association"
            evidence=IDA] [GO:0045471 "response to ethanol" evidence=IEP]
            [GO:0048471 "perinuclear region of cytoplasm" evidence=ISO;IDA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0060548 "negative regulation of cell death" evidence=IMP]
            [GO:0070670 "response to interleukin-4" evidence=IEP] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA;ISO]
            [GO:0005901 "caveola" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:621509 GO:GO:0005739 GO:GO:0042470
            GO:GO:0005615 GO:GO:0009612 GO:GO:0009611 GO:GO:0009897
            GO:GO:0045471 GO:GO:0016324 GO:GO:0009749 GO:GO:0006914
            GO:GO:0043434 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0007283
            GO:GO:0005764 GO:GO:0042383 GO:GO:0043621 GO:GO:0014070
            GO:GO:0042277 GO:GO:0060548 GO:GO:0014075 GO:GO:0004197
            GO:GO:0070670 GO:GO:0007519 MEROPS:C01.060 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 GO:GO:0097067
            PANTHER:PTHR12411:SF16 EMBL:X82396 EMBL:M11305 IPI:IPI00212811
            PIR:S51041 UniGene:Rn.100909 PDB:1CPJ PDB:1CTE PDB:1MIR PDB:1THE
            PDBsum:1CPJ PDBsum:1CTE PDBsum:1MIR PDBsum:1THE
            ProteinModelPortal:P00787 SMR:P00787 STRING:P00787 PRIDE:P00787
            UCSC:RGD:621509 InParanoid:P00787 SABIO-RK:P00787 BindingDB:P00787
            ChEMBL:CHEMBL2602 EvolutionaryTrace:P00787 ArrayExpress:P00787
            Genevestigator:P00787 GermOnline:ENSRNOG00000010331 Uniprot:P00787
        Length = 339

 Score = 161 (61.7 bits), Expect = 1.5e-14, Sum P(2) = 1.5e-14
 Identities = 46/134 (34%), Positives = 68/134 (50%)

Query:    93 EKDYPYKNANGEK-FKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPL--SVLLNSD 149
             E D P  N   E  +  +Y + K   +T    +  +  E M +I YK GP+  +  + SD
Sbjct:   201 EGDTPKCNKMCEAGYSTSYKEDKHYGYTSYS-VSDSEKEIMAEI-YKNGPVEGAFTVFSD 258

Query:   150 LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIER 209
              +   +G    +  +       GHA+ ++G+G ++ +PYWLV NSW     D GFFKI R
Sbjct:   259 FLTYKSGVYKHEAGDVMG----GHAIRILGWGIENGVPYWLVANSWNVDWGDNGFFKILR 314

Query:   210 GNNACGIEQ--IAG 221
             G N CGIE   +AG
Sbjct:   315 GENHCGIESEIVAG 328

 Score = 161 (61.7 bits), Expect = 1.5e-14, Sum P(2) = 1.5e-14
 Identities = 46/134 (34%), Positives = 68/134 (50%)

Query:   824 EKDYPYKNANGEK-FKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPL--SVLLNSD 880
             E D P  N   E  +  +Y + K   +T    +  +  E M +I YK GP+  +  + SD
Sbjct:   201 EGDTPKCNKMCEAGYSTSYKEDKHYGYTSYS-VSDSEKEIMAEI-YKNGPVEGAFTVFSD 258

Query:   881 LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIER 940
              +   +G    +  +       GHA+ ++G+G ++ +PYWLV NSW     D GFFKI R
Sbjct:   259 FLTYKSGVYKHEAGDVMG----GHAIRILGWGIENGVPYWLVANSWNVDWGDNGFFKILR 314

Query:   941 GNNACGIEQ--IAG 952
             G N CGIE   +AG
Sbjct:   315 GENHCGIESEIVAG 328

 Score = 153 (58.9 bits), Expect = 1.3e-13, Sum P(2) = 1.3e-13
 Identities = 28/52 (53%), Positives = 36/52 (69%)

Query:   971 GHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 1020
             GHA+ ++G+G ++ +PYWLV NSW     D GFFKI RG N CGIE   +AG
Sbjct:   277 GHAIRILGWGIENGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAG 328

 Score = 153 (58.9 bits), Expect = 1.3e-13, Sum P(2) = 1.3e-13
 Identities = 36/103 (34%), Positives = 56/103 (54%)

Query:   492 FNGSETMKKIL---YKYGPLSVGLN--SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGY 546
             ++ S++ K+I+   YK GP+       S  + + +G    +  +       GHA+ ++G+
Sbjct:   230 YSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMG----GHAIRILGW 285

Query:   547 GKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 587
             G ++ +PYWL  NSW     D GFFKI RG N CGIE   +AG
Sbjct:   286 GIENGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAG 328

 Score = 103 (41.3 bits), Expect = 1.5e-14, Sum P(2) = 1.5e-14
 Identities = 29/90 (32%), Positives = 46/90 (51%)

Query:   742 VPDAWDWRKKNVTGPA----GDQAACGSCWAFSIAGMLEGQYAIKT-GKL-VEFSKSQLV 795
             +P+++D R++    P      DQ +CGSCWAF     +  +  I T G++ VE S   L+
Sbjct:    80 LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLL 139

Query:   796 ECAK-QCS-GCDGCFFEPSIEYTHQAGLES 823
              C   QC  GC+G +   +  +  + GL S
Sbjct:   140 TCCGIQCGDGCNGGYPSGAWNFWTRKGLVS 169

 Score = 103 (41.3 bits), Expect = 1.5e-14, Sum P(2) = 1.5e-14
 Identities = 30/94 (31%), Positives = 47/94 (50%)

Query:     7 KDGPVPDAWDWRKKNVTGPA----GDQADCGSCWAFSIAGMLEGQYAIKT-GKL-VEFSK 60
             +D  +P+++D R++    P      DQ  CGSCWAF     +  +  I T G++ VE S 
Sbjct:    76 EDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSA 135

Query:    61 SQLVECAK-QCS-GCDGCFFEPSIEYTHQAGLES 92
               L+ C   QC  GC+G +   +  +  + GL S
Sbjct:   136 EDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVS 169

 Score = 99 (39.9 bits), Expect = 4.0e-14, Sum P(2) = 4.0e-14
 Identities = 26/73 (35%), Positives = 38/73 (52%)

Query:   376 VPDAWDWRKKNVTGPA----GDQAACGSCWAFSIAGMLEGQYAIKT-GKL-VEFSKSQLV 429
             +P+++D R++    P      DQ +CGSCWAF     +  +  I T G++ VE S   L+
Sbjct:    80 LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLL 139

Query:   430 ECAK-QCS-GCGG 440
              C   QC  GC G
Sbjct:   140 TCCGIQCGDGCNG 152


>UNIPROTKB|Q6IN22 [details] [associations]
            symbol:Ctsb "Cathepsin B" species:10116 "Rattus norvegicus"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 RGD:621509 GO:GO:0005739
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 CTD:1508
            GeneTree:ENSGT00560000076599 HOVERGEN:HBG003480 KO:K01363
            OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16 UniGene:Rn.100909
            EMBL:CH474023 HSSP:P00785 EMBL:BC072490 IPI:IPI00562653
            RefSeq:NP_072119.2 SMR:Q6IN22 IntAct:Q6IN22 STRING:Q6IN22
            Ensembl:ENSRNOT00000014177 GeneID:64529 KEGG:rno:64529
            InParanoid:Q6IN22 NextBio:613362 Genevestigator:Q6IN22
            Uniprot:Q6IN22
        Length = 339

 Score = 161 (61.7 bits), Expect = 1.5e-14, Sum P(2) = 1.5e-14
 Identities = 46/134 (34%), Positives = 68/134 (50%)

Query:    93 EKDYPYKNANGEK-FKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPL--SVLLNSD 149
             E D P  N   E  +  +Y + K   +T    +  +  E M +I YK GP+  +  + SD
Sbjct:   201 EGDTPKCNKMCEAGYSTSYKEDKHYGYTSYS-VSDSEKEIMAEI-YKNGPVEGAFTVFSD 258

Query:   150 LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIER 209
              +   +G    +  +       GHA+ ++G+G ++ +PYWLV NSW     D GFFKI R
Sbjct:   259 FLTYKSGVYKHEAGDVMG----GHAIRILGWGIENGVPYWLVANSWNVDWGDNGFFKILR 314

Query:   210 GNNACGIEQ--IAG 221
             G N CGIE   +AG
Sbjct:   315 GENHCGIESEIVAG 328

 Score = 161 (61.7 bits), Expect = 1.5e-14, Sum P(2) = 1.5e-14
 Identities = 46/134 (34%), Positives = 68/134 (50%)

Query:   824 EKDYPYKNANGEK-FKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPL--SVLLNSD 880
             E D P  N   E  +  +Y + K   +T    +  +  E M +I YK GP+  +  + SD
Sbjct:   201 EGDTPKCNKMCEAGYSTSYKEDKHYGYTSYS-VSDSEKEIMAEI-YKNGPVEGAFTVFSD 258

Query:   881 LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIER 940
              +   +G    +  +       GHA+ ++G+G ++ +PYWLV NSW     D GFFKI R
Sbjct:   259 FLTYKSGVYKHEAGDVMG----GHAIRILGWGIENGVPYWLVANSWNVDWGDNGFFKILR 314

Query:   941 GNNACGIEQ--IAG 952
             G N CGIE   +AG
Sbjct:   315 GENHCGIESEIVAG 328

 Score = 153 (58.9 bits), Expect = 1.3e-13, Sum P(2) = 1.3e-13
 Identities = 28/52 (53%), Positives = 36/52 (69%)

Query:   971 GHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 1020
             GHA+ ++G+G ++ +PYWLV NSW     D GFFKI RG N CGIE   +AG
Sbjct:   277 GHAIRILGWGIENGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAG 328

 Score = 153 (58.9 bits), Expect = 1.3e-13, Sum P(2) = 1.3e-13
 Identities = 36/103 (34%), Positives = 56/103 (54%)

Query:   492 FNGSETMKKIL---YKYGPLSVGLN--SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGY 546
             ++ S++ K+I+   YK GP+       S  + + +G    +  +       GHA+ ++G+
Sbjct:   230 YSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMG----GHAIRILGW 285

Query:   547 GKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 587
             G ++ +PYWL  NSW     D GFFKI RG N CGIE   +AG
Sbjct:   286 GIENGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAG 328

 Score = 103 (41.3 bits), Expect = 1.5e-14, Sum P(2) = 1.5e-14
 Identities = 29/90 (32%), Positives = 46/90 (51%)

Query:   742 VPDAWDWRKKNVTGPA----GDQAACGSCWAFSIAGMLEGQYAIKT-GKL-VEFSKSQLV 795
             +P+++D R++    P      DQ +CGSCWAF     +  +  I T G++ VE S   L+
Sbjct:    80 LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLL 139

Query:   796 ECAK-QCS-GCDGCFFEPSIEYTHQAGLES 823
              C   QC  GC+G +   +  +  + GL S
Sbjct:   140 TCCGIQCGDGCNGGYPSGAWNFWTRKGLVS 169

 Score = 103 (41.3 bits), Expect = 1.5e-14, Sum P(2) = 1.5e-14
 Identities = 30/94 (31%), Positives = 47/94 (50%)

Query:     7 KDGPVPDAWDWRKKNVTGPA----GDQADCGSCWAFSIAGMLEGQYAIKT-GKL-VEFSK 60
             +D  +P+++D R++    P      DQ  CGSCWAF     +  +  I T G++ VE S 
Sbjct:    76 EDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSA 135

Query:    61 SQLVECAK-QCS-GCDGCFFEPSIEYTHQAGLES 92
               L+ C   QC  GC+G +   +  +  + GL S
Sbjct:   136 EDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVS 169

 Score = 99 (39.9 bits), Expect = 4.0e-14, Sum P(2) = 4.0e-14
 Identities = 26/73 (35%), Positives = 38/73 (52%)

Query:   376 VPDAWDWRKKNVTGPA----GDQAACGSCWAFSIAGMLEGQYAIKT-GKL-VEFSKSQLV 429
             +P+++D R++    P      DQ +CGSCWAF     +  +  I T G++ VE S   L+
Sbjct:    80 LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLL 139

Query:   430 ECAK-QCS-GCGG 440
              C   QC  GC G
Sbjct:   140 TCCGIQCGDGCNG 152


>TAIR|locus:2133402 [details] [associations]
            symbol:AT4G01610 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0005829 "cytosol" evidence=IDA] [GO:0000902 "cell
            morphogenesis" evidence=RCA] [GO:0006635 "fatty acid
            beta-oxidation" evidence=RCA] [GO:0010162 "seed dormancy process"
            evidence=RCA] [GO:0016049 "cell growth" evidence=RCA] [GO:0048193
            "Golgi vesicle transport" evidence=RCA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0005773 EMBL:CP002687
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 eggNOG:NOG315657
            HOGENOM:HOG000241341 KO:K01363 PANTHER:PTHR12411:SF16 OMA:DAIPDHF
            HSSP:P07858 ProtClustDB:CLSN2687619 EMBL:AF370193 EMBL:AY065167
            EMBL:AY114015 EMBL:AY086034 EMBL:AF083797 EMBL:BT001190
            EMBL:AK175280 EMBL:AK175481 EMBL:AK175539 EMBL:AK176165
            EMBL:AK176244 EMBL:AK176281 EMBL:AK176330 EMBL:AK176416
            EMBL:AK176433 EMBL:AK176487 EMBL:AK221398 EMBL:AK230235
            IPI:IPI00530811 RefSeq:NP_567215.1 UniGene:At.24471
            ProteinModelPortal:Q94K85 SMR:Q94K85 STRING:Q94K85 MEROPS:C01.144
            PaxDb:Q94K85 PRIDE:Q94K85 EnsemblPlants:AT4G01610.1 GeneID:826792
            KEGG:ath:AT4G01610 TAIR:At4g01610 InParanoid:Q94K85
            PhylomeDB:Q94K85 Genevestigator:Q94K85 Uniprot:Q94K85
        Length = 359

 Score = 145 (56.1 bits), Expect = 1.2e-06, P = 1.2e-06
 Identities = 53/173 (30%), Positives = 80/173 (46%)

Query:    72 GCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKF--------------KCAYDK---SK 114
             GCDG +   + +Y   +G+ +E+  PY +  G                 KC  D    S+
Sbjct:   170 GCDGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEPAYPTPKCSRKCVSDNKLWSE 229

Query:   115 VKLFTGKDF-LHFNGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDL 171
              K ++   + +  N  + M ++ YK GP+ V      D  H  +G  + K+  T S    
Sbjct:   230 SKHYSVSTYTVKSNPQDIMAEV-YKNGPVEVSFTVYEDFAHYKSG--VYKHI-TGSNIG- 284

Query:   172 GHAVLLVGYG-KQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 221
             GHAV L+G+G   +   YWL+ N W     D+G+F I RG N CGIE   +AG
Sbjct:   285 GHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRRGTNECGIEDEPVAG 337

 Score = 145 (56.1 bits), Expect = 1.2e-06, P = 1.2e-06
 Identities = 53/173 (30%), Positives = 80/173 (46%)

Query:   803 GCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKF--------------KCAYDK---SK 845
             GCDG +   + +Y   +G+ +E+  PY +  G                 KC  D    S+
Sbjct:   170 GCDGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEPAYPTPKCSRKCVSDNKLWSE 229

Query:   846 VKLFTGKDF-LHFNGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDL 902
              K ++   + +  N  + M ++ YK GP+ V      D  H  +G  + K+  T S    
Sbjct:   230 SKHYSVSTYTVKSNPQDIMAEV-YKNGPVEVSFTVYEDFAHYKSG--VYKHI-TGSNIG- 284

Query:   903 GHAVLLVGYG-KQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 952
             GHAV L+G+G   +   YWL+ N W     D+G+F I RG N CGIE   +AG
Sbjct:   285 GHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRRGTNECGIEDEPVAG 337

 Score = 139 (54.0 bits), Expect = 2.0e-14, Sum P(2) = 2.0e-14
 Identities = 40/123 (32%), Positives = 60/123 (48%)

Query:   473 KCAYDK---SKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNSH--LIHFYNGTPIRK 527
             KC  D    S+ K ++   +   +  + +   +YK GP+ V    +    H+ +G  + K
Sbjct:   219 KCVSDNKLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYEDFAHYKSG--VYK 276

Query:   528 NDETCSPYDLGHAVLLVGYGKQDD-IPYWLARNSWGPIGPDEGFFKIERGNNACGIEQ-- 584
             +  T S    GHAV L+G+G   +   YWL  N W     D+G+F I RG N CGIE   
Sbjct:   277 HI-TGSNIG-GHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRRGTNECGIEDEP 334

Query:   585 IAG 587
             +AG
Sbjct:   335 VAG 337

 Score = 127 (49.8 bits), Expect = 2.0e-14, Sum P(2) = 2.0e-14
 Identities = 31/99 (31%), Positives = 49/99 (49%)

Query:    11 VPDAWD----WRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC 66
             +P A+D    W +    G   DQ  CGSCWAF     L  ++ I+ G  +  S + L+ C
Sbjct:   103 LPKAFDARTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLAC 162

Query:    67 AK-QCS-GCDGCFFEPSIEYTHQAGLESEKDYPYKNANG 103
                +C  GCDG +   + +Y   +G+ +E+  PY +  G
Sbjct:   163 CGFRCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDNTG 201

 Score = 126 (49.4 bits), Expect = 5.6e-13, Sum P(2) = 5.6e-13
 Identities = 25/53 (47%), Positives = 32/53 (60%)

Query:   971 GHAVLLVGYGKQDD-IPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 1020
             GHAV L+G+G   +   YWL+ N W     D+G+F I RG N CGIE   +AG
Sbjct:   285 GHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRRGTNECGIEDEPVAG 337

 Score = 126 (49.4 bits), Expect = 7.3e-13, Sum P(2) = 7.3e-13
 Identities = 31/99 (31%), Positives = 49/99 (49%)

Query:   742 VPDAWD----WRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC 797
             +P A+D    W +    G   DQ  CGSCWAF     L  ++ I+ G  +  S + L+ C
Sbjct:   103 LPKAFDARTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLAC 162

Query:   798 AK-QCS-GCDGCFFEPSIEYTHQAGLESEKDYPYKNANG 834
                +C  GCDG +   + +Y   +G+ +E+  PY +  G
Sbjct:   163 CGFRCGDGCDGGYPIAAWQYFSYSGVVTEECDPYFDNTG 201

 Score = 125 (49.1 bits), Expect = 3.2e-14, Sum P(2) = 3.2e-14
 Identities = 35/103 (33%), Positives = 50/103 (48%)

Query:   376 VPDAWD----WRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVEC 431
             +P A+D    W +    G   DQ  CGSCWAF     L  ++ I+ G  +  S + L+ C
Sbjct:   103 LPKAFDARTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLAC 162

Query:   432 AKQCS-GCG-GCDGLEQPI---EYTHQAGLESEKDYPYRNGNG 469
                C   CG GCDG   PI   +Y   +G+ +E+  PY +  G
Sbjct:   163 ---CGFRCGDGCDG-GYPIAAWQYFSYSGVVTEECDPYFDNTG 201


>FB|FBgn0030521 [details] [associations]
            symbol:CtsB1 "Cathepsin B1" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS] [GO:0035071 "salivary gland cell autophagic cell
            death" evidence=IEP] [GO:0048102 "autophagic cell death"
            evidence=IEP] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:AE014298 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0035071
            GO:GO:0004197 MEROPS:C01.060 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 KO:K01363 PANTHER:PTHR12411:SF16
            HSSP:P07688 EMBL:AY060640 RefSeq:NP_572920.1 UniGene:Dm.3926
            SMR:Q9VY87 IntAct:Q9VY87 MINT:MINT-932864 STRING:Q9VY87
            EnsemblMetazoa:FBtr0073838 GeneID:32341 KEGG:dme:Dmel_CG10992
            UCSC:CG10992-RA FlyBase:FBgn0030521 InParanoid:Q9VY87 OMA:TEGHIRR
            OrthoDB:EOG48W9HM ChiTaRS:CG10992 GenomeRNAi:32341 NextBio:778020
            Uniprot:Q9VY87
        Length = 340

 Score = 141 (54.7 bits), Expect = 4.0e-14, Sum P(2) = 4.0e-14
 Identities = 38/112 (33%), Positives = 60/112 (53%)

Query:   842 DKSKVKLFTGKDF-LHFNGSETMKKILYKYGPL--SVLLNSDLIHDYNGTPIRKNDETCS 898
             D +K K F  K + +  N  E  ++I+   GP+  +  +  DLI   +G    ++ +   
Sbjct:   224 DYAKDKHFGSKSYSVRRNVREIQEEIMTN-GPVEGAFTVYEDLILYKDGVYQHEHGKELG 282

Query:   899 PYDLGHAVLLVGYGK--QDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 948
                 GHA+ ++G+G   ++ IPYWL+ NSW     D GFF+I RG + CGIE
Sbjct:   283 ----GHAIRILGWGVWGEEKIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIE 330

 Score = 141 (54.7 bits), Expect = 5.0e-14, Sum P(2) = 5.0e-14
 Identities = 38/112 (33%), Positives = 60/112 (53%)

Query:   111 DKSKVKLFTGKDF-LHFNGSETMKKILYKYGPL--SVLLNSDLIHDYNGTPIRKNDETCS 167
             D +K K F  K + +  N  E  ++I+   GP+  +  +  DLI   +G    ++ +   
Sbjct:   224 DYAKDKHFGSKSYSVRRNVREIQEEIMTN-GPVEGAFTVYEDLILYKDGVYQHEHGKELG 282

Query:   168 PYDLGHAVLLVGYGK--QDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 217
                 GHA+ ++G+G   ++ IPYWL+ NSW     D GFF+I RG + CGIE
Sbjct:   283 ----GHAIRILGWGVWGEEKIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIE 330

 Score = 134 (52.2 bits), Expect = 2.5e-13, Sum P(2) = 2.5e-13
 Identities = 24/48 (50%), Positives = 33/48 (68%)

Query:   971 GHAVLLVGYGK--QDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 1016
             GHA+ ++G+G   ++ IPYWL+ NSW     D GFF+I RG + CGIE
Sbjct:   283 GHAIRILGWGVWGEEKIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIE 330

 Score = 133 (51.9 bits), Expect = 4.0e-13, Sum P(2) = 4.0e-13
 Identities = 37/112 (33%), Positives = 58/112 (51%)

Query:   477 DKSKVKLFTGKDF-LYFNGSETMKKILYKYGPLSVGLNSH--LIHFYNGTPIRKNDETCS 533
             D +K K F  K + +  N  E  ++I+   GP+      +  LI + +G    ++ +   
Sbjct:   224 DYAKDKHFGSKSYSVRRNVREIQEEIMTN-GPVEGAFTVYEDLILYKDGVYQHEHGKELG 282

Query:   534 PYDLGHAVLLVGYGK--QDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIE 583
                 GHA+ ++G+G   ++ IPYWL  NSW     D GFF+I RG + CGIE
Sbjct:   283 ----GHAIRILGWGVWGEEKIPYWLIGNSWNTDWGDHGFFRILRGQDHCGIE 330

 Score = 121 (47.7 bits), Expect = 4.0e-14, Sum P(2) = 4.0e-14
 Identities = 32/93 (34%), Positives = 44/93 (47%)

Query:   742 VPDAWDWRKK--N--VTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEF--SKSQLV 795
             +P+ +D RK+  N    G   DQ +CGSCWAF     +  +  I +G  V F  S   LV
Sbjct:    87 LPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLV 146

Query:   796 ECAKQCS-GCDGCFFEPSIEYTHQAGLESEKDY 827
              C   C  GC+G F   +  Y  + G+ S   Y
Sbjct:   147 SCCHTCGFGCNGGFPGAAWSYWTRKGIVSGGPY 179

 Score = 120 (47.3 bits), Expect = 5.0e-14, Sum P(2) = 5.0e-14
 Identities = 32/93 (34%), Positives = 43/93 (46%)

Query:    11 VPDAWDWRKK--N--VTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEF--SKSQLV 64
             +P+ +D RK+  N    G   DQ  CGSCWAF     +  +  I +G  V F  S   LV
Sbjct:    87 LPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLV 146

Query:    65 ECAKQCS-GCDGCFFEPSIEYTHQAGLESEKDY 96
              C   C  GC+G F   +  Y  + G+ S   Y
Sbjct:   147 SCCHTCGFGCNGGFPGAAWSYWTRKGIVSGGPY 179

 Score = 113 (44.8 bits), Expect = 2.7e-13, Sum P(2) = 2.7e-13
 Identities = 29/75 (38%), Positives = 38/75 (50%)

Query:   376 VPDAWDWRKK--N--VTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEF--SKSQLV 429
             +P+ +D RK+  N    G   DQ +CGSCWAF     +  +  I +G  V F  S   LV
Sbjct:    87 LPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLV 146

Query:   430 ECAKQCSGCG-GCDG 443
              C   C  CG GC+G
Sbjct:   147 SC---CHTCGFGCNG 158


>UNIPROTKB|F1N9D7 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9031 "Gallus gallus"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005730 "nucleolus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005739 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0005764
            GO:GO:0004197 GeneTree:ENSGT00560000076599 OMA:GYPSGAW
            GO:GO:0097067 PANTHER:PTHR12411:SF16 IPI:IPI00573387
            EMBL:AADN02018292 Ensembl:ENSGALT00000026896
            Ensembl:ENSGALT00000036723 Uniprot:F1N9D7
        Length = 340

 Score = 149 (57.5 bits), Expect = 5.4e-14, Sum P(2) = 5.4e-14
 Identities = 36/96 (37%), Positives = 52/96 (54%)

Query:   861 ETMKKILYKYGPL--SVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIP 918
             E M +I YK GP+  + ++  D +   +G     + E       GHA+ ++G+G ++  P
Sbjct:   239 EIMAEI-YKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQVG----GHAIRILGWGVENGTP 293

Query:   919 YWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 952
             YWL  NSW     D GFFKI RG + CGIE   +AG
Sbjct:   294 YWLAANSWNTDWGDNGFFKILRGEDHCGIESEIVAG 329

 Score = 149 (57.5 bits), Expect = 6.9e-14, Sum P(2) = 6.9e-14
 Identities = 36/96 (37%), Positives = 52/96 (54%)

Query:   130 ETMKKILYKYGPL--SVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIP 187
             E M +I YK GP+  + ++  D +   +G     + E       GHA+ ++G+G ++  P
Sbjct:   239 EIMAEI-YKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQVG----GHAIRILGWGVENGTP 293

Query:   188 YWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 221
             YWL  NSW     D GFFKI RG + CGIE   +AG
Sbjct:   294 YWLAANSWNTDWGDNGFFKILRGEDHCGIESEIVAG 329

 Score = 146 (56.5 bits), Expect = 1.5e-13, Sum P(2) = 1.5e-13
 Identities = 27/52 (51%), Positives = 35/52 (67%)

Query:   538 GHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 587
             GHA+ ++G+G ++  PYWLA NSW     D GFFKI RG + CGIE   +AG
Sbjct:   278 GHAIRILGWGVENGTPYWLAANSWNTDWGDNGFFKILRGEDHCGIESEIVAG 329

 Score = 142 (55.0 bits), Expect = 3.3e-13, Sum P(2) = 3.3e-13
 Identities = 26/52 (50%), Positives = 34/52 (65%)

Query:   971 GHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 1020
             GHA+ ++G+G ++  PYWL  NSW     D GFFKI RG + CGIE   +AG
Sbjct:   278 GHAIRILGWGVENGTPYWLAANSWNTDWGDNGFFKILRGEDHCGIESEIVAG 329

 Score = 111 (44.1 bits), Expect = 5.4e-14, Sum P(2) = 5.4e-14
 Identities = 31/93 (33%), Positives = 45/93 (48%)

Query:   739 DGPVPDAWDWRKKNVTGPA----GDQAACGSCWAFSIAGMLEGQYAIKTG-KL-VEFSKS 792
             D  +PD +D RK+    P      DQ +CGSCWAF     +  +  + T  K+ VE S  
Sbjct:    77 DMDLPDTFDSRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAE 136

Query:   793 QLVECAK-QCS-GCDGCFFEPSIEYTHQAGLES 823
              L+ C   +C  GC+G +   +  Y  + GL S
Sbjct:   137 DLLSCCGFECGMGCNGGYPSGAWRYWTERGLVS 169

 Score = 110 (43.8 bits), Expect = 6.9e-14, Sum P(2) = 6.9e-14
 Identities = 31/93 (33%), Positives = 44/93 (47%)

Query:     8 DGPVPDAWDWRKKNVTGPA----GDQADCGSCWAFSIAGMLEGQYAIKTG-KL-VEFSKS 61
             D  +PD +D RK+    P      DQ  CGSCWAF     +  +  + T  K+ VE S  
Sbjct:    77 DMDLPDTFDSRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAE 136

Query:    62 QLVECAK-QCS-GCDGCFFEPSIEYTHQAGLES 92
              L+ C   +C  GC+G +   +  Y  + GL S
Sbjct:   137 DLLSCCGFECGMGCNGGYPSGAWRYWTERGLVS 169

 Score = 103 (41.3 bits), Expect = 3.7e-13, Sum P(2) = 3.7e-13
 Identities = 29/79 (36%), Positives = 39/79 (49%)

Query:   373 DGPVPDAWDWRKKNVTGPA----GDQAACGSCWAFSIAGMLEGQYAIKTG-KL-VEFSKS 426
             D  +PD +D RK+    P      DQ +CGSCWAF     +  +  + T  K+ VE S  
Sbjct:    77 DMDLPDTFDSRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAE 136

Query:   427 QLVECAKQCS-GCG-GCDG 443
              L+ C   C   CG GC+G
Sbjct:   137 DLLSC---CGFECGMGCNG 152


>TAIR|locus:505006093 [details] [associations]
            symbol:AT1G02305 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0005829 "cytosol" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684 GO:GO:0005773
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 HOGENOM:HOG000241341
            KO:K01363 PANTHER:PTHR12411:SF16 OMA:CCGFLCG UniGene:At.23486
            UniGene:At.42610 UniGene:At.43952 EMBL:AY039887 EMBL:AF428337
            EMBL:BT002227 IPI:IPI00524601 RefSeq:NP_563648.1 HSSP:P07858
            ProteinModelPortal:Q93VC9 SMR:Q93VC9 IntAct:Q93VC9 STRING:Q93VC9
            MEROPS:C01.049 PRIDE:Q93VC9 ProMEX:Q93VC9 EnsemblPlants:AT1G02305.1
            GeneID:839538 KEGG:ath:AT1G02305 TAIR:At1g02305 InParanoid:Q93VC9
            PhylomeDB:Q93VC9 ProtClustDB:CLSN2687619 Genevestigator:Q93VC9
            Uniprot:Q93VC9
        Length = 362

 Score = 149 (57.5 bits), Expect = 5.7e-14, Sum P(2) = 5.7e-14
 Identities = 37/91 (40%), Positives = 50/91 (54%)

Query:   502 LYKYGPLSVGLNSH--LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD-IPYWLAR 558
             +YK GP+ V    +    H+ +G  + K+  T +    GHAV L+G+G  DD   YWL  
Sbjct:   254 VYKNGPVEVAFTVYEDFAHYKSG--VYKHI-TGTNIG-GHAVKLIGWGTSDDGEDYWLLA 309

Query:   559 NSWGPIGPDEGFFKIERGNNACGIEQ--IAG 587
             N W     D+G+FKI RG N CGIE   +AG
Sbjct:   310 NQWNRSWGDDGYFKIRRGTNECGIEHGVVAG 340

 Score = 148 (57.2 bits), Expect = 5.9e-07, P = 5.9e-07
 Identities = 58/185 (31%), Positives = 86/185 (46%)

Query:   803 GCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFK-C--AYDKSKV--KLFTG----KD 853
             GC+G +   +  Y    G+ +E+  PY +  G     C  AY   K   K  +G    ++
Sbjct:   173 GCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQLWRE 232

Query:   854 FLHFNGS--------ETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDLG 903
               H+  S        + +   +YK GP+ V      D  H  +G  + K+  T +    G
Sbjct:   233 SKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSG--VYKHI-TGTNIG-G 288

Query:   904 HAVLLVGYGKQDN-IPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAGYATI-DVV 959
             HAV L+G+G  D+   YWL+ N W     D+G+FKI RG N CGIE   +AG  +  +VV
Sbjct:   289 HAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSDRNVV 348

Query:   960 KNDET 964
             K   T
Sbjct:   349 KGITT 353

 Score = 146 (56.5 bits), Expect = 9.9e-07, P = 9.9e-07
 Identities = 54/172 (31%), Positives = 80/172 (46%)

Query:    72 GCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFK-C--AYDKSKV--KLFTG----KD 122
             GC+G +   +  Y    G+ +E+  PY +  G     C  AY   K   K  +G    ++
Sbjct:   173 GCNGGYPIAAWRYFKHHGVVTEECDPYFDNTGCSHPGCEPAYPTPKCARKCVSGNQLWRE 232

Query:   123 FLHFNGS--------ETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDLG 172
               H+  S        + +   +YK GP+ V      D  H  +G  + K+  T +    G
Sbjct:   233 SKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSG--VYKHI-TGTNIG-G 288

Query:   173 HAVLLVGYGKQDN-IPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 221
             HAV L+G+G  D+   YWL+ N W     D+G+FKI RG N CGIE   +AG
Sbjct:   289 HAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVVAG 340

 Score = 144 (55.7 bits), Expect = 1.6e-13, Sum P(2) = 1.6e-13
 Identities = 28/53 (52%), Positives = 34/53 (64%)

Query:   971 GHAVLLVGYGKQDD-IPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 1020
             GHAV L+G+G  DD   YWL+ N W     D+G+FKI RG N CGIE   +AG
Sbjct:   288 GHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVVAG 340

 Score = 113 (44.8 bits), Expect = 1.6e-13, Sum P(2) = 1.6e-13
 Identities = 53/195 (27%), Positives = 79/195 (40%)

Query:   382 WRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSG-CG- 439
             W +    G   DQ  CGSCWAF     L  ++ IK    V  S + L+ C   C   CG 
Sbjct:   116 WSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYNMNVSLSVNDLLAC---CGFLCGQ 172

Query:   440 GCDGLEQPI---EYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLY-FNGS 495
             GC+G   PI    Y    G+ +E+  PY +  G    C++   +    T K      +G+
Sbjct:   173 GCNG-GYPIAAWRYFKHHGVVTEECDPYFDNTG----CSHPGCEPAYPTPKCARKCVSGN 227

Query:   496 ETMKKILYKYGPLSVGLNSH----LIHFYNGTPIRKND---ETCSPYDLG---------- 538
             +  ++  + YG  +  + SH    +   Y   P+       E  + Y  G          
Sbjct:   228 QLWRESKH-YGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNI 286

Query:   539 --HAVLLVGYGKQDD 551
               HAV L+G+G  DD
Sbjct:   287 GGHAVKLIGWGTSDD 301

 Score = 112 (44.5 bits), Expect = 5.7e-14, Sum P(2) = 5.7e-14
 Identities = 28/89 (31%), Positives = 40/89 (44%)

Query:    17 WRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAK-QCS-GCD 74
             W +    G   DQ  CGSCWAF     L  ++ IK    V  S + L+ C    C  GC+
Sbjct:   116 WSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYNMNVSLSVNDLLACCGFLCGQGCN 175

Query:    75 GCFFEPSIEYTHQAGLESEKDYPYKNANG 103
             G +   +  Y    G+ +E+  PY +  G
Sbjct:   176 GGYPIAAWRYFKHHGVVTEECDPYFDNTG 204

 Score = 111 (44.1 bits), Expect = 2.6e-13, Sum P(2) = 2.6e-13
 Identities = 28/89 (31%), Positives = 40/89 (44%)

Query:   748 WRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAK-QCS-GCD 805
             W +    G   DQ  CGSCWAF     L  ++ IK    V  S + L+ C    C  GC+
Sbjct:   116 WSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYNMNVSLSVNDLLACCGFLCGQGCN 175

Query:   806 GCFFEPSIEYTHQAGLESEKDYPYKNANG 834
             G +   +  Y    G+ +E+  PY +  G
Sbjct:   176 GGYPIAAWRYFKHHGVVTEECDPYFDNTG 204


>UNIPROTKB|F1PIF2 [details] [associations]
            symbol:CTSZ "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0060441 "epithelial tube branching involved
            in lung morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0005783 GO:GO:0005615 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GO:GO:0060441 GeneTree:ENSGT00560000076599 OMA:QCGTCTE
            EMBL:AAEX03014054 Ensembl:ENSCAFT00000019357 Uniprot:F1PIF2
        Length = 261

 Score = 193 (73.0 bits), Expect = 5.8e-14, P = 5.8e-14
 Identities = 59/219 (26%), Positives = 94/219 (42%)

Query:   376 VPDAWDWRKKNVTGPAG---DQAA---CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLV 429
             +P +WDWR  N    A    +Q     CGSCWA      +  +  IK  K    S    V
Sbjct:    20 LPKSWDWRNVNGVNYASATRNQHIPQYCGSCWAHGSTSAMADRINIKR-KGAWPSTLLSV 78

Query:   430 ECAKQCSGCGGCDG-LEQPI-EYTHQAGLESE--KDYPYRNGNGEKFK----CAYDKS-- 479
             +    C+  G C+G  + P+  Y H+ G+  E   +Y  ++    KF     C   K   
Sbjct:    79 QHVLDCANAGSCEGGNDLPVWSYAHEHGIPDETCNNYQAKDQECNKFNQCGTCTEFKECH 138

Query:   480 ---KVKLFTGKDFLYFNGSETMKKILYKYGPLSVGL--NSHLIHFYNGTPIRKNDETCSP 534
                   L+   D+   +G E M   +Y  GP+S G+     ++++  G      ++    
Sbjct:   139 AIQNYTLWRVGDYGSLSGREKMMAEIYANGPISCGIMATEKMVNYTGGIHAEYQEQA--- 195

Query:   535 YDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKI 573
             Y + H + +VG+G  D   YW+ RNSWG    + G+ +I
Sbjct:   196 Y-INHVISVVGWGVSDGTEYWIVRNSWGEPWGERGWMRI 233

 Score = 171 (65.3 bits), Expect = 1.4e-10, Sum P(2) = 1.4e-10
 Identities = 54/191 (28%), Positives = 84/191 (43%)

Query:   763 CGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKSQLVECAKQCSGCDGCFFEPSIEYTHQA 819
             CGSCWA      +  +  IK  G       S   +++CA   S C+G    P   Y H+ 
Sbjct:    47 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVLDCANAGS-CEGGNDLPVWSYAHEH 105

Query:   820 GLESE--KDYPYKNANGEKFK----CAYDKS-----KVKLFTGKDFLHFNGSETMKKILY 868
             G+  E   +Y  K+    KF     C   K         L+   D+   +G E M   +Y
Sbjct:   106 GIPDETCNNYQAKDQECNKFNQCGTCTEFKECHAIQNYTLWRVGDYGSLSGREKMMAEIY 165

Query:   869 KYGPLSV-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWG 927
               GP+S  ++ ++ + +Y G    +  E    Y + H + +VG+G  D   YW+VRNSWG
Sbjct:   166 ANGPISCGIMATEKMVNYTGGIHAEYQEQA--Y-INHVISVVGWGVSDGTEYWIVRNSWG 222

Query:   928 PIGPDEGFFKI 938
                 + G+ +I
Sbjct:   223 EPWGERGWMRI 233

 Score = 171 (65.3 bits), Expect = 3.7e-10, P = 3.7e-10
 Identities = 54/191 (28%), Positives = 84/191 (43%)

Query:    32 CGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKSQLVECAKQCSGCDGCFFEPSIEYTHQA 88
             CGSCWA      +  +  IK  G       S   +++CA   S C+G    P   Y H+ 
Sbjct:    47 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVLDCANAGS-CEGGNDLPVWSYAHEH 105

Query:    89 GLESE--KDYPYKNANGEKFK----CAYDKS-----KVKLFTGKDFLHFNGSETMKKILY 137
             G+  E   +Y  K+    KF     C   K         L+   D+   +G E M   +Y
Sbjct:   106 GIPDETCNNYQAKDQECNKFNQCGTCTEFKECHAIQNYTLWRVGDYGSLSGREKMMAEIY 165

Query:   138 KYGPLSV-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWG 196
               GP+S  ++ ++ + +Y G    +  E    Y + H + +VG+G  D   YW+VRNSWG
Sbjct:   166 ANGPISCGIMATEKMVNYTGGIHAEYQEQA--Y-INHVISVVGWGVSDGTEYWIVRNSWG 222

Query:   197 PIGPDEGFFKI 207
                 + G+ +I
Sbjct:   223 EPWGERGWMRI 233

 Score = 100 (40.3 bits), Expect = 1.1e-07, Sum P(2) = 1.1e-07
 Identities = 33/116 (28%), Positives = 53/116 (45%)

Query:   896 TCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPI-GPDEGFFKI-ERGNNACGI---EQI 950
             TC+ +   HA+         N   W V   +G + G ++   +I   G  +CGI   E++
Sbjct:   130 TCTEFKECHAI--------QNYTLWRV-GDYGSLSGREKMMAEIYANGPISCGIMATEKM 180

Query:   951 AGYATIDVVKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKI 1006
               Y      +  E    Y + H + +VG+G  D   YW+VRNSWG    + G+ +I
Sbjct:   181 VNYTGGIHAEYQEQA--Y-INHVISVVGWGVSDGTEYWIVRNSWGEPWGERGWMRI 233

 Score = 99 (39.9 bits), Expect = 1.1e-07, Sum P(2) = 1.1e-07
 Identities = 32/110 (29%), Positives = 46/110 (41%)

Query:    11 VPDAWDWRKKNVTGPAGDQAD------CGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKS 61
             +P +WDWR  N    A    +      CGSCWA      +  +  IK  G       S  
Sbjct:    20 LPKSWDWRNVNGVNYASATRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQ 79

Query:    62 QLVECAKQCSGCDGCFFEPSIEYTHQAGLESE--KDYPYKNANGEKF-KC 108
              +++CA   S C+G    P   Y H+ G+  E   +Y  K+    KF +C
Sbjct:    80 HVLDCANAGS-CEGGNDLPVWSYAHEHGIPDETCNNYQAKDQECNKFNQC 128

 Score = 98 (39.6 bits), Expect = 1.3e-07, Sum P(2) = 1.3e-07
 Identities = 33/110 (30%), Positives = 47/110 (42%)

Query:   742 VPDAWDWRKKNVTGPAG---DQAA---CGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKS 792
             +P +WDWR  N    A    +Q     CGSCWA      +  +  IK  G       S  
Sbjct:    20 LPKSWDWRNVNGVNYASATRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQ 79

Query:   793 QLVECAKQCSGCDGCFFEPSIEYTHQAGLESE--KDYPYKNANGEKF-KC 839
              +++CA   S C+G    P   Y H+ G+  E   +Y  K+    KF +C
Sbjct:    80 HVLDCANAGS-CEGGNDLPVWSYAHEHGIPDETCNNYQAKDQECNKFNQC 128

 Score = 43 (20.2 bits), Expect = 1.4e-10, Sum P(2) = 1.4e-10
 Identities = 13/56 (23%), Positives = 23/56 (41%)

Query:    83 EYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYK 138
             EY   + L   K + ++N NG  +  A     +  + G  + H + S    +I  K
Sbjct:    13 EYLSPSDLP--KSWDWRNVNGVNYASATRNQHIPQYCGSCWAHGSTSAMADRINIK 66


>DICTYBASE|DDB_G0281077 [details] [associations]
            symbol:DDB_G0281077 species:44689 "Dictyostelium
            discoideum" [GO:0030246 "carbohydrate binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR008965 InterPro:IPR013128 InterPro:IPR019028
            Pfam:PF00112 Pfam:PF09478 PRINTS:PR00705 SMART:SM00645
            SMART:SM01063 InterPro:IPR000169 dictyBase:DDB_G0281077
            GO:GO:0030246 EMBL:AAFI02000040 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 SUPFAM:SSF49384 ProtClustDB:CLSZ2430562
            RefSeq:XP_640803.1 ProteinModelPortal:Q54UH3
            EnsemblProtists:DDB0203998 GeneID:8622857 KEGG:ddi:DDB_G0281077
            InParanoid:Q54UH3 OMA:LINDFNF Uniprot:Q54UH3
        Length = 662

 Score = 221 (82.9 bits), Expect = 6.6e-14, Sum P(2) = 6.6e-14
 Identities = 52/187 (27%), Positives = 88/187 (47%)

Query:   377 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 436
             P + DWR   +     +Q +CGSC+AFS  G LE  Y  K  +++  S+  LV+C +   
Sbjct:   472 PISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALEAHYYRKNNRMLNLSEQNLVDCTRNY- 530

Query:   437 GCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKSKVKL-FTGKDFLYF 492
             G G C G  +     Y  +  G+  +  YPY    G    C Y+    +   +    +  
Sbjct:   531 GNGECSGGWMHNCFRYIKENGGINLQSTYPYEGRVG---LCRYNSGDAQSRISNYVMIKQ 587

Query:   493 NGSETMKKILYKYGPLSVGLNSHLIHF-YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD 551
             +  E +   +   GP+SV  ++    F Y  + I  N ++C  Y   HAV++VGYG ++ 
Sbjct:   588 HDEEDLANAVASVGPVSVAYDASTREFMYYSSGIY-NSDSCDKYRTTHAVVVVGYGIENG 646

Query:   552 IPYWLAR 558
             + +W+ +
Sbjct:   647 VDFWIIK 653

 Score = 218 (81.8 bits), Expect = 1.4e-13, Sum P(2) = 1.4e-13
 Identities = 49/186 (26%), Positives = 87/186 (46%)

Query:   743 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 802
             P + DWR   +     +Q +CGSC+AFS  G LE  Y  K  +++  S+  LV+C +   
Sbjct:   472 PISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALEAHYYRKNNRMLNLSEQNLVDCTRNYG 531

Query:   803 G--CDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKL-FTGKDFLHFN 858
                C G +      Y  +  G+  +  YPY+   G    C Y+    +   +    +  +
Sbjct:   532 NGECSGGWMHNCFRYIKENGGINLQSTYPYEGRVG---LCRYNSGDAQSRISNYVMIKQH 588

Query:   859 GSETMKKILYKYGPLSVLLNSDLIHD-YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNI 917
               E +   +   GP+SV  ++      Y  + I  N ++C  Y   HAV++VGYG ++ +
Sbjct:   589 DEEDLANAVASVGPVSVAYDASTREFMYYSSGIY-NSDSCDKYRTTHAVVVVGYGIENGV 647

Query:   918 PYWLVR 923
              +W+++
Sbjct:   648 DFWIIK 653

 Score = 217 (81.4 bits), Expect = 5.8e-14, P = 5.8e-14
 Identities = 49/186 (26%), Positives = 86/186 (46%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 71
             P + DWR   +     +Q  CGSC+AFS  G LE  Y  K  +++  S+  LV+C +   
Sbjct:   472 PISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALEAHYYRKNNRMLNLSEQNLVDCTRNYG 531

Query:    72 G--CDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKL-FTGKDFLHFN 127
                C G +      Y  +  G+  +  YPY+   G    C Y+    +   +    +  +
Sbjct:   532 NGECSGGWMHNCFRYIKENGGINLQSTYPYEGRVG---LCRYNSGDAQSRISNYVMIKQH 588

Query:   128 GSETMKKILYKYGPLSVLLNSDLIHD-YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNI 186
               E +   +   GP+SV  ++      Y  + I  N ++C  Y   HAV++VGYG ++ +
Sbjct:   589 DEEDLANAVASVGPVSVAYDASTREFMYYSSGIY-NSDSCDKYRTTHAVVVVGYGIENGV 647

Query:   187 PYWLVR 192
              +W+++
Sbjct:   648 DFWIIK 653

 Score = 45 (20.9 bits), Expect = 6.6e-14, Sum P(2) = 6.6e-14
 Identities = 10/31 (32%), Positives = 17/31 (54%)

Query:   312 RF-EYFKQDGHKKHERYGTSEFSDRSPEEIL 341
             RF E +K++        G ++FSD + +E L
Sbjct:   190 RFIEQYKRENQNSTMELGLTQFSDMTHDEFL 220

 Score = 45 (20.9 bits), Expect = 1.4e-13, Sum P(2) = 1.4e-13
 Identities = 10/31 (32%), Positives = 17/31 (54%)

Query:   678 RF-EYFKQDGHKKHERYGTSEFSDRSPEEIL 707
             RF E +K++        G ++FSD + +E L
Sbjct:   190 RFIEQYKRENQNSTMELGLTQFSDMTHDEFL 220


>DICTYBASE|DDB_G0272742 [details] [associations]
            symbol:DDB_G0272742 species:44689 "Dictyostelium
            discoideum" [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0272742 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639 EMBL:AAFI02000008
            eggNOG:NOG331187 RefSeq:XP_644986.1 ProteinModelPortal:Q7KWP5
            PRIDE:Q7KWP5 EnsemblProtists:DDB0168242 GeneID:8618663
            KEGG:ddi:DDB_G0272742 InParanoid:Q7KWP5 OMA:ATESAHF Uniprot:Q7KWP5
        Length = 345

 Score = 209 (78.6 bits), Expect = 6.9e-14, P = 6.9e-14
 Identities = 81/330 (24%), Positives = 139/330 (42%)

Query:   658 FKAFIVKRGRQYANDE------EIKERFEYFKQDGHKKHERY-GTSEFSDRSPEEILCKT 710
             F A++    R YA+ E        K   ++  Q   K  +     +EF+D S EE   + 
Sbjct:    29 FTAWMTSNQRTYASSEFTNRYNTFKSNLDFINQWNSKGSKTVLALNEFADISNEEY--RK 86

Query:   711 GFKWSERTYERI----VADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQ-AACGS 765
              +  ++    ++    + D+             G      DWRKK        Q   CGS
Sbjct:    87 NYLRNDNNINKLSSLLINDKEDKEIKSSSSSGSGS--SGIDWRKKGAVPSVKSQIGGCGS 144

Query:   766 CWAFSIAGMLEGQYAIKTGK--LVEFSKSQLVECA---KQC-SGCDGCFFEPSIEYTHQA 819
              W  +  G  E  + +   K   +  S   L++C+   KQC  G     F+  IE     
Sbjct:   145 -WPITAVGATESAHFLANPKDPFISLSMQNLIDCSNLNKQCYQGTVNEAFQYIIE---NG 200

Query:   820 GLESEKDYPYKNANGEKFKCAYDKSK-VKLFTGKDFLHFNGSETMKKILYKYGPLSVLLN 878
             G++SE+ Y +  + GE  KC Y+ S  V   T  + +  +GSE+  +      P++  ++
Sbjct:   201 GIDSEESYKF--SGGEPGKCKYNSSNSVAKITSYEKVK-SGSESSLESAVSLKPVAAYID 257

Query:   879 SDLIH-DYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIP---------YWLVRNSWGP 928
             + L    +  + I   + +C+  DL H++L+VG+      P         YW+V+NS+G 
Sbjct:   258 ASLSSFQFYSSGIYY-EPSCNSTDLNHSILIVGFSDFSTTPTDSLKHSSNYWIVQNSFGK 316

Query:   929 IGPDEG--FFKIERGNNACGIEQIAGYATI 956
                + G  F   +R +N CGI ++A Y  +
Sbjct:   317 NWGENGYIFMSKDRDDN-CGISKMASYVIV 345

 Score = 200 (75.5 bits), Expect = 7.6e-13, P = 7.6e-13
 Identities = 76/328 (23%), Positives = 134/328 (40%)

Query:   292 FKAFIVKRGRQYANDE------EIKERFEYFKQDGHKKHERY-GTSEFSDRSPEEILCKT 344
             F A++    R YA+ E        K   ++  Q   K  +     +EF+D S EE   + 
Sbjct:    29 FTAWMTSNQRTYASSEFTNRYNTFKSNLDFINQWNSKGSKTVLALNEFADISNEEY--RK 86

Query:   345 GFKWSERTYERI----VADRXXXXXXXXXXXXDGPVPDAWDWRKKNVTGPAGDQ-AACGS 399
              +  ++    ++    + D+             G      DWRKK        Q   CGS
Sbjct:    87 NYLRNDNNINKLSSLLINDKEDKEIKSSSSSGSGS--SGIDWRKKGAVPSVKSQIGGCGS 144

Query:   400 CWAFSIAGMLEGQYAIKTGK--LVEFSKSQLVECAKQCSGCGGCDG-LEQPIEYT-HQAG 455
              W  +  G  E  + +   K   +  S   L++C+     C    G + +  +Y     G
Sbjct:   145 -WPITAVGATESAHFLANPKDPFISLSMQNLIDCSNLNKQC--YQGTVNEAFQYIIENGG 201

Query:   456 LESEKDYPYRNGNGEKFKCAYDKSK-VKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNS 514
             ++SE+ Y +    GE  KC Y+ S  V   T  + +  +GSE+  +      P++  +++
Sbjct:   202 IDSEESYKF--SGGEPGKCKYNSSNSVAKITSYEKVK-SGSESSLESAVSLKPVAAYIDA 258

Query:   515 HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIP---------YWLARNSWGPIG 565
              L  F   +     + +C+  DL H++L+VG+      P         YW+ +NS+G   
Sbjct:   259 SLSSFQFYSSGIYYEPSCNSTDLNHSILIVGFSDFSTTPTDSLKHSSNYWIVQNSFGKNW 318

Query:   566 PDEG--FFKIERGNNACGIEQIAGYATI 591
              + G  F   +R +N CGI ++A Y  +
Sbjct:   319 GENGYIFMSKDRDDN-CGISKMASYVIV 345

 Score = 198 (74.8 bits), Expect = 1.3e-12, P = 1.3e-12
 Identities = 64/230 (27%), Positives = 106/230 (46%)

Query:    16 DWRKKNVTGPAGDQ-ADCGSCWAFSIAGMLEGQYAIKTGK--LVEFSKSQLVECA---KQ 69
             DWRKK        Q   CGS W  +  G  E  + +   K   +  S   L++C+   KQ
Sbjct:   125 DWRKKGAVPSVKSQIGGCGS-WPITAVGATESAHFLANPKDPFISLSMQNLIDCSNLNKQ 183

Query:    70 C-SGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSK-VKLFTGKDFLHFN 127
             C  G     F+  IE     G++SE+ Y +  + GE  KC Y+ S  V   T  + +  +
Sbjct:   184 CYQGTVNEAFQYIIE---NGGIDSEESYKF--SGGEPGKCKYNSSNSVAKITSYEKVK-S 237

Query:   128 GSETMKKILYKYGPLSVLLNSDLIH-DYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNI 186
             GSE+  +      P++  +++ L    +  + I   + +C+  DL H++L+VG+      
Sbjct:   238 GSESSLESAVSLKPVAAYIDASLSSFQFYSSGIYY-EPSCNSTDLNHSILIVGFSDFSTT 296

Query:   187 P---------YWLVRNSWGPIGPDEG--FFKIERGNNACGIEQIAGYATI 225
             P         YW+V+NS+G    + G  F   +R +N CGI ++A Y  +
Sbjct:   297 PTDSLKHSSNYWIVQNSFGKNWGENGYIFMSKDRDDN-CGISKMASYVIV 345


>FB|FBgn0034709 [details] [associations]
            symbol:Swim "Secreted Wg-interacting molecule" species:7227
            "Drosophila melanogaster" [GO:0004197 "cysteine-type endopeptidase
            activity" evidence=ISS] [GO:0006955 "immune response" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044 "scavenger
            receptor activity" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0042600 "chorion" evidence=IDA]
            [GO:0035593 "positive regulation of Wnt receptor signaling pathway
            by establishment of Wnt protein localization to extracellular
            region" evidence=IMP] [GO:0030177 "positive regulation of Wnt
            receptor signaling pathway" evidence=IDA] [GO:0005615
            "extracellular space" evidence=IDA] [GO:0017147 "Wnt-protein
            binding" evidence=IDA] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR013128 Pfam:PF00112 PROSITE:PS50958 SMART:SM00201
            SMART:SM00645 EMBL:AE013599 GO:GO:0005615 GO:GO:0006955
            GO:GO:0030247 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GO:GO:0017147 GO:GO:0005044
            GeneTree:ENSGT00560000076599 GO:GO:0042600 eggNOG:NOG310046
            OMA:DNCNRCT HSSP:P80067 EMBL:AY113377 RefSeq:NP_611652.2
            RefSeq:NP_726176.1 UniGene:Dm.732 SMR:Q7JWQ7 IntAct:Q7JWQ7
            EnsemblMetazoa:FBtr0071784 EnsemblMetazoa:FBtr0071785 GeneID:37537
            KEGG:dme:Dmel_CG3074 UCSC:CG3074-RA FlyBase:FBgn0034709
            HOGENOM:HOG000264150 InParanoid:Q7JWQ7 OrthoDB:EOG48CZ9P
            GenomeRNAi:37537 NextBio:804155 GO:GO:0035593 Uniprot:Q7JWQ7
        Length = 431

 Score = 211 (79.3 bits), Expect = 1.0e-13, P = 1.0e-13
 Identities = 60/211 (28%), Positives = 102/211 (48%)

Query:   393 DQAACGSCWAFSIAGMLEGQYAIKT-GKL-VEFSKSQLVECAKQCSGCGGCDGLEQPIEY 450
             DQ  CG+ W  S   +   ++AI++ GK  V+ S   ++ C ++  GC G   L+    Y
Sbjct:   206 DQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNILSCTRRQQGCEG-GHLDAAWRY 264

Query:   451 THQAGLESEKDYPYRNGNGEKFKCAYDKSKVK-------LFTGKDFLYFNG--------S 495
              H+ G+  E  YPY   + +  K  ++   ++       +   +D LY  G        +
Sbjct:   265 LHKKGVVDENCYPYTQ-HRDTCKIRHNSRSLRANGCQKPVNVDRDSLYTVGPAYSLNREA 323

Query:   496 ETMKKILYKYGPLSVGLNSHLIHF-YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD-IP 553
             + M +I +  GP+   +  +   F Y+G   R+           H+V LVG+G++ +   
Sbjct:   324 DIMAEIFHS-GPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEK 382

Query:   554 YWLARNSWGPIGPDEGFFKIERGNNACGIEQ 584
             YW+A NSWG    + G+F+I RG+N CGIE+
Sbjct:   383 YWIAANSWGSWWGEHGYFRILRGSNECGIEE 413

 Score = 178 (67.7 bits), Expect = 4.4e-10, P = 4.4e-10
 Identities = 52/182 (28%), Positives = 84/182 (46%)

Query:    56 VEFSKSQLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKN--------------- 100
             V+ S   ++ C ++  GC+G   + +  Y H+ G+  E  YPY                 
Sbjct:   236 VQLSAQNILSCTRRQQGCEGGHLDAAWRYLHKKGVVDENCYPYTQHRDTCKIRHNSRSLR 295

Query:   101 ANGEKFKCAYDKSKVKLFT-GKDFLHFNGSETMKKILYKYGPLSVLL--NSDLIHDYNGT 157
             ANG +     D+    L+T G  +     ++ M +I +  GP+   +  N D    Y+G 
Sbjct:   296 ANGCQKPVNVDRDS--LYTVGPAYSLNREADIMAEIFHS-GPVQATMRVNRDFFA-YSGG 351

Query:   158 PIRKNDETCSPYDLGHAVLLVGYGKQDN-IPYWLVRNSWGPIGPDEGFFKIERGNNACGI 216
               R+           H+V LVG+G++ N   YW+  NSWG    + G+F+I RG+N CGI
Sbjct:   352 VYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGYFRILRGSNECGI 411

Query:   217 EQ 218
             E+
Sbjct:   412 EE 413

 Score = 178 (67.7 bits), Expect = 4.4e-10, P = 4.4e-10
 Identities = 52/182 (28%), Positives = 84/182 (46%)

Query:   787 VEFSKSQLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKN--------------- 831
             V+ S   ++ C ++  GC+G   + +  Y H+ G+  E  YPY                 
Sbjct:   236 VQLSAQNILSCTRRQQGCEGGHLDAAWRYLHKKGVVDENCYPYTQHRDTCKIRHNSRSLR 295

Query:   832 ANGEKFKCAYDKSKVKLFT-GKDFLHFNGSETMKKILYKYGPLSVLL--NSDLIHDYNGT 888
             ANG +     D+    L+T G  +     ++ M +I +  GP+   +  N D    Y+G 
Sbjct:   296 ANGCQKPVNVDRDS--LYTVGPAYSLNREADIMAEIFHS-GPVQATMRVNRDFFA-YSGG 351

Query:   889 PIRKNDETCSPYDLGHAVLLVGYGKQDN-IPYWLVRNSWGPIGPDEGFFKIERGNNACGI 947
               R+           H+V LVG+G++ N   YW+  NSWG    + G+F+I RG+N CGI
Sbjct:   352 VYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGYFRILRGSNECGI 411

Query:   948 EQ 949
             E+
Sbjct:   412 EE 413

 Score = 125 (49.1 bits), Expect = 2.8e-12, Sum P(2) = 2.8e-12
 Identities = 42/175 (24%), Positives = 76/175 (43%)

Query:   759 DQAACGSCWAFSIAGMLEGQYAIKT-GKL-VEFSKSQLVECAKQCSGCDGCFFEPSIEYT 816
             DQ  CG+ W  S   +   ++AI++ GK  V+ S   ++ C ++  GC+G   + +  Y 
Sbjct:   206 DQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNILSCTRRQQGCEGGHLDAAWRYL 265

Query:   817 HQAGLESEKDYPYKNA--------NGEKFK---CA--YDKSKVKLFT-GKDFLHFNGSET 862
             H+ G+  E  YPY           N    +   C    +  +  L+T G  +     ++ 
Sbjct:   266 HKKGVVDENCYPYTQHRDTCKIRHNSRSLRANGCQKPVNVDRDSLYTVGPAYSLNREADI 325

Query:   863 MKKILYKYGPLSVLLNSDL-IHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN 916
             M +I +  GP+   +  +     Y+G   R+           H+V LVG+G++ N
Sbjct:   326 MAEIFHS-GPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHN 379

 Score = 124 (48.7 bits), Expect = 2.8e-12, Sum P(2) = 2.8e-12
 Identities = 22/47 (46%), Positives = 33/47 (70%)

Query:   972 HAVLLVGYGKQDD-IPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ 1017
             H+V LVG+G++ +   YW+  NSWG    + G+F+I RG+N CGIE+
Sbjct:   367 HSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGYFRILRGSNECGIEE 413

 Score = 124 (48.7 bits), Expect = 3.6e-12, Sum P(2) = 3.6e-12
 Identities = 42/175 (24%), Positives = 76/175 (43%)

Query:    28 DQADCGSCWAFSIAGMLEGQYAIKT-GKL-VEFSKSQLVECAKQCSGCDGCFFEPSIEYT 85
             DQ  CG+ W  S   +   ++AI++ GK  V+ S   ++ C ++  GC+G   + +  Y 
Sbjct:   206 DQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNILSCTRRQQGCEGGHLDAAWRYL 265

Query:    86 HQAGLESEKDYPYKNA--------NGEKFK---CA--YDKSKVKLFT-GKDFLHFNGSET 131
             H+ G+  E  YPY           N    +   C    +  +  L+T G  +     ++ 
Sbjct:   266 HKKGVVDENCYPYTQHRDTCKIRHNSRSLRANGCQKPVNVDRDSLYTVGPAYSLNREADI 325

Query:   132 MKKILYKYGPLSVLLNSDL-IHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN 185
             M +I +  GP+   +  +     Y+G   R+           H+V LVG+G++ N
Sbjct:   326 MAEIFHS-GPVQATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHN 379


>ZFIN|ZDB-GENE-040426-2650 [details] [associations]
            symbol:ctsba "cathepsin B, a" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0031101 "fin regeneration"
            evidence=IEP] [GO:0008233 "peptidase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 ZFIN:ZDB-GENE-040426-2650 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790
            GO:GO:0004197 GO:GO:0031101 MEROPS:C01.060 HOVERGEN:HBG003480
            PANTHER:PTHR12411:SF16 HSSP:P07688 EMBL:BC044517 IPI:IPI00485996
            UniGene:Dr.3374 ProteinModelPortal:Q803E4 SMR:Q803E4 STRING:Q803E4
            PRIDE:Q803E4 InParanoid:Q803E4 ArrayExpress:Q803E4 Bgee:Q803E4
            Uniprot:Q803E4
        Length = 330

 Score = 150 (57.9 bits), Expect = 1.2e-13, Sum P(2) = 1.2e-13
 Identities = 26/52 (50%), Positives = 37/52 (71%)

Query:   538 GHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 587
             GHA+ ++G+G+++ +PYWLA NSW     D G+FKI RG + CGIE   +AG
Sbjct:   276 GHAIKILGWGEENGVPYWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAG 327

 Score = 147 (56.8 bits), Expect = 2.6e-13, Sum P(2) = 2.6e-13
 Identities = 25/52 (48%), Positives = 36/52 (69%)

Query:   903 GHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 952
             GHA+ ++G+G+++ +PYWL  NSW     D G+FKI RG + CGIE   +AG
Sbjct:   276 GHAIKILGWGEENGVPYWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAG 327

 Score = 147 (56.8 bits), Expect = 5.3e-13, Sum P(2) = 5.3e-13
 Identities = 25/52 (48%), Positives = 36/52 (69%)

Query:   172 GHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 221
             GHA+ ++G+G+++ +PYWL  NSW     D G+FKI RG + CGIE   +AG
Sbjct:   276 GHAIKILGWGEENGVPYWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAG 327

 Score = 146 (56.5 bits), Expect = 3.4e-13, Sum P(2) = 3.4e-13
 Identities = 25/52 (48%), Positives = 36/52 (69%)

Query:   971 GHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 1020
             GHA+ ++G+G+++ +PYWL  NSW     D G+FKI RG + CGIE   +AG
Sbjct:   276 GHAIKILGWGEENGVPYWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAG 327

 Score = 106 (42.4 bits), Expect = 1.2e-13, Sum P(2) = 1.2e-13
 Identities = 23/54 (42%), Positives = 31/54 (57%)

Query:   393 DQAACGSCWAFSIAGMLEGQYAIKTG-KL-VEFSKSQLVECAKQCSGCG-GCDG 443
             DQ +CGSCWAF  A  +  +  I++  K+ VE S   L+ C   C  CG GC+G
Sbjct:   100 DQGSCGSCWAFGAAEAISDRVCIQSNAKVSVEISSQDLLTC---CDSCGMGCNG 150

 Score = 104 (41.7 bits), Expect = 4.2e-13, Sum P(2) = 4.2e-13
 Identities = 23/66 (34%), Positives = 35/66 (53%)

Query:   759 DQAACGSCWAFSIAGMLEGQYAIKTG-KL-VEFSKSQLVECAKQCS-GCDGCFFEPSIEY 815
             DQ +CGSCWAF  A  +  +  I++  K+ VE S   L+ C   C  GC+G +   + ++
Sbjct:   100 DQGSCGSCWAFGAAEAISDRVCIQSNAKVSVEISSQDLLTCCDSCGMGCNGGYPSAAWDF 159

Query:   816 THQAGL 821
                 GL
Sbjct:   160 WTTDGL 165

 Score = 103 (41.3 bits), Expect = 2.4e-13, Sum P(2) = 2.4e-13
 Identities = 23/66 (34%), Positives = 34/66 (51%)

Query:    28 DQADCGSCWAFSIAGMLEGQYAIKTG-KL-VEFSKSQLVECAKQCS-GCDGCFFEPSIEY 84
             DQ  CGSCWAF  A  +  +  I++  K+ VE S   L+ C   C  GC+G +   + ++
Sbjct:   100 DQGSCGSCWAFGAAEAISDRVCIQSNAKVSVEISSQDLLTCCDSCGMGCNGGYPSAAWDF 159

Query:    85 THQAGL 90
                 GL
Sbjct:   160 WTTDGL 165


>UNIPROTKB|P07688 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9913 "Bos taurus"
            [GO:0042470 "melanosome" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730
            "nucleolus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0042470
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 EMBL:L06075 EMBL:M64620
            EMBL:U16336 EMBL:U16337 EMBL:U16338 EMBL:U16339 EMBL:U16341
            EMBL:U16342 EMBL:U16343 EMBL:BC102997 IPI:IPI00692061 PIR:S38328
            RefSeq:NP_776456.1 UniGene:Bt.393 PDB:1ITO PDB:1QDQ PDB:1SP4
            PDB:2DC6 PDB:2DC7 PDB:2DC8 PDB:2DC9 PDB:2DCA PDB:2DCB PDB:2DCC
            PDB:2DCD PDBsum:1ITO PDBsum:1QDQ PDBsum:1SP4 PDBsum:2DC6
            PDBsum:2DC7 PDBsum:2DC8 PDBsum:2DC9 PDBsum:2DCA PDBsum:2DCB
            PDBsum:2DCC PDBsum:2DCD ProteinModelPortal:P07688 SMR:P07688
            STRING:P07688 MEROPS:C01.060 PRIDE:P07688
            Ensembl:ENSBTAT00000036795 GeneID:281105 KEGG:bta:281105 CTD:1508
            eggNOG:NOG315657 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 InParanoid:P07688 KO:K01363 OMA:GYPSGAW
            OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 BindingDB:P07688
            ChEMBL:CHEMBL2323 EvolutionaryTrace:P07688 NextBio:20805177
            ArrayExpress:P07688 GO:GO:0097067 PANTHER:PTHR12411:SF16
            Uniprot:P07688
        Length = 335

 Score = 155 (59.6 bits), Expect = 1.4e-13, Sum P(2) = 1.4e-13
 Identities = 39/99 (39%), Positives = 53/99 (53%)

Query:   858 NGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 915
             N  E M +I YK GP+    +  SD +   +G     + E       GHA+ ++G+G ++
Sbjct:   235 NEKEIMAEI-YKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMG----GHAIRILGWGVEN 289

Query:   916 NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 952
               PYWLV NSW     D GFFKI RG + CGIE   +AG
Sbjct:   290 GTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAG 328

 Score = 155 (59.6 bits), Expect = 2.9e-13, Sum P(2) = 2.9e-13
 Identities = 39/99 (39%), Positives = 53/99 (53%)

Query:   127 NGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
             N  E M +I YK GP+    +  SD +   +G     + E       GHA+ ++G+G ++
Sbjct:   235 NEKEIMAEI-YKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMG----GHAIRILGWGVEN 289

Query:   185 NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 221
               PYWLV NSW     D GFFKI RG + CGIE   +AG
Sbjct:   290 GTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAG 328

 Score = 151 (58.2 bits), Expect = 4.1e-13, Sum P(2) = 4.1e-13
 Identities = 37/99 (37%), Positives = 53/99 (53%)

Query:   493 NGSETMKKILYKYGPL--SVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 550
             N  E M +I YK GP+  +  + S  + + +G     + E       GHA+ ++G+G ++
Sbjct:   235 NEKEIMAEI-YKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMG----GHAIRILGWGVEN 289

Query:   551 DIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 587
               PYWL  NSW     D GFFKI RG + CGIE   +AG
Sbjct:   290 GTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAG 328

 Score = 145 (56.1 bits), Expect = 2.0e-12, Sum P(2) = 2.0e-12
 Identities = 27/52 (51%), Positives = 35/52 (67%)

Query:   971 GHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 1020
             GHA+ ++G+G ++  PYWLV NSW     D GFFKI RG + CGIE   +AG
Sbjct:   277 GHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAG 328

 Score = 100 (40.3 bits), Expect = 1.4e-13, Sum P(2) = 1.4e-13
 Identities = 27/79 (34%), Positives = 42/79 (53%)

Query:   373 DGPVPDAWDWRKKNVTGPA----GDQAACGSCWAFSIAGMLEGQYAIKT-GKL-VEFSKS 426
             D  +P+++D R++    P      DQ +CGSCWAF     +  +  I + G++ VE S  
Sbjct:    77 DVVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAE 136

Query:   427 QLVECAKQCSG-CG-GCDG 443
              ++ C   C G CG GC+G
Sbjct:   137 DMLTC---CGGECGDGCNG 152

 Score = 98 (39.6 bits), Expect = 2.3e-13, Sum P(2) = 2.3e-13
 Identities = 28/93 (30%), Positives = 47/93 (50%)

Query:   739 DGPVPDAWDWRKKNVTGPA----GDQAACGSCWAFSIAGMLEGQYAIKT-GKL-VEFSKS 792
             D  +P+++D R++    P      DQ +CGSCWAF     +  +  I + G++ VE S  
Sbjct:    77 DVVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAE 136

Query:   793 QLVECAK-QCS-GCDGCFFEPSIEYTHQAGLES 823
              ++ C   +C  GC+G F   +  +  + GL S
Sbjct:   137 DMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVS 169

 Score = 97 (39.2 bits), Expect = 2.9e-13, Sum P(2) = 2.9e-13
 Identities = 28/93 (30%), Positives = 46/93 (49%)

Query:     8 DGPVPDAWDWRKKNVTGPA----GDQADCGSCWAFSIAGMLEGQYAIKT-GKL-VEFSKS 61
             D  +P+++D R++    P      DQ  CGSCWAF     +  +  I + G++ VE S  
Sbjct:    77 DVVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAE 136

Query:    62 QLVECAK-QCS-GCDGCFFEPSIEYTHQAGLES 92
              ++ C   +C  GC+G F   +  +  + GL S
Sbjct:   137 DMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVS 169


>UNIPROTKB|A1E295 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9823 "Sus scrofa"
            [GO:0005615 "extracellular space" evidence=IDA] [GO:0042470
            "melanosome" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730
            "nucleolus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0042470
            GO:GO:0005615 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 MEROPS:C01.060 CTD:1508
            GeneTree:ENSGT00560000076599 HOVERGEN:HBG003480 KO:K01363
            OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16 EMBL:EF095956
            RefSeq:NP_001090927.1 UniGene:Ssc.53773 ProteinModelPortal:A1E295
            SMR:A1E295 PRIDE:A1E295 Ensembl:ENSSSCT00000026923 GeneID:100037961
            KEGG:ssc:100037961 Uniprot:A1E295
        Length = 335

 Score = 155 (59.6 bits), Expect = 2.3e-13, Sum P(2) = 2.3e-13
 Identities = 38/99 (38%), Positives = 53/99 (53%)

Query:   858 NGSETMKKILYKYGPL--SVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 915
             N  E M +I YK GP+  +  + SD +   +G       +       GHA+ ++G+G ++
Sbjct:   235 NEKEIMAEI-YKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMG----GHAIRILGWGVEN 289

Query:   916 NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 952
               PYWLV NSW     D GFFKI RG + CGIE   +AG
Sbjct:   290 GTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAG 328

 Score = 155 (59.6 bits), Expect = 2.9e-13, Sum P(2) = 2.9e-13
 Identities = 38/99 (38%), Positives = 53/99 (53%)

Query:   127 NGSETMKKILYKYGPL--SVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
             N  E M +I YK GP+  +  + SD +   +G       +       GHA+ ++G+G ++
Sbjct:   235 NEKEIMAEI-YKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMG----GHAIRILGWGVEN 289

Query:   185 NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 221
               PYWLV NSW     D GFFKI RG + CGIE   +AG
Sbjct:   290 GTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAG 328

 Score = 148 (57.2 bits), Expect = 1.8e-12, Sum P(2) = 1.8e-12
 Identities = 36/99 (36%), Positives = 50/99 (50%)

Query:   493 NGSETMKKILYKYGPLSVGLN--SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 550
             N  E M +I YK GP+       S  + + +G       +       GHA+ ++G+G ++
Sbjct:   235 NEKEIMAEI-YKNGPVEGAFTVYSDFLQYKSGVYQHVTGDLMG----GHAIRILGWGVEN 289

Query:   551 DIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 587
               PYWL  NSW     D GFFKI RG + CGIE   +AG
Sbjct:   290 GTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAG 328

 Score = 145 (56.1 bits), Expect = 3.2e-12, Sum P(2) = 3.2e-12
 Identities = 27/52 (51%), Positives = 35/52 (67%)

Query:   971 GHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 1020
             GHA+ ++G+G ++  PYWLV NSW     D GFFKI RG + CGIE   +AG
Sbjct:   277 GHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAG 328

 Score = 98 (39.6 bits), Expect = 2.3e-13, Sum P(2) = 2.3e-13
 Identities = 23/69 (33%), Positives = 37/69 (53%)

Query:   759 DQAACGSCWAFSIAGMLEGQYAIKT-GKL-VEFSKSQLVECA-KQCS-GCDGCFFEPSIE 814
             DQ +CGSCWAF     +  +  I++ G++ VE S   ++ C   +C  GC+G F   +  
Sbjct:   101 DQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTCCGDECGDGCNGGFPSGAWN 160

Query:   815 YTHQAGLES 823
             +  + GL S
Sbjct:   161 FWTKKGLVS 169

 Score = 97 (39.2 bits), Expect = 2.9e-13, Sum P(2) = 2.9e-13
 Identities = 23/69 (33%), Positives = 36/69 (52%)

Query:    28 DQADCGSCWAFSIAGMLEGQYAIKT-GKL-VEFSKSQLVECA-KQCS-GCDGCFFEPSIE 83
             DQ  CGSCWAF     +  +  I++ G++ VE S   ++ C   +C  GC+G F   +  
Sbjct:   101 DQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTCCGDECGDGCNGGFPSGAWN 160

Query:    84 YTHQAGLES 92
             +  + GL S
Sbjct:   161 FWTKKGLVS 169

 Score = 92 (37.4 bits), Expect = 9.6e-13, Sum P(2) = 9.6e-13
 Identities = 21/55 (38%), Positives = 31/55 (56%)

Query:   393 DQAACGSCWAFSIAGMLEGQYAIKT-GKL-VEFSKSQLVECAKQCSG-CG-GCDG 443
             DQ +CGSCWAF     +  +  I++ G++ VE S   ++ C   C   CG GC+G
Sbjct:   101 DQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTC---CGDECGDGCNG 152


>DICTYBASE|DDB_G0288563 [details] [associations]
            symbol:DDB_G0288563 species:44689 "Dictyostelium
            discoideum" [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0288563
            GO:GO:0005615 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197
            EMBL:AAFI02000117 PANTHER:PTHR12411:SF16 RefSeq:XP_636643.1
            MEROPS:C01.A58 PRIDE:Q54IS1 EnsemblProtists:DDB0187993
            GeneID:8626689 KEGG:ddi:DDB_G0288563 InParanoid:Q54IS1 OMA:AWEYMEL
            Uniprot:Q54IS1
        Length = 314

 Score = 202 (76.2 bits), Expect = 2.5e-13, P = 2.5e-13
 Identities = 65/233 (27%), Positives = 102/233 (43%)

Query:     6 EKDGPVPDAWDWRKK--NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVE---FSK 60
             E  G +P ++D R +  +   P  +Q  CGSCWAFS + +L  +  I +         S 
Sbjct:    83 ELKGSIPTSFDSRVQWPDCIHPILNQEQCGSCWAFSSSEVLSDRLCIASNNKTNPGALSP 142

Query:    61 SQLVECAKQCS-GCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKC---AYDKSKVK 116
               LV C    + GC G   + + EY    GL ++   PY   NG  + C     D     
Sbjct:   143 QTLVACDVYGNDGCSGGIPQLAWEYMELKGLPTDSCVPYTAGNGTVYSCQRSCSDSEDYS 202

Query:   117 LFTGKDF-LHFNGS-ETMKKILYKYGPL--SVLLNSDLIHDYNGTPIRKNDETCSPYDLG 172
             L+  K F L    S + +++ +  YGP+  ++ +  D +   +G  +     +      G
Sbjct:   203 LYRAKPFTLKTCSSVQCIQENILAYGPIVGTMEVYEDFMSYSSGVYVMTPGSSLLG---G 259

Query:   173 HAVLLVGYG--KQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYA 223
             HA+ +VG+G  +   + YW+V NSWG     +GFF I      C I   A  A
Sbjct:   260 HAIKIVGWGFDQTSQLNYWIVANSWGADWGQQGFFFISM--ETCSISSDASAA 310

 Score = 200 (75.5 bits), Expect = 4.3e-13, P = 4.3e-13
 Identities = 64/232 (27%), Positives = 101/232 (43%)

Query:   374 GPVPDAWDWRKK--NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE---FSKSQL 428
             G +P ++D R +  +   P  +Q  CGSCWAFS + +L  +  I +         S   L
Sbjct:    86 GSIPTSFDSRVQWPDCIHPILNQEQCGSCWAFSSSEVLSDRLCIASNNKTNPGALSPQTL 145

Query:   429 VECAKQCS-GCGGCDGLEQPI-EYTHQAGLESEKDYPYRNGNGEKFKC---AYDKSKVKL 483
             V C    + GC G  G+ Q   EY    GL ++   PY  GNG  + C     D     L
Sbjct:   146 VACDVYGNDGCSG--GIPQLAWEYMELKGLPTDSCVPYTAGNGTVYSCQRSCSDSEDYSL 203

Query:   484 FTGKDFLYFNGS--ETMKKILYKYGPL--SVGLNSHLIHFYNGTPIRKNDETCSPYDLGH 539
             +  K F     S  + +++ +  YGP+  ++ +    + + +G  +     +      GH
Sbjct:   204 YRAKPFTLKTCSSVQCIQENILAYGPIVGTMEVYEDFMSYSSGVYVMTPGSSLLG---GH 260

Query:   540 AVLLVGYG--KQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYA 589
             A+ +VG+G  +   + YW+  NSWG     +GFF I      C I   A  A
Sbjct:   261 AIKIVGWGFDQTSQLNYWIVANSWGADWGQQGFFFISM--ETCSISSDASAA 310

 Score = 199 (75.1 bits), Expect = 5.8e-13, P = 5.8e-13
 Identities = 64/230 (27%), Positives = 101/230 (43%)

Query:   740 GPVPDAWDWRKK--NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE---FSKSQL 794
             G +P ++D R +  +   P  +Q  CGSCWAFS + +L  +  I +         S   L
Sbjct:    86 GSIPTSFDSRVQWPDCIHPILNQEQCGSCWAFSSSEVLSDRLCIASNNKTNPGALSPQTL 145

Query:   795 VECAKQCS-GCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKC---AYDKSKVKLFT 850
             V C    + GC G   + + EY    GL ++   PY   NG  + C     D     L+ 
Sbjct:   146 VACDVYGNDGCSGGIPQLAWEYMELKGLPTDSCVPYTAGNGTVYSCQRSCSDSEDYSLYR 205

Query:   851 GKDF-LHFNGS-ETMKKILYKYGPL--SVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAV 906
              K F L    S + +++ +  YGP+  ++ +  D +   +G  +     +      GHA+
Sbjct:   206 AKPFTLKTCSSVQCIQENILAYGPIVGTMEVYEDFMSYSSGVYVMTPGSSLLG---GHAI 262

Query:   907 LLVGYG--KQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYA 954
              +VG+G  +   + YW+V NSWG     +GFF I      C I   A  A
Sbjct:   263 KIVGWGFDQTSQLNYWIVANSWGADWGQQGFFFISM--ETCSISSDASAA 310


>UNIPROTKB|F1MW68 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9913 "Bos taurus"
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0060441
            GeneTree:ENSGT00560000076599 IPI:IPI00708474 UniGene:Bt.4902
            OMA:QCGTCTE EMBL:DAAA02036315 PRIDE:F1MW68
            Ensembl:ENSBTAT00000025007 Uniprot:F1MW68
        Length = 304

 Score = 200 (75.5 bits), Expect = 3.3e-13, P = 3.3e-13
 Identities = 63/219 (28%), Positives = 96/219 (43%)

Query:   376 VPDAWDWRKKN------VTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLV 429
             +P +WDWR  N      VT        CGSCWA      +  +  IK  K    S    V
Sbjct:    63 LPKSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKR-KGAWPSTLLSV 121

Query:   430 ECAKQCSGCGGCDG-LEQPI-EYTHQAGLESE--KDYPYRNGNGEKFK----CAYDKS-- 479
             +    C   G C+G  + P+ EY H+ G+  E   +Y  ++   +KF     C   K   
Sbjct:   122 QHVLDCGDAGSCEGGNDLPVWEYAHRHGIPDETCNNYQAKDQECDKFNQCGTCTEFKECH 181

Query:   480 KVKLFT-GK--DFLYFNGSETMKKILYKYGPLSVGL--NSHLIHFYNGTPIRKNDETCSP 534
              +K +T  K  D+   +G E M   +Y  GP+S G+     + ++  G     ND+    
Sbjct:   182 VIKNYTLWKVGDYGSLSGREKMMAEIYTNGPISCGIMATEKMSNYTGGIYSEYNDQAF-- 239

Query:   535 YDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKI 573
               + H V + G+G  D + YW+ RNSWG    + G+ +I
Sbjct:   240 --INHIVSVAGWGVSDGMEYWIVRNSWGEPWGEHGWMRI 276

 Score = 177 (67.4 bits), Expect = 1.9e-10, P = 1.9e-10
 Identities = 55/192 (28%), Positives = 88/192 (45%)

Query:    32 CGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKSQLVECAKQCSGCDGCFFEPSIEYTHQA 88
             CGSCWA      +  +  IK  G       S   +++C      C+G    P  EY H+ 
Sbjct:    90 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVLDCG-DAGSCEGGNDLPVWEYAHRH 148

Query:    89 GLESE--KDYPYKNANGEKFK----CAYDKS--KVKLFT-GK--DFLHFNGSETMKKILY 137
             G+  E   +Y  K+   +KF     C   K    +K +T  K  D+   +G E M   +Y
Sbjct:   149 GIPDETCNNYQAKDQECDKFNQCGTCTEFKECHVIKNYTLWKVGDYGSLSGREKMMAEIY 208

Query:   138 KYGPLSV-LLNSDLIHDYNGTPIRK-NDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSW 195
               GP+S  ++ ++ + +Y G    + ND+      + H V + G+G  D + YW+VRNSW
Sbjct:   209 TNGPISCGIMATEKMSNYTGGIYSEYNDQAF----INHIVSVAGWGVSDGMEYWIVRNSW 264

Query:   196 GPIGPDEGFFKI 207
             G    + G+ +I
Sbjct:   265 GEPWGEHGWMRI 276

 Score = 177 (67.4 bits), Expect = 4.0e-10, Sum P(2) = 4.0e-10
 Identities = 55/192 (28%), Positives = 88/192 (45%)

Query:   763 CGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKSQLVECAKQCSGCDGCFFEPSIEYTHQA 819
             CGSCWA      +  +  IK  G       S   +++C      C+G    P  EY H+ 
Sbjct:    90 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVLDCG-DAGSCEGGNDLPVWEYAHRH 148

Query:   820 GLESE--KDYPYKNANGEKFK----CAYDKS--KVKLFT-GK--DFLHFNGSETMKKILY 868
             G+  E   +Y  K+   +KF     C   K    +K +T  K  D+   +G E M   +Y
Sbjct:   149 GIPDETCNNYQAKDQECDKFNQCGTCTEFKECHVIKNYTLWKVGDYGSLSGREKMMAEIY 208

Query:   869 KYGPLSV-LLNSDLIHDYNGTPIRK-NDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSW 926
               GP+S  ++ ++ + +Y G    + ND+      + H V + G+G  D + YW+VRNSW
Sbjct:   209 TNGPISCGIMATEKMSNYTGGIYSEYNDQAF----INHIVSVAGWGVSDGMEYWIVRNSW 264

Query:   927 GPIGPDEGFFKI 938
             G    + G+ +I
Sbjct:   265 GEPWGEHGWMRI 276

 Score = 101 (40.6 bits), Expect = 1.1e-07, Sum P(2) = 1.1e-07
 Identities = 32/110 (29%), Positives = 46/110 (41%)

Query:   742 VPDAWDWRKKN------VTGPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKS 792
             +P +WDWR  N      VT        CGSCWA      +  +  IK  G       S  
Sbjct:    63 LPKSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQ 122

Query:   793 QLVECAKQCSGCDGCFFEPSIEYTHQAGLESE--KDYPYKNANGEKF-KC 839
              +++C      C+G    P  EY H+ G+  E   +Y  K+   +KF +C
Sbjct:   123 HVLDCG-DAGSCEGGNDLPVWEYAHRHGIPDETCNNYQAKDQECDKFNQC 171

 Score = 100 (40.3 bits), Expect = 1.1e-07, Sum P(2) = 1.1e-07
 Identities = 30/97 (30%), Positives = 48/97 (49%)

Query:   916 NIPYWLVRNSWGPI-GPDEGFFKI-ERGNNACGI---EQIAGYAT-IDVVKNDETCSPYD 969
             N   W V   +G + G ++   +I   G  +CGI   E+++ Y   I    ND+      
Sbjct:   185 NYTLWKV-GDYGSLSGREKMMAEIYTNGPISCGIMATEKMSNYTGGIYSEYNDQAF---- 239

Query:   970 LGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKI 1006
             + H V + G+G  D + YW+VRNSWG    + G+ +I
Sbjct:   240 INHIVSVAGWGVSDGMEYWIVRNSWGEPWGEHGWMRI 276

 Score = 100 (40.3 bits), Expect = 1.5e-07, Sum P(2) = 1.5e-07
 Identities = 31/110 (28%), Positives = 46/110 (41%)

Query:    11 VPDAWDWRKKNVTGPAGDQAD------CGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKS 61
             +P +WDWR  N    A    +      CGSCWA      +  +  IK  G       S  
Sbjct:    63 LPKSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQ 122

Query:    62 QLVECAKQCSGCDGCFFEPSIEYTHQAGLESE--KDYPYKNANGEKF-KC 108
              +++C      C+G    P  EY H+ G+  E   +Y  K+   +KF +C
Sbjct:   123 HVLDCG-DAGSCEGGNDLPVWEYAHRHGIPDETCNNYQAKDQECDKFNQC 171

 Score = 39 (18.8 bits), Expect = 4.0e-10, Sum P(2) = 4.0e-10
 Identities = 12/56 (21%), Positives = 22/56 (39%)

Query:    83 EYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYK 138
             EY   + L   K + ++N NG  +        +  + G  + H + S    +I  K
Sbjct:    56 EYLSPSDLP--KSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIK 109


>UNIPROTKB|P05689 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:BC122603
            EMBL:X01809 IPI:IPI00708474 PIR:A29172 RefSeq:NP_001071303.1
            UniGene:Bt.4902 ProteinModelPortal:P05689 SMR:P05689 MEROPS:C01.013
            PRIDE:P05689 GeneID:404187 KEGG:bta:404187 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 InParanoid:P05689 KO:K08568
            OrthoDB:EOG42Z4QN BRENDA:3.4.18.1 NextBio:20817615 Uniprot:P05689
        Length = 304

 Score = 199 (75.1 bits), Expect = 4.5e-13, P = 4.5e-13
 Identities = 63/219 (28%), Positives = 96/219 (43%)

Query:   376 VPDAWDWRKKN------VTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLV 429
             +P +WDWR  N      VT        CGSCWA      +  +  IK  K    S    V
Sbjct:    63 LPKSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKR-KGAWPSTLLSV 121

Query:   430 ECAKQCSGCGGCDG-LEQPI-EYTHQAGLESE--KDYPYRNGNGEKFK----CAYDKS-- 479
             +    C   G C+G  + P+ EY H+ G+  E   +Y  ++   +KF     C   K   
Sbjct:   122 QHVIDCGDAGSCEGGNDLPVWEYAHRHGIPDETCNNYQAKDQECDKFNQCGTCTEFKECH 181

Query:   480 KVKLFT-GK--DFLYFNGSETMKKILYKYGPLSVGL--NSHLIHFYNGTPIRKNDETCSP 534
              +K +T  K  D+   +G E M   +Y  GP+S G+     + ++  G     ND+    
Sbjct:   182 VIKNYTLWKVGDYGSLSGREKMMAEIYTNGPISCGIMATEKMSNYTGGIYSEYNDQAF-- 239

Query:   535 YDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKI 573
               + H V + G+G  D + YW+ RNSWG    + G+ +I
Sbjct:   240 --INHIVSVAGWGVSDGMEYWIVRNSWGEPWGEHGWMRI 276

 Score = 179 (68.1 bits), Expect = 1.1e-10, P = 1.1e-10
 Identities = 55/192 (28%), Positives = 88/192 (45%)

Query:    32 CGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKSQLVECAKQCSGCDGCFFEPSIEYTHQA 88
             CGSCWA      +  +  IK  G       S   +++C      C+G    P  EY H+ 
Sbjct:    90 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCG-DAGSCEGGNDLPVWEYAHRH 148

Query:    89 GLESE--KDYPYKNANGEKFK----CAYDKS--KVKLFT-GK--DFLHFNGSETMKKILY 137
             G+  E   +Y  K+   +KF     C   K    +K +T  K  D+   +G E M   +Y
Sbjct:   149 GIPDETCNNYQAKDQECDKFNQCGTCTEFKECHVIKNYTLWKVGDYGSLSGREKMMAEIY 208

Query:   138 KYGPLSV-LLNSDLIHDYNGTPIRK-NDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSW 195
               GP+S  ++ ++ + +Y G    + ND+      + H V + G+G  D + YW+VRNSW
Sbjct:   209 TNGPISCGIMATEKMSNYTGGIYSEYNDQAF----INHIVSVAGWGVSDGMEYWIVRNSW 264

Query:   196 GPIGPDEGFFKI 207
             G    + G+ +I
Sbjct:   265 GEPWGEHGWMRI 276

 Score = 179 (68.1 bits), Expect = 2.2e-10, Sum P(2) = 2.2e-10
 Identities = 55/192 (28%), Positives = 88/192 (45%)

Query:   763 CGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKSQLVECAKQCSGCDGCFFEPSIEYTHQA 819
             CGSCWA      +  +  IK  G       S   +++C      C+G    P  EY H+ 
Sbjct:    90 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCG-DAGSCEGGNDLPVWEYAHRH 148

Query:   820 GLESE--KDYPYKNANGEKFK----CAYDKS--KVKLFT-GK--DFLHFNGSETMKKILY 868
             G+  E   +Y  K+   +KF     C   K    +K +T  K  D+   +G E M   +Y
Sbjct:   149 GIPDETCNNYQAKDQECDKFNQCGTCTEFKECHVIKNYTLWKVGDYGSLSGREKMMAEIY 208

Query:   869 KYGPLSV-LLNSDLIHDYNGTPIRK-NDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSW 926
               GP+S  ++ ++ + +Y G    + ND+      + H V + G+G  D + YW+VRNSW
Sbjct:   209 TNGPISCGIMATEKMSNYTGGIYSEYNDQAF----INHIVSVAGWGVSDGMEYWIVRNSW 264

Query:   927 GPIGPDEGFFKI 938
             G    + G+ +I
Sbjct:   265 GEPWGEHGWMRI 276

 Score = 103 (41.3 bits), Expect = 6.8e-08, Sum P(2) = 6.8e-08
 Identities = 32/110 (29%), Positives = 46/110 (41%)

Query:   742 VPDAWDWRKKN------VTGPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKS 792
             +P +WDWR  N      VT        CGSCWA      +  +  IK  G       S  
Sbjct:    63 LPKSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQ 122

Query:   793 QLVECAKQCSGCDGCFFEPSIEYTHQAGLESE--KDYPYKNANGEKF-KC 839
              +++C      C+G    P  EY H+ G+  E   +Y  K+   +KF +C
Sbjct:   123 HVIDCG-DAGSCEGGNDLPVWEYAHRHGIPDETCNNYQAKDQECDKFNQC 171

 Score = 102 (41.0 bits), Expect = 8.8e-08, Sum P(2) = 8.8e-08
 Identities = 31/110 (28%), Positives = 46/110 (41%)

Query:    11 VPDAWDWRKKNVTGPAGDQAD------CGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKS 61
             +P +WDWR  N    A    +      CGSCWA      +  +  IK  G       S  
Sbjct:    63 LPKSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQ 122

Query:    62 QLVECAKQCSGCDGCFFEPSIEYTHQAGLESE--KDYPYKNANGEKF-KC 108
              +++C      C+G    P  EY H+ G+  E   +Y  K+   +KF +C
Sbjct:   123 HVIDCG-DAGSCEGGNDLPVWEYAHRHGIPDETCNNYQAKDQECDKFNQC 171

 Score = 100 (40.3 bits), Expect = 6.8e-08, Sum P(2) = 6.8e-08
 Identities = 30/97 (30%), Positives = 48/97 (49%)

Query:   916 NIPYWLVRNSWGPI-GPDEGFFKI-ERGNNACGI---EQIAGYAT-IDVVKNDETCSPYD 969
             N   W V   +G + G ++   +I   G  +CGI   E+++ Y   I    ND+      
Sbjct:   185 NYTLWKV-GDYGSLSGREKMMAEIYTNGPISCGIMATEKMSNYTGGIYSEYNDQAF---- 239

Query:   970 LGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKI 1006
             + H V + G+G  D + YW+VRNSWG    + G+ +I
Sbjct:   240 INHIVSVAGWGVSDGMEYWIVRNSWGEPWGEHGWMRI 276

 Score = 39 (18.8 bits), Expect = 2.2e-10, Sum P(2) = 2.2e-10
 Identities = 12/56 (21%), Positives = 22/56 (39%)

Query:    83 EYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYK 138
             EY   + L   K + ++N NG  +        +  + G  + H + S    +I  K
Sbjct:    56 EYLSPSDLP--KSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIK 109


>UNIPROTKB|P43233 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9031 "Gallus gallus"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 MEROPS:C01.060 CTD:1508
            HOGENOM:HOG000241341 HOVERGEN:HBG003480 KO:K01363 OrthoDB:EOG4K6G4C
            PANTHER:PTHR12411:SF16 EMBL:U18083 IPI:IPI00573387 PIR:S58770
            RefSeq:NP_990702.1 UniGene:Gga.3854 ProteinModelPortal:P43233
            SMR:P43233 STRING:P43233 PRIDE:P43233 GeneID:396329 KEGG:gga:396329
            InParanoid:P43233 NextBio:20816377 Uniprot:P43233
        Length = 340

 Score = 139 (54.0 bits), Expect = 4.5e-13, Sum P(2) = 4.5e-13
 Identities = 35/96 (36%), Positives = 51/96 (53%)

Query:   130 ETMKKILYKYGPL--SVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIP 187
             E M +I YK GP+  + ++  D +   +G     + E       GHA+ ++G+G ++  P
Sbjct:   239 EIMAEI-YKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQVG----GHAIRILGWGVENGTP 293

Query:   188 YWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 221
             YWL  NSW       GFFKI RG + CGIE   +AG
Sbjct:   294 YWLAANSWNTDWGITGFFKILRGEDHCGIESEIVAG 329

 Score = 139 (54.0 bits), Expect = 4.5e-13, Sum P(2) = 4.5e-13
 Identities = 35/96 (36%), Positives = 51/96 (53%)

Query:   861 ETMKKILYKYGPL--SVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIP 918
             E M +I YK GP+  + ++  D +   +G     + E       GHA+ ++G+G ++  P
Sbjct:   239 EIMAEI-YKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQVG----GHAIRILGWGVENGTP 293

Query:   919 YWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 952
             YWL  NSW       GFFKI RG + CGIE   +AG
Sbjct:   294 YWLAANSWNTDWGITGFFKILRGEDHCGIESEIVAG 329

 Score = 136 (52.9 bits), Expect = 9.8e-13, Sum P(2) = 9.8e-13
 Identities = 26/52 (50%), Positives = 34/52 (65%)

Query:   538 GHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 587
             GHA+ ++G+G ++  PYWLA NSW       GFFKI RG + CGIE   +AG
Sbjct:   278 GHAIRILGWGVENGTPYWLAANSWNTDWGITGFFKILRGEDHCGIESEIVAG 329

 Score = 132 (51.5 bits), Expect = 2.8e-12, Sum P(2) = 2.8e-12
 Identities = 25/52 (48%), Positives = 33/52 (63%)

Query:   971 GHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 1020
             GHA+ ++G+G ++  PYWL  NSW       GFFKI RG + CGIE   +AG
Sbjct:   278 GHAIRILGWGVENGTPYWLAANSWNTDWGITGFFKILRGEDHCGIESEIVAG 329

 Score = 113 (44.8 bits), Expect = 4.5e-13, Sum P(2) = 4.5e-13
 Identities = 31/98 (31%), Positives = 47/98 (47%)

Query:     3 MEVEKDGPVPDAWDWRKKNVTGPA----GDQADCGSCWAFSIAGMLEGQYAIKTG-KL-V 56
             ++  +D  +PD +D RK+    P      DQ  CGSCWAF     +  +  + T  K+ V
Sbjct:    72 VDFAEDMDLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSV 131

Query:    57 EFSKSQLVECAK-QCS-GCDGCFFEPSIEYTHQAGLES 92
             E S   L+ C   +C  GC+G +   +  Y  + GL S
Sbjct:   132 EVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVS 169

 Score = 112 (44.5 bits), Expect = 5.7e-13, Sum P(2) = 5.7e-13
 Identities = 31/93 (33%), Positives = 45/93 (48%)

Query:   739 DGPVPDAWDWRKKNVTGPA----GDQAACGSCWAFSIAGMLEGQYAIKTG-KL-VEFSKS 792
             D  +PD +D RK+    P      DQ +CGSCWAF     +  +  + T  K+ VE S  
Sbjct:    77 DMDLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAE 136

Query:   793 QLVECAK-QCS-GCDGCFFEPSIEYTHQAGLES 823
              L+ C   +C  GC+G +   +  Y  + GL S
Sbjct:   137 DLLSCCGFECGMGCNGGYPSGAWRYWTERGLVS 169

 Score = 104 (41.7 bits), Expect = 3.9e-12, Sum P(2) = 3.9e-12
 Identities = 29/79 (36%), Positives = 39/79 (49%)

Query:   373 DGPVPDAWDWRKKNVTGPA----GDQAACGSCWAFSIAGMLEGQYAIKTG-KL-VEFSKS 426
             D  +PD +D RK+    P      DQ +CGSCWAF     +  +  + T  K+ VE S  
Sbjct:    77 DMDLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAE 136

Query:   427 QLVECAKQCS-GCG-GCDG 443
              L+ C   C   CG GC+G
Sbjct:   137 DLLSC---CGFECGMGCNG 152


>WB|WBGene00000789 [details] [associations]
            symbol:cpz-2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GeneTree:ENSGT00560000076599 KO:K08568 EMBL:Z81103
            HSSP:P80067 PIR:T23720 RefSeq:NP_506318.1 ProteinModelPortal:P92005
            SMR:P92005 STRING:P92005 MEROPS:C01.A41 PaxDb:P92005
            EnsemblMetazoa:M04G12.2 GeneID:179818 KEGG:cel:CELE_M04G12.2
            UCSC:M04G12.2 CTD:179818 WormBase:M04G12.2 eggNOG:NOG275763
            InParanoid:P92005 OMA:VEYWIAR NextBio:906990 Uniprot:P92005
        Length = 467

 Score = 205 (77.2 bits), Expect = 5.7e-13, P = 5.7e-13
 Identities = 80/315 (25%), Positives = 130/315 (41%)

Query:   315 YFKQDGHKKHERYGTSEFSDRSPEEI--LCKTG-FKWSERTYERIVADRXXXXXXXXXXX 371
             Y++ +     +    SE S    EE     K G  K S + +E   A R           
Sbjct:   161 YYEPNDEALVDMSSESEESSEEWEEARPYLKCGCLKKSGKVFESKTAPREWESSSFKS-- 218

Query:   372 XDGPVPDAWDWRKK---NVTGPAGDQ---AACGSCWAFSIAGMLEGQYAI-KTGK--LVE 422
                 +P  WDWR     N   P  +Q     CGSCW F   G L  ++ + + G+  + +
Sbjct:   219 --NDLPTGWDWRNVSGVNYCSPTRNQHIPVYCGSCWVFGTTGALNDRFNVARKGRWPMTQ 276

Query:   423 FSKSQLVECAKQCSGCGGCDGLE--QPIEYTHQAGLESEKDYPYRNGNGE---KFKCA-- 475
              S  ++++C    +G G C G E    +E+    GL  E    YR  NGE     +C   
Sbjct:   277 LSPQEIIDC----NGKGNCQGGEIGNVLEHAKIQGLVEEGCNVYRATNGECNPYHRCGSC 332

Query:   476 -----YDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLS--VGLNSHLIHFY-NGTPIRK 527
                  +  +    +  KD+    G + +   + K GP++  +G      + Y  G    K
Sbjct:   333 WPNECFSLTNYTRYYVKDYGQVQGRDKIMSEIKKGGPIACAIGATKKFEYEYVKGVYSEK 392

Query:   528 NDETCSPYDLGHAVLLVGYGKQDD-IPYWLARNSWGPIGPDEGFFKIE-------RGNNA 579
             +D      +  H + L G+G  ++ + YW+ARNSWG    + G+F++        +G+  
Sbjct:   393 SD-----LESNHIISLTGWGVDENGVEYWIARNSWGEAWGELGWFRVVTSKFKDGQGDQY 447

Query:   580 -CGIEQIAGYATIDV 593
               GIE+   YA +DV
Sbjct:   448 NMGIERDCYYADVDV 462

 Score = 174 (66.3 bits), Expect = 1.4e-09, P = 1.4e-09
 Identities = 71/307 (23%), Positives = 124/307 (40%)

Query:   681 YFKQDGHKKHERYGTSEFSDRSPEEI--LCKTG-FKWSERTYERIVADRXXXXXXXXXXX 737
             Y++ +     +    SE S    EE     K G  K S + +E   A R           
Sbjct:   161 YYEPNDEALVDMSSESEESSEEWEEARPYLKCGCLKKSGKVFESKTAPREWESSSFKSN- 219

Query:   738 XDGPVPDAW-DWRKKNVTGPAGDQ---AACGSCWAFSIAGMLEGQYAI-KTGK--LVEFS 790
              D P    W +    N   P  +Q     CGSCW F   G L  ++ + + G+  + + S
Sbjct:   220 -DLPTGWDWRNVSGVNYCSPTRNQHIPVYCGSCWVFGTTGALNDRFNVARKGRWPMTQLS 278

Query:   791 KSQLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGE---KFKCA------- 840
               ++++C  +   C G      +E+    GL  E    Y+  NGE     +C        
Sbjct:   279 PQEIIDCNGK-GNCQGGEIGNVLEHAKIQGLVEEGCNVYRATNGECNPYHRCGSCWPNEC 337

Query:   841 YDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPY 900
             +  +    +  KD+    G + +   + K GP++  + +    +Y       +++  S  
Sbjct:   338 FSLTNYTRYYVKDYGQVQGRDKIMSEIKKGGPIACAIGATKKFEYEYVKGVYSEK--SDL 395

Query:   901 DLGHAVLLVGYGKQDN-IPYWLVRNSWGPIGPDEGFFKIE-------RGNNA-CGIEQIA 951
             +  H + L G+G  +N + YW+ RNSWG    + G+F++        +G+    GIE+  
Sbjct:   396 ESNHIISLTGWGVDENGVEYWIARNSWGEAWGELGWFRVVTSKFKDGQGDQYNMGIERDC 455

Query:   952 GYATIDV 958
              YA +DV
Sbjct:   456 YYADVDV 462

 Score = 173 (66.0 bits), Expect = 1.8e-09, P = 1.8e-09
 Identities = 52/218 (23%), Positives = 97/218 (44%)

Query:    32 CGSCWAFSIAGMLEGQYAI-KTGK--LVEFSKSQLVECAKQCSGCDGCFFEPSIEYTHQA 88
             CGSCW F   G L  ++ + + G+  + + S  ++++C  +   C G      +E+    
Sbjct:   248 CGSCWVFGTTGALNDRFNVARKGRWPMTQLSPQEIIDCNGK-GNCQGGEIGNVLEHAKIQ 306

Query:    89 GLESEKDYPYKNANGE---KFKCA-------YDKSKVKLFTGKDFLHFNGSETMKKILYK 138
             GL  E    Y+  NGE     +C        +  +    +  KD+    G + +   + K
Sbjct:   307 GLVEEGCNVYRATNGECNPYHRCGSCWPNECFSLTNYTRYYVKDYGQVQGRDKIMSEIKK 366

Query:   139 YGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN-IPYWLVRNSWGP 197
              GP++  + +    +Y       +++  S  +  H + L G+G  +N + YW+ RNSWG 
Sbjct:   367 GGPIACAIGATKKFEYEYVKGVYSEK--SDLESNHIISLTGWGVDENGVEYWIARNSWGE 424

Query:   198 IGPDEGFFKIE-------RGNNA-CGIEQIAGYATIDV 227
                + G+F++        +G+    GIE+   YA +DV
Sbjct:   425 AWGELGWFRVVTSKFKDGQGDQYNMGIERDCYYADVDV 462

 Score = 122 (48.0 bits), Expect = 2.9e-09, Sum P(2) = 2.9e-09
 Identities = 30/107 (28%), Positives = 47/107 (43%)

Query:     7 KDGPVPDAWDWRKK---NVTGPAGDQ---ADCGSCWAFSIAGMLEGQYAI-KTGK--LVE 57
             K   +P  WDWR     N   P  +Q     CGSCW F   G L  ++ + + G+  + +
Sbjct:   217 KSNDLPTGWDWRNVSGVNYCSPTRNQHIPVYCGSCWVFGTTGALNDRFNVARKGRWPMTQ 276

Query:    58 FSKSQLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGE 104
              S  ++++C  +   C G      +E+    GL  E    Y+  NGE
Sbjct:   277 LSPQEIIDCNGK-GNCQGGEIGNVLEHAKIQGLVEEGCNVYRATNGE 322

 Score = 99 (39.9 bits), Expect = 2.9e-09, Sum P(2) = 2.9e-09
 Identities = 34/124 (27%), Positives = 63/124 (50%)

Query:   916 NIPYWLVRNSWGPI-GPDEGFFKIERGNN-AC--GIEQIAGYATIDVVKNDETCSPYDLG 971
             N   + V++ +G + G D+   +I++G   AC  G  +   Y  +  V +++  S  +  
Sbjct:   342 NYTRYYVKD-YGQVQGRDKIMSEIKKGGPIACAIGATKKFEYEYVKGVYSEK--SDLESN 398

Query:   972 HAVLLVGYGKQDD-IPYWLVRNSWGPIGPDEGFFKIE-------RGNNA-CGIEQIAGYA 1022
             H + L G+G  ++ + YW+ RNSWG    + G+F++        +G+    GIE+   YA
Sbjct:   399 HIISLTGWGVDENGVEYWIARNSWGEAWGELGWFRVVTSKFKDGQGDQYNMGIERDCYYA 458

Query:  1023 TIDV 1026
              +DV
Sbjct:   459 DVDV 462


>UNIPROTKB|P07858 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9606 "Homo sapiens"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0005739 "mitochondrion" evidence=IEA] [GO:0042470 "melanosome"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042981 "regulation of apoptotic process" evidence=TAS]
            [GO:0006508 "proteolysis" evidence=IDA] [GO:0005764 "lysosome"
            evidence=IDA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEP] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IDA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IDA] [GO:0005622 "intracellular" evidence=TAS]
            [GO:0036021 "endolysosome lumen" evidence=TAS] [GO:0045087 "innate
            immune response" evidence=TAS] [GO:0008233 "peptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0005730 "nucleolus" evidence=IDA] [GO:0043231 "intracellular
            membrane-bounded organelle" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_118779 GO:GO:0005739
            GO:GO:0042470 GO:GO:0048471 Reactome:REACT_6900 GO:GO:0005615
            GO:GO:0009612 GO:GO:0009611 GO:GO:0005730 GO:GO:0042981
            GO:GO:0009897 GO:GO:0045471 GO:GO:0016324 GO:GO:0009749
            GO:GO:0006914 GO:GO:0043434 eggNOG:COG4870 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087
            GO:GO:0050790 GO:GO:0042383 GO:GO:0014070 GO:GO:0042277
            GO:GO:0060548 GO:GO:0005901 GO:GO:0014075 GO:GO:0004197
            GO:GO:0070670 EMBL:CH471157 GO:GO:0007519 MEROPS:C01.060 CTD:1508
            HOGENOM:HOG000241341 HOVERGEN:HBG003480 KO:K01363 OMA:GYPSGAW
            OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 GO:GO:0097067
            PANTHER:PTHR12411:SF16 EMBL:M14221 EMBL:L16510 EMBL:AK092070
            EMBL:AK075393 EMBL:BC010240 EMBL:BC095408 EMBL:M13230
            IPI:IPI00295741 PIR:A26498 RefSeq:NP_001899.1 RefSeq:NP_680090.1
            RefSeq:NP_680091.1 RefSeq:NP_680092.1 RefSeq:NP_680093.1
            UniGene:Hs.520898 PDB:1CSB PDB:1GMY PDB:1HUC PDB:1PBH PDB:2IPP
            PDB:2PBH PDB:3AI8 PDB:3CBJ PDB:3CBK PDB:3K9M PDB:3PBH PDBsum:1CSB
            PDBsum:1GMY PDBsum:1HUC PDBsum:1PBH PDBsum:2IPP PDBsum:2PBH
            PDBsum:3AI8 PDBsum:3CBJ PDBsum:3CBK PDBsum:3K9M PDBsum:3PBH
            ProteinModelPortal:P07858 SMR:P07858 DIP:DIP-42785N IntAct:P07858
            MINT:MINT-1397666 STRING:P07858 PhosphoSite:P07858 DMDM:68067549
            SWISS-2DPAGE:P07858 UCD-2DPAGE:P07858 PaxDb:P07858
            PeptideAtlas:P07858 PRIDE:P07858 DNASU:1508 Ensembl:ENST00000345125
            Ensembl:ENST00000353047 Ensembl:ENST00000434271
            Ensembl:ENST00000453527 Ensembl:ENST00000530640
            Ensembl:ENST00000531089 Ensembl:ENST00000533455
            Ensembl:ENST00000534510 GeneID:1508 KEGG:hsa:1508 UCSC:uc003wum.3
            GeneCards:GC08M011700 H-InvDB:HIX0007320 HGNC:HGNC:2527
            HPA:CAB000457 HPA:HPA018156 MIM:116810 neXtProt:NX_P07858
            PharmGKB:PA27027 InParanoid:P07858 PhylomeDB:P07858
            BindingDB:P07858 ChEMBL:CHEMBL4072 ChiTaRS:CTSB
            EvolutionaryTrace:P07858 GenomeRNAi:1508 NextBio:6235
            PMAP-CutDB:P07858 ArrayExpress:P07858 Bgee:P07858 CleanEx:HS_CTSB
            Genevestigator:P07858 GermOnline:ENSG00000164733 GO:GO:0036021
            Uniprot:P07858
        Length = 339

 Score = 153 (58.9 bits), Expect = 1.7e-12, Sum P(2) = 1.7e-12
 Identities = 37/104 (35%), Positives = 52/104 (50%)

Query:   858 NGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 915
             N  + +   +YK GP+    +  SD +   +G       E       GHA+ ++G+G ++
Sbjct:   234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMG----GHAIRILGWGVEN 289

Query:   916 NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAGYATID 957
               PYWLV NSW     D GFFKI RG + CGIE   +AG    D
Sbjct:   290 GTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTD 333

 Score = 153 (58.9 bits), Expect = 2.2e-12, Sum P(2) = 2.2e-12
 Identities = 37/104 (35%), Positives = 52/104 (50%)

Query:   127 NGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 184
             N  + +   +YK GP+    +  SD +   +G       E       GHA+ ++G+G ++
Sbjct:   234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMG----GHAIRILGWGVEN 289

Query:   185 NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAGYATID 226
               PYWLV NSW     D GFFKI RG + CGIE   +AG    D
Sbjct:   290 GTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTD 333

 Score = 149 (57.5 bits), Expect = 5.0e-12, Sum P(2) = 5.0e-12
 Identities = 35/104 (33%), Positives = 52/104 (50%)

Query:   493 NGSETMKKILYKYGPL--SVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD 550
             N  + +   +YK GP+  +  + S  + + +G       E       GHA+ ++G+G ++
Sbjct:   234 NSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMG----GHAIRILGWGVEN 289

Query:   551 DIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQ--IAGYATID 592
               PYWL  NSW     D GFFKI RG + CGIE   +AG    D
Sbjct:   290 GTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTD 333

 Score = 148 (57.2 bits), Expect = 6.5e-12, Sum P(2) = 6.5e-12
 Identities = 28/57 (49%), Positives = 36/57 (63%)

Query:   971 GHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAGYATID 1025
             GHA+ ++G+G ++  PYWLV NSW     D GFFKI RG + CGIE   +AG    D
Sbjct:   277 GHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTD 333

 Score = 92 (37.4 bits), Expect = 1.7e-12, Sum P(2) = 1.7e-12
 Identities = 22/54 (40%), Positives = 27/54 (50%)

Query:   393 DQAACGSCWAFSIAGMLEGQYAIKTGK--LVEFSKSQLVECAKQCSGCG-GCDG 443
             DQ +CGSCWAF     +  +  I T     VE S   L+ C    S CG GC+G
Sbjct:   101 DQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCG--SMCGDGCNG 152

 Score = 92 (37.4 bits), Expect = 1.7e-12, Sum P(2) = 1.7e-12
 Identities = 23/69 (33%), Positives = 32/69 (46%)

Query:   759 DQAACGSCWAFSIAGMLEGQYAIKTGK--LVEFSKSQLVECA-KQCS-GCDGCFFEPSIE 814
             DQ +CGSCWAF     +  +  I T     VE S   L+ C    C  GC+G +   +  
Sbjct:   101 DQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWN 160

Query:   815 YTHQAGLES 823
             +  + GL S
Sbjct:   161 FWTRKGLVS 169

 Score = 91 (37.1 bits), Expect = 2.2e-12, Sum P(2) = 2.2e-12
 Identities = 23/69 (33%), Positives = 31/69 (44%)

Query:    28 DQADCGSCWAFSIAGMLEGQYAIKTGK--LVEFSKSQLVECA-KQCS-GCDGCFFEPSIE 83
             DQ  CGSCWAF     +  +  I T     VE S   L+ C    C  GC+G +   +  
Sbjct:   101 DQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWN 160

Query:    84 YTHQAGLES 92
             +  + GL S
Sbjct:   161 FWTRKGLVS 169


>UNIPROTKB|F1M8U6 [details] [associations]
            symbol:F1M8U6 "Uncharacterized protein" species:10116
            "Rattus norvegicus" [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            IPI:IPI00782277 Ensembl:ENSRNOT00000055587 OMA:EREIAAW
            Uniprot:F1M8U6
        Length = 163

 Score = 178 (67.7 bits), Expect = 2.3e-12, P = 2.3e-12
 Identities = 56/175 (32%), Positives = 87/175 (49%)

Query:    60 KSQLVECAKQCSGCDGCFFEPSIEYT---HQAGLESEKDYPYKNANGEKFKCAYDKSKVK 116
             K +L++C K    C G    PS  YT   +  GLE+E  Y Y+   G    C +     K
Sbjct:     1 KKELLDCDKMDKACLGGL--PSNAYTAIKNLGGLETEDGYGYE---GHFQACNFLAQMTK 55

Query:   117 LFTGKDFLHFNGSET-MKKILYKYGPLSVLLNSDLIHDYNGT--PIRKNDETCSPYDLGH 173
             ++   D +  + +E+ +  +L + G +SV +     H Y GT  P+R     CSP    H
Sbjct:    56 VYIS-DSVELSQNESSIAALLAQKGLISVAIMQ--FHRY-GTVHPLRP---LCSPGFTDH 108

Query:   174 AVLLVGYGKQ--DNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID 226
             +VLLVGYG +   NIPYW ++N  G    +EG + + RG+   G+  +A  A ++
Sbjct:   109 SVLLVGYGNRPRSNIPYWAIKNIQGSDWGEEGHYYLYRGSGDRGVNTMASSAVVN 163

 Score = 178 (67.7 bits), Expect = 2.3e-12, P = 2.3e-12
 Identities = 56/175 (32%), Positives = 87/175 (49%)

Query:   791 KSQLVECAKQCSGCDGCFFEPSIEYT---HQAGLESEKDYPYKNANGEKFKCAYDKSKVK 847
             K +L++C K    C G    PS  YT   +  GLE+E  Y Y+   G    C +     K
Sbjct:     1 KKELLDCDKMDKACLGGL--PSNAYTAIKNLGGLETEDGYGYE---GHFQACNFLAQMTK 55

Query:   848 LFTGKDFLHFNGSET-MKKILYKYGPLSVLLNSDLIHDYNGT--PIRKNDETCSPYDLGH 904
             ++   D +  + +E+ +  +L + G +SV +     H Y GT  P+R     CSP    H
Sbjct:    56 VYIS-DSVELSQNESSIAALLAQKGLISVAIMQ--FHRY-GTVHPLRP---LCSPGFTDH 108

Query:   905 AVLLVGYGKQ--DNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID 957
             +VLLVGYG +   NIPYW ++N  G    +EG + + RG+   G+  +A  A ++
Sbjct:   109 SVLLVGYGNRPRSNIPYWAIKNIQGSDWGEEGHYYLYRGSGDRGVNTMASSAVVN 163

 Score = 161 (61.7 bits), Expect = 1.5e-10, P = 1.5e-10
 Identities = 53/175 (30%), Positives = 80/175 (45%)

Query:   425 KSQLVECAKQCSGCGGCDGLEQPIEYT---HQAGLESEKDYPYRNGNGEKFKCAYDKSKV 481
             K +L++C K    C G  GL     YT   +  GLE+E  Y Y    G    C +     
Sbjct:     1 KKELLDCDKMDKACLG--GLPSNA-YTAIKNLGGLETEDGYGYE---GHFQACNFLAQMT 54

Query:   482 KLFTGKDFLYFNGSETMKKILYKYGPLSVGLNSHLIHFYNGT--PIRKNDETCSPYDLGH 539
             K++            ++  +L + G +SV +     H Y GT  P+R     CSP    H
Sbjct:    55 KVYISDSVELSQNESSIAALLAQKGLISVAIMQ--FHRY-GTVHPLRP---LCSPGFTDH 108

Query:   540 AVLLVGYGKQ--DDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYATID 592
             +VLLVGYG +   +IPYW  +N  G    +EG + + RG+   G+  +A  A ++
Sbjct:   109 SVLLVGYGNRPRSNIPYWAIKNIQGSDWGEEGHYYLYRGSGDRGVNTMASSAVVN 163

 Score = 128 (50.1 bits), Expect = 5.0e-07, P = 5.0e-07
 Identities = 29/89 (32%), Positives = 48/89 (53%)

Query:   939 ERGNNACGIEQIAGYATIDVVKNDETCSPYDLGHAVLLVGYGKQ--DDIPYWLVRNSWGP 996
             ++G  +  I Q   Y T+  ++    CSP    H+VLLVGYG +   +IPYW ++N  G 
Sbjct:    77 QKGLISVAIMQFHRYGTVHPLR--PLCSPGFTDHSVLLVGYGNRPRSNIPYWAIKNIQGS 134

Query:   997 IGPDEGFFKIERGNNACGIEQIAGYATID 1025
                +EG + + RG+   G+  +A  A ++
Sbjct:   135 DWGEEGHYYLYRGSGDRGVNTMASSAVVN 163


>WB|WBGene00000785 [details] [associations]
            symbol:cpr-5 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39896 EMBL:L39927 EMBL:FO081739
            PIR:T37277 RefSeq:NP_503383.1 UniGene:Cel.19730
            ProteinModelPortal:P43509 SMR:P43509 DIP:DIP-25329N IntAct:P43509
            MINT:MINT-1051285 STRING:P43509 MEROPS:C01.A35 PaxDb:P43509
            EnsemblMetazoa:W07B8.5 GeneID:178612 KEGG:cel:CELE_W07B8.5
            UCSC:W07B8.5.1 CTD:178612 WormBase:W07B8.5 InParanoid:P43509
            OMA:DAIPDHF NextBio:901840 Uniprot:P43509
        Length = 344

 Score = 143 (55.4 bits), Expect = 2.7e-12, Sum P(3) = 2.7e-12
 Identities = 26/60 (43%), Positives = 38/60 (63%)

Query:   903 GHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDVVKND 962
             GHAV ++G+G  +  PYWLV NSW     ++G+F+I RG N CGIE  A     D+ +++
Sbjct:   285 GHAVKILGWGVDNGTPYWLVANSWNVAWGEKGYFRIIRGLNECGIEHSAVAGIPDLARHN 344

 Score = 141 (54.7 bits), Expect = 2.1e-11, Sum P(2) = 2.1e-11
 Identities = 25/49 (51%), Positives = 33/49 (67%)

Query:   172 GHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIA 220
             GHAV ++G+G  +  PYWLV NSW     ++G+F+I RG N CGIE  A
Sbjct:   285 GHAVKILGWGVDNGTPYWLVANSWNVAWGEKGYFRIIRGLNECGIEHSA 333

 Score = 140 (54.3 bits), Expect = 6.0e-12, Sum P(3) = 6.0e-12
 Identities = 25/49 (51%), Positives = 33/49 (67%)

Query:   971 GHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIA 1019
             GHAV ++G+G  +  PYWLV NSW     ++G+F+I RG N CGIE  A
Sbjct:   285 GHAVKILGWGVDNGTPYWLVANSWNVAWGEKGYFRIIRGLNECGIEHSA 333

 Score = 136 (52.9 bits), Expect = 5.4e-11, Sum P(3) = 5.4e-11
 Identities = 24/49 (48%), Positives = 32/49 (65%)

Query:   538 GHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIA 586
             GHAV ++G+G  +  PYWL  NSW     ++G+F+I RG N CGIE  A
Sbjct:   285 GHAVKILGWGVDNGTPYWLVANSWNVAWGEKGYFRIIRGLNECGIEHSA 333

 Score = 98 (39.6 bits), Expect = 2.7e-12, Sum P(3) = 2.7e-12
 Identities = 26/81 (32%), Positives = 36/81 (44%)

Query:   376 VPDAWD----WRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLV 429
             +PD +D    W          DQ+ CGSCWAF+ A  +  +  I +   V    S   L+
Sbjct:    82 IPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLL 141

Query:   430 ECAKQCSGCG-GCDGLEQPIE 449
              C      CG GC+G   PI+
Sbjct:   142 SCCTGMFSCGNGCEG-GYPIQ 161

 Score = 95 (38.5 bits), Expect = 5.5e-12, Sum P(3) = 5.5e-12
 Identities = 25/84 (29%), Positives = 37/84 (44%)

Query:     2 LMEVEKDGPVPDAWD----WRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVE 57
             ++  E    +PD +D    W          DQ+DCGSCWAF+ A  +  +  I +   V 
Sbjct:    73 IVATEVSDAIPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVN 132

Query:    58 --FSKSQLVECAK---QC-SGCDG 75
                S   L+ C      C +GC+G
Sbjct:   133 TLLSSEDLLSCCTGMFSCGNGCEG 156

 Score = 84 (34.6 bits), Expect = 1.7e-10, Sum P(2) = 1.7e-10
 Identities = 23/75 (30%), Positives = 33/75 (44%)

Query:   742 VPDAWD----WRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLV 795
             +PD +D    W          DQ+ CGSCWAF+ A  +  +  I +   V    S   L+
Sbjct:    82 IPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLL 141

Query:   796 ECAK---QC-SGCDG 806
              C      C +GC+G
Sbjct:   142 SCCTGMFSCGNGCEG 156

 Score = 44 (20.5 bits), Expect = 2.7e-12, Sum P(3) = 2.7e-12
 Identities = 21/82 (25%), Positives = 33/82 (40%)

Query:   739 DGPVP-DAWDWRKKN--VTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLV 795
             +G  P  AW W  K+  VTG  G       C  +SIA   E    +K     E ++    
Sbjct:   155 EGGYPIQAWKWWVKHGLVTG--GSYETQFGCKPYSIAPCGETVNGVKWPACPEDTEPT-P 211

Query:   796 ECAKQCSGCDGCFFEPSIEYTH 817
             +C   C+  +  +  P ++  H
Sbjct:   212 KCVDSCTSKNN-YATPYLQDKH 232

 Score = 43 (20.2 bits), Expect = 2.8e-06, Sum P(2) = 2.8e-06
 Identities = 21/82 (25%), Positives = 33/82 (40%)

Query:     8 DGPVP-DAWDWRKKN--VTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLV 64
             +G  P  AW W  K+  VTG  G       C  +SIA   E    +K     E ++    
Sbjct:   155 EGGYPIQAWKWWVKHGLVTG--GSYETQFGCKPYSIAPCGETVNGVKWPACPEDTEPT-P 211

Query:    65 ECAKQCSGCDGCFFEPSIEYTH 86
             +C   C+  +  +  P ++  H
Sbjct:   212 KCVDSCTSKNN-YATPYLQDKH 232

 Score = 42 (19.8 bits), Expect = 8.7e-12, Sum P(3) = 8.7e-12
 Identities = 19/67 (28%), Positives = 27/67 (40%)

Query:   373 DGPVP-DAWDWRKKN--VTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLV 429
             +G  P  AW W  K+  VTG  G       C  +SIA   E    +K     E ++    
Sbjct:   155 EGGYPIQAWKWWVKHGLVTG--GSYETQFGCKPYSIAPCGETVNGVKWPACPEDTEPT-P 211

Query:   430 ECAKQCS 436
             +C   C+
Sbjct:   212 KCVDSCT 218


>TAIR|locus:2204873 [details] [associations]
            symbol:AT1G02300 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002684 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 KO:K01363
            PANTHER:PTHR12411:SF16 OMA:ADDINAC IPI:IPI00534431
            RefSeq:NP_563647.1 UniGene:At.43952 ProteinModelPortal:F4HVZ1
            SMR:F4HVZ1 MEROPS:C01.A10 EnsemblPlants:AT1G02300.1 GeneID:839576
            KEGG:ath:AT1G02300 ArrayExpress:F4HVZ1 Uniprot:F4HVZ1
        Length = 379

 Score = 152 (58.6 bits), Expect = 3.0e-12, Sum P(2) = 3.0e-12
 Identities = 38/91 (41%), Positives = 50/91 (54%)

Query:   502 LYKYGPLSVGLNSH--LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD-IPYWLAR 558
             +YK GP+ V    +    H+ +G  + K   T +    GHAV L+G+G  DD   YWL  
Sbjct:   271 VYKNGPVEVAFTVYEDFAHYKSG--VYKYI-TGTKIG-GHAVKLIGWGTSDDGEDYWLLA 326

Query:   559 NSWGPIGPDEGFFKIERGNNACGIEQ--IAG 587
             N W     D+G+FKI RG N CGIEQ  +AG
Sbjct:   327 NQWNRSWGDDGYFKIRRGTNECGIEQSVVAG 357

 Score = 149 (57.5 bits), Expect = 6.5e-12, Sum P(2) = 6.5e-12
 Identities = 29/53 (54%), Positives = 35/53 (66%)

Query:   971 GHAVLLVGYGKQDD-IPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 1020
             GHAV L+G+G  DD   YWL+ N W     D+G+FKI RG N CGIEQ  +AG
Sbjct:   305 GHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVVAG 357

 Score = 148 (57.2 bits), Expect = 8.4e-12, Sum P(2) = 8.4e-12
 Identities = 38/91 (41%), Positives = 50/91 (54%)

Query:   136 LYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN-IPYWLVR 192
             +YK GP+ V      D  H  +G  + K   T +    GHAV L+G+G  D+   YWL+ 
Sbjct:   271 VYKNGPVEVAFTVYEDFAHYKSG--VYKYI-TGTKIG-GHAVKLIGWGTSDDGEDYWLLA 326

Query:   193 NSWGPIGPDEGFFKIERGNNACGIEQ--IAG 221
             N W     D+G+FKI RG N CGIEQ  +AG
Sbjct:   327 NQWNRSWGDDGYFKIRRGTNECGIEQSVVAG 357

 Score = 148 (57.2 bits), Expect = 8.4e-12, Sum P(2) = 8.4e-12
 Identities = 38/91 (41%), Positives = 50/91 (54%)

Query:   867 LYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN-IPYWLVR 923
             +YK GP+ V      D  H  +G  + K   T +    GHAV L+G+G  D+   YWL+ 
Sbjct:   271 VYKNGPVEVAFTVYEDFAHYKSG--VYKYI-TGTKIG-GHAVKLIGWGTSDDGEDYWLLA 326

Query:   924 NSWGPIGPDEGFFKIERGNNACGIEQ--IAG 952
             N W     D+G+FKI RG N CGIEQ  +AG
Sbjct:   327 NQWNRSWGDDGYFKIRRGTNECGIEQSVVAG 357

 Score = 93 (37.8 bits), Expect = 3.0e-12, Sum P(2) = 3.0e-12
 Identities = 23/74 (31%), Positives = 35/74 (47%)

Query:    32 CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ-CS-GCDGCFFEPSIEYTHQAG 89
             CGSCWAF     L  ++ IK    V  S + ++ C    C  GC+G F   +  Y    G
Sbjct:   148 CGSCWAFGAVESLSDRFCIKYNLNVSLSANDVIACCGLLCGFGCNGGFPMGAWLYFKYHG 207

Query:    90 LESEKDYPYKNANG 103
             + +++  PY +  G
Sbjct:   208 VVTQECDPYFDNTG 221

 Score = 93 (37.8 bits), Expect = 6.5e-12, Sum P(2) = 6.5e-12
 Identities = 23/74 (31%), Positives = 35/74 (47%)

Query:   763 CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQ-CS-GCDGCFFEPSIEYTHQAG 820
             CGSCWAF     L  ++ IK    V  S + ++ C    C  GC+G F   +  Y    G
Sbjct:   148 CGSCWAFGAVESLSDRFCIKYNLNVSLSANDVIACCGLLCGFGCNGGFPMGAWLYFKYHG 207

Query:   821 LESEKDYPYKNANG 834
             + +++  PY +  G
Sbjct:   208 VVTQECDPYFDNTG 221

 Score = 87 (35.7 bits), Expect = 1.2e-11, Sum P(2) = 1.2e-11
 Identities = 24/77 (31%), Positives = 36/77 (46%)

Query:   397 CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCG-GCDGLEQPIE---YTH 452
             CGSCWAF     L  ++ IK    V  S + ++ C      CG GC+G   P+    Y  
Sbjct:   148 CGSCWAFGAVESLSDRFCIKYNLNVSLSANDVIACCGLL--CGFGCNG-GFPMGAWLYFK 204

Query:   453 QAGLESEKDYPYRNGNG 469
               G+ +++  PY +  G
Sbjct:   205 YHGVVTQECDPYFDNTG 221


>DICTYBASE|DDB_G0276111 [details] [associations]
            symbol:DDB_G0276111 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0276111 Pfam:PF00188
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            PROSITE:PS00139 EMBL:AAFI02000014 Gene3D:3.40.33.10
            InterPro:IPR001283 InterPro:IPR014044 PRINTS:PR00837 SMART:SM00198
            SUPFAM:SSF55797 ProtClustDB:CLSZ2429919 RefSeq:XP_643261.1
            ProteinModelPortal:Q75JH0 EnsemblProtists:DDB0169514 GeneID:8620304
            KEGG:ddi:DDB_G0276111 InParanoid:Q75JH0 OMA:GFVTSIK Uniprot:Q75JH0
        Length = 415

 Score = 197 (74.4 bits), Expect = 3.2e-12, P = 3.2e-12
 Identities = 56/212 (26%), Positives = 87/212 (41%)

Query:    16 DWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKL---VEFSKSQLVECAKQCSG 72
             DW+         +Q  CG C++F+    LE  Y IK       ++ S+   V C     G
Sbjct:   214 DWKSLGFVTSIKNQGQCGGCYSFATCAALESAYLIKNNLPNTDIDLSEQNFVSCVNY--G 271

Query:    73 CDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETM 132
             C G   +  ++     G+  E  YPYK   G            K +TG   +  N  E  
Sbjct:   272 CGGGNGQSCLDKLKSTGIMYETSYPYKAVTGSCPNVIQSPQPFK-WTGYSNIQGN-KEAF 329

Query:   133 KKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVR 192
                L K GP+   L  D       + I    ++ +P    HA+ +VGY   DN   +L++
Sbjct:   330 LNAL-KSGPIYASLYVDSGFQLYKSGIYSCSQSSTP---NHAITIVGYSSADNS--YLIK 383

Query:   193 NSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 224
             NSWG I  + G+ +++ G+  C +    G  T
Sbjct:   384 NSWGTIYGESGYIRLKEGS--CNLYSFTGITT 413

 Score = 196 (74.1 bits), Expect = 4.1e-12, P = 4.1e-12
 Identities = 56/212 (26%), Positives = 87/212 (41%)

Query:   747 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKL---VEFSKSQLVECAKQCSG 803
             DW+         +Q  CG C++F+    LE  Y IK       ++ S+   V C     G
Sbjct:   214 DWKSLGFVTSIKNQGQCGGCYSFATCAALESAYLIKNNLPNTDIDLSEQNFVSCVNY--G 271

Query:   804 CDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETM 863
             C G   +  ++     G+  E  YPYK   G            K +TG   +  N  E  
Sbjct:   272 CGGGNGQSCLDKLKSTGIMYETSYPYKAVTGSCPNVIQSPQPFK-WTGYSNIQGN-KEAF 329

Query:   864 KKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVR 923
                L K GP+   L  D       + I    ++ +P    HA+ +VGY   DN   +L++
Sbjct:   330 LNAL-KSGPIYASLYVDSGFQLYKSGIYSCSQSSTP---NHAITIVGYSSADNS--YLIK 383

Query:   924 NSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 955
             NSWG I  + G+ +++ G+  C +    G  T
Sbjct:   384 NSWGTIYGESGYIRLKEGS--CNLYSFTGITT 413

 Score = 187 (70.9 bits), Expect = 4.1e-11, P = 4.1e-11
 Identities = 57/215 (26%), Positives = 91/215 (42%)

Query:   381 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKL---VEFSKSQLVECAKQCSG 437
             DW+         +Q  CG C++F+    LE  Y IK       ++ S+   V C     G
Sbjct:   214 DWKSLGFVTSIKNQGQCGGCYSFATCAALESAYLIKNNLPNTDIDLSEQNFVSCVNY--G 271

Query:   438 CGGCDGLEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSET 497
             CGG +G +  ++     G+  E  YPY+   G            K +TG   +  N  E 
Sbjct:   272 CGGGNG-QSCLDKLKSTGIMYETSYPYKAVTGSCPNVIQSPQPFK-WTGYSNIQGN-KEA 328

Query:   498 MKKILYKYGPLSVGLNSHLIHFYNGTPIRKND-ETCSPYDL-GHAVLLVGYGKQDDIPYW 555
                 L K GP+   L     +  +G  + K+   +CS      HA+ +VGY   D+   +
Sbjct:   329 FLNAL-KSGPIYASL-----YVDSGFQLYKSGIYSCSQSSTPNHAITIVGYSSADNS--Y 380

Query:   556 LARNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 590
             L +NSWG I  + G+ +++ G+  C +    G  T
Sbjct:   381 LIKNSWGTIYGESGYIRLKEGS--CNLYSFTGITT 413


>WB|WBGene00000784 [details] [associations]
            symbol:cpr-4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39895 EMBL:L39926 EMBL:FO081381
            PIR:T37280 RefSeq:NP_504682.1 UniGene:Cel.5404
            ProteinModelPortal:P43508 SMR:P43508 DIP:DIP-25376N
            MINT:MINT-1069892 STRING:P43508 MEROPS:C01.A34 PaxDb:P43508
            EnsemblMetazoa:F44C4.3 GeneID:179053 KEGG:cel:CELE_F44C4.3
            UCSC:F44C4.3 CTD:179053 WormBase:F44C4.3 InParanoid:P43508
            OMA:CCGFLCG NextBio:903704 Uniprot:P43508
        Length = 335

 Score = 139 (54.0 bits), Expect = 5.8e-12, Sum P(2) = 5.8e-12
 Identities = 23/46 (50%), Positives = 31/46 (67%)

Query:   172 GHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 217
             GHA+ ++G+G  +  PYWLV NSW     + G+F+I RG N CGIE
Sbjct:   280 GHAIRILGWGTDNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIE 325

 Score = 139 (54.0 bits), Expect = 5.8e-12, Sum P(2) = 5.8e-12
 Identities = 23/46 (50%), Positives = 31/46 (67%)

Query:   903 GHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 948
             GHA+ ++G+G  +  PYWLV NSW     + G+F+I RG N CGIE
Sbjct:   280 GHAIRILGWGTDNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIE 325

 Score = 138 (53.6 bits), Expect = 7.5e-12, Sum P(2) = 7.5e-12
 Identities = 23/46 (50%), Positives = 31/46 (67%)

Query:   971 GHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 1016
             GHA+ ++G+G  +  PYWLV NSW     + G+F+I RG N CGIE
Sbjct:   280 GHAIRILGWGTDNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIE 325

 Score = 134 (52.2 bits), Expect = 2.1e-11, Sum P(2) = 2.1e-11
 Identities = 22/46 (47%), Positives = 30/46 (65%)

Query:   538 GHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIE 583
             GHA+ ++G+G  +  PYWL  NSW     + G+F+I RG N CGIE
Sbjct:   280 GHAIRILGWGTDNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIE 325

 Score = 102 (41.0 bits), Expect = 5.8e-12, Sum P(2) = 5.8e-12
 Identities = 21/65 (32%), Positives = 34/65 (52%)

Query:    28 DQADCGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVECAKQCS-GCDGCFFEPSIEY 84
             DQ+DCGSCWAF+ A     ++ I +   V    S   ++ C   C  GC+G +   + +Y
Sbjct:   102 DQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSCCSNCGYGCEGGYPINAWKY 161

Query:    85 THQAG 89
               ++G
Sbjct:   162 LVKSG 166

 Score = 99 (39.9 bits), Expect = 1.2e-11, Sum P(2) = 1.2e-11
 Identities = 21/54 (38%), Positives = 29/54 (53%)

Query:   393 DQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVECAKQCSGCG-GCDG 443
             DQ+ CGSCWAF+ A     ++ I +   V    S   ++ C   CS CG GC+G
Sbjct:   102 DQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSC---CSNCGYGCEG 152

 Score = 94 (38.1 bits), Expect = 3.9e-11, Sum P(2) = 3.9e-11
 Identities = 20/65 (30%), Positives = 33/65 (50%)

Query:   759 DQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVECAKQCS-GCDGCFFEPSIEY 815
             DQ+ CGSCWAF+ A     ++ I +   V    S   ++ C   C  GC+G +   + +Y
Sbjct:   102 DQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSCCSNCGYGCEGGYPINAWKY 161

Query:   816 THQAG 820
               ++G
Sbjct:   162 LVKSG 166


>DICTYBASE|DDB_G0286015 [details] [associations]
            symbol:gmsA species:44689 "Dictyostelium discoideum"
            [GO:0005615 "extracellular space" evidence=IDA] [GO:0019953 "sexual
            reproduction" evidence=IEP] [GO:0009897 "external side of plasma
            membrane" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA;ISS] [GO:0000747 "conjugation with cellular
            fusion" evidence=IMP] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005576 "extracellular
            region" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0286015 Pfam:PF00188 GO:GO:0005615
            GenomeReviews:CM000153_GR GO:GO:0009897 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AAFI02000085 GO:GO:0000747
            Gene3D:3.40.33.10 InterPro:IPR001283 InterPro:IPR014044
            SMART:SM00198 SUPFAM:SSF55797 HSSP:P07688 RefSeq:XP_637893.1
            ProteinModelPortal:Q54ME1 MEROPS:C01.A52 EnsemblProtists:DDB0191145
            GeneID:8625403 KEGG:ddi:DDB_G0286015 InParanoid:Q54ME1 OMA:PGIAYEK
            ProtClustDB:CLSZ2429919 Uniprot:Q54ME1
        Length = 448

 Score = 195 (73.7 bits), Expect = 6.5e-12, P = 6.5e-12
 Identities = 59/207 (28%), Positives = 90/207 (43%)

Query:    16 DWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTG----KLVEFSKSQLVECAKQCS 71
             DW       P  DQ  CGSCWAF+ +  LE +Y IK G      ++ S    V C    S
Sbjct:   245 DWTSYQT--PIRDQGQCGSCWAFASSAALESRYLIKYGTAQKSTLQLSNQNAVNCI--AS 300

Query:    72 GCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKL-FTGKDFLHFNGSE 130
             GC+G +      +    G+  EKD PYK   G    C    S  +  +T   +     + 
Sbjct:   301 GCNGGWSGNYFNFFKTPGIAYEKDDPYKAVTGTS--CITTSSVARFKYTNYGYTEKTKAA 358

Query:   131 TMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYD-LGHAVLLVGYGKQDNIPYW 189
              + ++  K GP+++ +  D       + I  N  T   Y  + H VLLVGY +  +   +
Sbjct:   359 LLAEL--KKGPVTIAVYVDSAFQNYKSGIY-NSAT--KYTGINHLVLLVGYDQATDA--Y 411

Query:   190 LVRNSWGPIGPDEGFFKIERGNNACGI 216
              ++NSWG    + G+ +I   N+   I
Sbjct:   412 KIKNSWGSWWGESGYMRITASNDNLAI 438

 Score = 194 (73.4 bits), Expect = 8.4e-12, P = 8.4e-12
 Identities = 59/207 (28%), Positives = 90/207 (43%)

Query:   747 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTG----KLVEFSKSQLVECAKQCS 802
             DW       P  DQ  CGSCWAF+ +  LE +Y IK G      ++ S    V C    S
Sbjct:   245 DWTSYQT--PIRDQGQCGSCWAFASSAALESRYLIKYGTAQKSTLQLSNQNAVNCI--AS 300

Query:   803 GCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKL-FTGKDFLHFNGSE 861
             GC+G +      +    G+  EKD PYK   G    C    S  +  +T   +     + 
Sbjct:   301 GCNGGWSGNYFNFFKTPGIAYEKDDPYKAVTGTS--CITTSSVARFKYTNYGYTEKTKAA 358

Query:   862 TMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYD-LGHAVLLVGYGKQDNIPYW 920
              + ++  K GP+++ +  D       + I  N  T   Y  + H VLLVGY +  +   +
Sbjct:   359 LLAEL--KKGPVTIAVYVDSAFQNYKSGIY-NSAT--KYTGINHLVLLVGYDQATDA--Y 411

Query:   921 LVRNSWGPIGPDEGFFKIERGNNACGI 947
              ++NSWG    + G+ +I   N+   I
Sbjct:   412 KIKNSWGSWWGESGYMRITASNDNLAI 438

 Score = 188 (71.2 bits), Expect = 3.8e-11, P = 3.8e-11
 Identities = 63/210 (30%), Positives = 91/210 (43%)

Query:   381 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTG----KLVEFSKSQLVECAKQCS 436
             DW       P  DQ  CGSCWAF+ +  LE +Y IK G      ++ S    V C    S
Sbjct:   245 DWTSYQT--PIRDQGQCGSCWAFASSAALESRYLIKYGTAQKSTLQLSNQNAVNCI--AS 300

Query:   437 GC-GGCDGLEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGS 495
             GC GG  G      +    G+  EKD PY+   G    C    S V  F   ++ Y   +
Sbjct:   301 GCNGGWSG--NYFNFFKTPGIAYEKDDPYKAVTGTS--CI-TTSSVARFKYTNYGYTEKT 355

Query:   496 ETMKKILYKYGPLSVGL--NSHLIHFYNGTPIRKNDETCSPYD-LGHAVLLVGYGKQDDI 552
             +       K GP+++ +  +S   ++ +G     N  T   Y  + H VLLVGY +  D 
Sbjct:   356 KAALLAELKKGPVTIAVYVDSAFQNYKSGI---YNSAT--KYTGINHLVLLVGYDQATDA 410

Query:   553 PYWLARNSWGPIGPDEGFFKIERGNNACGI 582
               +  +NSWG    + G+ +I   N+   I
Sbjct:   411 --YKIKNSWGSWWGESGYMRITASNDNLAI 438


>DICTYBASE|DDB_G0292462 [details] [associations]
            symbol:DDB_G0292462 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0292462 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            EMBL:AAFI02000190 RefSeq:XP_629634.1 MEROPS:C01.A56
            EnsemblProtists:DDB0184413 GeneID:8628698 KEGG:ddi:DDB_G0292462
            InParanoid:Q54D62 OMA:NTQVESH Uniprot:Q54D62
        Length = 323

 Score = 188 (71.2 bits), Expect = 1.3e-11, P = 1.3e-11
 Identities = 65/250 (26%), Positives = 110/250 (44%)

Query:   742 VPDAWDWRKK--NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEF--SKSQLVEC 797
             +P ++D R    +   P  +Q +CGSCWA   +G+L  +  I++ K ++   S   L++C
Sbjct:    46 IPASFDVRTNWGDCMSPVREQQSCGSCWAQVTSGILADRMCIESDKNIKMLLSPQYLMDC 105

Query:   798 AKQC-----SGCD-GC---FFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDK----S 844
                C     SGC+ GC   F   ++      G+ S++   Y+ +         D     S
Sbjct:   106 DGSCVSDGVSGCNNGCKGGFVGLALTRLINEGIVSDECLSYQASKDSSCPTTCDDGSPIS 165

Query:   845 KVKLFTGKDFLHFNGSETMKKILYKYGPL--SVLLNSDLI-HDYNGTPIRKNDETCSPYD 901
                ++       F   +  +  +   GP+  + +L SD   H ++      N +  S   
Sbjct:   166 NTTIYKATSCRAFPTVQDAQYEIMTNGPVIATFMLYSDFKPHKWDVYIKSSNTQVES--- 222

Query:   902 LGHAVLLVGYGK-QDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDVVK 960
               HAV +VG+G   D + YW+  NSWG    D+G+FKI RG++    E+  G+ T+    
Sbjct:   223 --HAVRVVGWGTTSDGVDYWIAANSWGTGWGDKGYFKIRRGSDEAAFEE--GFITVTADT 278

Query:   961 NDETCSPYDL 970
                  S Y L
Sbjct:   279 ASVPTSQYGL 288

 Score = 185 (70.2 bits), Expect = 3.0e-11, P = 3.0e-11
 Identities = 62/236 (26%), Positives = 106/236 (44%)

Query:    11 VPDAWDWRKK--NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEF--SKSQLVEC 66
             +P ++D R    +   P  +Q  CGSCWA   +G+L  +  I++ K ++   S   L++C
Sbjct:    46 IPASFDVRTNWGDCMSPVREQQSCGSCWAQVTSGILADRMCIESDKNIKMLLSPQYLMDC 105

Query:    67 AKQC-----SGCD-GC---FFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDK----S 113
                C     SGC+ GC   F   ++      G+ S++   Y+ +         D     S
Sbjct:   106 DGSCVSDGVSGCNNGCKGGFVGLALTRLINEGIVSDECLSYQASKDSSCPTTCDDGSPIS 165

Query:   114 KVKLFTGKDFLHFNGSETMKKILYKYGPL--SVLLNSDLI-HDYNGTPIRKNDETCSPYD 170
                ++       F   +  +  +   GP+  + +L SD   H ++      N +  S   
Sbjct:   166 NTTIYKATSCRAFPTVQDAQYEIMTNGPVIATFMLYSDFKPHKWDVYIKSSNTQVES--- 222

Query:   171 LGHAVLLVGYGK-QDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 225
               HAV +VG+G   D + YW+  NSWG    D+G+FKI RG++    E+  G+ T+
Sbjct:   223 --HAVRVVGWGTTSDGVDYWIAANSWGTGWGDKGYFKIRRGSDEAAFEE--GFITV 274

 Score = 182 (69.1 bits), Expect = 6.6e-11, P = 6.6e-11
 Identities = 61/236 (25%), Positives = 106/236 (44%)

Query:   376 VPDAWDWRKK--NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEF--SKSQLVEC 431
             +P ++D R    +   P  +Q +CGSCWA   +G+L  +  I++ K ++   S   L++C
Sbjct:    46 IPASFDVRTNWGDCMSPVREQQSCGSCWAQVTSGILADRMCIESDKNIKMLLSPQYLMDC 105

Query:   432 AKQC-----SGCG-GCDGLEQPIEYTH--QAGLESEKDYPYRNGNGEKFKCAYDK----S 479
                C     SGC  GC G    +  T     G+ S++   Y+           D     S
Sbjct:   106 DGSCVSDGVSGCNNGCKGGFVGLALTRLINEGIVSDECLSYQASKDSSCPTTCDDGSPIS 165

Query:   480 KVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKND---ETCSPYD 536
                ++       F   +  +  +   GP+   + + ++  Y+     K D   ++ +   
Sbjct:   166 NTTIYKATSCRAFPTVQDAQYEIMTNGPV---IATFML--YSDFKPHKWDVYIKSSNTQV 220

Query:   537 LGHAVLLVGYGKQDD-IPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 591
               HAV +VG+G   D + YW+A NSWG    D+G+FKI RG++    E+  G+ T+
Sbjct:   221 ESHAVRVVGWGTTSDGVDYWIAANSWGTGWGDKGYFKIRRGSDEAAFEE--GFITV 274

 Score = 122 (48.0 bits), Expect = 0.00034, P = 0.00034
 Identities = 23/54 (42%), Positives = 34/54 (62%)

Query:   972 HAVLLVGYGKQDD-IPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 1024
             HAV +VG+G   D + YW+  NSWG    D+G+FKI RG++    E+  G+ T+
Sbjct:   223 HAVRVVGWGTTSDGVDYWIAANSWGTGWGDKGYFKIRRGSDEAAFEE--GFITV 274


>UNIPROTKB|Q9UBR2 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0060441 "epithelial tube
            branching involved in lung morphogenesis" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            Reactome:REACT_11123 Reactome:REACT_17015 InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 EMBL:CH471077 GO:GO:0005615 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AL109840 GO:GO:0060441 MEROPS:C01.013 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN
            BRENDA:3.4.18.1 EMBL:AF073890 EMBL:AF032906 EMBL:AF136273
            EMBL:AF136276 EMBL:AF136274 EMBL:AF136275 EMBL:AK314931
            EMBL:BC042168 EMBL:AF009923 IPI:IPI00002745 RefSeq:NP_001327.2
            UniGene:Hs.252549 PDB:1DEU PDB:1EF7 PDBsum:1DEU PDBsum:1EF7
            ProteinModelPortal:Q9UBR2 SMR:Q9UBR2 STRING:Q9UBR2 DMDM:12643324
            PaxDb:Q9UBR2 PeptideAtlas:Q9UBR2 PRIDE:Q9UBR2 DNASU:1522
            Ensembl:ENST00000217131 GeneID:1522 KEGG:hsa:1522 UCSC:uc002yai.2
            GeneCards:GC20M057570 HGNC:HGNC:2547 HPA:CAB025114 MIM:603169
            neXtProt:NX_Q9UBR2 PharmGKB:PA27043 InParanoid:Q9UBR2 OMA:QCGTCTE
            PhylomeDB:Q9UBR2 BindingDB:Q9UBR2 ChEMBL:CHEMBL4160 ChiTaRS:CTSZ
            EvolutionaryTrace:Q9UBR2 GenomeRNAi:1522 NextBio:6299 Bgee:Q9UBR2
            CleanEx:HS_CTSZ Genevestigator:Q9UBR2 GermOnline:ENSG00000101160
            Uniprot:Q9UBR2
        Length = 303

 Score = 185 (70.2 bits), Expect = 2.2e-11, P = 2.2e-11
 Identities = 62/220 (28%), Positives = 95/220 (43%)

Query:   376 VPDAWDWRKK---NVTGPAGDQAA---CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLV 429
             +P +WDWR     N      +Q     CGSCWA +    +  +  IK  K    S    V
Sbjct:    62 LPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKR-KGAWPSTLLSV 120

Query:   430 ECAKQCSGCGGCDG-LEQPI-EYTHQAGLESE--KDYPYRNGNGEKFK----CAYDKS-- 479
             +    C   G C+G  +  + +Y HQ G+  E   +Y  ++   +KF     C   K   
Sbjct:   121 QNVIDCGNAGSCEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDKFNQCGTCNEFKECH 180

Query:   480 KVKLFT----GKDFLYFNGSETMKKILYKYGPLSVGL--NSHLIHFYNGTPIRKNDETCS 533
              ++ +T    G D+   +G E M   +Y  GP+S G+     L ++  G      D T  
Sbjct:   181 AIRNYTLWRVG-DYGSLSGREKMMAEIYANGPISCGIMATERLANYTGGIYAEYQDTT-- 237

Query:   534 PYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKI 573
              Y + H V + G+G  D   YW+ RNSWG    + G+ +I
Sbjct:   238 -Y-INHVVSVAGWGISDGTEYWIVRNSWGEPWGERGWLRI 275

 Score = 169 (64.5 bits), Expect = 1.5e-09, P = 1.5e-09
 Identities = 54/192 (28%), Positives = 88/192 (45%)

Query:    32 CGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKSQLVECAKQCSGCDGCFFEPSIEYTHQA 88
             CGSCWA +    +  +  IK  G       S   +++C    S C+G       +Y HQ 
Sbjct:    89 CGSCWAHASTSAMADRINIKRKGAWPSTLLSVQNVIDCGNAGS-CEGGNDLSVWDYAHQH 147

Query:    89 GLESE--KDYPYKNANGEKFK----CAYDKS--KVKLFT----GKDFLHFNGSETMKKIL 136
             G+  E   +Y  K+   +KF     C   K    ++ +T    G D+   +G E M   +
Sbjct:   148 GIPDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVG-DYGSLSGREKMMAEI 206

Query:   137 YKYGPLSV-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSW 195
             Y  GP+S  ++ ++ + +Y G    +  +T   Y + H V + G+G  D   YW+VRNSW
Sbjct:   207 YANGPISCGIMATERLANYTGGIYAEYQDTT--Y-INHVVSVAGWGISDGTEYWIVRNSW 263

Query:   196 GPIGPDEGFFKI 207
             G    + G+ +I
Sbjct:   264 GEPWGERGWLRI 275

 Score = 169 (64.5 bits), Expect = 1.5e-09, P = 1.5e-09
 Identities = 54/192 (28%), Positives = 88/192 (45%)

Query:   763 CGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKSQLVECAKQCSGCDGCFFEPSIEYTHQA 819
             CGSCWA +    +  +  IK  G       S   +++C    S C+G       +Y HQ 
Sbjct:    89 CGSCWAHASTSAMADRINIKRKGAWPSTLLSVQNVIDCGNAGS-CEGGNDLSVWDYAHQH 147

Query:   820 GLESE--KDYPYKNANGEKFK----CAYDKS--KVKLFT----GKDFLHFNGSETMKKIL 867
             G+  E   +Y  K+   +KF     C   K    ++ +T    G D+   +G E M   +
Sbjct:   148 GIPDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVG-DYGSLSGREKMMAEI 206

Query:   868 YKYGPLSV-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSW 926
             Y  GP+S  ++ ++ + +Y G    +  +T   Y + H V + G+G  D   YW+VRNSW
Sbjct:   207 YANGPISCGIMATERLANYTGGIYAEYQDTT--Y-INHVVSVAGWGISDGTEYWIVRNSW 263

Query:   927 GPIGPDEGFFKI 938
             G    + G+ +I
Sbjct:   264 GEPWGERGWLRI 275

 Score = 103 (41.3 bits), Expect = 4.5e-07, Sum P(2) = 4.5e-07
 Identities = 30/96 (31%), Positives = 47/96 (48%)

Query:   916 NIPYWLVRNSWGPI-GPDEGFFKI-ERGNNACGI---EQIAGYATIDVVKNDETCSPYDL 970
             N   W V   +G + G ++   +I   G  +CGI   E++A Y      +  +T   Y +
Sbjct:   184 NYTLWRV-GDYGSLSGREKMMAEIYANGPISCGIMATERLANYTGGIYAEYQDTT--Y-I 239

Query:   971 GHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKI 1006
              H V + G+G  D   YW+VRNSWG    + G+ +I
Sbjct:   240 NHVVSVAGWGISDGTEYWIVRNSWGEPWGERGWLRI 275

 Score = 92 (37.4 bits), Expect = 4.5e-07, Sum P(2) = 4.5e-07
 Identities = 31/110 (28%), Positives = 47/110 (42%)

Query:   742 VPDAWDWRKK---NVTGPAGDQAA---CGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKS 792
             +P +WDWR     N      +Q     CGSCWA +    +  +  IK  G       S  
Sbjct:    62 LPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTLLSVQ 121

Query:   793 QLVECAKQCSGCDGCFFEPSIEYTHQAGLESE--KDYPYKNANGEKF-KC 839
              +++C    S C+G       +Y HQ G+  E   +Y  K+   +KF +C
Sbjct:   122 NVIDCGNAGS-CEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDKFNQC 170

 Score = 91 (37.1 bits), Expect = 5.7e-07, Sum P(2) = 5.7e-07
 Identities = 30/110 (27%), Positives = 47/110 (42%)

Query:    11 VPDAWDWRKKNVTGPAGDQAD------CGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKS 61
             +P +WDWR  +    A    +      CGSCWA +    +  +  IK  G       S  
Sbjct:    62 LPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTLLSVQ 121

Query:    62 QLVECAKQCSGCDGCFFEPSIEYTHQAGLESE--KDYPYKNANGEKF-KC 108
              +++C    S C+G       +Y HQ G+  E   +Y  K+   +KF +C
Sbjct:   122 NVIDCGNAGS-CEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDKFNQC 170


>WB|WBGene00000782 [details] [associations]
            symbol:cpr-2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 eggNOG:NOG315657 GeneTree:ENSGT00560000076599
            HOGENOM:HOG000241341 PANTHER:PTHR12411:SF16 EMBL:Z81531
            RefSeq:NP_507186.3 ProteinModelPortal:O45466 SMR:O45466
            MEROPS:C01.A40 PaxDb:O45466 EnsemblMetazoa:F36D3.9 GeneID:185355
            KEGG:cel:CELE_F36D3.9 CTD:185355 WormBase:F36D3.9 OMA:FDARLRW
            Uniprot:O45466
        Length = 326

 Score = 136 (52.9 bits), Expect = 2.3e-11, Sum P(2) = 2.3e-11
 Identities = 27/52 (51%), Positives = 34/52 (65%)

Query:   538 GHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 587
             GHAV L+G+G +   PYWLA NSWG    + G F+I RG + CGIE   +AG
Sbjct:   272 GHAVKLIGWGTERGTPYWLAVNSWGSQWGESGTFRILRGVDECGIESRIVAG 323

 Score = 133 (51.9 bits), Expect = 5.0e-11, Sum P(2) = 5.0e-11
 Identities = 26/52 (50%), Positives = 33/52 (63%)

Query:   172 GHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 221
             GHAV L+G+G +   PYWL  NSWG    + G F+I RG + CGIE   +AG
Sbjct:   272 GHAVKLIGWGTERGTPYWLAVNSWGSQWGESGTFRILRGVDECGIESRIVAG 323

 Score = 133 (51.9 bits), Expect = 5.0e-11, Sum P(2) = 5.0e-11
 Identities = 26/52 (50%), Positives = 33/52 (63%)

Query:   903 GHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 952
             GHAV L+G+G +   PYWL  NSWG    + G F+I RG + CGIE   +AG
Sbjct:   272 GHAVKLIGWGTERGTPYWLAVNSWGSQWGESGTFRILRGVDECGIESRIVAG 323

 Score = 132 (51.5 bits), Expect = 6.6e-11, Sum P(2) = 6.6e-11
 Identities = 26/52 (50%), Positives = 33/52 (63%)

Query:   971 GHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ--IAG 1020
             GHAV L+G+G +   PYWL  NSWG    + G F+I RG + CGIE   +AG
Sbjct:   272 GHAVKLIGWGTERGTPYWLAVNSWGSQWGESGTFRILRGVDECGIESRIVAG 323

 Score = 99 (39.9 bits), Expect = 2.3e-11, Sum P(2) = 2.3e-11
 Identities = 23/73 (31%), Positives = 39/73 (53%)

Query:    28 DQADCGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVECA-KQCS-GCDGCFFEPSIE 83
             +Q++CGSCWAFS A ++  +  I +    +   S + L+ C    C  GCDG F   + +
Sbjct:   104 EQSNCGSCWAFSTAEVISDRTCIASNGTQQPIISPTDLLTCCGMSCGEGCDGGFPYRAFQ 163

Query:    84 YTHQAGLESEKDY 96
             +  + G+ +  DY
Sbjct:   164 WWARRGVVTGGDY 176

 Score = 96 (38.9 bits), Expect = 1.0e-10, Sum P(2) = 1.0e-10
 Identities = 23/73 (31%), Positives = 38/73 (52%)

Query:   759 DQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVECA-KQCS-GCDGCFFEPSIE 814
             +Q+ CGSCWAFS A ++  +  I +    +   S + L+ C    C  GCDG F   + +
Sbjct:   104 EQSNCGSCWAFSTAEVISDRTCIASNGTQQPIISPTDLLTCCGMSCGEGCDGGFPYRAFQ 163

Query:   815 YTHQAGLESEKDY 827
             +  + G+ +  DY
Sbjct:   164 WWARRGVVTGGDY 176

 Score = 88 (36.0 bits), Expect = 3.2e-10, Sum P(2) = 3.2e-10
 Identities = 20/54 (37%), Positives = 29/54 (53%)

Query:   393 DQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLVECAKQCSGCG-GCDG 443
             +Q+ CGSCWAFS A ++  +  I +    +   S + L+ C      CG GCDG
Sbjct:   104 EQSNCGSCWAFSTAEVISDRTCIASNGTQQPIISPTDLLTCCGM--SCGEGCDG 155


>WB|WBGene00010204 [details] [associations]
            symbol:F57F5.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0006898
            "receptor-mediated endocytosis" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0009792 GO:GO:0006898
            GO:GO:0040007 GO:GO:0002119 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0040011
            EMBL:Z75953 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            PANTHER:PTHR12411:SF16 RefSeq:NP_506011.2 ProteinModelPortal:Q20950
            SMR:Q20950 DIP:DIP-24447N IntAct:Q20950 MINT:MINT-211137
            STRING:Q20950 MEROPS:C01.A42 EnsemblMetazoa:F57F5.1 GeneID:179645
            KEGG:cel:CELE_F57F5.1 UCSC:F57F5.1 CTD:179645 WormBase:F57F5.1
            OMA:ADDINAC Uniprot:Q20950
        Length = 351

 Score = 140 (54.3 bits), Expect = 2.3e-11, Sum P(2) = 2.3e-11
 Identities = 31/91 (34%), Positives = 47/91 (51%)

Query:   495 SETMKKILYKYGPLSVGLNSH--LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDI 552
             +E  K+I+  +GP+ V    +    H+  G  +     +      GHAV ++G+G  +  
Sbjct:   256 AEIQKEIM-THGPVEVAFTVYEDFEHYSGGVYVHTAGASLG----GHAVKMLGWGVDNGT 310

Query:   553 PYWLARNSWGPIGPDEGFFKIERGNNACGIE 583
             PYWL  NSW     + G+F+I RG N CGIE
Sbjct:   311 PYWLCANSWNEDWGENGYFRIIRGVNECGIE 341

 Score = 139 (54.0 bits), Expect = 3.0e-11, Sum P(2) = 3.0e-11
 Identities = 32/91 (35%), Positives = 46/91 (50%)

Query:   860 SETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNI 917
             +E  K+I+  +GP+ V      D  H   G  +     +      GHAV ++G+G  +  
Sbjct:   256 AEIQKEIM-THGPVEVAFTVYEDFEHYSGGVYVHTAGASLG----GHAVKMLGWGVDNGT 310

Query:   918 PYWLVRNSWGPIGPDEGFFKIERGNNACGIE 948
             PYWL  NSW     + G+F+I RG N CGIE
Sbjct:   311 PYWLCANSWNEDWGENGYFRIIRGVNECGIE 341

 Score = 139 (54.0 bits), Expect = 6.1e-11, Sum P(2) = 6.1e-11
 Identities = 32/91 (35%), Positives = 46/91 (50%)

Query:   129 SETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNI 186
             +E  K+I+  +GP+ V      D  H   G  +     +      GHAV ++G+G  +  
Sbjct:   256 AEIQKEIM-THGPVEVAFTVYEDFEHYSGGVYVHTAGASLG----GHAVKMLGWGVDNGT 310

Query:   187 PYWLVRNSWGPIGPDEGFFKIERGNNACGIE 217
             PYWL  NSW     + G+F+I RG N CGIE
Sbjct:   311 PYWLCANSWNEDWGENGYFRIIRGVNECGIE 341

 Score = 131 (51.2 bits), Expect = 2.3e-10, Sum P(2) = 2.3e-10
 Identities = 23/46 (50%), Positives = 30/46 (65%)

Query:   971 GHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 1016
             GHAV ++G+G  +  PYWL  NSW     + G+F+I RG N CGIE
Sbjct:   296 GHAVKMLGWGVDNGTPYWLCANSWNEDWGENGYFRIIRGVNECGIE 341

 Score = 96 (38.9 bits), Expect = 2.3e-11, Sum P(2) = 2.3e-11
 Identities = 30/84 (35%), Positives = 40/84 (47%)

Query:   373 DGPVPDAWDWRKKNVTGPA----GDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQL 428
             D  VPD++D R      P+     DQ++CGSCWA S A  +  +  I +      S S  
Sbjct:    94 DAAVPDSFDSRTAWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNAKTILSIS-- 151

Query:   429 VECAKQCSG--CG-GCDGLEQPIE 449
              +    C G  CG GC+G   PIE
Sbjct:   152 ADDINACCGMVCGNGCNG-GYPIE 174

 Score = 93 (37.8 bits), Expect = 4.7e-11, Sum P(2) = 4.7e-11
 Identities = 27/80 (33%), Positives = 39/80 (48%)

Query:     4 EVEKDGPVPDAWDWRKKNVTGPA----GDQADCGSCWAFSIAGMLEGQYAIKTGK--LVE 57
             EVE D  VPD++D R      P+     DQ+ CGSCWA S A  +  +  I +    ++ 
Sbjct:    91 EVE-DAAVPDSFDSRTAWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNAKTILS 149

Query:    58 FSKSQLVECAKQ-C-SGCDG 75
              S   +  C    C +GC+G
Sbjct:   150 ISADDINACCGMVCGNGCNG 169

 Score = 89 (36.4 bits), Expect = 1.6e-10, Sum P(2) = 1.6e-10
 Identities = 24/76 (31%), Positives = 37/76 (48%)

Query:   739 DGPVPDAWDWRKKNVTGPA----GDQAACGSCWAFSIAGMLEGQYAIKTGK--LVEFSKS 792
             D  VPD++D R      P+     DQ++CGSCWA S A  +  +  I +    ++  S  
Sbjct:    94 DAAVPDSFDSRTAWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNAKTILSISAD 153

Query:   793 QLVECAKQ-C-SGCDG 806
              +  C    C +GC+G
Sbjct:   154 DINACCGMVCGNGCNG 169


>DICTYBASE|DDB_G0283401 [details] [associations]
            symbol:ctsZ "cathepsin Z precursor" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            dictyBase:DDB_G0283401 GO:GO:0005615 GenomeReviews:CM000153_GR
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 EMBL:AAFI02000055 KO:K08568 OMA:QCGTCTE
            eggNOG:NOG275763 RefSeq:XP_639036.1 ProteinModelPortal:Q54R55
            IntAct:Q54R55 MEROPS:C01.A60 PRIDE:Q54R55
            EnsemblProtists:DDB0233836 GeneID:8624061 KEGG:ddi:DDB_G0283401
            InParanoid:Q54R55 Uniprot:Q54R55
        Length = 296

 Score = 183 (69.5 bits), Expect = 3.2e-11, P = 3.2e-11
 Identities = 66/242 (27%), Positives = 104/242 (42%)

Query:   376 VPDAWDWRKKN----VTGPAGDQAA--CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLV 429
             VP +WDWR  +    +T          CG CWAF+    +  +  IK  +   F    + 
Sbjct:    58 VPQSWDWRNVSGVNYLTMNRNQHIPQYCGGCWAFASTSSISDR--IKIQRKAAFPDVNVA 115

Query:   430 -ECAKQCSGCGGCDGLE--QPIEYTHQAGLESEKDYPYRNGN-----GEKFK-CAYDKS- 479
              +    C+G G CDG +      + ++ G+  E   PY+  N         K C  D + 
Sbjct:   116 PQHLIDCNGGGTCDGGDPGDAFAFINENGIVDETCKPYQAKNLPDECSPACKTCNPDGTC 175

Query:   480 -KVKLFTG---KDFLYFNGSETMKKILYKYGPL--SVGLNSHLIHFYNGTPIRKNDETCS 533
               + + T     ++    G++ M   +Y  GP+  S+   S L  + +G  I K  +   
Sbjct:   176 QAIPVHTNITVTEYGSVRGAKDMMAEIYARGPIACSIDATSKLEAYTSG--IFKEFKL-D 232

Query:   534 PYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGN--NACGIEQIAGYATI 591
             P    H + ++G+G QD  PYW+ RNSWG    + GFF I +G+     GIE    +A  
Sbjct:   233 PLP-NHIISVIGWGVQDSTPYWIVRNSWGSYYGEGGFFNIVQGSLFENLGIELDCNWAVP 291

Query:   592 DV 593
              V
Sbjct:   292 SV 293

 Score = 163 (62.4 bits), Expect = 6.9e-09, P = 6.9e-09
 Identities = 56/212 (26%), Positives = 94/212 (44%)

Query:    32 CGSCWAFSIAGMLEGQYAIKTGKL---VEFSKSQLVECAKQCSGCDGCFFEPSIEYTHQA 88
             CG CWAF+    +  +  I+       V  +   L++C      CDG     +  + ++ 
Sbjct:    85 CGGCWAFASTSSISDRIKIQRKAAFPDVNVAPQHLIDC-NGGGTCDGGDPGDAFAFINEN 143

Query:    89 GLESEKDYPY--KNANGE---KFK-CAYDKS--KVKLFTG---KDFLHFNGSETMKKILY 137
             G+  E   PY  KN   E     K C  D +   + + T     ++    G++ M   +Y
Sbjct:   144 GIVDETCKPYQAKNLPDECSPACKTCNPDGTCQAIPVHTNITVTEYGSVRGAKDMMAEIY 203

Query:   138 KYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGP 197
               GP++  +++    +   + I K  +   P    H + ++G+G QD+ PYW+VRNSWG 
Sbjct:   204 ARGPIACSIDATSKLEAYTSGIFKEFKL-DPLP-NHIISVIGWGVQDSTPYWIVRNSWGS 261

Query:   198 IGPDEGFFKIERGN--NACGIEQIAGYATIDV 227
                + GFF I +G+     GIE    +A   V
Sbjct:   262 YYGEGGFFNIVQGSLFENLGIELDCNWAVPSV 293

 Score = 163 (62.4 bits), Expect = 6.9e-09, P = 6.9e-09
 Identities = 56/212 (26%), Positives = 94/212 (44%)

Query:   763 CGSCWAFSIAGMLEGQYAIKTGKL---VEFSKSQLVECAKQCSGCDGCFFEPSIEYTHQA 819
             CG CWAF+    +  +  I+       V  +   L++C      CDG     +  + ++ 
Sbjct:    85 CGGCWAFASTSSISDRIKIQRKAAFPDVNVAPQHLIDC-NGGGTCDGGDPGDAFAFINEN 143

Query:   820 GLESEKDYPY--KNANGE---KFK-CAYDKS--KVKLFTG---KDFLHFNGSETMKKILY 868
             G+  E   PY  KN   E     K C  D +   + + T     ++    G++ M   +Y
Sbjct:   144 GIVDETCKPYQAKNLPDECSPACKTCNPDGTCQAIPVHTNITVTEYGSVRGAKDMMAEIY 203

Query:   869 KYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGP 928
               GP++  +++    +   + I K  +   P    H + ++G+G QD+ PYW+VRNSWG 
Sbjct:   204 ARGPIACSIDATSKLEAYTSGIFKEFKL-DPLP-NHIISVIGWGVQDSTPYWIVRNSWGS 261

Query:   929 IGPDEGFFKIERGN--NACGIEQIAGYATIDV 958
                + GFF I +G+     GIE    +A   V
Sbjct:   262 YYGEGGFFNIVQGSLFENLGIELDCNWAVPSV 293

 Score = 126 (49.4 bits), Expect = 3.5e-09, Sum P(2) = 3.5e-09
 Identities = 33/92 (35%), Positives = 47/92 (51%)

Query:   940 RGNNACGIE---QIAGYATIDVVKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGP 996
             RG  AC I+   ++  Y T  + K  +   P    H + ++G+G QD  PYW+VRNSWG 
Sbjct:   205 RGPIACSIDATSKLEAY-TSGIFKEFKL-DPLP-NHIISVIGWGVQDSTPYWIVRNSWGS 261

Query:   997 IGPDEGFFKIERGN--NACGIEQIAGYATIDV 1026
                + GFF I +G+     GIE    +A   V
Sbjct:   262 YYGEGGFFNIVQGSLFENLGIELDCNWAVPSV 293

 Score = 87 (35.7 bits), Expect = 3.5e-09, Sum P(2) = 3.5e-09
 Identities = 28/103 (27%), Positives = 44/103 (42%)

Query:   742 VPDAWDWRKKNVTGPA-----GDQAA---CGSCWAFSIAGMLEGQYAIKTGKL---VEFS 790
             VP +WDWR  NV+G        +Q     CG CWAF+    +  +  I+       V  +
Sbjct:    58 VPQSWDWR--NVSGVNYLTMNRNQHIPQYCGGCWAFASTSSISDRIKIQRKAAFPDVNVA 115

Query:   791 KSQLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNAN 833
                L++C      CDG     +  + ++ G+  E   PY+  N
Sbjct:   116 PQHLIDC-NGGGTCDGGDPGDAFAFINENGIVDETCKPYQAKN 157

 Score = 86 (35.3 bits), Expect = 4.5e-09, Sum P(2) = 4.5e-09
 Identities = 28/103 (27%), Positives = 44/103 (42%)

Query:    11 VPDAWDWRKKNVTGPA-----GDQ---ADCGSCWAFSIAGMLEGQYAIKTGKL---VEFS 59
             VP +WDWR  NV+G        +Q     CG CWAF+    +  +  I+       V  +
Sbjct:    58 VPQSWDWR--NVSGVNYLTMNRNQHIPQYCGGCWAFASTSSISDRIKIQRKAAFPDVNVA 115

Query:    60 KSQLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNAN 102
                L++C      CDG     +  + ++ G+  E   PY+  N
Sbjct:   116 PQHLIDC-NGGGTCDGGDPGDAFAFINENGIVDETCKPYQAKN 157


>WB|WBGene00016306 [details] [associations]
            symbol:C32B5.13 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 EMBL:FO080745
            PIR:T25581 RefSeq:NP_493866.1 UniGene:Cel.15740 HSSP:P00785
            ProteinModelPortal:P91110 SMR:P91110 EnsemblMetazoa:C32B5.13
            GeneID:183116 KEGG:cel:CELE_C32B5.13 UCSC:C32B5.13 CTD:183116
            WormBase:C32B5.13 eggNOG:KOG1543 HOGENOM:HOG000115376
            InParanoid:P91110 NextBio:919978 Uniprot:P91110
        Length = 150

 Score = 166 (63.5 bits), Expect = 4.4e-11, P = 4.4e-11
 Identities = 45/152 (29%), Positives = 71/152 (46%)

Query:    48 YAIKTGKLV-EFSKSQLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKF 106
             YA    + V  FS+ Q+++C    S C       S E+  + G+ +E DYPY     EK 
Sbjct:     2 YAKANNRTVLSFSEQQIIDCGNFTSPCQENIL--SHEFIKKNGVVTEADYPYVGKENEK- 58

Query:   107 KCAYDKSKVKLFTGKDFLHFNGSETMKKILYK-YGPLSVLLNSD-LIHDYNGTPIRKNDE 164
              C YD++K+KL+     L  N  ET+ K+  K +GP    + +     +Y         E
Sbjct:    59 -CKYDENKIKLWPTNMLLVGNLPETLLKLFIKEHGPGYFRMKAPPSFFNYKTGIYSPTQE 117

Query:   165 TCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWG 196
              C       ++ +VGYG +    YW+V+ S+G
Sbjct:   118 ECGKATDARSLTIVGYGIEGGQNYWIVKGSFG 149

 Score = 166 (63.5 bits), Expect = 4.4e-11, P = 4.4e-11
 Identities = 45/152 (29%), Positives = 71/152 (46%)

Query:   779 YAIKTGKLV-EFSKSQLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKF 837
             YA    + V  FS+ Q+++C    S C       S E+  + G+ +E DYPY     EK 
Sbjct:     2 YAKANNRTVLSFSEQQIIDCGNFTSPCQENIL--SHEFIKKNGVVTEADYPYVGKENEK- 58

Query:   838 KCAYDKSKVKLFTGKDFLHFNGSETMKKILYK-YGPLSVLLNSD-LIHDYNGTPIRKNDE 895
              C YD++K+KL+     L  N  ET+ K+  K +GP    + +     +Y         E
Sbjct:    59 -CKYDENKIKLWPTNMLLVGNLPETLLKLFIKEHGPGYFRMKAPPSFFNYKTGIYSPTQE 117

Query:   896 TCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWG 927
              C       ++ +VGYG +    YW+V+ S+G
Sbjct:   118 ECGKATDARSLTIVGYGIEGGQNYWIVKGSFG 149

 Score = 156 (60.0 bits), Expect = 5.1e-10, P = 5.1e-10
 Identities = 46/153 (30%), Positives = 72/153 (47%)

Query:   413 YAIKTGKLV-EFSKSQLVECAKQCSGCGGCDGLEQPIEYTHQAGLESEKDYPYRNGNGEK 471
             YA    + V  FS+ Q+++C    S C   + L    E+  + G+ +E DYPY     EK
Sbjct:     2 YAKANNRTVLSFSEQQIIDCGNFTSPCQE-NILSH--EFIKKNGVVTEADYPYVGKENEK 58

Query:   472 FKCAYDKSKVKLFTGKDFLYFNGSETMKKILYK-YGPLSVGLNSHLIHFYNGTPIRK-ND 529
               C YD++K+KL+     L  N  ET+ K+  K +GP    + +    F   T I     
Sbjct:    59 --CKYDENKIKLWPTNMLLVGNLPETLLKLFIKEHGPGYFRMKAPPSFFNYKTGIYSPTQ 116

Query:   530 ETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWG 562
             E C       ++ +VGYG +    YW+ + S+G
Sbjct:   117 EECGKATDARSLTIVGYGIEGGQNYWIVKGSFG 149


>WB|WBGene00021072 [details] [associations]
            symbol:W07B8.4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:FO081739 PIR:T31728 RefSeq:NP_503382.1
            HSSP:P53634 ProteinModelPortal:O16288 SMR:O16288 STRING:O16288
            MEROPS:C01.A39 PaxDb:O16288 EnsemblMetazoa:W07B8.4 GeneID:178611
            KEGG:cel:CELE_W07B8.4 UCSC:W07B8.4 CTD:178611 WormBase:W07B8.4
            InParanoid:O16288 OMA:ESQYGCK NextBio:901836 Uniprot:O16288
        Length = 335

 Score = 152 (58.6 bits), Expect = 4.7e-11, Sum P(2) = 4.7e-11
 Identities = 49/178 (27%), Positives = 83/178 (46%)

Query:   412 QYAIKTGKLVEFSKSQLVECAK-QCSGCGGC-DGLEQPIEYTHQAGLESEKDYPYRNGNG 469
             +Y +K G +   S      C     + CG   DG+  P E   +   ++ K   +  GN 
Sbjct:   155 RYWVKNGLVTGGSFESQYGCKPYSIAPCGETIDGVTWP-ECPMKIS-DTPKCEHHCTGNN 212

Query:   470 EKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKND 529
               +   YD+ K   F    +     ++ ++  +  +GP+ VG   +   +   T I  + 
Sbjct:   213 S-YPIPYDQDKH--FGASAYAIGRSAKQIQTEILAHGPVEVGFIVYEDFYLYKTGIYTH- 268

Query:   530 ETCSPYDLG-HAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIA 586
                +  +LG HAV ++G+G  +  PYWLA NSW  +  ++G+F+I RG + CGIE  A
Sbjct:   269 --VAGGELGGHAVKMLGWGVDNGTPYWLAANSWNTVWGEKGYFRILRGVDECGIESAA 324

 Score = 138 (53.6 bits), Expect = 2.7e-10, Sum P(2) = 2.7e-10
 Identities = 23/49 (46%), Positives = 33/49 (67%)

Query:   903 GHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIA 951
             GHAV ++G+G  +  PYWL  NSW  +  ++G+F+I RG + CGIE  A
Sbjct:   276 GHAVKMLGWGVDNGTPYWLAANSWNTVWGEKGYFRILRGVDECGIESAA 324

 Score = 138 (53.6 bits), Expect = 1.8e-09, Sum P(2) = 1.8e-09
 Identities = 23/49 (46%), Positives = 33/49 (67%)

Query:   172 GHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIA 220
             GHAV ++G+G  +  PYWL  NSW  +  ++G+F+I RG + CGIE  A
Sbjct:   276 GHAVKMLGWGVDNGTPYWLAANSWNTVWGEKGYFRILRGVDECGIESAA 324

 Score = 137 (53.3 bits), Expect = 3.5e-10, Sum P(2) = 3.5e-10
 Identities = 23/49 (46%), Positives = 33/49 (67%)

Query:   971 GHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIA 1019
             GHAV ++G+G  +  PYWL  NSW  +  ++G+F+I RG + CGIE  A
Sbjct:   276 GHAVKMLGWGVDNGTPYWLAANSWNTVWGEKGYFRILRGVDECGIESAA 324

 Score = 87 (35.7 bits), Expect = 2.7e-10, Sum P(2) = 2.7e-10
 Identities = 24/81 (29%), Positives = 37/81 (45%)

Query:   376 VPDAWD----WRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLV 429
             +PD++D    W +        DQ+ CGSCWA + A  +  +  I +   V    S   ++
Sbjct:    73 IPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGDVNTLLSAEDIL 132

Query:   430 ECAKQCSGCG-GCDGLEQPIE 449
              C      CG GC+G   PI+
Sbjct:   133 TCCTGKFNCGDGCEG-GYPIQ 152

 Score = 79 (32.9 bits), Expect = 4.7e-11, Sum P(2) = 4.7e-11
 Identities = 21/73 (28%), Positives = 32/73 (43%)

Query:    11 VPDAWD----WRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLV 64
             +PD++D    W +        DQ+ CGSCWA + A  +  +  I +   V    S   ++
Sbjct:    73 IPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGDVNTLLSAEDIL 132

Query:    65 ECAKQCSGC-DGC 76
              C      C DGC
Sbjct:   133 TCCTGKFNCGDGC 145

 Score = 78 (32.5 bits), Expect = 2.3e-09, Sum P(2) = 2.3e-09
 Identities = 21/73 (28%), Positives = 32/73 (43%)

Query:   742 VPDAWD----WRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE--FSKSQLV 795
             +PD++D    W +        DQ+ CGSCWA + A  +  +  I +   V    S   ++
Sbjct:    73 IPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGDVNTLLSAEDIL 132

Query:   796 ECAKQCSGC-DGC 807
              C      C DGC
Sbjct:   133 TCCTGKFNCGDGC 145


>UNIPROTKB|E1C4M3 [details] [associations]
            symbol:CTSZ "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0060441 "epithelial tube branching
            involved in lung morphogenesis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GeneTree:ENSGT00560000076599 CTD:1522 KO:K08568 OMA:QCGTCTE
            EMBL:AADN02019004 IPI:IPI00596430 RefSeq:XP_417483.3
            Ensembl:ENSGALT00000012067 GeneID:419311 KEGG:gga:419311
            Uniprot:E1C4M3
        Length = 305

 Score = 181 (68.8 bits), Expect = 6.5e-11, P = 6.5e-11
 Identities = 62/218 (28%), Positives = 96/218 (44%)

Query:   376 VPDAWDWRKKNVTGPAG---DQAA---CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLV 429
             +P +WDWR  N    A    +Q     CGSCWA      L  +  IK  K    S    V
Sbjct:    63 LPQSWDWRNVNGVNYASTTRNQHIPQYCGSCWAHGSTSALADRINIKR-KGAWPSAYLSV 121

Query:   430 ECAKQCSGCGGCDGLEQP--IEYTHQAGLESE--KDYPYRNGNGEKF-KCA----YDKSK 480
             +    C+  G C+G +      Y H  G+  E   +Y  +N   +KF +C     + +  
Sbjct:   122 QNVIDCANAGSCEGGDHTGVWMYAHDHGIPDETCNNYQAKNQKCKKFNQCGTCVTFGECH 181

Query:   481 V-KLFT---GKDFLYFNGSETMKKILYKYGPLSVGLNS-HLIHFYNGTPIRKNDETCSPY 535
             V K +T     D+   +G E M   +Y  GP+S G+ +   +  Y G    + +   SP 
Sbjct:   182 VIKNYTLWKVADYGAVSGREKMMAEIYANGPISCGIMATEKLDAYTGGLYTEYNP--SP- 238

Query:   536 DLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKI 573
              + H V + G+G ++   YW+ RNSWG    + G+ +I
Sbjct:   239 TVNHIVSVAGWGVENGTEYWIVRNSWGEPWGERGWLRI 276

 Score = 164 (62.8 bits), Expect = 5.9e-09, P = 5.9e-09
 Identities = 55/191 (28%), Positives = 87/191 (45%)

Query:    32 CGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKSQLVECAKQCSGCDGCFFEPSIEYTHQA 88
             CGSCWA      L  +  IK  G       S   +++CA   S C+G        Y H  
Sbjct:    90 CGSCWAHGSTSALADRINIKRKGAWPSAYLSVQNVIDCANAGS-CEGGDHTGVWMYAHDH 148

Query:    89 GLESE--KDYPYKNANGEKF-KCA----YDKSKV-KLFT---GKDFLHFNGSETMKKILY 137
             G+  E   +Y  KN   +KF +C     + +  V K +T     D+   +G E M   +Y
Sbjct:   149 GIPDETCNNYQAKNQKCKKFNQCGTCVTFGECHVIKNYTLWKVADYGAVSGREKMMAEIY 208

Query:   138 KYGPLSV-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWG 196
               GP+S  ++ ++ +  Y G    + +   SP  + H V + G+G ++   YW+VRNSWG
Sbjct:   209 ANGPISCGIMATEKLDAYTGGLYTEYNP--SP-TVNHIVSVAGWGVENGTEYWIVRNSWG 265

Query:   197 PIGPDEGFFKI 207
                 + G+ +I
Sbjct:   266 EPWGERGWLRI 276

 Score = 164 (62.8 bits), Expect = 1.5e-08, Sum P(2) = 1.5e-08
 Identities = 55/191 (28%), Positives = 87/191 (45%)

Query:   763 CGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKSQLVECAKQCSGCDGCFFEPSIEYTHQA 819
             CGSCWA      L  +  IK  G       S   +++CA   S C+G        Y H  
Sbjct:    90 CGSCWAHGSTSALADRINIKRKGAWPSAYLSVQNVIDCANAGS-CEGGDHTGVWMYAHDH 148

Query:   820 GLESE--KDYPYKNANGEKF-KCA----YDKSKV-KLFT---GKDFLHFNGSETMKKILY 868
             G+  E   +Y  KN   +KF +C     + +  V K +T     D+   +G E M   +Y
Sbjct:   149 GIPDETCNNYQAKNQKCKKFNQCGTCVTFGECHVIKNYTLWKVADYGAVSGREKMMAEIY 208

Query:   869 KYGPLSV-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWG 927
               GP+S  ++ ++ +  Y G    + +   SP  + H V + G+G ++   YW+VRNSWG
Sbjct:   209 ANGPISCGIMATEKLDAYTGGLYTEYNP--SP-TVNHIVSVAGWGVENGTEYWIVRNSWG 265

Query:   928 PIGPDEGFFKI 938
                 + G+ +I
Sbjct:   266 EPWGERGWLRI 276

 Score = 39 (18.8 bits), Expect = 1.5e-08, Sum P(2) = 1.5e-08
 Identities = 13/56 (23%), Positives = 21/56 (37%)

Query:    83 EYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYK 138
             EY   A L    D+  +N NG  +        +  + G  + H + S    +I  K
Sbjct:    56 EYLDMAELPQSWDW--RNVNGVNYASTTRNQHIPQYCGSCWAHGSTSALADRINIK 109


>DICTYBASE|DDB_G0280187 [details] [associations]
            symbol:DDB_G0280187 "cathepsin Z-like protein"
            species:44689 "Dictyostelium discoideum" [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            dictyBase:DDB_G0280187 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            EMBL:AAFI02000035 KO:K08568 RefSeq:XP_641294.1
            ProteinModelPortal:Q54VR1 MEROPS:C01.A61 PRIDE:Q54VR1
            EnsemblProtists:DDB0233838 GeneID:8622427 KEGG:ddi:DDB_G0280187
            InParanoid:Q54VR1 OMA:VWKVGDY Uniprot:Q54VR1
        Length = 291

 Score = 158 (60.7 bits), Expect = 2.4e-08, P = 2.4e-08
 Identities = 48/179 (26%), Positives = 76/179 (42%)

Query:   428 LVECAKQCSGCGGCDGLEQPIEYTHQAGLESEKDYPYR------NGNGEKFKCAYDKSK- 480
             L+ CA   + C G D  E    Y    G+  E   PY       N  G    C +D S  
Sbjct:   110 LLNCAGPDNTCDGGDPTEA-YAYMAAKGITDETCAPYEAIDNECNAEGICKNCNFDLSNP 168

Query:   481 -VKLFTGKDFL-YF-------NGSETMKKILYKYGPLSVGLN-SHLIHFYNGTPIRKNDE 530
                 F    +  YF       NGS  M + ++  GP++ G+  +     Y       +  
Sbjct:   169 TADCFAQPTYTTYFVEEHGQVNGSVAMMQEIFARGPIACGMEVTDAFESYTSGVFTSS-- 226

Query:   531 TCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYA 589
               S  ++ H + ++G+G ++ + YW+ RNSWG    + GFF+I+RG +   IE    +A
Sbjct:   227 VGSTGEINHEISIIGWGTENGVDYWIGRNSWGTYFGELGFFRIQRGIDLLSIESACDWA 285

 Score = 157 (60.3 bits), Expect = 3.1e-08, P = 3.1e-08
 Identities = 50/187 (26%), Positives = 81/187 (43%)

Query:   794 LVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYK------NANGEKFKCAYDKSK-- 845
             L+ CA   + CDG     +  Y    G+  E   PY+      NA G    C +D S   
Sbjct:   110 LLNCAGPDNTCDGGDPTEAYAYMAAKGITDETCAPYEAIDNECNAEGICKNCNFDLSNPT 169

Query:   846 VKLFTGKDFLHF--------NGSETMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDET 896
                F    +  +        NGS  M + ++  GP++  +  +D    Y       +   
Sbjct:   170 ADCFAQPTYTTYFVEEHGQVNGSVAMMQEIFARGPIACGMEVTDAFESYTSGVFTSS--V 227

Query:   897 CSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATI 956
              S  ++ H + ++G+G ++ + YW+ RNSWG    + GFF+I+RG +   IE    +A  
Sbjct:   228 GSTGEINHEISIIGWGTENGVDYWIGRNSWGTYFGELGFFRIQRGIDLLSIESACDWA-- 285

Query:   957 DVVKNDE 963
              V KN E
Sbjct:   286 -VPKNLE 291

 Score = 156 (60.0 bits), Expect = 4.1e-08, P = 4.1e-08
 Identities = 46/178 (25%), Positives = 77/178 (43%)

Query:    63 LVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYK------NANGEKFKCAYDKSK-- 114
             L+ CA   + CDG     +  Y    G+  E   PY+      NA G    C +D S   
Sbjct:   110 LLNCAGPDNTCDGGDPTEAYAYMAAKGITDETCAPYEAIDNECNAEGICKNCNFDLSNPT 169

Query:   115 VKLFTGKDFLHF--------NGSETMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDET 165
                F    +  +        NGS  M + ++  GP++  +  +D    Y       +   
Sbjct:   170 ADCFAQPTYTTYFVEEHGQVNGSVAMMQEIFARGPIACGMEVTDAFESYTSGVFTSS--V 227

Query:   166 CSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYA 223
              S  ++ H + ++G+G ++ + YW+ RNSWG    + GFF+I+RG +   IE    +A
Sbjct:   228 GSTGEINHEISIIGWGTENGVDYWIGRNSWGTYFGELGFFRIQRGIDLLSIESACDWA 285

 Score = 135 (52.6 bits), Expect = 1.2e-10, Sum P(2) = 1.2e-10
 Identities = 27/84 (32%), Positives = 45/84 (53%)

Query:   940 RGNNACGIEQIAGYATIDV-VKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIG 998
             RG  ACG+E    + +    V      S  ++ H + ++G+G ++ + YW+ RNSWG   
Sbjct:   202 RGPIACGMEVTDAFESYTSGVFTSSVGSTGEINHEISIIGWGTENGVDYWIGRNSWGTYF 261

Query:   999 PDEGFFKIERGNNACGIEQIAGYA 1022
              + GFF+I+RG +   IE    +A
Sbjct:   262 GELGFFRIQRGIDLLSIESACDWA 285

 Score = 91 (37.1 bits), Expect = 1.2e-10, Sum P(2) = 1.2e-10
 Identities = 37/128 (28%), Positives = 57/128 (44%)

Query:     5 VEKDGPVPDAWDWRKKNVTGPA-----GDQ---ADCGSCWAFSIAGMLEGQYAI-KTGKL 55
             +++D  +P  +DWR  N++G +      +Q     CGSCWA      L  +  I + G  
Sbjct:    44 IDED-TLPTQYDWR--NISGSSYITITRNQHLPQYCGSCWAHGTTSALGDRIKIGRKGTF 100

Query:    56 VE--FSKSQLVECAKQCSGCDGCFFEPSIEYTHQA--GLESEKDYPYK------NANGEK 105
              E   +   L+ CA   + CDG   +P+  Y + A  G+  E   PY+      NA G  
Sbjct:   101 PEVVLAPQVLLNCAGPDNTCDGG--DPTEAYAYMAAKGITDETCAPYEAIDNECNAEGIC 158

Query:   106 FKCAYDKS 113
               C +D S
Sbjct:   159 KNCNFDLS 166

 Score = 90 (36.7 bits), Expect = 1.5e-10, Sum P(2) = 1.5e-10
 Identities = 36/122 (29%), Positives = 53/122 (43%)

Query:   742 VPDAWDWRKKNVTGPA-----GDQAA---CGSCWAFSIAGMLEGQYAI-KTGKLVE--FS 790
             +P  +DWR  N++G +      +Q     CGSCWA      L  +  I + G   E   +
Sbjct:    49 LPTQYDWR--NISGSSYITITRNQHLPQYCGSCWAHGTTSALGDRIKIGRKGTFPEVVLA 106

Query:   791 KSQLVECAKQCSGCDGCFFEPSIEYTHQA--GLESEKDYPYK------NANGEKFKCAYD 842
                L+ CA   + CDG   +P+  Y + A  G+  E   PY+      NA G    C +D
Sbjct:   107 PQVLLNCAGPDNTCDGG--DPTEAYAYMAAKGITDETCAPYEAIDNECNAEGICKNCNFD 164

Query:   843 KS 844
              S
Sbjct:   165 LS 166

 Score = 82 (33.9 bits), Expect = 9.9e-10, Sum P(2) = 9.9e-10
 Identities = 35/122 (28%), Positives = 50/122 (40%)

Query:   376 VPDAWDWRKKNVTGPA-----GDQAA---CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQ 427
             +P  +DWR  N++G +      +Q     CGSCWA      L  +  IK G+   F +  
Sbjct:    49 LPTQYDWR--NISGSSYITITRNQHLPQYCGSCWAHGTTSALGDR--IKIGRKGTFPEVV 104

Query:   428 LV-ECAKQCSGCGG-CDGLEQPIEYTHQA--GLESEKDYPYR------NGNGEKFKCAYD 477
             L  +    C+G    CDG +    Y + A  G+  E   PY       N  G    C +D
Sbjct:   105 LAPQVLLNCAGPDNTCDGGDPTEAYAYMAAKGITDETCAPYEAIDNECNAEGICKNCNFD 164

Query:   478 KS 479
              S
Sbjct:   165 LS 166


>MGI|MGI:1891190 [details] [associations]
            symbol:Ctsz "cathepsin Z" species:10090 "Mus musculus"
            [GO:0005615 "extracellular space" evidence=ISO] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1891190 GO:GO:0005783 GO:GO:0005615 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0060441 GeneTree:ENSGT00560000076599 MEROPS:C01.013 CTD:1522
            HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN OMA:QCGTCTE
            ChiTaRS:CTSZ EMBL:AJ242663 EMBL:AF136277 EMBL:AF136278
            EMBL:BC008619 IPI:IPI00986833 RefSeq:NP_071720.1 UniGene:Mm.156919
            ProteinModelPortal:Q9WUU7 SMR:Q9WUU7 IntAct:Q9WUU7 STRING:Q9WUU7
            PaxDb:Q9WUU7 PRIDE:Q9WUU7 Ensembl:ENSMUST00000016400 GeneID:64138
            KEGG:mmu:64138 InParanoid:Q9WUU7 NextBio:319927 Bgee:Q9WUU7
            CleanEx:MM_CTSZ Genevestigator:Q9WUU7 GermOnline:ENSMUSG00000016256
            Uniprot:Q9WUU7
        Length = 306

 Score = 178 (67.7 bits), Expect = 1.5e-10, P = 1.5e-10
 Identities = 52/192 (27%), Positives = 89/192 (46%)

Query:    32 CGSCWAFSIAGMLEGQYAIKT-GKL--VEFSKSQLVECAKQCSGCDGCFFEPSIEYTHQA 88
             CGSCWA      +  +  IK  G    +  S   +++C    S C+G    P  EY H+ 
Sbjct:    91 CGSCWAHGSTSAMADRINIKRKGAWPSILLSVQNVIDCGNAGS-CEGGNDLPVWEYAHKH 149

Query:    89 GLESE--KDYPYKNANGEKFK----CAYDKS-----KVKLFTGKDFLHFNGSETMKKILY 137
             G+  E   +Y  K+ + +KF     C   K         L+   D+   +G E M   +Y
Sbjct:   150 GIPDETCNNYQAKDQDCDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSLSGREKMMAEIY 209

Query:   138 KYGPLSV-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG-KQDNIPYWLVRNSW 195
               GP+S  ++ ++++ +Y G    ++ +      + H + + G+G   D I YW+VRNSW
Sbjct:   210 ANGPISCGIMATEMMSNYTGGIYAEHQDQAV---INHIISVAGWGVSNDGIEYWIVRNSW 266

Query:   196 GPIGPDEGFFKI 207
             G    ++G+ +I
Sbjct:   267 GEPWGEKGWMRI 278

 Score = 178 (67.7 bits), Expect = 1.5e-10, Sum P(2) = 1.5e-10
 Identities = 52/192 (27%), Positives = 89/192 (46%)

Query:   763 CGSCWAFSIAGMLEGQYAIKT-GKL--VEFSKSQLVECAKQCSGCDGCFFEPSIEYTHQA 819
             CGSCWA      +  +  IK  G    +  S   +++C    S C+G    P  EY H+ 
Sbjct:    91 CGSCWAHGSTSAMADRINIKRKGAWPSILLSVQNVIDCGNAGS-CEGGNDLPVWEYAHKH 149

Query:   820 GLESE--KDYPYKNANGEKFK----CAYDKS-----KVKLFTGKDFLHFNGSETMKKILY 868
             G+  E   +Y  K+ + +KF     C   K         L+   D+   +G E M   +Y
Sbjct:   150 GIPDETCNNYQAKDQDCDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSLSGREKMMAEIY 209

Query:   869 KYGPLSV-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG-KQDNIPYWLVRNSW 926
               GP+S  ++ ++++ +Y G    ++ +      + H + + G+G   D I YW+VRNSW
Sbjct:   210 ANGPISCGIMATEMMSNYTGGIYAEHQDQAV---INHIISVAGWGVSNDGIEYWIVRNSW 266

Query:   927 GPIGPDEGFFKI 938
             G    ++G+ +I
Sbjct:   267 GEPWGEKGWMRI 278

 Score = 170 (64.9 bits), Expect = 1.2e-09, P = 1.2e-09
 Identities = 52/193 (26%), Positives = 87/193 (45%)

Query:   397 CGSCWAFSIAGMLEGQYAIKT-GKLVEFSKSQLVECAKQCSGCGGCDG-LEQPI-EYTHQ 453
             CGSCWA      +  +  IK  G       S  V+    C   G C+G  + P+ EY H+
Sbjct:    91 CGSCWAHGSTSAMADRINIKRKGAWPSILLS--VQNVIDCGNAGSCEGGNDLPVWEYAHK 148

Query:   454 AGLESE--KDYPYRNGNGEKFK----CAYDKS-----KVKLFTGKDFLYFNGSETMKKIL 502
              G+  E   +Y  ++ + +KF     C   K         L+   D+   +G E M   +
Sbjct:   149 HGIPDETCNNYQAKDQDCDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSLSGREKMMAEI 208

Query:   503 YKYGPLSVGLNS-HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD-IPYWLARNS 560
             Y  GP+S G+ +  ++  Y G    ++ +      + H + + G+G  +D I YW+ RNS
Sbjct:   209 YANGPISCGIMATEMMSNYTGGIYAEHQDQAV---INHIISVAGWGVSNDGIEYWIVRNS 265

Query:   561 WGPIGPDEGFFKI 573
             WG    ++G+ +I
Sbjct:   266 WGEPWGEKGWMRI 278

 Score = 103 (41.3 bits), Expect = 3.7e-07, Sum P(2) = 3.7e-07
 Identities = 33/111 (29%), Positives = 48/111 (43%)

Query:   376 VPDAWDWRKKN------VTGPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKLVEFSKSQL 428
             +P  WDWR  N      VT        CGSCWA      +  +  IK  G       S  
Sbjct:    64 LPKNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSILLS-- 121

Query:   429 VECAKQCSGCGGCDG-LEQPI-EYTHQAGLESE--KDYPYRNGNGEKF-KC 474
             V+    C   G C+G  + P+ EY H+ G+  E   +Y  ++ + +KF +C
Sbjct:   122 VQNVIDCGNAGSCEGGNDLPVWEYAHKHGIPDETCNNYQAKDQDCDKFNQC 172

 Score = 103 (41.3 bits), Expect = 3.7e-07, Sum P(2) = 3.7e-07
 Identities = 33/110 (30%), Positives = 48/110 (43%)

Query:   742 VPDAWDWRKKN------VTGPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKL--VEFSKS 792
             +P  WDWR  N      VT        CGSCWA      +  +  IK  G    +  S  
Sbjct:    64 LPKNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSILLSVQ 123

Query:   793 QLVECAKQCSGCDGCFFEPSIEYTHQAGLESE--KDYPYKNANGEKF-KC 839
              +++C    S C+G    P  EY H+ G+  E   +Y  K+ + +KF +C
Sbjct:   124 NVIDCGNAGS-CEGGNDLPVWEYAHKHGIPDETCNNYQAKDQDCDKFNQC 172

 Score = 102 (41.0 bits), Expect = 4.7e-07, Sum P(2) = 4.7e-07
 Identities = 32/110 (29%), Positives = 48/110 (43%)

Query:    11 VPDAWDWRKKNVTGPAGDQAD------CGSCWAFSIAGMLEGQYAIKT-GKL--VEFSKS 61
             +P  WDWR  N    A    +      CGSCWA      +  +  IK  G    +  S  
Sbjct:    64 LPKNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSILLSVQ 123

Query:    62 QLVECAKQCSGCDGCFFEPSIEYTHQAGLESE--KDYPYKNANGEKF-KC 108
              +++C    S C+G    P  EY H+ G+  E   +Y  K+ + +KF +C
Sbjct:   124 NVIDCGNAGS-CEGGNDLPVWEYAHKHGIPDETCNNYQAKDQDCDKFNQC 172

 Score = 93 (37.8 bits), Expect = 3.7e-07, Sum P(2) = 3.7e-07
 Identities = 29/98 (29%), Positives = 48/98 (48%)

Query:   916 NIPYWLVRNSWGPI-GPDEGFFKI-ERGNNACGI---EQIAGYAT-IDVVKNDETCSPYD 969
             N   W V   +G + G ++   +I   G  +CGI   E ++ Y   I     D+      
Sbjct:   186 NYTLWRV-GDYGSLSGREKMMAEIYANGPISCGIMATEMMSNYTGGIYAEHQDQAV---- 240

Query:   970 LGHAVLLVGYGKQDD-IPYWLVRNSWGPIGPDEGFFKI 1006
             + H + + G+G  +D I YW+VRNSWG    ++G+ +I
Sbjct:   241 INHIISVAGWGVSNDGIEYWIVRNSWGEPWGEKGWMRI 278

 Score = 42 (19.8 bits), Expect = 1.5e-10, Sum P(2) = 1.5e-10
 Identities = 14/59 (23%), Positives = 25/59 (42%)

Query:   446 QPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYK 504
             +P EY   A L   K++ +RN NG  +        +  + G  + + + S    +I  K
Sbjct:    54 RPHEYLSPADLP--KNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIK 110


>UNIPROTKB|A5GFX7 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9823 "Sus scrofa"
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GO:GO:0060441 GeneTree:ENSGT00560000076599 MEROPS:C01.013 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN
            OMA:QCGTCTE EMBL:CR956646 RefSeq:NP_001116576.1 UniGene:Ssc.16769
            ProteinModelPortal:A5GFX7 SMR:A5GFX7 STRING:A5GFX7
            Ensembl:ENSSSCT00000008249 GeneID:100141405 KEGG:ssc:100141405
            ArrayExpress:A5GFX7 Uniprot:A5GFX7
        Length = 304

 Score = 175 (66.7 bits), Expect = 3.2e-10, P = 3.2e-10
 Identities = 59/219 (26%), Positives = 90/219 (41%)

Query:   376 VPDAWDWRKKN------VTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLV 429
             +P +WDWR  N      VT        CGSCWA      +  +  IK  K    S    V
Sbjct:    63 LPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKR-KGAWPSTLLSV 121

Query:   430 ECAKQCSGCGGCDGLEQ-PI-EYTHQAGLESE--KDYPYRNGNGEKFK----CAYDKS-- 479
             +    C   G C+G +  P+  Y H+ G+  E   +Y  ++   +KF     C   K   
Sbjct:   122 QHVIDCGNAGSCEGGDDLPVWAYAHRHGIPDETCNNYQAKDQVCDKFNQCGTCTEFKECH 181

Query:   480 ---KVKLFTGKDFLYFNGSETMKKILYKYGPLSVGL--NSHLIHFYNGTPIRKNDETCSP 534
                   L+   D+   +G E M   +Y  GP+S G+     + ++  G      D+    
Sbjct:   182 VIQNYTLWKVGDYGSVSGREKMMAEIYANGPISCGIMATEKMSNYTGGIYAEYKDQA--- 238

Query:   535 YDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKI 573
             Y + H V + G+G      YW+ RNSWG    + G+ +I
Sbjct:   239 Y-INHIVSVAGWGVSGGTEYWIVRNSWGEPWGERGWMRI 276

 Score = 154 (59.3 bits), Expect = 8.0e-08, P = 8.0e-08
 Identities = 51/191 (26%), Positives = 82/191 (42%)

Query:    32 CGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKSQLVECAKQCSGCDGCFFEPSIEYTHQA 88
             CGSCWA      +  +  IK  G       S   +++C    S C+G    P   Y H+ 
Sbjct:    90 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGNAGS-CEGGDDLPVWAYAHRH 148

Query:    89 GLESE--KDYPYKNANGEKFK----CAYDKS-----KVKLFTGKDFLHFNGSETMKKILY 137
             G+  E   +Y  K+   +KF     C   K         L+   D+   +G E M   +Y
Sbjct:   149 GIPDETCNNYQAKDQVCDKFNQCGTCTEFKECHVIQNYTLWKVGDYGSVSGREKMMAEIY 208

Query:   138 KYGPLSV-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWG 196
               GP+S  ++ ++ + +Y G    +  +    Y + H V + G+G      YW+VRNSWG
Sbjct:   209 ANGPISCGIMATEKMSNYTGGIYAEYKDQA--Y-INHIVSVAGWGVSGGTEYWIVRNSWG 265

Query:   197 PIGPDEGFFKI 207
                 + G+ +I
Sbjct:   266 EPWGERGWMRI 276

 Score = 154 (59.3 bits), Expect = 8.0e-08, P = 8.0e-08
 Identities = 51/191 (26%), Positives = 82/191 (42%)

Query:   763 CGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKSQLVECAKQCSGCDGCFFEPSIEYTHQA 819
             CGSCWA      +  +  IK  G       S   +++C    S C+G    P   Y H+ 
Sbjct:    90 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGNAGS-CEGGDDLPVWAYAHRH 148

Query:   820 GLESE--KDYPYKNANGEKFK----CAYDKS-----KVKLFTGKDFLHFNGSETMKKILY 868
             G+  E   +Y  K+   +KF     C   K         L+   D+   +G E M   +Y
Sbjct:   149 GIPDETCNNYQAKDQVCDKFNQCGTCTEFKECHVIQNYTLWKVGDYGSVSGREKMMAEIY 208

Query:   869 KYGPLSV-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWG 927
               GP+S  ++ ++ + +Y G    +  +    Y + H V + G+G      YW+VRNSWG
Sbjct:   209 ANGPISCGIMATEKMSNYTGGIYAEYKDQA--Y-INHIVSVAGWGVSGGTEYWIVRNSWG 265

Query:   928 PIGPDEGFFKI 938
                 + G+ +I
Sbjct:   266 EPWGERGWMRI 276

 Score = 93 (37.8 bits), Expect = 9.2e-06, Sum P(2) = 9.2e-06
 Identities = 32/110 (29%), Positives = 46/110 (41%)

Query:   742 VPDAWDWRKKN------VTGPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKS 792
             +P +WDWR  N      VT        CGSCWA      +  +  IK  G       S  
Sbjct:    63 LPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQ 122

Query:   793 QLVECAKQCSGCDGCFFEPSIEYTHQAGLESE--KDYPYKNANGEKF-KC 839
              +++C    S C+G    P   Y H+ G+  E   +Y  K+   +KF +C
Sbjct:   123 HVIDCGNAGS-CEGGDDLPVWAYAHRHGIPDETCNNYQAKDQVCDKFNQC 171

 Score = 92 (37.4 bits), Expect = 1.2e-05, Sum P(2) = 1.2e-05
 Identities = 31/110 (28%), Positives = 46/110 (41%)

Query:    11 VPDAWDWRKKNVTGPAGDQAD------CGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKS 61
             +P +WDWR  N    A    +      CGSCWA      +  +  IK  G       S  
Sbjct:    63 LPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQ 122

Query:    62 QLVECAKQCSGCDGCFFEPSIEYTHQAGLESE--KDYPYKNANGEKF-KC 108
              +++C    S C+G    P   Y H+ G+  E   +Y  K+   +KF +C
Sbjct:   123 HVIDCGNAGS-CEGGDDLPVWAYAHRHGIPDETCNNYQAKDQVCDKFNQC 171

 Score = 90 (36.7 bits), Expect = 9.2e-06, Sum P(2) = 9.2e-06
 Identities = 29/97 (29%), Positives = 46/97 (47%)

Query:   916 NIPYWLVRNSWGPI-GPDEGFFKI-ERGNNACGI---EQIAGYAT-IDVVKNDETCSPYD 969
             N   W V   +G + G ++   +I   G  +CGI   E+++ Y   I     D+    Y 
Sbjct:   185 NYTLWKV-GDYGSVSGREKMMAEIYANGPISCGIMATEKMSNYTGGIYAEYKDQA---Y- 239

Query:   970 LGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKI 1006
             + H V + G+G      YW+VRNSWG    + G+ +I
Sbjct:   240 INHIVSVAGWGVSGGTEYWIVRNSWGEPWGERGWMRI 276


>WB|WBGene00000788 [details] [associations]
            symbol:cpz-1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] [GO:0009792 "embryo development ending in
            birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0010171 "body morphogenesis" evidence=IMP]
            [GO:0018996 "molting cycle, collagen and cuticulin-based cuticle"
            evidence=IMP] [GO:0031012 "extracellular matrix" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0009792 GO:GO:0040007 GO:GO:0002119
            GO:GO:0018996 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0010171 GO:GO:0031012
            GeneTree:ENSGT00560000076599 KO:K08568 OMA:QCGTCTE EMBL:FO081275
            EMBL:BK001409 PIR:T29872 RefSeq:NP_491023.2 HSSP:Q9UBR2
            ProteinModelPortal:G5EGP8 SMR:G5EGP8 IntAct:G5EGP8 MEROPS:C01.A38
            EnsemblMetazoa:F32B5.8 GeneID:171829 KEGG:cel:CELE_F32B5.8
            CTD:171829 WormBase:F32B5.8 NextBio:872879 Uniprot:G5EGP8
        Length = 306

 Score = 174 (66.3 bits), Expect = 4.3e-10, P = 4.3e-10
 Identities = 54/195 (27%), Positives = 89/195 (45%)

Query:   397 CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQL-VECAKQCSGCGGCDGLEQP---IEYTH 452
             CGSCWAF     L  +  IK      + ++ L V+    CSG G C    +P    +Y H
Sbjct:    92 CGSCWAFGATSALADRINIKRKNA--WPQAYLSVQEVIDCSGAGTCVMGGEPGGVYKYAH 149

Query:   453 QAGLESE--KDYPYRNGNGEKF-KCA-------YDKSKVKLFTGKDFLYFNGSETMKKIL 502
             + G+  E   +Y  R+G  + + +C        +      L+   ++   +G E MK  +
Sbjct:   150 EHGIPHETCNNYQARDGKCDPYNRCGSCWPGECFSIKNYTLYKVSEYGTVHGYEKMKAEI 209

Query:   503 YKYGPLSVGLNS-HLIHFYNGTPIRK-NDETCSPYDLGHAVLLVGYG--KQDDIPYWLAR 558
             Y  GP++ G+ +      Y G   ++  DE     D+ H + + G+G   +  + YW+ R
Sbjct:   210 YHKGPIACGIAATKAFETYAGGIYKEVTDE-----DIDHIISVHGWGVDHESGVEYWIGR 264

Query:   559 NSWGPIGPDEGFFKI 573
             NSWG    + G+FKI
Sbjct:   265 NSWGEPWGEHGWFKI 279

 Score = 155 (59.6 bits), Expect = 6.2e-08, P = 6.2e-08
 Identities = 51/195 (26%), Positives = 86/195 (44%)

Query:    32 CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQL-VECAKQCSGCDGCFF--EPS--IEYTH 86
             CGSCWAF     L  +  IK      + ++ L V+    CSG   C    EP    +Y H
Sbjct:    92 CGSCWAFGATSALADRINIKRKNA--WPQAYLSVQEVIDCSGAGTCVMGGEPGGVYKYAH 149

Query:    87 QAGLESE--KDYPYKNANGEKF-KCA-------YDKSKVKLFTGKDFLHFNGSETMKKIL 136
             + G+  E   +Y  ++   + + +C        +      L+   ++   +G E MK  +
Sbjct:   150 EHGIPHETCNNYQARDGKCDPYNRCGSCWPGECFSIKNYTLYKVSEYGTVHGYEKMKAEI 209

Query:   137 YKYGPLSV-LLNSDLIHDYNGTPIRK-NDETCSPYDLGHAVLLVGYG--KQDNIPYWLVR 192
             Y  GP++  +  +     Y G   ++  DE     D+ H + + G+G   +  + YW+ R
Sbjct:   210 YHKGPIACGIAATKAFETYAGGIYKEVTDE-----DIDHIISVHGWGVDHESGVEYWIGR 264

Query:   193 NSWGPIGPDEGFFKI 207
             NSWG    + G+FKI
Sbjct:   265 NSWGEPWGEHGWFKI 279

 Score = 155 (59.6 bits), Expect = 6.2e-08, P = 6.2e-08
 Identities = 51/195 (26%), Positives = 86/195 (44%)

Query:   763 CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQL-VECAKQCSGCDGCFF--EPS--IEYTH 817
             CGSCWAF     L  +  IK      + ++ L V+    CSG   C    EP    +Y H
Sbjct:    92 CGSCWAFGATSALADRINIKRKNA--WPQAYLSVQEVIDCSGAGTCVMGGEPGGVYKYAH 149

Query:   818 QAGLESE--KDYPYKNANGEKF-KCA-------YDKSKVKLFTGKDFLHFNGSETMKKIL 867
             + G+  E   +Y  ++   + + +C        +      L+   ++   +G E MK  +
Sbjct:   150 EHGIPHETCNNYQARDGKCDPYNRCGSCWPGECFSIKNYTLYKVSEYGTVHGYEKMKAEI 209

Query:   868 YKYGPLSV-LLNSDLIHDYNGTPIRK-NDETCSPYDLGHAVLLVGYG--KQDNIPYWLVR 923
             Y  GP++  +  +     Y G   ++  DE     D+ H + + G+G   +  + YW+ R
Sbjct:   210 YHKGPIACGIAATKAFETYAGGIYKEVTDE-----DIDHIISVHGWGVDHESGVEYWIGR 264

Query:   924 NSWGPIGPDEGFFKI 938
             NSWG    + G+FKI
Sbjct:   265 NSWGEPWGEHGWFKI 279

 Score = 105 (42.0 bits), Expect = 3.5e-07, Sum P(2) = 3.5e-07
 Identities = 41/155 (26%), Positives = 64/155 (41%)

Query:   693 YG-TSEFSDRSPEEILCKTGFKWSERTYERIVADRXXXXXXXXXXXXDGPVPDAWDWRKK 751
             YG   ++S+R+   +  K  +K + R +E    DR               +P  WDWR  
Sbjct:    21 YGKVRKYSNRNRYNL--KGCYKQTGRVFEHKRYDRIYETEDFDSED----LPKTWDWRDA 74

Query:   752 N-VTGPAGDQAA-----CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQL-VECAKQCSGC 804
             N +   + D+       CGSCWAF     L  +  IK      + ++ L V+    CSG 
Sbjct:    75 NGINYASADRNQHIPQYCGSCWAFGATSALADRINIKRKNA--WPQAYLSVQEVIDCSGA 132

Query:   805 DGCFF--EPS--IEYTHQAGLESEKDYPYKNANGE 835
               C    EP    +Y H+ G+  E    Y+  +G+
Sbjct:   133 GTCVMGGEPGGVYKYAHEHGIPHETCNNYQARDGK 167

 Score = 104 (41.7 bits), Expect = 4.6e-07, Sum P(2) = 4.6e-07
 Identities = 31/105 (29%), Positives = 46/105 (43%)

Query:    11 VPDAWDWRKKN-VTGPAGDQAD-----CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQL- 63
             +P  WDWR  N +   + D+       CGSCWAF     L  +  IK      + ++ L 
Sbjct:    65 LPKTWDWRDANGINYASADRNQHIPQYCGSCWAFGATSALADRINIKRKNA--WPQAYLS 122

Query:    64 VECAKQCSGCDGCFF--EPS--IEYTHQAGLESEKDYPYKNANGE 104
             V+    CSG   C    EP    +Y H+ G+  E    Y+  +G+
Sbjct:   123 VQEVIDCSGAGTCVMGGEPGGVYKYAHEHGIPHETCNNYQARDGK 167

 Score = 91 (37.1 bits), Expect = 3.5e-07, Sum P(2) = 3.5e-07
 Identities = 22/69 (31%), Positives = 33/69 (47%)

Query:   940 RGNNACGIEQIAGYATIDVVKNDETCSPYDLGHAVLLVGYG--KQDDIPYWLVRNSWGPI 997
             +G  ACGI     + T       E     D+ H + + G+G   +  + YW+ RNSWG  
Sbjct:   212 KGPIACGIAATKAFETYAGGIYKEVTDE-DIDHIISVHGWGVDHESGVEYWIGRNSWGEP 270

Query:   998 GPDEGFFKI 1006
               + G+FKI
Sbjct:   271 WGEHGWFKI 279


>WB|WBGene00009158 [details] [associations]
            symbol:F26E4.3 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005576
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005044
            GeneTree:ENSGT00560000076599 HSSP:P07711 EMBL:Z81070
            eggNOG:NOG310046 HOGENOM:HOG000241342 OMA:DNCNRCT PIR:T21421
            RefSeq:NP_492593.2 ProteinModelPortal:P90850 SMR:P90850
            PaxDb:P90850 EnsemblMetazoa:F26E4.3.1 EnsemblMetazoa:F26E4.3.2
            GeneID:172827 KEGG:cel:CELE_F26E4.3 UCSC:F26E4.3.1 CTD:172827
            WormBase:F26E4.3 InParanoid:P90850 NextBio:877161 Uniprot:P90850
        Length = 452

 Score = 117 (46.2 bits), Expect = 1.4e-09, Sum P(2) = 1.4e-09
 Identities = 32/112 (28%), Positives = 52/112 (46%)

Query:     7 KDGPVPDAWDWRKK--NVTGPAGDQADCGSCWAFSIAGMLEGQYAI-KTGKLVE-FSKSQ 62
             K   +P+ +D R K   +  P  DQ DCGS W+ S   +   + AI   G++    S  Q
Sbjct:   180 KPRELPEHFDARDKWGPLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQ 239

Query:    63 LVECAK-QCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANG-EKFKCAYDK 112
             L+ C + +  GC+G + + +  Y  + G+  +  YPY +    E   C   K
Sbjct:   240 LLSCNQHRQKGCEGGYLDRAWWYIRKLGVVGDHCYPYVSGQSREPGHCLIPK 291

 Score = 108 (43.1 bits), Expect = 1.3e-08, Sum P(2) = 1.3e-08
 Identities = 30/108 (27%), Positives = 50/108 (46%)

Query:   742 VPDAWDWRKK--NVTGPAGDQAACGSCWAFSIAGMLEGQYAI-KTGKLVE-FSKSQLVEC 797
             +P+ +D R K   +  P  DQ  CGS W+ S   +   + AI   G++    S  QL+ C
Sbjct:   184 LPEHFDARDKWGPLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQLLSC 243

Query:   798 AK-QCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANG-EKFKCAYDK 843
              + +  GC+G + + +  Y  + G+  +  YPY +    E   C   K
Sbjct:   244 NQHRQKGCEGGYLDRAWWYIRKLGVVGDHCYPYVSGQSREPGHCLIPK 291

 Score = 107 (42.7 bits), Expect = 1.4e-09, Sum P(2) = 1.4e-09
 Identities = 21/49 (42%), Positives = 30/49 (61%)

Query:   173 HAVLLVGYGKQDN----IPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 217
             H+V ++G+G   +    I YWL  NSWG    ++G+FK+ RG N C IE
Sbjct:   374 HSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGYFKVLRGENHCEIE 422

 Score = 107 (42.7 bits), Expect = 1.4e-09, Sum P(2) = 1.4e-09
 Identities = 21/49 (42%), Positives = 29/49 (59%)

Query:   539 HAVLLVGYGKQDD----IPYWLARNSWGPIGPDEGFFKIERGNNACGIE 583
             H+V ++G+G        I YWL  NSWG    ++G+FK+ RG N C IE
Sbjct:   374 HSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGYFKVLRGENHCEIE 422

 Score = 107 (42.7 bits), Expect = 1.4e-09, Sum P(2) = 1.4e-09
 Identities = 21/49 (42%), Positives = 30/49 (61%)

Query:   904 HAVLLVGYGKQDN----IPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 948
             H+V ++G+G   +    I YWL  NSWG    ++G+FK+ RG N C IE
Sbjct:   374 HSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGYFKVLRGENHCEIE 422

 Score = 106 (42.4 bits), Expect = 1.8e-09, Sum P(2) = 1.8e-09
 Identities = 21/49 (42%), Positives = 29/49 (59%)

Query:   972 HAVLLVGYGKQDD----IPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 1016
             H+V ++G+G        I YWL  NSWG    ++G+FK+ RG N C IE
Sbjct:   374 HSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGYFKVLRGENHCEIE 422

 Score = 102 (41.0 bits), Expect = 5.5e-08, Sum P(2) = 5.5e-08
 Identities = 32/110 (29%), Positives = 50/110 (45%)

Query:   376 VPDAWDWRKK--NVTGPAGDQAACGSCWAFSIAGMLEGQYAI-KTGKLVE-FSKSQLVEC 431
             +P+ +D R K   +  P  DQ  CGS W+ S   +   + AI   G++    S  QL+ C
Sbjct:   184 LPEHFDARDKWGPLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQLLSC 243

Query:   432 AKQCSGCGGCDG--LEQPIEYTHQAGLESEKDYPYRNGNG-EKFKCAYDK 478
              +      GC+G  L++   Y  + G+  +  YPY +G   E   C   K
Sbjct:   244 NQHRQK--GCEGGYLDRAWWYIRKLGVVGDHCYPYVSGQSREPGHCLIPK 291


>UNIPROTKB|E2RNP9 [details] [associations]
            symbol:TINAG "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0007155 "cell adhesion" evidence=IEA]
            [GO:0005604 "basement membrane" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044
            "scavenger receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 Pfam:PF01033
            PROSITE:PS00524 PROSITE:PS50958 SMART:SM00201 SMART:SM00645
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0007155
            GO:GO:0005604 GO:GO:0005044 GeneTree:ENSGT00560000076599 CTD:27283
            OMA:WGQLTSS EMBL:AAEX03008403 RefSeq:XP_538969.2
            ProteinModelPortal:E2RNP9 Ensembl:ENSCAFT00000003638 GeneID:481848
            KEGG:cfa:481848 NextBio:20856579 Uniprot:E2RNP9
        Length = 476

 Score = 113 (44.8 bits), Expect = 1.7e-09, Sum P(2) = 1.7e-09
 Identities = 35/104 (33%), Positives = 52/104 (50%)

Query:   493 NGSETMKKILYKYGPLSVGLNSHLIHFYNGTPI-----RKNDETCSPYDLG-HAVLLVGY 546
             N +E MK+I+   GP+   +  H   F+  T I     R N+E+     L  HAV L G+
Sbjct:   360 NETEIMKEIMQN-GPVQAIMQVHEDFFHYKTGIYRHITRTNEESRKYQKLQTHAVKLTGW 418

Query:   547 G-----KQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQI 585
             G     +     +W+A NSWG    + G+F+I RG N   IE++
Sbjct:   419 GTLKGAQGQKEKFWIAANSWGISWGENGYFRILRGVNESDIEKL 462

 Score = 113 (44.8 bits), Expect = 1.7e-09, Sum P(2) = 1.7e-09
 Identities = 36/117 (30%), Positives = 56/117 (47%)

Query:   858 NGSETMKKILYKYGPLSVLL--NSDLIHDYNGTP---IRKNDETCSPYDLG-HAVLLVGY 911
             N +E MK+I+   GP+  ++  + D  H   G      R N+E+     L  HAV L G+
Sbjct:   360 NETEIMKEIMQN-GPVQAIMQVHEDFFHYKTGIYRHITRTNEESRKYQKLQTHAVKLTGW 418

Query:   912 G-----KQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDVVKNDE 963
             G     +     +W+  NSWG    + G+F+I RG N   IE++   A   +  +DE
Sbjct:   419 GTLKGAQGQKEKFWIAANSWGISWGENGYFRILRGVNESDIEKLIIAAWGHLTSSDE 475

 Score = 111 (44.1 bits), Expect = 1.7e-09, Sum P(2) = 1.7e-09
 Identities = 29/90 (32%), Positives = 45/90 (50%)

Query:    24 GPAGDQADCGSCWAFSIAGMLEGQYAIKT-GKLV-EFSKSQLVEC-AKQCSGCDGCFFEP 80
             GP  DQ +C + WAFS A +   + AI++ G+     S   L+ C AK   GC+    + 
Sbjct:   233 GPL-DQKNCAASWAFSTASVAADRIAIQSNGRYTANLSPQNLISCCAKNRHGCNSGSIDR 291

Query:    81 SIEYTHQAGLESEKDYP-YKNANGEKFKCA 109
             +  +  + GL S   YP +K+ N   + CA
Sbjct:   292 AWWFLRKRGLVSHACYPLFKDQNATNYGCA 321

 Score = 110 (43.8 bits), Expect = 3.5e-09, Sum P(2) = 3.5e-09
 Identities = 33/104 (31%), Positives = 51/104 (49%)

Query:   127 NGSETMKKILYKYGPLSVLL--NSDLIHDYNGTP---IRKNDETCSPYDLG-HAVLLVGY 180
             N +E MK+I+   GP+  ++  + D  H   G      R N+E+     L  HAV L G+
Sbjct:   360 NETEIMKEIMQN-GPVQAIMQVHEDFFHYKTGIYRHITRTNEESRKYQKLQTHAVKLTGW 418

Query:   181 G-----KQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQI 219
             G     +     +W+  NSWG    + G+F+I RG N   IE++
Sbjct:   419 GTLKGAQGQKEKFWIAANSWGISWGENGYFRILRGVNESDIEKL 462

 Score = 108 (43.1 bits), Expect = 3.5e-09, Sum P(2) = 3.5e-09
 Identities = 29/90 (32%), Positives = 44/90 (48%)

Query:   755 GPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKLV-EFSKSQLVEC-AKQCSGCDGCFFEP 811
             GP  DQ  C + WAFS A +   + AI++ G+     S   L+ C AK   GC+    + 
Sbjct:   233 GPL-DQKNCAASWAFSTASVAADRIAIQSNGRYTANLSPQNLISCCAKNRHGCNSGSIDR 291

Query:   812 SIEYTHQAGLESEKDYP-YKNANGEKFKCA 840
             +  +  + GL S   YP +K+ N   + CA
Sbjct:   292 AWWFLRKRGLVSHACYPLFKDQNATNYGCA 321

 Score = 101 (40.6 bits), Expect = 1.8e-08, Sum P(2) = 1.8e-08
 Identities = 28/91 (30%), Positives = 44/91 (48%)

Query:   389 GPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKLV-EFSKSQLVEC-AKQCSGCGGCDGLE 445
             GP  DQ  C + WAFS A +   + AI++ G+     S   L+ C AK   GC     ++
Sbjct:   233 GPL-DQKNCAASWAFSTASVAADRIAIQSNGRYTANLSPQNLISCCAKNRHGCNS-GSID 290

Query:   446 QPIEYTHQAGLESEKDYP-YRNGNGEKFKCA 475
             +   +  + GL S   YP +++ N   + CA
Sbjct:   291 RAWWFLRKRGLVSHACYPLFKDQNATNYGCA 321

 Score = 85 (35.0 bits), Expect = 1.3e-06, Sum P(2) = 1.3e-06
 Identities = 19/52 (36%), Positives = 28/52 (53%)

Query:   972 HAVLLVGYG-----KQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQI 1018
             HAV L G+G     +     +W+  NSWG    + G+F+I RG N   IE++
Sbjct:   411 HAVKLTGWGTLKGAQGQKEKFWIAANSWGISWGENGYFRILRGVNESDIEKL 462


>RGD|708479 [details] [associations]
            symbol:Ctsz "cathepsin Z" species:10116 "Rattus norvegicus"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=TAS]
            [GO:0005615 "extracellular space" evidence=IEA;ISO] [GO:0005783
            "endoplasmic reticulum" evidence=IEA;ISO] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0060441 "epithelial tube branching involved in
            lung morphogenesis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:708479 GO:GO:0005576 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004197 MEROPS:C01.013 CTD:1522 HOVERGEN:HBG004456 KO:K08568
            EMBL:AB023781 EMBL:BC091110 IPI:IPI00207663 RefSeq:NP_899159.1
            UniGene:Rn.1475 ProteinModelPortal:Q9R1T3 SMR:Q9R1T3 PRIDE:Q9R1T3
            GeneID:252929 KEGG:rno:252929 BindingDB:Q9R1T3 NextBio:624097
            Genevestigator:Q9R1T3 Uniprot:Q9R1T3
        Length = 306

 Score = 167 (63.8 bits), Expect = 2.6e-09, Sum P(2) = 2.6e-09
 Identities = 53/192 (27%), Positives = 83/192 (43%)

Query:   763 CGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKSQLVECAKQCSGCDGCFFEPSIEYTHQA 819
             CGSCWA      L  +  IK  G       S   +++C    S C+G    P  EY H+ 
Sbjct:    91 CGSCWAHGSTSALADRINIKRKGAWPSTLLSVQNVIDCGNAGS-CEGGNDLPVWEYAHKH 149

Query:   820 GLESE--KDYPYKNANGEKFK----CAYDKS-----KVKLFTGKDFLHFNGSETMKKILY 868
             G+  E   +Y  K+   +KF     C   K         L+   D+   +G E M   +Y
Sbjct:   150 GIPDETCNNYQAKDQECDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSLSGREKMMAEIY 209

Query:   869 KYGPLSV-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG-KQDNIPYWLVRNSW 926
               GP+S  ++ ++ + +Y G    +         + H + + G+G   D I YW+VRNSW
Sbjct:   210 ANGPISCGIMATERMSNYTGGIYTEYQNQAI---INHIISVAGWGVSNDGIEYWIVRNSW 266

Query:   927 GPIGPDEGFFKI 938
             G    + G+ +I
Sbjct:   267 GEPWGERGWMRI 278

 Score = 167 (63.8 bits), Expect = 2.7e-09, P = 2.7e-09
 Identities = 53/192 (27%), Positives = 83/192 (43%)

Query:    32 CGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKSQLVECAKQCSGCDGCFFEPSIEYTHQA 88
             CGSCWA      L  +  IK  G       S   +++C    S C+G    P  EY H+ 
Sbjct:    91 CGSCWAHGSTSALADRINIKRKGAWPSTLLSVQNVIDCGNAGS-CEGGNDLPVWEYAHKH 149

Query:    89 GLESE--KDYPYKNANGEKFK----CAYDKS-----KVKLFTGKDFLHFNGSETMKKILY 137
             G+  E   +Y  K+   +KF     C   K         L+   D+   +G E M   +Y
Sbjct:   150 GIPDETCNNYQAKDQECDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSLSGREKMMAEIY 209

Query:   138 KYGPLSV-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG-KQDNIPYWLVRNSW 195
               GP+S  ++ ++ + +Y G    +         + H + + G+G   D I YW+VRNSW
Sbjct:   210 ANGPISCGIMATERMSNYTGGIYTEYQNQAI---INHIISVAGWGVSNDGIEYWIVRNSW 266

Query:   196 GPIGPDEGFFKI 207
             G    + G+ +I
Sbjct:   267 GEPWGERGWMRI 278

 Score = 165 (63.1 bits), Expect = 4.5e-09, Sum P(2) = 4.5e-09
 Identities = 52/193 (26%), Positives = 83/193 (43%)

Query:   397 CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDG-LEQPI-EYTHQA 454
             CGSCWA      L  +  IK  K    S    V+    C   G C+G  + P+ EY H+ 
Sbjct:    91 CGSCWAHGSTSALADRINIKR-KGAWPSTLLSVQNVIDCGNAGSCEGGNDLPVWEYAHKH 149

Query:   455 GLESE--KDYPYRNGNGEKFK----CAYDKS-----KVKLFTGKDFLYFNGSETMKKILY 503
             G+  E   +Y  ++   +KF     C   K         L+   D+   +G E M   +Y
Sbjct:   150 GIPDETCNNYQAKDQECDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSLSGREKMMAEIY 209

Query:   504 KYGPLSVGL--NSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD-IPYWLARNS 560
               GP+S G+     + ++  G      ++      + H + + G+G  +D I YW+ RNS
Sbjct:   210 ANGPISCGIMATERMSNYTGGIYTEYQNQAI----INHIISVAGWGVSNDGIEYWIVRNS 265

Query:   561 WGPIGPDEGFFKI 573
             WG    + G+ +I
Sbjct:   266 WGEPWGERGWMRI 278

 Score = 105 (42.0 bits), Expect = 4.5e-07, Sum P(2) = 4.5e-07
 Identities = 34/110 (30%), Positives = 47/110 (42%)

Query:   376 VPDAWDWRKKN------VTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLV 429
             +P  WDWR  N      VT        CGSCWA      L  +  IK  K    S    V
Sbjct:    64 LPKNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSALADRINIKR-KGAWPSTLLSV 122

Query:   430 ECAKQCSGCGGCDG-LEQPI-EYTHQAGLESE--KDYPYRNGNGEKF-KC 474
             +    C   G C+G  + P+ EY H+ G+  E   +Y  ++   +KF +C
Sbjct:   123 QNVIDCGNAGSCEGGNDLPVWEYAHKHGIPDETCNNYQAKDQECDKFNQC 172

 Score = 102 (41.0 bits), Expect = 9.6e-07, Sum P(2) = 9.6e-07
 Identities = 34/110 (30%), Positives = 46/110 (41%)

Query:   742 VPDAWDWRKKN------VTGPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKS 792
             +P  WDWR  N      VT        CGSCWA      L  +  IK  G       S  
Sbjct:    64 LPKNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSALADRINIKRKGAWPSTLLSVQ 123

Query:   793 QLVECAKQCSGCDGCFFEPSIEYTHQAGLESE--KDYPYKNANGEKF-KC 839
              +++C    S C+G    P  EY H+ G+  E   +Y  K+   +KF +C
Sbjct:   124 NVIDCGNAGS-CEGGNDLPVWEYAHKHGIPDETCNNYQAKDQECDKFNQC 172

 Score = 101 (40.6 bits), Expect = 1.2e-06, Sum P(2) = 1.2e-06
 Identities = 33/110 (30%), Positives = 46/110 (41%)

Query:    11 VPDAWDWRKKNVTGPAGDQAD------CGSCWAFSIAGMLEGQYAIKT-GKLVE--FSKS 61
             +P  WDWR  N    A    +      CGSCWA      L  +  IK  G       S  
Sbjct:    64 LPKNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSALADRINIKRKGAWPSTLLSVQ 123

Query:    62 QLVECAKQCSGCDGCFFEPSIEYTHQAGLESE--KDYPYKNANGEKF-KC 108
              +++C    S C+G    P  EY H+ G+  E   +Y  K+   +KF +C
Sbjct:   124 NVIDCGNAGS-CEGGNDLPVWEYAHKHGIPDETCNNYQAKDQECDKFNQC 172

 Score = 90 (36.7 bits), Expect = 4.5e-07, Sum P(2) = 4.5e-07
 Identities = 29/97 (29%), Positives = 49/97 (50%)

Query:   916 NIPYWLVRNSWGPI-GPDEGFFKI-ERGNNACGI---EQIAGYATIDVVKNDETCSPYDL 970
             N   W V   +G + G ++   +I   G  +CGI   E+++ Y T  +    E  +   +
Sbjct:   186 NYTLWRV-GDYGSLSGREKMMAEIYANGPISCGIMATERMSNY-TGGIYT--EYQNQAII 241

Query:   971 GHAVLLVGYGKQDD-IPYWLVRNSWGPIGPDEGFFKI 1006
              H + + G+G  +D I YW+VRNSWG    + G+ +I
Sbjct:   242 NHIISVAGWGVSNDGIEYWIVRNSWGEPWGERGWMRI 278

 Score = 43 (20.2 bits), Expect = 2.6e-09, Sum P(2) = 2.6e-09
 Identities = 13/56 (23%), Positives = 23/56 (41%)

Query:    83 EYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYK 138
             EY   A L   K++ ++N NG  +        +  + G  + H + S    +I  K
Sbjct:    57 EYLSPADLP--KNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSALADRINIK 110

 Score = 42 (19.8 bits), Expect = 3.2e-09, Sum P(2) = 3.2e-09
 Identities = 14/59 (23%), Positives = 25/59 (42%)

Query:   446 QPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYK 504
             +P EY   A L   K++ +RN NG  +        +  + G  + + + S    +I  K
Sbjct:    54 RPHEYLSPADLP--KNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSALADRINIK 110


>WB|WBGene00008861 [details] [associations]
            symbol:F15D4.4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 SMART:SM00848 PROSITE:PS00640
            GeneTree:ENSGT00560000076599 EMBL:Z80344 HSSP:P53634
            eggNOG:NOG310593 PIR:T20981 ProteinModelPortal:Q93512 SMR:Q93512
            MEROPS:C01.A45 EnsemblMetazoa:F15D4.4 KEGG:cel:CELE_F15D4.4
            UCSC:F15D4.4 CTD:184530 WormBase:F15D4.4 InParanoid:Q93512
            OMA:ITMEQNI NextBio:925068 Uniprot:Q93512
        Length = 608

 Score = 172 (65.6 bits), Expect = 3.8e-09, P = 3.8e-09
 Identities = 68/253 (26%), Positives = 110/253 (43%)

Query:   747 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS---- 802
             DWR      P  DQ+ CG CWAFS+  M+E  +AI+       S  QL+ C  +      
Sbjct:   228 DWRP--FLKPILDQSTCGGCWAFSMISMIESFFAIQGYNTSSLSVQQLLTCDTKVDSTYG 285

Query:   803 ----GCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDK--SKVKLFT-GKDFL 855
                 GC G +F+ +  Y   +        P+   +       +      + LF  G    
Sbjct:   286 LANVGCKGGYFQIAGSYLEVSAARDASLIPFDLEDTSCDSSFFPPVVPTILLFDDGYISG 345

Query:   856 HFNGSE--TMKKIL---YKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLG----HAV 906
             +F  ++  TM++ +    + GP++V + +       G  I K  E     D G    HAV
Sbjct:   346 NFTAAQLITMEQNIEDKVRKGPIAVGMAA-------GPDIYKYSEGVYDGDCGTIINHAV 398

Query:   907 LLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIER--GNNACGIEQIAGYATIDVVKNDET 964
             ++VG+   D+  YW++RNSWG    + G+F+++R  G + C   +    AT   V  +ET
Sbjct:   399 VIVGF--TDD--YWIIRNSWGASWGEAGYFRVKRTPGKDPCQFYKYWSQAT--AVGANET 452

Query:   965 CSPYDLGHAVLLV 977
              +P   G    +V
Sbjct:   453 YAPPKAGGGEFVV 465

 Score = 167 (63.8 bits), Expect = 1.3e-08, P = 1.3e-08
 Identities = 62/227 (27%), Positives = 99/227 (43%)

Query:   381 DWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGG 440
             DWR      P  DQ+ CG CWAFS+  M+E  +AI+       S  QL+ C  +     G
Sbjct:   228 DWRP--FLKPILDQSTCGGCWAFSMISMIESFFAIQGYNTSSLSVQQLLTCDTKVDSTYG 285

Query:   441 CDGLEQPIEYTHQAG--LE--SEKDYPYRNGNGEKFKC--AYDKSKVKLFTGKDFLYFNG 494
                +     Y   AG  LE  + +D      + E   C  ++    V      D  Y +G
Sbjct:   286 LANVGCKGGYFQIAGSYLEVSAARDASLIPFDLEDTSCDSSFFPPVVPTILLFDDGYISG 345

Query:   495 SETMKKIL---------YKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVG 545
             + T  +++          + GP++VG+ +    +     +   D  C    + HAV++VG
Sbjct:   346 NFTAAQLITMEQNIEDKVRKGPIAVGMAAGPDIYKYSEGVYDGD--CGTI-INHAVVIVG 402

Query:   546 YGKQDDIPYWLARNSWGPIGPDEGFFKIER--GNNACGIEQIAGYAT 590
             +   DD  YW+ RNSWG    + G+F+++R  G + C   +    AT
Sbjct:   403 F--TDD--YWIIRNSWGASWGEAGYFRVKRTPGKDPCQFYKYWSQAT 445

 Score = 163 (62.4 bits), Expect = 3.6e-08, P = 3.6e-08
 Identities = 62/231 (26%), Positives = 101/231 (43%)

Query:    16 DWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS---- 71
             DWR      P  DQ+ CG CWAFS+  M+E  +AI+       S  QL+ C  +      
Sbjct:   228 DWRP--FLKPILDQSTCGGCWAFSMISMIESFFAIQGYNTSSLSVQQLLTCDTKVDSTYG 285

Query:    72 ----GCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDK--SKVKLFT-GKDFL 124
                 GC G +F+ +  Y   +        P+   +       +      + LF  G    
Sbjct:   286 LANVGCKGGYFQIAGSYLEVSAARDASLIPFDLEDTSCDSSFFPPVVPTILLFDDGYISG 345

Query:   125 HFNGSE--TMKKIL---YKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLG----HAV 175
             +F  ++  TM++ +    + GP++V + +       G  I K  E     D G    HAV
Sbjct:   346 NFTAAQLITMEQNIEDKVRKGPIAVGMAA-------GPDIYKYSEGVYDGDCGTIINHAV 398

Query:   176 LLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIER--GNNACGIEQIAGYAT 224
             ++VG+   D+  YW++RNSWG    + G+F+++R  G + C   +    AT
Sbjct:   399 VIVGF--TDD--YWIIRNSWGASWGEAGYFRVKRTPGKDPCQFYKYWSQAT 445


>UNIPROTKB|I3L9E7 [details] [associations]
            symbol:LOC100153159 "Uncharacterized protein" species:9823
            "Sus scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GeneTree:ENSGT00560000076599 OMA:WGQLTSS
            Ensembl:ENSSSCT00000031207 Uniprot:I3L9E7
        Length = 358

 Score = 109 (43.4 bits), Expect = 3.9e-09, Sum P(2) = 3.9e-09
 Identities = 30/90 (33%), Positives = 44/90 (48%)

Query:    24 GPAGDQADCGSCWAFSIAGMLEGQYAIKT-GKLV-EFSKSQLVEC-AKQCSGCDGCFFEP 80
             GP  DQ +C + WAFS A +   + AI++ G+     S   L+ C AK   GC+    + 
Sbjct:   115 GPL-DQKNCAASWAFSTASVAADRIAIQSEGRYTANLSPQNLISCCAKNRHGCNSGSIDR 173

Query:    81 SIEYTHQAGLESEKDYP-YKNANGEKFKCA 109
             +  Y  + GL S   YP +K+ N     CA
Sbjct:   174 AWWYLRKRGLVSHACYPLFKDQNATNNGCA 203

 Score = 108 (43.1 bits), Expect = 3.9e-09, Sum P(2) = 3.9e-09
 Identities = 33/117 (28%), Positives = 53/117 (45%)

Query:   858 NGSETMKKILYKYGPLSVLL--NSDLIHDYNGT--PIRKNDETCSPYDL--GHAVLLVGY 911
             N +E M++I+   GP+  ++  + D  H   G    +   +E    Y     HAV L G+
Sbjct:   242 NETEIMREIMQN-GPVQAIMQVHEDFFHYKTGIYRHVTSTNEESDKYRKLRTHAVKLTGW 300

Query:   912 GKQDNIP-----YWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDVVKNDE 963
             G           +W+  NSWG    + G+F+I RG N   IE++   A   +  +DE
Sbjct:   301 GTLKGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQLTNSDE 357

 Score = 106 (42.4 bits), Expect = 6.3e-09, Sum P(2) = 6.3e-09
 Identities = 33/104 (31%), Positives = 51/104 (49%)

Query:   493 NGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRK-----NDETCSPYDLG-HAVLLVGY 546
             N +E M++I+   GP+   +  H   F+  T I +     N+E+     L  HAV L G+
Sbjct:   242 NETEIMREIMQN-GPVQAIMQVHEDFFHYKTGIYRHVTSTNEESDKYRKLRTHAVKLTGW 300

Query:   547 GKQDDIP-----YWLARNSWGPIGPDEGFFKIERGNNACGIEQI 585
             G           +W+A NSWG    + G+F+I RG N   IE++
Sbjct:   301 GTLKGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKL 344

 Score = 106 (42.4 bits), Expect = 1.1e-08, Sum P(3) = 1.1e-08
 Identities = 30/90 (33%), Positives = 43/90 (47%)

Query:   755 GPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKLV-EFSKSQLVEC-AKQCSGCDGCFFEP 811
             GP  DQ  C + WAFS A +   + AI++ G+     S   L+ C AK   GC+    + 
Sbjct:   115 GPL-DQKNCAASWAFSTASVAADRIAIQSEGRYTANLSPQNLISCCAKNRHGCNSGSIDR 173

Query:   812 SIEYTHQAGLESEKDYP-YKNANGEKFKCA 840
             +  Y  + GL S   YP +K+ N     CA
Sbjct:   174 AWWYLRKRGLVSHACYPLFKDQNATNNGCA 203

 Score = 104 (41.7 bits), Expect = 1.0e-08, Sum P(2) = 1.0e-08
 Identities = 30/104 (28%), Positives = 48/104 (46%)

Query:   127 NGSETMKKILYKYGPLSVLL--NSDLIHDYNGT--PIRKNDETCSPYDL--GHAVLLVGY 180
             N +E M++I+   GP+  ++  + D  H   G    +   +E    Y     HAV L G+
Sbjct:   242 NETEIMREIMQN-GPVQAIMQVHEDFFHYKTGIYRHVTSTNEESDKYRKLRTHAVKLTGW 300

Query:   181 GKQDNIP-----YWLVRNSWGPIGPDEGFFKIERGNNACGIEQI 219
             G           +W+  NSWG    + G+F+I RG N   IE++
Sbjct:   301 GTLKGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKL 344

 Score = 99 (39.9 bits), Expect = 4.3e-08, Sum P(2) = 4.3e-08
 Identities = 29/91 (31%), Positives = 43/91 (47%)

Query:   389 GPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKLV-EFSKSQLVEC-AKQCSGCGGCDGLE 445
             GP  DQ  C + WAFS A +   + AI++ G+     S   L+ C AK   GC     ++
Sbjct:   115 GPL-DQKNCAASWAFSTASVAADRIAIQSEGRYTANLSPQNLISCCAKNRHGCNS-GSID 172

Query:   446 QPIEYTHQAGLESEKDYP-YRNGNGEKFKCA 475
             +   Y  + GL S   YP +++ N     CA
Sbjct:   173 RAWWYLRKRGLVSHACYPLFKDQNATNNGCA 203

 Score = 86 (35.3 bits), Expect = 7.2e-07, Sum P(2) = 7.2e-07
 Identities = 19/52 (36%), Positives = 27/52 (51%)

Query:   972 HAVLLVGYGKQDDIP-----YWLVRNSWGPIGPDEGFFKIERGNNACGIEQI 1018
             HAV L G+G           +W+  NSWG    + G+F+I RG N   IE++
Sbjct:   293 HAVKLTGWGTLKGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKL 344

 Score = 40 (19.1 bits), Expect = 1.1e-08, Sum P(3) = 1.1e-08
 Identities = 17/64 (26%), Positives = 30/64 (46%)

Query:   518 HFYNGTPIRKNDETCS----PYDLGHAVLLVGYGKQDDIPY----WLARNS---WGPIGP 566
             H+  G+ I++N  +C+     ++    V LV  G  + +      W A+N    WG +  
Sbjct:     8 HYEEGSVIKENCNSCTCSGQQWNCSQHVCLVQPGLIEHVNEGDFGWTAQNYSQFWG-MTL 66

Query:   567 DEGF 570
             +EGF
Sbjct:    67 EEGF 70


>ZFIN|ZDB-GENE-041010-139 [details] [associations]
            symbol:ctsz "cathepsin Z" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0001525 "angiogenesis"
            evidence=IMP] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 ZFIN:ZDB-GENE-041010-139 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0001525
            CTD:1522 HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568
            OrthoDB:EOG42Z4QN UniGene:Dr.935 eggNOG:NOG275763 EMBL:BC083369
            IPI:IPI00483065 RefSeq:NP_001006043.1 ProteinModelPortal:Q5XJD4
            SMR:Q5XJD4 STRING:Q5XJD4 GeneID:450022 KEGG:dre:450022
            InParanoid:Q5XJD4 NextBio:20833005 ArrayExpress:Q5XJD4
            Uniprot:Q5XJD4
        Length = 301

 Score = 165 (63.1 bits), Expect = 4.3e-09, P = 4.3e-09
 Identities = 54/192 (28%), Positives = 87/192 (45%)

Query:    32 CGSCWAFSIAGMLEGQYAIKTGKL---VEFSKSQLVECAKQCSGCDGCFFEPSIEYTHQA 88
             CGSCWA      L  +  IK          S   +++C      C G       EY H  
Sbjct:    81 CGSCWAHGSTSALADRINIKRKAAWPSAYLSVQNVIDCG-DAGSCSGGDHSGVWEYAHNK 139

Query:    89 GLESE--KDYPYKNANGEKF-KCAYDKSK-----VKLFT-GK--DFLHFNGSETMKKILY 137
             G+  E   +Y  K+ + + F +C    +      VK FT  K  D+   +G + MK  +Y
Sbjct:   140 GIPDETCNNYQAKDQDCKPFNQCGTCTTFGVCNIVKNFTLWKVGDYGSASGLDKMKAEIY 199

Query:   138 KYGPLSV-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN-IPYWLVRNSW 195
               GP+S  ++ +D +  Y G     ++    PY + H V + G+G  +N + +W+VRNSW
Sbjct:   200 SGGPISCGIMATDKLDAYTGGLY--SEYVQEPY-INHIVSVAGWGVDENGVEFWVVRNSW 256

Query:   196 GPIGPDEGFFKI 207
             G    ++G+ +I
Sbjct:   257 GEPWGEKGWLRI 268

 Score = 165 (63.1 bits), Expect = 4.3e-09, P = 4.3e-09
 Identities = 54/192 (28%), Positives = 87/192 (45%)

Query:   763 CGSCWAFSIAGMLEGQYAIKTGKL---VEFSKSQLVECAKQCSGCDGCFFEPSIEYTHQA 819
             CGSCWA      L  +  IK          S   +++C      C G       EY H  
Sbjct:    81 CGSCWAHGSTSALADRINIKRKAAWPSAYLSVQNVIDCG-DAGSCSGGDHSGVWEYAHNK 139

Query:   820 GLESE--KDYPYKNANGEKF-KCAYDKSK-----VKLFT-GK--DFLHFNGSETMKKILY 868
             G+  E   +Y  K+ + + F +C    +      VK FT  K  D+   +G + MK  +Y
Sbjct:   140 GIPDETCNNYQAKDQDCKPFNQCGTCTTFGVCNIVKNFTLWKVGDYGSASGLDKMKAEIY 199

Query:   869 KYGPLSV-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN-IPYWLVRNSW 926
               GP+S  ++ +D +  Y G     ++    PY + H V + G+G  +N + +W+VRNSW
Sbjct:   200 SGGPISCGIMATDKLDAYTGGLY--SEYVQEPY-INHIVSVAGWGVDENGVEFWVVRNSW 256

Query:   927 GPIGPDEGFFKI 938
             G    ++G+ +I
Sbjct:   257 GEPWGEKGWLRI 268

 Score = 155 (59.6 bits), Expect = 5.9e-08, P = 5.9e-08
 Identities = 54/192 (28%), Positives = 87/192 (45%)

Query:   397 CGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLEQP--IEYTHQA 454
             CGSCWA      L  +  IK  K    S    V+    C   G C G +     EY H  
Sbjct:    81 CGSCWAHGSTSALADRINIKR-KAAWPSAYLSVQNVIDCGDAGSCSGGDHSGVWEYAHNK 139

Query:   455 GLESE--KDYPYRNGNGEKF-KCAYDKSK-----VKLFT-GK--DFLYFNGSETMKKILY 503
             G+  E   +Y  ++ + + F +C    +      VK FT  K  D+   +G + MK  +Y
Sbjct:   140 GIPDETCNNYQAKDQDCKPFNQCGTCTTFGVCNIVKNFTLWKVGDYGSASGLDKMKAEIY 199

Query:   504 KYGPLSVGLNS-HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD-IPYWLARNSW 561
               GP+S G+ +   +  Y G     ++    PY + H V + G+G  ++ + +W+ RNSW
Sbjct:   200 SGGPISCGIMATDKLDAYTGGLY--SEYVQEPY-INHIVSVAGWGVDENGVEFWVVRNSW 256

Query:   562 GPIGPDEGFFKI 573
             G    ++G+ +I
Sbjct:   257 GEPWGEKGWLRI 268

 Score = 97 (39.2 bits), Expect = 5.2e-06, Sum P(2) = 5.2e-06
 Identities = 30/119 (25%), Positives = 55/119 (46%)

Query:   894 DETCSPYDLGHAVLLVGYGK-QDNIPYWLVRNSWGPIGPDEGFFKIERGNN-ACGI---E 948
             D+ C P++        G      N   W V +     G D+   +I  G   +CGI   +
Sbjct:   153 DQDCKPFNQCGTCTTFGVCNIVKNFTLWKVGDYGSASGLDKMKAEIYSGGPISCGIMATD 212

Query:   949 QIAGYATIDVVKNDETCSPYDLGHAVLLVGYGKQDD-IPYWLVRNSWGPIGPDEGFFKI 1006
             ++  Y     + ++    PY + H V + G+G  ++ + +W+VRNSWG    ++G+ +I
Sbjct:   213 KLDAYT--GGLYSEYVQEPY-INHIVSVAGWGVDENGVEFWVVRNSWGEPWGEKGWLRI 268

 Score = 88 (36.0 bits), Expect = 5.2e-06, Sum P(2) = 5.2e-06
 Identities = 30/110 (27%), Positives = 42/110 (38%)

Query:   742 VPDAWDWRK-K--NVTGPAGDQAA---CGSCWAFSIAGMLEGQYAIKTGKL---VEFSKS 792
             +P  WDWR  K  N      +Q     CGSCWA      L  +  IK          S  
Sbjct:    54 LPKEWDWRNIKGVNYVSTTRNQHIPQYCGSCWAHGSTSALADRINIKRKAAWPSAYLSVQ 113

Query:   793 QLVECAKQCSGCDGCFFEPSIEYTHQAGLESE--KDYPYKNANGEKF-KC 839
              +++C      C G       EY H  G+  E   +Y  K+ + + F +C
Sbjct:   114 NVIDCG-DAGSCSGGDHSGVWEYAHNKGIPDETCNNYQAKDQDCKPFNQC 162

 Score = 87 (35.7 bits), Expect = 6.6e-06, Sum P(2) = 6.6e-06
 Identities = 30/110 (27%), Positives = 42/110 (38%)

Query:    11 VPDAWDWRK-K--NVTGPAGDQ---ADCGSCWAFSIAGMLEGQYAIKTGKL---VEFSKS 61
             +P  WDWR  K  N      +Q     CGSCWA      L  +  IK          S  
Sbjct:    54 LPKEWDWRNIKGVNYVSTTRNQHIPQYCGSCWAHGSTSALADRINIKRKAAWPSAYLSVQ 113

Query:    62 QLVECAKQCSGCDGCFFEPSIEYTHQAGLESE--KDYPYKNANGEKF-KC 108
              +++C      C G       EY H  G+  E   +Y  K+ + + F +C
Sbjct:   114 NVIDCG-DAGSCSGGDHSGVWEYAHNKGIPDETCNNYQAKDQDCKPFNQC 162


>UNIPROTKB|Q9UJW2 [details] [associations]
            symbol:TINAG "Tubulointerstitial nephritis antigen"
            species:9606 "Homo sapiens" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0007155 "cell adhesion"
            evidence=IDA] [GO:0005604 "basement membrane" evidence=IDA]
            [GO:0000166 "nucleotide binding" evidence=TAS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR001212 InterPro:IPR013128
            Pfam:PF00112 Pfam:PF01033 PROSITE:PS00524 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0006955 EMBL:CH471081
            GO:GO:0000166 GO:GO:0030247 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0007155 GO:GO:0005604
            GO:GO:0004197 GO:GO:0005044 EMBL:AL359380 MEROPS:C01.973 CTD:27283
            eggNOG:NOG310046 HOGENOM:HOG000241342 HOVERGEN:HBG053961
            OMA:WGQLTSS EMBL:AB022277 EMBL:AF195116 EMBL:AF195117 EMBL:AK312918
            EMBL:AL589946 IPI:IPI00099386 IPI:IPI00478705 PIR:JC7189
            RefSeq:NP_055279.3 UniGene:Hs.127011 ProteinModelPortal:Q9UJW2
            SMR:Q9UJW2 IntAct:Q9UJW2 STRING:Q9UJW2 PhosphoSite:Q9UJW2
            DMDM:212276468 PRIDE:Q9UJW2 DNASU:27283 Ensembl:ENST00000259782
            GeneID:27283 KEGG:hsa:27283 UCSC:uc003pcj.2 GeneCards:GC06P054220
            H-InvDB:HIX0025004 HGNC:HGNC:14599 HPA:HPA035427 MIM:606749
            neXtProt:NX_Q9UJW2 PharmGKB:PA37905 InParanoid:Q9UJW2
            PhylomeDB:Q9UJW2 GenomeRNAi:27283 NextBio:50212 ArrayExpress:Q9UJW2
            Bgee:Q9UJW2 CleanEx:HS_TINAG Genevestigator:Q9UJW2
            GermOnline:ENSG00000137251 Uniprot:Q9UJW2
        Length = 476

 Score = 110 (43.8 bits), Expect = 5.7e-09, Sum P(2) = 5.7e-09
 Identities = 35/118 (29%), Positives = 52/118 (44%)

Query:   858 NGSETMKKILYKYGPLSVLLN--SDLIHDYNG-----TPIRKNDETCSPYDLGHAVLLVG 910
             N +E MK+I+   GP+  ++    D  H   G     T   K  E        HAV L G
Sbjct:   360 NETEIMKEIMQN-GPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQT-HAVKLTG 417

Query:   911 YG-----KQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDVVKNDE 963
             +G     +     +W+  NSWG    + G+F+I RG N   IE++   A   +  +DE
Sbjct:   418 WGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQLTSSDE 475

 Score = 109 (43.4 bits), Expect = 5.7e-09, Sum P(2) = 5.7e-09
 Identities = 30/90 (33%), Positives = 44/90 (48%)

Query:    24 GPAGDQADCGSCWAFSIAGMLEGQYAIKT-GKLV-EFSKSQLVEC-AKQCSGCDGCFFEP 80
             GP  DQ +C + WAFS A +   + AI++ G+     S   L+ C AK   GC+    + 
Sbjct:   233 GPL-DQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDR 291

Query:    81 SIEYTHQAGLESEKDYP-YKNANGEKFKCA 109
             +  Y  + GL S   YP +K+ N     CA
Sbjct:   292 AWWYLRKRGLVSHACYPLFKDQNATNNGCA 321

 Score = 106 (42.4 bits), Expect = 1.2e-08, Sum P(2) = 1.2e-08
 Identities = 30/90 (33%), Positives = 43/90 (47%)

Query:   755 GPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKLV-EFSKSQLVEC-AKQCSGCDGCFFEP 811
             GP  DQ  C + WAFS A +   + AI++ G+     S   L+ C AK   GC+    + 
Sbjct:   233 GPL-DQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDR 291

Query:   812 SIEYTHQAGLESEKDYP-YKNANGEKFKCA 840
             +  Y  + GL S   YP +K+ N     CA
Sbjct:   292 AWWYLRKRGLVSHACYPLFKDQNATNNGCA 321

 Score = 106 (42.4 bits), Expect = 1.5e-08, Sum P(2) = 1.5e-08
 Identities = 32/105 (30%), Positives = 47/105 (44%)

Query:   127 NGSETMKKILYKYGPLSVLLN--SDLIHDYNG-----TPIRKNDETCSPYDLGHAVLLVG 179
             N +E MK+I+   GP+  ++    D  H   G     T   K  E        HAV L G
Sbjct:   360 NETEIMKEIMQN-GPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQT-HAVKLTG 417

Query:   180 YG-----KQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQI 219
             +G     +     +W+  NSWG    + G+F+I RG N   IE++
Sbjct:   418 WGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKL 462

 Score = 104 (41.7 bits), Expect = 2.4e-08, Sum P(2) = 2.4e-08
 Identities = 32/105 (30%), Positives = 48/105 (45%)

Query:   493 NGSETMKKILYKYGPLS--VGLNSHLIHFYNG-----TPIRKNDETCSPYDLGHAVLLVG 545
             N +E MK+I+   GP+   + +     H+  G     T   K  E        HAV L G
Sbjct:   360 NETEIMKEIMQN-GPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQT-HAVKLTG 417

Query:   546 YG-----KQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQI 585
             +G     +     +W+A NSWG    + G+F+I RG N   IE++
Sbjct:   418 WGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKL 462

 Score = 99 (39.9 bits), Expect = 6.1e-08, Sum P(2) = 6.1e-08
 Identities = 29/91 (31%), Positives = 43/91 (47%)

Query:   389 GPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKLV-EFSKSQLVEC-AKQCSGCGGCDGLE 445
             GP  DQ  C + WAFS A +   + AI++ G+     S   L+ C AK   GC     ++
Sbjct:   233 GPL-DQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNS-GSID 290

Query:   446 QPIEYTHQAGLESEKDYP-YRNGNGEKFKCA 475
             +   Y  + GL S   YP +++ N     CA
Sbjct:   291 RAWWYLRKRGLVSHACYPLFKDQNATNNGCA 321

 Score = 87 (35.7 bits), Expect = 1.3e-06, Sum P(2) = 1.3e-06
 Identities = 19/52 (36%), Positives = 28/52 (53%)

Query:   972 HAVLLVGYG-----KQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQI 1018
             HAV L G+G     +     +W+  NSWG    + G+F+I RG N   IE++
Sbjct:   411 HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKL 462


>GENEDB_PFALCIPARUM|PFD0230c [details] [associations]
            symbol:PFD0230c "protease, putative"
            species:5833 "Plasmodium falciparum" [GO:0030163 "protein catabolic
            process" evidence=ISS] [GO:0020011 "apicoplast" evidence=RCA]
            InterPro:IPR000668 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0030163 EMBL:AL844503 InterPro:IPR014882
            Pfam:PF08773 HSSP:P80067 RefSeq:XP_001351359.1
            ProteinModelPortal:Q8I1Y2 IntAct:Q8I1Y2 MINT:MINT-1604190
            MEROPS:C01.139 EnsemblProtists:PFD0230c:mRNA GeneID:812449
            KEGG:pfa:PFD0230c EuPathDB:PlasmoDB:PF3D7_0404700
            HOGENOM:HOG000284023 ProtClustDB:CLSZ2514828 ChEMBL:CHEMBL1250370
            Uniprot:Q8I1Y2
        Length = 939

 Score = 99 (39.9 bits), Expect = 7.1e-09, Sum P(4) = 7.1e-09
 Identities = 26/78 (33%), Positives = 43/78 (55%)

Query:    28 DQADCGSCWAFSIAGMLEGQYAIKTG--KLVE---FSKSQLVECAKQCSGCDGCFFEPSI 82
             DQ DCGSC+A S + ++  +  IK    K ++   FS  QL+ C     GC+G +   S+
Sbjct:   497 DQKDCGSCYANSASFIINSRVRIKYNYIKNIDSLFFSNEQLILCDIFNQGCNGGYIYLSL 556

Query:    83 EYTHQAGLESEKDYP-YK 99
             +Y ++  L ++K +  YK
Sbjct:   557 KYAYENYLYTQKCFEKYK 574

 Score = 91 (37.1 bits), Expect = 3.5e-06, Sum P(3) = 3.5e-06
 Identities = 25/78 (32%), Positives = 42/78 (53%)

Query:   759 DQAACGSCWAFSIAGMLEGQYAIKTG--KLVE---FSKSQLVECAKQCSGCDGCFFEPSI 813
             DQ  CGSC+A S + ++  +  IK    K ++   FS  QL+ C     GC+G +   S+
Sbjct:   497 DQKDCGSCYANSASFIINSRVRIKYNYIKNIDSLFFSNEQLILCDIFNQGCNGGYIYLSL 556

Query:   814 EYTHQAGLESEKDYP-YK 830
             +Y ++  L ++K +  YK
Sbjct:   557 KYAYENYLYTQKCFEKYK 574

 Score = 81 (33.6 bits), Expect = 7.1e-09, Sum P(4) = 7.1e-09
 Identities = 19/49 (38%), Positives = 26/49 (53%)

Query:   917 IPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDV-VKNDET 964
             I YW V NSWG    + G+F I R NN+  I+       +++ VK  ET
Sbjct:   891 IKYWKVLNSWGTNWGNSGYFYILRNNNSFNIKSYILACDVNLFVKQKET 939

 Score = 77 (32.2 bits), Expect = 7.1e-09, Sum P(4) = 7.1e-09
 Identities = 34/120 (28%), Positives = 59/120 (49%)

Query:   456 LESEKDYPYRNGNGEKFKCAYD--KSK---VKLFTGK-DFLYFNGSETMKKILYKYGPLS 509
             L  E+ Y   + N +  +  YD  KS    VK+   K ++L     E +KK +Y  GP++
Sbjct:   678 LIDEQKYNNNHNNNDDDE-EYDIFKSNSCDVKINVSKFEYLDIQDEELLKKYIYYNGPVA 736

Query:   510 VGL--NSHLIHFYNGTP----IRKNDETCS-PY---DLGHAVLLVGYGKQDDIPYWLARN 559
               +  +S  I +  G      I+  D T +  Y    + HAV++VG+G +D +P ++ +N
Sbjct:   737 AAIEPSSEFIGYKKGIILGNFIKMYDGTKNNAYIWNKVDHAVVIVGWG-EDTLPNFVKKN 795

 Score = 76 (31.8 bits), Expect = 2.2e-08, Sum P(4) = 2.2e-08
 Identities = 15/32 (46%), Positives = 19/32 (59%)

Query:   985 IPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 1016
             I YW V NSWG    + G+F I R NN+  I+
Sbjct:   891 IKYWKVLNSWGTNWGNSGYFYILRNNNSFNIK 922

 Score = 76 (31.8 bits), Expect = 1.6e-06, Sum P(3) = 1.6e-06
 Identities = 15/32 (46%), Positives = 19/32 (59%)

Query:   186 IPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 217
             I YW V NSWG    + G+F I R NN+  I+
Sbjct:   891 IKYWKVLNSWGTNWGNSGYFYILRNNNSFNIK 922

 Score = 74 (31.1 bits), Expect = 1.4e-08, Sum P(4) = 1.4e-08
 Identities = 30/107 (28%), Positives = 54/107 (50%)

Query:   831 NANGEKFKCAYDKS-KVKLFTGK-DFLHFNGSETMKKILYKYGPLSVLL--NSDLIHDYN 886
             N + E++      S  VK+   K ++L     E +KK +Y  GP++  +  +S+ I  Y 
Sbjct:   691 NDDDEEYDIFKSNSCDVKINVSKFEYLDIQDEELLKKYIYYNGPVAAAIEPSSEFI-GYK 749

Query:   887 -----GTPIRKNDETCS-PY---DLGHAVLLVGYGKQDNIPYWLVRN 924
                  G  I+  D T +  Y    + HAV++VG+G +D +P ++ +N
Sbjct:   750 KGIILGNFIKMYDGTKNNAYIWNKVDHAVVIVGWG-EDTLPNFVKKN 795

 Score = 74 (31.1 bits), Expect = 5.2e-07, Sum P(3) = 5.2e-07
 Identities = 30/107 (28%), Positives = 54/107 (50%)

Query:   100 NANGEKFKCAYDKS-KVKLFTGK-DFLHFNGSETMKKILYKYGPLSVLL--NSDLIHDYN 155
             N + E++      S  VK+   K ++L     E +KK +Y  GP++  +  +S+ I  Y 
Sbjct:   691 NDDDEEYDIFKSNSCDVKINVSKFEYLDIQDEELLKKYIYYNGPVAAAIEPSSEFI-GYK 749

Query:   156 -----GTPIRKNDETCS-PY---DLGHAVLLVGYGKQDNIPYWLVRN 193
                  G  I+  D T +  Y    + HAV++VG+G +D +P ++ +N
Sbjct:   750 KGIILGNFIKMYDGTKNNAYIWNKVDHAVVIVGWG-EDTLPNFVKKN 795

 Score = 73 (30.8 bits), Expect = 5.8e-06, Sum P(4) = 5.8e-06
 Identities = 22/73 (30%), Positives = 37/73 (50%)

Query:   393 DQAACGSCWAFSIAGMLEGQYAIKTG--KLVE---FSKSQLVECAKQCSGCGGCDGLEQP 447
             DQ  CGSC+A S + ++  +  IK    K ++   FS  QL+ C     GC G   +   
Sbjct:   497 DQKDCGSCYANSASFIINSRVRIKYNYIKNIDSLFFSNEQLILCDIFNQGCNG-GYIYLS 555

Query:   448 IEYTHQAGLESEK 460
             ++Y ++  L ++K
Sbjct:   556 LKYAYENYLYTQK 568

 Score = 72 (30.4 bits), Expect = 5.4e-08, Sum P(4) = 5.4e-08
 Identities = 14/32 (43%), Positives = 18/32 (56%)

Query:   552 IPYWLARNSWGPIGPDEGFFKIERGNNACGIE 583
             I YW   NSWG    + G+F I R NN+  I+
Sbjct:   891 IKYWKVLNSWGTNWGNSGYFYILRNNNSFNIK 922

 Score = 59 (25.8 bits), Expect = 7.1e-09, Sum P(4) = 7.1e-09
 Identities = 16/61 (26%), Positives = 30/61 (49%)

Query:   273 DTLAIEGSLTFDNENIL--ETFKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHERYGTS 330
             D L I+ SL   ++N L  + F  F +K  ++  N+  IKE+      +   K++ Y  +
Sbjct:   582 DDLEIKSSLMSQDDNSLLCDQFDVFKIKNEKKKNNEINIKEQITMNINNDSNKNQEYTNN 641

Query:   331 E 331
             +
Sbjct:   642 D 642

 Score = 59 (25.8 bits), Expect = 1.4e-08, Sum P(4) = 1.4e-08
 Identities = 16/61 (26%), Positives = 30/61 (49%)

Query:   639 DTLAIEGSLTFDNENIL--ETFKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHERYGTS 696
             D L I+ SL   ++N L  + F  F +K  ++  N+  IKE+      +   K++ Y  +
Sbjct:   582 DDLEIKSSLMSQDDNSLLCDQFDVFKIKNEKKKNNEINIKEQITMNINNDSNKNQEYTNN 641

Query:   697 E 697
             +
Sbjct:   642 D 642

 Score = 38 (18.4 bits), Expect = 4.3e-05, Sum P(4) = 4.3e-05
 Identities = 11/39 (28%), Positives = 18/39 (46%)

Query:   671 NDEEIKERFEYFKQDGHKKHERYGTSEFSDRSPEEILCK 709
             ND++  E ++ FK +           E+ D   EE+L K
Sbjct:   691 NDDD--EEYDIFKSNSCDVKINVSKFEYLDIQDEELLKK 727


>UNIPROTKB|Q8I1Y2 [details] [associations]
            symbol:PFD0230c "Protease, putative" species:36329
            "Plasmodium falciparum 3D7" [GO:0020011 "apicoplast" evidence=RCA]
            [GO:0030163 "protein catabolic process" evidence=ISS]
            InterPro:IPR000668 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0030163 EMBL:AL844503 InterPro:IPR014882
            Pfam:PF08773 HSSP:P80067 RefSeq:XP_001351359.1
            ProteinModelPortal:Q8I1Y2 IntAct:Q8I1Y2 MINT:MINT-1604190
            MEROPS:C01.139 EnsemblProtists:PFD0230c:mRNA GeneID:812449
            KEGG:pfa:PFD0230c EuPathDB:PlasmoDB:PF3D7_0404700
            HOGENOM:HOG000284023 ProtClustDB:CLSZ2514828 ChEMBL:CHEMBL1250370
            Uniprot:Q8I1Y2
        Length = 939

 Score = 99 (39.9 bits), Expect = 7.1e-09, Sum P(4) = 7.1e-09
 Identities = 26/78 (33%), Positives = 43/78 (55%)

Query:    28 DQADCGSCWAFSIAGMLEGQYAIKTG--KLVE---FSKSQLVECAKQCSGCDGCFFEPSI 82
             DQ DCGSC+A S + ++  +  IK    K ++   FS  QL+ C     GC+G +   S+
Sbjct:   497 DQKDCGSCYANSASFIINSRVRIKYNYIKNIDSLFFSNEQLILCDIFNQGCNGGYIYLSL 556

Query:    83 EYTHQAGLESEKDYP-YK 99
             +Y ++  L ++K +  YK
Sbjct:   557 KYAYENYLYTQKCFEKYK 574

 Score = 91 (37.1 bits), Expect = 3.5e-06, Sum P(3) = 3.5e-06
 Identities = 25/78 (32%), Positives = 42/78 (53%)

Query:   759 DQAACGSCWAFSIAGMLEGQYAIKTG--KLVE---FSKSQLVECAKQCSGCDGCFFEPSI 813
             DQ  CGSC+A S + ++  +  IK    K ++   FS  QL+ C     GC+G +   S+
Sbjct:   497 DQKDCGSCYANSASFIINSRVRIKYNYIKNIDSLFFSNEQLILCDIFNQGCNGGYIYLSL 556

Query:   814 EYTHQAGLESEKDYP-YK 830
             +Y ++  L ++K +  YK
Sbjct:   557 KYAYENYLYTQKCFEKYK 574

 Score = 81 (33.6 bits), Expect = 7.1e-09, Sum P(4) = 7.1e-09
 Identities = 19/49 (38%), Positives = 26/49 (53%)

Query:   917 IPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDV-VKNDET 964
             I YW V NSWG    + G+F I R NN+  I+       +++ VK  ET
Sbjct:   891 IKYWKVLNSWGTNWGNSGYFYILRNNNSFNIKSYILACDVNLFVKQKET 939

 Score = 77 (32.2 bits), Expect = 7.1e-09, Sum P(4) = 7.1e-09
 Identities = 34/120 (28%), Positives = 59/120 (49%)

Query:   456 LESEKDYPYRNGNGEKFKCAYD--KSK---VKLFTGK-DFLYFNGSETMKKILYKYGPLS 509
             L  E+ Y   + N +  +  YD  KS    VK+   K ++L     E +KK +Y  GP++
Sbjct:   678 LIDEQKYNNNHNNNDDDE-EYDIFKSNSCDVKINVSKFEYLDIQDEELLKKYIYYNGPVA 736

Query:   510 VGL--NSHLIHFYNGTP----IRKNDETCS-PY---DLGHAVLLVGYGKQDDIPYWLARN 559
               +  +S  I +  G      I+  D T +  Y    + HAV++VG+G +D +P ++ +N
Sbjct:   737 AAIEPSSEFIGYKKGIILGNFIKMYDGTKNNAYIWNKVDHAVVIVGWG-EDTLPNFVKKN 795

 Score = 76 (31.8 bits), Expect = 2.2e-08, Sum P(4) = 2.2e-08
 Identities = 15/32 (46%), Positives = 19/32 (59%)

Query:   985 IPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 1016
             I YW V NSWG    + G+F I R NN+  I+
Sbjct:   891 IKYWKVLNSWGTNWGNSGYFYILRNNNSFNIK 922

 Score = 76 (31.8 bits), Expect = 1.6e-06, Sum P(3) = 1.6e-06
 Identities = 15/32 (46%), Positives = 19/32 (59%)

Query:   186 IPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 217
             I YW V NSWG    + G+F I R NN+  I+
Sbjct:   891 IKYWKVLNSWGTNWGNSGYFYILRNNNSFNIK 922

 Score = 74 (31.1 bits), Expect = 1.4e-08, Sum P(4) = 1.4e-08
 Identities = 30/107 (28%), Positives = 54/107 (50%)

Query:   831 NANGEKFKCAYDKS-KVKLFTGK-DFLHFNGSETMKKILYKYGPLSVLL--NSDLIHDYN 886
             N + E++      S  VK+   K ++L     E +KK +Y  GP++  +  +S+ I  Y 
Sbjct:   691 NDDDEEYDIFKSNSCDVKINVSKFEYLDIQDEELLKKYIYYNGPVAAAIEPSSEFI-GYK 749

Query:   887 -----GTPIRKNDETCS-PY---DLGHAVLLVGYGKQDNIPYWLVRN 924
                  G  I+  D T +  Y    + HAV++VG+G +D +P ++ +N
Sbjct:   750 KGIILGNFIKMYDGTKNNAYIWNKVDHAVVIVGWG-EDTLPNFVKKN 795

 Score = 74 (31.1 bits), Expect = 5.2e-07, Sum P(3) = 5.2e-07
 Identities = 30/107 (28%), Positives = 54/107 (50%)

Query:   100 NANGEKFKCAYDKS-KVKLFTGK-DFLHFNGSETMKKILYKYGPLSVLL--NSDLIHDYN 155
             N + E++      S  VK+   K ++L     E +KK +Y  GP++  +  +S+ I  Y 
Sbjct:   691 NDDDEEYDIFKSNSCDVKINVSKFEYLDIQDEELLKKYIYYNGPVAAAIEPSSEFI-GYK 749

Query:   156 -----GTPIRKNDETCS-PY---DLGHAVLLVGYGKQDNIPYWLVRN 193
                  G  I+  D T +  Y    + HAV++VG+G +D +P ++ +N
Sbjct:   750 KGIILGNFIKMYDGTKNNAYIWNKVDHAVVIVGWG-EDTLPNFVKKN 795

 Score = 73 (30.8 bits), Expect = 5.8e-06, Sum P(4) = 5.8e-06
 Identities = 22/73 (30%), Positives = 37/73 (50%)

Query:   393 DQAACGSCWAFSIAGMLEGQYAIKTG--KLVE---FSKSQLVECAKQCSGCGGCDGLEQP 447
             DQ  CGSC+A S + ++  +  IK    K ++   FS  QL+ C     GC G   +   
Sbjct:   497 DQKDCGSCYANSASFIINSRVRIKYNYIKNIDSLFFSNEQLILCDIFNQGCNG-GYIYLS 555

Query:   448 IEYTHQAGLESEK 460
             ++Y ++  L ++K
Sbjct:   556 LKYAYENYLYTQK 568

 Score = 72 (30.4 bits), Expect = 5.4e-08, Sum P(4) = 5.4e-08
 Identities = 14/32 (43%), Positives = 18/32 (56%)

Query:   552 IPYWLARNSWGPIGPDEGFFKIERGNNACGIE 583
             I YW   NSWG    + G+F I R NN+  I+
Sbjct:   891 IKYWKVLNSWGTNWGNSGYFYILRNNNSFNIK 922

 Score = 59 (25.8 bits), Expect = 7.1e-09, Sum P(4) = 7.1e-09
 Identities = 16/61 (26%), Positives = 30/61 (49%)

Query:   273 DTLAIEGSLTFDNENIL--ETFKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHERYGTS 330
             D L I+ SL   ++N L  + F  F +K  ++  N+  IKE+      +   K++ Y  +
Sbjct:   582 DDLEIKSSLMSQDDNSLLCDQFDVFKIKNEKKKNNEINIKEQITMNINNDSNKNQEYTNN 641

Query:   331 E 331
             +
Sbjct:   642 D 642

 Score = 59 (25.8 bits), Expect = 1.4e-08, Sum P(4) = 1.4e-08
 Identities = 16/61 (26%), Positives = 30/61 (49%)

Query:   639 DTLAIEGSLTFDNENIL--ETFKAFIVKRGRQYANDEEIKERFEYFKQDGHKKHERYGTS 696
             D L I+ SL   ++N L  + F  F +K  ++  N+  IKE+      +   K++ Y  +
Sbjct:   582 DDLEIKSSLMSQDDNSLLCDQFDVFKIKNEKKKNNEINIKEQITMNINNDSNKNQEYTNN 641

Query:   697 E 697
             +
Sbjct:   642 D 642

 Score = 38 (18.4 bits), Expect = 4.3e-05, Sum P(4) = 4.3e-05
 Identities = 11/39 (28%), Positives = 18/39 (46%)

Query:   671 NDEEIKERFEYFKQDGHKKHERYGTSEFSDRSPEEILCK 709
             ND++  E ++ FK +           E+ D   EE+L K
Sbjct:   691 NDDD--EEYDIFKSNSCDVKINVSKFEYLDIQDEELLKK 727


>UNIPROTKB|H0YE42 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 PANTHER:PTHR12411 PROSITE:PS00139 EMBL:AP002748
            HGNC:HGNC:2531 ChiTaRS:CTSF Ensembl:ENST00000525733 Uniprot:H0YE42
        Length = 82

 Score = 144 (55.7 bits), Expect = 9.8e-09, P = 9.8e-09
 Identities = 24/52 (46%), Positives = 30/52 (57%)

Query:   377 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQL 428
             P  WDWR K       DQ  CGSCWAFS+ G +EGQ+ +  G L+  S+  L
Sbjct:    29 PPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQAL 80

 Score = 144 (55.7 bits), Expect = 9.8e-09, P = 9.8e-09
 Identities = 24/52 (46%), Positives = 30/52 (57%)

Query:   743 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQL 794
             P  WDWR K       DQ  CGSCWAFS+ G +EGQ+ +  G L+  S+  L
Sbjct:    29 PPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQAL 80

 Score = 142 (55.0 bits), Expect = 1.6e-08, P = 1.6e-08
 Identities = 24/52 (46%), Positives = 30/52 (57%)

Query:    12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQL 63
             P  WDWR K       DQ  CGSCWAFS+ G +EGQ+ +  G L+  S+  L
Sbjct:    29 PPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQAL 80


>UNIPROTKB|Q3SZI1 [details] [associations]
            symbol:TINAG "Tubulointerstitial nephritis antigen"
            species:9913 "Bos taurus" [GO:0005604 "basement membrane"
            evidence=IEA] [GO:0007155 "cell adhesion" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044
            "scavenger receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 Pfam:PF01033
            PROSITE:PS00524 PROSITE:PS50958 SMART:SM00201 SMART:SM00645
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0007155
            GO:GO:0005604 GO:GO:0005044 GeneTree:ENSGT00560000076599
            EMBL:BC102843 IPI:IPI00689615 RefSeq:NP_001030279.1
            UniGene:Bt.29080 ProteinModelPortal:Q3SZI1 MEROPS:C01.973
            PRIDE:Q3SZI1 Ensembl:ENSBTAT00000016790 GeneID:512517
            KEGG:bta:512517 CTD:27283 eggNOG:NOG310046 HOGENOM:HOG000241342
            HOVERGEN:HBG053961 InParanoid:Q3SZI1 OMA:WGQLTSS OrthoDB:EOG47PX5P
            NextBio:20870427 Uniprot:Q3SZI1
        Length = 476

 Score = 109 (43.4 bits), Expect = 4.9e-08, Sum P(2) = 4.9e-08
 Identities = 30/90 (33%), Positives = 45/90 (50%)

Query:    24 GPAGDQADCGSCWAFSIAGMLEGQYAIKT-GKLV-EFSKSQLVEC-AKQCSGCDGCFFEP 80
             GP  DQ +C + WAFS A +   + AI++ G+     S   L+ C AK+  GC+    + 
Sbjct:   233 GPL-DQKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNLISCCAKKRHGCNSGSVDR 291

Query:    81 SIEYTHQAGLESEKDYP-YKNANGEKFKCA 109
             +  Y  + GL S   YP +K+ N     CA
Sbjct:   292 AWWYLRKRGLVSHACYPLFKDQNATNNGCA 321

 Score = 106 (42.4 bits), Expect = 2.1e-07, Sum P(3) = 2.1e-07
 Identities = 30/90 (33%), Positives = 44/90 (48%)

Query:   755 GPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKLV-EFSKSQLVEC-AKQCSGCDGCFFEP 811
             GP  DQ  C + WAFS A +   + AI++ G+     S   L+ C AK+  GC+    + 
Sbjct:   233 GPL-DQKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNLISCCAKKRHGCNSGSVDR 291

Query:   812 SIEYTHQAGLESEKDYP-YKNANGEKFKCA 840
             +  Y  + GL S   YP +K+ N     CA
Sbjct:   292 AWWYLRKRGLVSHACYPLFKDQNATNNGCA 321

 Score = 101 (40.6 bits), Expect = 4.9e-08, Sum P(2) = 4.9e-08
 Identities = 32/104 (30%), Positives = 50/104 (48%)

Query:   493 NGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKN----DETCSPYDL--GHAVLLVGY 546
             N +E M++I+   GP+   +  H   F   T I ++    +E    Y     HAV L G+
Sbjct:   360 NETEIMREIMQN-GPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGW 418

Query:   547 G-----KQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQI 585
             G     +     +W+A NSWG    + G+F+I RG N   IE++
Sbjct:   419 GTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKL 462

 Score = 100 (40.3 bits), Expect = 6.2e-08, Sum P(2) = 6.2e-08
 Identities = 33/117 (28%), Positives = 53/117 (45%)

Query:   858 NGSETMKKILYKYGPLSVLL--NSDLIHDYNGT--PIRKNDETCSPYDL--GHAVLLVGY 911
             N +E M++I+   GP+  ++  + D  +   G    I   +E    Y     HAV L G+
Sbjct:   360 NETEIMREIMQN-GPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGW 418

Query:   912 G-----KQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDVVKNDE 963
             G     +     +W+  NSWG    + G+F+I RG N   IE++   A   +   DE
Sbjct:   419 GTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQLTSADE 475

 Score = 99 (39.9 bits), Expect = 7.9e-08, Sum P(2) = 7.9e-08
 Identities = 30/104 (28%), Positives = 49/104 (47%)

Query:   127 NGSETMKKILYKYGPLSVLL--NSDLIHDYNGT--PIRKNDETCSPYDL--GHAVLLVGY 180
             N +E M++I+   GP+  ++  + D  +   G    I   +E    Y     HAV L G+
Sbjct:   360 NETEIMREIMQN-GPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLTGW 418

Query:   181 G-----KQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQI 219
             G     +     +W+  NSWG    + G+F+I RG N   IE++
Sbjct:   419 GTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKL 462

 Score = 99 (39.9 bits), Expect = 5.7e-07, Sum P(2) = 5.7e-07
 Identities = 29/91 (31%), Positives = 44/91 (48%)

Query:   389 GPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKLV-EFSKSQLVEC-AKQCSGCGGCDGLE 445
             GP  DQ  C + WAFS A +   + AI++ G+     S   L+ C AK+  GC     ++
Sbjct:   233 GPL-DQKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNLISCCAKKRHGCNS-GSVD 290

Query:   446 QPIEYTHQAGLESEKDYP-YRNGNGEKFKCA 475
             +   Y  + GL S   YP +++ N     CA
Sbjct:   291 RAWWYLRKRGLVSHACYPLFKDQNATNNGCA 321

 Score = 87 (35.7 bits), Expect = 1.3e-06, Sum P(2) = 1.3e-06
 Identities = 19/52 (36%), Positives = 28/52 (53%)

Query:   972 HAVLLVGYG-----KQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQI 1018
             HAV L G+G     +     +W+  NSWG    + G+F+I RG N   IE++
Sbjct:   411 HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKL 462

 Score = 41 (19.5 bits), Expect = 2.1e-07, Sum P(3) = 2.1e-07
 Identities = 18/64 (28%), Positives = 30/64 (46%)

Query:   518 HFYNGTPIRKNDETCS----PYDLGHAVLLVGYGKQDDI---PY-WLARNS---WGPIGP 566
             H+  G+ I++N  +C+     +     V LV  G  + +    Y W A+N    WG +  
Sbjct:   126 HYEEGSVIKENCNSCTCSGQQWKCSQHVCLVQPGLIEHVNKGDYGWTAQNYSQFWG-MTL 184

Query:   567 DEGF 570
             +EGF
Sbjct:   185 EEGF 188


>RGD|1359482 [details] [associations]
            symbol:Tinag "tubulointerstitial nephritis antigen"
            species:10116 "Rattus norvegicus" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0005604 "basement membrane"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=IEA] [GO:0006955
            "immune response" evidence=IEA] [GO:0007155 "cell adhesion"
            evidence=ISO] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR001212 InterPro:IPR013128
            Pfam:PF00112 Pfam:PF01033 PROSITE:PS00524 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 RGD:1359482 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GO:GO:0007155 EMBL:CH473954 GO:GO:0005604
            GO:GO:0005044 MEROPS:C01.973 CTD:27283 eggNOG:NOG310046
            HOGENOM:HOG000241342 HOVERGEN:HBG053961 OMA:WGQLTSS
            OrthoDB:EOG47PX5P EMBL:BC081887 IPI:IPI00370427
            RefSeq:NP_001005549.1 UniGene:Rn.43851 STRING:Q66HF6
            Ensembl:ENSRNOT00000041567 GeneID:300846 KEGG:rno:300846
            UCSC:RGD:1359482 InParanoid:Q66HF6 NextBio:647630
            Genevestigator:Q66HF6 Uniprot:Q66HF6
        Length = 475

 Score = 108 (43.1 bits), Expect = 7.9e-08, Sum P(2) = 7.9e-08
 Identities = 34/104 (32%), Positives = 51/104 (49%)

Query:   493 NGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRK-----NDETCSPYDLG-HAVLLVGY 546
             N +E M++I+   GP+   +  H   FY  T I +     N+E      L  HAV L G+
Sbjct:   359 NETEIMREIIQN-GPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYRKLRTHAVKLTGW 417

Query:   547 G-----KQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQI 585
             G     +     +W+A NSWG    + G+F+I RG N   IE++
Sbjct:   418 GTLRGAQGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKL 461

 Score = 100 (40.3 bits), Expect = 7.9e-08, Sum P(2) = 7.9e-08
 Identities = 28/90 (31%), Positives = 43/90 (47%)

Query:    24 GPAGDQADCGSCWAFSIAGMLEGQYAIKT-GKLV-EFSKSQLVEC-AKQCSGCDGCFFEP 80
             GP  DQ +C + WAFS A +   + AI++ G+     S   L+ C AK   GC+    + 
Sbjct:   232 GPL-DQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDR 290

Query:    81 SIEYTHQAGLESEKDYP-YKNANGEKFKCA 109
             +  +  + GL S   YP +K  +     CA
Sbjct:   291 AWWFLRKRGLVSHACYPLFKEQSTNNNSCA 320

 Score = 98 (39.6 bits), Expect = 9.2e-07, Sum P(2) = 9.2e-07
 Identities = 32/117 (27%), Positives = 55/117 (47%)

Query:   858 NGSETMKKILYKYGPLSVLL--NSDLIHDYNGT---PIRKNDETCSPYDLG-HAVLLVGY 911
             N +E M++I+   GP+  ++  + D  +   G     +  N+E      L  HAV L G+
Sbjct:   359 NETEIMREIIQN-GPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYRKLRTHAVKLTGW 417

Query:   912 G-----KQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYATIDVVKNDE 963
             G     +     +W+  NSWG    + G+F+I RG N   IE++   A   +  +D+
Sbjct:   418 GTLRGAQGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQLTSSDD 474

 Score = 97 (39.2 bits), Expect = 1.2e-06, Sum P(2) = 1.2e-06
 Identities = 30/104 (28%), Positives = 50/104 (48%)

Query:   127 NGSETMKKILYKYGPLSVLL--NSDLIHDYNGT---PIRKNDETCSPYDLG-HAVLLVGY 180
             N +E M++I+   GP+  ++  + D  +   G     +  N+E      L  HAV L G+
Sbjct:   359 NETEIMREIIQN-GPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYRKLRTHAVKLTGW 417

Query:   181 G-----KQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQI 219
             G     +     +W+  NSWG    + G+F+I RG N   IE++
Sbjct:   418 GTLRGAQGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKL 461

 Score = 97 (39.2 bits), Expect = 4.6e-06, Sum P(3) = 4.6e-06
 Identities = 28/90 (31%), Positives = 42/90 (46%)

Query:   755 GPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKLV-EFSKSQLVEC-AKQCSGCDGCFFEP 811
             GP  DQ  C + WAFS A +   + AI++ G+     S   L+ C AK   GC+    + 
Sbjct:   232 GPL-DQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDR 290

Query:   812 SIEYTHQAGLESEKDYP-YKNANGEKFKCA 840
             +  +  + GL S   YP +K  +     CA
Sbjct:   291 AWWFLRKRGLVSHACYPLFKEQSTNNNSCA 320

 Score = 90 (36.7 bits), Expect = 2.0e-06, Sum P(3) = 2.0e-06
 Identities = 27/91 (29%), Positives = 42/91 (46%)

Query:   389 GPAGDQAACGSCWAFSIAGMLEGQYAIKT-GKLV-EFSKSQLVEC-AKQCSGCGGCDGLE 445
             GP  DQ  C + WAFS A +   + AI++ G+     S   L+ C AK   GC     ++
Sbjct:   232 GPL-DQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNS-GSID 289

Query:   446 QPIEYTHQAGLESEKDYP-YRNGNGEKFKCA 475
             +   +  + GL S   YP ++  +     CA
Sbjct:   290 RAWWFLRKRGLVSHACYPLFKEQSTNNNSCA 320

 Score = 86 (35.3 bits), Expect = 1.6e-05, Sum P(2) = 1.6e-05
 Identities = 19/52 (36%), Positives = 28/52 (53%)

Query:   972 HAVLLVGYG-----KQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQI 1018
             HAV L G+G     +     +W+  NSWG    + G+F+I RG N   IE++
Sbjct:   410 HAVKLTGWGTLRGAQGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKL 461

 Score = 39 (18.8 bits), Expect = 2.0e-06, Sum P(3) = 2.0e-06
 Identities = 8/16 (50%), Positives = 8/16 (50%)

Query:   249 CGVASCLCLPSLTDRI 264
             C    CL LP L D I
Sbjct:   149 CSQHVCLVLPELIDHI 164

 Score = 39 (18.8 bits), Expect = 4.6e-06, Sum P(3) = 4.6e-06
 Identities = 8/16 (50%), Positives = 8/16 (50%)

Query:   615 CGVASCLCLPSLTDRI 630
             C    CL LP L D I
Sbjct:   149 CSQHVCLVLPELIDHI 164

WARNING:  HSPs involving 25 database sequences were not reported due to the
          limiting value of parameter B = 250.


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.319   0.138   0.437    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0     1026      1002   0.00078  123 3  11 22  0.41    34
                                                     38  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  275
  No. of states in DFA:  613 (65 KB)
  Total size of DFA:  517 KB (2227 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  80.10u 0.13s 80.23t   Elapsed:  00:00:03
  Total cpu time:  80.22u 0.13s 80.35t   Elapsed:  00:00:03
  Start:  Thu Aug 15 13:38:46 2013   End:  Thu Aug 15 13:38:49 2013
WARNINGS ISSUED:  2

Back to top