RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7460
         (1026 letters)



>gnl|CDD|239068 cd02248, Peptidase_C1A, Peptidase C1A subfamily (MEROPS database
           nomenclature); composed of cysteine peptidases (CPs)
           similar to papain, including the mammalian CPs
           (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain
           is an endopeptidase with specific substrate preferences,
           primarily for bulky hydrophobic or aromatic residues at
           the S2 subsite, a hydrophobic pocket in papain that
           accommodates the P2 sidechain of the substrate (the
           second residue away from the scissile bond). Most
           members of the papain subfamily are endopeptidases. Some
           exceptions to this rule can be explained by specific
           details of the catalytic domains like the occluding loop
           in cathepsin B which confers an additional
           carboxydipeptidyl activity and the mini-chain of
           cathepsin H resulting in an N-terminal exopeptidase
           activity. Papain-like CPs have different functions in
           various organisms. Plant CPs are used to mobilize
           storage proteins in seeds. Parasitic CPs act
           extracellularly to help invade tissues and cells, to
           hatch or to evade the host immune system. Mammalian CPs
           are primarily lysosomal enzymes with the exception of
           cathepsin W, which is retained in the endoplasmic
           reticulum. They are responsible for protein degradation
           in the lysosome. Papain-like CPs are synthesized as
           inactive proenzymes with N-terminal propeptide regions,
           which are removed upon activation. In addition to its
           inhibitory role, the propeptide is required for proper
           folding of the newly synthesized enzyme and its
           stabilization in denaturing pH conditions. Residues
           within the propeptide region also play a role in the
           transport of the proenzyme to lysosomes or acidified
           vesicles. Also included in this subfamily are proteins
           classified as non-peptidase homologs, which lack
           peptidase activity or have missing active site residues.
          Length = 210

 Score =  222 bits (569), Expect = 6e-67
 Identities = 95/215 (44%), Positives = 124/215 (57%), Gaps = 8/215 (3%)

Query: 12  PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 71
           P++ DWR+K    P  DQ  CGSCWAFS  G LEG YAIKTGKLV  S+ QLV+C+   +
Sbjct: 1   PESVDWREKGAVTPVKDQGSCGSCWAFSTVGALEGAYAIKTGKLVSLSEQQLVDCSTSGN 60

Query: 72  -GCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGS 129
            GC+G   + + EY    GL SE DYPY   +G    C Y+ SKV    TG   +     
Sbjct: 61  NGCNGGNPDNAFEYVKNGGLASESDYPYTGKDG---TCKYNSSKVGAKITGYSNVPPGDE 117

Query: 130 ETMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPY 188
           E +K  L  YGP+SV ++ S     Y G     +   CS  +L HAVLLVGYG ++ + Y
Sbjct: 118 EALKAALANYGPVSVAIDASSSFQFYKGGIY--SGPCCSNTNLNHAVLLVGYGTENGVDY 175

Query: 189 WLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYA 223
           W+V+NSWG    ++G+ +I RG+N CGI   A Y 
Sbjct: 176 WIVKNSWGTSWGEKGYIRIARGSNLCGIASYASYP 210



 Score =  222 bits (567), Expect = 1e-66
 Identities = 95/215 (44%), Positives = 125/215 (58%), Gaps = 8/215 (3%)

Query: 743 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 802
           P++ DWR+K    P  DQ +CGSCWAFS  G LEG YAIKTGKLV  S+ QLV+C+   +
Sbjct: 1   PESVDWREKGAVTPVKDQGSCGSCWAFSTVGALEGAYAIKTGKLVSLSEQQLVDCSTSGN 60

Query: 803 -GCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGS 860
            GC+G   + + EY    GL SE DYPY   +G    C Y+ SKV    TG   +     
Sbjct: 61  NGCNGGNPDNAFEYVKNGGLASESDYPYTGKDG---TCKYNSSKVGAKITGYSNVPPGDE 117

Query: 861 ETMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPY 919
           E +K  L  YGP+SV ++ S     Y G     +   CS  +L HAVLLVGYG ++ + Y
Sbjct: 118 EALKAALANYGPVSVAIDASSSFQFYKGGIY--SGPCCSNTNLNHAVLLVGYGTENGVDY 175

Query: 920 WLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYA 954
           W+V+NSWG    ++G+ +I RG+N CGI   A Y 
Sbjct: 176 WIVKNSWGTSWGEKGYIRIARGSNLCGIASYASYP 210



 Score =  221 bits (566), Expect = 2e-66
 Identities = 97/217 (44%), Positives = 125/217 (57%), Gaps = 11/217 (5%)

Query: 377 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 436
           P++ DWR+K    P  DQ +CGSCWAFS  G LEG YAIKTGKLV  S+ QLV+C+   S
Sbjct: 1   PESVDWREKGAVTPVKDQGSCGSCWAFSTVGALEGAYAIKTGKLVSLSEQQLVDCST--S 58

Query: 437 GCGGCDG--LEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLF-TGKDFLYFN 493
           G  GC+G   +   EY    GL SE DYPY   +G    C Y+ SKV    TG   +   
Sbjct: 59  GNNGCNGGNPDNAFEYVKNGGLASESDYPYTGKDG---TCKYNSSKVGAKITGYSNVPPG 115

Query: 494 GSETMKKILYKYGPLSVGLN-SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDI 552
             E +K  L  YGP+SV ++ S    FY G     +   CS  +L HAVLLVGYG ++ +
Sbjct: 116 DEEALKAALANYGPVSVAIDASSSFQFYKGGIY--SGPCCSNTNLNHAVLLVGYGTENGV 173

Query: 553 PYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYA 589
            YW+ +NSWG    ++G+ +I RG+N CGI   A Y 
Sbjct: 174 DYWIVKNSWGTSWGEKGYIRIARGSNLCGIASYASYP 210



 Score = 91.5 bits (228), Expect = 7e-21
 Identities = 29/62 (46%), Positives = 41/62 (66%)

Query: 961  NDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAG 1020
            +   CS  +L HAVLLVGYG ++ + YW+V+NSWG    ++G+ +I RG+N CGI   A 
Sbjct: 149  SGPCCSNTNLNHAVLLVGYGTENGVDYWIVKNSWGTSWGEKGYIRIARGSNLCGIASYAS 208

Query: 1021 YA 1022
            Y 
Sbjct: 209  YP 210


>gnl|CDD|215726 pfam00112, Peptidase_C1, Papain family cysteine protease. 
          Length = 213

 Score =  220 bits (564), Expect = 4e-66
 Identities = 90/219 (41%), Positives = 124/219 (56%), Gaps = 11/219 (5%)

Query: 11  VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
           +P+++DWR+K    P  DQ  CGSCWAFS  G LEG+Y IKTGKLV  S+ QLV+C    
Sbjct: 1   LPESFDWREKGAVTPVKDQGQCGSCWAFSAVGALEGRYCIKTGKLVSLSEQQLVDCDTGN 60

Query: 71  SGCDGCFFEPSIEYTHQA-GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHFN 127
           +GC+G   + + EY  +  G+ +E DYPY   +G    C + KS  K    K +  + +N
Sbjct: 61  NGCNGGLPDNAFEYIKKNGGIVTESDYPYTAHDG---TCKFKKSNSKYAKIKGYGDVPYN 117

Query: 128 GSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN 185
             E ++  L K GP+SV ++   D    Y           CS   L HAVL+VGYG ++ 
Sbjct: 118 DEEALQAALAKNGPVSVAIDAYEDDFQLYKSGVY--KHTECSGE-LDHAVLIVGYGTENG 174

Query: 186 IPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 224
           +PYW+V+NSWG    + G+F+I RG N CGI   A Y  
Sbjct: 175 VPYWIVKNSWGTDWGENGYFRIARGVNECGIASEASYPI 213



 Score =  220 bits (563), Expect = 6e-66
 Identities = 90/219 (41%), Positives = 124/219 (56%), Gaps = 11/219 (5%)

Query: 742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
           +P+++DWR+K    P  DQ  CGSCWAFS  G LEG+Y IKTGKLV  S+ QLV+C    
Sbjct: 1   LPESFDWREKGAVTPVKDQGQCGSCWAFSAVGALEGRYCIKTGKLVSLSEQQLVDCDTGN 60

Query: 802 SGCDGCFFEPSIEYTHQA-GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHFN 858
           +GC+G   + + EY  +  G+ +E DYPY   +G    C + KS  K    K +  + +N
Sbjct: 61  NGCNGGLPDNAFEYIKKNGGIVTESDYPYTAHDG---TCKFKKSNSKYAKIKGYGDVPYN 117

Query: 859 GSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN 916
             E ++  L K GP+SV ++   D    Y           CS   L HAVL+VGYG ++ 
Sbjct: 118 DEEALQAALAKNGPVSVAIDAYEDDFQLYKSGVY--KHTECSGE-LDHAVLIVGYGTENG 174

Query: 917 IPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 955
           +PYW+V+NSWG    + G+F+I RG N CGI   A Y  
Sbjct: 175 VPYWIVKNSWGTDWGENGYFRIARGVNECGIASEASYPI 213



 Score =  213 bits (545), Expect = 2e-63
 Identities = 91/227 (40%), Positives = 125/227 (55%), Gaps = 26/227 (11%)

Query: 376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
           +P+++DWR+K    P  DQ  CGSCWAFS  G LEG+Y IKTGKLV  S+ QLV+C    
Sbjct: 1   LPESFDWREKGAVTPVKDQGQCGSCWAFSAVGALEGRYCIKTGKLVSLSEQQLVDC---D 57

Query: 436 SGCGGCDG--LEQPIEYTHQA-GLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLY- 491
           +G  GC+G   +   EY  +  G+ +E DYPY   +G    C + KS  K    K +   
Sbjct: 58  TGNNGCNGGLPDNAFEYIKKNGGIVTESDYPYTAHDG---TCKFKKSNSKYAKIKGYGDV 114

Query: 492 -FNGSETMKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLL 543
            +N  E ++  L K GP+SV ++++   F       Y  T        CS   L HAVL+
Sbjct: 115 PYNDEEALQAALAKNGPVSVAIDAYEDDFQLYKSGVYKHTE-------CSGE-LDHAVLI 166

Query: 544 VGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 590
           VGYG ++ +PYW+ +NSWG    + G+F+I RG N CGI   A Y  
Sbjct: 167 VGYGTENGVPYWIVKNSWGTDWGENGYFRIARGVNECGIASEASYPI 213



 Score = 88.4 bits (220), Expect = 1e-19
 Identities = 30/63 (47%), Positives = 39/63 (61%), Gaps = 1/63 (1%)

Query: 961  NDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAG 1020
                CS   L HAVL+VGYG ++ +PYW+V+NSWG    + G+F+I RG N CGI   A 
Sbjct: 152  KHTECSGE-LDHAVLIVGYGTENGVPYWIVKNSWGTDWGENGYFRIARGVNECGIASEAS 210

Query: 1021 YAT 1023
            Y  
Sbjct: 211  YPI 213


>gnl|CDD|214761 smart00645, Pept_C1, Papain family cysteine protease. 
          Length = 175

 Score =  170 bits (433), Expect = 1e-48
 Identities = 80/218 (36%), Positives = 104/218 (47%), Gaps = 49/218 (22%)

Query: 11  VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
           +P+++DWRKK    P  DQ  CGSCWAFS  G LEG+Y IKTGKLV  S+ QLV+C+   
Sbjct: 1   LPESFDWRKKGAVTPVKDQGQCGSCWAFSATGALEGRYCIKTGKLVSLSEQQLVDCSGGG 60

Query: 71  S-GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNG 128
           + GC+G   + + EY  +  GLE+E  YPY  +                    DF     
Sbjct: 61  NCGCNGGLPDNAFEYIKKNGGLETESCYPYTGSVAID--------------ASDFQF--- 103

Query: 129 SETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ--DNI 186
                   YK G             Y+          C    L HAVL+VGYG +  +  
Sbjct: 104 --------YKSG------------IYDHP-------GCGSGTLDHAVLIVGYGTEVENGK 136

Query: 187 PYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYA 223
            YW+V+NSWG    + G+F+I RG NN CGIE      
Sbjct: 137 DYWIVKNSWGTDWGENGYFRIARGKNNECGIEASVASY 174



 Score =  169 bits (431), Expect = 2e-48
 Identities = 80/218 (36%), Positives = 104/218 (47%), Gaps = 49/218 (22%)

Query: 742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
           +P+++DWRKK    P  DQ  CGSCWAFS  G LEG+Y IKTGKLV  S+ QLV+C+   
Sbjct: 1   LPESFDWRKKGAVTPVKDQGQCGSCWAFSATGALEGRYCIKTGKLVSLSEQQLVDCSGGG 60

Query: 802 S-GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNG 859
           + GC+G   + + EY  +  GLE+E  YPY  +                    DF     
Sbjct: 61  NCGCNGGLPDNAFEYIKKNGGLETESCYPYTGSVAID--------------ASDFQF--- 103

Query: 860 SETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ--DNI 917
                   YK G             Y+          C    L HAVL+VGYG +  +  
Sbjct: 104 --------YKSG------------IYDHP-------GCGSGTLDHAVLIVGYGTEVENGK 136

Query: 918 PYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYA 954
            YW+V+NSWG    + G+F+I RG NN CGIE      
Sbjct: 137 DYWIVKNSWGTDWGENGYFRIARGKNNECGIEASVASY 174



 Score =  161 bits (410), Expect = 1e-45
 Identities = 79/220 (35%), Positives = 101/220 (45%), Gaps = 52/220 (23%)

Query: 376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
           +P+++DWRKK    P  DQ  CGSCWAFS  G LEG+Y IKTGKLV  S+ QLV+C+   
Sbjct: 1   LPESFDWRKKGAVTPVKDQGQCGSCWAFSATGALEGRYCIKTGKLVSLSEQQLVDCSGGG 60

Query: 436 SGCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYF 492
           +   GC+G   +   EY  +  GLE+E  YPY                       DF   
Sbjct: 61  N--CGCNGGLPDNAFEYIKKNGGLETESCYPYTGSVAID--------------ASDFQ-- 102

Query: 493 NGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ--D 550
                     YK G             Y+          C    L HAVL+VGYG +  +
Sbjct: 103 ---------FYKSG------------IYDHP-------GCGSGTLDHAVLIVGYGTEVEN 134

Query: 551 DIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYA 589
              YW+ +NSWG    + G+F+I RG NN CGIE      
Sbjct: 135 GKDYWIVKNSWGTDWGENGYFRIARGKNNECGIEASVASY 174



 Score = 79.9 bits (198), Expect = 4e-17
 Identities = 28/65 (43%), Positives = 37/65 (56%), Gaps = 3/65 (4%)

Query: 961  NDETCSPYDLGHAVLLVGYGKQ--DDIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQ 1017
            +   C    L HAVL+VGYG +  +   YW+V+NSWG    + G+F+I RG NN CGIE 
Sbjct: 110  DHPGCGSGTLDHAVLIVGYGTEVENGKDYWIVKNSWGTDWGENGYFRIARGKNNECGIEA 169

Query: 1018 IAGYA 1022
                 
Sbjct: 170  SVASY 174


>gnl|CDD|185513 PTZ00203, PTZ00203, cathepsin L protease; Provisional.
          Length = 348

 Score =  151 bits (381), Expect = 7e-40
 Identities = 91/305 (29%), Positives = 142/305 (46%), Gaps = 22/305 (7%)

Query: 658 FKAFIVKRGRQYANDEEIKERFEYFKQD--------GHKKHERYGTSEFSDRSPEEILCK 709
           F+ F     R Y    E ++R   F+++            H R+G ++F D S  E   +
Sbjct: 38  FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97

Query: 710 TGFKWSERTYERIVADREKVEKMLMEVEKD-GPVPDAWDWRKKNVTGPAGDQAACGSCWA 768
                +   Y    A ++   +   +   D   VPDA DWR+K    P  +Q ACGSCWA
Sbjct: 98  Y---LNGAAY--FAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWA 152

Query: 769 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIEY--THQAG-LESEK 825
           FS  G +E Q+A+   KLV  S+ QLV C    +GC G     + E+   +  G + +EK
Sbjct: 153 FSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEK 212

Query: 826 DYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSE-TMKKILYKYGPLSVLLNSDLIHD 884
            YPY + NG+  +C+             ++    SE  M   L K GP+S+ +++     
Sbjct: 213 SYPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMS 272

Query: 885 YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNA 944
           Y+   +     +C    L H VLLVGY     +PYW+++NSWG    ++G+ ++  G NA
Sbjct: 273 YHSGVL----TSCIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWGEKGYVRVTMGVNA 328

Query: 945 CGIEQ 949
           C +  
Sbjct: 329 CLLTG 333



 Score =  151 bits (381), Expect = 8e-40
 Identities = 96/307 (31%), Positives = 146/307 (47%), Gaps = 25/307 (8%)

Query: 292 FKAFIVKRGRQYANDEEIKERFEYFKQD--------GHKKHERYGTSEFSDRSPEEILCK 343
           F+ F     R Y    E ++R   F+++            H R+G ++F D S  E   +
Sbjct: 38  FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97

Query: 344 TGFKWSERTYERIVADREKVEKMLMEVEKD-GPVPDAWDWRKKNVTGPAGDQAACGSCWA 402
                +   Y    A ++   +   +   D   VPDA DWR+K    P  +Q ACGSCWA
Sbjct: 98  Y---LNGAAY--FAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWA 152

Query: 403 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLE-QPIEY--THQAG-LES 458
           FS  G +E Q+A+   KLV  S+ QLV C    +GCGG  GL  Q  E+   +  G + +
Sbjct: 153 FSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGG--GLMLQAFEWVLRNMNGTVFT 210

Query: 459 EKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSE-TMKKILYKYGPLSVGLNSHLI 517
           EK YPY +GNG+  +C+             ++    SE  M   L K GP+S+ +++   
Sbjct: 211 EKSYPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSF 270

Query: 518 HFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGN 577
             Y+   +     +C    L H VLLVGY    ++PYW+ +NSWG    ++G+ ++  G 
Sbjct: 271 MSYHSGVL----TSCIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWGEKGYVRVTMGV 326

Query: 578 NACGIEQ 584
           NAC +  
Sbjct: 327 NACLLTG 333



 Score =  140 bits (355), Expect = 2e-36
 Identities = 73/212 (34%), Positives = 109/212 (51%), Gaps = 8/212 (3%)

Query: 11  VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
           VPDA DWR+K    P  +Q  CGSCWAFS  G +E Q+A+   KLV  S+ QLV C    
Sbjct: 126 VPDAVDWREKGAVTPVKNQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVD 185

Query: 71  SGCDGCFFEPSIEY--THQAG-LESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 127
           +GC G     + E+   +  G + +EK YPY + NG+  +C+             ++   
Sbjct: 186 NGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVPECSNSSELAPGARIDGYVSME 245

Query: 128 GSE-TMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNI 186
            SE  M   L K GP+S+ +++     Y+   +     +C    L H VLLVGY     +
Sbjct: 246 SSERVMAAWLAKNGPISIAVDASSFMSYHSGVL----TSCIGEQLNHGVLLVGYNMTGEV 301

Query: 187 PYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ 218
           PYW+++NSWG    ++G+ ++  G NAC +  
Sbjct: 302 PYWVIKNSWGEDWGEKGYVRVTMGVNACLLTG 333


>gnl|CDD|240310 PTZ00200, PTZ00200, cysteine proteinase; Provisional.
          Length = 448

 Score =  144 bits (365), Expect = 1e-36
 Identities = 103/364 (28%), Positives = 151/364 (41%), Gaps = 54/364 (14%)

Query: 623 LPSLTDRITDQVVARVDTLAIEGSLTFDNENILET---FKAFIVKRGRQYANDEEIKERF 679
            P L     D  V  +  L  +G ++ D +   E    F+ F  K  R++A   E   RF
Sbjct: 88  FPRLDKSKRDSYVDELTRLFKDGYISDDPKLEFEVYLEFEEFNKKYNRKHATHAERLNRF 147

Query: 680 EYFKQD-----GHKKHERY--GTSEFSDRSPEEI--------LCKTGFKWSERTY--ERI 722
             F+ +      HK  E Y    ++FSD + EE         +       S       R 
Sbjct: 148 LTFRNNYLEVKSHKGDEPYSKEINKFSDLTEEEFRKLFPVIKVPPKSNSTSHNNDFKARH 207

Query: 723 VADREKVEKMLMEVEKDGPVPDA-------WDWRKKNVTGPAGDQAA-CGSCWAFSIAGM 774
           V++   ++ +      D  V D         DWR+ +      DQ   CGSCWAFS  G 
Sbjct: 208 VSNPTYLKNLKKAKNTDEDVKDPSKITGEGLDWRRADAVTKVKDQGLNCGSCWAFSSVGS 267

Query: 775 LEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANG 834
           +E  Y I   K V+ S+ +LV C  +  GC G + + ++EY    GL S  D PY   +G
Sbjct: 268 VESLYKIYRDKSVDLSEQELVNCDTKSQGCSGGYPDTALEYVKNKGLSSSSDVPYLAKDG 327

Query: 835 EKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNS--DLIHD----YNGT 888
              KC    +K        +L   G + + K L    P  V +    +L+      YNG 
Sbjct: 328 ---KCVVSSTKKVYIDS--YLVAKGKDVLNKSLV-ISPTVVYIAVSRELLKYKSGVYNG- 380

Query: 889 PIRKNDETCSPYDLGHAVLLV--GYGKQDNIPYWLVRNSWGPIGPDEGFFKIER---GNN 943
                   C    L HAVLLV  GY ++    YW+++NSWG    + G+ ++ER   G +
Sbjct: 381 -------ECGKS-LNHAVLLVGEGYDEKTKKRYWIIKNSWGTDWGENGYMRLERTNEGTD 432

Query: 944 ACGI 947
            CGI
Sbjct: 433 KCGI 436



 Score =  140 bits (355), Expect = 2e-35
 Identities = 103/366 (28%), Positives = 148/366 (40%), Gaps = 57/366 (15%)

Query: 257 LPSLTDRITDQVVARVDTLAIEGSLTFDNENILET---FKAFIVKRGRQYANDEEIKERF 313
            P L     D  V  +  L  +G ++ D +   E    F+ F  K  R++A   E   RF
Sbjct: 88  FPRLDKSKRDSYVDELTRLFKDGYISDDPKLEFEVYLEFEEFNKKYNRKHATHAERLNRF 147

Query: 314 EYFKQD-----GHKKHERY--GTSEFSDRSPEEI--------LCKTGFKWSERTY--ERI 356
             F+ +      HK  E Y    ++FSD + EE         +       S       R 
Sbjct: 148 LTFRNNYLEVKSHKGDEPYSKEINKFSDLTEEEFRKLFPVIKVPPKSNSTSHNNDFKARH 207

Query: 357 VADREKVEKMLMEVEKDGPVPDA-------WDWRKKNVTGPAGDQAA-CGSCWAFSIAGM 408
           V++   ++ +      D  V D         DWR+ +      DQ   CGSCWAFS  G 
Sbjct: 208 VSNPTYLKNLKKAKNTDEDVKDPSKITGEGLDWRRADAVTKVKDQGLNCGSCWAFSSVGS 267

Query: 409 LEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGL-EQPIEYTHQAGLESEKDYPYRNG 467
           +E  Y I   K V+ S+ +LV C  +  GC G  G  +  +EY    GL S  D PY   
Sbjct: 268 VESLYKIYRDKSVDLSEQELVNCDTKSQGCSG--GYPDTALEYVKNKGLSSSSDVPYL-- 323

Query: 468 NGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNSH--LIHF----YN 521
             +  KC    +K        +L   G + + K L    P  V +     L+ +    YN
Sbjct: 324 -AKDGKCVVSSTKKVYIDS--YLVAKGKDVLNKSLV-ISPTVVYIAVSRELLKYKSGVYN 379

Query: 522 GTPIRKNDETCSPYDLGHAVLLV--GYGKQDDIPYWLARNSWGPIGPDEGFFKIER---G 576
           G         C    L HAVLLV  GY ++    YW+ +NSWG    + G+ ++ER   G
Sbjct: 380 G--------ECGKS-LNHAVLLVGEGYDEKTKKRYWIIKNSWGTDWGENGYMRLERTNEG 430

Query: 577 NNACGI 582
            + CGI
Sbjct: 431 TDKCGI 436



 Score =  128 bits (324), Expect = 2e-31
 Identities = 71/213 (33%), Positives = 101/213 (47%), Gaps = 27/213 (12%)

Query: 16  DWRKKNVTGPAGDQA-DCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCD 74
           DWR+ +      DQ  +CGSCWAFS  G +E  Y I   K V+ S+ +LV C  +  GC 
Sbjct: 239 DWRRADAVTKVKDQGLNCGSCWAFSSVGSVESLYKIYRDKSVDLSEQELVNCDTKSQGCS 298

Query: 75  GCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKK 134
           G + + ++EY    GL S  D PY   +G   KC    +K        +L   G + + K
Sbjct: 299 GGYPDTALEYVKNKGLSSSSDVPYLAKDG---KCVVSSTKKVYIDS--YLVAKGKDVLNK 353

Query: 135 ILYKYGPLSVLLNS--DLIHD----YNGTPIRKNDETCSPYDLGHAVLLV--GYGKQDNI 186
            L    P  V +    +L+      YNG         C    L HAVLLV  GY ++   
Sbjct: 354 SLV-ISPTVVYIAVSRELLKYKSGVYNG--------ECGKS-LNHAVLLVGEGYDEKTKK 403

Query: 187 PYWLVRNSWGPIGPDEGFFKIER---GNNACGI 216
            YW+++NSWG    + G+ ++ER   G + CGI
Sbjct: 404 RYWIIKNSWGTDWGENGYMRLERTNEGTDKCGI 436



 Score = 54.7 bits (132), Expect = 2e-07
 Identities = 23/56 (41%), Positives = 33/56 (58%), Gaps = 6/56 (10%)

Query: 965  CSPYDLGHAVLLVG--YGKQDDIPYWLVRNSWGPIGPDEGFFKIER---GNNACGI 1015
            C    L HAVLLVG  Y ++    YW+++NSWG    + G+ ++ER   G + CGI
Sbjct: 382  CGKS-LNHAVLLVGEGYDEKTKKRYWIIKNSWGTDWGENGYMRLERTNEGTDKCGI 436


>gnl|CDD|240232 PTZ00021, PTZ00021, falcipain-2; Provisional.
          Length = 489

 Score =  141 bits (358), Expect = 1e-35
 Identities = 103/319 (32%), Positives = 151/319 (47%), Gaps = 46/319 (14%)

Query: 652 ENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHK--KHE-------RYGTSEFSDRS 702
           EN+  +F  FI + G++Y   +E+++R+  F ++  K   H        + G + F D S
Sbjct: 164 ENV-NSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAHNNKENVLYKKGMNRFGDLS 222

Query: 703 PEEI------LCKTGFKWSERTYERIVADREKVEKMLMEVEKDGPVPD-AWDWRKKNVTG 755
            EE       L    FK S       V + + V K      KD       +DWR  N   
Sbjct: 223 FEEFKKKYLTLKSFDFK-SNGKKSPRVINYDDVIKKYKP--KDATFDHAKYDWRLHNGVT 279

Query: 756 PAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCF----FEP 811
           P  DQ  CGSCWAFS  G++E QYAI+  +LV  S+ +LV+C+ + +GC G      FE 
Sbjct: 280 PVKDQKNCGSCWAFSTVGVVESQYAIRKNELVSLSEQELVDCSFKNNGCYGGLIPNAFED 339

Query: 812 SIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYG 871
            IE     GL SE DYPY +   E   C  D+ K K +  K ++     +  K+ +   G
Sbjct: 340 MIEL---GGLCSEDDYPYVSDTPE--LCNIDRCKEK-YKIKSYVSIP-EDKFKEAIRFLG 392

Query: 872 PLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIP----------YW 920
           P+SV +  SD    Y G      D  C   +  HAV+LVGYG ++             Y+
Sbjct: 393 PISVSIAVSDDFAFYKGGIF---DGECG-EEPNHAVILVGYGMEEIYNSDTKKMEKRYYY 448

Query: 921 LVRNSWGPIGPDEGFFKIE 939
           +++NSWG    ++GF +IE
Sbjct: 449 IIKNSWGESWGEKGFIRIE 467



 Score =  133 bits (337), Expect = 7e-33
 Identities = 102/319 (31%), Positives = 150/319 (47%), Gaps = 45/319 (14%)

Query: 286 ENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHK--KHE-------RYGTSEFSDRS 336
           EN+  +F  FI + G++Y   +E+++R+  F ++  K   H        + G + F D S
Sbjct: 164 ENV-NSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAHNNKENVLYKKGMNRFGDLS 222

Query: 337 PEEI------LCKTGFKWSERTYERIVADREKVEKMLMEVEKDGPVPD-AWDWRKKNVTG 389
            EE       L    FK S       V + + V K      KD       +DWR  N   
Sbjct: 223 FEEFKKKYLTLKSFDFK-SNGKKSPRVINYDDVIKKYKP--KDATFDHAKYDWRLHNGVT 279

Query: 390 PAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGG---CDGLEQ 446
           P  DQ  CGSCWAFS  G++E QYAI+  +LV  S+ +LV+C+ + +GC G    +  E 
Sbjct: 280 PVKDQKNCGSCWAFSTVGVVESQYAIRKNELVSLSEQELVDCSFKNNGCYGGLIPNAFED 339

Query: 447 PIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYG 506
            IE     GL SE DYPY +   E   C  D+ K K +  K ++     +  K+ +   G
Sbjct: 340 MIEL---GGLCSEDDYPYVSDTPE--LCNIDRCKEK-YKIKSYVSIP-EDKFKEAIRFLG 392

Query: 507 PLSVGLN-SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIP----------YW 555
           P+SV +  S    FY G      D  C   +  HAV+LVGYG ++             Y+
Sbjct: 393 PISVSIAVSDDFAFYKGGIF---DGECG-EEPNHAVILVGYGMEEIYNSDTKKMEKRYYY 448

Query: 556 LARNSWGPIGPDEGFFKIE 574
           + +NSWG    ++GF +IE
Sbjct: 449 IIKNSWGESWGEKGFIRIE 467



 Score =  125 bits (315), Expect = 5e-30
 Identities = 76/210 (36%), Positives = 109/210 (51%), Gaps = 26/210 (12%)

Query: 14  AWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGC 73
            +DWR  N   P  DQ +CGSCWAFS  G++E QYAI+  +LV  S+ +LV+C+ + +GC
Sbjct: 269 KYDWRLHNGVTPVKDQKNCGSCWAFSTVGVVESQYAIRKNELVSLSEQELVDCSFKNNGC 328

Query: 74  DGCF----FEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGS 129
            G      FE  IE     GL SE DYPY +   E   C  D+ K K +  K ++     
Sbjct: 329 YGGLIPNAFEDMIEL---GGLCSEDDYPYVSDTPE--LCNIDRCKEK-YKIKSYVSIP-E 381

Query: 130 ETMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIP- 187
           +  K+ +   GP+SV +  SD    Y G      D  C   +  HAV+LVGYG ++    
Sbjct: 382 DKFKEAIRFLGPISVSIAVSDDFAFYKGGIF---DGECG-EEPNHAVILVGYGMEEIYNS 437

Query: 188 ---------YWLVRNSWGPIGPDEGFFKIE 208
                    Y++++NSWG    ++GF +IE
Sbjct: 438 DTKKMEKRYYYIIKNSWGESWGEKGFIRIE 467


>gnl|CDD|239112 cd02621, Peptidase_C1A_CathepsinC, Cathepsin C; also known as
           Dipeptidyl Peptidase I (DPPI), an atypical papain-like
           cysteine peptidase with chloride dependency and
           dipeptidyl aminopeptidase activity, resulting from its
           tetrameric structure which limits substrate access. Each
           subunit of the tetramer is composed of three peptides:
           the heavy and light chains, which together adopts the
           papain fold and forms the catalytic domain; and the
           residual propeptide region, which forms a beta barrel
           and points towards the substrate's N-terminus. The
           subunit composition is the result of the unique
           characteristic of procathepsin C maturation involving
           the cleavage of the catalytic domain and the
           non-autocatalytic excision of an activation peptide
           within its propeptide region. By removing N-terminal
           dipeptide extensions, cathepsin C activates granule
           serine peptidases (granzymes) involved in cell-mediated
           apoptosis, inflammation and tissue remodelling.
           Loss-of-function mutations in cathepsin C are associated
           with Papillon-Lefevre and Haim-Munk syndromes, rare
           diseases characterized by hyperkeratosis and early-onset
           periodontitis. Cathepsin C is widely expressed in many
           tissues with high levels in lung, kidney and placenta.
           It is also highly expressed in cytotoxic lymphocytes and
           mature myeloid cells.
          Length = 243

 Score =  134 bits (340), Expect = 2e-35
 Identities = 73/242 (30%), Positives = 110/242 (45%), Gaps = 31/242 (12%)

Query: 12  PDAWDWR----KKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVE------FSKS 61
           P ++DW       N   P  +Q  CGSC+AF+    LE +  I + K          S  
Sbjct: 2   PKSFDWGDVNNGFNYVSPVRNQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSPQ 61

Query: 62  QLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGK 121
            ++ C++   GCDG F     ++    G+ +E  +PY   +     C    S+ + +   
Sbjct: 62  HVLSCSQYSQGCDGGFPFLVGKFAEDFGIVTEDYFPYTADDDRP--CKASPSECRRYYFS 119

Query: 122 DFLHFNG------SETMKKILYKYGPLSVLL--NSDLIHDYNGT-PIRKNDETCS----- 167
           D+ +  G       + MK  +Y+ GP+ V     SD      G      NDE        
Sbjct: 120 DYNYVGGCYGCTNEDEMKWEIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSDGDNDN 179

Query: 168 --PYDL-GHAVLLVGYGKQD--NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
             P++L  HAVLLVG+G+ +     YW+V+NSWG    ++G+FKI RG N CGIE  A +
Sbjct: 180 FNPFELTNHAVLLVGWGEDEIKGEKYWIVKNSWGSSWGEKGYFKIRRGTNECGIESQAVF 239

Query: 223 AT 224
           A 
Sbjct: 240 AY 241



 Score =  134 bits (340), Expect = 2e-35
 Identities = 73/242 (30%), Positives = 110/242 (45%), Gaps = 31/242 (12%)

Query: 743 PDAWDWR----KKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE------FSKS 792
           P ++DW       N   P  +Q  CGSC+AF+    LE +  I + K          S  
Sbjct: 2   PKSFDWGDVNNGFNYVSPVRNQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSPQ 61

Query: 793 QLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGK 852
            ++ C++   GCDG F     ++    G+ +E  +PY   +     C    S+ + +   
Sbjct: 62  HVLSCSQYSQGCDGGFPFLVGKFAEDFGIVTEDYFPYTADDDRP--CKASPSECRRYYFS 119

Query: 853 DFLHFNG------SETMKKILYKYGPLSVLL--NSDLIHDYNGT-PIRKNDETCS----- 898
           D+ +  G       + MK  +Y+ GP+ V     SD      G      NDE        
Sbjct: 120 DYNYVGGCYGCTNEDEMKWEIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSDGDNDN 179

Query: 899 --PYDL-GHAVLLVGYGKQD--NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 953
             P++L  HAVLLVG+G+ +     YW+V+NSWG    ++G+FKI RG N CGIE  A +
Sbjct: 180 FNPFELTNHAVLLVGWGEDEIKGEKYWIVKNSWGSSWGEKGYFKIRRGTNECGIESQAVF 239

Query: 954 AT 955
           A 
Sbjct: 240 AY 241



 Score =  128 bits (324), Expect = 3e-33
 Identities = 71/244 (29%), Positives = 109/244 (44%), Gaps = 34/244 (13%)

Query: 377 PDAWDWR----KKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE------FSKS 426
           P ++DW       N   P  +Q  CGSC+AF+    LE +  I + K          S  
Sbjct: 2   PKSFDWGDVNNGFNYVSPVRNQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSPQ 61

Query: 427 QLVECAKQCSGCGGCDGLEQPI-EYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFT 485
            ++ C++   GC G  G    + ++    G+ +E  +PY   +     C    S+ + + 
Sbjct: 62  HVLSCSQYSQGCDG--GFPFLVGKFAEDFGIVTEDYFPYTADDDRP--CKASPSECRRYY 117

Query: 486 GKDFLYFNG------SETMKKILYKYGPLSVGL--NSHLIHFYNGT-PIRKNDETCS--- 533
             D+ Y  G       + MK  +Y+ GP+ V     S    +  G      NDE      
Sbjct: 118 FSDYNYVGGCYGCTNEDEMKWEIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSDGDN 177

Query: 534 ----PYDL-GHAVLLVGYGKQDD--IPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIA 586
               P++L  HAVLLVG+G+ +     YW+ +NSWG    ++G+FKI RG N CGIE  A
Sbjct: 178 DNFNPFELTNHAVLLVGWGEDEIKGEKYWIVKNSWGSSWGEKGYFKIRRGTNECGIESQA 237

Query: 587 GYAT 590
            +A 
Sbjct: 238 VFAY 241


>gnl|CDD|239111 cd02620, Peptidase_C1A_CathepsinB, Cathepsin B group; composed of
           cathepsin B and similar proteins, including
           tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin
           B is a lysosomal papain-like cysteine peptidase which is
           expressed in all tissues and functions primarily as an
           exopeptidase through its carboxydipeptidyl activity.
           Together with other cathepsins, it is involved in the
           degradation of proteins, proenzyme activation, Ag
           processing, metabolism and apoptosis. Cathepsin B has
           been implicated in a number of human diseases such as
           cancer, rheumatoid arthritis, osteoporosis and
           Alzheimer's disease. The unique carboxydipeptidyl
           activity of cathepsin B is attributed to the presence of
           an occluding loop in its active site which favors the
           binding of the C-termini of substrate proteins. Some
           members of this group do not possess the occluding loop.
           TIN-Ag is an extracellular matrix basement protein which
           was originally identified as a target Ag involved in
           anti-tubular basement membrane antibody-mediated
           interstitial nephritis. It plays a role in renal
           tubulogenesis and is defective in hereditary
           tubulointerstitial disorders. TIN-Ag is exclusively
           expressed in kidney tissues. .
          Length = 236

 Score =  119 bits (300), Expect = 3e-30
 Identities = 60/234 (25%), Positives = 97/234 (41%), Gaps = 32/234 (13%)

Query: 12  PDAWDWRKK----NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTG--KLVEFSKSQLVE 65
           P+++D R+K       G   DQ +CGSCWAFS       +  I++   + V  S   L+ 
Sbjct: 1   PESFDAREKWPNCISIGEIRDQGNCGSCWAFSAVEAFSDRLCIQSNGKENVLLSAQDLLS 60

Query: 66  CAKQCS-GCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKF------------------ 106
           C   C  GC+G + + + +Y    G+ +    PY                          
Sbjct: 61  CCSGCGDGCNGGYPDAAWKYLTTTGVVTGGCQPYTIPPCGHHPEGPPPCCGTPYCTPKCQ 120

Query: 107 -KCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSV--LLNSDLIHDYNGTPIRKND 163
             C     + K      +   +    + K +   GP+     +  D ++ Y     +   
Sbjct: 121 DGCEKTYEEDKHKGKSAYSVPSDETDIMKEIMTNGPVQAAFTVYEDFLY-YKSGVYQH-- 177

Query: 164 ETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 217
            T      GHAV ++G+G ++ +PYWL  NSWG    + G+F+I RG+N CGIE
Sbjct: 178 -TSGKQLGGHAVKIIGWGVENGVPYWLAANSWGTDWGENGYFRILRGSNECGIE 230



 Score =  118 bits (298), Expect = 6e-30
 Identities = 60/234 (25%), Positives = 96/234 (41%), Gaps = 32/234 (13%)

Query: 743 PDAWDWRKK----NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTG--KLVEFSKSQLVE 796
           P+++D R+K       G   DQ  CGSCWAFS       +  I++   + V  S   L+ 
Sbjct: 1   PESFDAREKWPNCISIGEIRDQGNCGSCWAFSAVEAFSDRLCIQSNGKENVLLSAQDLLS 60

Query: 797 CAKQCS-GCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKF------------------ 837
           C   C  GC+G + + + +Y    G+ +    PY                          
Sbjct: 61  CCSGCGDGCNGGYPDAAWKYLTTTGVVTGGCQPYTIPPCGHHPEGPPPCCGTPYCTPKCQ 120

Query: 838 -KCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSV--LLNSDLIHDYNGTPIRKND 894
             C     + K      +   +    + K +   GP+     +  D ++ Y     +   
Sbjct: 121 DGCEKTYEEDKHKGKSAYSVPSDETDIMKEIMTNGPVQAAFTVYEDFLY-YKSGVYQH-- 177

Query: 895 ETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 948
            T      GHAV ++G+G ++ +PYWL  NSWG    + G+F+I RG+N CGIE
Sbjct: 178 -TSGKQLGGHAVKIIGWGVENGVPYWLAANSWGTDWGENGYFRILRGSNECGIE 230



 Score =  116 bits (292), Expect = 4e-29
 Identities = 64/235 (27%), Positives = 99/235 (42%), Gaps = 33/235 (14%)

Query: 377 PDAWDWRKK----NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTG--KLVEFSKSQLVE 430
           P+++D R+K       G   DQ  CGSCWAFS       +  I++   + V  S   L+ 
Sbjct: 1   PESFDAREKWPNCISIGEIRDQGNCGSCWAFSAVEAFSDRLCIQSNGKENVLLSAQDLLS 60

Query: 431 CAKQCSGCG-GCDG--LEQPIEYTHQAGLESEKDYPYRNGNGEKF--------------- 472
           C   CSGCG GC+G   +   +Y    G+ +    PY                       
Sbjct: 61  C---CSGCGDGCNGGYPDAAWKYLTTTGVVTGGCQPYTIPPCGHHPEGPPPCCGTPYCTP 117

Query: 473 ----KCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKN 528
                C     + K      +   +    + K +   GP+      +    Y  + + ++
Sbjct: 118 KCQDGCEKTYEEDKHKGKSAYSVPSDETDIMKEIMTNGPVQAAFTVYEDFLYYKSGVYQH 177

Query: 529 DETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIE 583
             T      GHAV ++G+G ++ +PYWLA NSWG    + G+F+I RG+N CGIE
Sbjct: 178 --TSGKQLGGHAVKIIGWGVENGVPYWLAANSWGTDWGENGYFRILRGSNECGIE 230



 Score = 76.9 bits (190), Expect = 1e-15
 Identities = 24/46 (52%), Positives = 34/46 (73%)

Query: 971  GHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 1016
            GHAV ++G+G ++ +PYWL  NSWG    + G+F+I RG+N CGIE
Sbjct: 185  GHAVKIIGWGVENGVPYWLAANSWGTDWGENGYFRILRGSNECGIE 230


>gnl|CDD|239110 cd02619, Peptidase_C1, C1 Peptidase family (MEROPS database
           nomenclature), also referred to as the papain family;
           composed of two subfamilies of cysteine peptidases
           (CPs), C1A (papain) and C1B (bleomycin hydrolase).
           Papain-like enzymes are mostly endopeptidases with some
           exceptions like cathepsins B, C, H and X, which are
           exopeptidases. Papain-like CPs have different functions
           in various organisms. Plant CPs are used to mobilize
           storage proteins in seeds while mammalian CPs are
           primarily lysosomal enzymes responsible for protein
           degradation in the lysosome. Papain-like CPs are
           synthesized as inactive proenzymes with N-terminal
           propeptide regions, which are removed upon activation.
           Bleomycin hydrolase (BH) is a CP that detoxifies
           bleomycin by hydrolysis of an amide group. It acts as a
           carboxypeptidase on its C-terminus to convert itself
           into an aminopeptidase and peptide ligase. BH is found
           in all tissues in mammals as well as in many other
           eukaryotes. It forms a hexameric ring barrel structure
           with the active sites imbedded in the central channel.
           Some members of the C1 family are proteins classified as
           non-peptidase homologs which lack peptidase activity or
           have missing active site residues.
          Length = 223

 Score =  114 bits (286), Expect = 2e-28
 Identities = 57/209 (27%), Positives = 83/209 (39%), Gaps = 17/209 (8%)

Query: 15  WDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTG--KLVEFSKSQLVECAKQ--- 69
            D R   +T P  +Q   GSCWAF+ A  LE  Y IK G  + V+ S   L  CA     
Sbjct: 2   VDLRPLRLT-PVKNQGSRGSCWAFASAYALESAYRIKGGEDEYVDLSPQYLYICANDECL 60

Query: 70  ---CSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--L 124
               S   G      ++     G+  E+DYPY   +  +   +           KD+  +
Sbjct: 61  GINGSCDGGGPLSALLKLVALKGIPPEEDYPYGAESDGEEPKSEAALNAAKVKLKDYRRV 120

Query: 125 HFNGSETMKKILYKYGPLSV--LLNSDLIHDYNGTPIR--KNDETCSPYDLGHAVLLVGY 180
             N  E +K+ L K GP+     + S       G                 GHAV++VGY
Sbjct: 121 LKNNIEDIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDGDLGGHAVVIVGY 180

Query: 181 GKQ--DNIPYWLVRNSWGPIGPDEGFFKI 207
                +    ++V+NSWG    D G+ +I
Sbjct: 181 DDNYVEGKGAFIVKNSWGTDWGDNGYGRI 209



 Score =  113 bits (285), Expect = 2e-28
 Identities = 59/209 (28%), Positives = 87/209 (41%), Gaps = 16/209 (7%)

Query: 380 WDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTG--KLVEFSKSQLVECAKQ-CS 436
            D R   +T P  +Q + GSCWAF+ A  LE  Y IK G  + V+ S   L  CA   C 
Sbjct: 2   VDLRPLRLT-PVKNQGSRGSCWAFASAYALESAYRIKGGEDEYVDLSPQYLYICANDECL 60

Query: 437 GCG-GCDG---LEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF--L 490
           G    CDG   L   ++     G+  E+DYPY   +  +   +           KD+  +
Sbjct: 61  GINGSCDGGGPLSALLKLVALKGIPPEEDYPYGAESDGEEPKSEAALNAAKVKLKDYRRV 120

Query: 491 YFNGSETMKKILYKYGPLSVGLNSHLIHFY----NGTPIRKNDETCSPYDLGHAVLLVGY 546
             N  E +K+ L K GP+  G + +                          GHAV++VGY
Sbjct: 121 LKNNIEDIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDGDLGGHAVVIVGY 180

Query: 547 GKQ--DDIPYWLARNSWGPIGPDEGFFKI 573
                +    ++ +NSWG    D G+ +I
Sbjct: 181 DDNYVEGKGAFIVKNSWGTDWGDNGYGRI 209



 Score =  112 bits (283), Expect = 4e-28
 Identities = 57/209 (27%), Positives = 84/209 (40%), Gaps = 17/209 (8%)

Query: 746 WDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTG--KLVEFSKSQLVECAKQ--- 800
            D R   +T P  +Q + GSCWAF+ A  LE  Y IK G  + V+ S   L  CA     
Sbjct: 2   VDLRPLRLT-PVKNQGSRGSCWAFASAYALESAYRIKGGEDEYVDLSPQYLYICANDECL 60

Query: 801 ---CSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--L 855
               S   G      ++     G+  E+DYPY   +  +   +           KD+  +
Sbjct: 61  GINGSCDGGGPLSALLKLVALKGIPPEEDYPYGAESDGEEPKSEAALNAAKVKLKDYRRV 120

Query: 856 HFNGSETMKKILYKYGPLSV--LLNSDLIHDYNGTPIR--KNDETCSPYDLGHAVLLVGY 911
             N  E +K+ L K GP+     + S       G                 GHAV++VGY
Sbjct: 121 LKNNIEDIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDGDLGGHAVVIVGY 180

Query: 912 GKQ--DNIPYWLVRNSWGPIGPDEGFFKI 938
                +    ++V+NSWG    D G+ +I
Sbjct: 181 DDNYVEGKGAFIVKNSWGTDWGDNGYGRI 209



 Score = 52.1 bits (125), Expect = 3e-07
 Identities = 15/50 (30%), Positives = 24/50 (48%), Gaps = 2/50 (4%)

Query: 959  VKNDETCSPYDLGHAVLLVGYGKQ--DDIPYWLVRNSWGPIGPDEGFFKI 1006
            +           GHAV++VGY     +    ++V+NSWG    D G+ +I
Sbjct: 160  IVYLLYEDGDLGGHAVVIVGYDDNYVEGKGAFIVKNSWGTDWGDNGYGRI 209


>gnl|CDD|239149 cd02698, Peptidase_C1A_CathepsinX, Cathepsin X; the only
           papain-like lysosomal cysteine peptidase exhibiting
           carboxymonopeptidase activity. It can also act as a
           carboxydipeptidase, like cathepsin B, but has been shown
           to preferentially cleave substrates through a
           monopeptidyl carboxypeptidase pathway. The propeptide
           region of cathepsin X, the shortest among papain-like
           peptidases, is covalently attached to the active site
           cysteine in the inactive form of the enzyme. Little is
           known about the biological function of cathepsin X. Some
           studies point to a role in early tumorigenesis. A more
           recent study indicates that cathepsin X expression is
           restricted to immune cells suggesting a role in
           phagocytosis and the regulation of the immune response.
          Length = 239

 Score = 87.1 bits (216), Expect = 5e-19
 Identities = 59/225 (26%), Positives = 96/225 (42%), Gaps = 31/225 (13%)

Query: 376 VPDAWDWRKKNVTG-----PAGDQ---AACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQ 427
           +P +WDWR  NV G     P  +Q     CGSCWA      L  +  I   +   +    
Sbjct: 1   LPKSWDWR--NVNGVNYVSPTRNQHIPQYCGSCWAHGSTSALADR--INIARKGAWPSVY 56

Query: 428 L-VECAKQCSGCGGCDGLE--QPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKS----- 479
           L V+    C+G G C G +     EY H+ G+  E   PY+  +GE        +     
Sbjct: 57  LSVQVVIDCAGGGSCHGGDPGGVYEYAHKHGIPDETCNPYQAKDGECNPFNRCGTCNPFG 116

Query: 480 ---KVKLFTG---KDFLYFNGSETMKKILYKYGPLSVGLNSH-LIHFYNGTPIRKNDETC 532
               +K +T     D+   +G + M   +Y  GP+S G+ +   +  Y G   ++  +  
Sbjct: 117 ECFAIKNYTLYFVSDYGSVSGRDKMMAEIYARGPISCGIMATEALENYTGGVYKEYVQDP 176

Query: 533 SPYDLGHAVLLVGYGKQDD-IPYWLARNSWGPIGPDEGFFKIERG 576
                 H + + G+G  ++ + YW+ RNSWG    + G+F+I   
Sbjct: 177 LI---NHIISVAGWGVDENGVEYWIVRNSWGEPWGERGWFRIVTS 218



 Score = 85.2 bits (211), Expect = 3e-18
 Identities = 60/224 (26%), Positives = 97/224 (43%), Gaps = 30/224 (13%)

Query: 742 VPDAWDWRKKNVTG-----PAGDQ---AACGSCWAFSIAGMLEGQYAIKT---GKLVEFS 790
           +P +WDWR  NV G     P  +Q     CGSCWA      L  +  I        V  S
Sbjct: 1   LPKSWDWR--NVNGVNYVSPTRNQHIPQYCGSCWAHGSTSALADRINIARKGAWPSVYLS 58

Query: 791 KSQLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKS------ 844
              +++CA   S C G       EY H+ G+  E   PY+  +GE        +      
Sbjct: 59  VQVVIDCAGGGS-CHGGDPGGVYEYAHKHGIPDETCNPYQAKDGECNPFNRCGTCNPFGE 117

Query: 845 --KVKLFTG---KDFLHFNGSETMKKILYKYGPLSV-LLNSDLIHDYNGTPIRKNDETCS 898
              +K +T     D+   +G + M   +Y  GP+S  ++ ++ + +Y G   ++  +   
Sbjct: 118 CFAIKNYTLYFVSDYGSVSGRDKMMAEIYARGPISCGIMATEALENYTGGVYKEYVQDPL 177

Query: 899 PYDLGHAVLLVGYGKQDN-IPYWLVRNSWGPIGPDEGFFKIERG 941
                H + + G+G  +N + YW+VRNSWG    + G+F+I   
Sbjct: 178 I---NHIISVAGWGVDENGVEYWIVRNSWGEPWGERGWFRIVTS 218



 Score = 84.4 bits (209), Expect = 4e-18
 Identities = 60/224 (26%), Positives = 97/224 (43%), Gaps = 30/224 (13%)

Query: 11  VPDAWDWRKKNVTG-----PAGDQ---ADCGSCWAFSIAGMLEGQYAIKT---GKLVEFS 59
           +P +WDWR  NV G     P  +Q     CGSCWA      L  +  I        V  S
Sbjct: 1   LPKSWDWR--NVNGVNYVSPTRNQHIPQYCGSCWAHGSTSALADRINIARKGAWPSVYLS 58

Query: 60  KSQLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKS------ 113
              +++CA   S C G       EY H+ G+  E   PY+  +GE        +      
Sbjct: 59  VQVVIDCAGGGS-CHGGDPGGVYEYAHKHGIPDETCNPYQAKDGECNPFNRCGTCNPFGE 117

Query: 114 --KVKLFTG---KDFLHFNGSETMKKILYKYGPLSV-LLNSDLIHDYNGTPIRKNDETCS 167
              +K +T     D+   +G + M   +Y  GP+S  ++ ++ + +Y G   ++  +   
Sbjct: 118 CFAIKNYTLYFVSDYGSVSGRDKMMAEIYARGPISCGIMATEALENYTGGVYKEYVQDPL 177

Query: 168 PYDLGHAVLLVGYGKQDN-IPYWLVRNSWGPIGPDEGFFKIERG 210
                H + + G+G  +N + YW+VRNSWG    + G+F+I   
Sbjct: 178 I---NHIISVAGWGVDENGVEYWIVRNSWGEPWGERGWFRIVTS 218



 Score = 45.5 bits (108), Expect = 5e-05
 Identities = 14/39 (35%), Positives = 24/39 (61%), Gaps = 1/39 (2%)

Query: 972  HAVLLVGYGKQDD-IPYWLVRNSWGPIGPDEGFFKIERG 1009
            H + + G+G  ++ + YW+VRNSWG    + G+F+I   
Sbjct: 180  HIISVAGWGVDENGVEYWIVRNSWGEPWGERGWFRIVTS 218


>gnl|CDD|240381 PTZ00364, PTZ00364, dipeptidyl-peptidase I precursor; Provisional.
          Length = 548

 Score = 85.3 bits (211), Expect = 4e-17
 Identities = 60/250 (24%), Positives = 93/250 (37%), Gaps = 44/250 (17%)

Query: 741 PVPDAWDWRKKNVTG--------PAGDQAACGSC------WAFSIAGMLEGQYAIKTGKL 786
           P P AW W   +V G        PA     C S        A     M+        G+ 
Sbjct: 204 PPPAAWSWG--DVGGASFLPAAPPASPGRGCNSSYVEAALAAMMARVMVASNRTDPLGQQ 261

Query: 787 VEFSKSQLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDY--PYKNANGEKFKCAYDKS 844
              S   +++C++   GC G F E   ++    G+ +   Y  PY + +G +  C   + 
Sbjct: 262 TFLSARHVLDCSQYGQGCAGGFPEEVGKFAETFGILTTDSYYIPYDSGDGVERACKTRRP 321

Query: 845 KVKLF------TGKDFLHFNGSETMKKILYKYGPL--SVLLNSDLI---HDYNGT----- 888
             + +       G  +      + +   +Y++GP+  SV  NSD      +         
Sbjct: 322 SRRYYFTNYGPLGGYYGAVTDPDEIIWEIYRHGPVPASVYANSDWYNCDENSTEDVRYVS 381

Query: 889 ----PIRKNDETCSPY---DLGHAVLLVGYGKQDN-IPYWLVRNSWGPIGP--DEGFFKI 938
                    D     Y   ++ H VL++G+G  +N   YWLV + WG      D G  KI
Sbjct: 382 LDDYSTASADRPLRHYFASNVNHTVLIIGWGTDENGGDYWLVLDPWGSRRSWCDGGTRKI 441

Query: 939 ERGNNACGIE 948
            RG NA  IE
Sbjct: 442 ARGVNAYNIE 451



 Score = 85.3 bits (211), Expect = 5e-17
 Identities = 60/250 (24%), Positives = 93/250 (37%), Gaps = 44/250 (17%)

Query: 10  PVPDAWDWRKKNVTG--------PAGDQADCGSC------WAFSIAGMLEGQYAIKTGKL 55
           P P AW W   +V G        PA     C S        A     M+        G+ 
Sbjct: 204 PPPAAWSWG--DVGGASFLPAAPPASPGRGCNSSYVEAALAAMMARVMVASNRTDPLGQQ 261

Query: 56  VEFSKSQLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDY--PYKNANGEKFKCAYDKS 113
              S   +++C++   GC G F E   ++    G+ +   Y  PY + +G +  C   + 
Sbjct: 262 TFLSARHVLDCSQYGQGCAGGFPEEVGKFAETFGILTTDSYYIPYDSGDGVERACKTRRP 321

Query: 114 KVKLF------TGKDFLHFNGSETMKKILYKYGPL--SVLLNSDLI---HDYNGT----- 157
             + +       G  +      + +   +Y++GP+  SV  NSD      +         
Sbjct: 322 SRRYYFTNYGPLGGYYGAVTDPDEIIWEIYRHGPVPASVYANSDWYNCDENSTEDVRYVS 381

Query: 158 ----PIRKNDETCSPY---DLGHAVLLVGYGKQDN-IPYWLVRNSWGPIGP--DEGFFKI 207
                    D     Y   ++ H VL++G+G  +N   YWLV + WG      D G  KI
Sbjct: 382 LDDYSTASADRPLRHYFASNVNHTVLIIGWGTDENGGDYWLVLDPWGSRRSWCDGGTRKI 441

Query: 208 ERGNNACGIE 217
            RG NA  IE
Sbjct: 442 ARGVNAYNIE 451



 Score = 70.3 bits (172), Expect = 3e-12
 Identities = 57/252 (22%), Positives = 95/252 (37%), Gaps = 47/252 (18%)

Query: 375 PVPDAWDWRKKNVTG--------PAGDQAACGSC------WAFSIAGMLEGQYAIKTGKL 420
           P P AW W   +V G        PA     C S        A     M+        G+ 
Sbjct: 204 PPPAAWSWG--DVGGASFLPAAPPASPGRGCNSSYVEAALAAMMARVMVASNRTDPLGQQ 261

Query: 421 VEFSKSQLVECAKQCSGCGGCDGL-EQPIEYTHQAGLESEKDY--PYRNGNGEKFKCAYD 477
              S   +++C++   GC G  G  E+  ++    G+ +   Y  PY +G+G +  C   
Sbjct: 262 TFLSARHVLDCSQYGQGCAG--GFPEEVGKFAETFGILTTDSYYIPYDSGDGVERACKTR 319

Query: 478 KSKVK-LFTGKDFL--YF---NGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDET 531
           +   +  FT    L  Y+      + +   +Y++GP+   + ++   +       ++   
Sbjct: 320 RPSRRYYFTNYGPLGGYYGAVTDPDEIIWEIYRHGPVPASVYANSDWYNCDENSTEDVRY 379

Query: 532 CSPYD-----------------LGHAVLLVGYGK-QDDIPYWLARNSWGPIGP--DEGFF 571
            S  D                 + H VL++G+G  ++   YWL  + WG      D G  
Sbjct: 380 VSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDENGGDYWLVLDPWGSRRSWCDGGTR 439

Query: 572 KIERGNNACGIE 583
           KI RG NA  IE
Sbjct: 440 KIARGVNAYNIE 451



 Score = 43.7 bits (103), Expect = 5e-04
 Identities = 21/51 (41%), Positives = 29/51 (56%), Gaps = 3/51 (5%)

Query: 969  DLGHAVLLVGYGK-QDDIPYWLVRNSWGPIGP--DEGFFKIERGNNACGIE 1016
            ++ H VL++G+G  ++   YWLV + WG      D G  KI RG NA  IE
Sbjct: 401  NVNHTVLIIGWGTDENGGDYWLVLDPWGSRRSWCDGGTRKIARGVNAYNIE 451


>gnl|CDD|240244 PTZ00049, PTZ00049, cathepsin C-like protein; Provisional.
          Length = 693

 Score = 56.9 bits (137), Expect = 4e-08
 Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 4/49 (8%)

Query: 972  HAVLLVGYGKQDD----IPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 1016
            HA++LVG+G+++       YW+ RNSWG     EG+FKI RG N  GIE
Sbjct: 620  HAIVLVGWGEEEINGKLYKYWIGRNSWGKNWGKEGYFKIIRGKNFSGIE 668



 Score = 56.5 bits (136), Expect = 6e-08
 Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 4/49 (8%)

Query: 173 HAVLLVGYGKQDN----IPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 217
           HA++LVG+G+++       YW+ RNSWG     EG+FKI RG N  GIE
Sbjct: 620 HAIVLVGWGEEEINGKLYKYWIGRNSWGKNWGKEGYFKIIRGKNFSGIE 668



 Score = 56.5 bits (136), Expect = 6e-08
 Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 4/49 (8%)

Query: 904 HAVLLVGYGKQDN----IPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 948
           HA++LVG+G+++       YW+ RNSWG     EG+FKI RG N  GIE
Sbjct: 620 HAIVLVGWGEEEINGKLYKYWIGRNSWGKNWGKEGYFKIIRGKNFSGIE 668



 Score = 56.1 bits (135), Expect = 8e-08
 Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 4/49 (8%)

Query: 539 HAVLLVGYGKQDD----IPYWLARNSWGPIGPDEGFFKIERGNNACGIE 583
           HA++LVG+G+++       YW+ RNSWG     EG+FKI RG N  GIE
Sbjct: 620 HAIVLVGWGEEEINGKLYKYWIGRNSWGKNWGKEGYFKIIRGKNFSGIE 668


>gnl|CDD|227207 COG4870, COG4870, Cysteine protease [Posttranslational
           modification, protein turnover, chaperones].
          Length = 372

 Score = 49.5 bits (118), Expect = 6e-06
 Identities = 49/220 (22%), Positives = 73/220 (33%), Gaps = 29/220 (13%)

Query: 11  VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
           +P  +D R +    P  DQ   GSCWAF+    LE  Y            +         
Sbjct: 99  LPSYFDRRDEGKVSPVKDQGSGGSCWAFATTRSLES-YLNPESAWDFSENNMKNLLGVPY 157

Query: 71  S-GCD------GCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF 123
             G D      G     +   T  +G   E D PY   +        +    K       
Sbjct: 158 EKGFDYTSNDGGNADMSAAYLTEWSGPVYETDDPYSENSY---FSPTNLPVTKHVQEAQI 214

Query: 124 L-HFNGS---ETMKKILYKYGPLSVLLNSDLIHDY-NGTPIRKNDETCSPYDLGHAVLLV 178
           +           +K +   YG +S  +  D  +      P    D      + GHAVL+V
Sbjct: 215 IPSRKKYLDNGNIKAMFGFYGAVSSSMYIDATNSLGICIPYPYVDSGE---NWGHAVLIV 271

Query: 179 GYG---KQDNIPY-------WLVRNSWGPIGPDEGFFKIE 208
           GY      +N  Y       ++++NSWG    + G+F I 
Sbjct: 272 GYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYFWIS 311



 Score = 47.5 bits (113), Expect = 2e-05
 Identities = 49/220 (22%), Positives = 74/220 (33%), Gaps = 29/220 (13%)

Query: 742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
           +P  +D R +    P  DQ + GSCWAF+    LE  Y            +         
Sbjct: 99  LPSYFDRRDEGKVSPVKDQGSGGSCWAFATTRSLES-YLNPESAWDFSENNMKNLLGVPY 157

Query: 802 S-GCD------GCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF 854
             G D      G     +   T  +G   E D PY   +        +    K       
Sbjct: 158 EKGFDYTSNDGGNADMSAAYLTEWSGPVYETDDPYSENSY---FSPTNLPVTKHVQEAQI 214

Query: 855 L-HFNGS---ETMKKILYKYGPLSVLLNSDLIHDY-NGTPIRKNDETCSPYDLGHAVLLV 909
           +           +K +   YG +S  +  D  +      P    D      + GHAVL+V
Sbjct: 215 IPSRKKYLDNGNIKAMFGFYGAVSSSMYIDATNSLGICIPYPYVDSGE---NWGHAVLIV 271

Query: 910 GYG---KQDNIPY-------WLVRNSWGPIGPDEGFFKIE 939
           GY      +N  Y       ++++NSWG    + G+F I 
Sbjct: 272 GYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYFWIS 311



 Score = 40.6 bits (95), Expect = 0.003
 Identities = 46/222 (20%), Positives = 72/222 (32%), Gaps = 32/222 (14%)

Query: 376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
           +P  +D R +    P  DQ + GSCWAF+    LE  Y            +         
Sbjct: 99  LPSYFDRRDEGKVSPVKDQGSGGSCWAFATTRSLES-YLNPESAWDFSENNMKNLLGVPY 157

Query: 436 S------GCGGCDGLEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF 489
                     G +        T  +G   E D PY   +   +    +    K    ++ 
Sbjct: 158 EKGFDYTSNDGGNADMSAAYLTEWSGPVYETDDPY---SENSYFSPTNLPVTK--HVQEA 212

Query: 490 LYFNG------SETMKKILYKYGPLSVGLNSHLIHFY-NGTPIRKNDETCSPYDLGHAVL 542
                      +  +K +   YG +S  +     +      P    D      + GHAVL
Sbjct: 213 QIIPSRKKYLDNGNIKAMFGFYGAVSSSMYIDATNSLGICIPYPYVDSGE---NWGHAVL 269

Query: 543 LVGYGKQDDIPY----------WLARNSWGPIGPDEGFFKIE 574
           +VGY    DI            ++ +NSWG    + G+F I 
Sbjct: 270 IVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYFWIS 311



 Score = 34.8 bits (80), Expect = 0.20
 Identities = 17/47 (36%), Positives = 24/47 (51%), Gaps = 10/47 (21%)

Query: 971  GHAVLLVGYGKQDDIPY----------WLVRNSWGPIGPDEGFFKIE 1007
            GHAVL+VGY    DI            ++++NSWG    + G+F I 
Sbjct: 265  GHAVLIVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYFWIS 311


>gnl|CDD|185641 PTZ00462, PTZ00462, Serine-repeat antigen protein; Provisional.
          Length = 1004

 Score = 48.5 bits (115), Expect = 2e-05
 Identities = 21/41 (51%), Positives = 26/41 (63%), Gaps = 5/41 (12%)

Query: 173 HAVLLVGYGKQDN-----IPYWLVRNSWGPIGPDEGFFKIE 208
           HAV +VGYG   N       YW+VRNSWG    DEG+FK++
Sbjct: 723 HAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKVD 763



 Score = 48.5 bits (115), Expect = 2e-05
 Identities = 21/41 (51%), Positives = 26/41 (63%), Gaps = 5/41 (12%)

Query: 904 HAVLLVGYGKQDN-----IPYWLVRNSWGPIGPDEGFFKIE 939
           HAV +VGYG   N       YW+VRNSWG    DEG+FK++
Sbjct: 723 HAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKVD 763



 Score = 48.1 bits (114), Expect = 2e-05
 Identities = 20/41 (48%), Positives = 27/41 (65%), Gaps = 5/41 (12%)

Query: 972  HAVLLVGYG-----KQDDIPYWLVRNSWGPIGPDEGFFKIE 1007
            HAV +VGYG     + +   YW+VRNSWG    DEG+FK++
Sbjct: 723  HAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKVD 763



 Score = 46.2 bits (109), Expect = 1e-04
 Identities = 19/41 (46%), Positives = 26/41 (63%), Gaps = 5/41 (12%)

Query: 539 HAVLLVGYG-----KQDDIPYWLARNSWGPIGPDEGFFKIE 574
           HAV +VGYG     + +   YW+ RNSWG    DEG+FK++
Sbjct: 723 HAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKVD 763


>gnl|CDD|214853 smart00848, Inhibitor_I29, Cathepsin propeptide inhibitor domain
           (I29).  This domain is found at the N-terminus of some
           C1 peptidases such as Cathepsin L where it acts as a
           propeptide. There are also a number of proteins that are
           composed solely of multiple copies of this domain such
           as the peptidase inhibitor salarin. This family is
           classified as I29 by MEROPS. Peptide proteinase
           inhibitors can be found as single domain proteins or as
           single or multiple domains within proteins; these are
           referred to as either simple or compound inhibitors,
           respectively. In many cases they are synthesised as part
           of a larger precursor protein, either as a prepropeptide
           or as an N-terminal domain associated with an inactive
           peptidase or zymogen. This domain prevents access of the
           substrate to the active site. Removal of the N-terminal
           inhibitor domain either by interaction with a second
           peptidase or by autocatalytic cleavage activates the
           zymogen. Other inhibitors interact direct with
           proteinases using a simple noncovalent lock and key
           mechanism; while yet others use a conformational
           change-based trapping mechanism that depends on their
           structural and thermodynamic properties.
          Length = 57

 Score = 41.5 bits (98), Expect = 5e-05
 Identities = 19/57 (33%), Positives = 31/57 (54%), Gaps = 9/57 (15%)

Query: 292 FKAFIVKRGRQYANDEEIKERFEYFKQ-----DGHKKHE----RYGTSEFSDRSPEE 339
           F+ +  K G+ Y+++EE   RF  FK+     + H K      + G ++FSD +PEE
Sbjct: 1   FEQWKKKHGKSYSSEEEEARRFAIFKENLKKIEEHNKKYEHSYKLGVNQFSDLTPEE 57



 Score = 41.5 bits (98), Expect = 5e-05
 Identities = 19/57 (33%), Positives = 31/57 (54%), Gaps = 9/57 (15%)

Query: 658 FKAFIVKRGRQYANDEEIKERFEYFKQ-----DGHKKHE----RYGTSEFSDRSPEE 705
           F+ +  K G+ Y+++EE   RF  FK+     + H K      + G ++FSD +PEE
Sbjct: 1   FEQWKKKHGKSYSSEEEEARRFAIFKENLKKIEEHNKKYEHSYKLGVNQFSDLTPEE 57


>gnl|CDD|219764 pfam08246, Inhibitor_I29, Cathepsin propeptide inhibitor domain
           (I29).  This domain is found at the N-terminus of some
           C1 peptidases such as Cathepsin L where it acts as a
           propeptide. There are also a number of proteins that are
           composed solely of multiple copies of this domain such
           as the peptidase inhibitor salarin. This family is
           classified as I29 by MEROPS.
          Length = 58

 Score = 39.1 bits (92), Expect = 4e-04
 Identities = 17/58 (29%), Positives = 29/58 (50%), Gaps = 9/58 (15%)

Query: 292 FKAFIVKRGRQYANDEEIKERFEYFKQ-----DGHKKHE----RYGTSEFSDRSPEEI 340
           F+ +  K G+ Y ++EE   RF+ FK+     + H K        G ++F+D + EE 
Sbjct: 1   FEDWKKKYGKSYYSEEEELYRFQIFKENLRFIEEHNKKGNVSYTLGLNQFADLTDEEF 58



 Score = 39.1 bits (92), Expect = 4e-04
 Identities = 17/58 (29%), Positives = 29/58 (50%), Gaps = 9/58 (15%)

Query: 658 FKAFIVKRGRQYANDEEIKERFEYFKQ-----DGHKKHE----RYGTSEFSDRSPEEI 706
           F+ +  K G+ Y ++EE   RF+ FK+     + H K        G ++F+D + EE 
Sbjct: 1   FEDWKKKYGKSYYSEEEELYRFQIFKENLRFIEEHNKKGNVSYTLGLNQFADLTDEEF 58


>gnl|CDD|240447 cd13432, LDT_IgD_like_2, IgD-like repeat domain of mycobacterial
           L,D-transpeptidases.  Immunoglobulin-like domain found
           in actinobacterial L,D-transpeptidases, including
           Mycobacterium tuberculosis LdtMt2, which is a
           non-classical transpeptidase that generates 3->3
           transpeptide linkages. LdtMt2 is associated with
           virulence and resistance to amoxicillin. This domain may
           occur in a tandem-repeat arrangement and is found
           N-terminal to the catalytic L,D-transpeptidase domain;
           this model represents the  repeat adjacent to the
           catalytic domain.
          Length = 99

 Score = 29.0 bits (66), Expect = 3.1
 Identities = 14/38 (36%), Positives = 16/38 (42%), Gaps = 9/38 (23%)

Query: 357 VADREKVEKMLMEVEKDGPVPDAW--------DWRKKN 386
           V DR  VEK  ++V    PV  AW         WR K 
Sbjct: 27  VTDRAAVEKA-LKVTTSPPVEGAWYWLSDREVHWRPKE 63



 Score = 29.0 bits (66), Expect = 3.1
 Identities = 14/38 (36%), Positives = 16/38 (42%), Gaps = 9/38 (23%)

Query: 723 VADREKVEKMLMEVEKDGPVPDAW--------DWRKKN 752
           V DR  VEK  ++V    PV  AW         WR K 
Sbjct: 27  VTDRAAVEKA-LKVTTSPPVEGAWYWLSDREVHWRPKE 63


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.138    0.433 

Gapped
Lambda     K      H
   0.267   0.0764    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 52,786,039
Number of extensions: 5260080
Number of successful extensions: 4485
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4343
Number of HSP's successfully gapped: 77
Length of query: 1026
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 919
Effective length of database: 6,191,724
Effective search space: 5690194356
Effective search space used: 5690194356
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (28.4 bits)