RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy2558
         (348 letters)



>gnl|CDD|215726 pfam00112, Peptidase_C1, Papain family cysteine protease. 
          Length = 213

 Score =  282 bits (725), Expect = 1e-95
 Identities = 109/222 (49%), Positives = 130/222 (58%), Gaps = 17/222 (7%)

Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
           LP +FDWRE  AVT VKDQ  CGS WAFS  G +EG Y  KT KLVSLSEQ+L+DCD  +
Sbjct: 1   LPESFDWREKGAVTPVKDQGQCGSCWAFSAVGALEGRYCIKTGKLVSLSEQQLVDCDTGN 60

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKK-ATQVKINGYVSVS-RD 243
           +GC GG   NAF+ I     GG+  E  YPY   D  C+  K  +   KI GY  V   D
Sbjct: 61  NGCNGGLPDNAFEYIKKN--GGIVTESDYPYTAHDGTCKFKKSNSKYAKIKGYGDVPYND 118

Query: 244 ETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
           E  +   L +NGP++VAI+AY    Q Y +GV    +  C G    L H+VLIVGYG + 
Sbjct: 119 EEALQAALAKNGPVSVAIDAYEDDFQLYKSGVYKHTE--CSG---ELDHAVLIVGYGTE- 172

Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYV 343
                  VPYWI+KNSWG  WGE GYFR+ RG   CGI    
Sbjct: 173 -----NGVPYWIVKNSWGTDWGENGYFRIARGVNECGIASEA 209


>gnl|CDD|239068 cd02248, Peptidase_C1A, Peptidase C1A subfamily (MEROPS database
           nomenclature); composed of cysteine peptidases (CPs)
           similar to papain, including the mammalian CPs
           (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain
           is an endopeptidase with specific substrate preferences,
           primarily for bulky hydrophobic or aromatic residues at
           the S2 subsite, a hydrophobic pocket in papain that
           accommodates the P2 sidechain of the substrate (the
           second residue away from the scissile bond). Most
           members of the papain subfamily are endopeptidases. Some
           exceptions to this rule can be explained by specific
           details of the catalytic domains like the occluding loop
           in cathepsin B which confers an additional
           carboxydipeptidyl activity and the mini-chain of
           cathepsin H resulting in an N-terminal exopeptidase
           activity. Papain-like CPs have different functions in
           various organisms. Plant CPs are used to mobilize
           storage proteins in seeds. Parasitic CPs act
           extracellularly to help invade tissues and cells, to
           hatch or to evade the host immune system. Mammalian CPs
           are primarily lysosomal enzymes with the exception of
           cathepsin W, which is retained in the endoplasmic
           reticulum. They are responsible for protein degradation
           in the lysosome. Papain-like CPs are synthesized as
           inactive proenzymes with N-terminal propeptide regions,
           which are removed upon activation. In addition to its
           inhibitory role, the propeptide is required for proper
           folding of the newly synthesized enzyme and its
           stabilization in denaturing pH conditions. Residues
           within the propeptide region also play a role in the
           transport of the proenzyme to lysosomes or acidified
           vesicles. Also included in this subfamily are proteins
           classified as non-peptidase homologs, which lack
           peptidase activity or have missing active site residues.
          Length = 210

 Score =  279 bits (716), Expect = 2e-94
 Identities = 106/220 (48%), Positives = 130/220 (59%), Gaps = 16/220 (7%)

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-D 185
           P + DWRE  AVT VKDQ  CGS WAFST G +EG YA KT KLVSLSEQ+L+DC    +
Sbjct: 1   PESVDWREKGAVTPVKDQGSCGSCWAFSTVGALEGAYAIKTGKLVSLSEQQLVDCSTSGN 60

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS-RDE 244
           +GC GG+  NAF+ + +   GGL  E  YPY G D  C+ N      KI GY +V   DE
Sbjct: 61  NGCNGGNPDNAFEYVKN---GGLASESDYPYTGKDGTCKYNSSKVGAKITGYSNVPPGDE 117

Query: 245 TDMAKYLVENGPMAVAINAY-ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
             +   L   GP++VAI+A  + QFY  G+           N NL+H+VL+VGYG +   
Sbjct: 118 EALKAALANYGPVSVAIDASSSFQFYKGGIYSG----PCCSNTNLNHAVLLVGYGTE--- 170

Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYV 343
                V YWI+KNSWG  WGEKGY R+ RG   CGI  Y 
Sbjct: 171 ---NGVDYWIVKNSWGTSWGEKGYIRIARGSNLCGIASYA 207


>gnl|CDD|185513 PTZ00203, PTZ00203, cathepsin L protease; Provisional.
          Length = 348

 Score =  234 bits (599), Expect = 5e-75
 Identities = 130/321 (40%), Positives = 175/321 (54%), Gaps = 26/321 (8%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYAD---RSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSW 151
           F A+YL     F     +A    R   A +    +P A DWRE  AVT VK+Q  CGS W
Sbjct: 94  FAARYLNGAAYFAAAKQHAGQHYRKARADLS--AVPDAVDWREKGAVTPVKNQGACGSCW 151

Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEE 211
           AFS  GNIE  +A    KLV LSEQ+L+ CD  D+GC GG +  AF+ ++  + G +  E
Sbjct: 152 AFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTE 211

Query: 212 KTYPY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQ 267
           K+YPY    GD   C  + + A   +I+GYVS+   E  MA +L +NGP+++A++A +  
Sbjct: 212 KSYPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFM 271

Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
            Y +GV       C G  E L+H VL+VGY +         VPYW+IKNSWGE WGEKGY
Sbjct: 272 SYHSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGY 319

Query: 328 FRLYRGDGSCGINDYVRSALV 348
            R+  G  +C +  Y  S  V
Sbjct: 320 VRVTMGVNACLLTGYPVSVHV 340


>gnl|CDD|214761 smart00645, Pept_C1, Papain family cysteine protease. 
          Length = 175

 Score =  217 bits (555), Expect = 1e-70
 Identities = 95/220 (43%), Positives = 112/220 (50%), Gaps = 51/220 (23%)

Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
           LP +FDWR+  AVT VKDQ  CGS WAFS TG +EG Y  KT KLVSLSEQ+L+DC    
Sbjct: 1   LPESFDWRKKGAVTPVKDQGQCGSCWAFSATGALEGRYCIKTGKLVSLSEQQLVDCSGGG 60

Query: 186 D-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDE 244
           + GC GG   NAF+ I     GGLE E  YPY G                          
Sbjct: 61  NCGCNGGLPDNAFEYIKKN--GGLETESCYPYTG-------------------------- 92

Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
                        +VAI+A   QFY +G+       C  G+  L H+VLIVGYG +    
Sbjct: 93  -------------SVAIDASDFQFYKSGIYDHPG--C--GSGTLDHAVLIVGYGTEVE-- 133

Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYV 343
                 YWI+KNSWG  WGE GYFR+ RG +  CGI   V
Sbjct: 134 --NGKDYWIVKNSWGTDWGENGYFRIARGKNNECGIEASV 171


>gnl|CDD|240232 PTZ00021, PTZ00021, falcipain-2; Provisional.
          Length = 489

 Score =  204 bits (521), Expect = 1e-61
 Identities = 121/319 (37%), Positives = 168/319 (52%), Gaps = 39/319 (12%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +++   F  F+++H K Y T  E   R   F  NL KI    + E+     G+N F DLS
Sbjct: 163 LENVNSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAHNNKENVLYKKGMNRFGDLS 222

Query: 96  TAEFQAKYL---GFKLKP---------SYAD---RSVPAMIPNITLPR-AFDWREYDAVT 139
             EF+ KYL    F  K          +Y D   +  P    + T     +DWR ++ VT
Sbjct: 223 FEEFKKKYLTLKSFDFKSNGKKSPRVINYDDVIKKYKPK---DATFDHAKYDWRLHNGVT 279

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            VKDQ  CGS WAFST G +E  YA +  +LVSLSEQEL+DC  +++GC GG I NAF+ 
Sbjct: 280 PVKDQKNCGSCWAFSTVGVVESQYAIRKNELVSLSEQELVDCSFKNNGCYGGLIPNAFED 339

Query: 200 IMSKLGGGLEEEKTYPYRGD-DKACRLNKKATQVKINGYVSVSRDETDMA-KYLVENGPM 257
           ++    GGL  E  YPY  D  + C +++   + KI  YVS+  D+   A ++L   GP+
Sbjct: 340 MIEL--GGLCSEDDYPYVSDTPELCNIDRCKEKYKIKSYVSIPEDKFKEAIRFL---GPI 394

Query: 258 AVAINAY-ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-----TKFTHKAVPY 311
           +V+I       FY  G       F     E  +H+V++VGYG++      TK   K   Y
Sbjct: 395 SVSIAVSDDFAFYKGG------IFDGECGEEPNHAVILVGYGMEEIYNSDTKKMEKRY-Y 447

Query: 312 WIIKNSWGEGWGEKGYFRL 330
           +IIKNSWGE WGEKG+ R+
Sbjct: 448 YIIKNSWGESWGEKGFIRI 466


>gnl|CDD|240310 PTZ00200, PTZ00200, cysteine proteinase; Provisional.
          Length = 448

 Score =  183 bits (466), Expect = 5e-54
 Identities = 111/334 (33%), Positives = 153/334 (45%), Gaps = 53/334 (15%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEF 99
           F  F +++N+ +AT  E  +R   F  N  +++    +  G   Y   +N+FSDL+  EF
Sbjct: 126 FEEFNKKYNRKHATHAERLNRFLTFRNNYLEVK----SHKGDEPYSKEINKFSDLTEEEF 181

Query: 100 QAKYLGFKLKPSYADRSV-----------PAMIPNITLPRAF---------------DWR 133
           +  +   K+ P     S            P  + N+   +                 DWR
Sbjct: 182 RKLFPVIKVPPKSNSTSHNNDFKARHVSNPTYLKNLKKAKNTDEDVKDPSKITGEGLDWR 241

Query: 134 EYDAVTGVKDQ-TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGS 192
             DAVT VKDQ   CGS WAFS+ G++E +Y     K V LSEQEL++CD +  GC GG 
Sbjct: 242 RADAVTKVKDQGLNCGSCWAFSSVGSVESLYKIYRDKSVDLSEQELVNCDTKSQGCSGGY 301

Query: 193 ISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLV 252
              A + + +K   GL      PY   D  C       +V I+ Y  V++ +  + K LV
Sbjct: 302 PDTALEYVKNK---GLSSSSDVPYLAKDGKCV-VSSTKKVYIDSY-LVAKGKDVLNKSLV 356

Query: 253 ENGPMAVAINA-YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPY 311
              P  V I     L  Y +GV      +     ++L+H+VL+VG G D    T K   Y
Sbjct: 357 -ISPTVVYIAVSRELLKYKSGV------YNGECGKSLNHAVLLVGEGYDEK--TKKR--Y 405

Query: 312 WIIKNSWGEGWGEKGYFRLYR---GDGSCGINDY 342
           WIIKNSWG  WGE GY RL R   G   CGI   
Sbjct: 406 WIIKNSWGTDWGENGYMRLERTNEGTDKCGILTV 439


>gnl|CDD|239112 cd02621, Peptidase_C1A_CathepsinC, Cathepsin C; also known as
           Dipeptidyl Peptidase I (DPPI), an atypical papain-like
           cysteine peptidase with chloride dependency and
           dipeptidyl aminopeptidase activity, resulting from its
           tetrameric structure which limits substrate access. Each
           subunit of the tetramer is composed of three peptides:
           the heavy and light chains, which together adopts the
           papain fold and forms the catalytic domain; and the
           residual propeptide region, which forms a beta barrel
           and points towards the substrate's N-terminus. The
           subunit composition is the result of the unique
           characteristic of procathepsin C maturation involving
           the cleavage of the catalytic domain and the
           non-autocatalytic excision of an activation peptide
           within its propeptide region. By removing N-terminal
           dipeptide extensions, cathepsin C activates granule
           serine peptidases (granzymes) involved in cell-mediated
           apoptosis, inflammation and tissue remodelling.
           Loss-of-function mutations in cathepsin C are associated
           with Papillon-Lefevre and Haim-Munk syndromes, rare
           diseases characterized by hyperkeratosis and early-onset
           periodontitis. Cathepsin C is widely expressed in many
           tissues with high levels in lung, kidney and placenta.
           It is also highly expressed in cytotoxic lymphocytes and
           mature myeloid cells.
          Length = 243

 Score =  143 bits (363), Expect = 4e-41
 Identities = 81/247 (32%), Positives = 120/247 (48%), Gaps = 39/247 (15%)

Query: 126 LPRAFDWRE----YDAVTGVKDQTMCGSSWAFSTTGNIEG---VYAAKTKKLVS---LSE 175
           LP++FDW +    ++ V+ V++Q  CGS +AF++   +E    + + KT  L     LS 
Sbjct: 1   LPKSFDWGDVNNGFNYVSPVRNQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSP 60

Query: 176 QELIDCDQEDDGCEGGSISNAFDTIMSKLGG--GLEEEKTYPYRGDDKA-CRLNKKATQV 232
           Q ++ C Q   GC+GG     F  ++ K     G+  E  +PY  DD   C+ +    + 
Sbjct: 61  QHVLSCSQYSQGCDGG-----FPFLVGKFAEDFGIVTEDYFPYTADDDRPCKASPSECRR 115

Query: 233 -------KINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPIQF--FCD 282
                   + G    + +E +M   +  NGP+ VA   Y+   FY  GV H        D
Sbjct: 116 YYFSDYNYVGGCYGCT-NEDEMKWEIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSD 174

Query: 283 GGNENL------SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
           G N+N       +H+VL+VG+G D  K       YWI+KNSWG  WGEKGYF++ RG   
Sbjct: 175 GDNDNFNPFELTNHAVLLVGWGEDEIK----GEKYWIVKNSWGSSWGEKGYFKIRRGTNE 230

Query: 337 CGINDYV 343
           CGI    
Sbjct: 231 CGIESQA 237


>gnl|CDD|239111 cd02620, Peptidase_C1A_CathepsinB, Cathepsin B group; composed of
           cathepsin B and similar proteins, including
           tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin
           B is a lysosomal papain-like cysteine peptidase which is
           expressed in all tissues and functions primarily as an
           exopeptidase through its carboxydipeptidyl activity.
           Together with other cathepsins, it is involved in the
           degradation of proteins, proenzyme activation, Ag
           processing, metabolism and apoptosis. Cathepsin B has
           been implicated in a number of human diseases such as
           cancer, rheumatoid arthritis, osteoporosis and
           Alzheimer's disease. The unique carboxydipeptidyl
           activity of cathepsin B is attributed to the presence of
           an occluding loop in its active site which favors the
           binding of the C-termini of substrate proteins. Some
           members of this group do not possess the occluding loop.
           TIN-Ag is an extracellular matrix basement protein which
           was originally identified as a target Ag involved in
           anti-tubular basement membrane antibody-mediated
           interstitial nephritis. It plays a role in renal
           tubulogenesis and is defective in hereditary
           tubulointerstitial disorders. TIN-Ag is exclusively
           expressed in kidney tissues. .
          Length = 236

 Score =  131 bits (332), Expect = 1e-36
 Identities = 78/254 (30%), Positives = 107/254 (42%), Gaps = 52/254 (20%)

Query: 127 PRAFDWREY----DAVTGVKDQTMCGSSWAFSTTGN------IEGVYAAKTKKLVSLSEQ 176
           P +FD RE      ++  ++DQ  CGS WAFS          I+       K+ V LS Q
Sbjct: 1   PESFDAREKWPNCISIGEIRDQGNCGSCWAFSAVEAFSDRLCIQS----NGKENVLLSAQ 56

Query: 177 ELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY------------------R 217
           +L+ C     DGC GG    A+  + +    G+      PY                   
Sbjct: 57  DLLSCCSGCGDGCNGGYPDAAWKYLTTT---GVVTGGCQPYTIPPCGHHPEGPPPCCGTP 113

Query: 218 GDDKACR--LNKKATQVKINGYV--SVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTG 272
                C+    K   + K  G    SV  DETD+ K ++ NGP+  A   Y    +Y +G
Sbjct: 114 YCTPKCQDGCEKTYEEDKHKGKSAYSVPSDETDIMKEIMTNGPVQAAFTVYEDFLYYKSG 173

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
           V          G +   H+V I+G+GV+        VPYW+  NSWG  WGE GYFR+ R
Sbjct: 174 VYQHT-----SGKQLGGHAVKIIGWGVENG------VPYWLAANSWGTDWGENGYFRILR 222

Query: 333 GDGSCGINDYVRSA 346
           G   CGI   V + 
Sbjct: 223 GSNECGIESEVVAG 236


>gnl|CDD|239149 cd02698, Peptidase_C1A_CathepsinX, Cathepsin X; the only
           papain-like lysosomal cysteine peptidase exhibiting
           carboxymonopeptidase activity. It can also act as a
           carboxydipeptidase, like cathepsin B, but has been shown
           to preferentially cleave substrates through a
           monopeptidyl carboxypeptidase pathway. The propeptide
           region of cathepsin X, the shortest among papain-like
           peptidases, is covalently attached to the active site
           cysteine in the inactive form of the enzyme. Little is
           known about the biological function of cathepsin X. Some
           studies point to a role in early tumorigenesis. A more
           recent study indicates that cathepsin X expression is
           restricted to immune cells suggesting a role in
           phagocytosis and the regulation of the immune response.
          Length = 239

 Score =  126 bits (319), Expect = 1e-34
 Identities = 66/233 (28%), Positives = 98/233 (42%), Gaps = 40/233 (17%)

Query: 126 LPRAFDWREYDAV---TGVKDQ---TMCGSSWAFSTTGNIEGVYAAKTK---KLVSLSEQ 176
           LP+++DWR  + V   +  ++Q     CGS WA  +T  +        K     V LS Q
Sbjct: 1   LPKSWDWRNVNGVNYVSPTRNQHIPQYCGSCWAHGSTSALADRINIARKGAWPSVYLSVQ 60

Query: 177 ELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLN---------- 226
            +IDC      C GG     ++        G+ +E   PY+  D  C             
Sbjct: 61  VVIDCAG-GGSCHGGDPGGVYEYAHKH---GIPDETCNPYQAKDGECNPFNRCGTCNPFG 116

Query: 227 -----KKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAY-ALQFYVTGVSHPIQFF 280
                K  T   ++ Y SVS     M   +   GP++  I A  AL+ Y  GV       
Sbjct: 117 ECFAIKNYTLYFVSDYGSVS-GRDKMMAEIYARGPISCGIMATEALENYTGGVYKEYV-- 173

Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
               +  ++H + + G+GVD        V YWI++NSWGE WGE+G+FR+   
Sbjct: 174 ---QDPLINHIISVAGWGVD-----ENGVEYWIVRNSWGEPWGERGWFRIVTS 218


>gnl|CDD|239110 cd02619, Peptidase_C1, C1 Peptidase family (MEROPS database
           nomenclature), also referred to as the papain family;
           composed of two subfamilies of cysteine peptidases
           (CPs), C1A (papain) and C1B (bleomycin hydrolase).
           Papain-like enzymes are mostly endopeptidases with some
           exceptions like cathepsins B, C, H and X, which are
           exopeptidases. Papain-like CPs have different functions
           in various organisms. Plant CPs are used to mobilize
           storage proteins in seeds while mammalian CPs are
           primarily lysosomal enzymes responsible for protein
           degradation in the lysosome. Papain-like CPs are
           synthesized as inactive proenzymes with N-terminal
           propeptide regions, which are removed upon activation.
           Bleomycin hydrolase (BH) is a CP that detoxifies
           bleomycin by hydrolysis of an amide group. It acts as a
           carboxypeptidase on its C-terminus to convert itself
           into an aminopeptidase and peptide ligase. BH is found
           in all tissues in mammals as well as in many other
           eukaryotes. It forms a hexameric ring barrel structure
           with the active sites imbedded in the central channel.
           Some members of the C1 family are proteins classified as
           non-peptidase homologs which lack peptidase activity or
           have missing active site residues.
          Length = 223

 Score =  126 bits (317), Expect = 1e-34
 Identities = 59/219 (26%), Positives = 97/219 (44%), Gaps = 21/219 (9%)

Query: 130 FDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTK--KLVSLSEQELIDCD----- 182
            D R    +T VK+Q   GS WAF++   +E  Y  K    + V LS Q L  C      
Sbjct: 2   VDLR-PLRLTPVKNQGSRGSCWAFASAYALESAYRIKGGEDEYVDLSPQYLYICANDECL 60

Query: 183 QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY----RGDDKACRLNKKATQVKINGYV 238
             +  C+GG   +A   +++  G  +  E+ YPY     G++        A +VK+  Y 
Sbjct: 61  GINGSCDGGGPLSALLKLVALKG--IPPEEDYPYGAESDGEEPKSEAALNAAKVKLKDYR 118

Query: 239 SV-SRDETDMAKYLVENGPMAVAINAYALQFYVTGVS-HPIQFFCDGGNENL-SHSVLIV 295
            V   +  D+ + L + GP+    + Y+    +     +    +    + +L  H+V+IV
Sbjct: 119 RVLKNNIEDIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDGDLGGHAVVIV 178

Query: 296 GYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD 334
           GY  +  +        +I+KNSWG  WG+ GY R+   D
Sbjct: 179 GYDDNYVEGK----GAFIVKNSWGTDWGDNGYGRISYED 213


>gnl|CDD|227207 COG4870, COG4870, Cysteine protease [Posttranslational
           modification, protein turnover, chaperones].
          Length = 372

 Score = 80.7 bits (199), Expect = 6e-17
 Identities = 63/222 (28%), Positives = 89/222 (40%), Gaps = 27/222 (12%)

Query: 124 ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVY---------AAKTKKLVSLS 174
            +LP  FD R+   V+ VKDQ   GS WAF+TT ++E                K L+ + 
Sbjct: 97  ASLPSYFDRRDEGKVSPVKDQGSGGSCWAFATTRSLESYLNPESAWDFSENNMKNLLGVP 156

Query: 175 EQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VK 233
            ++  D    D G     +S A+ T  S  G   E +   PY  +      N   T+ V+
Sbjct: 157 YEKGFDYTSNDGG--NADMSAAYLTEWS--GPVYETDD--PYSENSYFSPTNLPVTKHVQ 210

Query: 234 INGY---VSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSH 290
                       D  ++       G ++ ++   A      G+  P  +      EN  H
Sbjct: 211 EAQIIPSRKKYLDNGNIKAMFGFYGAVSSSMYIDATNSL--GICIP--YPYVDSGENWGH 266

Query: 291 SVLIVGYGVDRTKFTHKAVPY----WIIKNSWGEGWGEKGYF 328
           +VLIVGY         K  P     +IIKNSWG  WGE GYF
Sbjct: 267 AVLIVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYF 308


>gnl|CDD|240244 PTZ00049, PTZ00049, cathepsin C-like protein; Provisional.
          Length = 693

 Score = 73.8 bits (181), Expect = 2e-14
 Identities = 74/298 (24%), Positives = 112/298 (37%), Gaps = 95/298 (31%)

Query: 126 LPRAFDW--------REYDAVTGVKDQTMCGSSWAFSTT----GNIE-GVYAAKTKKLVS 172
           LP+ F W        REYD    V +Q +CGS +  S        IE  +     KK ++
Sbjct: 381 LPKNFTWGDPFNNNTREYD----VTNQLLCGSCYIASQMYAFKRRIEIALTKNLDKKYLN 436

Query: 173 -----LSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGG--GLEEEKTYPYRGDDKACRL 225
                LS Q ++ C   D GC GG     F  ++SK+    G+  +K +PY   ++ C  
Sbjct: 437 NFDDLLSIQTVLSCSFYDQGCNGG-----FPYLVSKMAKLQGIPLDKVFPYTATEQTCPY 491

Query: 226 NKKA---------TQVKINGYVSVSRDETDM----------------AK----------- 249
                           +IN     S  ++DM                AK           
Sbjct: 492 QVDQSANSMNGSANLRQINAVFFSSETQSDMHADFEAPISSEPARWYAKDYNYIGGCYGC 551

Query: 250 -----------YLVENGPMAVAINAYALQF-YVTGV----SHPIQFFCD----------- 282
                       +  NGP+  +  A    + Y  GV      P    C            
Sbjct: 552 NQCNGEKIMMNEIYRNGPIVASFEASPDFYDYADGVYYVEDFPHARRCTVDLPKHNGVYN 611

Query: 283 -GGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGI 339
             G E ++H++++VG+G    +   K   YWI +NSWG+ WG++GYF++ RG    GI
Sbjct: 612 ITGWEKVNHAIVLVGWG--EEEINGKLYKYWIGRNSWGKNWGKEGYFKIIRGKNFSGI 667


>gnl|CDD|240381 PTZ00364, PTZ00364, dipeptidyl-peptidase I precursor; Provisional.
          Length = 548

 Score = 66.1 bits (161), Expect = 7e-12
 Identities = 50/259 (19%), Positives = 90/259 (34%), Gaps = 48/259 (18%)

Query: 125 TLPRAFDWREYDAVT---GVKDQT---MCGSSWAFSTTGNIEG---VYAAKTKKL---VS 172
             P A+ W +    +        +    C SS+  +    +     V + +T  L     
Sbjct: 204 PPPAAWSWGDVGGASFLPAAPPASPGRGCNSSYVEAALAAMMARVMVASNRTDPLGQQTF 263

Query: 173 LSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY--PYRGDDKACRLNKKAT 230
           LS + ++DC Q   GC GG      +        G+    +Y  PY   D   R  K   
Sbjct: 264 LSARHVLDCSQYGQGCAGGF---PEEVGKFAETFGILTTDSYYIPYDSGDGVERACKTRR 320

Query: 231 QVK---------INGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFC 281
             +         + GY     D  ++   +  +GP+  ++ A +  +     S     + 
Sbjct: 321 PSRRYYFTNYGPLGGYYGAVTDPDEIIWEIYRHGPVPASVYANSDWYNCDENSTEDVRYV 380

Query: 282 DGGN---------------ENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEG--WGE 324
              +                N++H+VLI+G+G D          YW++ + WG    W +
Sbjct: 381 SLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDEN-----GGDYWLVLDPWGSRRSWCD 435

Query: 325 KGYFRLYRGDGSCGINDYV 343
            G  ++ RG  +  I   V
Sbjct: 436 GGTRKIARGVNAYNIESEV 454


>gnl|CDD|214853 smart00848, Inhibitor_I29, Cathepsin propeptide inhibitor domain
          (I29).  This domain is found at the N-terminus of some
          C1 peptidases such as Cathepsin L where it acts as a
          propeptide. There are also a number of proteins that
          are composed solely of multiple copies of this domain
          such as the peptidase inhibitor salarin. This family is
          classified as I29 by MEROPS. Peptide proteinase
          inhibitors can be found as single domain proteins or as
          single or multiple domains within proteins; these are
          referred to as either simple or compound inhibitors,
          respectively. In many cases they are synthesised as
          part of a larger precursor protein, either as a
          prepropeptide or as an N-terminal domain associated
          with an inactive peptidase or zymogen. This domain
          prevents access of the substrate to the active site.
          Removal of the N-terminal inhibitor domain either by
          interaction with a second peptidase or by autocatalytic
          cleavage activates the zymogen. Other inhibitors
          interact direct with proteinases using a simple
          noncovalent lock and key mechanism; while yet others
          use a conformational change-based trapping mechanism
          that depends on their structural and thermodynamic
          properties.
          Length = 57

 Score = 59.2 bits (144), Expect = 8e-12
 Identities = 20/57 (35%), Positives = 31/57 (54%)

Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
          F  + ++H K+Y++  E   R  IF  NL+KI+        S   G+N+FSDL+  E
Sbjct: 1  FEQWKKKHGKSYSSEEEEARRFAIFKENLKKIEEHNKKYEHSYKLGVNQFSDLTPEE 57


>gnl|CDD|219764 pfam08246, Inhibitor_I29, Cathepsin propeptide inhibitor domain
          (I29).  This domain is found at the N-terminus of some
          C1 peptidases such as Cathepsin L where it acts as a
          propeptide. There are also a number of proteins that
          are composed solely of multiple copies of this domain
          such as the peptidase inhibitor salarin. This family is
          classified as I29 by MEROPS.
          Length = 58

 Score = 58.8 bits (143), Expect = 1e-11
 Identities = 20/58 (34%), Positives = 31/58 (53%)

Query: 42 FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
          F  + +++ K+Y +  E   R  IF  NLR I+      + S   GLN+F+DL+  EF
Sbjct: 1  FEDWKKKYGKSYYSEEEELYRFQIFKENLRFIEEHNKKGNVSYTLGLNQFADLTDEEF 58


>gnl|CDD|185641 PTZ00462, PTZ00462, Serine-repeat antigen protein; Provisional.
          Length = 1004

 Score = 54.3 bits (130), Expect = 5e-08
 Identities = 53/220 (24%), Positives = 96/220 (43%), Gaps = 34/220 (15%)

Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFD 198
           ++DQ  C  SW F++  ++E +   K  +  ++S   + +C + +  D C+ GS    F 
Sbjct: 547 IEDQGNCAISWIFASKYHLETIKCMKGYEPHAISALYIANCSKGEHKDRCDEGSNPLEFL 606

Query: 199 TIMSKLGGGLEEEKTYPYR----GDD------------KACRL----NKKATQVKINGYV 238
            I+    G L  +  Y Y     G+D               ++     K+   +    Y 
Sbjct: 607 QIIED-NGFLPADSNYLYNYTKVGEDCPDEEDHWMNLLDHGKILNHNKKEPNSLDGKAYR 665

Query: 239 SVS----RDETDMAKYLVENGPMAV-AINAYALQFYVTGVS---HPIQFFCDGGNENLSH 290
           +       D+ D    ++++  M   ++ AY     V G       +Q  C  G++   H
Sbjct: 666 AYESEHFHDKMDAFIKIIKDEIMNKGSVIAYIKAENVLGYEFNGKKVQNLC--GDDTADH 723

Query: 291 SVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +V IVGYG +      +   YWI++NSWG+ WG++GYF++
Sbjct: 724 AVNIVGYG-NYINDEDEKKSYWIVRNSWGKYWGDEGYFKV 762


>gnl|CDD|202517 pfam03051, Peptidase_C1_2, Peptidase C1-like family.  This family
           is closely related to the Peptidase_C1 family pfam00112,
           containing several prokaryotic and eukaryotic
           aminopeptidases and bleomycin hydrolases.
          Length = 438

 Score = 39.2 bits (92), Expect = 0.002
 Identities = 16/47 (34%), Positives = 26/47 (55%), Gaps = 4/47 (8%)

Query: 282 DGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           D G   ++H++++ G   D      K    W ++NSWG+  G+KGYF
Sbjct: 353 DYGESLMTHAMVLTGVDEDDDGKPTK----WKVENSWGDDSGKKGYF 395


>gnl|CDD|238328 cd00585, Peptidase_C1B, Peptidase C1B subfamily (MEROPS database
           nomenclature); composed of eukaryotic bleomycin
           hydrolases (BH) and bacterial aminopeptidases C (pepC).
           The proteins of this subfamily contain a large insert
           relative to the C1A peptidase (papain) subfamily. BH is
           a cysteine peptidase that detoxifies bleomycin by
           hydrolysis of an amide group. It acts as a
           carboxypeptidase on its C-terminus to convert itself
           into an aminopeptidase and peptide ligase. BH is found
           in all tissues in mammals as well as in many other
           eukaryotes. Bleomycin, a glycopeptide derived from the
           fungus Streptomyces verticullus, is an effective
           anticancer drug due to its ability to induce DNA strand
           breaks. Human BH is the major cause of tumor cell
           resistance to bleomycin chemotherapy, and is also
           genetically linked to Alzheimer's disease. In addition
           to its peptidase activity, the yeast BH (Gal6) binds DNA
           and acts as a repressor in the Gal4 regulatory system.
           BH forms a hexameric ring barrel structure with the
           active sites imbedded in the central channel. The
           bacterial homolog of BH, called pepC, is a cysteine
           aminopeptidase possessing broad specificity. Although
           its crystal structure has not been solved, biochemical
           analysis shows that pepC also forms a hexamer. .
          Length = 437

 Score = 38.3 bits (90), Expect = 0.004
 Identities = 15/41 (36%), Positives = 25/41 (60%), Gaps = 4/41 (9%)

Query: 288 LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           ++H++++ G  +D      K    W ++NSWGE  G+KGYF
Sbjct: 358 MTHAMVLTGVDLDEDGKPVK----WKVENSWGEKVGKKGYF 394


>gnl|CDD|226107 COG3579, PepC, Aminopeptidase C [Amino acid transport and
           metabolism].
          Length = 444

 Score = 37.1 bits (86), Expect = 0.013
 Identities = 14/41 (34%), Positives = 26/41 (63%), Gaps = 4/41 (9%)

Query: 288 LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           ++H++++ G  +D T    +    W ++NSWG+  G+KGYF
Sbjct: 361 MTHAMVLTGVDLDETGNPLR----WKVENSWGKDVGKKGYF 397


>gnl|CDD|227934 COG5647, COG5647, Cullin, a subunit of E3 ubiquitin ligase
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 773

 Score = 32.1 bits (73), Expect = 0.53
 Identities = 18/91 (19%), Positives = 34/91 (37%), Gaps = 6/91 (6%)

Query: 166 KTKKLVSLSE--QELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY-PYRGDDKA 222
             +KL+S  +    L++   E DG    ++ NAF T ++           Y     D   
Sbjct: 371 YVQKLLSCHDLFPSLVNESFEGDGSIVKALGNAFKTFINGNESADSGPSEYLAKYIDGLL 430

Query: 223 CRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
            +  K++   KI   +        + +Y+ E
Sbjct: 431 KKDGKQSFIGKIKDLLQDI---ITLFRYVEE 458


>gnl|CDD|118686 pfam10158, LOH1CR12, Tumour suppressor protein.  This is a region
           of 130 amino acids that is the most conserved region of
           hypothetical proteins involved in loss of heterozygosity
           and thus tumour suppression. The exact function is not
           known.
          Length = 131

 Score = 28.2 bits (63), Expect = 3.2
 Identities = 11/34 (32%), Positives = 21/34 (61%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQD 78
            + +  K YA+  E +S+++  S  LR+IQ++ D
Sbjct: 71  EMVERQKRYASYAEQFSKVNEISAQLRRIQMILD 104


>gnl|CDD|234887 PRK01001, PRK01001, putative inner membrane protein translocase
           component YidC; Provisional.
          Length = 795

 Score = 29.0 bits (65), Expect = 4.6
 Identities = 21/82 (25%), Positives = 38/82 (46%), Gaps = 12/82 (14%)

Query: 141 VKDQTMCGSSWAFSTTGN---IEGVYAAKTKKLVSLS---EQELIDCDQEDDGCEGGS-- 192
            KD  + G++  F  +GN     G+Y +K + L SL     + ++  D+ED      S  
Sbjct: 167 NKDSAIYGTALVFLRSGNGYLPLGIYNSKEECLESLDLPMTRAVVFADKEDAAKPTASGS 226

Query: 193 ---ISNAF-DTIMSKLGGGLEE 210
              +SN +   ++S+  G +E 
Sbjct: 227 YYVLSNEYMQLVVSQESGAIEG 248


>gnl|CDD|236784 PRK10876, recB, exonuclease V subunit beta; Provisional.
          Length = 1181

 Score = 28.8 bits (65), Expect = 5.9
 Identities = 12/29 (41%), Positives = 17/29 (58%), Gaps = 1/29 (3%)

Query: 293 LIVGYGVDRTKFTHKAVPYWIIK-NSWGE 320
           LI   GVDR  +  K +P W+ K ++W E
Sbjct: 247 LIESSGVDRRSYNSKNLPKWLEKISAWAE 275


>gnl|CDD|214730 smart00579, FBD, domain in FBox and BRCT domain containing plant
           proteins. 
          Length = 72

 Score = 26.0 bits (58), Expect = 7.2
 Identities = 9/25 (36%), Positives = 16/25 (64%), Gaps = 1/25 (4%)

Query: 232 VKINGYVSVSRDETDMAKYLVENGP 256
           ++I GY   + +E ++ KY +EN P
Sbjct: 10  LEIKGY-RGTEEEKELVKYFLENAP 33


>gnl|CDD|223992 COG1064, AdhP, Zn-dependent alcohol dehydrogenases [General
           function prediction only].
          Length = 339

 Score = 27.9 bits (63), Expect = 7.5
 Identities = 10/21 (47%), Positives = 12/21 (57%)

Query: 280 FCDGGNENLSHSVLIVGYGVD 300
           +C  GNENL  +  I GY  D
Sbjct: 99  YCRSGNENLCPNQKITGYTTD 119


>gnl|CDD|235092 PRK02963, PRK02963, carbon starvation induced protein; Validated.
          Length = 316

 Score = 27.7 bits (62), Expect = 9.7
 Identities = 19/107 (17%), Positives = 32/107 (29%), Gaps = 16/107 (14%)

Query: 11  ALLSLTVSVSSFMVVGDEKLHHLHHVK-HTALFNYFLEQHNKTYATLVEY--YSRLHI-- 65
             + L    S F +    +   L  +         FLEQ  +     +EY  + R  +  
Sbjct: 4   NAVDLAQDYSGFTLAPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVAK 63

Query: 66  ---------FSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKY 103
                        L K   L D   G+ +       D++ A+   K 
Sbjct: 64  ILDDLCGNQLQPLLLKT--LLDRAEGAFLINAVGIDDVAQADEMVKL 108


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.136    0.414 

Gapped
Lambda     K      H
   0.267   0.0764    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,561,875
Number of extensions: 1659557
Number of successful extensions: 1274
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1220
Number of HSP's successfully gapped: 36
Length of query: 348
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 250
Effective length of database: 6,590,910
Effective search space: 1647727500
Effective search space used: 1647727500
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (26.4 bits)