RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3960
         (351 letters)



>gnl|CDD|239068 cd02248, Peptidase_C1A, Peptidase C1A subfamily (MEROPS database
           nomenclature); composed of cysteine peptidases (CPs)
           similar to papain, including the mammalian CPs
           (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain
           is an endopeptidase with specific substrate preferences,
           primarily for bulky hydrophobic or aromatic residues at
           the S2 subsite, a hydrophobic pocket in papain that
           accommodates the P2 sidechain of the substrate (the
           second residue away from the scissile bond). Most
           members of the papain subfamily are endopeptidases. Some
           exceptions to this rule can be explained by specific
           details of the catalytic domains like the occluding loop
           in cathepsin B which confers an additional
           carboxydipeptidyl activity and the mini-chain of
           cathepsin H resulting in an N-terminal exopeptidase
           activity. Papain-like CPs have different functions in
           various organisms. Plant CPs are used to mobilize
           storage proteins in seeds. Parasitic CPs act
           extracellularly to help invade tissues and cells, to
           hatch or to evade the host immune system. Mammalian CPs
           are primarily lysosomal enzymes with the exception of
           cathepsin W, which is retained in the endoplasmic
           reticulum. They are responsible for protein degradation
           in the lysosome. Papain-like CPs are synthesized as
           inactive proenzymes with N-terminal propeptide regions,
           which are removed upon activation. In addition to its
           inhibitory role, the propeptide is required for proper
           folding of the newly synthesized enzyme and its
           stabilization in denaturing pH conditions. Residues
           within the propeptide region also play a role in the
           transport of the proenzyme to lysosomes or acidified
           vesicles. Also included in this subfamily are proteins
           classified as non-peptidase homologs, which lack
           peptidase activity or have missing active site residues.
          Length = 210

 Score =  257 bits (658), Expect = 1e-85
 Identities = 98/199 (49%), Positives = 128/199 (64%), Gaps = 5/199 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F T GA+EGAY +K  KL  LS+Q L+DCS   GNNGC+GG    +++++
Sbjct: 17  DQGSCGSCWAFSTVGALEGAYAIKTGKLVSLSEQQLVDCSTS-GNNGCNGGNPDNAFEYV 75

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
              GL ++ DY PY G+D  C   ++   A +TG+ NV P  E+ALK ALA +GPVSVAI
Sbjct: 76  KNGGLASESDY-PYTGKDGTCKYNSSKVGAKITGYSNVPPGDEEALKAALANYGPVSVAI 134

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DAS  SF FY  G+Y    C  S   L+HAVL VGYG  +G  YW VKNSW T WG +GY
Sbjct: 135 DASS-SFQFYKGGIYSGPCC--SNTNLNHAVLLVGYGTENGVDYWIVKNSWGTSWGEKGY 191

Query: 331 VLMSIKDNNCGVMTAPTYV 349
           + ++   N CG+ +  +Y 
Sbjct: 192 IRIARGSNLCGIASYASYP 210


>gnl|CDD|215726 pfam00112, Peptidase_C1, Papain family cysteine protease. 
          Length = 213

 Score =  254 bits (650), Expect = 2e-84
 Identities = 93/202 (46%), Positives = 123/202 (60%), Gaps = 8/202 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F   GA+EG Y +K  KL  LS+Q L+DC  G  NNGC+GG    ++++I
Sbjct: 18  DQGQCGSCWAFSAVGALEGRYCIKTGKLVSLSEQQLVDCDTG--NNGCNGGLPDNAFEYI 75

Query: 211 MK-HGLPTQDDYGPYLGQDAYCHI-ANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
            K  G+ T+ DY PY   D  C    + +  A + G+ +V  N E+AL+ ALAK+GPVSV
Sbjct: 76  KKNGGIVTESDY-PYTAHDGTCKFKKSNSKYAKIKGYGDVPYNDEEALQAALAKNGPVSV 134

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
           AIDA +  F  Y +GVY   +C+     LDHAVL VGYG  +G PYW VKNSW T WG  
Sbjct: 135 AIDAYEDDFQLYKSGVYKHTECSGE---LDHAVLIVGYGTENGVPYWIVKNSWGTDWGEN 191

Query: 329 GYVLMSIKDNNCGVMTAPTYVT 350
           GY  ++   N CG+ +  +Y  
Sbjct: 192 GYFRIARGVNECGIASEASYPI 213


>gnl|CDD|214761 smart00645, Pept_C1, Papain family cysteine protease. 
          Length = 175

 Score =  174 bits (444), Expect = 7e-54
 Identities = 83/211 (39%), Positives = 104/211 (49%), Gaps = 51/211 (24%)

Query: 144 KGYNSL-LDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGE 202
           KG  +   DQ  CGSCW+F  TGA+EG Y +K  KL  LS+Q L+DCS G GN GC+GG 
Sbjct: 10  KGAVTPVKDQGQCGSCWAFSATGALEGRYCIKTGKLVSLSEQQLVDCSGG-GNCGCNGGL 68

Query: 203 DFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALA 261
              ++++I K+ GL T+  Y PY G                                   
Sbjct: 69  PDNAFEYIKKNGGLETESCY-PYTG----------------------------------- 92

Query: 262 KHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE--LDGKPYWQVKN 319
                SVAIDAS   F FY +G+Y    C      LDHAVL VGYG    +GK YW VKN
Sbjct: 93  -----SVAIDAS--DFQFYKSGIYDHPGC--GSGTLDHAVLIVGYGTEVENGKDYWIVKN 143

Query: 320 SWSTYWGNQGYVLMSI-KDNNCGVMTAPTYV 349
           SW T WG  GY  ++  K+N CG+  +    
Sbjct: 144 SWGTDWGENGYFRIARGKNNECGIEASVASY 174


>gnl|CDD|185513 PTZ00203, PTZ00203, cathepsin L protease; Provisional.
          Length = 348

 Score =  125 bits (315), Expect = 4e-33
 Identities = 82/232 (35%), Positives = 125/232 (53%), Gaps = 26/232 (11%)

Query: 129 YNKASKD--AIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQ 183
           Y KA  D  A+P  V +  KG    + +Q  CGSCW+F   G +E  + +   KL  LS+
Sbjct: 116 YRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVGNIESQWAVAGHKLVRLSE 175

Query: 184 QALIDCSWGYGNNGCDGGEDFRSYQWIMKH---GLPTQDDYGPYL---GQDAYCHIANTT 237
           Q L+ C   + +NGC GG   ++++W++++    + T+  Y PY+   G    C  ++  
Sbjct: 176 QQLVSCD--HVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSY-PYVSGNGDVPECSNSSEL 232

Query: 238 AT-ATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDG 296
           A  A + G+V++  +SE  +   LAK+GP+S+A+DAS  SF  Y +GV        S  G
Sbjct: 233 APGARIDGYVSM-ESSERVMAAWLAKNGPISIAVDAS--SFMSYHSGVL------TSCIG 283

Query: 297 --LDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAP 346
             L+H VL VGY      PYW +KNSW   WG +GYV +++  N C +   P
Sbjct: 284 EQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWGEKGYVRVTMGVNACLLTGYP 335


>gnl|CDD|239110 cd02619, Peptidase_C1, C1 Peptidase family (MEROPS database
           nomenclature), also referred to as the papain family;
           composed of two subfamilies of cysteine peptidases
           (CPs), C1A (papain) and C1B (bleomycin hydrolase).
           Papain-like enzymes are mostly endopeptidases with some
           exceptions like cathepsins B, C, H and X, which are
           exopeptidases. Papain-like CPs have different functions
           in various organisms. Plant CPs are used to mobilize
           storage proteins in seeds while mammalian CPs are
           primarily lysosomal enzymes responsible for protein
           degradation in the lysosome. Papain-like CPs are
           synthesized as inactive proenzymes with N-terminal
           propeptide regions, which are removed upon activation.
           Bleomycin hydrolase (BH) is a CP that detoxifies
           bleomycin by hydrolysis of an amide group. It acts as a
           carboxypeptidase on its C-terminus to convert itself
           into an aminopeptidase and peptide ligase. BH is found
           in all tissues in mammals as well as in many other
           eukaryotes. It forms a hexameric ring barrel structure
           with the active sites imbedded in the central channel.
           Some members of the C1 family are proteins classified as
           non-peptidase homologs which lack peptidase activity or
           have missing active site residues.
          Length = 223

 Score =  120 bits (303), Expect = 2e-32
 Identities = 62/201 (30%), Positives = 90/201 (44%), Gaps = 15/201 (7%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAV--LSQQALIDCSWGY---GNNGCDGGEDFR 205
           +Q   GSCW+F +  A+E AY +K  +     LS Q L  C+       N  CDGG    
Sbjct: 14  NQGSRGSCWAFASAYALESAYRIKGGEDEYVDLSPQYLYICANDECLGINGSCDGGGPLS 73

Query: 206 SYQWIMK-HGLPTQDDYGPYLGQD----AYCHIANTTATATMTGFVNVTPNSEDALKLAL 260
           +   ++   G+P ++DY PY  +          A   A   +  +  V  N+ + +K AL
Sbjct: 74  ALLKLVALKGIPPEEDY-PYGAESDGEEPKSEAALNAAKVKLKDYRRVLKNNIEDIKEAL 132

Query: 261 AKHGPVSVAIDASQKSFSFYVNGVYYDEK--CNNSPDGLDHAVLAVGYG--ELDGKPYWQ 316
           AK GPV    D            +Y +         D   HAV+ VGY    ++GK  + 
Sbjct: 133 AKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDGDLGGHAVVIVGYDDNYVEGKGAFI 192

Query: 317 VKNSWSTYWGNQGYVLMSIKD 337
           VKNSW T WG+ GY  +S +D
Sbjct: 193 VKNSWGTDWGDNGYGRISYED 213


>gnl|CDD|240310 PTZ00200, PTZ00200, cysteine proteinase; Provisional.
          Length = 448

 Score =  122 bits (309), Expect = 2e-31
 Identities = 68/201 (33%), Positives = 103/201 (51%), Gaps = 20/201 (9%)

Query: 151 DQSV-CGSCWSFGTTGAVEGAYYMKHKKLAV-LSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           DQ + CGSCW+F + G+VE   Y  ++  +V LS+Q L++C     + GC GG    + +
Sbjct: 251 DQGLNCGSCWAFSSVGSVES-LYKIYRDKSVDLSEQELVNCD--TKSQGCSGGYPDTALE 307

Query: 209 WIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
           ++   GL +  D  PYL +D  C +++T      +  V      +D L  +L    P  V
Sbjct: 308 YVKNKGLSSSSDV-PYLAKDGKCVVSSTKKVYIDSYLVA---KGKDVLNKSLVI-SPTVV 362

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV--GYGELDGKPYWQVKNSWSTYWG 326
            I  S +    Y +GV Y+ +C      L+HAVL V  GY E   K YW +KNSW T WG
Sbjct: 363 YIAVS-RELLKYKSGV-YNGECG---KSLNHAVLLVGEGYDEKTKKRYWIIKNSWGTDWG 417

Query: 327 NQGYVLM---SIKDNNCGVMT 344
             GY+ +   +   + CG++T
Sbjct: 418 ENGYMRLERTNEGTDKCGILT 438


>gnl|CDD|240232 PTZ00021, PTZ00021, falcipain-2; Provisional.
          Length = 489

 Score =  122 bits (307), Expect = 5e-31
 Identities = 74/214 (34%), Positives = 113/214 (52%), Gaps = 25/214 (11%)

Query: 142 EMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG 201
              G   + DQ  CGSCW+F T G VE  Y ++  +L  LS+Q L+DCS  + NNGC GG
Sbjct: 274 LHNGVTPVKDQKNCGSCWAFSTVGVVESQYAIRKNELVSLSEQELVDCS--FKNNGCYGG 331

Query: 202 EDFRSYQWIMK-HGLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDALKLA 259
               +++ +++  GL ++DDY PY+      C+I        +  +V++    ED  K A
Sbjct: 332 LIPNAFEDMIELGGLCSEDDY-PYVSDTPELCNIDRCKEKYKIKSYVSI---PEDKFKEA 387

Query: 260 LAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG------ELDGKP 313
           +   GP+SV+I  S   F+FY  G+ +D +C   P   +HAV+ VGYG          K 
Sbjct: 388 IRFLGPISVSIAVSD-DFAFYKGGI-FDGECGEEP---NHAVILVGYGMEEIYNSDTKKM 442

Query: 314 ----YWQVKNSWSTYWGNQGYVLMSIKDNNCGVM 343
               Y+ +KNSW   WG +G+  + I+ +  G+M
Sbjct: 443 EKRYYYIIKNSWGESWGEKGF--IRIETDENGLM 474


>gnl|CDD|239111 cd02620, Peptidase_C1A_CathepsinB, Cathepsin B group; composed of
           cathepsin B and similar proteins, including
           tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin
           B is a lysosomal papain-like cysteine peptidase which is
           expressed in all tissues and functions primarily as an
           exopeptidase through its carboxydipeptidyl activity.
           Together with other cathepsins, it is involved in the
           degradation of proteins, proenzyme activation, Ag
           processing, metabolism and apoptosis. Cathepsin B has
           been implicated in a number of human diseases such as
           cancer, rheumatoid arthritis, osteoporosis and
           Alzheimer's disease. The unique carboxydipeptidyl
           activity of cathepsin B is attributed to the presence of
           an occluding loop in its active site which favors the
           binding of the C-termini of substrate proteins. Some
           members of this group do not possess the occluding loop.
           TIN-Ag is an extracellular matrix basement protein which
           was originally identified as a target Ag involved in
           anti-tubular basement membrane antibody-mediated
           interstitial nephritis. It plays a role in renal
           tubulogenesis and is defective in hereditary
           tubulointerstitial disorders. TIN-Ag is exclusively
           expressed in kidney tissues. .
          Length = 236

 Score =  112 bits (282), Expect = 3e-29
 Identities = 56/220 (25%), Positives = 83/220 (37%), Gaps = 39/220 (17%)

Query: 150 LDQSVCGSCWSFGTTGAV--------EGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG 201
            DQ  CGSCW+F    A          G      K+  +LS Q L+ C      +GC+GG
Sbjct: 20  RDQGNCGSCWAFSAVEAFSDRLCIQSNG------KENVLLSAQDLLSCC-SGCGDGCNGG 72

Query: 202 EDFRSYQWIMKHGLPT------QDDYGPYLGQDAYCHIANTTATATMTGFVNVT------ 249
               +++++   G+ T            +  +           T         T      
Sbjct: 73  YPDAAWKYLTTTGVVTGGCQPYTIPPCGHHPEGPPPCCGTPYCTPKCQDGCEKTYEEDKH 132

Query: 250 --------PNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
                   P+ E  +   +  +GPV  A    +  F +Y +GVY  +  +    G  HAV
Sbjct: 133 KGKSAYSVPSDETDIMKEIMTNGPVQAAFTVYE-DFLYYKSGVY--QHTSGKQLGG-HAV 188

Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCG 341
             +G+G  +G PYW   NSW T WG  GY  +    N CG
Sbjct: 189 KIIGWGVENGVPYWLAANSWGTDWGENGYFRILRGSNECG 228


>gnl|CDD|239149 cd02698, Peptidase_C1A_CathepsinX, Cathepsin X; the only
           papain-like lysosomal cysteine peptidase exhibiting
           carboxymonopeptidase activity. It can also act as a
           carboxydipeptidase, like cathepsin B, but has been shown
           to preferentially cleave substrates through a
           monopeptidyl carboxypeptidase pathway. The propeptide
           region of cathepsin X, the shortest among papain-like
           peptidases, is covalently attached to the active site
           cysteine in the inactive form of the enzyme. Little is
           known about the biological function of cathepsin X. Some
           studies point to a role in early tumorigenesis. A more
           recent study indicates that cathepsin X expression is
           restricted to immune cells suggesting a role in
           phagocytosis and the regulation of the immune response.
          Length = 239

 Score =  112 bits (282), Expect = 3e-29
 Identities = 55/193 (28%), Positives = 84/193 (43%), Gaps = 25/193 (12%)

Query: 155 CGSCWSFGTTGAVEGAYYMKHK---KLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
           CGSCW+ G+T A+     +  K       LS Q +IDC+   G   C GG+    Y++  
Sbjct: 28  CGSCWAHGSTSALADRINIARKGAWPSVYLSVQVVIDCA---GGGSCHGGDPGGVYEYAH 84

Query: 212 KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN-------------SEDALKL 258
           KHG+P  +   PY  +D  C+  N   T    G      N               D +  
Sbjct: 85  KHGIP-DETCNPYQAKDGECNPFNRCGTCNPFGECFAIKNYTLYFVSDYGSVSGRDKMMA 143

Query: 259 ALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQV 317
            +   GP+S  I A++   + Y  GVY +   +   +   H +   G+G + +G  YW V
Sbjct: 144 EIYARGPISCGIMATEALEN-YTGGVYKEYVQDPLIN---HIISVAGWGVDENGVEYWIV 199

Query: 318 KNSWSTYWGNQGY 330
           +NSW   WG +G+
Sbjct: 200 RNSWGEPWGERGW 212


>gnl|CDD|239112 cd02621, Peptidase_C1A_CathepsinC, Cathepsin C; also known as
           Dipeptidyl Peptidase I (DPPI), an atypical papain-like
           cysteine peptidase with chloride dependency and
           dipeptidyl aminopeptidase activity, resulting from its
           tetrameric structure which limits substrate access. Each
           subunit of the tetramer is composed of three peptides:
           the heavy and light chains, which together adopts the
           papain fold and forms the catalytic domain; and the
           residual propeptide region, which forms a beta barrel
           and points towards the substrate's N-terminus. The
           subunit composition is the result of the unique
           characteristic of procathepsin C maturation involving
           the cleavage of the catalytic domain and the
           non-autocatalytic excision of an activation peptide
           within its propeptide region. By removing N-terminal
           dipeptide extensions, cathepsin C activates granule
           serine peptidases (granzymes) involved in cell-mediated
           apoptosis, inflammation and tissue remodelling.
           Loss-of-function mutations in cathepsin C are associated
           with Papillon-Lefevre and Haim-Munk syndromes, rare
           diseases characterized by hyperkeratosis and early-onset
           periodontitis. Cathepsin C is widely expressed in many
           tissues with high levels in lung, kidney and placenta.
           It is also highly expressed in cytotoxic lymphocytes and
           mature myeloid cells.
          Length = 243

 Score =  105 bits (265), Expect = 9e-27
 Identities = 68/226 (30%), Positives = 104/226 (46%), Gaps = 32/226 (14%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLA------VLSQQALIDCSWGYGNNGCDGGEDF 204
           +Q  CGSC++F +  A+E    +   K        +LS Q ++ CS  Y + GCDGG  F
Sbjct: 22  NQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSPQHVLSCS-QY-SQGCDGGFPF 79

Query: 205 RSYQWIMKHGLPTQDDYGPYLGQDAYCHIAN--------TTATATMTGFVNVTPNSEDAL 256
              ++    G+ T+D +      D  C  A+         +    + G    T  +ED +
Sbjct: 80  LVGKFAEDFGIVTEDYFPYTADDDRPC-KASPSECRRYYFSDYNYVGGCYGCT--NEDEM 136

Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVY----YDEKCNNSPDGL------DHAVLAVGY 306
           K  + ++GP+ VA +     F FY  GVY     DE  +   D        +HAVL VG+
Sbjct: 137 KWEIYRNGPIVVAFEVYS-DFDFYKEGVYHHTDNDEVSDGDNDNFNPFELTNHAVLLVGW 195

Query: 307 GE--LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVT 350
           GE  + G+ YW VKNSW + WG +GY  +    N CG+ +   +  
Sbjct: 196 GEDEIKGEKYWIVKNSWGSSWGEKGYFKIRRGTNECGIESQAVFAY 241


>gnl|CDD|240244 PTZ00049, PTZ00049, cathepsin C-like protein; Provisional.
          Length = 693

 Score = 64.6 bits (157), Expect = 2e-11
 Identities = 58/277 (20%), Positives = 100/277 (36%), Gaps = 82/277 (29%)

Query: 151 DQSVCGSCWSFGTTGAVE-----GAYYMKHKKLA-----VLSQQALIDCSWGYGNNGCDG 200
           +Q +CGSC+      A +            KK       +LS Q ++ CS  + + GC+G
Sbjct: 402 NQLLCGSCYIASQMYAFKRRIEIALTKNLDKKYLNNFDDLLSIQTVLSCS--FYDQGCNG 459

Query: 201 GEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP---------- 250
           G  +   +     G+P  D   PY   +  C      +  +M G  N+            
Sbjct: 460 GFPYLVSKMAKLQGIP-LDKVFPYTATEQTCPYQVDQSANSMNGSANLRQINAVFFSSET 518

Query: 251 ------------------------------------NSEDALKLALAKHGPVSVAIDASQ 274
                                               N E  +   + ++GP+  + +AS 
Sbjct: 519 QSDMHADFEAPISSEPARWYAKDYNYIGGCYGCNQCNGEKIMMNEIYRNGPIVASFEASP 578

Query: 275 KSFSFYVNGVYYDE------KCNNSP------------DGLDHAVLAVGYGE--LDGKP- 313
             +  Y +GVYY E      +C                + ++HA++ VG+GE  ++GK  
Sbjct: 579 DFYD-YADGVYYVEDFPHARRCTVDLPKHNGVYNITGWEKVNHAIVLVGWGEEEINGKLY 637

Query: 314 -YWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYV 349
            YW  +NSW   WG +GY  +    N  G+ +   ++
Sbjct: 638 KYWIGRNSWGKNWGKEGYFKIIRGKNFSGIESQSLFI 674


>gnl|CDD|227207 COG4870, COG4870, Cysteine protease [Posttranslational
           modification, protein turnover, chaperones].
          Length = 372

 Score = 58.7 bits (142), Expect = 1e-09
 Identities = 50/233 (21%), Positives = 75/233 (32%), Gaps = 40/233 (17%)

Query: 140 RYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYY---------MKHKKLAVLSQQALIDCS 190
           R +    + + DQ   GSCW+F TT ++E                K L  +         
Sbjct: 105 RRDEGKVSPVKDQGSGGSCWAFATTRSLESYLNPESAWDFSENNMKNLLGVP------YE 158

Query: 191 WGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMT--GFVNV 248
            G+     DGG    S  ++ +   P  +   PY     +    N   T  +     +  
Sbjct: 159 KGFDYTSNDGGNADMSAAYLTEWSGPVYETDDPYSENSYFSP-TNLPVTKHVQEAQIIPS 217

Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKC-----NNSPDGLDHAVLA 303
                D   +        +V       S S Y++       C      +S +   HAVL 
Sbjct: 218 RKKYLDNGNIKAMFGFYGAV-------SSSMYIDATNSLGICIPYPYVDSGENWGHAVLI 270

Query: 304 VGYGEL----------DGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAP 346
           VGY +            G   + +KNSW T WG  GY  +S      G   A 
Sbjct: 271 VGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYFWISYYYALNGDAEAL 323


>gnl|CDD|185641 PTZ00462, PTZ00462, Serine-repeat antigen protein; Provisional.
          Length = 1004

 Score = 55.5 bits (133), Expect = 2e-08
 Identities = 61/217 (28%), Positives = 84/217 (38%), Gaps = 43/217 (19%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY-QW 209
           DQ  C   W F +   +E    MK  +   +S   + +CS G   + CD G +   + Q 
Sbjct: 549 DQGNCAISWIFASKYHLETIKCMKGYEPHAISALYIANCSKGEHKDRCDEGSNPLEFLQI 608

Query: 210 IMKHG-LPTQDDYGPY----LGQDAYC-----HIANTTATATMTGFVNVTPNSEDA---- 255
           I  +G LP   +Y  Y    +G+D  C     H  N      +       PNS D     
Sbjct: 609 IEDNGFLPADSNY-LYNYTKVGED--CPDEEDHWMNLLDHGKILNHNKKEPNSLDGKAYR 665

Query: 256 -----------------LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLD 298
                            +K  +   G V   I A +    +  NG      C +  D  D
Sbjct: 666 AYESEHFHDKMDAFIKIIKDEIMNKGSVIAYIKA-ENVLGYEFNGKKVQNLCGD--DTAD 722

Query: 299 HAVLAVGYG-----ELDGKPYWQVKNSWSTYWGNQGY 330
           HAV  VGYG     E + K YW V+NSW  YWG++GY
Sbjct: 723 HAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGY 759


>gnl|CDD|240381 PTZ00364, PTZ00364, dipeptidyl-peptidase I precursor; Provisional.
          Length = 548

 Score = 52.2 bits (125), Expect = 2e-07
 Identities = 49/229 (21%), Positives = 82/229 (35%), Gaps = 41/229 (17%)

Query: 155 CGSCWSFGTTGAVEGAYYM----KHKKLA---VLSQQALIDCSWGYGNNGCDGGEDFRSY 207
           C S +      A+  A  M    +   L     LS + ++DCS  YG  GC GG      
Sbjct: 232 CNSSYVEAALAAMM-ARVMVASNRTDPLGQQTFLSARHVLDCS-QYGQ-GCAGGFPEEVG 288

Query: 208 QWIMKHGLPTQDDYG-PYLGQDAYCHIANTTATATMTGFVNVTP--------NSEDALKL 258
           ++    G+ T D Y  PY   D       T   +    F N  P           D +  
Sbjct: 289 KFAETFGILTTDSYYIPYDSGDGVERACKTRRPSRRYYFTNYGPLGGYYGAVTDPDEIIW 348

Query: 259 ALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDG-----------------LDHAV 301
            + +HGPV  ++ A+   ++   +    ++    S D                  ++H V
Sbjct: 349 EIYRHGPVPASVYANSDWYN--CDENSTEDVRYVSLDDYSTASADRPLRHYFASNVNHTV 406

Query: 302 LAVGYGELD-GKPYWQVKNSWSTY--WGNQGYVLMSIKDNNCGVMTAPT 347
           L +G+G  + G  YW V + W +   W + G   ++   N   + +   
Sbjct: 407 LIIGWGTDENGGDYWLVLDPWGSRRSWCDGGTRKIARGVNAYNIESEVV 455


>gnl|CDD|202517 pfam03051, Peptidase_C1_2, Peptidase C1-like family.  This family
           is closely related to the Peptidase_C1 family pfam00112,
           containing several prokaryotic and eukaryotic
           aminopeptidases and bleomycin hydrolases.
          Length = 438

 Score = 35.4 bits (82), Expect = 0.043
 Identities = 15/40 (37%), Positives = 22/40 (55%), Gaps = 2/40 (5%)

Query: 297 LDHAVLAVGYGELDGKPY--WQVKNSWSTYWGNQGYVLMS 334
           + HA++  G  E D      W+V+NSW    G +GY +MS
Sbjct: 359 MTHAMVLTGVDEDDDGKPTKWKVENSWGDDSGKKGYFVMS 398


>gnl|CDD|222715 pfam14371, DUF4412, Domain of unknown function (DUF4412).  This
           presumed domain is functionally uncharacterized. This
           domain family is found in bacteria, archaea and
           eukaryotes, and is typically between 75 and 104 amino
           acids in length.
          Length = 74

 Score = 32.3 bits (74), Expect = 0.056
 Identities = 12/50 (24%), Positives = 18/50 (36%), Gaps = 7/50 (14%)

Query: 95  TYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMK 144
           T +G E + G   EK+R+            LWV          P++ E  
Sbjct: 2   TKLGTETIAGYKCEKYRVTDGSTG---TAELWV----TPDLGFPLKTESI 44


>gnl|CDD|226107 COG3579, PepC, Aminopeptidase C [Amino acid transport and
           metabolism].
          Length = 444

 Score = 32.5 bits (74), Expect = 0.32
 Identities = 14/40 (35%), Positives = 21/40 (52%), Gaps = 2/40 (5%)

Query: 297 LDHAVLAVGYGEL-DGKPY-WQVKNSWSTYWGNQGYVLMS 334
           + HA++  G      G P  W+V+NSW    G +GY + S
Sbjct: 361 MTHAMVLTGVDLDETGNPLRWKVENSWGKDVGKKGYFVAS 400


>gnl|CDD|234540 TIGR04318, lacto_ODC_hypo, putative ornithine decarboxylase.  In at
           least ten species of Lactobacillus, this close homolog
           to known ornithine decarboxylase occurs in a three-gene
           neighborhood, along with an amino acid permease family
           transporter and pyridoxal phosphate-dependent enyzme
           from the cystathionine gamma-lyase family. Species
           include L. acidophilus, L. amylovorus, L. crispatus, L.
           delbrueckii, L. farciminis, L. helveticus, L. johnsonii,
           etc. The combination of a decarboxylase with an
           antiporter in a two-gene system suggests a
           decarboxylation/antiport proton-motive cycle for
           transient resistance to acidic conditions. The substrate
           for this decarboxylase might be ornithine but is
           unknown.
          Length = 695

 Score = 32.4 bits (74), Expect = 0.34
 Identities = 19/48 (39%), Positives = 21/48 (43%), Gaps = 10/48 (20%)

Query: 176 KKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGP 223
            K+  L    L DC+WG       G E F     IMKH  P Q  YGP
Sbjct: 287 DKIGKLCDYILFDCAWG-------GYEQFVP---IMKHLSPLQLTYGP 324


>gnl|CDD|187859 cd09728, Csx1_III-U, CRISPR/Cas system-associated protein Csx1.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Protein of this family often fused to HTH domain; Some
           proteins could have an additional fusion with
           RecB-family nuclease domain; Core domain appears to have
           a Rossmann-like fold; loosely associated with CRISPR/Cas
           systems; also known as DxTHG family.
          Length = 400

 Score = 32.0 bits (73), Expect = 0.50
 Identities = 16/105 (15%), Positives = 35/105 (33%), Gaps = 9/105 (8%)

Query: 47  CFHAYGIPDATIEPQSVLPDVSDFKVNIYRLFFLRPRFHENEKIRYN---WTYIGEELVN 103
             + + I   TI P + +  + +  +N Y+      + +  + +             L N
Sbjct: 180 PLNIHEIESETIAPVTEISYLKEKFINKYKDNNKDLKRYFKKALNEGLEKIIKFASALYN 239

Query: 104 GIIL------EKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYE 142
           G+ L           +    E   +    +  NK  ++ I V+ E
Sbjct: 240 GLPLYLLYLKPSIEELREPLEFKLEIIEIIESNKNQEEEIKVKRE 284


>gnl|CDD|238328 cd00585, Peptidase_C1B, Peptidase C1B subfamily (MEROPS database
           nomenclature); composed of eukaryotic bleomycin
           hydrolases (BH) and bacterial aminopeptidases C (pepC).
           The proteins of this subfamily contain a large insert
           relative to the C1A peptidase (papain) subfamily. BH is
           a cysteine peptidase that detoxifies bleomycin by
           hydrolysis of an amide group. It acts as a
           carboxypeptidase on its C-terminus to convert itself
           into an aminopeptidase and peptide ligase. BH is found
           in all tissues in mammals as well as in many other
           eukaryotes. Bleomycin, a glycopeptide derived from the
           fungus Streptomyces verticullus, is an effective
           anticancer drug due to its ability to induce DNA strand
           breaks. Human BH is the major cause of tumor cell
           resistance to bleomycin chemotherapy, and is also
           genetically linked to Alzheimer's disease. In addition
           to its peptidase activity, the yeast BH (Gal6) binds DNA
           and acts as a repressor in the Gal4 regulatory system.
           BH forms a hexameric ring barrel structure with the
           active sites imbedded in the central channel. The
           bacterial homolog of BH, called pepC, is a cysteine
           aminopeptidase possessing broad specificity. Although
           its crystal structure has not been solved, biochemical
           analysis shows that pepC also forms a hexamer. .
          Length = 437

 Score = 31.0 bits (71), Expect = 0.83
 Identities = 17/38 (44%), Positives = 23/38 (60%), Gaps = 2/38 (5%)

Query: 299 HAVLAVGYGE-LDGKPY-WQVKNSWSTYWGNQGYVLMS 334
           HA++  G     DGKP  W+V+NSW    G +GY +MS
Sbjct: 360 HAMVLTGVDLDEDGKPVKWKVENSWGEKVGKKGYFVMS 397


>gnl|CDD|129242 TIGR00136, gidA, glucose-inhibited division protein A.  GidA, the
           longer of two forms of GidA-related proteins, appears to
           be present in all complete eubacterial genomes so far,
           as well as Saccharomyces cerevisiae. A subset of these
           organisms have a closely related protein. GidA is absent
           in the Archaea. It appears to act with MnmE, in an
           alpha2/beta2 heterotetramer, in the
           5-carboxymethylaminomethyl modification of uridine 34 in
           certain tRNAs. The shorter, related protein, previously
           called gid or gidA(S), is now called TrmFO (see model
           TIGR00137) [Protein synthesis, tRNA and rRNA base
           modification].
          Length = 617

 Score = 30.8 bits (70), Expect = 1.4
 Identities = 15/51 (29%), Positives = 23/51 (45%), Gaps = 7/51 (13%)

Query: 96  YIG---EELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEM 143
           YIG   ++LV     E +R+ TS  E    Y L +R + A      +  E+
Sbjct: 404 YIGVLIDDLVTKGTKEPYRMFTSRAE----YRLLLREDNADFRLTEIGREL 450


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
           Reviewed.
          Length = 460

 Score = 29.2 bits (65), Expect = 4.0
 Identities = 12/33 (36%), Positives = 19/33 (57%), Gaps = 1/33 (3%)

Query: 297 LDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            D A+ AV +G + G+ Y  +   W TY+G+ G
Sbjct: 56  YDIALWAVPFGLIGGRLY-HLATDWRTYFGDGG 87


>gnl|CDD|215341 PLN02635, PLN02635, disproportionating enzyme.
          Length = 538

 Score = 29.0 bits (65), Expect = 4.2
 Identities = 16/45 (35%), Positives = 18/45 (40%), Gaps = 13/45 (28%)

Query: 200 GGEDFRSYQWIMKHG------LP-------TQDDYGPYLGQDAYC 231
           G E FR   W+   G      LP         +D  PY GQDA C
Sbjct: 49  GDEAFRFLDWLASTGCSVWQVLPLVPPGRKGGEDGSPYSGQDANC 93


>gnl|CDD|182581 PRK10600, PRK10600, nitrate/nitrite sensor protein NarX;
           Provisional.
          Length = 569

 Score = 28.9 bits (65), Expect = 5.5
 Identities = 9/16 (56%), Positives = 12/16 (75%)

Query: 327 NQGYVLMSIKDNNCGV 342
           NQ  V +S++DN CGV
Sbjct: 497 NQNQVKLSVQDNGCGV 512


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.134    0.422 

Gapped
Lambda     K      H
   0.267   0.0789    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,628,175
Number of extensions: 1642845
Number of successful extensions: 1282
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1238
Number of HSP's successfully gapped: 30
Length of query: 351
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 253
Effective length of database: 6,590,910
Effective search space: 1667500230
Effective search space used: 1667500230
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.4 bits)