RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy282
         (233 letters)



>gnl|CDD|215726 pfam00112, Peptidase_C1, Papain family cysteine protease. 
          Length = 213

 Score =  120 bits (303), Expect = 8e-34
 Identities = 49/131 (37%), Positives = 70/131 (53%), Gaps = 10/131 (7%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHFNGSETMKKILYKYGPLSVLL 59
           G+ +E DYPY   +G    C + KS  K    K +  + +N  E ++  L K GP+SV +
Sbjct: 80  GIVTESDYPYTAHDG---TCKFKKSNSKYAKIKGYGDVPYNDEEALQAALAKNGPVSVAI 136

Query: 60  N--SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGF 117
           +   D    Y           CS   L HAVL+VGYG ++ +PYW+V+NSWG    + G+
Sbjct: 137 DAYEDDFQLYKSGVY--KHTECSGE-LDHAVLIVGYGTENGVPYWIVKNSWGTDWGENGY 193

Query: 118 FKIERGNNACG 128
           F+I RG N CG
Sbjct: 194 FRIARGVNECG 204



 Score = 81.8 bits (203), Expect = 5e-19
 Identities = 33/92 (35%), Positives = 51/92 (55%), Gaps = 15/92 (16%)

Query: 132 LHFNGSETMKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVL 184
           + +N  E ++  L K GP+SV ++++   F       Y  T        CS   L HAVL
Sbjct: 114 VPYNDEEALQAALAKNGPVSVAIDAYEDDFQLYKSGVYKHTE-------CSGE-LDHAVL 165

Query: 185 LVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIE 216
           +VGYG ++ +PYW+V+NSWG    + G+F+I 
Sbjct: 166 IVGYGTENGVPYWIVKNSWGTDWGENGYFRIA 197


>gnl|CDD|239068 cd02248, Peptidase_C1A, Peptidase C1A subfamily (MEROPS database
           nomenclature); composed of cysteine peptidases (CPs)
           similar to papain, including the mammalian CPs
           (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain
           is an endopeptidase with specific substrate preferences,
           primarily for bulky hydrophobic or aromatic residues at
           the S2 subsite, a hydrophobic pocket in papain that
           accommodates the P2 sidechain of the substrate (the
           second residue away from the scissile bond). Most
           members of the papain subfamily are endopeptidases. Some
           exceptions to this rule can be explained by specific
           details of the catalytic domains like the occluding loop
           in cathepsin B which confers an additional
           carboxydipeptidyl activity and the mini-chain of
           cathepsin H resulting in an N-terminal exopeptidase
           activity. Papain-like CPs have different functions in
           various organisms. Plant CPs are used to mobilize
           storage proteins in seeds. Parasitic CPs act
           extracellularly to help invade tissues and cells, to
           hatch or to evade the host immune system. Mammalian CPs
           are primarily lysosomal enzymes with the exception of
           cathepsin W, which is retained in the endoplasmic
           reticulum. They are responsible for protein degradation
           in the lysosome. Papain-like CPs are synthesized as
           inactive proenzymes with N-terminal propeptide regions,
           which are removed upon activation. In addition to its
           inhibitory role, the propeptide is required for proper
           folding of the newly synthesized enzyme and its
           stabilization in denaturing pH conditions. Residues
           within the propeptide region also play a role in the
           transport of the proenzyme to lysosomes or acidified
           vesicles. Also included in this subfamily are proteins
           classified as non-peptidase homologs, which lack
           peptidase activity or have missing active site residues.
          Length = 210

 Score =  120 bits (303), Expect = 9e-34
 Identities = 53/129 (41%), Positives = 72/129 (55%), Gaps = 7/129 (5%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGSETMKKILYKYGPLSVLLN 60
           GL SE DYPY   +G    C Y+ SKV    TG   +     E +K  L  YGP+SV ++
Sbjct: 79  GLASESDYPYTGKDG---TCKYNSSKVGAKITGYSNVPPGDEEALKAALANYGPVSVAID 135

Query: 61  -SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFK 119
            S     Y G     +   CS  +L HAVLLVGYG ++ + YW+V+NSWG    ++G+ +
Sbjct: 136 ASSSFQFYKGGIY--SGPCCSNTNLNHAVLLVGYGTENGVDYWIVKNSWGTSWGEKGYIR 193

Query: 120 IERGNNACG 128
           I RG+N CG
Sbjct: 194 IARGSNLCG 202



 Score = 89.6 bits (223), Expect = 5e-22
 Identities = 33/86 (38%), Positives = 49/86 (56%), Gaps = 3/86 (3%)

Query: 131 FLHFNGSETMKKILYKYGPLSVGLN-SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYG 189
            +     E +K  L  YGP+SV ++ S    FY G     +   CS  +L HAVLLVGYG
Sbjct: 111 NVPPGDEEALKAALANYGPVSVAIDASSSFQFYKGGIY--SGPCCSNTNLNHAVLLVGYG 168

Query: 190 KQDDIPYWLVRNSWGPIGPDEGFFKI 215
            ++ + YW+V+NSWG    ++G+ +I
Sbjct: 169 TENGVDYWIVKNSWGTSWGEKGYIRI 194


>gnl|CDD|214761 smart00645, Pept_C1, Papain family cysteine protease. 
          Length = 175

 Score = 75.7 bits (187), Expect = 4e-17
 Identities = 39/130 (30%), Positives = 50/130 (38%), Gaps = 47/130 (36%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNS 61
           GLE+E  YPY  +                    DF             YK G        
Sbjct: 81  GLETESCYPYTGSVAID--------------ASDFQF-----------YKSG-------- 107

Query: 62  DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ--DDIPYWLVRNSWGPIGPDEGFFK 119
                Y+          C    L HAVL+VGYG +  +   YW+V+NSWG    + G+F+
Sbjct: 108 ----IYDHP-------GCGSGTLDHAVLIVGYGTEVENGKDYWIVKNSWGTDWGENGYFR 156

Query: 120 IERG-NNACG 128
           I RG NN CG
Sbjct: 157 IARGKNNECG 166



 Score = 66.5 bits (163), Expect = 9e-14
 Identities = 27/73 (36%), Positives = 39/73 (53%), Gaps = 5/73 (6%)

Query: 145 YKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ--DDIPYWLVRNS 202
           Y Y   SV +++    FY    I  +   C    L HAVL+VGYG +  +   YW+V+NS
Sbjct: 88  YPY-TGSVAIDASDFQFYKSG-IYDHPG-CGSGTLDHAVLIVGYGTEVENGKDYWIVKNS 144

Query: 203 WGPIGPDEGFFKI 215
           WG    + G+F+I
Sbjct: 145 WGTDWGENGYFRI 157


>gnl|CDD|239112 cd02621, Peptidase_C1A_CathepsinC, Cathepsin C; also known as
           Dipeptidyl Peptidase I (DPPI), an atypical papain-like
           cysteine peptidase with chloride dependency and
           dipeptidyl aminopeptidase activity, resulting from its
           tetrameric structure which limits substrate access. Each
           subunit of the tetramer is composed of three peptides:
           the heavy and light chains, which together adopts the
           papain fold and forms the catalytic domain; and the
           residual propeptide region, which forms a beta barrel
           and points towards the substrate's N-terminus. The
           subunit composition is the result of the unique
           characteristic of procathepsin C maturation involving
           the cleavage of the catalytic domain and the
           non-autocatalytic excision of an activation peptide
           within its propeptide region. By removing N-terminal
           dipeptide extensions, cathepsin C activates granule
           serine peptidases (granzymes) involved in cell-mediated
           apoptosis, inflammation and tissue remodelling.
           Loss-of-function mutations in cathepsin C are associated
           with Papillon-Lefevre and Haim-Munk syndromes, rare
           diseases characterized by hyperkeratosis and early-onset
           periodontitis. Cathepsin C is widely expressed in many
           tissues with high levels in lung, kidney and placenta.
           It is also highly expressed in cytotoxic lymphocytes and
           mature myeloid cells.
          Length = 243

 Score = 77.0 bits (190), Expect = 4e-17
 Identities = 46/146 (31%), Positives = 69/146 (47%), Gaps = 21/146 (14%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNG------SETMKKILYKYGPL 55
           G+ +E  +PY   +     C    S+ + +   D+ +  G       + MK  +Y+ GP+
Sbjct: 89  GIVTEDYFPYTADDDRP--CKASPSECRRYYFSDYNYVGGCYGCTNEDEMKWEIYRNGPI 146

Query: 56  SVLL--NSDLIHDYNGT-PIRKNDETCS-------PYDL-GHAVLLVGYGKQDD--IPYW 102
            V     SD      G      NDE          P++L  HAVLLVG+G+ +     YW
Sbjct: 147 VVAFEVYSDFDFYKEGVYHHTDNDEVSDGDNDNFNPFELTNHAVLLVGWGEDEIKGEKYW 206

Query: 103 LVRNSWGPIGPDEGFFKIERGNNACG 128
           +V+NSWG    ++G+FKI RG N CG
Sbjct: 207 IVKNSWGSSWGEKGYFKIRRGTNECG 232



 Score = 63.2 bits (154), Expect = 5e-12
 Identities = 32/89 (35%), Positives = 46/89 (51%), Gaps = 13/89 (14%)

Query: 140 MKKILYKYGPLSVGL--NSHLIHFYNGT-PIRKNDETCS-------PYDL-GHAVLLVGY 188
           MK  +Y+ GP+ V     S    +  G      NDE          P++L  HAVLLVG+
Sbjct: 136 MKWEIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSDGDNDNFNPFELTNHAVLLVGW 195

Query: 189 GKQDD--IPYWLVRNSWGPIGPDEGFFKI 215
           G+ +     YW+V+NSWG    ++G+FKI
Sbjct: 196 GEDEIKGEKYWIVKNSWGSSWGEKGYFKI 224


>gnl|CDD|239111 cd02620, Peptidase_C1A_CathepsinB, Cathepsin B group; composed of
           cathepsin B and similar proteins, including
           tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin
           B is a lysosomal papain-like cysteine peptidase which is
           expressed in all tissues and functions primarily as an
           exopeptidase through its carboxydipeptidyl activity.
           Together with other cathepsins, it is involved in the
           degradation of proteins, proenzyme activation, Ag
           processing, metabolism and apoptosis. Cathepsin B has
           been implicated in a number of human diseases such as
           cancer, rheumatoid arthritis, osteoporosis and
           Alzheimer's disease. The unique carboxydipeptidyl
           activity of cathepsin B is attributed to the presence of
           an occluding loop in its active site which favors the
           binding of the C-termini of substrate proteins. Some
           members of this group do not possess the occluding loop.
           TIN-Ag is an extracellular matrix basement protein which
           was originally identified as a target Ag involved in
           anti-tubular basement membrane antibody-mediated
           interstitial nephritis. It plays a role in renal
           tubulogenesis and is defective in hereditary
           tubulointerstitial disorders. TIN-Ag is exclusively
           expressed in kidney tissues. .
          Length = 236

 Score = 76.5 bits (189), Expect = 7e-17
 Identities = 30/111 (27%), Positives = 50/111 (45%), Gaps = 6/111 (5%)

Query: 20  KCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSV--LLNSDLIHDYNGTPIRKNDE 77
            C     + K      +   +    + K +   GP+     +  D ++ Y     +    
Sbjct: 122 GCEKTYEEDKHKGKSAYSVPSDETDIMKEIMTNGPVQAAFTVYEDFLY-YKSGVYQH--- 177

Query: 78  TCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACG 128
           T      GHAV ++G+G ++ +PYWL  NSWG    + G+F+I RG+N CG
Sbjct: 178 TSGKQLGGHAVKIIGWGVENGVPYWLAANSWGTDWGENGYFRILRGSNECG 228



 Score = 60.7 bits (148), Expect = 4e-11
 Identities = 17/36 (47%), Positives = 26/36 (72%)

Query: 180 GHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKI 215
           GHAV ++G+G ++ +PYWL  NSWG    + G+F+I
Sbjct: 185 GHAVKIIGWGVENGVPYWLAANSWGTDWGENGYFRI 220


>gnl|CDD|185513 PTZ00203, PTZ00203, cathepsin L protease; Provisional.
          Length = 348

 Score = 76.7 bits (188), Expect = 2e-16
 Identities = 38/124 (30%), Positives = 63/124 (50%), Gaps = 5/124 (4%)

Query: 5   SEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSE-TMKKILYKYGPLSVLLNSDL 63
           +EK YPY + NG+  +C+             ++    SE  M   L K GP+S+ +++  
Sbjct: 210 TEKSYPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASS 269

Query: 64  IHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERG 123
              Y+   +     +C    L H VLLVGY    ++PYW+++NSWG    ++G+ ++  G
Sbjct: 270 FMSYHSGVL----TSCIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWGEKGYVRVTMG 325

Query: 124 NNAC 127
            NAC
Sbjct: 326 VNAC 329



 Score = 58.9 bits (142), Expect = 2e-10
 Identities = 26/91 (28%), Positives = 46/91 (50%), Gaps = 4/91 (4%)

Query: 138 ETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYW 197
             M   L K GP+S+ +++     Y+   +     +C    L H VLLVGY    ++PYW
Sbjct: 249 RVMAAWLAKNGPISIAVDASSFMSYHSGVL----TSCIGEQLNHGVLLVGYNMTGEVPYW 304

Query: 198 LVRNSWGPIGPDEGFFKIEHTLRSHLTHDIP 228
           +++NSWG    ++G+ ++   + + L    P
Sbjct: 305 VIKNSWGEDWGEKGYVRVTMGVNACLLTGYP 335


>gnl|CDD|239110 cd02619, Peptidase_C1, C1 Peptidase family (MEROPS database
           nomenclature), also referred to as the papain family;
           composed of two subfamilies of cysteine peptidases
           (CPs), C1A (papain) and C1B (bleomycin hydrolase).
           Papain-like enzymes are mostly endopeptidases with some
           exceptions like cathepsins B, C, H and X, which are
           exopeptidases. Papain-like CPs have different functions
           in various organisms. Plant CPs are used to mobilize
           storage proteins in seeds while mammalian CPs are
           primarily lysosomal enzymes responsible for protein
           degradation in the lysosome. Papain-like CPs are
           synthesized as inactive proenzymes with N-terminal
           propeptide regions, which are removed upon activation.
           Bleomycin hydrolase (BH) is a CP that detoxifies
           bleomycin by hydrolysis of an amide group. It acts as a
           carboxypeptidase on its C-terminus to convert itself
           into an aminopeptidase and peptide ligase. BH is found
           in all tissues in mammals as well as in many other
           eukaryotes. It forms a hexameric ring barrel structure
           with the active sites imbedded in the central channel.
           Some members of the C1 family are proteins classified as
           non-peptidase homologs which lack peptidase activity or
           have missing active site residues.
          Length = 223

 Score = 71.0 bits (174), Expect = 6e-15
 Identities = 32/127 (25%), Positives = 51/127 (40%), Gaps = 8/127 (6%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHFNGSETMKKILYKYGPLSV-- 57
           G+  E+DYPY   +  +   +           KD+  +  N  E +K+ L K GP+    
Sbjct: 83  GIPPEEDYPYGAESDGEEPKSEAALNAAKVKLKDYRRVLKNNIEDIKEALAKGGPVVAGF 142

Query: 58  LLNSDLIHDYNG--TPIRKNDETCSPYDLGHAVLLVGYGKQ--DDIPYWLVRNSWGPIGP 113
            + S       G                 GHAV++VGY     +    ++V+NSWG    
Sbjct: 143 DVYSGFDRLKEGIIYEEIVYLLYEDGDLGGHAVVIVGYDDNYVEGKGAFIVKNSWGTDWG 202

Query: 114 DEGFFKI 120
           D G+ +I
Sbjct: 203 DNGYGRI 209



 Score = 65.6 bits (160), Expect = 5e-13
 Identities = 36/182 (19%), Positives = 57/182 (31%), Gaps = 15/182 (8%)

Query: 45  MKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLV 104
           +K    +Y  LS      L    N   +  N        L   + LV       IP    
Sbjct: 36  IKGGEDEYVDLSPQ---YLYICANDECLGINGSCDGGGPLSALLKLV---ALKGIPPEED 89

Query: 105 RNSWGPIGPDEGFFKIERGNNACGKDF---LHFNGSETMKKILYKYGPLSVGLNSHLIHF 161
                    +E   +               +  N  E +K+ L K GP+  G + +    
Sbjct: 90  YPYGAESDGEEPKSEAALNAAKVKLKDYRRVLKNNIEDIKEALAKGGPVVAGFDVYSGFD 149

Query: 162 Y----NGTPIRKNDETCSPYDLGHAVLLVGYGKQ--DDIPYWLVRNSWGPIGPDEGFFKI 215
                                 GHAV++VGY     +    ++V+NSWG    D G+ +I
Sbjct: 150 RLKEGIIYEEIVYLLYEDGDLGGHAVVIVGYDDNYVEGKGAFIVKNSWGTDWGDNGYGRI 209

Query: 216 EH 217
            +
Sbjct: 210 SY 211


>gnl|CDD|240310 PTZ00200, PTZ00200, cysteine proteinase; Provisional.
          Length = 448

 Score = 61.6 bits (150), Expect = 4e-11
 Identities = 42/138 (30%), Positives = 60/138 (43%), Gaps = 26/138 (18%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNS 61
           GL S  D PY   +G   KC    +K        +L   G + + K L    P  V +  
Sbjct: 313 GLSSSSDVPYLAKDG---KCVVSSTKKVYIDS--YLVAKGKDVLNKSLV-ISPTVVYIAV 366

Query: 62  --DLIHD----YNGTPIRKNDETCSPYDLGHAVLLVG--YGKQDDIPYWLVRNSWGPIGP 113
             +L+      YNG         C    L HAVLLVG  Y ++    YW+++NSWG    
Sbjct: 367 SRELLKYKSGVYNGE--------CGKS-LNHAVLLVGEGYDEKTKKRYWIIKNSWGTDWG 417

Query: 114 DEGFFKIER---GNNACG 128
           + G+ ++ER   G + CG
Sbjct: 418 ENGYMRLERTNEGTDKCG 435



 Score = 47.8 bits (114), Expect = 2e-06
 Identities = 29/96 (30%), Positives = 44/96 (45%), Gaps = 18/96 (18%)

Query: 131 FLHFNGSETMKKILYKYGPLSVGLNSH--LIHF----YNGTPIRKNDETCSPYDLGHAVL 184
           +L   G + + K L    P  V +     L+ +    YNG         C    L HAVL
Sbjct: 342 YLVAKGKDVLNKSLV-ISPTVVYIAVSRELLKYKSGVYNGE--------CGKS-LNHAVL 391

Query: 185 LVG--YGKQDDIPYWLVRNSWGPIGPDEGFFKIEHT 218
           LVG  Y ++    YW+++NSWG    + G+ ++E T
Sbjct: 392 LVGEGYDEKTKKRYWIIKNSWGTDWGENGYMRLERT 427


>gnl|CDD|239149 cd02698, Peptidase_C1A_CathepsinX, Cathepsin X; the only
           papain-like lysosomal cysteine peptidase exhibiting
           carboxymonopeptidase activity. It can also act as a
           carboxydipeptidase, like cathepsin B, but has been shown
           to preferentially cleave substrates through a
           monopeptidyl carboxypeptidase pathway. The propeptide
           region of cathepsin X, the shortest among papain-like
           peptidases, is covalently attached to the active site
           cysteine in the inactive form of the enzyme. Little is
           known about the biological function of cathepsin X. Some
           studies point to a role in early tumorigenesis. A more
           recent study indicates that cathepsin X expression is
           restricted to immune cells suggesting a role in
           phagocytosis and the regulation of the immune response.
          Length = 239

 Score = 57.4 bits (139), Expect = 5e-10
 Identities = 34/142 (23%), Positives = 59/142 (41%), Gaps = 18/142 (12%)

Query: 76  DETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGKDFLHFN 135
           DETC+PY              +  P+           P    F I+        D+   +
Sbjct: 90  DETCNPY-QAK--------DGECNPFNRCGTCN----PFGECFAIKNYTLYFVSDYGSVS 136

Query: 136 GSETMKKILYKYGPLSVGLNSH-LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD- 193
           G + M   +Y  GP+S G+ +   +  Y G   ++  +        H + + G+G  ++ 
Sbjct: 137 GRDKMMAEIYARGPISCGIMATEALENYTGGVYKEYVQDPLI---NHIISVAGWGVDENG 193

Query: 194 IPYWLVRNSWGPIGPDEGFFKI 215
           + YW+VRNSWG    + G+F+I
Sbjct: 194 VEYWIVRNSWGEPWGERGWFRI 215



 Score = 55.1 bits (133), Expect = 3e-09
 Identities = 24/101 (23%), Positives = 50/101 (49%), Gaps = 5/101 (4%)

Query: 34  KDFLHFNGSETMKKILYKYGPLSV-LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVG 92
            D+   +G + M   +Y  GP+S  ++ ++ + +Y G   ++  +        H + + G
Sbjct: 130 SDYGSVSGRDKMMAEIYARGPISCGIMATEALENYTGGVYKEYVQDPLI---NHIISVAG 186

Query: 93  YGKQDD-IPYWLVRNSWGPIGPDEGFFKIERGNNACGKDFL 132
           +G  ++ + YW+VRNSWG    + G+F+I   +    +  L
Sbjct: 187 WGVDENGVEYWIVRNSWGEPWGERGWFRIVTSSYKGARYNL 227


>gnl|CDD|240244 PTZ00049, PTZ00049, cathepsin C-like protein; Provisional.
          Length = 693

 Score = 51.5 bits (123), Expect = 1e-07
 Identities = 22/47 (46%), Positives = 30/47 (63%), Gaps = 4/47 (8%)

Query: 86  HAVLLVGYGKQDD----IPYWLVRNSWGPIGPDEGFFKIERGNNACG 128
           HA++LVG+G+++       YW+ RNSWG     EG+FKI RG N  G
Sbjct: 620 HAIVLVGWGEEEINGKLYKYWIGRNSWGKNWGKEGYFKIIRGKNFSG 666



 Score = 44.9 bits (106), Expect = 2e-05
 Identities = 18/39 (46%), Positives = 26/39 (66%), Gaps = 4/39 (10%)

Query: 181 HAVLLVGYGKQDD----IPYWLVRNSWGPIGPDEGFFKI 215
           HA++LVG+G+++       YW+ RNSWG     EG+FKI
Sbjct: 620 HAIVLVGWGEEEINGKLYKYWIGRNSWGKNWGKEGYFKI 658


>gnl|CDD|240232 PTZ00021, PTZ00021, falcipain-2; Provisional.
          Length = 489

 Score = 50.9 bits (122), Expect = 1e-07
 Identities = 42/131 (32%), Positives = 63/131 (48%), Gaps = 19/131 (14%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLN- 60
           GL SE DYPY +   E   C  D+ K K +  K ++     +  K+ +   GP+SV +  
Sbjct: 345 GLCSEDDYPYVSDTPEL--CNIDRCKEK-YKIKSYVSIP-EDKFKEAIRFLGPISVSIAV 400

Query: 61  SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIP----------YWLVRNSWGP 110
           SD    Y G      D  C   +  HAV+LVGYG ++             Y++++NSWG 
Sbjct: 401 SDDFAFYKGGIF---DGECG-EEPNHAVILVGYGMEEIYNSDTKKMEKRYYYIIKNSWGE 456

Query: 111 IGPDEGFFKIE 121
              ++GF +IE
Sbjct: 457 SWGEKGFIRIE 467



 Score = 40.1 bits (94), Expect = 5e-04
 Identities = 27/80 (33%), Positives = 40/80 (50%), Gaps = 15/80 (18%)

Query: 148 GPLSVGLN-SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIP----------Y 196
           GP+SV +  S    FY G      D  C   +  HAV+LVGYG ++             Y
Sbjct: 392 GPISVSIAVSDDFAFYKGGIF---DGECG-EEPNHAVILVGYGMEEIYNSDTKKMEKRYY 447

Query: 197 WLVRNSWGPIGPDEGFFKIE 216
           ++++NSWG    ++GF +IE
Sbjct: 448 YIIKNSWGESWGEKGFIRIE 467


>gnl|CDD|185641 PTZ00462, PTZ00462, Serine-repeat antigen protein; Provisional.
          Length = 1004

 Score = 51.2 bits (122), Expect = 2e-07
 Identities = 24/54 (44%), Positives = 34/54 (62%), Gaps = 6/54 (11%)

Query: 86  HAVLLVGYG-----KQDDIPYWLVRNSWGPIGPDEGFFKIER-GNNACGKDFLH 133
           HAV +VGYG     + +   YW+VRNSWG    DEG+FK++  G + C  +F+H
Sbjct: 723 HAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKVDMYGPSHCEDNFIH 776



 Score = 47.0 bits (111), Expect = 4e-06
 Identities = 20/41 (48%), Positives = 27/41 (65%), Gaps = 5/41 (12%)

Query: 181 HAVLLVGYG-----KQDDIPYWLVRNSWGPIGPDEGFFKIE 216
           HAV +VGYG     + +   YW+VRNSWG    DEG+FK++
Sbjct: 723 HAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKVD 763


>gnl|CDD|240381 PTZ00364, PTZ00364, dipeptidyl-peptidase I precursor; Provisional.
          Length = 548

 Score = 43.3 bits (102), Expect = 6e-05
 Identities = 36/155 (23%), Positives = 60/155 (38%), Gaps = 28/155 (18%)

Query: 2   GLESEKDY--PYKNANGEKFKCAYDKSKVKLF------TGKDFLHFNGSETMKKILYKYG 53
           G+ +   Y  PY + +G +  C   +   + +       G  +      + +   +Y++G
Sbjct: 295 GILTTDSYYIPYDSGDGVERACKTRRPSRRYYFTNYGPLGGYYGAVTDPDEIIWEIYRHG 354

Query: 54  PL--SVLLNSDLI---HDYNGT---------PIRKNDETCSPY---DLGHAVLLVGYGK- 95
           P+  SV  NSD      +                  D     Y   ++ H VL++G+G  
Sbjct: 355 PVPASVYANSDWYNCDENSTEDVRYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTD 414

Query: 96  QDDIPYWLVRNSWGPIGP--DEGFFKIERGNNACG 128
           ++   YWLV + WG      D G  KI RG NA  
Sbjct: 415 ENGGDYWLVLDPWGSRRSWCDGGTRKIARGVNAYN 449



 Score = 31.4 bits (71), Expect = 0.35
 Identities = 11/28 (39%), Positives = 19/28 (67%), Gaps = 1/28 (3%)

Query: 178 DLGHAVLLVGYGK-QDDIPYWLVRNSWG 204
           ++ H VL++G+G  ++   YWLV + WG
Sbjct: 401 NVNHTVLIIGWGTDENGGDYWLVLDPWG 428


>gnl|CDD|227207 COG4870, COG4870, Cysteine protease [Posttranslational
           modification, protein turnover, chaperones].
          Length = 372

 Score = 33.7 bits (77), Expect = 0.068
 Identities = 17/48 (35%), Positives = 25/48 (52%), Gaps = 10/48 (20%)

Query: 180 GHAVLLVGYGKQDDIPY----------WLVRNSWGPIGPDEGFFKIEH 217
           GHAVL+VGY    DI            ++++NSWG    + G+F I +
Sbjct: 265 GHAVLIVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYFWISY 312



 Score = 33.7 bits (77), Expect = 0.075
 Identities = 17/47 (36%), Positives = 24/47 (51%), Gaps = 10/47 (21%)

Query: 85  GHAVLLVGYGKQDDIPY----------WLVRNSWGPIGPDEGFFKIE 121
           GHAVL+VGY    DI            ++++NSWG    + G+F I 
Sbjct: 265 GHAVLIVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYFWIS 311


>gnl|CDD|219395 pfam07387, Seadorna_VP7, Seadornavirus VP7.  This family consists
           of several Seadornavirus specific VP7 proteins of around
           305 residues in length. The function of this family is
           unknown. However, it appears to be distantly related to
           protein kinases.
          Length = 308

 Score = 28.6 bits (64), Expect = 3.0
 Identities = 12/40 (30%), Positives = 19/40 (47%), Gaps = 6/40 (15%)

Query: 130 DFLHFNGSETMKKILYKYGPL------SVGLNSHLIHFYN 163
           DF H NG+ T+ K   + GPL      S+    +   ++N
Sbjct: 68  DFQHLNGNSTLAKGKTEIGPLFLKKIRSLPCCLNDALYFN 107


>gnl|CDD|220789 pfam10508, Proteasom_PSMB, Proteasome non-ATPase 26S subunit.  The
           26S proteasome, a eukaryotic ATP-dependent, dumb-bell
           shaped, protease complex with a molecular mass of approx
           20kDa consists of a central 20S proteasome,functioning
           as a catalytic machine, and two large V-shaped terminal
           modules, having possible regulatory roles,composed of
           multiple subunits of 25- 110 kDa attached to the central
           portion in opposite orientations. It is responsible for
           degradation of abnormal intracellular proteins,
           including oxidatively damaged proteins, and may play a
           role as a component of a cellular anti-oxidative system.
           Expression of catalytic core subunits including PSMB5
           and peptidase activities of the proteasome were elevated
           following incubation with 3-methylcholanthrene. The 20S
           proteasome comprises a cylindrical stack of four rings,
           two outer rings formed by seven alpha-subunits
           (alpha1-alpha7) and two inner rings of seven
           beta-subunits (beta1-beta7). Two outer rings of alpha
           subunits maintain structure, while the central beta
           rings contain the proteolytic active core subunits beta1
           (PSMB6), beta2 (PSMB7), and beta5 (PSMB5). Expression of
           PSMB5 can be altered by chemical reactants, such as
           3-methylcholanthrene.
          Length = 504

 Score = 27.8 bits (62), Expect = 5.0
 Identities = 15/53 (28%), Positives = 18/53 (33%), Gaps = 9/53 (16%)

Query: 128 GKDFLHFNGSETMKKILYKYG----PLSVGLNSHLIH-----FYNGTPIRKND 171
           GK  L F  S  ++  L   G     LSV L    +      FY  T     D
Sbjct: 325 GKQLLMFKTSGALEHTLKAAGAHQTMLSVELKKRALQALTSIFYFATEQATED 377


>gnl|CDD|132560 TIGR03521, GldG, gliding-associated putative ABC transporter
           substrate-binding component GldG.  Members of this
           protein family are exclusive to the Bacteroidetes phylum
           (previously Cytophaga-Flavobacteria-Bacteroides). GldG
           is a protein linked to a type of rapid surface gliding
           motility found in certain Bacteroidetes, such as
           Flavobacterium johnsoniae and Cytophaga hutchinsonii.
           Knockouts of GldG abolish the gliding phenotype. GldG,
           along with GldA and GldF are believed to compose an ABC
           transporter and are observed as an operon. Gliding
           motility appears closely linked to chitin utilization in
           the model species Flavobacterium johnsoniae.
           Bacteroidetes with members of this protein family appear
           to have all of the genes associated with gliding
           motility.
          Length = 552

 Score = 27.3 bits (61), Expect = 7.6
 Identities = 11/25 (44%), Positives = 16/25 (64%), Gaps = 3/25 (12%)

Query: 48  ILYKYGPLSVLLNSDLIHDYNGTPI 72
           +L+KYG   + +N DL+ D N  PI
Sbjct: 298 LLFKYG---IRINPDLVEDLNSAPI 319


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.142    0.459 

Gapped
Lambda     K      H
   0.267   0.0789    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,114,126
Number of extensions: 1131162
Number of successful extensions: 841
Number of sequences better than 10.0: 1
Number of HSP's gapped: 806
Number of HSP's successfully gapped: 41
Length of query: 233
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 139
Effective length of database: 6,768,326
Effective search space: 940797314
Effective search space used: 940797314
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 57 (25.6 bits)