RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy1727
         (240 letters)



>gnl|CDD|215726 pfam00112, Peptidase_C1, Papain family cysteine protease. 
          Length = 213

 Score =  173 bits (441), Expect = 2e-54
 Identities = 67/152 (44%), Positives = 78/152 (51%), Gaps = 17/152 (11%)

Query: 63  LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKE-EIRVKIQS 121
           LVDCD  + GC GGL  NAFE I     GG+  E DYPY   +  C   K      KI+ 
Sbjct: 53  LVDCDTGNNGCNGGLPDNAFEYIKKN--GGIVTESDYPYTAHDGTCKFKKSNSKYAKIKG 110

Query: 122 YVNVSS-DETEMAKYLVKNGPMAVAINANAM--QFYFGGVSHPLKFLCKGGMDNLDHGVL 178
           Y +V   DE  +   L KNGP++VAI+A     Q Y  GV    +  C G    LDH VL
Sbjct: 111 YGDVPYNDEEALQAALAKNGPVSVAIDAYEDDFQLYKSGVYKHTE--CSG---ELDHAVL 165

Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
           IVGYG           PYWI+KNSWG  WGE 
Sbjct: 166 IVGYGTENGV------PYWIVKNSWGTDWGEN 191


>gnl|CDD|239068 cd02248, Peptidase_C1A, Peptidase C1A subfamily (MEROPS database
           nomenclature); composed of cysteine peptidases (CPs)
           similar to papain, including the mammalian CPs
           (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain
           is an endopeptidase with specific substrate preferences,
           primarily for bulky hydrophobic or aromatic residues at
           the S2 subsite, a hydrophobic pocket in papain that
           accommodates the P2 sidechain of the substrate (the
           second residue away from the scissile bond). Most
           members of the papain subfamily are endopeptidases. Some
           exceptions to this rule can be explained by specific
           details of the catalytic domains like the occluding loop
           in cathepsin B which confers an additional
           carboxydipeptidyl activity and the mini-chain of
           cathepsin H resulting in an N-terminal exopeptidase
           activity. Papain-like CPs have different functions in
           various organisms. Plant CPs are used to mobilize
           storage proteins in seeds. Parasitic CPs act
           extracellularly to help invade tissues and cells, to
           hatch or to evade the host immune system. Mammalian CPs
           are primarily lysosomal enzymes with the exception of
           cathepsin W, which is retained in the endoplasmic
           reticulum. They are responsible for protein degradation
           in the lysosome. Papain-like CPs are synthesized as
           inactive proenzymes with N-terminal propeptide regions,
           which are removed upon activation. In addition to its
           inhibitory role, the propeptide is required for proper
           folding of the newly synthesized enzyme and its
           stabilization in denaturing pH conditions. Residues
           within the propeptide region also play a role in the
           transport of the proenzyme to lysosomes or acidified
           vesicles. Also included in this subfamily are proteins
           classified as non-peptidase homologs, which lack
           peptidase activity or have missing active site residues.
          Length = 210

 Score =  165 bits (421), Expect = 2e-51
 Identities = 63/151 (41%), Positives = 79/151 (52%), Gaps = 16/151 (10%)

Query: 63  LVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
           LVDC    + GC GG   NAFE + +   GGL  E DYPY G +  C  N  ++  KI  
Sbjct: 52  LVDCSTSGNNGCNGGNPDNAFEYVKN---GGLASESDYPYTGKDGTCKYNSSKVGAKITG 108

Query: 122 YVNVS-SDETEMAKYLVKNGPMAVAINAN-AMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
           Y NV   DE  +   L   GP++VAI+A+ + QFY GG+             NL+H VL+
Sbjct: 109 YSNVPPGDEEALKAALANYGPVSVAIDASSSFQFYKGGIYSG----PCCSNTNLNHAVLL 164

Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
           VGYG            YWI+KNSWG  WGEK
Sbjct: 165 VGYGTENGV------DYWIVKNSWGTSWGEK 189


>gnl|CDD|214761 smart00645, Pept_C1, Papain family cysteine protease. 
          Length = 175

 Score =  118 bits (299), Expect = 2e-33
 Identities = 57/161 (35%), Positives = 64/161 (39%), Gaps = 51/161 (31%)

Query: 63  LVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
           LVDC    + GC GGL  NAFE I     GGLE E  YPY GS                 
Sbjct: 53  LVDCSGGGNCGCNGGLPDNAFEYIKKN--GGLETESCYPYTGS----------------- 93

Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
                                 VAI+A+  QFY  G+       C  G   LDH VLIVG
Sbjct: 94  ----------------------VAIDASDFQFYKSGIYDHPG--C--GSGTLDHAVLIVG 127

Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWG 222
           YG          + YWI+KNSWG  WGE    F I +    
Sbjct: 128 YGTEVEN----GKDYWIVKNSWGTDWGENGY-FRIARGKNN 163


>gnl|CDD|185513 PTZ00203, PTZ00203, cathepsin L protease; Provisional.
          Length = 348

 Score =  121 bits (305), Expect = 6e-33
 Identities = 67/154 (43%), Positives = 91/154 (59%), Gaps = 18/154 (11%)

Query: 62  KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRV- 117
           +LV CD VD GCGGGLM  AFE ++  + G +  EK YPY    G    C  N  E+   
Sbjct: 177 QLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVPEC-SNSSELAPG 235

Query: 118 -KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
            +I  YV++ S E  MA +L KNGP+++A++A++   Y  GV       C G  + L+HG
Sbjct: 236 ARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVLTS----CIG--EQLNHG 289

Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
           VL+VGY +     T ++ PYW+IKNSWG  WGEK
Sbjct: 290 VLLVGYNM-----TGEV-PYWVIKNSWGEDWGEK 317


>gnl|CDD|240232 PTZ00021, PTZ00021, falcipain-2; Provisional.
          Length = 489

 Score = 96.0 bits (239), Expect = 8e-23
 Identities = 62/157 (39%), Positives = 92/157 (58%), Gaps = 20/157 (12%)

Query: 62  KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN-RACHLNKEEIRVKIQ 120
           +LVDC   + GC GGL+ NAFE +I    GGL  E DYPY       C++++ + + KI+
Sbjct: 317 ELVDCSFKNNGCYGGLIPNAFEDMIEL--GGLCSEDDYPYVSDTPELCNIDRCKEKYKIK 374

Query: 121 SYVNVSSDE-TEMAKYLVKNGPMAVAINANA-MQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
           SYV++  D+  E  ++L   GP++V+I  +    FY GG+     F  + G +  +H V+
Sbjct: 375 SYVSIPEDKFKEAIRFL---GPISVSIAVSDDFAFYKGGI-----FDGECG-EEPNHAVI 425

Query: 179 IVGYGVHK-----TKFTHKIQPYWIIKNSWGPHWGEK 210
           +VGYG+ +     TK   K + Y+IIKNSWG  WGEK
Sbjct: 426 LVGYGMEEIYNSDTKKMEK-RYYYIIKNSWGESWGEK 461



 Score = 40.5 bits (95), Expect = 4e-04
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 11  DKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKI 50
             LE+V  F  F+++H K Y T +E  +R   F  NL KI
Sbjct: 161 TNLENVNSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKI 200


>gnl|CDD|239112 cd02621, Peptidase_C1A_CathepsinC, Cathepsin C; also known as
           Dipeptidyl Peptidase I (DPPI), an atypical papain-like
           cysteine peptidase with chloride dependency and
           dipeptidyl aminopeptidase activity, resulting from its
           tetrameric structure which limits substrate access. Each
           subunit of the tetramer is composed of three peptides:
           the heavy and light chains, which together adopts the
           papain fold and forms the catalytic domain; and the
           residual propeptide region, which forms a beta barrel
           and points towards the substrate's N-terminus. The
           subunit composition is the result of the unique
           characteristic of procathepsin C maturation involving
           the cleavage of the catalytic domain and the
           non-autocatalytic excision of an activation peptide
           within its propeptide region. By removing N-terminal
           dipeptide extensions, cathepsin C activates granule
           serine peptidases (granzymes) involved in cell-mediated
           apoptosis, inflammation and tissue remodelling.
           Loss-of-function mutations in cathepsin C are associated
           with Papillon-Lefevre and Haim-Munk syndromes, rare
           diseases characterized by hyperkeratosis and early-onset
           periodontitis. Cathepsin C is widely expressed in many
           tissues with high levels in lung, kidney and placenta.
           It is also highly expressed in cytotoxic lymphocytes and
           mature myeloid cells.
          Length = 243

 Score = 91.7 bits (228), Expect = 2e-22
 Identities = 52/165 (31%), Positives = 73/165 (44%), Gaps = 27/165 (16%)

Query: 64  VDCDKVDAGCGGGLMSNAFETIISKLGG--GLEGEKDYPYKG-SNRACHLNKEEIR---V 117
           + C +   GC GG     F  ++ K     G+  E  +PY    +R C  +  E R    
Sbjct: 64  LSCSQYSQGCDGG-----FPFLVGKFAEDFGIVTEDYFPYTADDDRPCKASPSECRRYYF 118

Query: 118 KIQSYVN---VSSDETEMAKYLVKNGPMAVAINANA-MQFYFGGVSHPLKF--LCKGGMD 171
              +YV      ++E EM   + +NGP+ VA    +   FY  GV H      +  G  D
Sbjct: 119 SDYNYVGGCYGCTNEDEMKWEIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSDGDND 178

Query: 172 NL------DHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
           N       +H VL+VG+G  + K       YWI+KNSWG  WGEK
Sbjct: 179 NFNPFELTNHAVLLVGWGEDEIKGE----KYWIVKNSWGSSWGEK 219


>gnl|CDD|240310 PTZ00200, PTZ00200, cysteine proteinase; Provisional.
          Length = 448

 Score = 90.5 bits (225), Expect = 5e-21
 Identities = 55/154 (35%), Positives = 70/154 (45%), Gaps = 29/154 (18%)

Query: 63  LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSY 122
           LV+CD    GC GG    A E + +K   GL    D PY   +  C +     +V I SY
Sbjct: 287 LVNCDTKSQGCSGGYPDTALEYVKNK---GLSSSSDVPYLAKDGKC-VVSSTKKVYIDSY 342

Query: 123 VNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGM------DNLDHG 176
           + V+  +  + K LV   P  V I           VS  L    K G+       +L+H 
Sbjct: 343 L-VAKGKDVLNKSLVI-SPTVVYIA----------VSREL-LKYKSGVYNGECGKSLNHA 389

Query: 177 VLIVGYGV-HKTKFTHKIQPYWIIKNSWGPHWGE 209
           VL+VG G   KTK     + YWIIKNSWG  WGE
Sbjct: 390 VLLVGEGYDEKTK-----KRYWIIKNSWGTDWGE 418



 Score = 35.1 bits (81), Expect = 0.027
 Identities = 14/45 (31%), Positives = 21/45 (46%), Gaps = 2/45 (4%)

Query: 11  DKLEHVAM--FNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQIR 53
            KLE      F  F +K+N+ +AT  E   R   FR N  +++  
Sbjct: 116 PKLEFEVYLEFEEFNKKYNRKHATHAERLNRFLTFRNNYLEVKSH 160


>gnl|CDD|239110 cd02619, Peptidase_C1, C1 Peptidase family (MEROPS database
           nomenclature), also referred to as the papain family;
           composed of two subfamilies of cysteine peptidases
           (CPs), C1A (papain) and C1B (bleomycin hydrolase).
           Papain-like enzymes are mostly endopeptidases with some
           exceptions like cathepsins B, C, H and X, which are
           exopeptidases. Papain-like CPs have different functions
           in various organisms. Plant CPs are used to mobilize
           storage proteins in seeds while mammalian CPs are
           primarily lysosomal enzymes responsible for protein
           degradation in the lysosome. Papain-like CPs are
           synthesized as inactive proenzymes with N-terminal
           propeptide regions, which are removed upon activation.
           Bleomycin hydrolase (BH) is a CP that detoxifies
           bleomycin by hydrolysis of an amide group. It acts as a
           carboxypeptidase on its C-terminus to convert itself
           into an aminopeptidase and peptide ligase. BH is found
           in all tissues in mammals as well as in many other
           eukaryotes. It forms a hexameric ring barrel structure
           with the active sites imbedded in the central channel.
           Some members of the C1 family are proteins classified as
           non-peptidase homologs which lack peptidase activity or
           have missing active site residues.
          Length = 223

 Score = 78.3 bits (193), Expect = 1e-17
 Identities = 36/152 (23%), Positives = 64/152 (42%), Gaps = 13/152 (8%)

Query: 66  CDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEI----RVKIQS 121
           C  ++  C GG   +A   +++  G  +  E+DYPY   +       E      +VK++ 
Sbjct: 59  CLGINGSCDGGGPLSALLKLVALKG--IPPEEDYPYGAESDGEEPKSEAALNAAKVKLKD 116

Query: 122 YVNVSS-DETEMAKYLVKNGPMAVAINANAMQFYFGGVSH--PLKFLCKGGMDNLDHGVL 178
           Y  V   +  ++ + L K GP+    +  +             + +L     D   H V+
Sbjct: 117 YRRVLKNNIEDIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDGDLGGHAVV 176

Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
           IVGY  +  +     +  +I+KNSWG  WG+ 
Sbjct: 177 IVGYDDNYVE----GKGAFIVKNSWGTDWGDN 204


>gnl|CDD|239111 cd02620, Peptidase_C1A_CathepsinB, Cathepsin B group; composed of
           cathepsin B and similar proteins, including
           tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin
           B is a lysosomal papain-like cysteine peptidase which is
           expressed in all tissues and functions primarily as an
           exopeptidase through its carboxydipeptidyl activity.
           Together with other cathepsins, it is involved in the
           degradation of proteins, proenzyme activation, Ag
           processing, metabolism and apoptosis. Cathepsin B has
           been implicated in a number of human diseases such as
           cancer, rheumatoid arthritis, osteoporosis and
           Alzheimer's disease. The unique carboxydipeptidyl
           activity of cathepsin B is attributed to the presence of
           an occluding loop in its active site which favors the
           binding of the C-termini of substrate proteins. Some
           members of this group do not possess the occluding loop.
           TIN-Ag is an extracellular matrix basement protein which
           was originally identified as a target Ag involved in
           anti-tubular basement membrane antibody-mediated
           interstitial nephritis. It plays a role in renal
           tubulogenesis and is defective in hereditary
           tubulointerstitial disorders. TIN-Ag is exclusively
           expressed in kidney tissues. .
          Length = 236

 Score = 64.6 bits (158), Expect = 1e-12
 Identities = 46/172 (26%), Positives = 64/172 (37%), Gaps = 38/172 (22%)

Query: 63  LVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK--------- 112
           L+ C      GC GG    A++ + +    G+      PY       H            
Sbjct: 58  LLSCCSGCGDGCNGGYPDAAWKYLTTT---GVVTGGCQPYTIPPCGHHPEGPPPCCGTPY 114

Query: 113 -------------EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYF-GGV 158
                        EE + K +S  +V SDET++ K ++ NGP+  A        Y+  GV
Sbjct: 115 CTPKCQDGCEKTYEEDKHKGKSAYSVPSDETDIMKEIMTNGPVQAAFTVYEDFLYYKSGV 174

Query: 159 SHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
                    GG     H V I+G+GV          PYW+  NSWG  WGE 
Sbjct: 175 YQHTSGKQLGG-----HAVKIIGWGVENG------VPYWLAANSWGTDWGEN 215


>gnl|CDD|239149 cd02698, Peptidase_C1A_CathepsinX, Cathepsin X; the only
           papain-like lysosomal cysteine peptidase exhibiting
           carboxymonopeptidase activity. It can also act as a
           carboxydipeptidase, like cathepsin B, but has been shown
           to preferentially cleave substrates through a
           monopeptidyl carboxypeptidase pathway. The propeptide
           region of cathepsin X, the shortest among papain-like
           peptidases, is covalently attached to the active site
           cysteine in the inactive form of the enzyme. Little is
           known about the biological function of cathepsin X. Some
           studies point to a role in early tumorigenesis. A more
           recent study indicates that cathepsin X expression is
           restricted to immune cells suggesting a role in
           phagocytosis and the regulation of the immune response.
          Length = 239

 Score = 59.7 bits (145), Expect = 7e-11
 Identities = 42/181 (23%), Positives = 67/181 (37%), Gaps = 34/181 (18%)

Query: 49  KIQIRGEGTHLALK---LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN 105
            I  +G    + L    ++DC    + C GG     +E        G+  E   PY+  +
Sbjct: 45  NIARKGAWPSVYLSVQVVIDCAGGGS-CHGGDPGGVYEYAHKH---GIPDETCNPYQAKD 100

Query: 106 R---------------ACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINA-N 149
                            C   K      +  Y +VS    +M   +   GP++  I A  
Sbjct: 101 GECNPFNRCGTCNPFGECFAIKNYTLYFVSDYGSVSG-RDKMMAEIYARGPISCGIMATE 159

Query: 150 AMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
           A++ Y GGV              ++H + + G+GV           YWI++NSWG  WGE
Sbjct: 160 ALENYTGGVYKE-----YVQDPLINHIISVAGWGVD-----ENGVEYWIVRNSWGEPWGE 209

Query: 210 K 210
           +
Sbjct: 210 R 210


>gnl|CDD|214853 smart00848, Inhibitor_I29, Cathepsin propeptide inhibitor domain
          (I29).  This domain is found at the N-terminus of some
          C1 peptidases such as Cathepsin L where it acts as a
          propeptide. There are also a number of proteins that
          are composed solely of multiple copies of this domain
          such as the peptidase inhibitor salarin. This family is
          classified as I29 by MEROPS. Peptide proteinase
          inhibitors can be found as single domain proteins or as
          single or multiple domains within proteins; these are
          referred to as either simple or compound inhibitors,
          respectively. In many cases they are synthesised as
          part of a larger precursor protein, either as a
          prepropeptide or as an N-terminal domain associated
          with an inactive peptidase or zymogen. This domain
          prevents access of the substrate to the active site.
          Removal of the N-terminal inhibitor domain either by
          interaction with a second peptidase or by autocatalytic
          cleavage activates the zymogen. Other inhibitors
          interact direct with proteinases using a simple
          noncovalent lock and key mechanism; while yet others
          use a conformational change-based trapping mechanism
          that depends on their structural and thermodynamic
          properties.
          Length = 57

 Score = 49.5 bits (119), Expect = 2e-08
 Identities = 16/33 (48%), Positives = 24/33 (72%)

Query: 19 FNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
          F  + +KH KSY+++EE  +R  IF+ NLKKI+
Sbjct: 1  FEQWKKKHGKSYSSEEEEARRFAIFKENLKKIE 33


>gnl|CDD|219764 pfam08246, Inhibitor_I29, Cathepsin propeptide inhibitor domain
          (I29).  This domain is found at the N-terminus of some
          C1 peptidases such as Cathepsin L where it acts as a
          propeptide. There are also a number of proteins that
          are composed solely of multiple copies of this domain
          such as the peptidase inhibitor salarin. This family is
          classified as I29 by MEROPS.
          Length = 58

 Score = 48.4 bits (116), Expect = 4e-08
 Identities = 13/33 (39%), Positives = 22/33 (66%)

Query: 19 FNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
          F  + +K+ KSY ++EE   R +IF+ NL+ I+
Sbjct: 1  FEDWKKKYGKSYYSEEEELYRFQIFKENLRFIE 33


>gnl|CDD|185641 PTZ00462, PTZ00462, Serine-repeat antigen protein; Provisional.
          Length = 1004

 Score = 52.0 bits (124), Expect = 1e-07
 Identities = 32/99 (32%), Positives = 51/99 (51%), Gaps = 12/99 (12%)

Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINA-NAMQFYFGGVSHPLKFLCKGGMD 171
           E    K+ +++ +  DE      ++  G +   I A N + + F G    ++ LC  G D
Sbjct: 670 EHFHDKMDAFIKIIKDE------IMNKGSVIAYIKAENVLGYEFNGKK--VQNLC--GDD 719

Query: 172 NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
             DH V IVGYG +      K + YWI++NSWG +WG++
Sbjct: 720 TADHAVNIVGYGNYINDEDEK-KSYWIVRNSWGKYWGDE 757


>gnl|CDD|240244 PTZ00049, PTZ00049, cathepsin C-like protein; Provisional.
          Length = 693

 Score = 50.3 bits (120), Expect = 3e-07
 Identities = 33/129 (25%), Positives = 52/129 (40%), Gaps = 34/129 (26%)

Query: 97  KDYPYKGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQF-YF 155
           KDY Y G    C               N  + E  M   + +NGP+  +  A+   + Y 
Sbjct: 540 KDYNYIGGCYGC---------------NQCNGEKIMMNEIYRNGPIVASFEASPDFYDYA 584

Query: 156 GGV----SHPLKFLCK------------GGMDNLDHGVLIVGYGVHKTKFTHKIQPYWII 199
            GV      P    C              G + ++H +++VG+G  + +   K+  YWI 
Sbjct: 585 DGVYYVEDFPHARRCTVDLPKHNGVYNITGWEKVNHAIVLVGWG--EEEINGKLYKYWIG 642

Query: 200 KNSWGPHWG 208
           +NSWG +WG
Sbjct: 643 RNSWGKNWG 651


>gnl|CDD|227207 COG4870, COG4870, Cysteine protease [Posttranslational
           modification, protein turnover, chaperones].
          Length = 372

 Score = 46.4 bits (110), Expect = 4e-06
 Identities = 25/67 (37%), Positives = 30/67 (44%), Gaps = 7/67 (10%)

Query: 154 YFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPY----WIIKNSWGPHWGE 209
              G+  P  +      +N  H VLIVGY         K  P     +IIKNSWG +WGE
Sbjct: 247 NSLGICIP--YPYVDSGENWGHAVLIVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGE 304

Query: 210 KTMPFWI 216
               FWI
Sbjct: 305 NGY-FWI 310


>gnl|CDD|240381 PTZ00364, PTZ00364, dipeptidyl-peptidase I precursor; Provisional.
          Length = 548

 Score = 45.3 bits (107), Expect = 2e-05
 Identities = 39/178 (21%), Positives = 64/178 (35%), Gaps = 35/178 (19%)

Query: 54  GEGTHLALKLV-DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDY--PYK---GSNRA 107
           G+ T L+ + V DC +   GC GG      E        G+     Y  PY    G  RA
Sbjct: 259 GQQTFLSARHVLDCSQYGQGCAGGF---PEEVGKFAETFGILTTDSYYIPYDSGDGVERA 315

Query: 108 CHLNKEEIR------VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHP 161
           C   +   R        +  Y    +D  E+   + ++GP+  ++ AN+  +     S  
Sbjct: 316 CKTRRPSRRYYFTNYGPLGGYYGAVTDPDEIIWEIYRHGPVPASVYANSDWYNCDENSTE 375

Query: 162 LKFLCKGGMD---------------NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWG 204
                                    N++H VLI+G+G  +         YW++ + WG
Sbjct: 376 DVRYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDENG-----GDYWLVLDPWG 428


>gnl|CDD|176025 cd08379, C2D_MCTP_PRT_plant, C2 domain fourth repeat found in
           Multiple C2 domain and Transmembrane region Proteins
           (MCTP); plant subset.  MCTPs are involved in Ca2+
           signaling at the membrane.  Plant-MCTPs are composed of
           a variable N-terminal sequence, four C2 domains, two
           transmembrane regions (TMRs), and a short C-terminal
           sequence.  It is one of four protein classes that are
           anchored to membranes via a transmembrane region; the
           others being synaptotagmins, extended synaptotagmins,
           and ferlins. MCTPs are the only membrane-bound C2 domain
           proteins that contain two functional TMRs. MCTPs are
           unique in that they bind Ca2+ but not phospholipids. C2
           domains fold into an 8-standed beta-sandwich that can
           adopt 2 structural arrangements: Type I and Type II,
           distinguished by a circular permutation involving their
           N- and C-terminal beta strands. Many C2 domains are
           Ca2+-dependent membrane-targeting modules that bind a
           wide variety of substances including bind phospholipids,
           inositol polyphosphates, and intracellular proteins.
           Most C2 domain proteins are either signal transduction
           enzymes that contain a single C2 domain, such as protein
           kinase C, or membrane trafficking proteins which contain
           at least two C2 domains, such as synaptotagmin 1.
           However, there are a few exceptions to this including
           RIM isoforms and some splice variants of piccolo/aczonin
           and intersectin which only have a single C2 domain.  C2
           domains with a calcium binding region have negatively
           charged residues, primarily aspartates, that serve as
           ligands for calcium ions. This cd contains the fourth C2
           repeat, C2D, and has a type-II topology.
          Length = 126

 Score = 28.1 bits (63), Expect = 2.1
 Identities = 14/47 (29%), Positives = 21/47 (44%), Gaps = 7/47 (14%)

Query: 195 PYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKSIYS-SAPVT 240
            Y + K  +GP W         +++S  PRW EQ T  +Y     +T
Sbjct: 26  AYCVAK--YGPKWVR----TRTVEDSSNPRWNEQYTWPVYDPCTVLT 66


>gnl|CDD|211774 TIGR02993, ectoine_eutD, ectoine utilization protein EutD.  Members
           of this family are putative peptidases or hydrolases
           similar to Xaa-Pro aminopeptidase (pfam00557). They
           belong to ectoine utilization operons, as found in
           Sinorhizobium meliloti 1021 (where it is known to be
           induced by ectoine), Mesorhizobium loti, Silicibacter
           pomeroyi, Agrobacterium tumefaciens, and Pseudomonas
           putida. The exact function is unknown.
          Length = 391

 Score = 29.1 bits (65), Expect = 2.5
 Identities = 20/66 (30%), Positives = 28/66 (42%), Gaps = 19/66 (28%)

Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMP--------------FWIIKNSWGPRWGE 227
           YG+HK   T      + I  S+ P WGE+TM               F  +   W   WG 
Sbjct: 310 YGIHKDSRTG-----YPIGLSYPPDWGERTMSLRPGDNTVLQPGMTFHFMTGLWMEDWGL 364

Query: 228 QVTKSI 233
           ++T+SI
Sbjct: 365 EITESI 370


>gnl|CDD|188453 TIGR03938, deacetyl_PgaB, poly-beta-1,6-N-acetyl-D-glucosamine
           N-deacetylase PgaB.  Two well-characterized systems
           produce polysaccharide based on N-acetyl-D-glucosamine
           in straight chains with beta-1,6 linkages. These are
           encoded by the icaADBC operon in Staphylococcus species,
           where the system is designated polysaccharide
           intercellular adhesin (PIA), and the pgaABCD operon in
           Gram-negative bacteria such as E. coli. Both systems
           include a putative polysaccharide deacetylase. The PgaB
           protein, described here, contains an additional domain
           lacking from its Gram-positive counterpart IcaB
           (TIGR03933). Deacetylation by this protein appears
           necessary to allow export through the porin PgaA [Cell
           envelope, Biosynthesis and degradation of surface
           polysaccharides and lipopolysaccharides].
          Length = 619

 Score = 28.0 bits (63), Expect = 4.6
 Identities = 10/34 (29%), Positives = 14/34 (41%), Gaps = 3/34 (8%)

Query: 17  AMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKI 50
           A      +     Y T  EY +R+   R +LKK 
Sbjct: 158 AATTRAYDPATGRYETDAEYRQRI---RDDLKKS 188


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
            functions are known are involvedin recombination,
            recombinational repair, and/or non-homologous end
            joining.They are components of an exonuclease complex
            with MRE11 homologs. This family is distantly related to
            the SbcC family of bacterial proteins.This family is
            based on the phylogenomic analysis of JA Eisen (1999,
            Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 28.1 bits (62), Expect = 5.6
 Identities = 12/53 (22%), Positives = 22/53 (41%), Gaps = 2/53 (3%)

Query: 1    MEATAKPHHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQIR 53
            M+       H KLE     +     H  +   ++ Y K ++ F+  L++ Q R
Sbjct: 1047 MQVLQMKQEHQKLEE--NIDLIKRNHVLALGRQKGYEKEIKHFKKELREPQFR 1097


>gnl|CDD|165867 PLN02223, PLN02223, phosphoinositide phospholipase C.
          Length = 537

 Score = 27.7 bits (61), Expect = 6.0
 Identities = 12/26 (46%), Positives = 14/26 (53%), Gaps = 1/26 (3%)

Query: 205 PHWGEKTMPFWIIKNSWGPRWGEQVT 230
           PH  EK M   +  N W P WGE+ T
Sbjct: 447 PH-DEKIMKTTVKNNEWKPTWGEEFT 471


>gnl|CDD|145887 pfam02973, Sialidase, Sialidase, N-terminal domain. 
          Length = 190

 Score = 27.1 bits (60), Expect = 8.0
 Identities = 16/80 (20%), Positives = 26/80 (32%), Gaps = 8/80 (10%)

Query: 75  GGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLN-KEEIRVKIQSYVNVSSDETEMA 133
           GG   N    I +  G  L GE    Y        +  K + +  IQS  ++S+      
Sbjct: 4   GGEYENNNINISNGQGYDLSGELGKLYVLEEGTIIMRFKSDKKNSIQSLFSISNSTA--- 60

Query: 134 KYLVKNGPMAVAINANAMQF 153
                N    + I  + +  
Sbjct: 61  ----GNEYFHIYITNSEIGL 76


>gnl|CDD|223291 COG0213, DeoA, Thymidine phosphorylase [Nucleotide transport and
           metabolism].
          Length = 435

 Score = 27.2 bits (61), Expect = 8.3
 Identities = 12/39 (30%), Positives = 20/39 (51%), Gaps = 5/39 (12%)

Query: 109 HLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAIN 147
            L+KEEI   I  YVN +  + +++ +L     MA+   
Sbjct: 16  ALSKEEIEFFINGYVNGTIPDYQISAFL-----MAIFFR 49


>gnl|CDD|219433 pfam07500, TFIIS_M, Transcription factor S-II (TFIIS), central
          domain.  Transcription elongation by RNA polymerase II
          is regulated by the general elongation factor TFIIS.
          This factor stimulates RNA polymerase II to transcribe
          through regions of DNA that promote the formation of
          stalled ternary complexes. TFIIS is composed of three
          structural domains, termed I, II, and III. The two
          C-terminal domains (II and III), this domain and
          pfam01096 are required for transcription activity.
          Length = 115

 Score = 26.1 bits (58), Expect = 9.2
 Identities = 7/32 (21%), Positives = 12/32 (37%), Gaps = 7/32 (21%)

Query: 17 AMFNHFLEKHNKSYATKEEYHKRLRIFRANLK 48
           +F  +         T ++Y  + R    NLK
Sbjct: 40 ELFKLYGN-------TGKKYKNKYRSLMFNLK 64


>gnl|CDD|199985 TIGR00009, L28, ribosomal protein L28.  This model describes
          bacterial and chloroplast forms of the 50S ribosomal
          protein L28, a polypeptide about 60 amino acids in
          length. Mitochondrial homologs differ substantially in
          architecture and are not included [Protein synthesis,
          Ribosomal proteins: synthesis and modification].
          Length = 56

 Score = 24.9 bits (55), Expect = 9.4
 Identities = 8/27 (29%), Positives = 15/27 (55%)

Query: 37 HKRLRIFRANLKKIQIRGEGTHLALKL 63
           K  R F+ NL+K ++  +G    L++
Sbjct: 23 RKTKRRFKPNLQKKRVWVDGRFKRLRV 49


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.136    0.430 

Gapped
Lambda     K      H
   0.267   0.0764    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,901,916
Number of extensions: 1089515
Number of successful extensions: 1039
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1005
Number of HSP's successfully gapped: 39
Length of query: 240
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 146
Effective length of database: 6,768,326
Effective search space: 988175596
Effective search space used: 988175596
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 57 (25.7 bits)