RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy4960
         (341 letters)



>gnl|CDD|239068 cd02248, Peptidase_C1A, Peptidase C1A subfamily (MEROPS database
           nomenclature); composed of cysteine peptidases (CPs)
           similar to papain, including the mammalian CPs
           (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain
           is an endopeptidase with specific substrate preferences,
           primarily for bulky hydrophobic or aromatic residues at
           the S2 subsite, a hydrophobic pocket in papain that
           accommodates the P2 sidechain of the substrate (the
           second residue away from the scissile bond). Most
           members of the papain subfamily are endopeptidases. Some
           exceptions to this rule can be explained by specific
           details of the catalytic domains like the occluding loop
           in cathepsin B which confers an additional
           carboxydipeptidyl activity and the mini-chain of
           cathepsin H resulting in an N-terminal exopeptidase
           activity. Papain-like CPs have different functions in
           various organisms. Plant CPs are used to mobilize
           storage proteins in seeds. Parasitic CPs act
           extracellularly to help invade tissues and cells, to
           hatch or to evade the host immune system. Mammalian CPs
           are primarily lysosomal enzymes with the exception of
           cathepsin W, which is retained in the endoplasmic
           reticulum. They are responsible for protein degradation
           in the lysosome. Papain-like CPs are synthesized as
           inactive proenzymes with N-terminal propeptide regions,
           which are removed upon activation. In addition to its
           inhibitory role, the propeptide is required for proper
           folding of the newly synthesized enzyme and its
           stabilization in denaturing pH conditions. Residues
           within the propeptide region also play a role in the
           transport of the proenzyme to lysosomes or acidified
           vesicles. Also included in this subfamily are proteins
           classified as non-peptidase homologs, which lack
           peptidase activity or have missing active site residues.
          Length = 210

 Score =  202 bits (515), Expect = 3e-64
 Identities = 90/215 (41%), Positives = 119/215 (55%), Gaps = 12/215 (5%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD-H 185
           P+S+DWR  +   + PV+ QG CGSCWAF+T   LE   A+    L  LS+ QLV+C   
Sbjct: 1   PESVDWR--EKGAVTPVKDQGSCGSCWAFSTVGALEGAYAIKTGKLVSLSEQQLVDCSTS 58

Query: 186 GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSG 244
           GN  CNGGN D AFEYVK  GL S++DYPY  K+     C Y   K    +   + V  G
Sbjct: 59  GNNGCNGGNPDNAFEYVKNGGLASESDYPYTGKDG---TCKYNSSKVGAKITGYSNVPPG 115

Query: 245 VDH--MMHLLQSGPIGVYLN-HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGI 301
            +      L   GP+ V ++     + Y G     +   C+   L+HAV +VGYG +NG+
Sbjct: 116 DEEALKAALANYGPVSVAIDASSSFQFYKGGIY--SGPCCSNTNLNHAVLLVGYGTENGV 173

Query: 302 LTWIVRNSWGDIGPDHGYFQIERGANACGIESYAY 336
             WIV+NSWG    + GY +I RG+N CGI SYA 
Sbjct: 174 DYWIVKNSWGTSWGEKGYIRIARGSNLCGIASYAS 208


>gnl|CDD|215726 pfam00112, Peptidase_C1, Papain family cysteine protease. 
          Length = 213

 Score =  192 bits (490), Expect = 2e-60
 Identities = 84/218 (38%), Positives = 120/218 (55%), Gaps = 15/218 (6%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP+S DWR+     + PV+ QG+CGSCWAF+    LE +  +    L  LS+ QLV+CD 
Sbjct: 1   LPESFDWREKGA--VTPVKDQGQCGSCWAFSAVGALEGRYCIKTGKLVSLSEQQLVDCDT 58

Query: 186 GNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSG 244
           GN  CNGG  D AFEY+K+  G+ +++DYPY   +     C ++K  +K      +    
Sbjct: 59  GNNGCNGGLPDNAFEYIKKNGGIVTESDYPYTAHDG---TCKFKKSNSKYAKIKGYGDVP 115

Query: 245 VDH----MMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEK 298
            +        L ++GP+ V ++      + Y           C+  +LDHAV IVGYG +
Sbjct: 116 YNDEEALQAALAKNGPVSVAIDAYEDDFQLYKSGVY--KHTECSG-ELDHAVLIVGYGTE 172

Query: 299 NGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAY 336
           NG+  WIV+NSWG    ++GYF+I RG N CGI S A 
Sbjct: 173 NGVPYWIVKNSWGTDWGENGYFRIARGVNECGIASEAS 210


>gnl|CDD|214761 smart00645, Pept_C1, Papain family cysteine protease. 
          Length = 175

 Score =  158 bits (401), Expect = 2e-47
 Identities = 80/216 (37%), Positives = 103/216 (47%), Gaps = 49/216 (22%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP+S DWR  K   + PV+ QG+CGSCWAF+ T  LE +  +    L  LS+ QLV+C  
Sbjct: 1   LPESFDWR--KKGAVTPVKDQGQCGSCWAFSATGALEGRYCIKTGKLVSLSEQQLVDCSG 58

Query: 186 -GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTS 243
            GN  CNGG  D AFEY+K+  GLE+++ YPY                  V +       
Sbjct: 59  GGNCGCNGGLPDNAFEYIKKNGGLETESCYPY---------------TGSVAI------- 96

Query: 244 GVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEK--NGI 301
                         G+Y                +   C    LDHAV IVGYG +  NG 
Sbjct: 97  DASDFQFY--KS--GIY----------------DHPGCGSGTLDHAVLIVGYGTEVENGK 136

Query: 302 LTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYAY 336
             WIV+NSWG    ++GYF+I RG  N CGIE+   
Sbjct: 137 DYWIVKNSWGTDWGENGYFRIARGKNNECGIEASVA 172


>gnl|CDD|240310 PTZ00200, PTZ00200, cysteine proteinase; Provisional.
          Length = 448

 Score =  152 bits (385), Expect = 3e-42
 Identities = 96/328 (29%), Positives = 142/328 (43%), Gaps = 46/328 (14%)

Query: 43  FKTYIVKWNRTYTDDNEIKTRFEYFKQD-----GKETDEYY--GTSGSSDRSPQEI---- 91
           F+ +  K+NR +    E   RF  F+ +       + DE Y    +  SD + +E     
Sbjct: 126 FEEFNKKYNRKHATHAERLNRFLTFRNNYLEVKSHKGDEPYSKEINKFSDLTEEEFRKLF 185

Query: 92  ------LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLP---------KSLDWRQSK 136
                  +          K R  ++   +K  L + K              + LDWR  +
Sbjct: 186 PVIKVPPKSNSTSHNNDFKARHVSNPTYLKN-LKKAKNTDEDVKDPSKITGEGLDWR--R 242

Query: 137 VKVLNPVESQGR-CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNI 195
              +  V+ QG  CGSCWAF++   +ES   + +     LS+ +LV CD  +  C+GG  
Sbjct: 243 ADAVTKVKDQGLNCGSCWAFSSVGSVESLYKIYRDKSVDLSEQELVNCDTKSQGCSGGYP 302

Query: 196 DVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQSG 255
           D A EYVK  GL S +D PY  K+    +C     K KV++    V  G D +   L   
Sbjct: 303 DTALEYVKNKGLSSSSDVPYLAKDG---KCVVSSTK-KVYIDSYLVAKGKDVLNKSLVIS 358

Query: 256 PIGVYLN-HRLIESYDGNPIRRNDWACNPHKLDHAVAIV--GYGEKNGILTWIVRNSWG- 311
           P  VY+   R +  Y        +  C    L+HAV +V  GY EK     WI++NSWG 
Sbjct: 359 PTVVYIAVSRELLKYKSGVY---NGECG-KSLNHAVLLVGEGYDEKTKKRYWIIKNSWGT 414

Query: 312 DIGPDHGYFQIER---GANACGIESYAY 336
           D G + GY ++ER   G + CGI +   
Sbjct: 415 DWGEN-GYMRLERTNEGTDKCGILTVGL 441


>gnl|CDD|185513 PTZ00203, PTZ00203, cathepsin L protease; Provisional.
          Length = 348

 Score =  125 bits (316), Expect = 2e-33
 Identities = 72/220 (32%), Positives = 111/220 (50%), Gaps = 12/220 (5%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
           P ++DWR+     + PV++QG CGSCWAF+    +ESQ A+    L  LS+ QLV CDH 
Sbjct: 127 PDAVDWREKGA--VTPVKNQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHV 184

Query: 187 NLNCNGGNIDVAFEYV--KQYG-LESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVT- 242
           +  C GG +  AFE+V     G + ++  YPY +       C+   E A     D +V+ 
Sbjct: 185 DNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVPECSNSSELAPGARIDGYVSM 244

Query: 243 -SGVDHMMH-LLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNG 300
            S    M   L ++GPI + ++     SY    +     +C   +L+H V +VGY     
Sbjct: 245 ESSERVMAAWLAKNGPISIAVDASSFMSYHSGVLT----SCIGEQLNHGVLLVGYNMTGE 300

Query: 301 ILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAYLASV 340
           +  W+++NSWG+   + GY ++  G NAC +  Y     V
Sbjct: 301 VPYWVIKNSWGEDWGEKGYVRVTMGVNACLLTGYPVSVHV 340


>gnl|CDD|239111 cd02620, Peptidase_C1A_CathepsinB, Cathepsin B group; composed of
           cathepsin B and similar proteins, including
           tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin
           B is a lysosomal papain-like cysteine peptidase which is
           expressed in all tissues and functions primarily as an
           exopeptidase through its carboxydipeptidyl activity.
           Together with other cathepsins, it is involved in the
           degradation of proteins, proenzyme activation, Ag
           processing, metabolism and apoptosis. Cathepsin B has
           been implicated in a number of human diseases such as
           cancer, rheumatoid arthritis, osteoporosis and
           Alzheimer's disease. The unique carboxydipeptidyl
           activity of cathepsin B is attributed to the presence of
           an occluding loop in its active site which favors the
           binding of the C-termini of substrate proteins. Some
           members of this group do not possess the occluding loop.
           TIN-Ag is an extracellular matrix basement protein which
           was originally identified as a target Ag involved in
           anti-tubular basement membrane antibody-mediated
           interstitial nephritis. It plays a role in renal
           tubulogenesis and is defective in hereditary
           tubulointerstitial disorders. TIN-Ag is exclusively
           expressed in kidney tissues. .
          Length = 236

 Score =  120 bits (303), Expect = 2e-32
 Identities = 72/253 (28%), Positives = 103/253 (40%), Gaps = 62/253 (24%)

Query: 127 PKSLDWRQ--SKVKVLNPVESQGRCGSCWAFATTAILESQ--VALLKKTLYPLSKSQLVE 182
           P+S D R+       +  +  QG CGSCWAF+       +  +    K    LS   L+ 
Sbjct: 1   PESFDAREKWPNCISIGEIRDQGNCGSCWAFSAVEAFSDRLCIQSNGKENVLLSAQDLLS 60

Query: 183 CDHG-NLNCNGGNIDVAFEYVKQYGLESQADYPY---------------RNKENITFRC- 225
           C  G    CNGG  D A++Y+   G+ +    PY                     T +C 
Sbjct: 61  CCSGCGDGCNGGYPDAAWKYLTTTGVVTGGCQPYTIPPCGHHPEGPPPCCGTPYCTPKCQ 120

Query: 226 -----TYEKEKAKVFVQDTWVTSGVDHMM-HLLQSGPI---------------GVYLNHR 264
                TYE++K K       V S    +M  ++ +GP+               GVY +  
Sbjct: 121 DGCEKTYEEDKHKG-KSAYSVPSDETDIMKEIMTNGPVQAAFTVYEDFLYYKSGVYQHTS 179

Query: 265 LIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWG-DIGPDHGYFQIE 323
                 G                HAV I+G+G +NG+  W+  NSWG D G ++GYF+I 
Sbjct: 180 --GKQLGG---------------HAVKIIGWGVENGVPYWLAANSWGTDWG-ENGYFRIL 221

Query: 324 RGANACGIESYAY 336
           RG+N CGIES   
Sbjct: 222 RGSNECGIESEVV 234


>gnl|CDD|239112 cd02621, Peptidase_C1A_CathepsinC, Cathepsin C; also known as
           Dipeptidyl Peptidase I (DPPI), an atypical papain-like
           cysteine peptidase with chloride dependency and
           dipeptidyl aminopeptidase activity, resulting from its
           tetrameric structure which limits substrate access. Each
           subunit of the tetramer is composed of three peptides:
           the heavy and light chains, which together adopts the
           papain fold and forms the catalytic domain; and the
           residual propeptide region, which forms a beta barrel
           and points towards the substrate's N-terminus. The
           subunit composition is the result of the unique
           characteristic of procathepsin C maturation involving
           the cleavage of the catalytic domain and the
           non-autocatalytic excision of an activation peptide
           within its propeptide region. By removing N-terminal
           dipeptide extensions, cathepsin C activates granule
           serine peptidases (granzymes) involved in cell-mediated
           apoptosis, inflammation and tissue remodelling.
           Loss-of-function mutations in cathepsin C are associated
           with Papillon-Lefevre and Haim-Munk syndromes, rare
           diseases characterized by hyperkeratosis and early-onset
           periodontitis. Cathepsin C is widely expressed in many
           tissues with high levels in lung, kidney and placenta.
           It is also highly expressed in cytotoxic lymphocytes and
           mature myeloid cells.
          Length = 243

 Score =  117 bits (294), Expect = 5e-31
 Identities = 65/239 (27%), Positives = 110/239 (46%), Gaps = 31/239 (12%)

Query: 126 LPKSLDWRQ--SKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYP------LSK 177
           LPKS DW    +    ++PV +QG CGSC+AFA+   LE+++ +      P      LS 
Sbjct: 1   LPKSFDWGDVNNGFNYVSPVRNQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSP 60

Query: 178 SQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEK-EKAKVFV 236
             ++ C   +  C+GG   +  ++ + +G+ ++  +PY   ++    C     E  + + 
Sbjct: 61  QHVLSCSQYSQGCDGGFPFLVGKFAEDFGIVTEDYFPYTADDDRP--CKASPSECRRYYF 118

Query: 237 QDTW-------VTSGVDHMMHLLQSGPI--GVYLNHRLIESYDG--NPIRRNDWACNPHK 285
            D          T+  +    + ++GPI     +        +G  +    ++ +   + 
Sbjct: 119 SDYNYVGGCYGCTNEDEMKWEIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSDGDND 178

Query: 286 -------LDHAVAIVGYGE--KNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
                   +HAV +VG+GE    G   WIV+NSWG    + GYF+I RG N CGIES A
Sbjct: 179 NFNPFELTNHAVLLVGWGEDEIKGEKYWIVKNSWGSSWGEKGYFKIRRGTNECGIESQA 237


>gnl|CDD|239110 cd02619, Peptidase_C1, C1 Peptidase family (MEROPS database
           nomenclature), also referred to as the papain family;
           composed of two subfamilies of cysteine peptidases
           (CPs), C1A (papain) and C1B (bleomycin hydrolase).
           Papain-like enzymes are mostly endopeptidases with some
           exceptions like cathepsins B, C, H and X, which are
           exopeptidases. Papain-like CPs have different functions
           in various organisms. Plant CPs are used to mobilize
           storage proteins in seeds while mammalian CPs are
           primarily lysosomal enzymes responsible for protein
           degradation in the lysosome. Papain-like CPs are
           synthesized as inactive proenzymes with N-terminal
           propeptide regions, which are removed upon activation.
           Bleomycin hydrolase (BH) is a CP that detoxifies
           bleomycin by hydrolysis of an amide group. It acts as a
           carboxypeptidase on its C-terminus to convert itself
           into an aminopeptidase and peptide ligase. BH is found
           in all tissues in mammals as well as in many other
           eukaryotes. It forms a hexameric ring barrel structure
           with the active sites imbedded in the central channel.
           Some members of the C1 family are proteins classified as
           non-peptidase homologs which lack peptidase activity or
           have missing active site residues.
          Length = 223

 Score =  102 bits (255), Expect = 1e-25
 Identities = 58/226 (25%), Positives = 86/226 (38%), Gaps = 22/226 (9%)

Query: 129 SLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQ--VALLKKTLYPLSKSQLV----- 181
           S+D R  ++    PV++QG  GSCWAFA+   LES   +   +     LS   L      
Sbjct: 1   SVDLRPLRLT---PVKNQGSRGSCWAFASAYALESAYRIKGGEDEYVDLSPQYLYICAND 57

Query: 182 ECDHGNLNCNGGNIDVAFEY-VKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW 240
           EC   N +C+GG    A    V   G+  + DYPY  + +     +     A       +
Sbjct: 58  ECLGINGSCDGGGPLSALLKLVALKGIPPEEDYPYGAESDGEEPKSEAALNAAKVKLKDY 117

Query: 241 VTS---GVDHMM-HLLQSGP--IGVYLNHRLIESYDGNPIRRNDWACNPHKL--DHAVAI 292
                  ++ +   L + GP   G  +        +G       +          HAV I
Sbjct: 118 RRVLKNNIEDIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDGDLGGHAVVI 177

Query: 293 VGYG--EKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAY 336
           VGY      G   +IV+NSWG    D+GY +I        +   A 
Sbjct: 178 VGYDDNYVEGKGAFIVKNSWGTDWGDNGYGRISYED-VYEMTFGAN 222


>gnl|CDD|239149 cd02698, Peptidase_C1A_CathepsinX, Cathepsin X; the only
           papain-like lysosomal cysteine peptidase exhibiting
           carboxymonopeptidase activity. It can also act as a
           carboxydipeptidase, like cathepsin B, but has been shown
           to preferentially cleave substrates through a
           monopeptidyl carboxypeptidase pathway. The propeptide
           region of cathepsin X, the shortest among papain-like
           peptidases, is covalently attached to the active site
           cysteine in the inactive form of the enzyme. Little is
           known about the biological function of cathepsin X. Some
           studies point to a role in early tumorigenesis. A more
           recent study indicates that cathepsin X expression is
           restricted to immune cells suggesting a role in
           phagocytosis and the regulation of the immune response.
          Length = 239

 Score = 99.4 bits (248), Expect = 2e-24
 Identities = 70/244 (28%), Positives = 111/244 (45%), Gaps = 33/244 (13%)

Query: 126 LPKSLDWRQ-SKVKVLNPVESQ---GRCGSCWAFATTAILESQVALLKKTLYP---LSKS 178
           LPKS DWR  + V  ++P  +Q     CGSCWA  +T+ L  ++ + +K  +P   LS  
Sbjct: 1   LPKSWDWRNVNGVNYVSPTRNQHIPQYCGSCWAHGSTSALADRINIARKGAWPSVYLSVQ 60

Query: 179 QLVECDHGNLNCNGGNIDVAFEYVKQYGLESQADYPYR------NKENITFRCTYE---- 228
            +++C  G  +C+GG+    +EY  ++G+  +   PY+      N  N    C       
Sbjct: 61  VVIDCAGGG-SCHGGDPGGVYEYAHKHGIPDETCNPYQAKDGECNPFNRCGTCNPFGECF 119

Query: 229 --KEKAKVFVQDTWVTSGVDHMM-HLLQSGPI--GVYLNHRLIESYDGNPIRRNDWACNP 283
             K     FV D    SG D MM  +   GPI  G+      +E+Y G   +        
Sbjct: 120 AIKNYTLYFVSDYGSVSGRDKMMAEIYARGPISCGIMA-TEALENYTGGVYKEYVQDPLI 178

Query: 284 HKLDHAVAIVGYG-EKNGILTWIVRNSWGDIGPDHGYFQIERGAN-----ACGIESYAYL 337
              +H +++ G+G ++NG+  WIVRNSWG+   + G+F+I   +         IE     
Sbjct: 179 ---NHIISVAGWGVDENGVEYWIVRNSWGEPWGERGWFRIVTSSYKGARYNLAIEEDCAW 235

Query: 338 ASVK 341
           A   
Sbjct: 236 ADPI 239


>gnl|CDD|240232 PTZ00021, PTZ00021, falcipain-2; Provisional.
          Length = 489

 Score =  102 bits (257), Expect = 2e-24
 Identities = 96/344 (27%), Positives = 152/344 (44%), Gaps = 58/344 (16%)

Query: 35  DSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQD--------GKETDEYY-GTSGSSD 85
            +++ V++F  +I +  + Y   +E++ R+  F ++         KE   Y  G +   D
Sbjct: 161 TNLENVNSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAHNNKENVLYKKGMNRFGD 220

Query: 86  RSPQEI------LQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPK-SLDWRQSKVK 138
            S +E       L+    +  GK+  R+    + +KK+  + K         DWR     
Sbjct: 221 LSFEEFKKKYLTLKSFDFKSNGKKSPRVINYDDVIKKY--KPKDATFDHAKYDWRLHNG- 277

Query: 139 VLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVA 198
            + PV+ Q  CGSCWAF+T  ++ESQ A+ K  L  LS+ +LV+C   N  C GG I  A
Sbjct: 278 -VTPVKDQKNCGSCWAFSTVGVVESQYAIRKNELVSLSEQELVDCSFKNNGCYGGLIPNA 336

Query: 199 FE-YVKQYGLESQADYPY----RNKENITFRCTYEKEKAKVFVQ--DTWVTSGVDHMMHL 251
           FE  ++  GL S+ DYPY        NI  RC  EK K K +V   +      +  +   
Sbjct: 337 FEDMIELGGLCSEDDYPYVSDTPELCNID-RCK-EKYKIKSYVSIPEDKFKEAIRFL--- 391

Query: 252 LQSGPIGVYLNHRLIESYDGNPIRRN---DWACNPHKLDHAVAIVGYGEKNGI------- 301
              GPI V +        D     +    D  C   + +HAV +VGYG +          
Sbjct: 392 ---GPISVSIA-----VSDDFAFYKGGIFDGECG-EEPNHAVILVGYGMEEIYNSDTKKM 442

Query: 302 ---LTWIVRNSWGDIGPDHGYFQIERGANA----CGIESYAYLA 338
                +I++NSWG+   + G+ +IE   N     C + + AY+ 
Sbjct: 443 EKRYYYIIKNSWGESWGEKGFIRIETDENGLMKTCSLGTEAYVP 486


>gnl|CDD|240381 PTZ00364, PTZ00364, dipeptidyl-peptidase I precursor; Provisional.
          Length = 548

 Score = 72.2 bits (177), Expect = 6e-14
 Identities = 53/254 (20%), Positives = 95/254 (37%), Gaps = 48/254 (18%)

Query: 125 PLPKSLDWRQ----SKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYP------ 174
           P P +  W      S +    P      C S +  A  A + ++V +      P      
Sbjct: 204 PPPAAWSWGDVGGASFLPAAPPASPGRGCNSSYVEAALAAMMARVMVASNRTDPLGQQTF 263

Query: 175 LSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQADY--PYRNKENITFRCTYEKEKA 232
           LS   +++C      C GG  +   ++ + +G+ +   Y  PY + + +   C   +   
Sbjct: 264 LSARHVLDCSQYGQGCAGGFPEEVGKFAETFGILTTDSYYIPYDSGDGVERACKTRRPSR 323

Query: 233 KVFVQDTW--------VTSGVDHMMHLLQSGPI--GVYLN---------------HRLIE 267
           + +  +          VT   + +  + + GP+   VY N               +  ++
Sbjct: 324 RYYFTNYGPLGGYYGAVTDPDEIIWEIYRHGPVPASVYANSDWYNCDENSTEDVRYVSLD 383

Query: 268 SYD----GNPIRRNDWACNPHKLDHAVAIVGYGE-KNGILTWIVRNSWGDIGP--DHGYF 320
            Y       P+R    +     ++H V I+G+G  +NG   W+V + WG      D G  
Sbjct: 384 DYSTASADRPLRHYFAS----NVNHTVLIIGWGTDENGGDYWLVLDPWGSRRSWCDGGTR 439

Query: 321 QIERGANACGIESY 334
           +I RG NA  IES 
Sbjct: 440 KIARGVNAYNIESE 453


>gnl|CDD|240244 PTZ00049, PTZ00049, cathepsin C-like protein; Provisional.
          Length = 693

 Score = 56.1 bits (135), Expect = 1e-08
 Identities = 28/55 (50%), Positives = 36/55 (65%), Gaps = 4/55 (7%)

Query: 285 KLDHAVAIVGYGEK--NGILT--WIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
           K++HA+ +VG+GE+  NG L   WI RNSWG      GYF+I RG N  GIES +
Sbjct: 617 KVNHAIVLVGWGEEEINGKLYKYWIGRNSWGKNWGKEGYFKIIRGKNFSGIESQS 671



 Score = 32.6 bits (74), Expect = 0.30
 Identities = 33/165 (20%), Positives = 57/165 (34%), Gaps = 22/165 (13%)

Query: 87  SPQEILQ---RTGLRLTGKEKERLEADRERVKKFLNERKKG----PLPKSLDWRQ--SKV 137
           SP  ++Q      L          E D E  +       +      LPK+  W    +  
Sbjct: 335 SPVSLMQLGNTNFLAHVHGSDATNEMDLENYEDTEKAPHRELEIDELPKNFTWGDPFNNN 394

Query: 138 KVLNPVESQGRCGSCWAFATTAILESQVAL-----LKKTLYP-----LSKSQLVECDHGN 187
                V +Q  CGSC+  +     + ++ +     L K         LS   ++ C   +
Sbjct: 395 TREYDVTNQLLCGSCYIASQMYAFKRRIEIALTKNLDKKYLNNFDDLLSIQTVLSCSFYD 454

Query: 188 LNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKA 232
             CNGG   +  +  K  G+     +PY   E     C Y+ +++
Sbjct: 455 QGCNGGFPYLVSKMAKLQGIPLDKVFPYTATEQT---CPYQVDQS 496


>gnl|CDD|185641 PTZ00462, PTZ00462, Serine-repeat antigen protein; Provisional.
          Length = 1004

 Score = 49.3 bits (117), Expect = 2e-06
 Identities = 54/223 (24%), Positives = 87/223 (39%), Gaps = 48/223 (21%)

Query: 143 VESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC---DHGNLNCNGGNIDVAF 199
           +E QG C   W FA+   LE+   +     + +S   +  C   +H +  C+ G+  + F
Sbjct: 547 IEDQGNCAISWIFASKYHLETIKCMKGYEPHAISALYIANCSKGEHKD-RCDEGSNPLEF 605

Query: 200 -EYVKQYG-LESQADYPY----------------------------RNKENITFRC---- 225
            + ++  G L + ++Y Y                              KE  +       
Sbjct: 606 LQIIEDNGFLPADSNYLYNYTKVGEDCPDEEDHWMNLLDHGKILNHNKKEPNSLDGKAYR 665

Query: 226 TYEKEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHK 285
            YE E     + D ++    D +M+    G +  Y+    +  Y+ N  +  +  C    
Sbjct: 666 AYESEHFHDKM-DAFIKIIKDEIMN---KGSVIAYIKAENVLGYEFNGKKVQN-LCGDDT 720

Query: 286 LDHAVAIVGYG-----EKNGILTWIVRNSWGDIGPDHGYFQIE 323
            DHAV IVGYG     E      WIVRNSWG    D GYF+++
Sbjct: 721 ADHAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKVD 763


>gnl|CDD|227207 COG4870, COG4870, Cysteine protease [Posttranslational
           modification, protein turnover, chaperones].
          Length = 372

 Score = 45.6 bits (108), Expect = 2e-05
 Identities = 55/225 (24%), Positives = 83/225 (36%), Gaps = 39/225 (17%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILE-----------SQVALLKKTLYP 174
           LP   D R      ++PV+ QG  GSCWAFATT  LE           S+  +      P
Sbjct: 99  LPSYFDRRDEGK--VSPVKDQGSGGSCWAFATTRSLESYLNPESAWDFSENNMKNLLGVP 156

Query: 175 LSKSQLVECDHGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRC--TYEKEK 231
             K      D+ +   +GGN D++  Y+ ++ G   + D PY      +       +  +
Sbjct: 157 YEK----GFDYTS--NDGGNADMSAAYLTEWSGPVYETDDPYSENSYFSPTNLPVTKHVQ 210

Query: 232 AKVFVQDTW--VTSGVDHMMHLLQSGPIGVYLNHRLIESYD-GNPIRRNDWACNPHKLDH 288
               +      + +G    M     G +   +      S     P    D   +     H
Sbjct: 211 EAQIIPSRKKYLDNGNIKAM-FGFYGAVSSSMYIDATNSLGICIPYPYVD---SGENWGH 266

Query: 289 AVAIVGY----------GEKNGILTWIVRNSWGDIGPDHGYFQIE 323
           AV IVGY              G   +I++NSWG    ++GYF I 
Sbjct: 267 AVLIVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYFWIS 311


>gnl|CDD|213529 TIGR00423, TIGR00423, radical SAM domain protein, CofH subfamily.
           This protein family includes the CofH protein of
          coenzyme F(420) biosynthesis from Methanocaldococcus
          jannaschii, but appears to hit genomes more broadly
          than just the subset that make coenzyme F(420), so that
          narrower group is being built as a separate family
          [Hypothetical proteins, Conserved].
          Length = 309

 Score = 36.2 bits (84), Expect = 0.020
 Identities = 14/52 (26%), Positives = 24/52 (46%), Gaps = 7/52 (13%)

Query: 45 TYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTG 96
          T++V  N  +T+    K +F  F+   K+ D Y         S +EIL++  
Sbjct: 2  TFVVNRNINFTNICVGKCKFCAFRAREKDKDAYV-------LSLEEILEKVK 46


>gnl|CDD|182167 PRK09956, PRK09956, hypothetical protein; Provisional.
          Length = 308

 Score = 32.8 bits (74), Expect = 0.23
 Identities = 27/82 (32%), Positives = 42/82 (51%), Gaps = 16/82 (19%)

Query: 35  DSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQR 94
           D I  VD   T +V   R +T+D++++T F Y  Q G +T  +        R  QEI +R
Sbjct: 187 DLIGMVDRITTLLV---RGFTNDSQLQTLFNYLLQCG-DTSRF-------TRFIQEIAER 235

Query: 95  TGLRLTGKEKERLEADRERVKK 116
           + L     +KERL    ER+++
Sbjct: 236 SPL-----QKERLMTIAERLRQ 252


>gnl|CDD|202517 pfam03051, Peptidase_C1_2, Peptidase C1-like family.  This family
           is closely related to the Peptidase_C1 family pfam00112,
           containing several prokaryotic and eukaryotic
           aminopeptidases and bleomycin hydrolases.
          Length = 438

 Score = 32.7 bits (75), Expect = 0.31
 Identities = 14/37 (37%), Positives = 18/37 (48%), Gaps = 2/37 (5%)

Query: 286 LDHAVAIVGYGEKNG--ILTWIVRNSWGDIGPDHGYF 320
           + HA+ + G  E +      W V NSWGD     GYF
Sbjct: 359 MTHAMVLTGVDEDDDGKPTKWKVENSWGDDSGKKGYF 395


>gnl|CDD|214853 smart00848, Inhibitor_I29, Cathepsin propeptide inhibitor domain
          (I29).  This domain is found at the N-terminus of some
          C1 peptidases such as Cathepsin L where it acts as a
          propeptide. There are also a number of proteins that
          are composed solely of multiple copies of this domain
          such as the peptidase inhibitor salarin. This family is
          classified as I29 by MEROPS. Peptide proteinase
          inhibitors can be found as single domain proteins or as
          single or multiple domains within proteins; these are
          referred to as either simple or compound inhibitors,
          respectively. In many cases they are synthesised as
          part of a larger precursor protein, either as a
          prepropeptide or as an N-terminal domain associated
          with an inactive peptidase or zymogen. This domain
          prevents access of the substrate to the active site.
          Removal of the N-terminal inhibitor domain either by
          interaction with a second peptidase or by autocatalytic
          cleavage activates the zymogen. Other inhibitors
          interact direct with proteinases using a simple
          noncovalent lock and key mechanism; while yet others
          use a conformational change-based trapping mechanism
          that depends on their structural and thermodynamic
          properties.
          Length = 57

 Score = 29.5 bits (67), Expect = 0.35
 Identities = 8/27 (29%), Positives = 15/27 (55%)

Query: 43 FKTYIVKWNRTYTDDNEIKTRFEYFKQ 69
          F+ +  K  ++Y+ + E   RF  FK+
Sbjct: 1  FEQWKKKHGKSYSSEEEEARRFAIFKE 27


>gnl|CDD|219764 pfam08246, Inhibitor_I29, Cathepsin propeptide inhibitor domain
          (I29).  This domain is found at the N-terminus of some
          C1 peptidases such as Cathepsin L where it acts as a
          propeptide. There are also a number of proteins that
          are composed solely of multiple copies of this domain
          such as the peptidase inhibitor salarin. This family is
          classified as I29 by MEROPS.
          Length = 58

 Score = 29.1 bits (66), Expect = 0.43
 Identities = 8/27 (29%), Positives = 16/27 (59%)

Query: 43 FKTYIVKWNRTYTDDNEIKTRFEYFKQ 69
          F+ +  K+ ++Y  + E   RF+ FK+
Sbjct: 1  FEDWKKKYGKSYYSEEEELYRFQIFKE 27


>gnl|CDD|238328 cd00585, Peptidase_C1B, Peptidase C1B subfamily (MEROPS database
           nomenclature); composed of eukaryotic bleomycin
           hydrolases (BH) and bacterial aminopeptidases C (pepC).
           The proteins of this subfamily contain a large insert
           relative to the C1A peptidase (papain) subfamily. BH is
           a cysteine peptidase that detoxifies bleomycin by
           hydrolysis of an amide group. It acts as a
           carboxypeptidase on its C-terminus to convert itself
           into an aminopeptidase and peptide ligase. BH is found
           in all tissues in mammals as well as in many other
           eukaryotes. Bleomycin, a glycopeptide derived from the
           fungus Streptomyces verticullus, is an effective
           anticancer drug due to its ability to induce DNA strand
           breaks. Human BH is the major cause of tumor cell
           resistance to bleomycin chemotherapy, and is also
           genetically linked to Alzheimer's disease. In addition
           to its peptidase activity, the yeast BH (Gal6) binds DNA
           and acts as a repressor in the Gal4 regulatory system.
           BH forms a hexameric ring barrel structure with the
           active sites imbedded in the central channel. The
           bacterial homolog of BH, called pepC, is a cysteine
           aminopeptidase possessing broad specificity. Although
           its crystal structure has not been solved, biochemical
           analysis shows that pepC also forms a hexamer. .
          Length = 437

 Score = 31.8 bits (73), Expect = 0.45
 Identities = 13/37 (35%), Positives = 18/37 (48%), Gaps = 2/37 (5%)

Query: 288 HAVAIVGY--GEKNGILTWIVRNSWGDIGPDHGYFQI 322
           HA+ + G    E    + W V NSWG+     GYF +
Sbjct: 360 HAMVLTGVDLDEDGKPVKWKVENSWGEKVGKKGYFVM 396


>gnl|CDD|178652 PLN03105, TCP24, transcription factor TCP24 (TEOSINTE BRANCHED1,
           CYCLOIDEA, AND PCF FAMILY 24); Provisional.
          Length = 324

 Score = 31.0 bits (69), Expect = 0.96
 Identities = 19/69 (27%), Positives = 33/69 (47%), Gaps = 2/69 (2%)

Query: 53  TYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRE 112
           + TD   + T F++  Q+  +T     +SG+S+ S    L RT +R   +E+ R    ++
Sbjct: 104 SITDLPLLNTNFDHLDQNQNQTKSA-CSSGTSESSLLS-LSRTEIRGKARERARERTAKD 161

Query: 113 RVKKFLNER 121
           R K   N  
Sbjct: 162 RDKDLQNAH 170


>gnl|CDD|215551 PLN03050, PLN03050, pyridoxine (pyridoxamine) 5'-phosphate oxidase;
           Provisional.
          Length = 246

 Score = 29.8 bits (67), Expect = 1.6
 Identities = 20/68 (29%), Positives = 31/68 (45%), Gaps = 9/68 (13%)

Query: 164 QVALLKKTLYP--LSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNK--- 218
           +VA  +K   P       L+ C  GN   NGG+  VA  ++  +G E    YP ++    
Sbjct: 46  EVADGEKASNPPGRHPRVLLVCGPGN---NGGDGLVAARHLAHFGYEVTVCYPKQSSKPH 102

Query: 219 -ENITFRC 225
            EN+  +C
Sbjct: 103 YENLVTQC 110


>gnl|CDD|184691 PRK14464, PRK14464, ribosomal RNA large subunit methyltransferase
           N; Provisional.
          Length = 344

 Score = 29.7 bits (67), Expect = 2.0
 Identities = 16/44 (36%), Positives = 21/44 (47%), Gaps = 8/44 (18%)

Query: 268 SYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWG 311
           S DG+  RR          +  VA+  Y  + G+LT  VRNS G
Sbjct: 281 SVDGDAYRRPSG-------ERIVAMARYLHRRGVLT-KVRNSAG 316


>gnl|CDD|226107 COG3579, PepC, Aminopeptidase C [Amino acid transport and
           metabolism].
          Length = 444

 Score = 29.8 bits (67), Expect = 2.2
 Identities = 13/37 (35%), Positives = 16/37 (43%), Gaps = 2/37 (5%)

Query: 286 LDHAVAIVGYGEKNG--ILTWIVRNSWGDIGPDHGYF 320
           + HA+ + G         L W V NSWG      GYF
Sbjct: 361 MTHAMVLTGVDLDETGNPLRWKVENSWGKDVGKKGYF 397


>gnl|CDD|223637 COG0563, Adk, Adenylate kinase and related kinases [Nucleotide
           transport and metabolism].
          Length = 178

 Score = 29.2 bits (66), Expect = 2.5
 Identities = 13/52 (25%), Positives = 20/52 (38%), Gaps = 2/52 (3%)

Query: 50  WNRTYTDDNE--IKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRL 99
             R   DDNE  +K R + + +      EYY  +       +E+L      L
Sbjct: 126 GRRVREDDNEETVKKRLKVYHEQTAPLIEYYSVTIDGSGEIEEVLADILKAL 177


>gnl|CDD|235066 PRK02746, pdxA, 4-hydroxythreonine-4-phosphate dehydrogenase;
           Provisional.
          Length = 345

 Score = 29.1 bits (66), Expect = 3.4
 Identities = 18/54 (33%), Positives = 26/54 (48%), Gaps = 4/54 (7%)

Query: 286 LDHAVAIVGYGEKNGILTW-IVRNSWGDIGPDHGY-FQIERGANACGIESYAYL 337
           L  A+ I   GE + I+T  I +++W   G  H Y  Q E  A   G+E +  L
Sbjct: 100 LTTAIEITLAGEFDAIVTAPIAKSAWHAAG--HHYPGQTEVLAELAGVERFGML 151


>gnl|CDD|132433 TIGR03392, FeS_syn_CsdA, cysteine desulfurase, catalytic subunit
           CsdA.  Members of this protein family are CsdS. This
           protein, found Escherichia coli, Yersinia pestis,
           Photorhabdus luminescens, and related species, and
           related to SufS, works together with and physically
           interacts with CsdE (a paralog of SufE). CsdA has
           cysteine desulfurase activity that is enhanced by CsdE,
           a sulfur acceptor protein. This gene pair, although
           involved in FeS cluster biosynthesis, is not found next
           to other such genes as are its paralogs from the Suf or
           Isc systems [Biosynthesis of cofactors, prosthetic
           groups, and carriers, Other].
          Length = 398

 Score = 29.0 bits (65), Expect = 3.6
 Identities = 19/73 (26%), Positives = 33/73 (45%), Gaps = 13/73 (17%)

Query: 72  KETDEYY-GTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSL 130
             T ++Y  +SG+  RS      +    LT     R E  R++V +FLN     P  +++
Sbjct: 34  DATQQFYRLSSGTVHRSQH----QQAQSLT----ARYELARQQVARFLN----APDAENI 81

Query: 131 DWRQSKVKVLNPV 143
            W +   + +N V
Sbjct: 82  VWTRGTTESINLV 94


>gnl|CDD|182799 PRK10874, PRK10874, cysteine sulfinate desulfinase; Provisional.
          Length = 401

 Score = 29.2 bits (66), Expect = 3.6
 Identities = 21/71 (29%), Positives = 33/71 (46%), Gaps = 13/71 (18%)

Query: 74  TDEYY-GTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDW 132
           T ++Y  ++G+  RS          RLT     R EA RE+V + LN     P  K++ W
Sbjct: 39  TQQFYSLSAGNVHRSQF----AAAQRLT----ARYEAAREQVAQLLN----APDAKNIVW 86

Query: 133 RQSKVKVLNPV 143
            +   + +N V
Sbjct: 87  TRGTTESINLV 97


>gnl|CDD|151502 pfam11056, UvsY, Recombination, repair and ssDNA binding protein
           UvsY.  UvsY protein enhances the rate of
           single-stranded-DNA-dependant ATP hydrolysis by UvsX
           protein. The enhancement of ATP hydrolysis by UvsY
           protein is shown to result from the ability of UvsY
           protein to increase the affinity of UvsX protein for
           single-stranded DNA.
          Length = 128

 Score = 27.3 bits (61), Expect = 6.8
 Identities = 21/70 (30%), Positives = 29/70 (41%), Gaps = 10/70 (14%)

Query: 49  KWNRTYTDDNEIKTRFEY-FKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKE-KER 106
           KW R Y+D  + K R E   K+  KE   YY     + R+  E+           E K  
Sbjct: 26  KWLRYYSDAKKEKIRLEAQKKKLLKERLLYY-----TGRADDEV---CMEIYEKSEIKTV 77

Query: 107 LEADRERVKK 116
           + AD E +K 
Sbjct: 78  IAADEEVIKI 87


>gnl|CDD|221228 pfam11796, DUF3323, Protein of unknown function N-terminus
           (DUF3323).  Proteins in this entry are encoded within a
           conserved gene four-gene neighborhood found sporadically
           in a phylogenetically broad range of bacteria including:
           Nocardia farcinica, Symbiobacterium thermophilum, and
           Streptomyces avermitilis (Actinobacteria), Geobacillus
           kaustophilus (Firmicutes), Azoarcus sp. EbN1 and
           Ralstonia solanacearum (Beta-proteobacteria).
          Length = 215

 Score = 27.7 bits (62), Expect = 7.7
 Identities = 13/61 (21%), Positives = 25/61 (40%), Gaps = 1/61 (1%)

Query: 75  DEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERK-KGPLPKSLDWR 133
           D    ++G +     E L+     +  ++ ER EA+    + F  E +  G  P+   W 
Sbjct: 42  DAALRSTGFAGVGLAEALEALFGPILTRKAERAEAESAWREAFFAEDRLAGDDPELRAWL 101

Query: 134 Q 134
           +
Sbjct: 102 R 102


>gnl|CDD|233043 TIGR00599, rad18, DNA repair protein rad18.  All proteins in this
           family for which functions are known are involved in
           nucleotide excision repair.This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 397

 Score = 28.0 bits (62), Expect = 7.8
 Identities = 11/38 (28%), Positives = 18/38 (47%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILES 163
           +  S DW  + +  L P+++  RC  C  F    +L S
Sbjct: 6   ITDSSDWLTTPIPSLYPLDTSLRCHICKDFFDVPVLTS 43


>gnl|CDD|239583 cd03506, Delta6-FADS-like, The Delta6 Fatty Acid Desaturase
           (Delta6-FADS)-like CD includes the integral-membrane
           enzymes: delta-4, delta-5, delta-6, delta-8,
           delta-8-sphingolipid, and delta-11 desaturases found in
           vertebrates, higher plants, fungi, and bacteria. These
           desaturases are required for the synthesis of highly
           unsaturated fatty acids (HUFAs), which are mainly
           esterified into phospholipids and contribute to
           maintaining membrane fluidity. While HUFAs may be
           required for cold tolerance in bacteria, plants and
           fish, the primary role of HUFAs in mammals is cell
           signaling. These enzymes are described as front-end
           desaturases because they introduce a double bond between
           the pre-exiting double bond and the carboxyl (front) end
           of the fatty acid. Various substrates are involved, with
           both acyl-coenzyme A (CoA) and acyl-lipid desaturases
           present in this CD. Acyl-lipid desaturases are localized
           in the membranes of cyanobacterial thylakoid, plant
           endoplasmic reticulum (ER), and plastid; and acyl-CoA
           desaturases are present in ER membrane. ER-bound plant
           acyl-lipid desaturases and acyl-CoA desaturases require
           cytochrome b5 as an electron donor. Most of the
           eukaryotic desaturase domains have an adjacent
           N-terminal cytochrome b5-like domain. This domain family
           has extensive hydrophobic regions that would be capable
           of spanning the membrane bilayer at least twice.
           Comparison of sequences also reveals the existence of
           three regions of conserved histidine cluster motifs that
           contain the residues: HXXXH, HXX(X)HH, and Q/HXXHH.
           These histidine residues are reported to be
           catalytically essential and proposed to be the ligands
           for the iron atoms contained within the homolog,
           stearoyl CoA desaturase.
          Length = 204

 Score = 27.6 bits (62), Expect = 8.8
 Identities = 18/80 (22%), Positives = 28/80 (35%), Gaps = 21/80 (26%)

Query: 248 MMHLLQSGPIGVY--LNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWI 305
           ++ L     + V   LNH  +   D     +NDW     ++     I G    +  L W+
Sbjct: 114 VVQLAGGLWLAVVFQLNHFGMPVEDPPGESKNDWLE--RQVLTTRNITG----SPFLDWL 167

Query: 306 VRNSWGDIGPDHGY--FQIE 323
                      HG   +QIE
Sbjct: 168 -----------HGGLNYQIE 176


>gnl|CDD|227934 COG5647, COG5647, Cullin, a subunit of E3 ubiquitin ligase
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 773

 Score = 27.8 bits (62), Expect = 9.0
 Identities = 21/78 (26%), Positives = 30/78 (38%), Gaps = 16/78 (20%)

Query: 61  KTRFEYFKQDG-----KETDEYYGTSGSSD---RSPQEILQRTGLRLTGKEKERLEADRE 112
           K    Y+K        +ET E+Y    S      S  E L+        K  + LE + E
Sbjct: 212 KENLSYYKSVFEPIFLEETWEFYEMESSEVIELLSVTEYLE--------KAHKILEREEE 263

Query: 113 RVKKFLNERKKGPLPKSL 130
            V+ +L    K PL + L
Sbjct: 264 LVEIYLKVSTKKPLLEVL 281


>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 758

 Score = 28.0 bits (62), Expect = 9.2
 Identities = 31/94 (32%), Positives = 40/94 (42%), Gaps = 9/94 (9%)

Query: 35  DSIK-QVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQ 93
           DS K ++DA K  I K N    + NE    FE  K+  KE  E            QE  +
Sbjct: 388 DSTKDEIDAIKDLIKKANAQVVNHNERIKNFEKQKKSSKEQLE-KFLVNEFKSDVQEYNK 446

Query: 94  RTGLRLTGKEKERLEA----DRERVKKFLNERKK 123
             GL    K+   LE     ++E VKK  NE K+
Sbjct: 447 YCGLE---KKINNLEKEIKNNQEEVKKLSNEIKE 477


>gnl|CDD|220177 pfam09318, DUF1975, Domain of unknown function (DUF1975).  Members
           of this family of functionally uncharacterized domains
           are predominantly found in the N-terminal region of
           various prokaryotic alpha-glucosyltransferases.
          Length = 203

 Score = 27.3 bits (61), Expect = 9.6
 Identities = 19/80 (23%), Positives = 30/80 (37%), Gaps = 2/80 (2%)

Query: 6   CDHQETNTEQVTYNVNTDSAIYV-WRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRF 64
            + +    +   Y+        V +RD    ++  V+ F        R Y D     +  
Sbjct: 90  LEFEYNPNDIRIYDQGGQLIARVYYRDETKGTVDYVEYFDPNGRLVRRDYYDYRGFLSST 149

Query: 65  EYFKQDGKE-TDEYYGTSGS 83
           +YF  DGK   +EYY   GS
Sbjct: 150 QYFDDDGKVILEEYYNPDGS 169


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 27.8 bits (62), Expect = 9.9
 Identities = 16/93 (17%), Positives = 35/93 (37%), Gaps = 8/93 (8%)

Query: 56  DDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVK 115
           D+ E++        D  + +         +   +E        ++ +EK RL        
Sbjct: 14  DEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKREKARL-------- 65

Query: 116 KFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
           K L ++KK  + K L+ + + +      + +GR
Sbjct: 66  KELKKQKKQEIQKILEQQNAAIDADMNNKGKGR 98


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.316    0.133    0.410 

Gapped
Lambda     K      H
   0.267   0.0764    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 16,942,356
Number of extensions: 1591761
Number of successful extensions: 1612
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1576
Number of HSP's successfully gapped: 57
Length of query: 341
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 243
Effective length of database: 6,590,910
Effective search space: 1601591130
Effective search space used: 1601591130
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 59 (26.4 bits)