RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy11074
         (710 letters)



>gnl|CDD|199850 cd03868, M14_CPD_I, Peptidase M14 carboxypeptidase subfamily
           N/E-like; Carboxypeptidase D, domain I subgroup.  The
           first carboxypeptidase (CP)-like domain of
           Carboxypeptidase D (CPD; EC 3.4.17.22), domain I. CPD
           differs from all other metallocarboxypeptidases in that
           it contains multiple CP-like domains. CPD belongs to the
           N/E-like subfamily of the M14 family of
           metallocarboxypeptidases (MCPs).The M14 family are
           zinc-binding CPs which hydrolyze single, C-terminal
           amino acids from polypeptide chains, and have a
           recognition site for the free C-terminal carboxyl group,
           which is a key determinant of specificity. CPD is a
           single-chain protein containing a signal peptide, three
           tandem repeats of CP-like domains separated by short
           bridge regions, followed by a transmembrane domain, and
           a C-terminal cytosolic tail. The first two CP-like
           domains of CPD contain all of the essential active site
           and substrate-binding residues, the third CP-like domain
           lacks critical residues necessary for enzymatic activity
           and is inactive towards standard CP substrates. Domain I
           is optimally active at pH 6.3-7.5 and prefers substrates
           with C-terminal Arg, whereas domain II is active at pH
           5.0-6.5 and prefers substrates with C-terminal Lys. This
           Domain I family contains two contiguous surface
           cysteines that may become palmitoylated and target the
           enzyme to membranes, thus regulating intracellular
           trafficking. CPD functions in the processing of proteins
           that transit the secretory pathway, and is present in
           all vertebrates as well as Drosophila. It is broadly
           distributed in all tissue types. Within cells, CPD is
           present in the trans Golgi network and immature
           secretory vesicles, but is excluded from mature
           vesicles. It is thought to play a role in the processing
           of proteins that are initially processed by furin or
           related endopeptidases present in the trans Golgi
           network, such as growth factors and receptors. CPD is
           implicated in the pathogenesis of lupus erythematosus
           (LE), it is regulated by TGF-beta in various cell types
           of murine and human origin and is significantly
           down-regulated in CD14 positive cells isolated from
           patients with LE. As down-regulation of CPD leads to
           down-modulation of TGF-beta, CPD may have a role in a
           positive feedback loop. In D. melanogaster, the CPD
           variant 1B short (DmCPD1Bs) is necessary and sufficient
           for viability of the fruit fly.
          Length = 293

 Score =  377 bits (969), Expect = e-126
 Identities = 148/296 (50%), Positives = 192/296 (64%), Gaps = 38/296 (12%)

Query: 360 YYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKY 419
           Y++  EL   +    K+YP + R  +IG+SV+GR+LWA+ I+   DN + R   +PMFKY
Sbjct: 1   YHHYDELTDLLASLEKAYPDIARLHSIGRSVEGRDLWALRIS---DNVNSREPGRPMFKY 57

Query: 420 VANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKC 479
           V NMHGDETVG  ++++L QYL+   G D R+T+L+N+TDIY++PS+NPDGF  ++EG C
Sbjct: 58  VGNMHGDETVGRQVLIYLAQYLLENYGGDPRVTRLVNTTDIYLMPSMNPDGFERSQEGDC 117

Query: 480 DSLDGYVGRKNAHGVDLNRNFPDQFE--YEAKKVYEPETQAIMNFIYSNPFVLSGNLHGG 537
               GY GR NA+ VDLNRNFPDQFE  +   +  +PET A++N+I SNPFVLS NLHGG
Sbjct: 118 S-CGGYGGRGNANNVDLNRNFPDQFEGKHVRAQERQPETLAMINWIRSNPFVLSANLHGG 176

Query: 538 AV------------------------------ARDYASRNPMMAPGHACGFD--FKDGIT 565
           +V                              A  YAS +P M  G  C  +  FKDGIT
Sbjct: 177 SVVASYPYDDSSSHNECGVYSKSPDDAVFRYLALTYASNHPTMRTGKPCCENETFKDGIT 236

Query: 566 NGNYWYKVTGGMQDFNYVHSNCFEITMELSCCKYPKASDLKHYWAANKESLIKLIE 621
           NG +WY V GGMQD+NY+HSNCFEIT+ELSCCKYP AS+L   W  N+ESL+  IE
Sbjct: 237 NGAHWYDVPGGMQDYNYLHSNCFEITLELSCCKYPPASELPQEWNNNRESLLAYIE 292



 Score =  164 bits (416), Expect = 2e-45
 Identities = 58/119 (48%), Positives = 87/119 (73%), Gaps = 3/119 (2%)

Query: 46  YYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFKY 105
           Y++  EL   +    K+YP + R  +IG+SV+GR+LWA+ I+ +V+S   R   +PMFKY
Sbjct: 1   YHHYDELTDLLASLEKAYPDIARLHSIGRSVEGRDLWALRISDNVNS---REPGRPMFKY 57

Query: 106 VANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGN 164
           V NMHGDETVG  ++++L QYL+   G D R+T+L+N+TDIY++PS+NPDGF  ++EG+
Sbjct: 58  VGNMHGDETVGRQVLIYLAQYLLENYGGDPRVTRLVNTTDIYLMPSMNPDGFERSQEGD 116



 Score = 91.6 bits (228), Expect = 3e-20
 Identities = 31/45 (68%), Positives = 37/45 (82%)

Query: 172 MQDFNYVHSNCFEITMELSCCKYPKASDLKHYWAANKESLIKLIE 216
           MQD+NY+HSNCFEIT+ELSCCKYP AS+L   W  N+ESL+  IE
Sbjct: 248 MQDYNYLHSNCFEITLELSCCKYPPASELPQEWNNNRESLLAYIE 292


>gnl|CDD|199842 cd03858, M14_CP_N-E_like, Peptidase M14 carboxypeptidase subfamily
           N/E-like.  Carboxypeptidase (CP) N/E-like subfamily of
           the M14 family of metallocarboxypeptidases (MCPs). The
           M14 family are zinc-binding CPs which hydrolyze single,
           C-terminal amino acids from polypeptide chains, and have
           a recognition site for the free C-terminal carboxyl
           group, which is a key determinant of specificity. The
           N/E subfamily includes eight members, of which five
           (CPN, CPE, CPM, CPD, CPZ) are considered enzymatically
           active, while the other three are non-active (CPX1,
           PCX2, ACLP/AEBP1) and lack the critical active site and
           substrate-binding residues considered necessary for CP
           activity. These non-active members may function as
           binding proteins or display catalytic activity towards
           other substrates. Unlike the A/B CP subfamily, enzymes
           belonging to the N/E subfamily are not produced as
           inactive precursors that require proteolysis to produce
           the active form; rather, they rely on their substrate
           specificity and subcellular compartmentalization to
           prevent inappropriate cleavages that would otherwise
           damage the cell. In addition, all members of the N/E
           subfamily contain an extra C-terminal domain that is not
           present in the A/B subfamily. This domain has structural
           homology to transthyretin and other proteins and has
           been proposed to function as a folding domain. The
           active N/E enzymes fulfill a variety of cellular
           functions, including prohormone processing, regulation
           of peptide hormone activity, alteration of
           protein-protein or protein-cell interactions and
           transcriptional regulation.
          Length = 293

 Score =  375 bits (965), Expect = e-126
 Identities = 144/298 (48%), Positives = 184/298 (61%), Gaps = 42/298 (14%)

Query: 360 YYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKY 419
           ++N  EL+AF+ K  K YP++ R  +IGKSV+GR LW +EI+   DNP      +P FKY
Sbjct: 1   HHNYEELEAFLKKLNKKYPNITRLYSIGKSVEGRPLWVLEIS---DNPGEHEPGEPEFKY 57

Query: 420 VANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKC 479
           V NMHG+E VG  L++ L QYL    G D RIT L+NST I+I+PS+NPDG+  A+EG C
Sbjct: 58  VGNMHGNEVVGRELLLLLAQYLCENYGGDPRITNLVNSTRIHIMPSMNPDGYEKAQEGDC 117

Query: 480 DSLDGYVGRKNAHGVDLNRNFPDQFE--YEAKKVYEPETQAIMNFIYSNPFVLSGNLHGG 537
               G  GR NA+GVDLNRNFPD F   Y   +  +PET+A+M +I S PFVLS NLHGG
Sbjct: 118 ---GGVTGRANANGVDLNRNFPDLFATIYSDNEPRQPETKAVMKWIKSIPFVLSANLHGG 174

Query: 538 AV------------------------------ARDYASRNPMMAPGHAC----GFDFKDG 563
           A+                              A+ YA+ +P M  G  C       F  G
Sbjct: 175 ALVANYPYDDTPSGTERTEYSPTPDDDVFRYLAKTYANAHPTMHKGQPCCCNDDESFPGG 234

Query: 564 ITNGNYWYKVTGGMQDFNYVHSNCFEITMELSCCKYPKASDLKHYWAANKESLIKLIE 621
           ITNG  WY V+GGMQD+NY+H+NCFEIT+ELSCCK+P AS+L  +W  NKE+L+  IE
Sbjct: 235 ITNGAAWYSVSGGMQDYNYLHTNCFEITLELSCCKFPPASELPKFWEENKEALLNYIE 292



 Score =  162 bits (412), Expect = 6e-45
 Identities = 61/126 (48%), Positives = 83/126 (65%), Gaps = 3/126 (2%)

Query: 46  YYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFKY 105
           ++N  EL+AF+ K  K YP++ R  +IGKSV+GR LW +EI+   D+P      +P FKY
Sbjct: 1   HHNYEELEAFLKKLNKKYPNITRLYSIGKSVEGRPLWVLEIS---DNPGEHEPGEPEFKY 57

Query: 106 VANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGNL 165
           V NMHG+E VG  L++ L QYL    G D RIT L+NST I+I+PS+NPDG+  A+EG+ 
Sbjct: 58  VGNMHGNEVVGRELLLLLAQYLCENYGGDPRITNLVNSTRIHIMPSMNPDGYEKAQEGDC 117

Query: 166 NTKQKR 171
                R
Sbjct: 118 GGVTGR 123



 Score = 95.4 bits (238), Expect = 1e-21
 Identities = 29/45 (64%), Positives = 38/45 (84%)

Query: 172 MQDFNYVHSNCFEITMELSCCKYPKASDLKHYWAANKESLIKLIE 216
           MQD+NY+H+NCFEIT+ELSCCK+P AS+L  +W  NKE+L+  IE
Sbjct: 248 MQDYNYLHTNCFEITLELSCCKFPPASELPKFWEENKEALLNYIE 292


>gnl|CDD|199848 cd03866, M14_CPM, Peptidase M14 carboxypeptidase subfamily
           N/E-like; Carboxypeptidase M subgroup.  Peptidase M14
           Carboxypeptidase (CP) M (CPM) belongs to the N/E
           subfamily of the M14 family of metallocarboxypeptidases
           (MCPs).The M14 family are zinc-binding CPs which
           hydrolyze single, C-terminal amino acids from
           polypeptide chains, and have a recognition site for the
           free C-terminal carboxyl group, which is a key
           determinant of specificity. CPM is an extracellular
           glycoprotein, bound to cell membranes via a
           glycosyl-phosphatidylinositol on the C-terminus of the
           protein. It specifically removes C-terminal basic
           residues such as lysine and arginine from peptides and
           proteins. The highest levels of CPM have been found in
           human lung and placenta, but significant amounts are
           present in kidney, blood vessels, intestine, brain, and
           peripheral nerves. CPM has also been found in soluble
           form in various body fluids, including amniotic fluid,
           seminal plasma and urine. Due to its wide distribution
           in a variety of tissues, it is believed that it plays an
           important role in the control of peptide hormones and
           growth factor activity on the cell surface and in the
           membrane-localized degradation of extracellular
           proteins, for example it hydrolyses the C-terminal
           arginine of epidermal growth factor (EGF) resulting in
           des-Arg-EGF which binds to the EGF receptor (EGFR) with
           an equal or greater affinity than native EGF.  CPM is a
           required processing enzyme that generates specific
           agonists for the B1 receptor.
          Length = 292

 Score =  266 bits (680), Expect = 2e-83
 Identities = 127/298 (42%), Positives = 171/298 (57%), Gaps = 45/298 (15%)

Query: 359 RYYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFK 418
            Y+N+ +++ ++    K+Y  +    +IG+SV+GR LW + +     +P    +  P FK
Sbjct: 4   NYHNTVQMEQYLKDVNKNYSSITHLHSIGQSVEGRELWVLVLGQ---HPREHRIGIPEFK 60

Query: 419 YVANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGK 478
           YVANMHG+E VG  L++ LI YLV   G D  IT+L+NST I+I+PS+NPDGF A+K   
Sbjct: 61  YVANMHGNEVVGRELLLHLINYLVTSYGSDSVITRLINSTRIHIMPSMNPDGFEASK--- 117

Query: 479 CDSLDGY--VGRKNAHGVDLNRNFPDQFEYEAKKVYEPETQAIMNFIYSNPFVLSGNLHG 536
               D Y  VGR N +G DLNRNFPD FE   ++  +PET+A+M ++ S  FVLS NLHG
Sbjct: 118 ---PDCYYTVGRYNKNGYDLNRNFPDAFEENNEQR-QPETRAVMEWLKSETFVLSANLHG 173

Query: 537 GAV-------------------------------ARDYASRNPMMAPGHACG--FDFKDG 563
           GA+                               A+ YA  +  M  G+ C     F  G
Sbjct: 174 GALVASYPYDNGNGGSEQQGYRSVTPDDDVFVYLAKTYAYNHTNMYRGNHCSDLQSFPSG 233

Query: 564 ITNGNYWYKVTGGMQDFNYVHSNCFEITMELSCCKYPKASDLKHYWAANKESLIKLIE 621
           ITNG  WY + GGMQD+NYV + C EIT+ELSCCKYP    L  +W ANK SLI+ I+
Sbjct: 234 ITNGYQWYPLQGGMQDYNYVWAQCLEITLELSCCKYPPEEQLPAFWEANKASLIEYIK 291



 Score =  120 bits (303), Expect = 3e-30
 Identities = 51/117 (43%), Positives = 75/117 (64%), Gaps = 3/117 (2%)

Query: 45  RYYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFK 104
            Y+N+ +++ ++    K+Y  +    +IG+SV+GR LW + +      P    +  P FK
Sbjct: 4   NYHNTVQMEQYLKDVNKNYSSITHLHSIGQSVEGRELWVLVLGQ---HPREHRIGIPEFK 60

Query: 105 YVANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAK 161
           YVANMHG+E VG  L++ LI YLV   G D  IT+L+NST I+I+PS+NPDGF A+K
Sbjct: 61  YVANMHGNEVVGRELLLHLINYLVTSYGSDSVITRLINSTRIHIMPSMNPDGFEASK 117



 Score = 70.3 bits (172), Expect = 4e-13
 Identities = 28/48 (58%), Positives = 34/48 (70%)

Query: 169 QKRMQDFNYVHSNCFEITMELSCCKYPKASDLKHYWAANKESLIKLIE 216
           Q  MQD+NYV + C EIT+ELSCCKYP    L  +W ANK SLI+ I+
Sbjct: 244 QGGMQDYNYVWAQCLEITLELSCCKYPPEEQLPAFWEANKASLIEYIK 291


>gnl|CDD|199845 cd03863, M14_CPD_II, Peptidase M14 carboxypeptidase subfamily
           N/E-like; Carboxypeptidase D, domain II subgroup.  The
           second carboxypeptidase (CP)-like domain of
           Carboxypeptidase D (CPD; EC 3.4.17.22), domain II. CPD
           differs from all other metallocarboxypeptidases in that
           it contains multiple CP-like domains. CPD belongs to the
           N/E-like subfamily of the M14 family of
           metallocarboxypeptidases (MCPs).The M14 family are
           zinc-binding CPs which hydrolyze single, C-terminal
           amino acids from polypeptide chains, and have a
           recognition site for the free C-terminal carboxyl group,
           which is a key determinant of specificity. CPD is a
           single-chain protein containing a signal peptide, three
           tandem repeats of CP-like domains separated by short
           bridge regions, followed by a transmembrane domain, and
           a C-terminal cytosolic tail. The first two CP-like
           domains of CPD contain all of the essential active site
           and substrate-binding residues, while the third CP-like
           domain lacks critical residues necessary for enzymatic
           activity and is inactive towards standard CP substrates.
           Domain I is optimally active at pH 6.3-7.5 and prefers
           substrates with C-terminal Arg, whereas domain II is
           active at pH 5.0-6.5 and prefers substrates with
           C-terminal Lys. CPD functions in the processing of
           proteins that transit the secretory pathway, and is
           present in all vertebrates as well as Drosophila. It is
           broadly distributed in all tissue types. Within cells,
           CPD is present in the trans-Golgi network and immature
           secretory vesicles, but is excluded from mature
           vesicles. It is thought to play a role in the processing
           of proteins that are initially processed by furin or
           related endopeptidases present in the trans-Golgi
           network, such as growth factors and receptors. CPD is
           implicated in the pathogenesis of lupus erythematosus
           (LE), it is regulated by TGF-beta in various cell types
           of murine and human origin and is significantly
           down-regulated in CD14 positive cells isolated from
           patients with LE. As down -regulation of CPD leads to
           down-modulation of TGF-beta, CPD may have a role in a
           positive feedback loop.
          Length = 296

 Score =  246 bits (630), Expect = 3e-76
 Identities = 120/296 (40%), Positives = 174/296 (58%), Gaps = 40/296 (13%)

Query: 359 RYYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFK 418
           R+++ ++++ F+ +    YP + R  ++GKSV+ R L+ +EI+   DNP      +P FK
Sbjct: 7   RHHHFSDMEIFLRRFANEYPSITRLYSVGKSVELRELYVMEIS---DNPGVHEAGEPEFK 63

Query: 419 YVANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGK 478
           Y+ NMHG+E VG  L++ LI+YL    G D  +T L+ ST I+I+PS+NPDG+  ++EG 
Sbjct: 64  YIGNMHGNEVVGRELLLNLIEYLCKNFGTDPEVTDLVQSTRIHIMPSMNPDGYEKSQEG- 122

Query: 479 CDSLDGYVGRKNAHGVDLNRNFPDQFEYEAKKVYEPETQAIMNFIYSNPFVLSGNLHGGA 538
                G VGR N++  DLNRNFPDQF ++     +PET A+M+++ + PFVLS NLHGG+
Sbjct: 123 --DRGGTVGRNNSNNYDLNRNFPDQF-FQITDPPQPETLAVMSWLKTYPFVLSANLHGGS 179

Query: 539 ----------------------------VARDYASRNPMMAPGHACGFD-----FKDGIT 565
                                       +A  Y+  N  M  G  C        F  GIT
Sbjct: 180 LVVNYPFDDDEQGIAIYSKSPDDAVFQQLALSYSKENSKMYQGSPCKDLYPTEYFPHGIT 239

Query: 566 NGNYWYKVTGGMQDFNYVHSNCFEITMELSCCKYPKASDLKHYWAANKESLIKLIE 621
           NG  WY V GGMQD+NY+++NCFE+T+EL C KYPKA +L  YW  N+ SL++ I+
Sbjct: 240 NGAQWYNVPGGMQDWNYLNTNCFEVTIELGCVKYPKAEELPKYWEQNRRSLLQFIK 295



 Score =  108 bits (272), Expect = 4e-26
 Identities = 50/133 (37%), Positives = 82/133 (61%), Gaps = 3/133 (2%)

Query: 45  RYYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFK 104
           R+++ ++++ F+ +    YP + R  ++GKSV+ R L+ +EI+   D+P      +P FK
Sbjct: 7   RHHHFSDMEIFLRRFANEYPSITRLYSVGKSVELRELYVMEIS---DNPGVHEAGEPEFK 63

Query: 105 YVANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGN 164
           Y+ NMHG+E VG  L++ LI+YL    G D  +T L+ ST I+I+PS+NPDG+  ++EG+
Sbjct: 64  YIGNMHGNEVVGRELLLNLIEYLCKNFGTDPEVTDLVQSTRIHIMPSMNPDGYEKSQEGD 123

Query: 165 LNTKQKRMQDFNY 177
                 R    NY
Sbjct: 124 RGGTVGRNNSNNY 136



 Score = 68.1 bits (166), Expect = 2e-12
 Identities = 25/45 (55%), Positives = 36/45 (80%)

Query: 172 MQDFNYVHSNCFEITMELSCCKYPKASDLKHYWAANKESLIKLIE 216
           MQD+NY+++NCFE+T+EL C KYPKA +L  YW  N+ SL++ I+
Sbjct: 251 MQDWNYLNTNCFEVTIELGCVKYPKAEELPKYWEQNRRSLLQFIK 295


>gnl|CDD|215816 pfam00246, Peptidase_M14, Zinc carboxypeptidase. 
          Length = 277

 Score =  235 bits (602), Expect = 3e-72
 Identities = 90/291 (30%), Positives = 128/291 (43%), Gaps = 55/291 (18%)

Query: 366 LDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKYVANMHG 425
           ++A++     +YP LVR  +IGKSV+GR L  ++I+     P      KP     A +H 
Sbjct: 1   IEAWLDALAAAYPDLVRLVSIGKSVEGRPLLVLKISKGPHGPG-----KPAVLIDAGIHA 55

Query: 426 DETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKCDSLDGY 485
            E +G A +++LI+ L+     D  IT+LL+ TDIYIVP +NPDG+     G        
Sbjct: 56  REWIGPAALLYLIRQLL--SNNDPEITRLLDKTDIYIVPVLNPDGYEYTHTGDR---LWR 110

Query: 486 VGRKNAHGVDLNRNFPDQFEYEAKKVY-------------EPETQAIMNFIYSNPFVLSG 532
             R NA+GVDLNRNFPD +                     EPET+A+ +FI S  F L  
Sbjct: 111 KNRSNANGVDLNRNFPDLWNEVGASSNPCSETYRGPAPFSEPETRAVADFIRSYKFKLYI 170

Query: 533 NLHGGA-----------------------VARDYASRNPMMAPGHACGFDFKDGITNGNY 569
           +LHG +                       +A+ YA     M      G  +  GITNG+ 
Sbjct: 171 DLHGYSQLILYPYGYTYSSLPPDDEELKSLAKRYAKALGAMLY----GTRYTYGITNGDT 226

Query: 570 WYKVTGGMQDFNYVHSNC-FEITMELSCCK----YPKASDLKHYWAANKES 615
            Y  +GG  D+ Y      +  T+EL           AS +        E+
Sbjct: 227 IYPASGGSDDWAYGELGIKYSYTLELRDTGRYGFLLPASQIPPTGEETLEA 277



 Score =  124 bits (313), Expect = 1e-31
 Identities = 43/113 (38%), Positives = 65/113 (57%), Gaps = 7/113 (6%)

Query: 52  LDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFKYVANMHG 111
           ++A++     +YP LVR  +IGKSV+GR L  ++I+     P      KP     A +H 
Sbjct: 1   IEAWLDALAAAYPDLVRLVSIGKSVEGRPLLVLKISKGPHGPG-----KPAVLIDAGIHA 55

Query: 112 DETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGN 164
            E +G A +++LI+ L+     D  IT+LL+ TDIYIVP +NPDG+     G+
Sbjct: 56  REWIGPAALLYLIRQLL--SNNDPEITRLLDKTDIYIVPVLNPDGYEYTHTGD 106



 Score = 45.4 bits (108), Expect = 5e-05
 Identities = 8/44 (18%), Positives = 13/44 (29%), Gaps = 5/44 (11%)

Query: 172 MQDFNYVHSNC-FEITMELSCCK----YPKASDLKHYWAANKES 210
             D+ Y      +  T+EL           AS +        E+
Sbjct: 234 SDDWAYGELGIKYSYTLELRDTGRYGFLLPASQIPPTGEETLEA 277


>gnl|CDD|199846 cd03864, M14_CPN, Peptidase M14 carboxypeptidase subfamily
           N/E-like; Carboxypeptidase N subgroup.  Peptidase M14
           Carboxypeptidase N (CPN, also known as kininase I,
           creatine kinase conversion factor, plasma
           carboxypeptidase B, arginine carboxypeptidase, and
           protaminase; EC 3.4.17.3) is an extracellular
           glycoprotein synthesized in the liver and released into
           the blood, where it is present in high concentrations.
           CPN belongs to the N/E subfamily of the M14 family of
           metallocarboxypeptidases (MCPs).The M14 family are
           zinc-binding carboxypeptidases (CPs) which hydrolyze
           single, C-terminal amino acids from polypeptide chains,
           and have a recognition site for the free C-terminal
           carboxyl group, which is a key determinant of
           specificity. CPN plays an important role in protecting
           the body from excessive buildup of potentially
           deleterious peptides that normally act as local
           autocrine or paracrine hormones. It specifically removes
           C-terminal basic residues. As CPN can cleave lysine more
           avidly than arginine residues it is also called lysine
           carboxypeptidase. CPN substrates include peptides found
           in the bloodstream, such as kinins (e.g. bradykinin,
           kalinin, met-lys-bradykinin), complement anaphylatoxins
           and creatine kinase MM (CK-MM). By removing just one
           amino acid, CPN can alter peptide activity and receptor
           binding. For example Bradykinin, a nine-residue peptide
           released from kiningen in response to tissue injury
           which is inactivated by CPN, anaphylatoxins which are
           regulated by CPN by the cleaving and removal of their
           C-terminal arginines resulting in a reduction in their
           biological activities of 10-100-fold, and creatine
           kinase MM, a cytosolic enzyme that catalyzes the
           reversible transfer of a phosphate group from ATP to
           creatine, and is regulated by CPN by the cleavage of
           C-terminal lysines. Like the other N/E subfamily
           members, two surface loops surrounding the active-site
           groove restrict access to the catalytic center, thus
           restricting larger protein carboxypeptidase inhibitors
           from inhibiting CPN.
          Length = 313

 Score =  229 bits (586), Expect = 2e-69
 Identities = 121/299 (40%), Positives = 172/299 (57%), Gaps = 60/299 (20%)

Query: 378 PHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKYVANMHGDETVGYALMVFL 437
           P++ R  +IG+SV+GR+L+ +E +   DNP     ++P FKYV NMHG+E +G  L++ L
Sbjct: 19  PYITRIYSIGRSVEGRHLYVLEFS---DNPGIHEPLEPEFKYVGNMHGNEVLGRELLIQL 75

Query: 438 IQYLVLK-DGKDDRITQLLNSTDIYIVPSINPDGF-AAAKEGKCDSLDGY-VGRKNAHGV 494
            ++L  +    ++RIT+L++ T I+I+PS+NPDG+  AA++G     +GY VGR NA+GV
Sbjct: 76  SEFLCEEYRNGNERITRLIHDTRIHILPSMNPDGYEVAARQGP--EFNGYLVGRNNANGV 133

Query: 495 DLNRNFPD--QFEYEAKKV----------------YEPETQAIMNFIYSNPFVLSGNLHG 536
           DLNRNFPD     Y  +K                  EPET A++ ++ +  FVLS NLHG
Sbjct: 134 DLNRNFPDLNTLMYYNEKYGGPNHHLPLPDNWKSQVEPETLAVIQWMQNYNFVLSANLHG 193

Query: 537 GAV----------------------------------ARDYASRNPMMAPGHACGFDFKD 562
           GAV                                  A+ Y+  +  M  G  CG  F +
Sbjct: 194 GAVVANYPYDKSREPRVRGFRRTAYSPTPDDKLFQKLAKTYSYAHGWMHKGWNCGDYFDE 253

Query: 563 GITNGNYWYKVTGGMQDFNYVHSNCFEITMELSCCKYPKASDLKHYWAANKESLIKLIE 621
           GITNG  WY ++ GMQDFNY+H+NCFEIT+ELSC K+P   +L+  W AN+E+LI  IE
Sbjct: 254 GITNGASWYSLSKGMQDFNYLHTNCFEITLELSCDKFPPEEELEREWLANREALISYIE 312



 Score = 92.3 bits (229), Expect = 2e-20
 Identities = 42/102 (41%), Positives = 72/102 (70%), Gaps = 5/102 (4%)

Query: 64  PHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFKYVANMHGDETVGYALMVFL 123
           P++ R  +IG+SV+GR+L+ +E +   D+P     ++P FKYV NMHG+E +G  L++ L
Sbjct: 19  PYITRIYSIGRSVEGRHLYVLEFS---DNPGIHEPLEPEFKYVGNMHGNEVLGRELLIQL 75

Query: 124 IQYLVLK-DGKDDRITQLLNSTDIYIVPSINPDGF-AAAKEG 163
            ++L  +    ++RIT+L++ T I+I+PS+NPDG+  AA++G
Sbjct: 76  SEFLCEEYRNGNERITRLIHDTRIHILPSMNPDGYEVAARQG 117



 Score = 75.7 bits (186), Expect = 7e-15
 Identities = 29/47 (61%), Positives = 37/47 (78%)

Query: 170 KRMQDFNYVHSNCFEITMELSCCKYPKASDLKHYWAANKESLIKLIE 216
           K MQDFNY+H+NCFEIT+ELSC K+P   +L+  W AN+E+LI  IE
Sbjct: 266 KGMQDFNYLHTNCFEITLELSCDKFPPEEELEREWLANREALISYIE 312


>gnl|CDD|214748 smart00631, Zn_pept, Zn_pept domain. 
          Length = 277

 Score =  219 bits (561), Expect = 3e-66
 Identities = 88/288 (30%), Positives = 127/288 (44%), Gaps = 60/288 (20%)

Query: 360 YYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKY 419
           Y++  E++A++ +    YP LVR  +IGKSV+GR +W ++I+    N       KP    
Sbjct: 1   YHSYEEIEAWLKELAARYPDLVRLVSIGKSVEGRPIWVLKIS----NGGSHD--KPAIFI 54

Query: 420 VANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKC 479
            A +H  E +G A  ++LI  L+   G+D R+T LL+ TDIYIVP +NPDG+     G  
Sbjct: 55  DAGIHAREWIGPATALYLINQLLENYGRDPRVTNLLDKTDIYIVPVLNPDGYEYTHTGDR 114

Query: 480 DSLDGYVGRKNAHGVDLNRNFPDQFE---------YEAKKVY-EPETQAIMNFIYSN-PF 528
                     N  GVDLNRNFP  +          Y     + EPET+A+ +FI SN  F
Sbjct: 115 LWRKNRSPNSNCRGVDLNRNFPFHWGETGNPCSETYAGPSPFSEPETKAVRDFIRSNRRF 174

Query: 529 VLSGNLHGG---------------------------AVARDYASRNPMMAPGHACGFDFK 561
            L  +LH                             A+A+  AS +         G  + 
Sbjct: 175 KLYIDLHSYSQLILYPYGYTKNDLPPNVDDLDAVAKALAKALASVH---------GTRYT 225

Query: 562 DGITNGNYWYKVTGGMQDFNYVHSN-CFEITMELSCC-----KYPKAS 603
            GI+NG  +   +GG  D+ Y      F  T+EL          P + 
Sbjct: 226 YGISNGAIYP-ASGGSDDWAYGVLGIPFSFTLELRDDGRYGFLLPPSQ 272



 Score =  139 bits (352), Expect = 6e-37
 Identities = 45/119 (37%), Positives = 70/119 (58%), Gaps = 6/119 (5%)

Query: 46  YYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFKY 105
           Y++  E++A++ +    YP LVR  +IGKSV+GR +W ++I++           KP    
Sbjct: 1   YHSYEEIEAWLKELAARYPDLVRLVSIGKSVEGRPIWVLKISNGGSHD------KPAIFI 54

Query: 106 VANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGN 164
            A +H  E +G A  ++LI  L+   G+D R+T LL+ TDIYIVP +NPDG+     G+
Sbjct: 55  DAGIHAREWIGPATALYLINQLLENYGRDPRVTNLLDKTDIYIVPVLNPDGYEYTHTGD 113



 Score = 35.0 bits (81), Expect = 0.096
 Identities = 7/33 (21%), Positives = 10/33 (30%), Gaps = 6/33 (18%)

Query: 172 MQDFNYVHSN-CFEITMELSCC-----KYPKAS 198
             D+ Y      F  T+EL          P + 
Sbjct: 240 SDDWAYGVLGIPFSFTLELRDDGRYGFLLPPSQ 272


>gnl|CDD|199847 cd03865, M14_CPE, Peptidase M14 carboxypeptidase subfamily
           N/E-like; Carboxypeptidase E subgroup.  Peptidase M14
           Carboxypeptidase (CP) E (CPE, also known as
           carboxypeptidase H, and enkephalin convertase; EC
           3.4.17.10) belongs to the N/E subfamily of the M14
           family of metallocarboxypeptidases (MCPs).The M14 family
           are zinc-binding CPs which hydrolyze single, C-terminal
           amino acids from polypeptide chains, and have a
           recognition site for the free C-terminal carboxyl group,
           which is a key determinant of specificity. CPE is an
           important enzyme responsible for the proteolytic
           processing of prohormone intermediates (such as
           pro-insulin, pro-opiomelanocortin, or
           pro-gonadotropin-releasing hormone) by specifically
           removing C-terminal basic residues. In addition, it has
           been proposed that the regulated secretory pathway (RSP)
           of the nervous and endocrine systems utilizes
           membrane-bound CPE as a sorting receptor. A naturally
           occurring point mutation in CPE reduces the stability of
           the enzyme and causes its degradation, leading to an
           accumulation of numerous neuroendocrine peptides that
           result in obesity and hyperglycemia. Reduced CPE enzyme
           and receptor activity could underlie abnormal placental
           phenotypes from the observation that CPE is
           down-regulated  in enlarged placentas of interspecific
           hybrid (interspecies hybrid placental dysplasia, IHPD)
           and cloned mice.
          Length = 323

 Score =  221 bits (564), Expect = 3e-66
 Identities = 128/321 (39%), Positives = 168/321 (52%), Gaps = 62/321 (19%)

Query: 360 YYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKY 419
           Y+   EL   ++      P + R  T+G+S +GR L  +E++   DNP      +P FKY
Sbjct: 5   YHRYPELREALVSVWLQCPSISRIYTVGRSFEGRELLVIEMS---DNPGEHEPGEPEFKY 61

Query: 420 VANMHGDETVGYALMVFLIQYLVLKDGK-DDRITQLLNSTDIYIVPSINPDGFAAAKEGK 478
           V NMHG+E VG  L+++L QYL  +  K ++ I  L++ST I+I+PS+NPDGF  A    
Sbjct: 62  VGNMHGNEAVGRELLIYLAQYLCNEYQKGNETIINLIHSTRIHIMPSLNPDGFEKAASQP 121

Query: 479 CDSLDGYVGRKNAHGVDLNRNFPD----------------QFEYEAKKVYE------PET 516
            +  D +VGR NA G+DLNRNFPD                      KK  +      PET
Sbjct: 122 GELKDWFVGRSNAQGIDLNRNFPDLDRIVYVNEREGGPNNHLLKNMKKAVDENTKLAPET 181

Query: 517 QAIMNFIYSNPFVLSGNLHGG-----------------------------AVARDYASRN 547
           +A++++I   PFVLS NLHGG                             ++AR Y+S N
Sbjct: 182 KAVIHWIMDIPFVLSANLHGGDLVANYPYDETRSGSAHEYSACPDDAIFKSLARAYSSLN 241

Query: 548 PMMA-----PGHACGFD--FKDGITNGNYWYKVTGGMQDFNYVHSNCFEITMELSCCKYP 600
           P M+     P      D  F DG TNG  WY V GGMQDFNY+ SNCFEIT+ELSC K+P
Sbjct: 242 PAMSDPNRPPCRKNDDDSSFVDGTTNGGAWYSVPGGMQDFNYLSSNCFEITVELSCDKFP 301

Query: 601 KASDLKHYWAANKESLIKLIE 621
               LK YW  NK SL+  IE
Sbjct: 302 PEETLKQYWEDNKNSLVNYIE 322



 Score = 87.8 bits (217), Expect = 9e-19
 Identities = 49/123 (39%), Positives = 75/123 (60%), Gaps = 6/123 (4%)

Query: 46  YYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFKY 105
           Y+   EL   ++      P + R  T+G+S +GR L  +E++   D+P      +P FKY
Sbjct: 5   YHRYPELREALVSVWLQCPSISRIYTVGRSFEGRELLVIEMS---DNPGEHEPGEPEFKY 61

Query: 106 VANMHGDETVGYALMVFLIQYLVLKDGK-DDRITQLLNSTDIYIVPSINPDGF--AAAKE 162
           V NMHG+E VG  L+++L QYL  +  K ++ I  L++ST I+I+PS+NPDGF  AA++ 
Sbjct: 62  VGNMHGNEAVGRELLIYLAQYLCNEYQKGNETIINLIHSTRIHIMPSLNPDGFEKAASQP 121

Query: 163 GNL 165
           G L
Sbjct: 122 GEL 124



 Score = 69.7 bits (170), Expect = 7e-13
 Identities = 29/45 (64%), Positives = 33/45 (73%)

Query: 172 MQDFNYVHSNCFEITMELSCCKYPKASDLKHYWAANKESLIKLIE 216
           MQDFNY+ SNCFEIT+ELSC K+P    LK YW  NK SL+  IE
Sbjct: 278 MQDFNYLSSNCFEITVELSCDKFPPEETLKQYWEDNKNSLVNYIE 322


>gnl|CDD|199849 cd03867, M14_CPZ, Peptidase M14 carboxypeptidase subfamily
           N/E-like; Carboxypeptidase Z subgroup.  Peptidase
           M14-like domain of carboxypeptidase (CP) Z (CPZ), CPZ
           belongs to the N/E subfamily of the M14 family of
           metallocarboxypeptidases (MCPs). The M14 family are
           zinc-binding CPs which hydrolyze single, C-terminal
           amino acids from polypeptide chains, and have a
           recognition site for the free C-terminal carboxyl group,
           which is a key determinant of specificity. CPZ is a
           secreted Zn-dependent enzyme whose biological function
           is largely unknown. Unlike other members of the N/E
           subfamily, CPZ has a bipartite structure, which consists
           of an N-terminal cysteine-rich domain (CRD) whose
           sequence is similar to Wnt-binding proteins, and a
           C-terminal CP catalytic domain that removes C-terminal
           Arg residues from substrates. CPZ is enriched in the
           extracellular matrix and is widely distributed during
           early embryogenesis.  That the CRD of CPZ can bind to
           Wnt4 suggests that CPZ plays a role in Wnt signaling.
          Length = 315

 Score =  219 bits (560), Expect = 1e-65
 Identities = 115/321 (35%), Positives = 173/321 (53%), Gaps = 66/321 (20%)

Query: 360 YYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKY 419
           +++ +++ + + +T     H+ R  +IG+S +GR+L  +E +    NP    L++P  KY
Sbjct: 1   HHSYSQMVSVLKRTAARCSHIARTYSIGRSFEGRDLLVIEFS---SNPGQHELLEPEVKY 57

Query: 420 VANMHGDETVGYALMVFLIQYLVLK--DGKDDRITQLLNSTDIYIVPSINPDGF-AAAKE 476
           + NMHG+E +G  L+++L Q+L  +   G + RI  L+N+T I+++PS+NPDG+ AAA E
Sbjct: 58  IGNMHGNEVLGRELLIYLAQFLCSEYLLG-NQRIQTLINTTRIHLLPSMNPDGYEAAASE 116

Query: 477 GKCDSLDGYV-GRKNAHGVDLNRNFPDQFE---------------------YEAKKVYEP 514
           G     +G+  GR+NA  +DLNRNFPD                        Y   KV  P
Sbjct: 117 GA--GYNGWTNGRQNAQNIDLNRNFPDLTSEVYRRRRQRGARTDHIPIPDSYWFGKV-AP 173

Query: 515 ETQAIMNFIYSNPFVLSGNLHGG------------------------------AVARDYA 544
           ET+A+M ++ S PFVLS +LHGG                               +AR YA
Sbjct: 174 ETKAVMKWMRSIPFVLSASLHGGDLVVSYPYDFSRHPLEEKMFSPTPDEKVFKMLARTYA 233

Query: 545 SRNPMMA--PGHACGFDF--KDGITNGNYWYKVTGGMQDFNYVHSNCFEITMELSCCKYP 600
             +P M+      CG +F  + GI NG  WY  +GGM DFNY+H+NCFE+T+EL C K+P
Sbjct: 234 DAHPTMSDRSTRRCGGNFHKRGGIINGAEWYSFSGGMSDFNYLHTNCFEVTVELGCDKFP 293

Query: 601 KASDLKHYWAANKESLIKLIE 621
              +L   W  NKE+L+  +E
Sbjct: 294 PEEELYLIWQENKEALLSFME 314



 Score = 95.7 bits (238), Expect = 2e-21
 Identities = 43/121 (35%), Positives = 78/121 (64%), Gaps = 7/121 (5%)

Query: 46  YYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFKY 105
           +++ +++ + + +T     H+ R  +IG+S +GR+L  +E +    +P    L++P  KY
Sbjct: 1   HHSYSQMVSVLKRTAARCSHIARTYSIGRSFEGRDLLVIEFS---SNPGQHELLEPEVKY 57

Query: 106 VANMHGDETVGYALMVFLIQYLVLK--DGKDDRITQLLNSTDIYIVPSINPDGF-AAAKE 162
           + NMHG+E +G  L+++L Q+L  +   G + RI  L+N+T I+++PS+NPDG+ AAA E
Sbjct: 58  IGNMHGNEVLGRELLIYLAQFLCSEYLLG-NQRIQTLINTTRIHLLPSMNPDGYEAAASE 116

Query: 163 G 163
           G
Sbjct: 117 G 117



 Score = 65.6 bits (160), Expect = 2e-11
 Identities = 23/45 (51%), Positives = 32/45 (71%)

Query: 172 MQDFNYVHSNCFEITMELSCCKYPKASDLKHYWAANKESLIKLIE 216
           M DFNY+H+NCFE+T+EL C K+P   +L   W  NKE+L+  +E
Sbjct: 270 MSDFNYLHTNCFEVTVELGCDKFPPEEELYLIWQENKEALLSFME 314


>gnl|CDD|199867 cd06245, M14_CPD_III, Peptidase M14 carboxypeptidase subfamily
           N/E-like; Carboxypeptidase D, domain III subgroup.  The
           third carboxypeptidase (CP)-like domain of
           Carboxypeptidase D (CPD; EC 3.4.17.22), domain III. CPD
           differs from all other metallocarboxypeptidases in that
           it contains multiple CP-like domains. CPD belongs to the
           N/E-like subfamily of the M14 family of
           metallocarboxypeptidases (MCPs).The M14 family are
           zinc-binding CPs which hydrolyze single, C-terminal
           amino acids from polypeptide chains, and have a
           recognition site for the free C-terminal carboxyl group,
           which is a key determinant of specificity. CPD is a
           single-chain protein containing a signal peptide, three
           tandem repeats of CP-like domains separated by short
           bridge regions, followed by a transmembrane domain, and
           a C-terminal cytosolic tail. The first two CP-like
           domains of CPD contain all of the essential active site
           and substrate-binding residues, the third CP-like domain
           lacks critical residues necessary for enzymatic activity
           and is inactive towards standard CP substrates. Domain I
           is optimally active at pH 6.3-7.5 and prefers substrates
           with C-terminal Arg, whereas domain II is active at pH
           5.0-6.5 and prefers substrates with C-terminal Lys. CPD
           functions in the processing of proteins that transit the
           secretory pathway, and is present in all vertebrates as
           well as Drosophila. It is broadly distributed in all
           tissue types. Within cells, CPD is present in the
           trans-Golgi network and immature secretory vesicles, but
           is excluded from mature vesicles. It is thought to play
           a role in the processing of proteins that are initially
           processed by furin or related endopeptidases present in
           the trans-Golgi network, such as growth factors and
           receptors. CPD is implicated in the pathogenesis of
           lupus erythematosus (LE), it is regulated by TGF-beta in
           various cell types of murine and human origin and is
           significantly down-regulated in CD14 positive cells
           isolated from patients with LE. As down -regulation of
           CPD leads to down-modulation of TGF-beta, CPD may have a
           role in a positive feedback loop.
          Length = 284

 Score =  194 bits (494), Expect = 9e-57
 Identities = 94/290 (32%), Positives = 154/290 (53%), Gaps = 36/290 (12%)

Query: 359 RYYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFK 418
           RY++  EL  F+   V +YPH+    ++G+SV+ R++ ++EI++  +N +     +P  +
Sbjct: 1   RYHSYKELSEFLRGLVLNYPHITNLTSLGQSVEFRHILSLEISNKPNNSEPE---EPKIR 57

Query: 419 YVANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGK 478
           +VA +HG+  VG  L++ L ++L +  GK+  +T+L++ T I IVPS+NPDG   A+E +
Sbjct: 58  FVAGIHGNAPVGTELLLALAEFLCMNYGKNPAVTKLIDRTRIVIVPSLNPDGRERAQEKQ 117

Query: 479 CDSLDGYVGRKNAHGVDLNRNFPDQFEYEAKKVYEPETQAIMNFIYSNPFVLSGNLHGGA 538
           C S     G  NAHG DL+ +F       +    +PET+AI++ +    F LS  L GG+
Sbjct: 118 CTSK---EGHTNAHGKDLDTDFTSNASNMSADS-QPETKAIIDNLIQKDFTLSVALDGGS 173

Query: 539 V----------------------ARDYASRNPMM------APGHACGFDFKDGITNGNYW 570
           V                      A+ YA+ +P M       P ++   +   G+  G  W
Sbjct: 174 VVATYPYDKPVQTVENKETLKHLAKVYANNHPSMHLGQPGCPNNS-DENIPGGVMRGAEW 232

Query: 571 YKVTGGMQDFNYVHSNCFEITMELSCCKYPKASDLKHYWAANKESLIKLI 620
               G M+DF+    +C EIT+  SCC +P A+ L   WA NK+SL+ +I
Sbjct: 233 NSHLGSMKDFSVDFGHCPEITVYTSCCLFPSAAQLPDLWAENKKSLLSMI 282



 Score =  102 bits (256), Expect = 3e-24
 Identities = 43/125 (34%), Positives = 78/125 (62%), Gaps = 3/125 (2%)

Query: 45  RYYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFK 104
           RY++  EL  F+   V +YPH+    ++G+SV+ R++ ++EI+   + P+     +P  +
Sbjct: 1   RYHSYKELSEFLRGLVLNYPHITNLTSLGQSVEFRHILSLEIS---NKPNNSEPEEPKIR 57

Query: 105 YVANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGN 164
           +VA +HG+  VG  L++ L ++L +  GK+  +T+L++ T I IVPS+NPDG   A+E  
Sbjct: 58  FVAGIHGNAPVGTELLLALAEFLCMNYGKNPAVTKLIDRTRIVIVPSLNPDGRERAQEKQ 117

Query: 165 LNTKQ 169
             +K+
Sbjct: 118 CTSKE 122



 Score = 58.0 bits (140), Expect = 5e-09
 Identities = 24/62 (38%), Positives = 34/62 (54%), Gaps = 2/62 (3%)

Query: 154 PDGFAAAKEGNLNTKQKRMQDFNYVHSNCFEITMELSCCKYPKASDLKHYWAANKESLIK 213
           P G     E   N+    M+DF+    +C EIT+  SCC +P A+ L   WA NK+SL+ 
Sbjct: 223 PGGVMRGAE--WNSHLGSMKDFSVDFGHCPEITVYTSCCLFPSAAQLPDLWAENKKSLLS 280

Query: 214 LI 215
           +I
Sbjct: 281 MI 282


>gnl|CDD|199851 cd03869, M14_CPX_like, Peptidase M14 carboxypeptidase subfamily
           N/E-like; Carboxypeptidase X subgroup.  Peptidase
           M14-like domain of carboxypeptidase (CP)-like protein X
           (CPX), CPX forms a distinct subgroup of the N/E
           subfamily of the M14 family of metallocarboxypeptidases
           (MCPs). The M14 family are zinc-binding CPs which
           hydrolyze single, C-terminal amino acids from
           polypeptide chains, and have a recognition site for the
           free C-terminal carboxyl group, which is a key
           determinant of specificity. Proteins belonging to this
           subgroup include CP-like protein X1 (CPX1), CP-like
           protein X2 (CPX2),  and aortic CP-like protein (ACLP)
           and its isoform adipocyte enhancer binding protein-1
           (AEBP1). AEBP1 is a truncated form of ACLP, which may
           arise from alternative splicing of the gene. These
           proteins are inactive towards standard CP substrates
           because they lack one or more critical active site and
           substrate-binding residues that are necessary for
           activity. They may function as binding proteins rather
           than as active CPs or display catalytic activity toward
           other substrates.  Proteins in this subgroup also
           contain an N-terminal discoidin domain. The CP domain is
           important for the function of AEBP1 as a transcriptional
           repressor. AEBP1 is involved in several biological
           processes including adipogenesis, macrophage cholesterol
           homeostasis, and inflammation. In macrophages, AEBP1
           promotes the expression of IL-6, TNF-alpha, MCP-1, and
           iNOS whose expression is tightly regulated by NF-kappaB
           activity. ACLP, a secreted protein that associates with
           the extracellular matrix, is essential for abdominal
           wall development and contributes to dermal wound
           healing.
          Length = 326

 Score =  194 bits (494), Expect = 4e-56
 Identities = 106/316 (33%), Positives = 155/316 (49%), Gaps = 68/316 (21%)

Query: 370 ILKTV-KSYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKYVANMHGDET 428
           ++K V +  P++ R  +IGKS QG  L+A+EI+   DNP    L +P F+Y A MHG+E 
Sbjct: 14  LMKVVNEECPNITRIYSIGKSYQGLKLYAMEIS---DNPGEHELGEPEFRYTAGMHGNEV 70

Query: 429 VGYALMVFLIQYLV--LKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKCDSLDGYV 486
           +G  L++ L+Q+L     DG + R+  L+  T I+++PS+NPDG+  A E   +     +
Sbjct: 71  LGRELLLLLMQFLCQEYLDG-NPRVVHLVEETRIHLLPSLNPDGYEKAYEMGSELGGWAL 129

Query: 487 GRKNAHGVDLNRNFPD----QFEYEAKK----------------------VYEPETQAIM 520
           GR    G+D+N NFPD     +E E KK                         PET+A++
Sbjct: 130 GRWTEEGIDINHNFPDLNTILWEAEEKKWVPRKVPNHHIPIPEWYLSPNATVAPETRAVI 189

Query: 521 NFIYSNPFVLSGNLHGGA------------------------------VARDYASRNPMM 550
            ++   PFVL  NL GG                               +A  YAS +  M
Sbjct: 190 AWMEKIPFVLGANLQGGELVVSYPYDMTRTPWATQEATPTPDDAVFRWLATSYASTHLTM 249

Query: 551 A-----PGHACGFDFKDGITNGNYWYKVTGGMQDFNYVHSNCFEITMELSCCKYPKASDL 605
                   H   F  + GI NG  W+ V G M DF+Y+H+NCFE+++ L C K+P  S+L
Sbjct: 250 TDPSRRVCHTEDFQKEMGIVNGASWHTVAGSMNDFSYLHTNCFELSVYLGCDKFPHESEL 309

Query: 606 KHYWAANKESLIKLIE 621
              W  NKESL+  +E
Sbjct: 310 PEEWENNKESLLVFME 325



 Score = 86.4 bits (214), Expect = 2e-18
 Identities = 44/111 (39%), Positives = 70/111 (63%), Gaps = 7/111 (6%)

Query: 56  ILKTV-KSYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFKYVANMHGDET 114
           ++K V +  P++ R  +IGKS QG  L+A+EI+   D+P    L +P F+Y A MHG+E 
Sbjct: 14  LMKVVNEECPNITRIYSIGKSYQGLKLYAMEIS---DNPGEHELGEPEFRYTAGMHGNEV 70

Query: 115 VGYALMVFLIQYLV--LKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEG 163
           +G  L++ L+Q+L     DG + R+  L+  T I+++PS+NPDG+  A E 
Sbjct: 71  LGRELLLLLMQFLCQEYLDG-NPRVVHLVEETRIHLLPSLNPDGYEKAYEM 120



 Score = 66.0 bits (161), Expect = 2e-11
 Identities = 23/51 (45%), Positives = 34/51 (66%)

Query: 166 NTKQKRMQDFNYVHSNCFEITMELSCCKYPKASDLKHYWAANKESLIKLIE 216
           +T    M DF+Y+H+NCFE+++ L C K+P  S+L   W  NKESL+  +E
Sbjct: 275 HTVAGSMNDFSYLHTNCFELSVYLGCDKFPHESELPEEWENNKESLLVFME 325


>gnl|CDD|199839 cd00596, Peptidase_M14_like, M14 family of metallocarboxypeptidases
           and related proteins.  The M14 family of
           metallocarboxypeptidases (MCPs), also known as
           funnelins, are zinc-binding carboxypeptidases (CPs)
           which hydrolyze single, C-terminal amino acids from
           polypeptide chains, and have a recognition site for the
           free C-terminal carboxyl group, which is a key
           determinant of specificity. Two major subfamilies of the
           M14 family, defined based on sequence and structural
           homology, are the A/B and N/E subfamilies. Enzymes
           belonging to the A/B subfamily are normally synthesized
           as inactive precursors containing preceding signal
           peptide, followed by an N-terminal pro-region linked to
           the enzyme; these proenzymes are called
           procarboxypeptidases. The A/B enzymes can be further
           divided based on their substrate specificity;
           Carboxypeptidase A-like (CPA-like) enzymes favor
           hydrophobic residues while carboxypeptidase B-like
           (CPB-like) enzymes only cleave the basic residues lysine
           or arginine. The A forms have slightly different
           specificities, with Carboxypeptidase A1 (CPA1)
           preferring aliphatic and small aromatic residues, and
           CPA2 preferring the bulky aromatic side chains. Enzymes
           belonging to the N/E subfamily enzymes are not produced
           as inactive precursors and instead rely on their
           substrate specificity and subcellular
           compartmentalization to prevent inappropriate cleavage.
           They contain an extra C-terminal transthyretin-like
           domain, thought to be involved in folding or formation
           of oligomers. MCPs can also be classified based on their
           involvement in specific physiological processes; the
           pancreatic MCPs participate only in alimentary digestion
           and include carboxypeptidase A and B (A/B subfamily),
           while others, namely regulatory MCPs or the N/E
           subfamily, are involved in more selective reactions,
           mainly in non-digestive tissues and fluids, acting on
           blood coagulation/fibrinolysis, inflammation and local
           anaphylaxis, pro-hormone and neuropeptide processing,
           cellular response and others.  Another MCP subfamily, is
           that of succinylglutamate desuccinylase /aspartoacylase,
           which hydrolyzes N-acetyl-L-aspartate (NAA), and
           deficiency in which is the established cause of Canavan
           disease. Another subfamily (referred to as subfamily C)
           includes an exceptional type of activity in the MCP
           family, that of dipeptidyl-peptidase activity of
           gamma-glutamyl-(L)-meso-diaminopimelate peptidase I
           which is involved in bacterial cell wall metabolism.
          Length = 211

 Score =  118 bits (297), Expect = 3e-30
 Identities = 61/223 (27%), Positives = 93/223 (41%), Gaps = 35/223 (15%)

Query: 417 FKYVANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKE 476
              +A +HG+E +G  + + L+++L+   G D   T LL++  ++IVP++NPDG A  + 
Sbjct: 1   VLIIAGIHGNEVIGVEVALLLLEHLLANYGND---TALLDNRRLWIVPNVNPDGIARVQR 57

Query: 477 GKCDSLDGYVGRKNAHGVDLNRNFPDQFEYEAKKVY--------EPETQAIMNFIYSNPF 528
           G          R NA+GVDLNRNFP  +                EPET+A+   I    F
Sbjct: 58  GW---------RGNANGVDLNRNFPGIWGKRGAPETYGGPAPLSEPETRALAALIKQRKF 108

Query: 529 VLSGNLHGGAVA---------RDYASRNPMMAPG-----HACGFDFKDGITNGNYWYKVT 574
            L  NLH G++A          D    +            A     K+G+          
Sbjct: 109 DLVINLHSGSLAILYPYGHSDNDPDDFSNPKEIAQSLAIAADKHGGKEGVGFIVQEKATQ 168

Query: 575 GGMQDFNYVHSNCFEITMELSCCKYPKAS-DLKHYWAANKESL 616
           GG +D+ Y +      T+EL     P  +         N E+L
Sbjct: 169 GGFEDWAYDNHGKLAFTVELGKGIPPLFAEKFFLRLKRNDEAL 211



 Score = 58.7 bits (142), Expect = 1e-09
 Identities = 21/61 (34%), Positives = 38/61 (62%), Gaps = 3/61 (4%)

Query: 103 FKYVANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKE 162
              +A +HG+E +G  + + L+++L+   G D   T LL++  ++IVP++NPDG A  + 
Sbjct: 1   VLIIAGIHGNEVIGVEVALLLLEHLLANYGND---TALLDNRRLWIVPNVNPDGIARVQR 57

Query: 163 G 163
           G
Sbjct: 58  G 58


>gnl|CDD|199843 cd03859, M14_CPT, Peptidase M14 Carboxypeptidase T subfamily.
           Peptidase M14-like domain of carboxypeptidase (CP) T
           (CPT), CPT belongs to the M14 family of
           metallocarboxypeptidases (MCPs). The M14 family are
           zinc-binding CPs which hydrolyze single, C-terminal
           amino acids from polypeptide chains, and have a
           recognition site for the free C-terminal carboxyl group,
           which is a key determinant of specificity. CPT has
           moderate similarity to CPA and CPB, and exhibits
           dual-substrate specificity by cleaving C-terminal
           hydrophobic amino acid residues like CPA and C-terminal
           positively charged residues like CPB. CPA and CPB are
           M14 family peptidases but do not belong to this CPT
           group. The substrate specificity difference between CPT
           and CPA and CPB is ascribed to a few amino acid
           substitutions at the substrate-binding pocket while the
           spatial organization of the binding site remains the
           same as in all Zn-CPs. CPT has increased thermal
           stability in presence of Ca2+ ions, and two disulfide
           bridges which give an additional stabilization factor.
          Length = 297

 Score =  120 bits (304), Expect = 3e-30
 Identities = 60/205 (29%), Positives = 102/205 (49%), Gaps = 34/205 (16%)

Query: 357 LQRYYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPM 416
            + Y+N  E+   +     +YP+LV+ ++IGKS +GR++WAV+I+ +    +     KP 
Sbjct: 1   DRGYHNYLEMVDELNAAAAAYPNLVKVKSIGKSYEGRDIWAVKISDNPATDEN----KPE 56

Query: 417 FKYVANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKE 476
             Y +  H  E +   + ++LI+YL+   G D RI  L++S +++ VP +NPDG+   + 
Sbjct: 57  VLYTSTHHAREWLSLEMAIYLIKYLLENYGTDPRIQNLVDSRELWFVPVVNPDGYEYDE- 115

Query: 477 GKCDSLDGYVGRKN----------AHGVDLNRNFPDQFEY--------EAKKVY------ 512
               +      RKN          + GVDLNRN+  ++           + + Y      
Sbjct: 116 ---ATGSYRSWRKNRRPNSGDISSSDGVDLNRNYGYKWGCDSGGSSSDPSSETYRGPSAF 172

Query: 513 -EPETQAIMNFIYSNP-FVLSGNLH 535
            EPETQAI +F+ S+     + N H
Sbjct: 173 SEPETQAIRDFVESHVNIKTAINYH 197



 Score = 93.9 bits (234), Expect = 5e-21
 Identities = 38/117 (32%), Positives = 69/117 (58%), Gaps = 4/117 (3%)

Query: 43  LQRYYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPM 102
            + Y+N  E+   +     +YP+LV+ ++IGKS +GR++WAV+I+ +  + +     KP 
Sbjct: 1   DRGYHNYLEMVDELNAAAAAYPNLVKVKSIGKSYEGRDIWAVKISDNPATDEN----KPE 56

Query: 103 FKYVANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAA 159
             Y +  H  E +   + ++LI+YL+   G D RI  L++S +++ VP +NPDG+  
Sbjct: 57  VLYTSTHHAREWLSLEMAIYLIKYLLENYGTDPRIQNLVDSRELWFVPVVNPDGYEY 113


>gnl|CDD|200604 cd11308, Peptidase_M14NE-CP-C_like, Peptidase associated domain:
           C-terminal domain of M14 N/E carboxypeptidase; putative
           folding, regulation, or interaction domain.  This domain
           is found C-terminal to the M14 carboxypeptidase (CP) N/E
           subfamily containing zinc-binding enzymes that hydrolyze
           single C-terminal amino acids from polypeptide chains,
           and have a recognition site for the free C-terminal
           carboxyl group, which is a key determinant of
           specificity. The N/E subfamily includes enzymatically
           active members (carboxypeptidase N, E, M, D, and Z), as
           well as non-active members (carboxypeptidase-like
           protein 1, -2, aortic CP-like protein, and adipocyte
           enhancer binding protein-1) which lack the critical
           active site and substrate-binding residues considered
           necessary for activity. The active N/E enzymes fulfill a
           variety of cellular functions, including prohormone
           processing, regulation of peptide hormone activity,
           alteration of protein-protein or protein-cell
           interactions and transcriptional regulation. For M14
           CPs, it has been suggested that this domain may assist
           in folding of the CP domain, regulate enzyme activity,
           or be involved in interactions with other proteins or
           with membranes; for carboxypeptidase M, it may interact
           with the bradykinin 1 receptor at the cell surface. This
           domain may also be found in other peptidase families.
          Length = 76

 Score = 93.7 bits (234), Expect = 2e-23
 Identities = 34/76 (44%), Positives = 45/76 (59%)

Query: 221 GVYGIVTDTYGNPLPSAIITVRWNDKAVTVTNRGEYWRLLARGKYVVTASAPGYEPVTTE 280
           G+ G VTD  GNP+ +A I+V   +  VT    G+YWRLL  G Y VTASAPGY+PVT  
Sbjct: 1   GIKGFVTDATGNPIANATISVEGINHDVTTAKDGDYWRLLLPGTYNVTASAPGYQPVTKT 60

Query: 281 PLDVPDTESVRLDFML 296
                +  +  ++F L
Sbjct: 61  VTVPNNFSATVVNFTL 76



 Score = 93.7 bits (234), Expect = 2e-23
 Identities = 34/76 (44%), Positives = 45/76 (59%)

Query: 626 GVYGIVTDTYGNPLPSAIITVRWNDKAVTVTNRGEYWRLLARGKYVVTASAPGYEPVTTE 685
           G+ G VTD  GNP+ +A I+V   +  VT    G+YWRLL  G Y VTASAPGY+PVT  
Sbjct: 1   GIKGFVTDATGNPIANATISVEGINHDVTTAKDGDYWRLLLPGTYNVTASAPGYQPVTKT 60

Query: 686 PLDVPDTESVRLDFML 701
                +  +  ++F L
Sbjct: 61  VTVPNNFSATVVNFTL 76


>gnl|CDD|199844 cd03860, M14_CP_A-B_like, Peptidase M14 carboxypeptidase subfamily
           A/B-like.  The Peptidase M14 Carboxypeptidase (CP) A/B
           subfamily is one of two main M14 CP subfamilies defined
           by sequence and structural homology, the other being the
           N/E subfamily. CPs hydrolyze single, C-terminal amino
           acids from polypeptide chains. They have a recognition
           site for the free C-terminal carboxyl group, which is a
           key determinant of specificity. Enzymes belonging to the
           A/B subfamily are normally synthesized as inactive
           precursors containing preceding signal peptide, followed
           by a globular N-terminal pro-region linked to the
           enzyme; these proenzymes are called
           procarboxypeptidases. The A/B enzymes can be further
           divided based on their substrate specificity;
           Carboxypeptidase A-like (CPA-like) enzymes favor
           hydrophobic residues while carboxypeptidase B-like
           (CPB-like) enzymes only cleave the basic residues lysine
           or arginine. There are nine members in the A/B family:
           CPA1, CPA2, CPA3, CPA4, CPA5, CPA6, CPB, CPO and CPU.
           CPA1, CPA2 and CPB are produced by the pancreas. The A
           forms have slightly different specificities, with CPA1
           preferring aliphatic and small aromatic residues, and
           CPA2 preferring the bulkier aromatic side chains. CPA3
           is found in secretory granules of mast cells and
           functions in inflammatory processes. CPA4 is detected in
           hormone-regulated tissues, and is thought to play a role
           in prostate cancer. CPA5 is present in discrete regions
           of pituitary and other tissues, and cleaves aliphatic
           C-terminal residues. CPA6 is highly expressed in
           embryonic brain and optic muscle, suggesting that it may
           play a specific role in cell migration and axonal
           guidance. CPU (also called CPB2) is produced and
           secreted by the liver as the inactive precursor, PCPU,
           commonly referred to as thrombin-activatable
           fibrinolysis inhibitor (TAFI). Little is known about CPO
           but it has been suggested to have specificity for acidic
           residues.
          Length = 293

 Score = 99.1 bits (248), Expect = 7e-23
 Identities = 58/188 (30%), Positives = 89/188 (47%), Gaps = 34/188 (18%)

Query: 360 YYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKY 419
           Y+   E+ A++ +  + YP LV  ETIG+S +GR +  ++I++   +       KP    
Sbjct: 1   YHTYDEIYAWLDELAQKYPDLVTVETIGRSYEGRPIKGLKISNGGRSN------KPAIFI 54

Query: 420 VANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKC 479
            A +H  E +  A  +++I  LV  +  D  +T+LL++ D YIVP +NPDG+        
Sbjct: 55  DAGIHAREWISPATALYIINQLV--ESYDPEVTELLDNYDWYIVPVVNPDGYEYTHTT-- 110

Query: 480 DSLDGYVGRKNAH-------GVDLNRNFPDQFEYEAKKVY-------------EPETQAI 519
           D L     RKN         GVDLNRNF   +  E                  EPET+A+
Sbjct: 111 DRL----WRKNRSPNGGGCVGVDLNRNFDFHWGGEGASSNPCSETYAGPSAFSEPETRAV 166

Query: 520 MNFIYSNP 527
            +F+ S  
Sbjct: 167 RDFLLSLA 174



 Score = 79.1 bits (196), Expect = 5e-16
 Identities = 38/113 (33%), Positives = 63/113 (55%), Gaps = 8/113 (7%)

Query: 46  YYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFKY 105
           Y+   E+ A++ +  + YP LV  ETIG+S +GR +  ++I++   S       KP    
Sbjct: 1   YHTYDEIYAWLDELAQKYPDLVTVETIGRSYEGRPIKGLKISNGGRSN------KPAIFI 54

Query: 106 VANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFA 158
            A +H  E +  A  +++I  LV  +  D  +T+LL++ D YIVP +NPDG+ 
Sbjct: 55  DAGIHAREWISPATALYIINQLV--ESYDPEVTELLDNYDWYIVPVVNPDGYE 105


>gnl|CDD|199875 cd06905, M14-like_8, Peptidase M14-like domain; uncharacterized
           subfamily.  A functionally uncharacterized subgroup of
           the M14 family of metallocarboxypeptidases (MCPs). The
           M14 family are zinc-binding carboxypeptidases (CPs)
           which hydrolyze single, C-terminal amino acids from
           polypeptide chains, and have a recognition site for the
           free C-terminal carboxyl group, which is a key
           determinant of specificity. Two major subfamilies of the
           M14 family, defined based on sequence and structural
           homology, are the A/B and N/E subfamilies. Enzymes
           belonging to the A/B subfamily are normally synthesized
           as inactive precursors containing preceding signal
           peptide, followed by an N-terminal pro-region linked to
           the enzyme; these proenzymes are called
           procarboxypeptidases. The A/B enzymes can be further
           divided based on their substrate specificity;
           Carboxypeptidase A-like (CPA-like) enzymes favor
           hydrophobic residues while carboxypeptidase B-like
           (CPB-like) enzymes only cleave the basic residues lysine
           or arginine. The A forms have slightly different
           specificities, with Carboxypeptidase A1 (CPA1)
           preferring aliphatic and small aromatic residues, and
           CPA2 preferring the bulky aromatic side chains. Enzymes
           belonging to the N/E subfamily enzymes are not produced
           as inactive precursors and instead rely on their
           substrate specificity and subcellular
           compartmentalization to prevent inappropriate cleavages.
           They contain an extra C-terminal transthyretin-like
           domain, thought to be involved in folding or formation
           of oligomers.  MCPs can also be classified based on
           their involvement in specific physiological processes;
           the pancreatic MCPs participate only in alimentary
           digestion and include carboxypeptidase A and B (A/B
           subfamily), while others, namely regulatory MCPs or the
           N/E subfamily, are involved in more selective reactions,
           mainly in non-digestive tissues and fluids, acting on
           blood coagulation/fibrinolysis, inflammation and local
           anaphylaxis, pro-hormone and neuropeptide processing,
           cellular response and others.   Another MCP subfamily,
           is that of succinylglutamate desuccinylase
           /aspartoacylase, which hydrolyzes N-acetyl-L-aspartate
           (NAA), and deficiency in which is the established cause
           of Canavan disease. Another subfamily (referred to as
           subfamily C) includes an exceptional type of activity in
           the MCP family, that of dipeptidyl-peptidase activity of
           gamma-glutamyl-(L)-meso-diaminopimelate peptidase I
           which is involved in bacterial cell wall metabolism.
          Length = 360

 Score = 89.0 bits (221), Expect = 5e-19
 Identities = 45/112 (40%), Positives = 66/112 (58%), Gaps = 3/112 (2%)

Query: 45  RYYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFK 104
           RYY   EL AF+     +YPHL+  E+IGKS +GR++W + +T   +   G    KP F 
Sbjct: 3   RYYRYDELTAFLQAWASAYPHLISLESIGKSYEGRDIWLLTLT---NQATGPAREKPAFW 59

Query: 105 YVANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDG 156
             AN+H  E  G A+ +++I +L+   G D  +T+LL+    YI+P INPDG
Sbjct: 60  IDANIHASEVAGSAVALYVIAHLLNGYGSDPEVTRLLDGVTFYILPRINPDG 111



 Score = 89.0 bits (221), Expect = 5e-19
 Identities = 45/112 (40%), Positives = 66/112 (58%), Gaps = 3/112 (2%)

Query: 359 RYYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFK 418
           RYY   EL AF+     +YPHL+  E+IGKS +GR++W + +T   +   G    KP F 
Sbjct: 3   RYYRYDELTAFLQAWASAYPHLISLESIGKSYEGRDIWLLTLT---NQATGPAREKPAFW 59

Query: 419 YVANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDG 470
             AN+H  E  G A+ +++I +L+   G D  +T+LL+    YI+P INPDG
Sbjct: 60  IDANIHASEVAGSAVALYVIAHLLNGYGSDPEVTRLLDGVTFYILPRINPDG 111



 Score = 37.8 bits (88), Expect = 0.018
 Identities = 32/135 (23%), Positives = 49/135 (36%), Gaps = 40/135 (29%)

Query: 493 GVDLNRNFPDQFEYEAKK-------VYEPETQAIMNFIYSNPFV---LSGNLHGGAVARD 542
           G+D NRNFP  +  E ++         EPET+A+++FI  +P +   +S +   G + R 
Sbjct: 209 GLDFNRNFPVDWRPEGEQYGAGPFPFSEPETRAVVDFITDHPNINGAISYHTFSGVILRP 268

Query: 543 YASRNPMMAPGHACGFDFKD------------GITN-------GNYWY----KVTGGMQD 579
           Y        P         D             +T          + Y       G   D
Sbjct: 269 Y-----STKPDDQ--MPVDDLELYKALGEKGEELTGYPTVSVYHEFRYHPKEVTYGVFDD 321

Query: 580 FNYVHSNCFEITMEL 594
           + Y H   F  T+EL
Sbjct: 322 WAYDHLGIFAFTVEL 336


>gnl|CDD|133069 cd03857, M14-like_1, Peptidase M14-like domain; uncharacterized
           subfamily.  Peptidase M14-like domain of a functionally
           uncharacterized subgroup of the M14 family of
           metallocarboxypeptidases (MCPs). The M14 family are
           zinc-binding carboxypeptidases (CPs) which hydrolyze
           single, C-terminal amino acids from polypeptide chains,
           and have a recognition site for the free C-terminal
           carboxyl group, which is a key determinant of
           specificity. Two major subfamilies of the M14 family,
           defined based on sequence and structural homology, are
           the A/B and N/E subfamilies. Enzymes belonging to the
           A/B subfamily are normally synthesized as inactive
           precursors containing preceding signal peptide, followed
           by an N-terminal pro-region linked to the enzyme; these
           proenzymes are called procarboxypeptidases. The A/B
           enzymes can be further divided based on their substrate
           specificity; Carboxypeptidase A-like (CPA-like) enzymes
           favor hydrophobic residues while carboxypeptidase B-like
           (CPB-like) enzymes only cleave the basic residues lysine
           or arginine. The A forms have slightly different
           specificities, with Carboxypeptidase A1 (CPA1)
           preferring aliphatic and small aromatic residues, and
           CPA2 preferring the bulky aromatic side chains. Enzymes
           belonging to the N/E subfamily enzymes are not produced
           as inactive precursors and instead rely on their
           substrate specificity and subcellular
           compartmentalization to prevent inappropriate cleavage.
           They contain an extra C-terminal transthyretin-like
           domain, thought to be involved in folding or formation
           of oligomers.  MCPs can also be classified based on
           their involvement in specific physiological processes;
           the pancreatic MCPs participate only in alimentary
           digestion and include carboxypeptidase A and B (A/B
           subfamily), while others, namely regulatory MCPs or the
           N/E subfamily, are involved in more selective reactions,
           mainly in non-digestive tissues and fluids, acting on
           blood coagulation/fibrinolysis, inflammation and local
           anaphylaxis, pro-hormone and neuropeptide processing,
           cellular response and others.   Another MCP subfamily,
           is that of succinylglutamate desuccinylase
           /aspartoacylase, which hydrolyzes N-acetyl-L-aspartate
           (NAA), and deficiency in which is the established cause
           of Canavan disease. Another subfamily (referred to as
           subfamily C) includes an exceptional type of activity in
           the MCP family, that of dipeptidyl-peptidase activity of
           gamma-glutamyl-(L)-meso-diaminopimelate peptidase I
           which is involved in bacterial cell wall metabolism.
          Length = 226

 Score = 79.4 bits (196), Expect = 1e-16
 Identities = 61/238 (25%), Positives = 91/238 (38%), Gaps = 58/238 (24%)

Query: 386 IGKSVQGRNLWAVEITH--DVDNPDGRTLMKPMFKYVANMHGDETVGYALMVFLIQYLVL 443
           IGKSV+GR LW V +T    +         KP     A +HG+E+ G   ++ L++ L  
Sbjct: 1   IGKSVEGRPLWMVTLTTAEGMKKRALAKEGKPRVWIDAQIHGNESAGSDALLELLRQLAS 60

Query: 444 KDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKCDSLDGYVGRKNAHGVDLNRNFPDQ 503
                D   ++L +  I ++P  NPDG                 R+NA+G+DLNR+F   
Sbjct: 61  AS---DEEAKMLENIVIVLIPRANPDG------------AALFTRENANGLDLNRDF--- 102

Query: 504 FEYEAKKVYEPETQAIMN-FIYSNPFVLSGNLHG---GAV-----ARDYASRNP------ 548
                 K+ +PET+A+   FI   P     +LH    GAV          + N       
Sbjct: 103 -----LKLTQPETRAVREVFIEWKPQFFI-DLHEYGFGAVSFYDPFPQPFNYNFLNNLGI 156

Query: 549 --MMAPGHACGFDFKD-----------GITNGNYWYKVTGGMQDFNYVHSNCFEITME 593
             ++   HA                  G  +    Y   G M+     H N   I +E
Sbjct: 157 EKVIRQMHAKLVAGILELGKYMVAYVTGFDDSTPGY---GPMRTVTDFH-NVIGILIE 210



 Score = 66.3 bits (162), Expect = 4e-12
 Identities = 28/87 (32%), Positives = 43/87 (49%), Gaps = 5/87 (5%)

Query: 72  IGKSVQGRNLWAVEITH--DVDSPDGRTLMKPMFKYVANMHGDETVGYALMVFLIQYLVL 129
           IGKSV+GR LW V +T    +         KP     A +HG+E+ G   ++ L++ L  
Sbjct: 1   IGKSVEGRPLWMVTLTTAEGMKKRALAKEGKPRVWIDAQIHGNESAGSDALLELLRQLAS 60

Query: 130 KDGKDDRITQLLNSTDIYIVPSINPDG 156
                D   ++L +  I ++P  NPDG
Sbjct: 61  AS---DEEAKMLENIVIVLIPRANPDG 84


>gnl|CDD|133114 cd06904, M14_MpaA_like, Peptidase M14-like domain of Escherichia
           coli Murein Peptide Amidase A and related proteins.
           Peptidase M14-like domain of Escherichia coli Murein
           Peptide Amidase A (MpaA) and related proteins. MpaA is a
           member of the M14 family of metallocarboxypeptidases
           (MCPs), however it has an exceptional type of activity,
           it hydrolyzes the gamma-D-glutamyl-meso-diaminopimelic
           acid (gamma-D-Glu-Dap) bond in murein peptides. MpaA is
           specific for cleavage of the gamma-D-Glu-Dap bond of
           free murein tripeptide; it may also cleave murein
           tetrapeptide. MpaA has a different substrate specificity
           and cellular role than endopeptidase I, ENP1 (ENP1 does
           not belong to this group). MpaA works on free murein
           peptide in the recycling pathway.
          Length = 178

 Score = 76.6 bits (189), Expect = 4e-16
 Identities = 53/209 (25%), Positives = 78/209 (37%), Gaps = 53/209 (25%)

Query: 419 YVANMHGDET----VGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAA 474
            +  +HGDE         L+  L             I + L+    Y++P +NPDG   A
Sbjct: 3   IIGGIHGDEPASVSDLEELLRILPGL----------ILRGLS---WYVIPVLNPDGLLRA 49

Query: 475 KEGKCDSLDGYVGRKNAHGVDLNRNFPDQFEYEAKKVY----------EPETQAIMNFI- 523
                        R NA+GVDLNRNFP +        Y          EPE++A+M+ I 
Sbjct: 50  ------------TRCNANGVDLNRNFPTKDWPPGASRYRRYPGPKPGSEPESRALMDLIE 97

Query: 524 -YSNPFVLSGNLHGGAVARDYASRNPMMAPGHACGFDFKDGITNGNYWYKVTGGMQDFNY 582
            +    V+S   H      D    N  +A      F +  G  +  Y     G +  +  
Sbjct: 98  RFKPDVVVS--FHAPLGVLDGDGPNEPLAR----KFGYL-GFDDLGYPT--PGSLGSWAG 148

Query: 583 VHSNCFEITMEL--SCCKYPKAS-DLKHY 608
           V  N   IT+EL  +  +Y     DL  +
Sbjct: 149 VERNIPVITIELPYNLSEYSHIDADLVRW 177



 Score = 36.5 bits (85), Expect = 0.016
 Identities = 17/66 (25%), Positives = 25/66 (37%), Gaps = 17/66 (25%)

Query: 105 YVANMHGDET----VGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAA 160
            +  +HGDE         L+  L             I + L+    Y++P +NPDG   A
Sbjct: 3   IIGGIHGDEPASVSDLEELLRILPGL----------ILRGLS---WYVIPVLNPDGLLRA 49

Query: 161 KEGNLN 166
              N N
Sbjct: 50  TRCNAN 55


>gnl|CDD|199869 cd06248, M14_CP_A-B_like_1, Peptidase M14 carboxypeptidase
           subfamily A/B-like; uncharacterized subgroup.  The
           Peptidase M14 Carboxypeptidase A/B-like subfamily is one
           of two main M14 carboxypeptidase subfamilies, defined by
           sequence and structural homology, the other being N/E.
           Carboxypeptidases (CPs) hydrolyze single, C-terminal
           amino acids from polypeptide chains. They have a
           recognition site for the free C-terminal carboxyl group,
           which is a key determinant of specificity. Majority of
           the proteins in this subfamily have not been
           characterized as yet. The A/B enzymes are normally
           synthesized as inactive precursors containing preceding
           signal peptide, followed by a globular N-terminal
           pro-region linked to the enzyme; the proenzymes are
           called procarboxypeptidases. These enzymes exhibit
           distinct substrate specificity pattern; Carboxypeptidase
           A-like (CPA-like) enzymes favor hydrophobic residues
           while carboxypeptidase B-like (CPB-like) enzymes only
           cleave the basic residues lysine or arginine. There are
           nine members in the A/B family: CPA1, CPA2, CPA3, CPA4,
           CPA5, CPA6, CPB, CPO and CPU.  CPA1, CPA2 and CPB are
           produced by the pancreas. The A forms have slightly
           different specificities, with CPA1 preferring aliphatic
           and small aromatic residues, and CPA2 preferring the
           bulkier aromatic side chains. CPA3 is found in secretory
           granules of mast cells and functions in inflammatory
           processes. CPA4, detected in hormone-regulated tissues,
           is thought to play a role in prostate cancer. CPA5 is
           present in discrete regions of pituitary and other
           tissues, and cleaves aliphatic C-terminal residues. CPA6
           is highly expressed in embryonic brain and optic muscle,
           suggesting that it may play a specific role in cell
           migration and axonal guidance. CPU (also called CPB2) is
           produced and secreted by the liver as the inactive
           precursor PCPU, commonly referred to as
           thrombin-activatable fibrinolysis inhibitor (TAFI).
           Little is known about CPO but it has been suggested to
           have specificity for acidic residues.
          Length = 303

 Score = 75.4 bits (186), Expect = 7e-15
 Identities = 47/169 (27%), Positives = 70/169 (41%), Gaps = 32/169 (18%)

Query: 376 SYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKYVANMHGDETVGYALMV 435
           ++P      TIGKS +GR +  + I             KP       +H  E +    + 
Sbjct: 20  AFPSNSELFTIGKSYEGRTITGIHIWGSGGKGS-----KPAIVIHGTVHAREWISTMTVE 74

Query: 436 FLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKCDSLDGYVGRKN----- 490
           +L   L+   G D  +T LL+  D YI+P +NPDGF   +    D L     RKN     
Sbjct: 75  YLAYQLLTGYGSDATVTALLDKFDFYIIPVVNPDGFVYTQ--TSDRL----WRKNRQPTS 128

Query: 491 ---AHGVDLNRNFPDQFE------------YEAKKVYE-PETQAIMNFI 523
                G DLNRN+P Q++            Y  +   + PE +A+  F+
Sbjct: 129 GSSCVGTDLNRNWPYQWDGGGSSTNPCSETYRGESPGDAPEAKALAAFL 177



 Score = 62.3 bits (152), Expect = 2e-10
 Identities = 30/97 (30%), Positives = 44/97 (45%), Gaps = 5/97 (5%)

Query: 62  SYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFKYVANMHGDETVGYALMV 121
           ++P      TIGKS +GR +  + I        G    KP       +H  E +    + 
Sbjct: 20  AFPSNSELFTIGKSYEGRTITGIHIW--GSGGKGS---KPAIVIHGTVHAREWISTMTVE 74

Query: 122 FLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFA 158
           +L   L+   G D  +T LL+  D YI+P +NPDGF 
Sbjct: 75  YLAYQLLTGYGSDATVTALLDKFDFYIIPVVNPDGFV 111


>gnl|CDD|133084 cd06226, M14_CPT_like, Peptidase M14 Carboxypeptidase T-like
           subfamily.  Peptidase M14-like domain of an
           uncharacterized group of Peptidase M14 Carboxypeptidase
           (CP) T (CPT)-like proteins. This group belongs to the
           M14 family of metallocarboxypeptidases (MCPs). The M14
           family are zinc-binding CPs which hydrolyze single,
           C-terminal amino acids from polypeptide chains, and have
           a recognition site for the free C-terminal carboxyl
           group, which is a key determinant of specificity. CPT
           exhibits dual-substrate specificity by cleaving
           C-terminal hydrophobic amino acid residues and
           C-terminal positively charged residues. However, CPT
           does not belong to this CPT-like group.
          Length = 293

 Score = 72.4 bits (178), Expect = 8e-14
 Identities = 49/179 (27%), Positives = 77/179 (43%), Gaps = 41/179 (22%)

Query: 377 YPHLVRAETIGKSVQ------GRNLWAVEITHDVDNPDGRTLMKPMFKYVANMHGDETVG 430
           YP+L +   IG S        G ++ A+++T+   N       KP+F  +  +H  E   
Sbjct: 1   YPNLAQWVDIGDSWDKPGGPAGYDIRALKLTNKATNSPTGP--KPVFFIMGAIHAREYTT 58

Query: 431 YALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKCDSLDGYVGRKN 490
             L++   + L+   G D   T LL+  +I++VP +NPDG   A++       G   RKN
Sbjct: 59  AELVLRFAEDLLEGYGTDADATWLLDYHEIHVVPIVNPDGRKIAEQ-------GLSQRKN 111

Query: 491 AH------------GVDLNRNFP-------DQFEYEAKKVY-------EPETQAIMNFI 523
           A+            GVDLNRN+                + Y       EPET A+ ++I
Sbjct: 112 ANTSGGSNCSGSSYGVDLNRNYSFGWGGAGASSGDPCSETYRGPAPGSEPETAALEDYI 170



 Score = 52.4 bits (126), Expect = 3e-07
 Identities = 31/112 (27%), Positives = 54/112 (48%), Gaps = 8/112 (7%)

Query: 63  YPHLVRAETIGKSVQ------GRNLWAVEITHDVDSPDGRTLMKPMFKYVANMHGDETVG 116
           YP+L +   IG S        G ++ A+++T+   +       KP+F  +  +H  E   
Sbjct: 1   YPNLAQWVDIGDSWDKPGGPAGYDIRALKLTNKATNSPTGP--KPVFFIMGAIHAREYTT 58

Query: 117 YALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGNLNTK 168
             L++   + L+   G D   T LL+  +I++VP +NPDG   A++G    K
Sbjct: 59  AELVLRFAEDLLEGYGTDADATWLLDYHEIHVVPIVNPDGRKIAEQGLSQRK 110


>gnl|CDD|222268 pfam13620, CarboxypepD_reg, Carboxypeptidase regulatory-like
           domain. 
          Length = 81

 Score = 65.4 bits (160), Expect = 2e-13
 Identities = 27/81 (33%), Positives = 39/81 (48%), Gaps = 5/81 (6%)

Query: 221 GVYGIVTDTYGNPLPSAIITVRWNDKAV---TVTNR-GEYW-RLLARGKYVVTASAPGYE 275
            + G VTD  G P+P A +T+   D      T T+  G +    L  G Y +T SAPGY+
Sbjct: 1   TISGTVTDASGAPIPGATVTLTNADTGTVRGTTTDADGRFSLTGLPPGTYTLTVSAPGYK 60

Query: 276 PVTTEPLDVPDTESVRLDFML 296
             T + + V   ++  LD  L
Sbjct: 61  SQTVKDVTVTAGQTTTLDITL 81



 Score = 65.4 bits (160), Expect = 2e-13
 Identities = 27/81 (33%), Positives = 39/81 (48%), Gaps = 5/81 (6%)

Query: 626 GVYGIVTDTYGNPLPSAIITVRWNDKAV---TVTNR-GEYW-RLLARGKYVVTASAPGYE 680
            + G VTD  G P+P A +T+   D      T T+  G +    L  G Y +T SAPGY+
Sbjct: 1   TISGTVTDASGAPIPGATVTLTNADTGTVRGTTTDADGRFSLTGLPPGTYTLTVSAPGYK 60

Query: 681 PVTTEPLDVPDTESVRLDFML 701
             T + + V   ++  LD  L
Sbjct: 61  SQTVKDVTVTAGQTTTLDITL 81


>gnl|CDD|199855 cd06229, M14_Endopeptidase_I, Peptidase M14 carboxypeptidase
           family-like domain of Endopeptidase I.  Peptidase
           M14-like domain of Gamma-D-glutamyl-L-diamino acid
           endopeptidase 1 (also known as
           Gamma-D-glutamyl-meso-diaminopimelate peptidase I, and
           Endopeptidase I (ENP1); EC 3.4.19.11). ENP1 is a member
           of the M14 family of metallocarboxypeptidases (MCPs).
           However it has an exceptional type of activity of
           hydrolyzing the gamma-D-Glu-(L)meso-diaminopimelic acid
           (gamma-D-Glu-Dap) bond of
           L-Ala-gamma-D-Glu-(L)meso-diaminopimelic acid and
           L-Ala-gamma-D-Glu-(L)meso-diaminopimelic acid(L)-D-Ala
           peptides. ENP1has a different substrate specificity and
           cellular role than MpaA (MpaA does not belong to this
           group). ENP1 hydrolyzes the gamma-D-Glu-Dap bond of
           MurNAc-tripeptide and MurNAc-tetrapeptide, as well as
           the amide bond of free tripeptide and tetrapeptide .
           ENP1 is active on spore cortex peptidoglycan, and is
           produced at stage IV of sporulation in forespore and
           spore integuments.
          Length = 256

 Score = 69.6 bits (171), Expect = 3e-13
 Identities = 53/179 (29%), Positives = 70/179 (39%), Gaps = 54/179 (30%)

Query: 389 SVQGRNLWAVEITHDVDNPDGRTLMKPMFKYVANMHGDE--TVGYALMVFLIQYLV---- 442
           SV GRN++ V++        G    K +F Y A+ H  E  T    LM FL  Y      
Sbjct: 1   SVLGRNIYEVQL--------GNGP-KTVF-YNASFHAREWITTLL-LMKFLEDYARAYEC 49

Query: 443 --LKDGKDDRITQLLNSTDIYIVPSINPDG-------FAAAKE---------GKCDSLDG 484
                G D  I  LL +  + IVP +NPDG         A +                  
Sbjct: 50  GEKLRGYD--IRDLLENVTLCIVPMVNPDGVEISQNGPYAIRNYYLELLVINAGSIDFKE 107

Query: 485 YVGRKNAHGVDLNRNFPDQFEYEAKK---------------VYEPETQAIMNFIYSNPF 528
           +  + NA GVDLNRNFP  +E E  +               + EPET+A+  F   N F
Sbjct: 108 W--KANARGVDLNRNFPAGWELEKARGPKAPSPRNYGGEYPLSEPETKALAEFTRENRF 164



 Score = 40.4 bits (95), Expect = 0.002
 Identities = 31/90 (34%), Positives = 40/90 (44%), Gaps = 21/90 (23%)

Query: 75  SVQGRNLWAVEITHDVDSPDGRTLMKPMFKYVANMHGDE--TVGYALMVFLIQYLV---- 128
           SV GRN++ V++        G    K +F Y A+ H  E  T    LM FL  Y      
Sbjct: 1   SVLGRNIYEVQL--------GNGP-KTVF-YNASFHAREWITTLL-LMKFLEDYARAYEC 49

Query: 129 --LKDGKDDRITQLLNSTDIYIVPSINPDG 156
                G D  I  LL +  + IVP +NPDG
Sbjct: 50  GEKLRGYD--IRDLLENVTLCIVPMVNPDG 77


>gnl|CDD|199863 cd06239, M14-like_1_2, Peptidase M14-like domain; uncharacterized
           subgroup.  Peptidase M14-like domain of a functionally
           uncharacterized subgroup of the M14 family of
           metallocarboxypeptidases (MCPs). The M14 family are
           zinc-binding carboxypeptidases (CPs) which hydrolyze
           single, C-terminal amino acids from polypeptide chains,
           and have a recognition site for the free C-terminal
           carboxyl group, which is a key determinant of
           specificity. Two major subfamilies of the M14 family,
           defined based on sequence and structural homology, are
           the A/B and N/E subfamilies. Enzymes belonging to the
           A/B subfamily are normally synthesized as inactive
           precursors containing preceding signal peptide, followed
           by an N-terminal pro-region linked to the enzyme; these
           proenzymes are called procarboxypeptidases. The A/B
           enzymes can be further divided based on their substrate
           specificity; Carboxypeptidase A-like (CPA-like) enzymes
           favor hydrophobic residues while carboxypeptidase B-like
           (CPB-like) enzymes only cleave the basic residues lysine
           or arginine. The A forms have slightly different
           specificities, with Carboxypeptidase A1 (CPA1)
           preferring aliphatic and small aromatic residues, and
           CPA2 preferring the bulky aromatic side chains. Enzymes
           belonging to the N/E subfamily enzymes are not produced
           as inactive precursors and instead rely on their
           substrate specificity and subcellular
           compartmentalization to prevent inappropriate cleavage.
           They contain an extra C-terminal transthyretin-like
           domain, thought to be involved in folding or formation
           of oligomers.  MCPs can also be classified based on
           their involvement in specific physiological processes;
           the pancreatic MCPs participate only in alimentary
           digestion and include carboxypeptidase A and B (A/B
           subfamily), while others, namely regulatory MCPs or the
           N/E subfamily, are involved in more selective reactions,
           mainly in non-digestive tissues and fluids, acting on
           blood coagulation/fibrinolysis, inflammation and local
           anaphylaxis, pro-hormone and neuropeptide processing,
           cellular response and others.   Another MCP subfamily,
           is that of succinylglutamate desuccinylase
           /aspartoacylase, which hydrolyzes N-acetyl-L-aspartate
           (NAA), and deficiency in which is the established cause
           of Canavan disease. Another subfamily (referred to as
           subfamily C) includes an exceptional type of activity in
           the MCP family, that of dipeptidyl-peptidase activity of
           gamma-glutamyl-(L)-meso-diaminopimelate peptidase I
           which is involved in bacterial cell wall metabolism.
          Length = 231

 Score = 63.5 bits (155), Expect = 4e-11
 Identities = 36/139 (25%), Positives = 64/139 (46%), Gaps = 27/139 (19%)

Query: 378 PHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKYVANMHGDETVGYALMVFL 437
           P     E IG+SV+GR +++V+          + L+       + MHG+E+     ++ L
Sbjct: 7   PAKFEVEVIGQSVEGRPIYSVKFGTG----PRKILL------WSQMHGNESTTTKALLDL 56

Query: 438 IQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKCDSLDGYVGRKNAHGVDLN 497
           + +L      D    ++L+   + I+P +NPDG                 R NA+G+DLN
Sbjct: 57  LNFL--GTNSDQEAKKILDECTLVIIPMLNPDGAELYT------------RVNANGIDLN 102

Query: 498 RNFPDQFEYEA---KKVYE 513
           R+  D  + E+   + VY+
Sbjct: 103 RDAQDLTQPESRLLRDVYD 121



 Score = 50.8 bits (122), Expect = 6e-07
 Identities = 24/93 (25%), Positives = 45/93 (48%), Gaps = 12/93 (12%)

Query: 64  PHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFKYVANMHGDETVGYALMVFL 123
           P     E IG+SV+GR +++V+          + L+       + MHG+E+     ++ L
Sbjct: 7   PAKFEVEVIGQSVEGRPIYSVKFGTG----PRKILL------WSQMHGNESTTTKALLDL 56

Query: 124 IQYLVLKDGKDDRITQLLNSTDIYIVPSINPDG 156
           + +L      D    ++L+   + I+P +NPDG
Sbjct: 57  LNFL--GTNSDQEAKKILDECTLVIIPMLNPDG 87


>gnl|CDD|199854 cd06227, M14-like_2, Peptidase M14-like domain; uncharacterized
           subfamily.  A functionally uncharacterized subgroup of
           the M14 family of metallocarboxypeptidases (MCPs). The
           M14 family are zinc-binding carboxypeptidases (CPs)
           which hydrolyze single, C-terminal amino acids from
           polypeptide chains, and have a recognition site for the
           free C-terminal carboxyl group, which is a key
           determinant of specificity. Two major subfamilies of the
           M14 family, defined based on sequence and structural
           homology, are the A/B and N/E subfamilies. Enzymes
           belonging to the A/B subfamily are normally synthesized
           as inactive precursors containing preceding signal
           peptide, followed by an N-terminal pro-region linked to
           the enzyme; these proenzymes are called
           procarboxypeptidases. The A/B enzymes can be further
           divided based on their substrate specificity;
           Carboxypeptidase A-like (CPA-like) enzymes favor
           hydrophobic residues while carboxypeptidase B-like
           (CPB-like) enzymes only cleave the basic residues lysine
           or arginine. The A forms have slightly different
           specificities, with Carboxypeptidase A1 (CPA1)
           preferring aliphatic and small aromatic residues, and
           CPA2 preferring the bulky aromatic side chains. Enzymes
           belonging to the N/E subfamily enzymes are not produced
           as inactive precursors and instead rely on their
           substrate specificity and subcellular
           compartmentalization to prevent inappropriate cleavages.
           They contain an extra C-terminal transthyretin-like
           domain, thought to be involved in folding or formation
           of oligomers.  MCPs can also be classified based on
           their involvement in specific physiological processes;
           the pancreatic MCPs participate only in alimentary
           digestion and include carboxypeptidase A and B (A/B
           subfamily), while others, namely regulatory MCPs or the
           N/E subfamily, are involved in more selective reactions,
           mainly in non-digestive tissues and fluids, acting on
           blood coagulation/fibrinolysis, inflammation and local
           anaphylaxis, pro-hormone and neuropeptide processing,
           cellular response and others.   Another MCP subfamily,
           is that of succinylglutamate desuccinylase
           /aspartoacylase, which hydrolyzes N-acetyl-L-aspartate
           (NAA), and deficiency in which is the established cause
           of Canavan disease. Another subfamily (referred to as
           subfamily C) includes an exceptional type of activity in
           the MCP family, that of dipeptidyl-peptidase activity of
           gamma-glutamyl-(L)-meso-diaminopimelate peptidase I
           which is involved in bacterial cell wall metabolism.
          Length = 269

 Score = 62.4 bits (152), Expect = 1e-10
 Identities = 42/193 (21%), Positives = 78/193 (40%), Gaps = 27/193 (13%)

Query: 360 YYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKY 419
           Y+ + EL+      +           + +SV+ R    + +     NP+G    K     
Sbjct: 2   YHTTDELNL-EADALVKRIEFSSLGELIESVKVRVFAVITL-----NPNGDPKKKVKALL 55

Query: 420 VANMHGDETVGYALMVFLIQYLV----LKDGKDDRITQLLNSTDIYIVPSINPDGFAAAK 475
           +   H  E +     + L+  L           D +  +L++  + I+P+ NPDG    +
Sbjct: 56  LFGEHARELISPETGLHLLSTLCGEEAEPFDWGDLLKNILDNFVLKIIPNENPDGRKKVE 115

Query: 476 EGKCDSLDGYVGRKNAHGVDLNRNFPDQFEYEAK----KVY-------EPETQAIMNFIY 524
            G       Y  R+N +GVDLNRN+   + ++      + Y       EPET+A+ + + 
Sbjct: 116 SGN------YCLRENENGVDLNRNYGADWGFKEDDYPDEEYSGPAPFSEPETRAVRDLLK 169

Query: 525 SNPFVLSGNLHGG 537
           S    +  ++H G
Sbjct: 170 SFSPHVFISVHSG 182



 Score = 32.0 bits (73), Expect = 0.96
 Identities = 23/124 (18%), Positives = 46/124 (37%), Gaps = 10/124 (8%)

Query: 46  YYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFKY 105
           Y+ + EL+      +           + +SV+ R    + +     +P+G    K     
Sbjct: 2   YHTTDELNL-EADALVKRIEFSSLGELIESVKVRVFAVITL-----NPNGDPKKKVKALL 55

Query: 106 VANMHGDETVGYALMVFLIQYLV----LKDGKDDRITQLLNSTDIYIVPSINPDGFAAAK 161
           +   H  E +     + L+  L           D +  +L++  + I+P+ NPDG    +
Sbjct: 56  LFGEHARELISPETGLHLLSTLCGEEAEPFDWGDLLKNILDNFVLKIIPNENPDGRKKVE 115

Query: 162 EGNL 165
            GN 
Sbjct: 116 SGNY 119


>gnl|CDD|133081 cd03870, M14_CPA, Peptidase M14 carboxypeptidase subfamily
           A/B-like; Carboxypeptidase A subgroup.  Peptidase M14
           Carboxypeptidase (CP) A (CPA) belongs to the A/B
           subfamily of the M14 family of metallocarboxypeptidases
           (MCPs). The M14 family are zinc-binding CPs which
           hydrolyze single, C-terminal amino acids from
           polypeptide chains, and have a recognition site for the
           free C-terminal carboxyl group, which is a key
           determinant of specificity. CPA enzymes generally favor
           hydrophobic residues. A/B subfamily enzymes are normally
           synthesized as inactive precursors containing preceding
           signal peptide, followed by a globular N-terminal
           pro-region linked to the enzyme; these proenzymes are
           called procarboxypeptidases. The procarboxypeptidase A
           (PCPA) is produced by the exocrine pancreas and stored
           as a stable zymogen in the pancreatic granules until
           secretion into the digestive tract occurs. This
           subfamily includes CPA1, CPA2 and CPA4 forms. Within
           these A forms, there are slightly different
           specificities, with CPA1 preferring aliphatic and small
           aromatic residues, and CPA2 preferring the bulkier
           aromatic side chains. CPA4, detected in
           hormone-regulated tissues, is thought to play a role in
           prostate cancer.
          Length = 301

 Score = 60.6 bits (147), Expect = 6e-10
 Identities = 48/188 (25%), Positives = 79/188 (42%), Gaps = 34/188 (18%)

Query: 360 YYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKY 419
           Y+   E+  F+   V  +P+LV    IG+S +GR ++ ++ +    N       +P    
Sbjct: 6   YHTLDEIYDFMDLLVAEHPNLVSKLQIGRSYEGRPIYVLKFSTGGSN-------RPAIWI 58

Query: 420 VANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKC 479
            A +H  E +  A  V+  + +    G+D   T +L+S DI++    NPDGFA       
Sbjct: 59  DAGIHSREWITQATGVWFAKKITEDYGQDPSFTAILDSMDIFLEIVTNPDGFAFTH---- 114

Query: 480 DSLDGYVGRK--------NAHGVDLNRNFPDQF------------EYEAKKVY-EPETQA 518
                 + RK           GVD NRN+   F             Y  K    E E ++
Sbjct: 115 --SQNRLWRKTRSVTSGSLCVGVDPNRNWDAGFGGAGASSNPCSETYHGKYANSEVEVKS 172

Query: 519 IMNFIYSN 526
           I++F+ S+
Sbjct: 173 IVDFVKSH 180



 Score = 52.1 bits (125), Expect = 4e-07
 Identities = 33/113 (29%), Positives = 55/113 (48%), Gaps = 7/113 (6%)

Query: 46  YYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFKY 105
           Y+   E+  F+   V  +P+LV    IG+S +GR ++ ++      S  G    +P    
Sbjct: 6   YHTLDEIYDFMDLLVAEHPNLVSKLQIGRSYEGRPIYVLKF-----STGGSN--RPAIWI 58

Query: 106 VANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFA 158
            A +H  E +  A  V+  + +    G+D   T +L+S DI++    NPDGFA
Sbjct: 59  DAGIHSREWITQATGVWFAKKITEDYGQDPSFTAILDSMDIFLEIVTNPDGFA 111


>gnl|CDD|199868 cd06246, M14_CPB2, Peptidase M14 carboxypeptidase subfamily
           A/B-like; Carboxypeptidase B2 subgroup.  Peptidase M14
           Carboxypeptidase (CP) B2 (CPB2, also known as plasma
           carboxypeptidase B, carboxypeptidase U, and CPU),
           belongs to the carboxpeptidase A/B subfamily of the M14
           family of metallocarboxypeptidases (MCPs). The M14
           family are zinc-binding CPs which hydrolyze single,
           C-terminal amino acids from polypeptide chains, and have
           a recognition site for the free C-terminal carboxyl
           group, which is a key determinant of specificity. CPB2
           enzyme displays B-like activity; it only cleaves the
           basic residues lysine or arginine. It is produced and
           secreted by the liver as the inactive precursor,
           procarboxypeptidase U or PCPB2, commonly referred to as
           thrombin-activatable fibrinolysis inhibitor (TAFI). It
           circulates in plasma as a zymogen bound to plasminogen,
           and the active enzyme, TAFIa, inhibits fibrinolysis. It
           is highly regulated, increased TAFI concentrations are
           thought to increase the risk of thrombosis and coronary
           artery disease by reducing fibrinolytic activity while
           low TAFI levels have been correlated with chronic liver
           disease.
          Length = 302

 Score = 59.5 bits (144), Expect = 2e-09
 Identities = 40/191 (20%), Positives = 75/191 (39%), Gaps = 34/191 (17%)

Query: 358 QRYYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMF 417
           ++Y++  E+ ++I    + +P ++    IG S +   L+ ++++           +    
Sbjct: 3   EQYHSLNEIYSWIEFITERHPDMLEKIHIGSSFEKYPLYVLKVSGKEQTAKNAIWID--- 59

Query: 418 KYVANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEG 477
                +H  E +  A  ++ + +     G   + T LL   D YI+P +N DG+      
Sbjct: 60  ---CGIHAREWISPAFCLWFVGHATQFYGIIGQYTNLLRHMDFYIMPVVNVDGYDYT--W 114

Query: 478 KCDSLDGYVGRKNAH--------GVDLNRNF-------PDQFEYEAKKVY-------EPE 515
           K + +     RKN          G DLNRNF                + Y       EPE
Sbjct: 115 KKNRM----WRKNRSFYANNHCIGTDLNRNFASKHWCEEGASSSSCSETYCGLYPESEPE 170

Query: 516 TQAIMNFIYSN 526
            +A+ +F+  N
Sbjct: 171 VKAVASFLRRN 181



 Score = 49.1 bits (117), Expect = 4e-06
 Identities = 22/114 (19%), Positives = 51/114 (44%), Gaps = 6/114 (5%)

Query: 44  QRYYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMF 103
           ++Y++  E+ ++I    + +P ++    IG S +   L+ ++++    +      +    
Sbjct: 3   EQYHSLNEIYSWIEFITERHPDMLEKIHIGSSFEKYPLYVLKVSGKEQTAKNAIWID--- 59

Query: 104 KYVANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGF 157
                +H  E +  A  ++ + +     G   + T LL   D YI+P +N DG+
Sbjct: 60  ---CGIHAREWISPAFCLWFVGHATQFYGIIGQYTNLLRHMDFYIMPVVNVDGY 110


>gnl|CDD|225421 COG2866, COG2866, Predicted carboxypeptidase [Amino acid transport
           and metabolism].
          Length = 374

 Score = 59.8 bits (145), Expect = 2e-09
 Identities = 48/173 (27%), Positives = 76/173 (43%), Gaps = 27/173 (15%)

Query: 350 ADSAGIDLQRYYNSTELDAFILKTVKSYPH-LVRAETIGKSVQGRN-LWAVEITHDVDNP 407
             ++ +          ++    +      H LV  E IG+SV+GR+      IT    NP
Sbjct: 89  YPASFMTFTLAPEEGRVEFAFFEPYSLEQHLLVELELIGRSVEGRDDPL---ITFPESNP 145

Query: 408 DGRTLMKPMFKYVANMH--GDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPS 465
           + +T++       A  H  G++ V      FL   ++     D ++ +LL+  D+++VP+
Sbjct: 146 EHKTIL-----ITAGQHARGEKMV----EWFLYNLILRYLDPDVQVRKLLDRADLHVVPN 196

Query: 466 INPDGFAAAKEGKCDSLDGYVGRKNAHGVDLNRNFPDQFEYEAKKVYEPETQA 518
           +NPDG                 R NA+GVDLNRNF    E E K+VY     A
Sbjct: 197 VNPDGSDLG-----------NLRTNANGVDLNRNFIAPNEEEGKEVYRWNDAA 238



 Score = 39.8 bits (93), Expect = 0.004
 Identities = 30/129 (23%), Positives = 57/129 (44%), Gaps = 16/129 (12%)

Query: 36  ADSAGIDLQRYYNSTELDAFILKTVKSYPH-LVRAETIGKSVQGRN-LWAVEITHDVDSP 93
             ++ +          ++    +      H LV  E IG+SV+GR+      IT    +P
Sbjct: 89  YPASFMTFTLAPEEGRVEFAFFEPYSLEQHLLVELELIGRSVEGRDDPL---ITFPESNP 145

Query: 94  DGRTLMKPMFKYVANMH--GDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPS 151
           + +T++       A  H  G++ V      FL   ++     D ++ +LL+  D+++VP+
Sbjct: 146 EHKTIL-----ITAGQHARGEKMV----EWFLYNLILRYLDPDVQVRKLLDRADLHVVPN 196

Query: 152 INPDGFAAA 160
           +NPDG    
Sbjct: 197 VNPDGSDLG 205


>gnl|CDD|133073 cd03862, M14-like_7, Peptidase M14-like domain; uncharacterized
           subfamily.  A functionally uncharacterized subgroup of
           the M14 family of metallocarboxypeptidases (MCPs). The
           M14 family are zinc-binding carboxypeptidases (CPs)
           which hydrolyze single, C-terminal amino acids from
           polypeptide chains, and have a recognition site for the
           free C-terminal carboxyl group, which is a key
           determinant of specificity. Two major subfamilies of the
           M14 family, defined based on sequence and structural
           homology, are the A/B and N/E subfamilies. Enzymes
           belonging to the A/B subfamily are normally synthesized
           as inactive precursors containing preceding signal
           peptide, followed by an N-terminal pro-region linked to
           the enzyme; these proenzymes are called
           procarboxypeptidases. The A/B enzymes can be further
           divided based on their substrate specificity;
           Carboxypeptidase A-like (CPA-like) enzymes favor
           hydrophobic residues while carboxypeptidase B-like
           (CPB-like) enzymes only cleave the basic residues lysine
           or arginine. The A forms have slightly different
           specificities, with Carboxypeptidase A1 (CPA1)
           preferring aliphatic and small aromatic residues, and
           CPA2 preferring the bulky aromatic side chains. Enzymes
           belonging to the N/E subfamily enzymes are not produced
           as inactive precursors and instead rely on their
           substrate specificity and subcellular
           compartmentalization to prevent inappropriate cleavages.
           They contain an extra C-terminal transthyretin-like
           domain, thought to be involved in folding or formation
           of oligomers.  MCPs can also be classified based on
           their involvement in specific physiological processes;
           the pancreatic MCPs participate only in alimentary
           digestion and include carboxypeptidase A and B (A/B
           subfamily), while others, namely regulatory MCPs or the
           N/E subfamily, are involved in more selective reactions,
           mainly in non-digestive tissues and fluids, acting on
           blood coagulation/fibrinolysis, inflammation and local
           anaphylaxis, pro-hormone and neuropeptide processing,
           cellular response and others.   Another MCP subfamily,
           is that of succinylglutamate desuccinylase
           /aspartoacylase, which hydrolyzes N-acetyl-L-aspartate
           (NAA), and deficiency in which is the established cause
           of Canavan disease. Another subfamily (referred to as
           subfamily C) includes an exceptional type of activity in
           the MCP family, that of dipeptidyl-peptidase activity of
           gamma-glutamyl-(L)-meso-diaminopimelate peptidase I
           which is involved in bacterial cell wall metabolism.
          Length = 273

 Score = 58.8 bits (143), Expect = 2e-09
 Identities = 37/166 (22%), Positives = 65/166 (39%), Gaps = 42/166 (25%)

Query: 414 KPMFKYVANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAA 473
            P+   V  +HG E +G  +++  ++ L+ +   D  + +LL    +  +P +NP G A 
Sbjct: 24  APVLGLVGGVHGLERIGTQVLLAFLESLLERLRWDKLLQELLEKVRLVFLPLVNPVGMAL 83

Query: 474 AKEGKCDSLDGYVGRKNAHGVDLNRNFPDQ--------------------FEYEAKKVYE 513
                         R N +GVDL RN P                      +  +     E
Sbjct: 84  KT------------RSNGNGVDLMRNAPVDAEDKPPFLVGGQRLSPRLPWYRGKNGAGME 131

Query: 514 PETQAIMNFI----YSNPFVLSGNLHGGAVARD-----YA-SRNPM 549
            E QA+  F+    + +PF ++ + H G    D     YA ++ P+
Sbjct: 132 LEAQALCRFVRELLFESPFSIALDCHSGFGLVDRIWFPYAYTKEPI 177



 Score = 38.4 bits (90), Expect = 0.007
 Identities = 17/67 (25%), Positives = 32/67 (47%)

Query: 100 KPMFKYVANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAA 159
            P+   V  +HG E +G  +++  ++ L+ +   D  + +LL    +  +P +NP G A 
Sbjct: 24  APVLGLVGGVHGLERIGTQVLLAFLESLLERLRWDKLLQELLEKVRLVFLPLVNPVGMAL 83

Query: 160 AKEGNLN 166
               N N
Sbjct: 84  KTRSNGN 90


>gnl|CDD|133101 cd06243, M14-like_1_6, Peptidase M14-like domain; uncharacterized
           subgroup.  Peptidase M14-like domain of a functionally
           uncharacterized subgroup of the M14 family of
           metallocarboxypeptidases (MCPs). The M14 family are
           zinc-binding carboxypeptidases (CPs) which hydrolyze
           single, C-terminal amino acids from polypeptide chains,
           and have a recognition site for the free C-terminal
           carboxyl group, which is a key determinant of
           specificity. Two major subfamilies of the M14 family,
           defined based on sequence and structural homology, are
           the A/B and N/E subfamilies. Enzymes belonging to the
           A/B subfamily are normally synthesized as inactive
           precursors containing preceding signal peptide, followed
           by an N-terminal pro-region linked to the enzyme; these
           proenzymes are called procarboxypeptidases. The A/B
           enzymes can be further divided based on their substrate
           specificity; Carboxypeptidase A-like (CPA-like) enzymes
           favor hydrophobic residues while carboxypeptidase B-like
           (CPB-like) enzymes only cleave the basic residues lysine
           or arginine. The A forms have slightly different
           specificities, with Carboxypeptidase A1 (CPA1)
           preferring aliphatic and small aromatic residues, and
           CPA2 preferring the bulky aromatic side chains. Enzymes
           belonging to the N/E subfamily enzymes are not produced
           as inactive precursors and instead rely on their
           substrate specificity and subcellular
           compartmentalization to prevent inappropriate cleavages.
           They contain an extra C-terminal transthyretin-like
           domain, thought to be involved in folding or formation
           of oligomers.  MCPs can also be classified based on
           their involvement in specific physiological processes;
           the pancreatic MCPs participate only in alimentary
           digestion and include carboxypeptidase A and B (A/B
           subfamily), while others, namely regulatory MCPs or the
           N/E subfamily, are involved in more selective reactions,
           mainly in non-digestive tissues and fluids, acting on
           blood coagulation/fibrinolysis, inflammation and local
           anaphylaxis, pro-hormone and neuropeptide processing,
           cellular response and others.   Another MCP subfamily,
           is that of succinylglutamate desuccinylase
           /aspartoacylase, which hydrolyzes N-acetyl-L-aspartate
           (NAA), and deficiency in which is the established cause
           of Canavan disease. Another subfamily (referred to as
           subfamily C) includes an exceptional type of activity in
           the MCP family, that of dipeptidyl-peptidase activity of
           gamma-glutamyl-(L)-meso-diaminopimelate peptidase I
           which is involved in bacterial cell wall metabolism.
          Length = 236

 Score = 58.4 bits (141), Expect = 2e-09
 Identities = 38/134 (28%), Positives = 55/134 (41%), Gaps = 22/134 (16%)

Query: 386 IGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKYVANMHGDETVGYALMVFLIQYLVLKD 445
           IG S +GR +  V +             +P    V   HGDE  G   ++ + + L    
Sbjct: 1   IGTSQRGRPIHLVRVGFAEGPSALDIANRPTVLLVGTQHGDEPAGREALLIIARDLAF-- 58

Query: 446 GKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKCDSLDGYVGRKNAHGVDLNRNFPDQFE 505
           G+D+ +  LL+ T +  VP+ NPDG  A              R NA G+D+NR   D   
Sbjct: 59  GEDEELVPLLHQTTVLFVPTANPDGREADT------------RSNADGIDINR---DHLL 103

Query: 506 YEAKKVYEPETQAI 519
                   PE QA+
Sbjct: 104 LNT-----PEAQAL 112



 Score = 52.6 bits (126), Expect = 2e-07
 Identities = 27/93 (29%), Positives = 41/93 (44%), Gaps = 2/93 (2%)

Query: 72  IGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFKYVANMHGDETVGYALMVFLIQYLVLKD 131
           IG S +GR +  V +             +P    V   HGDE  G   ++ + + L    
Sbjct: 1   IGTSQRGRPIHLVRVGFAEGPSALDIANRPTVLLVGTQHGDEPAGREALLIIARDLAF-- 58

Query: 132 GKDDRITQLLNSTDIYIVPSINPDGFAAAKEGN 164
           G+D+ +  LL+ T +  VP+ NPDG  A    N
Sbjct: 59  GEDEELVPLLHQTTVLFVPTANPDGREADTRSN 91


>gnl|CDD|199861 cd06237, M14_Nna1_like_3, Peptidase M14-like domain of ATP/GTP
           binding proteins and cytosolic carboxypeptidases;
           uncharacterized bacterial subgroup.  A bacterial
           subgroup of the Peptidase M14-like domain of Nna-1
           (Nervous system Nuclear protein induced by Axotomy),
           also known as ATP/GTP binding protein (AGTPBP-1) and
           cytosolic carboxypeptidase (CCP),-like proteins. The
           Peptidase M14 family of metallocarboxypeptidases are
           zinc-binding carboxypeptidases (CPs) which hydrolyze
           single, C-terminal amino acids from polypeptide chains,
           and have a recognition site for the free C-terminal
           carboxyl group, which is a key determinant of
           specificity. Nna1-like proteins are active
           metallopeptidases that are thought to act on cytosolic
           proteins (such as alpha-tubulin in eukaryotes) to remove
           a C-terminal tyrosine. Nna1-like proteins from the
           different phyla are highly diverse, but they all contain
           a unique N-terminal conserved domain right before the CP
           domain. It has been suggested that this N-terminal
           domain might act as a folding domain.
          Length = 245

 Score = 56.3 bits (136), Expect = 9e-09
 Identities = 42/154 (27%), Positives = 66/154 (42%), Gaps = 32/154 (20%)

Query: 371 LKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKYVANMHGDETVG 430
           L+++      V  E++G S QGR + A+E      NP+     K     +   H  E  G
Sbjct: 17  LESLIKQEPYVEVESLGLSTQGRPIPALESG----NPES----KEWIVLLGRQHPPEVTG 68

Query: 431 -YALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKCDSLDGYVGRK 489
             AL  F     +L D       +     ++  +P++NPDG             G   R 
Sbjct: 69  ALALFHFS--EALLSD--SALSKEFREKYNVLAIPNMNPDGVEL----------GN-WRH 113

Query: 490 NAHGVDLNRNFPDQFEYEAKKVYEPETQAIMNFI 523
           NA+GVDLNR++        KK  +PET+ I +++
Sbjct: 114 NANGVDLNRDW--------KKFNQPETRLIRDYL 139



 Score = 36.3 bits (84), Expect = 0.039
 Identities = 25/104 (24%), Positives = 42/104 (40%), Gaps = 13/104 (12%)

Query: 57  LKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFKYVANMHGDETVG 116
           L+++      V  E++G S QGR + A+E      +P+     K     +   H  E  G
Sbjct: 17  LESLIKQEPYVEVESLGLSTQGRPIPALESG----NPES----KEWIVLLGRQHPPEVTG 68

Query: 117 -YALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAA 159
             AL  F     +L D       +     ++  +P++NPDG   
Sbjct: 69  ALALFHFS--EALLSD--SALSKEFREKYNVLAIPNMNPDGVEL 108


>gnl|CDD|199852 cd03871, M14_CPB, Peptidase M14 carboxypeptidase subfamily
           A/B-like; Carboxypeptidase B subgroup.  Peptidase M14
           Carboxypeptidase B (CPB) belongs to the carboxypeptidase
           A/B subfamily of the M14 family of
           metallocarboxypeptidases (MCPs). The M14 family are
           zinc-binding CPs which hydrolyze single, C-terminal
           amino acids from polypeptide chains, and have a
           recognition site for the free C-terminal carboxyl group,
           which is a key determinant of specificity.
           Carboxypeptidase B (CPB) enzymes only cleave the basic
           residues lysine or arginine. A/B subfamily enzymes are
           normally synthesized as inactive precursors containing
           preceding signal peptide, followed by a globular
           N-terminal pro-region linked to the enzyme; these
           proenzymes are called procarboxypeptidases. The
           procarboxypeptidase B (PCPB) is produced by the exocrine
           pancreas and stored as stable zymogen in the pancreatic
           granules until secretion into the digestive tract
           occurs. PCPB has been reported to be a good serum marker
           for the diagnosis of acute pancreatitis and graft
           rejection in pancreas transplant recipients.
          Length = 300

 Score = 54.1 bits (130), Expect = 9e-08
 Identities = 44/184 (23%), Positives = 77/184 (41%), Gaps = 27/184 (14%)

Query: 361 YNSTE-LDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKY 419
           YN+ E ++A+  +     P L+    IG + +GR ++ +++     N       KP    
Sbjct: 6   YNNWETIEAWTEQVASENPDLISRSAIGTTFEGRPIYLLKVGKPGVN-------KPAIFM 58

Query: 420 VANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKC 479
               H  E +  A   + ++  V   G++  +T+LL+  D Y++P +N DG+        
Sbjct: 59  DCGFHAREWISPAFCQWFVREAVRTYGREAIMTELLDKLDFYVLPVLNIDGYIYT--WTK 116

Query: 480 DSLDGYVGRKNAH----GVDLNRNF------------PDQFEY-EAKKVYEPETQAIMNF 522
           + +       NA     G D NRNF            P    Y  +    E ET+A+ +F
Sbjct: 117 NRMWRKTRSTNAGSSCIGTDPNRNFNAGWCTVGASRNPCDETYCGSAPESEKETKALADF 176

Query: 523 IYSN 526
           I +N
Sbjct: 177 IRNN 180



 Score = 46.4 bits (110), Expect = 3e-05
 Identities = 28/112 (25%), Positives = 53/112 (47%), Gaps = 8/112 (7%)

Query: 47  YNSTE-LDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFKY 105
           YN+ E ++A+  +     P L+    IG + +GR ++ ++    V  P      KP    
Sbjct: 6   YNNWETIEAWTEQVASENPDLISRSAIGTTFEGRPIYLLK----VGKPGVN---KPAIFM 58

Query: 106 VANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGF 157
               H  E +  A   + ++  V   G++  +T+LL+  D Y++P +N DG+
Sbjct: 59  DCGFHAREWISPAFCQWFVREAVRTYGREAIMTELLDKLDFYVLPVLNIDGY 110


>gnl|CDD|199858 cd06234, M14_Nna1_like_1, Peptidase M14-like domain of ATP/GTP
           binding proteins and cytosolic carboxypeptidases;
           uncharacterized bacterial subgroup.  A bacterial
           subgroup of the Peptidase M14-like domain of Nna-1
           (Nervous system Nuclear protein induced by Axotomy),
           also known as ATP/GTP binding protein (AGTPBP-1) and
           cytosolic carboxypeptidase (CCP)-like proteins. The
           Peptidase M14 family of metallocarboxypeptidases are
           zinc-binding carboxypeptidases (CPs) which hydrolyze
           single, C-terminal amino acids from polypeptide chains,
           and have a recognition site for the free C-terminal
           carboxyl group, which is a key determinant of
           specificity. Nna1-like proteins are active
           metallopeptidases that are thought to act on cytosolic
           proteins (such as alpha-tubulin in eukaryotes) to remove
           a C-terminal tyrosine. Nna1-like proteins from the
           different phyla are highly diverse, but they all contain
           a unique N-terminal conserved domain right before the CP
           domain. It has been suggested that this N-terminal
           domain might act as a folding domain.
          Length = 263

 Score = 51.9 bits (125), Expect = 3e-07
 Identities = 38/120 (31%), Positives = 56/120 (46%), Gaps = 20/120 (16%)

Query: 379 HLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKYVANMHGDETVGYALMVFLI 438
             VR E +G++VQGR++  + +        G    K     +A  H  ET+    M  L+
Sbjct: 27  PGVRLEVLGQTVQGRDIDLLTVG---TPGPG----KKKIWIIARQHPGETMAEWFMEGLL 79

Query: 439 QYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKCDSLDGYVGRKNAHGVDLNR 498
           + L+  D  D     LL+    YIVP++NPDG          S  G++ R NA G +LNR
Sbjct: 80  ERLL--DPDDPVARALLDKAVFYIVPNMNPDG----------SARGHL-RTNAAGANLNR 126



 Score = 46.9 bits (112), Expect = 1e-05
 Identities = 32/103 (31%), Positives = 48/103 (46%), Gaps = 12/103 (11%)

Query: 65  HLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFKYVANMHGDETVGYALMVFLI 124
             VR E +G++VQGR++  + +             K     +A  H  ET+    M  L+
Sbjct: 27  PGVRLEVLGQTVQGRDIDLLTVGTPGPG-------KKKIWIIARQHPGETMAEWFMEGLL 79

Query: 125 QYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGNLNT 167
           + L+  D  D     LL+    YIVP++NPDG A    G+L T
Sbjct: 80  ERLL--DPDDPVARALLDKAVFYIVPNMNPDGSA---RGHLRT 117


>gnl|CDD|199853 cd03872, M14_CPA6, Peptidase M14 carboxypeptidase subfamily
           A/B-like; Carboxypeptidase A6 subgroup.
           Carboxypeptidase (CP) A6 (CPA6, also known as CPAH; EC
           3.4.17.1), belongs to the carboxypeptidase A/B subfamily
           of the M14 family of metallocarboxypeptidases (MCPs).
           The M14 family are zinc-binding CPs which hydrolyze
           single, C-terminal amino acids from polypeptide chains,
           and have a recognition site for the free C-terminal
           carboxyl group, which is a key determinant of
           specificity. CPA6 prefers large hydrophobic C-terminal
           amino acids as well as histidine, while peptides with a
           penultimate glycine or proline are very poorly cleaved.
           Several neuropeptides are processed by CPA6, including
           Met- and Leu-enkephalin, angiotensin I, and neurotensin.
           CPA6 converts enkephalin and neurotensin into forms
           known to be inactive toward their receptors, but
           converts inactive angiotensin I into the biologically
           active angiotensin II. Thus, CPA6 plays a possible role
           in the regulation of neuropeptides in the extracellular
           environment within the olfactory bulb where it is highly
           expressed. It is also broadly expressed in embryonic
           tissue, being found in neuronal tissues, bone, skin as
           well as the lateral rectus eye muscle. A disruption in
           the CPA6 gene is linked to Duane syndrome, a defect in
           the abducens nerve/lateral rectus muscle connection.
          Length = 300

 Score = 51.6 bits (123), Expect = 6e-07
 Identities = 41/182 (22%), Positives = 83/182 (45%), Gaps = 25/182 (13%)

Query: 360 YYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKY 419
           Y++  E+++++    K++  LV   +IGKS +GR+L+ +++         R+  K ++  
Sbjct: 2   YHSLEEIESWMFYLNKTHSDLVHMFSIGKSYEGRSLYVLKL-----GKRSRSYKKAVW-I 55

Query: 420 VANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKC 479
              +H  E +G A   + ++  +     D  + ++LN    Y++P  N DG+  +     
Sbjct: 56  DCGIHAREWIGPAFCQWFVKEALNTYNTDPAMKKMLNLLYFYVMPVFNVDGYHYSWTN-- 113

Query: 480 DSLDGYVGRKNA----HGVDLNRNFPDQFEYEAKKVY-------------EPETQAIMNF 522
           D        KN+    +GVD NRN+  ++  E   ++             EPE +A+  F
Sbjct: 114 DRFWRKTRSKNSRFQCYGVDANRNWKVKWCDEGASLHPCDDTYCGPFPESEPEVKAVAQF 173

Query: 523 IY 524
           + 
Sbjct: 174 LR 175



 Score = 42.7 bits (100), Expect = 5e-04
 Identities = 26/112 (23%), Positives = 57/112 (50%), Gaps = 6/112 (5%)

Query: 46  YYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFKY 105
           Y++  E+++++    K++  LV   +IGKS +GR+L+ +++         R+  K ++  
Sbjct: 2   YHSLEEIESWMFYLNKTHSDLVHMFSIGKSYEGRSLYVLKL-----GKRSRSYKKAVW-I 55

Query: 106 VANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGF 157
              +H  E +G A   + ++  +     D  + ++LN    Y++P  N DG+
Sbjct: 56  DCGIHAREWIGPAFCQWFVKEALNTYNTDPAMKKMLNLLYFYVMPVFNVDGY 107


>gnl|CDD|182582 PRK10602, PRK10602, murein peptide amidase A; Provisional.
          Length = 237

 Score = 50.4 bits (121), Expect = 8e-07
 Identities = 41/136 (30%), Positives = 54/136 (39%), Gaps = 49/136 (36%)

Query: 420 VANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKC 479
           +A  HGDET   A +V L   L        R          ++V ++NPDG         
Sbjct: 45  LAGTHGDET---ASVVTLSCALRTLTPSLRRH---------HVVLAVNPDGCQ------- 85

Query: 480 DSLDGYVG-RKNAHGVDLNRNFPDQFEYEAKKVY---------------------EPETQ 517
                 +G R NA+GVDLNRNFP     E + VY                     EPETQ
Sbjct: 86  ------LGLRANANGVDLNRNFPAANWKEGETVYRWNSAAEERDVVLLTGDKPGSEPETQ 139

Query: 518 AIMNFIY--SNPFVLS 531
           A+   I+     +V+S
Sbjct: 140 ALCQLIHRLQPAWVVS 155


>gnl|CDD|133100 cd06242, M14-like_1_5, Peptidase M14-like domain; uncharacterized
           subgroup.  Peptidase M14-like domain of a functionally
           uncharacterized subgroup of the M14 family of
           metallocarboxypeptidases (MCPs). The M14 family are
           zinc-binding carboxypeptidases (CPs) which hydrolyze
           single, C-terminal amino acids from polypeptide chains,
           and have a recognition site for the free C-terminal
           carboxyl group, which is a key determinant of
           specificity. Two major subfamilies of the M14 family,
           defined based on sequence and structural homology, are
           the A/B and N/E subfamilies. Enzymes belonging to the
           A/B subfamily are normally synthesized as inactive
           precursors containing preceding signal peptide, followed
           by an N-terminal pro-region linked to the enzyme; these
           proenzymes are called procarboxypeptidases. The A/B
           enzymes can be further divided based on their substrate
           specificity; Carboxypeptidase A-like (CPA-like) enzymes
           favor hydrophobic residues while carboxypeptidase B-like
           (CPB-like) enzymes only cleave the basic residues lysine
           or arginine. The A forms have slightly different
           specificities, with Carboxypeptidase A1 (CPA1)
           preferring aliphatic and small aromatic residues, and
           CPA2 preferring the bulky aromatic side chains. Enzymes
           belonging to the N/E subfamily enzymes are not produced
           as inactive precursors and instead rely on their
           substrate specificity and subcellular
           compartmentalization to prevent inappropriate cleavages.
           They contain an extra C-terminal transthyretin-like
           domain, thought to be involved in folding or formation
           of oligomers.  MCPs can also be classified based on
           their involvement in specific physiological processes;
           the pancreatic MCPs participate only in alimentary
           digestion and include carboxypeptidase A and B (A/B
           subfamily), while others, namely regulatory MCPs or the
           N/E subfamily, are involved in more selective reactions,
           mainly in non-digestive tissues and fluids, acting on
           blood coagulation/fibrinolysis, inflammation and local
           anaphylaxis, pro-hormone and neuropeptide processing,
           cellular response and others.   Another MCP subfamily,
           is that of succinylglutamate desuccinylase
           /aspartoacylase, which hydrolyzes N-acetyl-L-aspartate
           (NAA), and deficiency in which is the established cause
           of Canavan disease. Another subfamily (referred to as
           subfamily C) includes an exceptional type of activity in
           the MCP family, that of dipeptidyl-peptidase activity of
           gamma-glutamyl-(L)-meso-diaminopimelate peptidase I
           which is involved in bacterial cell wall metabolism.
          Length = 268

 Score = 50.5 bits (121), Expect = 1e-06
 Identities = 37/155 (23%), Positives = 59/155 (38%), Gaps = 26/155 (16%)

Query: 365 ELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKYVANMH 424
           E+++F+         +     IGKS +GR++  V ++         +  K        +H
Sbjct: 7   EMESFMKSLASKNDWMSYHSDIGKSEEGRSIPYVYLS--TSKSSSSSSKKLRVWLQGGVH 64

Query: 425 GDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKCDSLDG 484
           G+E  G    + L+  L   + K      +L   DI ++P  NPDG A            
Sbjct: 65  GNEPAGDEAALALLGKL-DNNPKW---ASVLEKIDIIVLPRYNPDGSA------------ 108

Query: 485 YVGRKNAHGVDLNRNFPDQFEYEAKKVYEPETQAI 519
           Y  R  A G D NR   D       K+   +T+ I
Sbjct: 109 YFQRTLATGYDPNR---DHT-----KLARQQTRDI 135



 Score = 43.5 bits (103), Expect = 2e-04
 Identities = 26/108 (24%), Positives = 45/108 (41%), Gaps = 6/108 (5%)

Query: 51  ELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFKYVANMH 110
           E+++F+         +     IGKS +GR++  V ++         +  K        +H
Sbjct: 7   EMESFMKSLASKNDWMSYHSDIGKSEEGRSIPYVYLS--TSKSSSSSSKKLRVWLQGGVH 64

Query: 111 GDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFA 158
           G+E  G    + L+  L   + K      +L   DI ++P  NPDG A
Sbjct: 65  GNEPAGDEAALALLGKL-DNNPKW---ASVLEKIDIIVLPRYNPDGSA 108


>gnl|CDD|133105 cd06247, M14_CPO, Peptidase M14 carboxypeptidase subfamily
           A/B-like; Carboxypeptidase O subgroup.  Peptidase M14
           carboxypeptidase (CP) O (CPO, also known as
           metallocarboxypeptidase C; EC 3.4.17.) belongs to the
           carboxypeptidase A/B subfamily of the M14 family of
           metallocarboxypeptidases (MCPs). The M14 family are
           zinc-binding CPs which hydrolyze single, C-terminal
           amino acids from polypeptide chains, and have a
           recognition site for the free C-terminal carboxyl group,
           which is a key determinant of specificity. CPO has not
           been well characterized as yet, and little is known
           about it. Based on modeling studies, CPO has been
           suggested to have specificity for acidic residues rather
           than aliphatic/aromatic residues as in A-like enzymes or
           basic residues as in B-like enzymes. It remains to be
           demonstrated that CPO is functional as an MCP.
          Length = 298

 Score = 48.0 bits (114), Expect = 8e-06
 Identities = 43/188 (22%), Positives = 78/188 (41%), Gaps = 33/188 (17%)

Query: 359 RYYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFK 418
           +Y+   E+  ++ +  + Y  LV    +G + + R ++ ++I    D P      K +  
Sbjct: 3   KYHPMDEIYNWMDQIKEKYSELVSQHYLGCTYELRPMYYLKIGWPSDKP------KKIIW 56

Query: 419 YVANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGK 478
               +H  E +  A   + ++ ++     D  + ++L + D Y++P +N DG+       
Sbjct: 57  MDCGIHAREWISPAFCQWFVKEILQNYKTDPILRKVLKNVDFYVLPVLNIDGYIYTWT-- 114

Query: 479 CDSLDGYVGRKN--------AHGVDLNRNFPDQF-----EYEAKK--------VYEPETQ 517
            D L     RKN         +GVDLNRNF  Q+         +           EPET+
Sbjct: 115 TDRL----WRKNRSPHNNGTCYGVDLNRNFNSQWCSIGASRNCRSNIFCGTGPESEPETK 170

Query: 518 AIMNFIYS 525
           A+   I S
Sbjct: 171 AVARLIES 178



 Score = 35.2 bits (81), Expect = 0.099
 Identities = 22/113 (19%), Positives = 52/113 (46%), Gaps = 6/113 (5%)

Query: 45  RYYNSTELDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFK 104
           +Y+   E+  ++ +  + Y  LV    +G + + R ++ ++I    D P      K +  
Sbjct: 3   KYHPMDEIYNWMDQIKEKYSELVSQHYLGCTYELRPMYYLKIGWPSDKP------KKIIW 56

Query: 105 YVANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGF 157
               +H  E +  A   + ++ ++     D  + ++L + D Y++P +N DG+
Sbjct: 57  MDCGIHAREWISPAFCQWFVKEILQNYKTDPILRKVLKNVDFYVLPVLNIDGY 109


>gnl|CDD|199870 cd06250, M14_PaAOTO_like, Peptidase M14 Succinylglutamate
           desuccinylase (ASTE)/aspartoacylase (ASPA)-like
           subfamily; subgroup includes Pseudomonas aeruginosa
           AotO.  An uncharacterized subgroup of the
           Succinylglutamate desuccinylase (ASTE)/aspartoacylase
           (ASPA) subfamily which is part of the the M14 family of
           metallocarboxypeptidases. This subgroup includes
           Pseudomonas aeruginosa AotO and related proteins. ASTE
           catalyzes the fifth and last step in arginine catabolism
           by the arginine succinyltransferase pathway, and
           aspartoacylase (ASPA, also known as aminoacylase 2, and
           ACY-2; EC:3.5.1.15) cleaves N-acetyl L-aspartic acid
           (NAA) into aspartate and acetate. NAA is abundant in the
           brain, and hydrolysis of NAA by ASPA may help maintain
           white matter. ASPA is an NAA scavenger in other tissues.
           Mutations in the gene encoding ASPA cause Canavan
           disease (CD), a fatal progressive neurodegenerative
           disorder involving dysmyelination and spongiform
           degeneration of white matter in children. This enzyme
           binds zinc which is necessary for activity. Measurement
           of elevated NAA levels in urine is used in the diagnosis
           of CD. The gene encoding P. aeruginosa AotO was
           characterized as part of an operon encoding an arginine
           and ornithine transport system, however it is not
           essential for arginine and ornithine uptake.
          Length = 359

 Score = 46.1 bits (110), Expect = 4e-05
 Identities = 29/84 (34%), Positives = 42/84 (50%), Gaps = 15/84 (17%)

Query: 421 ANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKCD 480
           A++H DE  G A++  LI+ L     K +   ++L   +I +VP  NP G          
Sbjct: 34  ASLHADELPGNAVIHHLIELL----KKLEAEGRILG--EITLVPVANPIGLNQK------ 81

Query: 481 SLDGYV-GRKN-AHGVDLNRNFPD 502
            L G+  GR + A G + NRNFPD
Sbjct: 82  -LGGFHLGRFDLATGTNFNRNFPD 104



 Score = 39.1 bits (92), Expect = 0.007
 Identities = 17/50 (34%), Positives = 27/50 (54%), Gaps = 6/50 (12%)

Query: 107 ANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDG 156
           A++H DE  G A++  LI+ L     K +   ++L   +I +VP  NP G
Sbjct: 34  ASLHADELPGNAVIHHLIELL----KKLEAEGRILG--EITLVPVANPIG 77


>gnl|CDD|199866 cd06244, M14-like_1_7, Peptidase M14-like domain; uncharacterized
           subgroup.  Peptidase M14-like domain of a functionally
           uncharacterized subgroup of the M14 family of
           metallocarboxypeptidases (MCPs). The M14 family are
           zinc-binding carboxypeptidases (CPs) which hydrolyze
           single, C-terminal amino acids from polypeptide chains,
           and have a recognition site for the free C-terminal
           carboxyl group, which is a key determinant of
           specificity. Two major subfamilies of the M14 family,
           defined based on sequence and structural homology, are
           the A/B and N/E subfamilies. Enzymes belonging to the
           A/B subfamily are normally synthesized as inactive
           precursors containing preceding signal peptide, followed
           by an N-terminal pro-region linked to the enzyme; these
           proenzymes are called procarboxypeptidases. The A/B
           enzymes can be further divided based on their substrate
           specificity; Carboxypeptidase A-like (CPA-like) enzymes
           favor hydrophobic residues while carboxypeptidase B-like
           (CPB-like) enzymes only cleave the basic residues lysine
           or arginine. The A forms have slightly different
           specificities, with Carboxypeptidase A1 (CPA1)
           preferring aliphatic and small aromatic residues, and
           CPA2 preferring the bulky aromatic side chains. Enzymes
           belonging to the N/E subfamily enzymes are not produced
           as inactive precursors and instead rely on their
           substrate specificity and subcellular
           compartmentalization to prevent inappropriate cleavages.
           They contain an extra C-terminal transthyretin-like
           domain, thought to be involved in folding or formation
           of oligomers.  MCPs can also be classified based on
           their involvement in specific physiological processes;
           the pancreatic MCPs participate only in alimentary
           digestion and include carboxypeptidase A and B (A/B
           subfamily), while others, namely regulatory MCPs or the
           N/E subfamily, are involved in more selective reactions,
           mainly in non-digestive tissues and fluids, acting on
           blood coagulation/fibrinolysis, inflammation and local
           anaphylaxis, pro-hormone and neuropeptide processing,
           cellular response and others.   Another MCP subfamily,
           is that of succinylglutamate desuccinylase
           /aspartoacylase, which hydrolyzes N-acetyl-L-aspartate
           (NAA), and deficiency in which is the established cause
           of Canavan disease. Another subfamily (referred to as
           subfamily C) includes an exceptional type of activity in
           the MCP family, that of dipeptidyl-peptidase activity of
           gamma-glutamyl-(L)-meso-diaminopimelate peptidase I
           which is involved in bacterial cell wall metabolism.
          Length = 268

 Score = 45.7 bits (108), Expect = 4e-05
 Identities = 46/188 (24%), Positives = 73/188 (38%), Gaps = 59/188 (31%)

Query: 387 GKSVQGRNLWAVEIT---HDVDNPDGRTLMKPMFK--------------------YVANM 423
           G+SV+G ++  V ++     VD    + L   + K                    +  N+
Sbjct: 2   GESVEGHDIPVVVVSDSSAKVDYYHNQALPTMLEKPPELADKIEDGTIEDYAPPIWFNNI 61

Query: 424 HGDETVGYALMVFLIQYLVLKD-----GKDDR---------ITQLLNSTDIYIVPSINPD 469
           H DET G    + +I+ L  +D       DD          +  LL +    + P+ NPD
Sbjct: 62  HPDETPGIDAQMEVIERLAYEDETEFNTTDDDGEEWTETVDVDDLLENVIFIVHPTENPD 121

Query: 470 GFAAAKEGKCDSLDGYVGRKNAHGVDLNRNFPDQFEYEAKKVYEPETQAIMNFI-YSNPF 528
           G              Y+ R+N++G DLNR+   Q         +PETQ ++  I   NP 
Sbjct: 122 GRT------------YLTRENSNGFDLNRDNSFQ--------TQPETQNMVALIAEWNPM 161

Query: 529 VLSGNLHG 536
               +LHG
Sbjct: 162 SFL-DLHG 168



 Score = 31.4 bits (71), Expect = 1.4
 Identities = 29/131 (22%), Positives = 47/131 (35%), Gaps = 37/131 (28%)

Query: 73  GKSVQGRNLWAVEIT---HDVDSPDGRTLMKPMFK--------------------YVANM 109
           G+SV+G ++  V ++     VD    + L   + K                    +  N+
Sbjct: 2   GESVEGHDIPVVVVSDSSAKVDYYHNQALPTMLEKPPELADKIEDGTIEDYAPPIWFNNI 61

Query: 110 HGDETVGYALMVFLIQYLVLKD-----GKDDR---------ITQLLNSTDIYIVPSINPD 155
           H DET G    + +I+ L  +D       DD          +  LL +    + P+ NPD
Sbjct: 62  HPDETPGIDAQMEVIERLAYEDETEFNTTDDDGEEWTETVDVDDLLENVIFIVHPTENPD 121

Query: 156 GFAAAKEGNLN 166
           G       N N
Sbjct: 122 GRTYLTRENSN 132


>gnl|CDD|199841 cd03856, M14_Nna1_like, Peptidase M14-like domain of ATP/GTP
           binding proteins, cytosolic carboxypeptidases and
           related proteins.  Peptidase M14-like domain of Nna-1
           (Nervous system Nuclear protein induced by Axotomy),
           also known as ATP/GTP binding protein (AGTPBP-1) and
           cytosolic carboxypeptidase (CCP), and related proteins.
           The Peptidase M14 family of metallocarboxypeptidases are
           zinc-binding carboxypeptidases (CPs) which hydrolyze
           single, C-terminal amino acids from polypeptide chains,
           and have a recognition site for the free C-terminal
           carboxyl group, which is a key determinant of
           specificity. This subfamily includes the human AGTPBP-1
           and AGBL -2, -3, -4, and -5, and the mouse Nna1/CCP-1
           and CCP -2 through -6. Nna1-like proteins are active
           metallopeptidases that are thought to act on cytosolic
           proteins such as alpha-tubulin, to remove a C-terminal
           tyrosine. Nna1 is widely expressed in the developing and
           adult nervous systems, including cerebellar Purkinje and
           granule neurons, miral cells of the olfactory bulb and
           retinal photoreceptors. Nna1 is also induced in
           axotomized motor neurons. Mutations in Nna1 cause
           Purkinje cell degeneration (pcd). The Nna1 CP domain is
           required to prevent the retinal photoreceptor loss and
           cerebellar ataxia phenotypes of pcd mice, and a
           functional zinc-binding domain is needed for Nna-1 to
           support neuron survival in these mice. Nna1-like
           proteins from the different phyla are highly diverse,
           but they all contain a characteristic N-terminal
           conserved domain right before the CP domain. It has been
           suggested that this N-terminal domain might act as a
           folding domain.
          Length = 258

 Score = 45.4 bits (107), Expect = 4e-05
 Identities = 32/121 (26%), Positives = 49/121 (40%), Gaps = 22/121 (18%)

Query: 380 LVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKYVANMHGDETVGYALMVFLIQ 439
            +  ET+G+S +GR +    I     N             +A  H  ET     +   ++
Sbjct: 27  SLEVETLGQSEKGRPI-KALIKQSPKN-------NEWIVIIARQHPGETTAAWALFGFLE 78

Query: 440 YLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKCDSLDGYVGRKNAHGVDLNRN 499
            L+L +      T L    +  IVP +NPDG             G   R NA+G+DLNR+
Sbjct: 79  ALLLDEALS---TPLFERFNFLIVPDLNPDG----------VETGNW-RHNANGIDLNRD 124

Query: 500 F 500
           +
Sbjct: 125 W 125



 Score = 38.0 bits (88), Expect = 0.009
 Identities = 22/91 (24%), Positives = 37/91 (40%), Gaps = 11/91 (12%)

Query: 66  LVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFKYVANMHGDETVGYALMVFLIQ 125
            +  ET+G+S +GR + A+      +              +A  H  ET     +   ++
Sbjct: 27  SLEVETLGQSEKGRPIKALIKQSPKN--------NEWIVIIARQHPGETTAAWALFGFLE 78

Query: 126 YLVLKDGKDDRITQLLNSTDIYIVPSINPDG 156
            L+L +      T L    +  IVP +NPDG
Sbjct: 79  ALLLDEALS---TPLFERFNFLIVPDLNPDG 106


>gnl|CDD|133118 cd06908, M14_AGBL4_like, Peptidase M14-like domain of ATP/GTP
           binding protein AGBL-4 and related proteins.  Peptidase
           M14-like domain of ATP/GTP binding protein_like
           (AGBL)-4, and related proteins. The Peptidase M14 family
           of metallocarboxypeptidases are zinc-binding
           carboxypeptidases (CPs) which hydrolyze single,
           C-terminal amino acids from polypeptide chains, and have
           a recognition site for the free C-terminal carboxyl
           group, which is a key determinant of specificity. This
           eukaryotic subgroup includes the human AGBL4 and the
           mouse cytosolic carboxypeptidase (CCP)-6. ATP/GTP
           binding protein (AGTPBP-1/Nna1)-like proteins are active
           metallopeptidases that are thought to act on cytosolic
           proteins such as alpha-tubulin, to remove a C-terminal
           tyrosine. Mutations in AGTPBP-1/Nna1 cause Purkinje cell
           degeneration (pcd). AGTPBP-1/Nna1 however does not
           belong to this subgroup. AGTPBP-1/Nna1-like proteins
           from the different phyla are highly diverse, but they
           all contain a unique N-terminal conserved domain right
           before the CP domain. It has been suggested that this
           N-terminal domain might act as a folding domain.
          Length = 261

 Score = 45.5 bits (108), Expect = 5e-05
 Identities = 41/158 (25%), Positives = 63/158 (39%), Gaps = 28/158 (17%)

Query: 375 KSYPHLVRAETIGKSVQGRNLWAVEITH-DVDNPDGRTLMKPMFKYVANMHGDETVGYAL 433
           K      R E +G+SVQ R L  + I   D    D     K +    A +H  E+    +
Sbjct: 6   KRSLDYFRREQLGQSVQKRRLDLLTIDSPDNLREDSE---KKVIFITARVHPGESPSSYV 62

Query: 434 MVFLIQYLVLKDGKDDRITQLLNSTDIY-IVPSINPDGFAAAKEGKCDSLDGYVGRKNAH 492
              LI +LV     +  I ++L    ++ IVP +NPDG           L  Y  R +  
Sbjct: 63  CQGLIDFLV----SNHPIAKVLREHLVFKIVPMLNPDGV---------FLGNY--RCSLM 107

Query: 493 GVDLNRNFPDQFEYEAKKVYEPETQAIMNFI---YSNP 527
           G DLNR++ D   +       P   A+ N +    ++ 
Sbjct: 108 GHDLNRHWHDPSPWA-----HPTLHAVKNLLKELDNDS 140



 Score = 38.2 bits (89), Expect = 0.009
 Identities = 27/97 (27%), Positives = 43/97 (44%), Gaps = 7/97 (7%)

Query: 61  KSYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFKYVANMHGDETVGYALM 120
           K      R E +G+SVQ R L  + I    +  +     K +    A +H  E+    + 
Sbjct: 6   KRSLDYFRREQLGQSVQKRRLDLLTIDSPDNLREDSE--KKVIFITARVHPGESPSSYVC 63

Query: 121 VFLIQYLVLKDGKDDRITQLLNSTDIY-IVPSINPDG 156
             LI +LV     +  I ++L    ++ IVP +NPDG
Sbjct: 64  QGLIDFLV----SNHPIAKVLREHLVFKIVPMLNPDG 96


>gnl|CDD|199865 cd06241, M14-like_1_4, Peptidase M14-like domain; uncharacterized
           subgroup.  Peptidase M14-like domain of a functionally
           uncharacterized subgroup of the M14 family of
           metallocarboxypeptidases (MCPs). The M14 family are
           zinc-binding carboxypeptidases (CPs) which hydrolyze
           single, C-terminal amino acids from polypeptide chains,
           and have a recognition site for the free C-terminal
           carboxyl group, which is a key determinant of
           specificity. Two major subfamilies of the M14 family,
           defined based on sequence and structural homology, are
           the A/B and N/E subfamilies. Enzymes belonging to the
           A/B subfamily are normally synthesized as inactive
           precursors containing preceding signal peptide, followed
           by an N-terminal pro-region linked to the enzyme; these
           proenzymes are called procarboxypeptidases. The A/B
           enzymes can be further divided based on their substrate
           specificity; Carboxypeptidase A-like (CPA-like) enzymes
           favor hydrophobic residues while carboxypeptidase B-like
           (CPB-like) enzymes only cleave the basic residues lysine
           or arginine. The A forms have slightly different
           specificities, with Carboxypeptidase A1 (CPA1)
           preferring aliphatic and small aromatic residues, and
           CPA2 preferring the bulky aromatic side chains. Enzymes
           belonging to the N/E subfamily enzymes are not produced
           as inactive precursors and instead rely on their
           substrate specificity and subcellular
           compartmentalization to prevent inappropriate cleavage.
           They contain an extra C-terminal transthyretin-like
           domain, thought to be involved in folding or formation
           of oligomers.  MCPs can also be classified based on
           their involvement in specific physiological processes;
           the pancreatic MCPs participate only in alimentary
           digestion and include carboxypeptidase A and B (A/B
           subfamily), while others, namely regulatory MCPs or the
           N/E subfamily, are involved in more selective reactions,
           mainly in non-digestive tissues and fluids, acting on
           blood coagulation/fibrinolysis, inflammation and local
           anaphylaxis, pro-hormone and neuropeptide processing,
           cellular response and others.   Another MCP subfamily,
           is that of succinylglutamate desuccinylase
           /aspartoacylase, which hydrolyzes N-acetyl-L-aspartate
           (NAA), and deficiency in which is the established cause
           of Canavan disease. Another subfamily (referred to as
           subfamily C) includes an exceptional type of activity in
           the MCP family, that of dipeptidyl-peptidase activity of
           gamma-glutamyl-(L)-meso-diaminopimelate peptidase I
           which is involved in bacterial cell wall metabolism.
          Length = 266

 Score = 44.6 bits (106), Expect = 8e-05
 Identities = 39/139 (28%), Positives = 64/139 (46%), Gaps = 13/139 (9%)

Query: 368 AFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDN-PDG-RTLMKPMFKYVANMHG 425
           A++ +   + P L+R E+ GK+ +GR L  + ++ D    P+  +   KP+    A +H 
Sbjct: 1   AYLQRLAAASP-LIRLESFGKTPEGRPLHLLVLSKDGAFDPEEAKRSGKPVVLVNAGIHP 59

Query: 426 DETVGY-ALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKCDSLDG 484
            E  G  A ++ L     L DGK D    LL+   +  +P  N DG          + +G
Sbjct: 60  GEIDGKDAGLMLLRD---LADGKKD---ALLDKVVLVFIPVFNVDGHERRSPYNRTNQNG 113

Query: 485 YVG---RKNAHGVDLNRNF 500
                 R NA  ++LNR+F
Sbjct: 114 PEEYGWRGNARNLNLNRDF 132



 Score = 36.9 bits (86), Expect = 0.028
 Identities = 31/106 (29%), Positives = 51/106 (48%), Gaps = 10/106 (9%)

Query: 54  AFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVD-SPDG-RTLMKPMFKYVANMHG 111
           A++ +   + P L+R E+ GK+ +GR L  + ++ D    P+  +   KP+    A +H 
Sbjct: 1   AYLQRLAAASP-LIRLESFGKTPEGRPLHLLVLSKDGAFDPEEAKRSGKPVVLVNAGIHP 59

Query: 112 DETVGY-ALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDG 156
            E  G  A ++ L     L DGK D    LL+   +  +P  N DG
Sbjct: 60  GEIDGKDAGLMLLRD---LADGKKD---ALLDKVVLVFIPVFNVDG 99


>gnl|CDD|199862 cd06238, M14-like_1_1, Peptidase M14-like domain; uncharacterized
           subgroup.  Peptidase M14-like domain of a functionally
           uncharacterized subgroup of the M14 family of
           metallocarboxypeptidases (MCPs). The M14 family are
           zinc-binding carboxypeptidases (CPs) which hydrolyze
           single, C-terminal amino acids from polypeptide chains,
           and have a recognition site for the free C-terminal
           carboxyl group, which is a key determinant of
           specificity. Two major subfamilies of the M14 family,
           defined based on sequence and structural homology, are
           the A/B and N/E subfamilies.  Enzymes belonging to the
           A/B subfamily are normally synthesized as inactive
           precursors containing preceding signal peptide, followed
           by an N-terminal pro-region linked to the enzyme; these
           proenzymes are called procarboxypeptidases. The A/B
           enzymes can be further divided based on their substrate
           specificity; Carboxypeptidase A-like (CPA-like) enzymes
           favor hydrophobic residues while carboxypeptidase B-like
           (CPB-like) enzymes only cleave the basic residues lysine
           or arginine. The A forms have slightly different
           specificities, with Carboxypeptidase A1 (CPA1)
           preferring aliphatic and small aromatic residues, and
           CPA2 preferring the bulky aromatic side chains. Enzymes
           belonging to the N/E subfamily enzymes are not produced
           as inactive precursors and instead rely on their
           substrate specificity and subcellular
           compartmentalization to prevent inappropriate cleavage.
           They contain an extra C-terminal transthyretin-like
           domain, thought to be involved in folding or formation
           of oligomers.  MCPs can also be classified based on
           their involvement in specific physiological processes;
           the pancreatic MCPs participate only in alimentary
           digestion and include carboxypeptidase A and B (A/B
           subfamily), while others, namely regulatory MCPs or the
           N/E subfamily, are involved in more selective reactions,
           mainly in non-digestive tissues and fluids, acting on
           blood coagulation/fibrinolysis, inflammation and local
           anaphylaxis, pro-hormone and neuropeptide processing,
           cellular response and others.   Another MCP subfamily,
           is that of succinylglutamate desuccinylase
           /aspartoacylase, which hydrolyzes N-acetyl-L-aspartate
           (NAA), and deficiency in which is the established cause
           of Canavan disease. Another subfamily (referred to as
           subfamily C) includes an exceptional type of activity in
           the MCP family, that of dipeptidyl-peptidase activity of
           gamma-glutamyl-(L)-meso-diaminopimelate peptidase I
           which is involved in bacterial cell wall metabolism.
          Length = 270

 Score = 43.1 bits (102), Expect = 3e-04
 Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 6/51 (11%)

Query: 108 NMHGDETVGY--ALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDG 156
           ++HG+E  G   AL+     Y +    + D I  LL++T + I P  NPDG
Sbjct: 56  SVHGNEISGTEAALLT---AYHLAA-AQGDEIEALLDNTVVLIDPMQNPDG 102



 Score = 43.1 bits (102), Expect = 3e-04
 Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 6/51 (11%)

Query: 422 NMHGDETVGY--ALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDG 470
           ++HG+E  G   AL+     Y +    + D I  LL++T + I P  NPDG
Sbjct: 56  SVHGNEISGTEAALLT---AYHLAA-AQGDEIEALLDNTVVLIDPMQNPDG 102


>gnl|CDD|199859 cd06235, M14_Nna1_like_2, Peptidase M14-like domain of human
           Nna1/AGTPBP-1, AGBL2 -5, and related proteins.  Subgroup
           of the Peptidase M14-like domain of Nna-1 (Nervous
           system Nuclear protein induced by Axotomy), also known
           as ATP/GTP binding protein (AGTPBP-1) and cytosolic
           carboxypeptidase (CCP), and related proteins. The
           Peptidase M14 family of metallocarboxypeptidases are
           zinc-binding carboxypeptidases (CPs) which hydrolyze
           single, C-terminal amino acids from polypeptide chains,
           and have a recognition site for the free C-terminal
           carboxyl group, which is a key determinant of
           specificity. This eukaryotic subgroup includes the human
           Nna1/AGTPBP-1 and AGBL -2, -3, -4, and -5, and the mouse
           Nna1/CCP-1 and CCP -2 through -6. Nna1-like proteins are
           active metallopeptidases that are thought to act on
           cytosolic proteins such as alpha-tubulin, to remove a
           C-terminal tyrosine. Nna1 is widely expressed in the
           developing and adult nervous systems, including
           cerebellar Purkinje and granule neurons, miral cells of
           the olfactory bulb and retinal photoreceptors. Nna1 is
           also induced in axotomized motor neurons. Mutations in
           Nna1 cause Purkinje cell degeneration (pcd). The Nna1 CP
           domain is required to prevent the retinal photoreceptor
           loss and cerebellar ataxia phenotypes of pcd mice, and a
           functional zinc-binding domain is needed for Nna-1 to
           support neuron survival in these mice. Nna1-like
           proteins from the different phyla are highly diverse,
           but they all contain a unique N-terminal conserved
           domain right before the CP domain. It has been suggested
           that this N-terminal domain might act as a folding
           domain.
          Length = 258

 Score = 42.6 bits (101), Expect = 3e-04
 Identities = 35/164 (21%), Positives = 57/164 (34%), Gaps = 32/164 (19%)

Query: 381 VRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKYVANMHGDETVGYALMVFLIQY 440
           ++ + +  ++ G  +  + IT            K +    A  H  ET    +M   I +
Sbjct: 13  LKRKILCTTLGGLPIPLLTITSPSSKSI-PIKKKKVIVITARQHPGETNSSFVMQGFIDF 71

Query: 441 LVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKCDSLDGYVG--RKNAHGVDLNR 498
           L+           L  +    I+P +NPDG                G  R +  G+DLNR
Sbjct: 72  LL---SDSPEAQYLRENFIFKIIPMLNPDGV-------------IHGNYRCSLSGIDLNR 115

Query: 499 NF--PDQFEYEAKKVYEPETQAIMNFIYS----NPFVLSGNLHG 536
            +  PD       K   PE  ++   I         +L  +LHG
Sbjct: 116 QWKNPD-------KKLHPEIYSVKQLIKKLSQERNIILFIDLHG 152



 Score = 36.8 bits (86), Expect = 0.024
 Identities = 20/93 (21%), Positives = 38/93 (40%), Gaps = 10/93 (10%)

Query: 67  VRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFKYV---ANMHGDETVGYALMVFL 123
           ++ + +  ++ G  +  + IT    SP  +++     K +   A  H  ET    +M   
Sbjct: 13  LKRKILCTTLGGLPIPLLTIT----SPSSKSIPIKKKKVIVITARQHPGETNSSFVMQGF 68

Query: 124 IQYLVLKDGKDDRITQLLNSTDIYIVPSINPDG 156
           I +L+           L  +    I+P +NPDG
Sbjct: 69  IDFLL---SDSPEAQYLRENFIFKIIPMLNPDG 98


>gnl|CDD|226135 COG3608, COG3608, Predicted deacylase [General function prediction
           only].
          Length = 331

 Score = 43.2 bits (102), Expect = 3e-04
 Identities = 39/188 (20%), Positives = 59/188 (31%), Gaps = 48/188 (25%)

Query: 370 ILKTVKSYPHLVRAETIGKSV------QGRNLWAVEITHDVD--NPDGRTLMKPMFKYVA 421
           +  +            +   V      +  + +    T      N  G +++       A
Sbjct: 1   MEGSKIVLTGDFDGGGVQLGVLRFPYSRDDSAYGHVGTPVFVFGNGPGPSVL-----LQA 55

Query: 422 NMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKCDS 481
            +HGDE  G   +  LI  L   D  D   T       + IVP  NP  F A   G+   
Sbjct: 56  GVHGDELPGVIALRRLIPAL---DPADISGT-------VIIVPIANPPAFEAQ--GRFSP 103

Query: 482 LDGYVGRKNAHGVDLNRNFPDQFEYEAKKVYEPETQAIMNFIYS-----NPFVLSGNLHG 536
            D           +LNR FP + +          T+ I + +          VL  +LH 
Sbjct: 104 GD---------DTNLNRAFPGRPD-------GSATERIADRLKRLLLPLADIVL--DLHS 145

Query: 537 GAVARDYA 544
           G    DY 
Sbjct: 146 GGEGLDYL 153



 Score = 35.5 bits (82), Expect = 0.074
 Identities = 23/79 (29%), Positives = 30/79 (37%), Gaps = 21/79 (26%)

Query: 94  DGRTLMKPMFKYVANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSIN 153
            G +++       A +HGDE  G   +  LI  L   D  D   T       + IVP  N
Sbjct: 47  PGPSVL-----LQAGVHGDELPGVIALRRLIPAL---DPADISGT-------VIIVPIAN 91

Query: 154 PDGFAAA------KEGNLN 166
           P  F A        + NLN
Sbjct: 92  PPAFEAQGRFSPGDDTNLN 110


>gnl|CDD|222338 pfam13715, Cna_B_2, Cna protein B-type domain.  This domain is
           found in Staphylococcus aureus collagen-binding surface
           protein. However, this region does not mediate collagen
           binding, the pfam05737 region carries out that function.
           The structure of the repetitive B-region has been solved
           and forms a beta sandwich structure. It is thought that
           this region forms a stalk in Staphylococcus aureus
           collagen-binding protein that presents the ligand
           binding domain away from the bacterial cell surface.
          Length = 86

 Score = 38.4 bits (90), Expect = 8e-04
 Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 6/77 (7%)

Query: 222 VYGIVTD-TYGNPLPSAIITVRWNDKAVTVTNR-GEYWRLLARGKYVVTASAPGYEPVTT 279
           + G V D   G PLP A I V+      TVT+  G +   +  G Y +  S  GY+  T 
Sbjct: 2   IKGKVVDSDTGEPLPGATIFVK-GSNVGTVTDADGNFTIKVKEGDYTLVISYVGYKTKT- 59

Query: 280 EPLDVPDTESVRLDFML 296
             + V   +   L  +L
Sbjct: 60  --IKVTLNKGNVLTIVL 74



 Score = 38.4 bits (90), Expect = 8e-04
 Identities = 23/77 (29%), Positives = 33/77 (42%), Gaps = 6/77 (7%)

Query: 627 VYGIVTD-TYGNPLPSAIITVRWNDKAVTVTNR-GEYWRLLARGKYVVTASAPGYEPVTT 684
           + G V D   G PLP A I V+      TVT+  G +   +  G Y +  S  GY+  T 
Sbjct: 2   IKGKVVDSDTGEPLPGATIFVK-GSNVGTVTDADGNFTIKVKEGDYTLVISYVGYKTKT- 59

Query: 685 EPLDVPDTESVRLDFML 701
             + V   +   L  +L
Sbjct: 60  --IKVTLNKGNVLTIVL 74


>gnl|CDD|133117 cd06907, M14_AGBL2-3_like, Peptidase M14-like domain of ATP/GTP
           binding protein AGBL-2 and AGBL-3, and related proteins.
            Peptidase M14-like domain of ATP/GTP binding
           protein_like (AGBL)-2, and related proteins. The
           Peptidase M14 family of metallocarboxypeptidases are
           zinc-binding carboxypeptidases (CPs) which hydrolyze
           single, C-terminal amino acids from polypeptide chains,
           and have a recognition site for the free C-terminal
           carboxyl group, which is a key determinant of
           specificity. This subgroup includes the human AGBL-2,
           and -3, and the mouse cytosolic carboxypeptidase
           (CCPs)-2, and -3. ATP/GTP binding protein
           (AGTPBP-1/Nna1)-like proteins are active
           metallopeptidases that are thought to act on cytosolic
           proteins such as alpha-tubulin, to remove a C-terminal
           tyrosine. Mutations in AGTPBP-1/Nna1 cause Purkinje cell
           degeneration (pcd). AGTPBP-1/Nna1 however does not
           belong to this subgroup. AGTPBP-1/Nna1-like proteins
           from the different phyla are highly diverse, but they
           all contain a unique N-terminal conserved domain right
           before the CP domain. It has been suggested that this
           N-terminal domain might act as a folding domain.
          Length = 261

 Score = 39.7 bits (93), Expect = 0.003
 Identities = 37/142 (26%), Positives = 55/142 (38%), Gaps = 27/142 (19%)

Query: 366 LDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKYV--ANM 423
           L       V+S     +   + +++ G  ++ + IT    NP    L       V  A +
Sbjct: 1   LMEIANDPVRS--KFCKLRVLCRTLAGNTVYLLTITSPSSNPS---LAAAKKAVVLTARV 55

Query: 424 HGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIY-IVPSINPDGFAAAKEGKCDSL 482
           H  ET    +M   + +L      +    QLL  T I+ IVP +NPDG            
Sbjct: 56  HPGETNASWMMKGFLDFLT----SNSPDAQLLRDTFIFKIVPMLNPDGVI---------- 101

Query: 483 DGYVG--RKNAHGVDLNRNFPD 502
              VG  R +  G DLNRN+  
Sbjct: 102 ---VGNYRCSLAGRDLNRNYKT 120



 Score = 33.1 bits (76), Expect = 0.36
 Identities = 27/108 (25%), Positives = 44/108 (40%), Gaps = 12/108 (11%)

Query: 52  LDAFILKTVKSYPHLVRAETIGKSVQGRNLWAVEITHDVDSPDGRTLMKPMFKYV--ANM 109
           L       V+S     +   + +++ G  ++ + IT    +P    L       V  A +
Sbjct: 1   LMEIANDPVRS--KFCKLRVLCRTLAGNTVYLLTITSPSSNPS---LAAAKKAVVLTARV 55

Query: 110 HGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIY-IVPSINPDG 156
           H  ET    +M   + +L      +    QLL  T I+ IVP +NPDG
Sbjct: 56  HPGETNASWMMKGFLDFLT----SNSPDAQLLRDTFIFKIVPMLNPDG 99


>gnl|CDD|199857 cd06231, M14-like_4, Peptidase M14-like domain; uncharacterized
           subfamily.  A functionally uncharacterized subgroup of
           the M14 family of metallocarboxypeptidases (MCPs). The
           M14 family are zinc-binding carboxypeptidases (CPs)
           which hydrolyze single, C-terminal amino acids from
           polypeptide chains, and have a recognition site for the
           free C-terminal carboxyl group, which is a key
           determinant of specificity. Two major subfamilies of the
           M14 family, defined based on sequence and structural
           homology, are the A/B and N/E subfamilies. Enzymes
           belonging to the A/B subfamily are normally synthesized
           as inactive precursors containing preceding signal
           peptide, followed by an N-terminal pro-region linked to
           the enzyme; these proenzymes are called
           procarboxypeptidases. The A/B enzymes can be further
           divided based on their substrate specificity;
           Carboxypeptidase A-like (CPA-like) enzymes favor
           hydrophobic residues while carboxypeptidase B-like
           (CPB-like) enzymes only cleave the basic residues lysine
           or arginine. The A forms have slightly different
           specificities, with Carboxypeptidase A1 (CPA1)
           preferring aliphatic and small aromatic residues, and
           CPA2 preferring the bulky aromatic side chains. Enzymes
           belonging to the N/E subfamily enzymes are not produced
           as inactive precursors and instead rely on their
           substrate specificity and subcellular
           compartmentalization to prevent inappropriate cleavages.
           They contain an extra C-terminal transthyretin-like
           domain, thought to be involved in folding or formation
           of oligomers.  MCPs can also be classified based on
           their involvement in specific physiological processes;
           the pancreatic MCPs participate only in alimentary
           digestion and include carboxypeptidase A and B (A/B
           subfamily), while others, namely regulatory MCPs or the
           N/E subfamily, are involved in more selective reactions,
           mainly in non-digestive tissues and fluids, acting on
           blood coagulation/fibrinolysis, inflammation and local
           anaphylaxis, pro-hormone and neuropeptide processing,
           cellular response and others.   Another MCP subfamily,
           is that of succinylglutamate desuccinylase
           /aspartoacylase, which hydrolyzes N-acetyl-L-aspartate
           (NAA), and deficiency in which is the established cause
           of Canavan disease. Another subfamily (referred to as
           subfamily C) includes an exceptional type of activity in
           the MCP family, that of dipeptidyl-peptidase activity of
           gamma-glutamyl-(L)-meso-diaminopimelate peptidase I
           which is involved in bacterial cell wall metabolism.
          Length = 234

 Score = 38.5 bits (90), Expect = 0.006
 Identities = 22/81 (27%), Positives = 29/81 (35%), Gaps = 23/81 (28%)

Query: 421 ANMHGDETVG-YALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKC 479
           A +HG E  G    + FL                L    ++ ++P INP GF    E   
Sbjct: 49  AGVHGYEPAGPLGALEFL----------RAAAADLFQDVNLSVLPCINPSGF----EAIT 94

Query: 480 DSLDGYVGRKNAHGVDLNRNF 500
                   R N  G D NR+F
Sbjct: 95  --------RWNPTGEDPNRSF 107


>gnl|CDD|199864 cd06240, M14-like_1_3, Peptidase M14-like domain; uncharacterized
           subgroup.  Peptidase M14-like domain of a functionally
           uncharacterized subgroup of the M14 family of
           metallocarboxypeptidases (MCPs). The M14 family are
           zinc-binding carboxypeptidases (CPs) which hydrolyze
           single, C-terminal amino acids from polypeptide chains,
           and have a recognition site for the free C-terminal
           carboxyl group, which is a key determinant of
           specificity. Two major subfamilies of the M14 family,
           defined based on sequence and structural homology, are
           the A/B and N/E subfamilies.  Enzymes belonging to the
           A/B subfamily are normally synthesized as inactive
           precursors containing preceding signal peptide, followed
           by an N-terminal pro-region linked to the enzyme; these
           proenzymes are called procarboxypeptidases. The A/B
           enzymes can be further divided based on their substrate
           specificity; Carboxypeptidase A-like (CPA-like) enzymes
           favor hydrophobic residues while carboxypeptidase B-like
           (CPB-like) enzymes only cleave the basic residues lysine
           or arginine. The A forms have slightly different
           specificities, with Carboxypeptidase A1 (CPA1)
           preferring aliphatic and small aromatic residues, and
           CPA2 preferring the bulky aromatic side chains. Enzymes
           belonging to the N/E subfamily enzymes are not produced
           as inactive precursors and instead rely on their
           substrate specificity and subcellular
           compartmentalization to prevent inappropriate cleavages.
           They contain an extra C-terminal transthyretin-like
           domain, thought to be involved in folding or formation
           of oligomers.  MCPs can also be classified based on
           their involvement in specific physiological processes;
           the pancreatic MCPs participate only in alimentary
           digestion and include carboxypeptidase A and B (A/B
           subfamily), while others, namely regulatory MCPs or the
           N/E subfamily, are involved in more selective reactions,
           mainly in non-digestive tissues and fluids, acting on
           blood coagulation/fibrinolysis, inflammation and local
           anaphylaxis, pro-hormone and neuropeptide processing,
           cellular response and others.   Another MCP subfamily,
           is that of succinylglutamate desuccinylase
           /aspartoacylase, which hydrolyzes N-acetyl-L-aspartate
           (NAA), and deficiency in which is the established cause
           of Canavan disease. Another subfamily (referred to as
           subfamily C) includes an exceptional type of activity in
           the MCP family, that of dipeptidyl-peptidase activity of
           gamma-glutamyl-(L)-meso-diaminopimelate peptidase I
           which is involved in bacterial cell wall metabolism.
          Length = 274

 Score = 36.6 bits (85), Expect = 0.030
 Identities = 24/113 (21%), Positives = 42/113 (37%), Gaps = 26/113 (23%)

Query: 68  RAETIGKSVQGRNLWAVEITHD---------------------VDSPDGRTLM---KPMF 103
           +  TIGK+ +GR      I+                       +     R L    K + 
Sbjct: 1   KLVTIGKTEEGREQIMAAISSPENLAKLDHYKAILRKLADPRGLSEEQARKLAAEGKAIV 60

Query: 104 KYVANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDG 156
                +H  ET    +++ L   L  ++     I ++L++  + IVP +NPDG
Sbjct: 61  WITGGLHSTETGPPQMLMELAYRLASEE--SPEIKRILDNVILLIVPVLNPDG 111



 Score = 36.6 bits (85), Expect = 0.033
 Identities = 24/113 (21%), Positives = 42/113 (37%), Gaps = 26/113 (23%)

Query: 382 RAETIGKSVQGRNLWAVEITHD---------------------VDNPDGRTLM---KPMF 417
           +  TIGK+ +GR      I+                       +     R L    K + 
Sbjct: 1   KLVTIGKTEEGREQIMAAISSPENLAKLDHYKAILRKLADPRGLSEEQARKLAAEGKAIV 60

Query: 418 KYVANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDG 470
                +H  ET    +++ L   L  ++     I ++L++  + IVP +NPDG
Sbjct: 61  WITGGLHSTETGPPQMLMELAYRLASEE--SPEIKRILDNVILLIVPVLNPDG 111


>gnl|CDD|133086 cd06228, M14-like_3, Peptidase M14-like domain; uncharacterized
           subfamily.  A functionally uncharacterized subgroup of
           the M14 family of metallocarboxypeptidases (MCPs). The
           M14 family are zinc-binding carboxypeptidases (CPs)
           which hydrolyze single, C-terminal amino acids from
           polypeptide chains, and have a recognition site for the
           free C-terminal carboxyl group, which is a key
           determinant of specificity. Two major subfamilies of the
           M14 family, defined based on sequence and structural
           homology, are the A/B and N/E subfamilies. Enzymes
           belonging to the A/B subfamily are normally synthesized
           as inactive precursors containing preceding signal
           peptide, followed by an N-terminal pro-region linked to
           the enzyme; these proenzymes are called
           procarboxypeptidases. The A/B enzymes can be further
           divided based on their substrate specificity;
           Carboxypeptidase A-like (CPA-like) enzymes favor
           hydrophobic residues while carboxypeptidase B-like
           (CPB-like) enzymes only cleave the basic residues lysine
           or arginine. The A forms have slightly different
           specificities, with Carboxypeptidase A1 (CPA1)
           preferring aliphatic and small aromatic residues, and
           CPA2 preferring the bulky aromatic side chains. Enzymes
           belonging to the N/E subfamily enzymes are not produced
           as inactive precursors and instead rely on their
           substrate specificity and subcellular
           compartmentalization to prevent inappropriate cleavages.
           They contain an extra C-terminal transthyretin-like
           domain, thought to be involved in folding or formation
           of oligomers.  MCPs can also be classified based on
           their involvement in specific physiological processes;
           the pancreatic MCPs participate only in alimentary
           digestion and include carboxypeptidase A and B (A/B
           subfamily), while others, namely regulatory MCPs or the
           N/E subfamily, are involved in more selective reactions,
           mainly in non-digestive tissues and fluids, acting on
           blood coagulation/fibrinolysis, inflammation and local
           anaphylaxis, pro-hormone and neuropeptide processing,
           cellular response and others.   Another MCP subfamily,
           is that of succinylglutamate desuccinylase
           /aspartoacylase, which hydrolyzes N-acetyl-L-aspartate
           (NAA), and deficiency in which is the established cause
           of Canavan disease. Another subfamily (referred to as
           subfamily C) includes an exceptional type of activity in
           the MCP family, that of dipeptidyl-peptidase activity of
           gamma-glutamyl-(L)-meso-diaminopimelate peptidase I
           which is involved in bacterial cell wall metabolism.
          Length = 332

 Score = 35.9 bits (83), Expect = 0.060
 Identities = 29/101 (28%), Positives = 43/101 (42%), Gaps = 25/101 (24%)

Query: 459 DIYIVPSINPDGFA-AAKEGKCDSLDGYVGRKN-----------AHGVDLNRNFPDQFEY 506
            I + P +NPDG A       C        RKN           + GVD+NRNF   F +
Sbjct: 84  GIVVFPLVNPDGRAHDQTANSC-------WRKNRNPASAGPNPSSVGVDINRNF--DFLW 134

Query: 507 EAKKVYEPETQAIMNFIYSNPFVLSGNLHGGAVARDYASRN 547
           + +K ++P   A  +   ++P   S   HG A   +  +RN
Sbjct: 135 DFQKYFDP--GASRSVASTDP--ASETFHGTAAFSEPETRN 171


>gnl|CDD|222623 pfam14247, DUF4344, Domain of unknown function (DUF4344).  This
           family of proteins is found in bacteria and archaea.
           Proteins in this family are typically between 247 and
           291 amino acids in length. There is a conserved EED
           sequence motif.
          Length = 220

 Score = 35.3 bits (82), Expect = 0.061
 Identities = 13/36 (36%), Positives = 19/36 (52%), Gaps = 3/36 (8%)

Query: 16  TLSTAADYYFDFDDLTGLEDADSA---GIDLQRYYN 48
            +S A  +  + +D   L+D   A    +DLQRYYN
Sbjct: 139 AISAADLFALESEDEEDLDDEPFADEHSLDLQRYYN 174



 Score = 35.3 bits (82), Expect = 0.061
 Identities = 13/36 (36%), Positives = 19/36 (52%), Gaps = 3/36 (8%)

Query: 330 TLSTAADYYFDFDDLTGLEDADSA---GIDLQRYYN 362
            +S A  +  + +D   L+D   A    +DLQRYYN
Sbjct: 139 AISAADLFALESEDEEDLDDEPFADEHSLDLQRYYN 174


>gnl|CDD|218345 pfam04952, AstE_AspA, Succinylglutamate desuccinylase /
           Aspartoacylase family.  This family includes
           Succinylglutamate desuccinylase EC:3.1.-.- that
           catalyzes the fifth and last step in arginine catabolism
           by the arginine succinyltransferase pathway. The family
           also include aspartoacylase EC:3.5.1.15 which cleaves
           acylaspartate into a fatty acid and aspartate. Mutations
           in human ASPA lead to Canavan disease disease. This
           family is probably structurally related to pfam00246
           (Bateman A pers. obs.).
          Length = 289

 Score = 35.0 bits (81), Expect = 0.093
 Identities = 24/108 (22%), Positives = 38/108 (35%), Gaps = 22/108 (20%)

Query: 420 VANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKC 479
            A +HG+E  G  L+  L++ L        R           +VP  NP  F A    + 
Sbjct: 8   SAGIHGNELAGVELLRRLLRQLDPAQRAGTRT----------LVPLANPPAFRAGS--RY 55

Query: 480 DSLDGYVGRKNAHGVDLNRNFPDQFEYEAKKVYEP-ETQAIMNFIYSN 526
             +D           DLNR FP + +  +   Y     + I +  +  
Sbjct: 56  IPID---------RRDLNRAFPGRADGSSGLPYRATRAERIADAFFRA 94



 Score = 31.9 bits (73), Expect = 0.91
 Identities = 17/67 (25%), Positives = 24/67 (35%), Gaps = 16/67 (23%)

Query: 106 VANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAK---- 161
            A +HG+E  G  L+  L++ L        R           +VP  NP  F A      
Sbjct: 8   SAGIHGNELAGVELLRRLLRQLDPAQRAGTRT----------LVPLANPPAFRAGSRYIP 57

Query: 162 --EGNLN 166
               +LN
Sbjct: 58  IDRRDLN 64


>gnl|CDD|214751 smart00634, BID_1, Bacterial Ig-like domain (group 1). 
          Length = 92

 Score = 32.7 bits (75), Expect = 0.13
 Identities = 15/65 (23%), Positives = 20/65 (30%), Gaps = 11/65 (16%)

Query: 226 VTDTYGNPLPSAIITVRWNDKAVTVTNRGEYW---RLLAR--------GKYVVTASAPGY 274
           VTD  GNP+    +T           ++G        +A         G Y VTAS    
Sbjct: 26  VTDANGNPVAGQEVTFTTPSGGALTLSKGTATTDANGIATVTLTSTTAGVYTVTASLENG 85

Query: 275 EPVTT 279
                
Sbjct: 86  SSSVK 90



 Score = 32.7 bits (75), Expect = 0.13
 Identities = 15/65 (23%), Positives = 20/65 (30%), Gaps = 11/65 (16%)

Query: 631 VTDTYGNPLPSAIITVRWNDKAVTVTNRGEYW---RLLAR--------GKYVVTASAPGY 679
           VTD  GNP+    +T           ++G        +A         G Y VTAS    
Sbjct: 26  VTDANGNPVAGQEVTFTTPSGGALTLSKGTATTDANGIATVTLTSTTAGVYTVTASLENG 85

Query: 680 EPVTT 684
                
Sbjct: 86  SSSVK 90


>gnl|CDD|220930 pfam10994, DUF2817, Protein of unknown function (DUF2817).  This
           family of proteins has no known function.
          Length = 341

 Score = 33.0 bits (76), Expect = 0.47
 Identities = 24/83 (28%), Positives = 36/83 (43%), Gaps = 25/83 (30%)

Query: 423 MHGDETVGY---ALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKC 479
            HG E  G+   A+ V L++        +     L     + +V ++NP GFA       
Sbjct: 60  THGVE--GFCGSAIQVALLR--------ELLPRSLPAGVAVLLVHALNPYGFA------- 102

Query: 480 DSLDGYVGRKNAHGVDLNRNFPD 502
                ++ R N + VDLNRNF D
Sbjct: 103 -----HLRRVNENNVDLNRNFLD 120


>gnl|CDD|133110 cd06253, M14_ASTE_ASPA_like_3, Peptidase M14 Succinylglutamate
           desuccinylase (ASTE)/aspartoacylase (ASPA)-like;
           uncharacterized subgroup.  A functionally
           uncharacterized subgroup of the Succinylglutamate
           desuccinylase (ASTE)/aspartoacylase (ASPA) subfamily
           which is part of the M14 family of
           metallocarboxypeptidases. ASTE catalyzes the fifth and
           last step in arginine catabolism by the arginine
           succinyltransferase pathway, and aspartoacylase (ASPA,
           also known as aminoacylase 2, and ACY-2; EC:3.5.1.15)
           cleaves N-acetyl L-aspartic acid (NAA) into aspartate
           and acetate. NAA is abundant in the brain, and
           hydrolysis of NAA by ASPA may help maintain white
           matter. ASPA is an NAA scavenger in other tissues.
           Mutations in the gene encoding ASPA cause Canavan
           disease (CD), a fatal progressive neurodegenerative
           disorder involving dysmyelination and spongiform
           degeneration of white matter in children. This enzyme
           binds zinc which is necessary for activity. Measurement
           of elevated NAA levels in urine is used in the diagnosis
           of CD.
          Length = 298

 Score = 32.6 bits (75), Expect = 0.60
 Identities = 26/99 (26%), Positives = 39/99 (39%), Gaps = 20/99 (20%)

Query: 420 VANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKC 479
           V  +HGDE  G  +   LI++L      + R    LN   + ++PS+NP G         
Sbjct: 25  VGGIHGDELQGLYICSLLIRFL---KELEKRGPLKLNG-IVDVIPSVNPLGLNLGTRFW- 79

Query: 480 DSLDGYVGRKNAHGVDLNRNFP-----DQFEYEAKKVYE 513
              D           D+NR FP     +  +  A  V+E
Sbjct: 80  -PTD---------NSDINRMFPGDPQGETTQRIAAAVFE 108



 Score = 29.2 bits (66), Expect = 8.3
 Identities = 17/56 (30%), Positives = 26/56 (46%), Gaps = 4/56 (7%)

Query: 106 VANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAK 161
           V  +HGDE  G  +   LI++L      + R    LN   + ++PS+NP G     
Sbjct: 25  VGGIHGDELQGLYICSLLIRFL---KELEKRGPLKLNG-IVDVIPSVNPLGLNLGT 76


>gnl|CDD|199860 cd06236, M14_AGBL5_like, Peptidase M14-like domain of ATP/GTP
           binding protein (AGBL)-5 and related proteins.
           Peptidase M14-like domain of ATP/GTP binding
           protein_like (AGBL)-5, and related proteins. The
           Peptidase M14 family of metallocarboxypeptidases are
           zinc-binding carboxypeptidases (CPs) which hydrolyze
           single, C-terminal amino acids from polypeptide chains,
           and have a recognition site for the free C-terminal
           carboxyl group, which is a key determinant of
           specificity. This eukaryotic subgroup includes the human
           AGBL5 and the mouse cytosolic carboxypeptidase (CCP)-5.
           ATP/GTP binding protein (AGTPBP-1/Nna1)-like proteins
           are active metallopeptidases that are thought to act on
           cytosolic proteins such as alpha-tubulin, to remove a
           C-terminal tyrosine. Mutations in AGTPBP-1/Nna1 cause
           Purkinje cell degeneration (pcd). AGTPBP-1/Nna1 however
           does not belong to this subgroup. AGTPBP-1/Nna1-like
           proteins from the different phyla are highly diverse,
           but they all contain a unique N-terminal conserved
           domain right before the CP domain. It has been suggested
           that this N-terminal domain might act as a folding
           domain.
          Length = 308

 Score = 32.4 bits (74), Expect = 0.69
 Identities = 38/156 (24%), Positives = 59/156 (37%), Gaps = 41/156 (26%)

Query: 384 ETIGKSVQGRNLWAVEIT--HDVDN----------PDGRTLMKPMF---KYV---ANMHG 425
           E +  S+ GR +  + IT  H +            PD  T     F   +     + +H 
Sbjct: 19  EVLCYSLDGRRVDLLTITSCHGLLEEREPRLPGLFPDRSTPRPFRFTGKRVFFLSSRVHP 78

Query: 426 DETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKCDSLDGY 485
            ET    +    +++L+ KD  D R   L  +    ++P +NPDG              Y
Sbjct: 79  GETPSSFVFNGFLKFLLRKD--DPRAQALRRNFVFKLIPMLNPDGV-------------Y 123

Query: 486 VG--RKNAHGVDLNRNF--PDQFE----YEAKKVYE 513
            G  R +  GV+LNR +  PD       Y AK +  
Sbjct: 124 RGHYRTDTRGVNLNRVYLNPDPQLHPSIYAAKALLL 159


>gnl|CDD|224387 COG1470, COG1470, Predicted membrane protein [Function unknown].
          Length = 513

 Score = 32.4 bits (74), Expect = 0.77
 Identities = 20/106 (18%), Positives = 39/106 (36%), Gaps = 3/106 (2%)

Query: 222 VYGIVTDTYGNPLPSAIITVRWNDKAVTVTNRGEYWRLLARGKYVVTASAPGYEPVTTEP 281
               + D  G  +P A++         +    G     +  GKYVV  +  G        
Sbjct: 189 SKLRLVDDDGAGVPKALVKDGNESFETSSK--GNLEVEITPGKYVVLIAKKGIYEKKKRA 246

Query: 282 LDVPDTESVRLDFML-GKKNAFSIDQLKNPFIEIMTLIVPSLSLLL 326
           + + D E+  L   +  KK+ F+     + ++EI      S ++ +
Sbjct: 247 VKLNDGETKDLKLSVTEKKSYFTELNSSDIYLEISPSTTASFTVSI 292



 Score = 30.9 bits (70), Expect = 2.8
 Identities = 16/83 (19%), Positives = 30/83 (36%), Gaps = 2/83 (2%)

Query: 627 VYGIVTDTYGNPLPSAIITVRWNDKAVTVTNRGEYWRLLARGKYVVTASAPGYEPVTTEP 686
               + D  G  +P A++         +    G     +  GKYVV  +  G        
Sbjct: 189 SKLRLVDDDGAGVPKALVKDGNESFETSSK--GNLEVEITPGKYVVLIAKKGIYEKKKRA 246

Query: 687 LDVPDTESVRLDFMLGKKNAFVT 709
           + + D E+  L   + +K ++ T
Sbjct: 247 VKLNDGETKDLKLSVTEKKSYFT 269


>gnl|CDD|133091 cd06233, M14-like_6, Peptidase M14-like domain; uncharacterized
           subfamily.  Peptidase M14-like domain of a functionally
           uncharacterized subgroup of the M14 family of
           metallocarboxypeptidases (MCPs). The M14 family are
           zinc-binding carboxypeptidases (CPs) which hydrolyze
           single, C-terminal amino acids from polypeptide chains,
           and have a recognition site for the free C-terminal
           carboxyl group, which is a key determinant of
           specificity. Two major subfamilies of the M14 family,
           defined based on sequence and structural homology, are
           the A/B and N/E subfamilies. Enzymes belonging to the
           A/B subfamily are normally synthesized as inactive
           precursors containing preceding signal peptide, followed
           by an N-terminal pro-region linked to the enzyme; these
           proenzymes are called procarboxypeptidases. The A/B
           enzymes can be further divided based on their substrate
           specificity; Carboxypeptidase A-like (CPA-like) enzymes
           favor hydrophobic residues while carboxypeptidase B-like
           (CPB-like) enzymes only cleave the basic residues lysine
           or arginine. The A forms have slightly different
           specificities, with Carboxypeptidase A1 (CPA1)
           preferring aliphatic and small aromatic residues, and
           CPA2 preferring the bulky aromatic side chains. Enzymes
           belonging to the N/E subfamily enzymes are not produced
           as inactive precursors and instead rely on their
           substrate specificity and subcellular
           compartmentalization to prevent inappropriate cleavages.
           They contain an extra C-terminal transthyretin-like
           domain, thought to be involved in folding or formation
           of oligomers.  MCPs can also be classified based on
           their involvement in specific physiological processes;
           the pancreatic MCPs participate only in alimentary
           digestion and include carboxypeptidase A and B (A/B
           subfamily), while others, namely regulatory MCPs or the
           N/E subfamily, are involved in more selective reactions,
           mainly in non-digestive tissues and fluids, acting on
           blood coagulation/fibrinolysis, inflammation and local
           anaphylaxis, pro-hormone and neuropeptide processing,
           cellular response and others.   Another MCP subfamily,
           is that of succinylglutamate desuccinylase
           /aspartoacylase, which hydrolyzes N-acetyl-L-aspartate
           (NAA), and deficiency in which is the established cause
           of Canavan disease. Another subfamily (referred to as
           subfamily C) includes an exceptional type of activity in
           the MCP family, that of dipeptidyl-peptidase activity of
           gamma-glutamyl-(L)-meso-diaminopimelate peptidase I
           which is involved in bacterial cell wall metabolism.
          Length = 283

 Score = 32.2 bits (74), Expect = 0.80
 Identities = 24/97 (24%), Positives = 38/97 (39%), Gaps = 32/97 (32%)

Query: 423 MHGDETVGY---ALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKC 479
            HG E  G+   A+ + L+        ++     L     + +V ++NP GFA       
Sbjct: 60  THGVE--GFCGSAIQLALL--------RELLPRSLPAGVAVLLVHALNPYGFA------- 102

Query: 480 DSLDGYVGRKNAHGVDLNRNF-------PDQFEYEAK 509
                ++ R N + VDLNRNF       P+   Y   
Sbjct: 103 -----HLRRVNENNVDLNRNFLDFSAPLPENPGYAEL 134


>gnl|CDD|129031 smart00795, Agro_virD5, Agrobacterium VirD5 protein.  The virD
           operon in Agrobacterium encodes a site-specific
           endonuclease, and a number of other poorly characterised
           products. This family represents the VirD5 protein.
          Length = 780

 Score = 32.2 bits (73), Expect = 1.0
 Identities = 24/91 (26%), Positives = 39/91 (42%), Gaps = 13/91 (14%)

Query: 450 RITQLLNSTDIYIVPSINPDGFAAAKEGKCDSLDGYVGRKNAHGVDLNRNFPDQFEYEAK 509
           RI +L  S  I       P+   + +  +    D    RK ++ +D + NFP+ FEY+  
Sbjct: 85  RIEELNTSASI-------PNECGSLQTDEEFRSDA-SKRKRSNLIDQDENFPESFEYDGM 136

Query: 510 KVYEPETQAIMNFIYSNP-----FVLSGNLH 535
           ++ EPE  A  N+    P      +L   L 
Sbjct: 137 RLGEPERAAYDNWSKPEPPTWKDLILKARLD 167


>gnl|CDD|148470 pfam06871, TraH_2, TraH_2.  This family consists of several TraH
           proteins which seem to be specific to Agrobacterium and
           Rhizobium species. This protein is thought to be
           involved in conjugal transfer but its function is
           unknown. This family does not appear to be related to
           pfam06122.
          Length = 206

 Score = 30.5 bits (69), Expect = 2.4
 Identities = 22/80 (27%), Positives = 31/80 (38%), Gaps = 13/80 (16%)

Query: 208 KESLIKLIENVHRG------VYGIVTDTYGNPLPSAIITVRWNDKAVTVTNRGEYWRLLA 261
           K  L+   EN+  G      V  IV   YGNP    ++   ++D A+     G +     
Sbjct: 88  KPDLVDACENLRMGTALFAKVLRIVAKWYGNPTSKDVLPQIFDD-AIYAWKTGYF----- 141

Query: 262 RGKYVVTASAPGYEPVTTEP 281
            G  V  A  PG   V  +P
Sbjct: 142 EGVSVFQAEDPG-GGVLRQP 160



 Score = 30.5 bits (69), Expect = 2.4
 Identities = 22/80 (27%), Positives = 31/80 (38%), Gaps = 13/80 (16%)

Query: 613 KESLIKLIENVHRG------VYGIVTDTYGNPLPSAIITVRWNDKAVTVTNRGEYWRLLA 666
           K  L+   EN+  G      V  IV   YGNP    ++   ++D A+     G +     
Sbjct: 88  KPDLVDACENLRMGTALFAKVLRIVAKWYGNPTSKDVLPQIFDD-AIYAWKTGYF----- 141

Query: 667 RGKYVVTASAPGYEPVTTEP 686
            G  V  A  PG   V  +P
Sbjct: 142 EGVSVFQAEDPG-GGVLRQP 160


>gnl|CDD|131518 TIGR02465, chlorocat_1_2, chlorocatechol 1,2-dioxygenase.  Members
           of this protein family are chlorocatechol
           1,2-dioxygenase. This protein is closely related to
           catechol 1,2-dioxygenase, TIGR02439, EC 1.13.11.1. Note
           that annotated database entries have appeared for the
           present protein family with the EC number that refers to
           that of family TIGR02439. This protein acts in pathways
           of the biodegradation of chlorinated aromatic compounds.
          Length = 246

 Score = 30.5 bits (69), Expect = 2.6
 Identities = 25/72 (34%), Positives = 33/72 (45%), Gaps = 16/72 (22%)

Query: 224 GIVTDTYGNPLPSAIITVRWNDKAVTVTNRGEYW-------RLLARGKYVVTASAPGYEP 276
           G V D  G P+  A+I V W+      T  G+Y            RGK +VTA+   YE 
Sbjct: 103 GTVRDLSGTPVAGAVIDV-WHS-----TPDGKYSGFHDNIPDDYYRGK-LVTAADGSYEV 155

Query: 277 VTTEPL--DVPD 286
            TT P+   +PD
Sbjct: 156 RTTMPVPYQIPD 167



 Score = 30.5 bits (69), Expect = 2.6
 Identities = 25/72 (34%), Positives = 33/72 (45%), Gaps = 16/72 (22%)

Query: 629 GIVTDTYGNPLPSAIITVRWNDKAVTVTNRGEYW-------RLLARGKYVVTASAPGYEP 681
           G V D  G P+  A+I V W+      T  G+Y            RGK +VTA+   YE 
Sbjct: 103 GTVRDLSGTPVAGAVIDV-WHS-----TPDGKYSGFHDNIPDDYYRGK-LVTAADGSYEV 155

Query: 682 VTTEPL--DVPD 691
            TT P+   +PD
Sbjct: 156 RTTMPVPYQIPD 167


>gnl|CDD|199856 cd06230, M14_ASTE_ASPA_like, Peptidase M14 Succinylglutamate
           desuccinylase (ASTE)/aspartoacylase (ASPA) subfamily.
           The Peptidase M14 Succinylglutamate desuccinylase
           (ASTE)/aspartoacylase (ASPA) subfamily belongs to the
           M14 family of metallocarboxypeptidases (MCPs), and
           includes ASTE, which catalyzes the fifth and last step
           in arginine catabolism by the arginine
           succinyltransferase pathway, and aspartoacylase (ASPA,
           also known as aminoacylase 2, and ACY-2; EC:3.5.1.15)
           which cleaves N-acetyl L-aspartic acid (NAA) into
           aspartate and acetate. NAA is abundant in the brain, and
           hydrolysis of NAA by ASPA may help maintain white
           matter. ASPA is an NAA scavenger in other tissues.
           Mutations in the gene encoding ASPA cause Canavan
           disease (CD), a fatal progressive neurodegenerative
           disorder involving dysmyelination and spongiform
           degeneration of white matter in children. This enzyme
           binds zinc which is necessary for activity. Measurement
           of elevated NAA levels in urine is used in the diagnosis
           of CD.
          Length = 254

 Score = 30.4 bits (69), Expect = 2.8
 Identities = 20/84 (23%), Positives = 30/84 (35%), Gaps = 23/84 (27%)

Query: 420 VANMHGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKC 479
              +HG+E  G    V  I+ L L + +  +  +      + +VP+ NP    A      
Sbjct: 4   TGGVHGNEPCG----VQAIERL-LAELEPGQELKG----PVKLVPAANPLALEAG----- 49

Query: 480 DSLDGYVGRKNAHGVDLNRNFPDQ 503
                   R      DLNR FP  
Sbjct: 50  -------QRYGDR--DLNRIFPGD 64


>gnl|CDD|199872 cd06252, M14_ASTE_ASPA_like_2, Peptidase M14 Succinylglutamate
           desuccinylase (ASTE)/aspartoacylase (ASPA)-like;
           uncharacterized subgroup.  A functionally
           uncharacterized subgroup of the Succinylglutamate
           desuccinylase (ASTE)/aspartoacylase (ASPA) subfamily
           which is part of the M14 family of
           metallocarboxypeptidases. ASTE catalyzes the fifth and
           last step in arginine catabolism by the arginine
           succinyltransferase pathway, and aspartoacylase (ASPA,
           also known as aminoacylase 2, and ACY-2; EC:3.5.1.15)
           cleaves N-acetyl L-aspartic acid (NAA) into aspartate
           and acetate. NAA is abundant in the brain, and
           hydrolysis of NAA by ASPA may help maintain white
           matter. ASPA is an NAA scavenger in other tissues.
           Mutations in the gene encoding ASPA cause Canavan
           disease (CD), a fatal progressive neurodegenerative
           disorder involving dysmyelination and spongiform
           degeneration of white matter in children. This enzyme
           binds zinc which is necessary for activity. Measurement
           of elevated NAA levels in urine is used in the diagnosis
           of CD.
          Length = 318

 Score = 29.9 bits (68), Expect = 4.9
 Identities = 19/78 (24%), Positives = 28/78 (35%), Gaps = 21/78 (26%)

Query: 424 HGDETVGYALMVFLIQYLVLKDGKDDRITQLLNSTDIYIVPSINPDGFAAAKEGKCDSLD 483
           HGDE  G   ++ L             +        + IVP++N   F A    +   +D
Sbjct: 44  HGDEYEGQIALLRLA----------RTLDPEDVRGRVIIVPALNFPAFQAGT--RTSPID 91

Query: 484 GYVGRKNAHGVDLNRNFP 501
                    G +LNR FP
Sbjct: 92  ---------GGNLNRAFP 100


>gnl|CDD|219229 pfam06920, Ded_cyto, Dedicator of cytokinesis.  This family
           represents a conserved region approximately 200 residues
           long within a number of eukaryotic dedicator of
           cytokinesis proteins. These are potential guanine
           nucleotide exchange factors, which activate some small
           GTPases by exchanging bound GDP for free GTP.
          Length = 178

 Score = 28.8 bits (65), Expect = 7.6
 Identities = 5/19 (26%), Positives = 8/19 (42%)

Query: 519 IMNFIYSNPFVLSGNLHGG 537
           +  F++  PF   G   G 
Sbjct: 9   VNRFVFDTPFTKGGKKQGE 27


>gnl|CDD|212572 cd11699, DHR2_DOCK10, Dock Homology Region 2, a GEF domain, of
           Class D Dedicator of Cytokinesis 10.  Dock10, also
           called Zizimin3, is an atypical guanine nucleotide
           exchange factor (GEF) that lacks the conventional Dbl
           homology (DH) domain. As a GEF, it activates the small
           GTPase Cdc42 by exchanging bound GDP for free GTP.
           Dock10 is preferentially expressed in lymphocytes and
           may play a role in interleukin-4 induced activation of B
           cells. It may also play a role in the invasion of tumor
           cells. DOCK proteins are divided into four classes (A-D)
           based on sequence similarity and domain architecture;
           class D includes Dock9, 10 and 11. All DOCKs contain two
           homology domains: the DHR-1 (Dock homology region-1),
           also called CZH1 (CED-5, Dock180, and MBC-zizimin
           homology 1), and DHR-2 (also called CZH2 or Docker). The
           DHR-1 domain binds
           phosphatidylinositol-3,4,5-triphosphate. This alignment
           model represents the DHR-2 domain of Dock10, which
           contains the catalytic GEF activity for Cdc42. Class D
           DOCKs also contain a Pleckstrin homology (PH) domain at
           the N-terminus.
          Length = 446

 Score = 29.2 bits (65), Expect = 7.8
 Identities = 14/38 (36%), Positives = 19/38 (50%)

Query: 500 FPDQFEYEAKKVYEPETQAIMNFIYSNPFVLSGNLHGG 537
           + D+ E E +K        I  F++  PF LSG  HGG
Sbjct: 260 YFDEKEQEDRKTDFEMHHNINRFVFETPFTLSGKKHGG 297


>gnl|CDD|119331 cd02742, GH20_hexosaminidase, Beta-N-acetylhexosaminidases of
           glycosyl hydrolase family 20 (GH20) catalyze the removal
           of beta-1,4-linked N-acetyl-D-hexosamine residues from
           the non-reducing ends of N-acetyl-beta-D-hexosaminides
           including N-acetylglucosides and N-acetylgalactosides.
           These enzymes are broadly distributed in microorganisms,
           plants and animals, and play roles in various key
           physiological and pathological processes. These
           processes include cell structural integrity, energy
           storage, cellular signaling, fertilization, pathogen
           defense, viral penetration, the development of
           carcinomas, inflammatory events and lysosomal storage
           disorders. The GH20 enzymes include the eukaryotic
           beta-N-acetylhexosaminidases A and B, the bacterial
           chitobiases, dispersin B, and lacto-N-biosidase.  The
           GH20 hexosaminidases are thought to act via a catalytic
           mechanism in which the catalytic nucleophile is not
           provided by the solvent or the enzyme, but by the
           substrate itself.
          Length = 303

 Score = 28.9 bits (65), Expect = 9.4
 Identities = 24/90 (26%), Positives = 33/90 (36%), Gaps = 22/90 (24%)

Query: 374 VKSYPHLVRAETIGKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKYVANMH--------G 425
           VKS+P L+      +   G  L  V    D   P G   +  +F  +A +         G
Sbjct: 102 VKSFPKLL-----TECYAGLKLRDVFDPLDPTLPKGYDFLDDLFGEIAELFPDRYLHIGG 156

Query: 426 DETVGY-----ALMVFLIQ---YLVLKDGK 447
           DE   +      LM   IQ    +V K GK
Sbjct: 157 DE-AHFKQDRKHLMSQFIQRVLDIVKKKGK 185


>gnl|CDD|151989 pfam11553, DUF3231, Protein of unknown function (DUF3231).  This
           bacterial family of proteins has no known function.
          Length = 166

 Score = 28.4 bits (64), Expect = 9.6
 Identities = 17/85 (20%), Positives = 30/85 (35%), Gaps = 9/85 (10%)

Query: 387 GKSVQGRNLWAVEITHDVDNPDGRTLMKPMFKYVANMHGDETVGYALMVFLIQYLVLKDG 446
               + R L A E+ H   N    +L K +  Y   +  DE +    +   +    L + 
Sbjct: 6   NILGKKRPLHAGEVAHLWSNYQTNSLGKCLLTYFLQVAEDEEI-KKYLQRALD---LSNK 61

Query: 447 KDDRITQLLNSTDIYIVPSINPDGF 471
               ++ LL    + +     P GF
Sbjct: 62  HIKILSSLLKEEGLPL-----PSGF 81


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.137    0.414 

Gapped
Lambda     K      H
   0.267   0.0587    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 36,562,161
Number of extensions: 3623302
Number of successful extensions: 3361
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3200
Number of HSP's successfully gapped: 149
Length of query: 710
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 606
Effective length of database: 6,324,786
Effective search space: 3832820316
Effective search space used: 3832820316
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.4 bits)