HHsearch alignment for GI: 254781012 and conserved domain: TIGR01975

>TIGR01975 isoAsp_dipep beta-aspartyl peptidase; InterPro: IPR010229 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of proteins include metallopeptidases belonging to the MEROPS peptidase family M38 (clan MJ, beta-aspartyl dipeptidase family). This entry includes the beta-aspartyl dipeptidase from Escherichia coli, (3.4.19.5 from EC, IadA), which degrades isoaspartyl dipeptides and may unblock degradation of proteins that cannot be repaired. This entry also describes closely related proteins from other species (e.g. Clostridium perfringens, Thermoanaerobacter tengcongensis) that may have an equivalent in function. This family shows homology to dihydroorotases. The L-isoaspartyl derivative of Asp arises non-enzymatically over time as a form of protein damage. In this isomerisation, the connectivity of the polypeptide changes to pass through the beta-carboxyl of the side chain. Much but not all of this damage can be repaired by protein-L-isoaspartate (D-aspartate) O-methyltransferase..
Probab=99.01  E-value=8.7e-09  Score=73.77  Aligned_cols=295  Identities=15%  Similarity=0.149  Sum_probs=167.1

Q ss_pred             EEEECCCCEEEEECC----CHHHHHHH----H-HHHCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEEE
Q ss_conf             558646642375088----17889874----9-87418873999838951489898999999999998386884699501
Q gi|254781012|r    3 KISLRVPDDWHLHLR----DGEILKTV----L-RDTAKNFRRALVMPNIDPPIITVDDACAYRQRILNALPPEYDFSPLM   73 (349)
Q Consensus         3 ~l~lp~~~D~HvH~R----eg~~~~~t----~-aa~aGG~TtV~~MPNT~Ppi~t~e~~~~~~~r~~~~~~~~~~~~~~~   73 (349)
T Consensus        52 ~i~vPGfID~HVHiiGGGGEgG~~TRTPE~~Ls~~t~~GvTtvVG~lGTDgi~R~m~~L~AK-ArAL~--eEGvs~y~lt  128 (391)
T TIGR01975        52 MIAVPGFIDQHVHIIGGGGEGGPKTRTPELTLSDITKGGVTTVVGLLGTDGITRDMESLLAK-ARALE--EEGVSCYVLT  128 (391)
T ss_pred             CEECCCCCCCCEEEEECCCCCCCCCCCHHHECCCEEECCCEEEEEEEECCCCCCCCHHHHHH-HHHHH--HCCEEEEEEE
T ss_conf             08738741474578405768868888432230120112624898886278622584678999-98652--0782699963


Q ss_pred             EEEECCCC--CHHHHHHHHHCCCEEEHHHCCCCCEECCCCC--CCHHHHHHHHHHHHHHCCCCEECCCCCCCCCCCCHHH
Q ss_conf             34527899--9889987440687210221045510025654--1006889999999987398222025755555430014
Q gi|254781012|r   74 TIYLTETT--DPDDVEKGFTSQLVQAIKLYFAGSTTNSHHG--IRNIDRVMPVLERMETIGMPLCIHGEILNQDIDIFDR  149 (349)
Q Consensus        74 ~~~lt~~~--~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~--v~~~~~~~~~le~~~~~~~~i~~H~E~~~~~~~~~~~  149 (349)
T Consensus       129 GaY~vP~~TlTgsv~sDl~LIDkvIGvG-----eiAiSDHRSaqPt~~~L~-~~aAeARVGGLLgGK~Giv~~H~Gds~~  202 (391)
T TIGR01975       129 GAYEVPVRTLTGSVKSDLLLIDKVIGVG-----EIAISDHRSAQPTVEELT-NLAAEARVGGLLGGKPGIVNLHVGDSKR  202 (391)
T ss_pred             CCEECCCCCCCCCEEEEEEEEEEEEEEE-----EEEECCCCCCCCCHHHHH-HHHHHHCCCCCCCCCCCEEEEEECCCHH
T ss_conf             3313552001511243267883355424-----572214778874489999-9977511241116887568996369867


Q ss_pred             HHHHHHHHHHHHHH-HHCCCE---EEEECCCH--HHHHHHHHHCC--CCCEEEECCCCCCCHHHHHCCCCCCCEEECCCC
Q ss_conf             67654578999996-206953---99705785--89999987057--332023023100397886137888311321346
Q gi|254781012|r  150 ELMFIDKILDPLRN-KLPNLK---IILEHITT--SNGIDYVNNAT--NIAGSITVHHLIINRNAIFHDGLNPHYYCLPIP  221 (349)
Q Consensus       150 E~~~~~~~l~~~~a-~~~~~~---iHi~HiST--~~sv~lir~ak--~vtaEVTPHHL~lt~~d~~~~~~~~~~k~nPPl  221 (349)
T Consensus       203 ~L~~i---~~~v~~~dvPi~q~lPTH~nR~~~LFE~g~~fa~~GG~iDlTss~~p~~~~ege-----------------v  262 (391)
T TIGR01975       203 KLEPI---EELVEETDVPITQFLPTHINRNRELFEAGLEFAKKGGTIDLTSSIDPQFRKEGE-----------------V  262 (391)
T ss_pred             HHHHH---HHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCC-----------------C
T ss_conf             77799---999850588700255776476756899999999739808760278887553554-----------------3


Q ss_pred             CCHHHHHHHHHHHHCCCCEE--EEECCC---CCCCHHHHHCCCCCCCCCCHHHHHHHHHHHH-HCCCCHHHHHHHHHHCC
Q ss_conf             99999999999997699418--971388---8653133215457754454367999999997-20699999999982431
Q gi|254781012|r  222 KREKHRLSLRKAALSGNPRF--FLGTDS---APHWDSSKESSCGCAGIYTARNALNCLAQIF-EEENKLENLESFVSING  295 (349)
Q Consensus       222 R~~~dr~aL~~ai~~G~i~d--~I~TDH---APH~~eeK~~~~~~~g~~~~~~~~~~~~~~~-~~~~~L~~l~~~~s~np  295 (349)
T Consensus       263 ~---p~eGlk~~l~~gvPle~vT~sSDgnGS~P~Fde~g~l--~~~~v~~~~sLf~~v~~av~~~~~pL~~al~v~T~nv  337 (391)
T TIGR01975       263 K---PAEGLKKLLEAGVPLEKVTFSSDGNGSLPLFDENGEL--KGLGVGSVESLFEEVREAVKEGDVPLEKALRVITSNV  337 (391)
T ss_pred             C---HHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCCCCCE--EEEEEECHHHHHHHHHHHHHHCCCCHHHCCEEECCCC
T ss_conf             7---6789999996389752356423578874637678746--3357604166789999998514787657646440364


Q ss_pred             HHHCCCCCC--------CCEEEEEECCCCEEECHHHCCCCCCEEECC
Q ss_conf             788188988--------852899947971784711226777714657
Q gi|254781012|r  296 ATWYGIPVN--------TRKISLKRREQPIIFDEKITTSTGSITIFN  334 (349)
Q Consensus       296 a~~~gl~~~--------~~~i~l~d~~~~~~v~~~~~~sk~~~tpF~  334 (349)
T Consensus       338 A~~L~L~P~KG~I~~g~DADlv~l~~~~-L~I--~~V~ArG~~mv~~  381 (391)
T TIGR01975       338 AKVLKLSPKKGEISPGKDADLVLLDKDS-LDI--HSVIARGKLMVKE  381 (391)
T ss_pred             CCCCCCCCCCCCCCCCCCCCEEEECCCC-CCE--EEEEECCCEEECC
T ss_conf             5541367888500789972279865264-844--1577304100017