HHsearch alignment for GI: peg_472 and conserved domain: TIGR00763

>TIGR00763 lon ATP-dependent protease La; InterPro: IPR004815 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This signature defines the bacterial and eukaryotic lon proteases, which are ATP-dependent serine peptidases belonging to the MEROPS peptidase family S16 (lon protease family, clan SF). This family of sequences does not include the archaeal lon homologs, IPR004663 from INTERPRO. In the eukaryotes the majority of the proteins are located in the mitochondrial matrix , . In yeast, Pim1, is located in the mitochondrial matrix, is required for mitochondrial function, is constitutively expressed but is increased after thermal stress, suggesting that Pim1 may play a role in the heat shock response .; GO: 0004176 ATP-dependent peptidase activity, 0005524 ATP binding, 0006510 ATP-dependent proteolysis.
Probab=99.29  E-value=5.3e-12  Score=108.67  Aligned_cols=154  Identities=23%  Similarity=0.402  Sum_probs=106.5

Q ss_pred             CE-EEHHCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCC--HHHH-HHHHH
Q ss_conf             70-62010787988889999999996146877788665878999779999779877822245332233--4555-66555
Q 537021.9.peg.4   21 AQ-SYMLSGTRGIGKTTTARIIARSLNYKTAHIDVPTVEFEGFGEHCQAIIRGNHVDVVELDAASHTS--IDDV-REIID   96 (369)
Q Consensus        21 ~h-a~lf~G~~G~GK~~~a~~~A~~l~c~~~~~~~~~~~~c~~c~~c~~i~~~~~~d~~e~~~~s~~~--id~i-r~l~~   96 (369)
T Consensus       449 GpqIlClvGPPGVGKTSlg~SIA~ALnRkFv-----------------R~SlGG~~DeAEIrG--HRRTYvGAMPGriiQ  509 (941)
T TIGR00763       449 GPQILCLVGPPGVGKTSLGKSIAKALNRKFV-----------------RFSLGGVRDEAEIRG--HRRTYVGAMPGRIIQ  509 (941)
T ss_pred             CCEEEEEECCCCCCHHHHHHHHHHHHCCEEE-----------------EEEECCCEEHHHCCC--CCCCCCCCCHHHHHH
T ss_conf             8767872072695422278999999688049-----------------995267220311278--643203467257899


Q ss_pred             HHHHHHHCCCCCEEEEECHHHCC--CCH----HHHHHHHHHHCCCC----------------CC--EEEEECCCCCCCHH
Q ss_conf             44565420465237751156648--001----67899999721221----------------11--46650675433035
Q 537021.9.peg.4   97 QIYYKPISARFRVYIMDEVQMLS--TAA----FNGLLKTLEEPPPH----------------VK--FIFATTEIRKIPIT  152 (369)
Q Consensus        97 ~~~~~p~~~~~kv~iid~a~~m~--~~a----~NaLLK~lEEPp~~----------------~~--fil~t~~~~~ll~T  152 (369)
T Consensus       510 ~lk~~~t~NP--l~LlDEIDK~~~~~~~~GDPaSALLEvLD-PEQN~~F~DHYldvp~DLS~V~CyFi~TAN~~d~IP~P  586 (941)
T TIGR00763       510 GLKKAKTKNP--LILLDEIDKIGLKSSFRGDPASALLEVLD-PEQNNAFSDHYLDVPFDLSKVLCYFIATANSIDTIPRP  586 (941)
T ss_pred             HHHHCCCCCC--EEEEEEEEEECCCCCCCCCHHHHHHHHCC-HHHCCCCCCCCCCCCCCHHHHHHHEEECCCCCCCCCCC
T ss_conf             9876041588--06862022001678865563788864128-64360425530023400420021000244757677722


Q ss_pred             HHHHHHHHHCCCCCCHH---H-HHHHH-HH-----HHHCCCCCCHHHHHHHHHC
Q ss_conf             67543332102454001---3-56787-64-----3101345625664456531
Q 537021.9.peg.4  153 VLSRCQRFDLHRISIGD---L-IELFT-KI-----LQEESIEFDPEAVAMIARA  196 (369)
Q Consensus       153 I~SRcq~~~f~~l~~~~---i-~~~L~-~i-----~~~E~i~~d~~~l~~ia~~  196 (369)
T Consensus       587 LLDRMEvI~lsGY~~~EK~~IA~~yLiP~~~~~~GL~~~~l~~~d~al~~lI~~  640 (941)
T TIGR00763       587 LLDRMEVIELSGYTEEEKLEIAKKYLIPKALEDHGLKPDELKISDEALLLLIKY  640 (941)
T ss_pred             CCCCEEEEECCCCCHHHHHHHHHHCCHHHHHHHHCCCCCCEEECHHHHHHHHHH
T ss_conf             137402452388876789999985471367987088813221268999999987