HHsearch alignment for GI: 254780166 and conserved domain: TIGR01249
>TIGR01249 pro_imino_pep_1 proline iminopeptidase; InterPro: IPR005944 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of serine peptidase belong to MEROPS peptidase family S33 (clan SC). They are proline iminopeptidase (Prolyl aminopeptidase, 3.4.11.5 from EC), which catalyzes the removal of the N-terminal proline from peptides. This family represents one of two related families of proline iminopeptidase containing the alpha/beta fold. The fine specificities of the various members, including both the range of short peptides from which proline can be removed and whether other amino acids such as alanine can be also removed, may vary among members. ; GO: 0016804 prolyl aminopeptidase activity, 0005737 cytoplasm.
Probab=99.96 E-value=2.1e-30 Score=199.82 Aligned_cols=232 Identities=21% Similarity=0.318 Sum_probs=159.1
Q ss_pred EEEEECCCCEEEEEEECCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCCCCC
Q ss_conf 79993499499999964899878999987888800122179999999868989999647655422222222222212222
Q gi|254780166|r 6 KFFRSWRKYQFAFYDVGDKDAPTILLIHGLASSVQTNWLFSGWIQLLCDQGFRVIAFDNLGHGKSDKSYIENDYRLVFMA 85 (261)
Q Consensus 6 ~~~~~~dG~~l~y~~~g~~~~~~vv~iHG~~~~~~~~~~~~~~~~~l~~~g~~vi~~D~~G~G~S~~~~~~~~~s~~~~~ 85 (261)
T Consensus 7 G~L~V~d~H~LYye~~GnP~G~PV~~lHGGPGsGt~~~----~r~fFdpe~~rIvL~DQRGcGkS~p~a~~~eNtTWdLV 82 (310)
T TIGR01249 7 GYLKVSDNHQLYYEQSGNPDGKPVVFLHGGPGSGTDPE----CRRFFDPETYRIVLLDQRGCGKSTPHACLEENTTWDLV 82 (310)
T ss_pred CCEEECCEEEEEEECCCCCCCCEEEEEECCCCCCCCCC----CCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCHHHH
T ss_conf 44230661354221067989954899756878998834----46453766358999830788898624332247705667
Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCCCCEEEECC----------------CCCCCCCCHHHHHHHHHH
Q ss_conf 2222222222222222234434443101333210235652389628----------------856331103569999998
Q gi|254780166|r 86 ADAVSLLEHLGISKVHVMGYSMGARIACSMVLFYPSYVRSVILGGV----------------GSVLYDSDVVDWQSLIDS 149 (261)
Q Consensus 86 ~di~~~i~~l~~~~~~liGhS~Gg~ia~~~a~~~p~~v~~lvl~~~----------------~~~~~~~~~~~~~~~~~~ 149 (261)
T Consensus 83 ~DiEkLR~~L~I~~W~vFGGSWGStLALaYAq~HP~~v~~lvLRgiFL~R~~e~~w~~~~G~~~~~YP---~~w~~F~d~ 159 (310)
T TIGR01249 83 ADIEKLREKLGIKKWLVFGGSWGSTLALAYAQTHPEKVTGLVLRGIFLLREKELSWFYEGGLASMIYP---DAWQRFVDS 159 (310)
T ss_pred HHHHHHHHHCCCCCEEEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCCC---HHHHHHHCC
T ss_conf 43999998628971488538778999999860162465355655676328667899972687023472---556654105
Q ss_pred HHCCCCCHH------------------HHHHHHHHCCCCC---C--------CCCCCHHHHHH------HHHHHC-----
Q ss_conf 631111000------------------0111000000003---4--------76654135677------765410-----
Q gi|254780166|r 150 FLLPSIDEV------------------QNPLGKKFRKFAD---L--------DPGNDLKALAS------CLSMIR----- 189 (261)
Q Consensus 150 ~~~~~~~~~------------------~~~~~~~~~~~~~---~--------~~~~~~~~~~~------~~~~~~----- 189 (261)
T Consensus 160 IP~~~r~sY~~lv~ayh~~l~~~De~~~~~aAkAW~~WE~~t~~L~~~~~~~~~aed~~~~la~ArlEnHYfVNkgFl~~ 239 (310)
T TIGR01249 160 IPENERNSYEQLVNAYHDRLQSEDEETKLAAAKAWVDWESATTLLRPENEIVSTAEDAKFSLALARLENHYFVNKGFLDS 239 (310)
T ss_pred CCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCHH
T ss_conf 87401365789999998740575379999998646667635655134714341114778999998775102204640534
Q ss_pred CCCCHHHHHCC-CCCEEEEEECCCCCCCCH--HHHHHHCCCCEEEEECCCCCCCCCCCH
Q ss_conf 12200121003-576069984788878807--999996799799998888738434848
Q gi|254780166|r 190 KPFCQDDLYRI-DVPVLIAVGSQDDLAGSP--QELMSFIPSSQYLNICRRDHLLAVGDK 245 (261)
Q Consensus 190 ~~~~~~~l~~i-~~P~l~i~G~~D~~~~~~--~~l~~~~p~~~~~~i~~~gH~~~~e~p 245 (261)
T Consensus 240 e~~lL~ni~~i~~i~~~iv~GRyDl~cPl~~awaL~kafPea~L~v~~~AGHsa-~dp~ 297 (310)
T TIGR01249 240 ENFLLDNISKIRNIPTVIVHGRYDLICPLQSAWALHKAFPEAELKVVNNAGHSA-FDPN 297 (310)
T ss_pred HHHHHHHHHHHCCCCEEEEECCEEHCCHHCCHHHHHHCCCCCEEEEECCCCCCC-CCHH
T ss_conf 688886677640687379844600002003544675218551466745788665-7654