HHsearch alignment for GI: 254780747 and conserved domain: TIGR00706
>TIGR00706 SppA_dom signal peptide peptidase SppA, 36K type; InterPro: IPR004635 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of serine peptidases belong to MEROPS peptidase family S49 (protease IV family, clan S-). The predicted active site serine for members of this family occurs in a transmembrane domain. This group of sequences represent both long and short forms of the bacterial SppA and homologs found in the archaea and plants. Signal peptides of secretory proteins seem to serve at least two important biological functions. First, they are required for protein targeting to and translocation across membranes, such as the eubacterial plasma membrane and the endoplasmic reticular membrane of eukaryotes. Second, in addition to their role as determinants for protein targeting and translocation, certain signal peptides have a signaling function. During or shortly after pre-protein translocation, the signal peptide is removed by signal peptidases. The integral membrane protein, SppA (protease IV), of Escherichia coli was shown experimentally to degrade signal peptides. The member of this family from Bacillus subtilis has only been shown to be required for efficient processing of pre-proteins under conditions of hyper-secretion . ; GO: 0008233 peptidase activity, 0006508 proteolysis.
Probab=100.00 E-value=0 Score=420.79 Aligned_cols=206 Identities=34% Similarity=0.623 Sum_probs=201.0
Q ss_pred EEEEEEEEEEE--------------CCHHHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHC--CCCCEE
Q ss_conf 28999976662--------------38699999999986189987999975888888899999999999841--478679
Q gi|254780747|r 37 HVARIAIRGQI--------------EDSQELIERIERISRDDSATALIVSLSSPGGSAYAGEAIFRAIQKVK--NRKPVI 100 (293)
Q Consensus 37 ~i~~i~i~G~I--------------~~~~~l~~~l~~a~~d~~ik~ivL~i~SpGG~~~~~~~i~~ai~~~k--~~kpvv 100 (293)
T Consensus 1 ~IA~~~v~G~I~~~~~~~~~~~~Dg~~~~~~~k~~~~~~~~~~~ka~~l~i~SPGG~V~~S~Eiy~~l~~~~k~~kkPVv 80 (224)
T TIGR00706 1 KIASLEVTGAIASDAALSILLFSDGVSPEDVLKKIKRIKDDKSIKALVLRIDSPGGTVVASEEIYEKLKKLKKEAKKPVV 80 (224)
T ss_pred CEEEEEEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEE
T ss_conf 93588841266303442101256899756799998877408970069998637999752268999999863453088589
Q ss_pred EEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHCCEEEEECCCCCCCCCCCCCCHHHHHHHHH
Q ss_conf 96033233223210001110001301353455565302102456777420422553155211234666789999887777
Q gi|254780747|r 101 TEVHEMAASAGYLISCASNIIVAAETSLVGSIGVLFQYPYVKPFLDKLGVSIKSVKSSPMKAEPSPFSEVNPKAVQMMQD 180 (293)
Q Consensus 101 a~~~~~~~S~~Y~iAs~ad~I~a~p~s~vGsiGv~~~~~~~~~ll~k~gi~~~~~~~g~~K~~~~p~~~~s~e~~~~~~~ 180 (293)
T Consensus 81 ~~~g~~aaSGGYYia~aa~~I~A~~~t~tGSIGVIl~~~n~~~L~~k~GI~~~~iK~G~yKd~~~~~R~lt~eE~~~lQ~ 160 (224)
T TIGR00706 81 ASMGGVAASGGYYIAMAADEIVANPGTITGSIGVILQGANVEKLLEKLGIEFEAIKSGEYKDIGSPTRELTPEERKILQS 160 (224)
T ss_pred EEECCCCHHHHHHHHHCCCEEEECCCCCEECHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHH
T ss_conf 98368322679999813882463477420203755203579999986491565665166567898757762999999999
Q ss_pred HHHHHHHHHHHHHHHCCC--CCHHHHHHHHCCCCCCHHHHHHCCCCCCCCCHHHHHHHHHHHCC
Q ss_conf 766666778999985149--99889988734982378899877980623898999999997418
Q gi|254780747|r 181 VVDSSYHWFVRLVSESRN--IPYDKTLVLSDGRIWTGAEAKKVGLIDVVGGQEEVWQSLYALGV 242 (293)
Q Consensus 181 ~l~~~~~~f~~~Va~~R~--~~~~~~~~~~~g~~~~~~~A~~~GLvD~ig~~~~a~~~l~~~~~ 242 (293)
T Consensus 161 ~v~~~Y~~F~~~V~~~R~nkl~~~~vK~~AdGRvf~GrqA~~l~LVD~lG~~d~A~~~l~~L~g 224 (224)
T TIGR00706 161 LVNESYEQFVQVVAKGRNNKLSVEDVKKFADGRVFTGRQALKLRLVDKLGTLDDALKWLAKLAG 224 (224)
T ss_pred HHHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCCEECHHHHHHCCCEECCCCHHHHHHHHHHHCC
T ss_conf 8888875789999984167789788765206860104334311460012898999999997449