Query psy8663
Match_columns 161
No_of_seqs 192 out of 1146
Neff 6.2
Searched_HMMs 46136
Date Sat Aug 17 00:35:11 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8663.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8663hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3627|consensus 99.0 6.9E-10 1.5E-14 89.7 5.6 48 114-161 10-59 (256)
2 COG5640 Secreted trypsin-like 98.5 4.7E-08 1E-12 85.0 1.1 47 114-160 30-81 (413)
3 PF03761 DUF316: Domain of unk 97.8 2.8E-05 6.1E-10 64.4 4.9 54 107-160 30-89 (282)
4 COG3591 V8-like Glu-specific e 94.5 0.048 1E-06 45.8 4.0 35 126-160 47-84 (251)
5 TIGR02037 degP_htrA_DO peripla 27.9 44 0.00095 29.5 2.2 21 140-160 58-79 (428)
6 TIGR02038 protease_degS peripl 17.0 1.3E+02 0.0027 26.0 2.8 20 141-160 79-99 (351)
7 PF10459 Peptidase_S46: Peptid 11.5 1.5E+02 0.0032 28.6 1.8 19 142-160 49-68 (698)
8 PRK10942 serine endoprotease; 10.7 2.3E+02 0.0049 25.7 2.7 20 141-160 112-133 (473)
9 PRK10139 serine endoprotease; 10.5 2E+02 0.0044 25.9 2.3 21 140-160 90-112 (455)
10 cd07620 BAR_SH3BP1 The Bin/Amp 8.1 41 0.00088 28.5 -2.9 20 2-21 105-124 (257)
No 1
>KOG3627|consensus
Probab=98.98 E-value=6.9e-10 Score=89.73 Aligned_cols=48 Identities=46% Similarity=0.838 Sum_probs=44.2
Q ss_pred CCceeCCEEcCCCCCCeEEEEEeCC--eEeeEEEEeeCCeeeecccccCC
Q psy8663 114 KTRIVGGQVTYVHQYPWMALLMYKK--RFYCGATLINNLYVLTAAHCVHQ 161 (161)
Q Consensus 114 ~~rIvgG~~a~~ge~PW~vsL~~~~--~~~CGGsLIs~~~VLTAAHC~~~ 161 (161)
..||+||..+..+++||++++.... .++|||+||+++||||||||+.+
T Consensus 10 ~~~i~~g~~~~~~~~Pw~~~l~~~~~~~~~Cggsli~~~~vltaaHC~~~ 59 (256)
T KOG3627|consen 10 EGRIVGGTEAEPGSFPWQVSLQYGGNGRHLCGGSLISPRWVLTAAHCVKG 59 (256)
T ss_pred cCCEeCCccCCCCCCCCEEEEEECCCcceeeeeEEeeCCEEEEChhhCCC
Confidence 5799999999999999999999875 78999999999999999999863
No 2
>COG5640 Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones]
Probab=98.46 E-value=4.7e-08 Score=84.99 Aligned_cols=47 Identities=36% Similarity=0.747 Sum_probs=42.3
Q ss_pred CCceeCCEEcCCCCCCeEEEEEeC-----CeEeeEEEEeeCCeeeecccccC
Q psy8663 114 KTRIVGGQVTYVHQYPWMALLMYK-----KRFYCGATLINNLYVLTAAHCVH 160 (161)
Q Consensus 114 ~~rIvgG~~a~~ge~PW~vsL~~~-----~~~~CGGsLIs~~~VLTAAHC~~ 160 (161)
..||+||..+..++||.+|+|... ...+|||+++..|||||||||+.
T Consensus 30 s~rIigGs~Anag~~P~~VaLv~~isd~~s~tfCGgs~l~~RYvLTAAHC~~ 81 (413)
T COG5640 30 SSRIIGGSNANAGEYPSLVALVDRISDYVSGTFCGGSKLGGRYVLTAAHCAD 81 (413)
T ss_pred ceeEecCcccccccCchHHHHHhhcccccceeEeccceecceEEeeehhhcc
Confidence 579999999999999999998763 34689999999999999999985
No 3
>PF03761 DUF316: Domain of unknown function (DUF316) ; InterPro: IPR005514 This is a family of uncharacterised proteins from Caenorhabditis elegans.
Probab=97.81 E-value=2.8e-05 Score=64.41 Aligned_cols=54 Identities=28% Similarity=0.584 Sum_probs=42.9
Q ss_pred ccCCCC--CCCceeCCEEcCCCCCCeEEEEEeCC----eEeeEEEEeeCCeeeecccccC
Q psy8663 107 TCGAVN--KKTRIVGGQVTYVHQYPWMALLMYKK----RFYCGATLINNLYVLTAAHCVH 160 (161)
Q Consensus 107 ~CG~~~--~~~rIvgG~~a~~ge~PW~vsL~~~~----~~~CGGsLIs~~~VLTAAHC~~ 160 (161)
.||... ...++..|..+..++.||.+.+.... ..++.|+|||+|||||++||+.
T Consensus 30 ~CG~~~~~~~~~~~~g~~~~~~~~pW~v~v~~~~~~~~~~~~~gtlIS~RHiLtss~~~~ 89 (282)
T PF03761_consen 30 TCGKKKLPYPSKVFNGTPAESGEAPWAVSVYTKNHNEGNYFSTGTLISPRHILTSSHCVM 89 (282)
T ss_pred hcCCCCCCCcccccCCcccccCCCCCEEEEEeccCcccceecceEEeccCeEEEeeeEEE
Confidence 577432 34557888888889999999998743 3467999999999999999984
No 4
>COG3591 V8-like Glu-specific endopeptidase [Amino acid transport and metabolism]
Probab=94.51 E-value=0.048 Score=45.78 Aligned_cols=35 Identities=40% Similarity=0.684 Sum_probs=27.3
Q ss_pred CCCCeEEEEEe---CCeEeeEEEEeeCCeeeecccccC
Q psy8663 126 HQYPWMALLMY---KKRFYCGATLINNLYVLTAAHCVH 160 (161)
Q Consensus 126 ge~PW~vsL~~---~~~~~CGGsLIs~~~VLTAAHC~~ 160 (161)
..|||.+-..+ .+.+-|+++||+++-+|||+||+.
T Consensus 47 ~~~Py~av~~~~~~tG~~~~~~~lI~pntvLTa~Hc~~ 84 (251)
T COG3591 47 TQFPYSAVVQFEAATGRLCTAATLIGPNTVLTAGHCIY 84 (251)
T ss_pred CCCCcceeEEeecCCCcceeeEEEEcCceEEEeeeEEe
Confidence 46898766655 244557779999999999999985
No 5
>TIGR02037 degP_htrA_DO periplasmic serine protease, Do/DeqQ family. This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures.
Probab=27.85 E-value=44 Score=29.49 Aligned_cols=21 Identities=29% Similarity=0.377 Sum_probs=17.5
Q ss_pred EeeEEEEeeCC-eeeecccccC
Q psy8663 140 FYCGATLINNL-YVLTAAHCVH 160 (161)
Q Consensus 140 ~~CGGsLIs~~-~VLTAAHC~~ 160 (161)
..+.|.+|+.. +|||.+|.+.
T Consensus 58 ~~GSGfii~~~G~IlTn~Hvv~ 79 (428)
T TIGR02037 58 GLGSGVIISADGYILTNNHVVD 79 (428)
T ss_pred ceeeEEEECCCCEEEEcHHHcC
Confidence 45889999875 9999999875
No 6
>TIGR02038 protease_degS periplasmic serine pepetdase DegS. This family consists of the periplasmic serine protease DegS (HhoB), a shorter paralog of protease DO (HtrA, DegP) and DegQ (HhoA). It is found in E. coli and several other Proteobacteria of the gamma subdivision. It contains a trypsin domain and a single copy of PDZ domain (in contrast to DegP with two copies). A critical role of this DegS is to sense stress in the periplasm and partially degrade an inhibitor of sigma(E).
Probab=17.03 E-value=1.3e+02 Score=26.05 Aligned_cols=20 Identities=20% Similarity=0.463 Sum_probs=16.5
Q ss_pred eeEEEEeeCC-eeeecccccC
Q psy8663 141 YCGATLINNL-YVLTAAHCVH 160 (161)
Q Consensus 141 ~CGGsLIs~~-~VLTAAHC~~ 160 (161)
...|.+|++. +|||.+|.+.
T Consensus 79 ~GSG~vi~~~G~IlTn~HVV~ 99 (351)
T TIGR02038 79 LGSGVIMSKEGYILTNYHVIK 99 (351)
T ss_pred eEEEEEEeCCeEEEecccEeC
Confidence 4778899865 9999999874
No 7
>PF10459 Peptidase_S46: Peptidase S46; InterPro: IPR019500 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents S46 peptidases, where dipeptidyl-peptidase 7 (DPP-7) is the best-characterised member of this family. It is a serine peptidase that is located on the cell surface and is predicted to have two N-terminal transmembrane domains.
Probab=11.54 E-value=1.5e+02 Score=28.57 Aligned_cols=19 Identities=32% Similarity=0.628 Sum_probs=15.6
Q ss_pred eEEEEeeCC-eeeecccccC
Q psy8663 142 CGATLINNL-YVLTAAHCVH 160 (161)
Q Consensus 142 CGGsLIs~~-~VLTAAHC~~ 160 (161)
|.|++||+. -|||-.||..
T Consensus 49 CSgsfVS~~GLvlTNHHC~~ 68 (698)
T PF10459_consen 49 CSGSFVSPDGLVLTNHHCGY 68 (698)
T ss_pred eeEEEEcCCceEEecchhhh
Confidence 999999985 5677889974
No 8
>PRK10942 serine endoprotease; Provisional
Probab=10.73 E-value=2.3e+02 Score=25.71 Aligned_cols=20 Identities=30% Similarity=0.466 Sum_probs=16.7
Q ss_pred eeEEEEeeC--CeeeecccccC
Q psy8663 141 YCGATLINN--LYVLTAAHCVH 160 (161)
Q Consensus 141 ~CGGsLIs~--~~VLTAAHC~~ 160 (161)
...|.+|+. .+|||.+|.+.
T Consensus 112 ~GSG~ii~~~~G~IlTn~HVv~ 133 (473)
T PRK10942 112 LGSGVIIDADKGYVVTNNHVVD 133 (473)
T ss_pred eEEEEEEECCCCEEEeChhhcC
Confidence 578999984 69999999875
No 9
>PRK10139 serine endoprotease; Provisional
Probab=10.53 E-value=2e+02 Score=25.88 Aligned_cols=21 Identities=29% Similarity=0.404 Sum_probs=17.1
Q ss_pred EeeEEEEeeC--CeeeecccccC
Q psy8663 140 FYCGATLINN--LYVLTAAHCVH 160 (161)
Q Consensus 140 ~~CGGsLIs~--~~VLTAAHC~~ 160 (161)
....|.+|++ .+|||.+|.+.
T Consensus 90 ~~GSG~ii~~~~g~IlTn~HVv~ 112 (455)
T PRK10139 90 GLGSGVIIDAAKGYVLTNNHVIN 112 (455)
T ss_pred ceEEEEEEECCCCEEEeChHHhC
Confidence 3578999974 79999999875
No 10
>cd07620 BAR_SH3BP1 The Bin/Amphiphysin/Rvs (BAR) domain of SH3-domain Binding Protein 1. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. SH3-domain binding protein 1 (SH3BP1 or 3BP-1) is a Rac GTPase activating protein that inhibits Rac-mediated platelet-derived growth factor (PDGF)-induced membrane ruffling. SH3BP1 contains an N-terminal BAR domain followed by a GAP domain for Rho and Rac GTPases and a C-terminal proline-rich domain. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions.
Probab=8.12 E-value=41 Score=28.49 Aligned_cols=20 Identities=45% Similarity=0.964 Sum_probs=17.6
Q ss_pred ccccCCcCCccCcccccccc
Q psy8663 2 EQSVLQPLDDVQHDEVPEVL 21 (161)
Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~ 21 (161)
|+.||+||.+++..++|+++
T Consensus 105 e~~vl~PL~~L~e~dL~~I~ 124 (257)
T cd07620 105 EKDVLQPLNKLSEEDLPEIL 124 (257)
T ss_pred HHHHHHHHHHhHHhhHHHHH
Confidence 67899999999999998875
Done!