Query psy17192
Match_columns 74
No_of_seqs 114 out of 533
Neff 4.8
Searched_HMMs 46136
Date Sat Aug 17 00:12:09 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy17192.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/17192hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 COG3579 PepC Aminopeptidase C 100.0 3.3E-35 7.1E-40 225.7 6.6 71 4-74 374-444 (444)
2 PF03051 Peptidase_C1_2: Pepti 100.0 3.1E-34 6.7E-39 222.3 6.9 67 4-70 372-438 (438)
3 cd00585 Peptidase_C1B Peptidas 100.0 2.5E-32 5.5E-37 211.6 6.8 67 4-70 371-437 (437)
4 KOG4128|consensus 99.9 5.8E-28 1.3E-32 185.9 3.1 73 2-74 384-457 (457)
5 cd02619 Peptidase_C1 C1 Peptid 98.8 3.7E-09 8.1E-14 71.2 3.8 36 6-42 187-222 (223)
6 cd02248 Peptidase_C1A Peptidas 98.7 1.5E-08 3.2E-13 68.9 3.0 25 5-30 171-195 (210)
7 cd02620 Peptidase_C1A_Cathepsi 98.7 1.6E-08 3.5E-13 71.7 3.3 27 4-31 196-222 (236)
8 PF00112 Peptidase_C1: Papain 98.7 1.2E-08 2.6E-13 68.7 2.3 27 5-32 178-204 (219)
9 cd02698 Peptidase_C1A_Cathepsi 98.6 2.9E-08 6.3E-13 70.5 3.1 29 6-35 193-221 (239)
10 PTZ00203 cathepsin L protease; 98.6 4.6E-08 9.9E-13 74.2 3.8 27 4-31 298-324 (348)
11 cd02621 Peptidase_C1A_Cathepsi 98.5 5.8E-08 1.3E-12 68.8 3.2 25 6-31 202-226 (243)
12 smart00645 Pept_C1 Papain fami 98.5 9.1E-08 2E-12 65.3 3.4 26 5-31 133-158 (174)
13 PTZ00200 cysteine proteinase; 98.5 9E-08 1.9E-12 75.0 3.5 29 2-31 398-426 (448)
14 KOG1542|consensus 98.4 2E-07 4.3E-12 72.2 2.7 24 6-30 331-354 (372)
15 KOG1543|consensus 98.3 3.6E-07 7.8E-12 68.7 3.0 26 5-31 282-307 (325)
16 PTZ00021 falcipain-2; Provisio 98.2 8.2E-07 1.8E-11 70.6 3.3 23 9-31 446-468 (489)
17 COG4870 Cysteine protease [Pos 98.1 1E-06 2.2E-11 68.4 1.9 25 9-33 290-314 (372)
18 PTZ00364 dipeptidyl-peptidase 98.1 1.9E-06 4.1E-11 69.4 3.4 27 4-31 415-443 (548)
19 PTZ00049 cathepsin C-like prot 98.1 2.4E-06 5.1E-11 70.5 3.5 27 5-31 633-660 (693)
20 PTZ00462 Serine-repeat antigen 98.1 2.4E-06 5.2E-11 72.7 3.5 25 6-31 740-764 (1004)
21 KOG1544|consensus 96.6 0.0015 3.3E-08 51.4 2.7 25 5-29 417-442 (470)
22 PF04046 PSP: PSP; InterPro: 61.1 4.6 9.9E-05 23.3 0.9 20 47-66 4-23 (48)
23 smart00581 PSP proline-rich do 58.0 5.7 0.00012 23.5 1.0 21 46-66 7-27 (54)
24 PF01011 PQQ: PQQ enzyme repea 48.4 12 0.00025 19.4 1.2 18 1-19 15-32 (38)
25 PF05153 DUF706: Family of unk 43.1 6.8 0.00015 29.5 -0.4 23 34-56 162-185 (253)
26 PF10273 WGG: Pre-rRNA-process 41.9 4.7 0.0001 24.9 -1.2 32 13-49 20-51 (82)
27 PF06658 DUF1168: Protein of u 32.2 9.5 0.00021 26.3 -1.0 20 10-29 20-40 (142)
28 smart00695 DUSP Domain in ubiq 31.9 48 0.001 19.6 2.2 21 21-42 3-23 (86)
29 KOG4032|consensus 31.4 15 0.00032 26.6 -0.2 33 13-50 31-63 (184)
30 PF13586 DDE_Tnp_1_2: Transpos 30.8 14 0.0003 22.2 -0.3 13 8-20 45-57 (88)
31 PF06646 Mycoplasma_p37: High 26.4 32 0.0007 26.8 0.9 18 14-31 300-317 (331)
32 COG2513 PrpB PEP phosphonomuta 24.4 31 0.00068 26.4 0.5 25 48-72 6-32 (289)
33 KOG4003|consensus 24.1 31 0.00066 25.5 0.4 8 13-20 97-104 (223)
34 KOG0393|consensus 22.1 50 0.0011 23.8 1.1 26 34-71 38-63 (198)
35 TIGR03601 B_an_ocin probable h 22.1 57 0.0012 20.5 1.3 23 3-32 16-38 (79)
36 cd00044 CysPc Calpains, domain 21.3 1.8E+02 0.0039 21.4 4.0 29 9-37 253-307 (315)
37 PF06337 DUSP: DUSP domain; I 20.5 88 0.0019 18.7 1.9 22 24-45 2-23 (99)
38 PF10008 DUF2251: Uncharacteri 20.0 63 0.0014 21.1 1.2 13 19-31 17-29 (97)
No 1
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=100.00 E-value=3.3e-35 Score=225.74 Aligned_cols=71 Identities=46% Similarity=1.102 Sum_probs=69.5
Q ss_pred CCCccceEEEecccCCCCCCceEEEEeHHHHhhhceeeeecCCCCCHHHHhhhcCCCeeeCCCCccCccCC
Q psy17192 4 ETEEPTKWRVENSWGEEQNHKGYILMTSPWFKEYVFEVVVDKKYVPASVLDVFNQEPTILPAWDPMGTLAQ 74 (74)
Q Consensus 4 ~~g~~~~wkVeNSWG~~~G~~Gy~~ms~~wF~eyv~~ivV~K~~l~~~~~~~l~~~p~~l~~wDpmg~l~~ 74 (74)
++|+|.|||||||||++.|.+|||+|||+||+||+|||||+|++||+++++++..|||+|+|||||||||.
T Consensus 374 ~~g~p~rwkVENSWG~d~G~~GyfvaSd~wmdEytyQIvV~k~~l~~e~~~a~~~epivL~pWDPMGALA~ 444 (444)
T COG3579 374 ETGNPLRWKVENSWGKDVGKKGYFVASDAWMDEYTYQIVVDKKFLPKEELAAYEEEPIVLAPWDPMGALAK 444 (444)
T ss_pred cCCCceeeEeecccccccCCCceEeehHhHhhhheeEEEEehhhCCHHHHHhhcCCCeecCCCCccccccC
Confidence 67899999999999999999999999999999999999999999999999999999999999999999985
No 2
>PF03051 Peptidase_C1_2: Peptidase C1-like family This family is a subfamily of the Prosite entry; InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=100.00 E-value=3.1e-34 Score=222.27 Aligned_cols=67 Identities=52% Similarity=1.205 Sum_probs=61.0
Q ss_pred CCCccceEEEecccCCCCCCceEEEEeHHHHhhhceeeeecCCCCCHHHHhhhcCCCeeeCCCCccC
Q psy17192 4 ETEEPTKWRVENSWGEEQNHKGYILMTSPWFKEYVFEVVVDKKYVPASVLDVFNQEPTILPAWDPMG 70 (74)
Q Consensus 4 ~~g~~~~wkVeNSWG~~~G~~Gy~~ms~~wF~eyv~~ivV~K~~l~~~~~~~l~~~p~~l~~wDpmg 70 (74)
++|+|++||||||||++.|++|||+||++||++|||+|||||++||++++++|++|||+||||||||
T Consensus 372 ~~g~p~~wkVeNSWG~~~g~kGy~~msd~wf~e~v~~vvV~K~~lp~~~~~~~~~~p~~l~~wdpmg 438 (438)
T PF03051_consen 372 EDGKPVRWKVENSWGTDNGDKGYFYMSDDWFDEYVYQVVVHKKYLPKELLKKLEKEPIVLPPWDPMG 438 (438)
T ss_dssp TTSSEEEEEEE-SBTTTSTBTTEEEEEHHHHHHHEEEEEEEGGGS-HHHHHHHHS--EEE-TTSCCC
T ss_pred cCCCeeEEEEEcCCCCCCCCCcEEEECHHHHHhhhheEEechhhCCHHHHHHhcCCCeECCCCCCCC
Confidence 6799999999999999999999999999999999999999999999999999999999999999998
No 3
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.97 E-value=2.5e-32 Score=211.60 Aligned_cols=67 Identities=61% Similarity=1.293 Sum_probs=65.2
Q ss_pred CCCccceEEEecccCCCCCCceEEEEeHHHHhhhceeeeecCCCCCHHHHhhhcCCCeeeCCCCccC
Q psy17192 4 ETEEPTKWRVENSWGEEQNHKGYILMTSPWFKEYVFEVVVDKKYVPASVLDVFNQEPTILPAWDPMG 70 (74)
Q Consensus 4 ~~g~~~~wkVeNSWG~~~G~~Gy~~ms~~wF~eyv~~ivV~K~~l~~~~~~~l~~~p~~l~~wDpmg 70 (74)
++|+|.||+||||||+++|++|||+||++||++|||+|||||++||++++++|+++||+||||||||
T Consensus 371 ~~g~p~yw~VkNSWG~~~G~~Gy~~ms~~wf~~y~~~ivV~kk~l~~~~~~~l~~~p~~L~~WDpmg 437 (437)
T cd00585 371 EDGKPVKWKVENSWGEKVGKKGYFVMSDDWFDEYVYQVVVDKKYLPEEVLDLLKQEPIVLPPWDPMG 437 (437)
T ss_pred CCCCcceEEEEcccCCCCCCCcceehhHHHHHhhheEEEeecccCCHHHHHHhcCCCEECCCCCCCC
Confidence 4588889999999999999999999999999999999999999999999999999999999999997
No 4
>KOG4128|consensus
Probab=99.94 E-value=5.8e-28 Score=185.87 Aligned_cols=73 Identities=53% Similarity=0.974 Sum_probs=70.3
Q ss_pred CCCCCccceEEEecccCCCCCCceEEEEeHHHHhhhceeeeecCCCCCHHHHhhhc-CCCeeeCCCCccCccCC
Q psy17192 2 DQETEEPTKWRVENSWGEEQNHKGYILMTSPWFKEYVFEVVVDKKYVPASVLDVFN-QEPTILPAWDPMGTLAQ 74 (74)
Q Consensus 2 d~~~g~~~~wkVeNSWG~~~G~~Gy~~ms~~wF~eyv~~ivV~K~~l~~~~~~~l~-~~p~~l~~wDpmg~l~~ 74 (74)
|+.+|-+.+|+||||||++.|++||+.|+++||+||+|+||||.+++|++++.++. +|||+||+||||||||.
T Consensus 384 d~~~g~~~~~rVenswgkd~gkkg~~~mt~ewf~EY~feiVVd~k~vP~evl~~~~~~epi~LPaWDPMGALa~ 457 (457)
T KOG4128|consen 384 DPATGGLNEHRVENSWGKDLGKKGVNKMTAEWFREYAFEIVVDEKFVPSEVLHAARIHEPIELPAWDPMGALAN 457 (457)
T ss_pred CcccCCchhhhhhchhhhhccccchhhhhHHHHHhhheeEEeecccCcHHHHHHhhhcCCeeccccCchhhhcC
Confidence 67889999999999999999999999999999999999999999999999999997 89999999999999984
No 5
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=98.82 E-value=3.7e-09 Score=71.19 Aligned_cols=36 Identities=25% Similarity=0.396 Sum_probs=32.0
Q ss_pred CccceEEEecccCCCCCCceEEEEeHHHHhhhceeee
Q psy17192 6 EEPTKWRVENSWGEEQNHKGYILMTSPWFKEYVFEVV 42 (74)
Q Consensus 6 g~~~~wkVeNSWG~~~G~~Gy~~ms~~wF~eyv~~iv 42 (74)
|+ .+|+|+||||+.+|++||+.|+.+-+.++++...
T Consensus 187 ~~-~~~i~~NSwG~~wg~~Gy~~i~~~~~~~~~~~~~ 222 (223)
T cd02619 187 GK-GAFIVKNSWGTDWGDNGYGRISYEDVYEMTFGAN 222 (223)
T ss_pred CC-CEEEEEeCCCCccccCCEEEEehhhhhhhhhccc
Confidence 55 5999999999999999999999999998887654
No 6
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=98.67 E-value=1.5e-08 Score=68.87 Aligned_cols=25 Identities=40% Similarity=0.795 Sum_probs=22.5
Q ss_pred CCccceEEEecccCCCCCCceEEEEe
Q psy17192 5 TEEPTKWRVENSWGEEQNHKGYILMT 30 (74)
Q Consensus 5 ~g~~~~wkVeNSWG~~~G~~Gy~~ms 30 (74)
+|+ .+|+|+||||+++|++|||.|+
T Consensus 171 ~~~-~ywiv~NSWG~~WG~~Gy~~i~ 195 (210)
T cd02248 171 NGV-DYWIVKNSWGTSWGEKGYIRIA 195 (210)
T ss_pred CCc-eEEEEEcCCCCccccCcEEEEE
Confidence 354 5999999999999999999997
No 7
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=98.67 E-value=1.6e-08 Score=71.72 Aligned_cols=27 Identities=30% Similarity=0.667 Sum_probs=23.6
Q ss_pred CCCccceEEEecccCCCCCCceEEEEeH
Q psy17192 4 ETEEPTKWRVENSWGEEQNHKGYILMTS 31 (74)
Q Consensus 4 ~~g~~~~wkVeNSWG~~~G~~Gy~~ms~ 31 (74)
++|+ .+|+|+||||+++|++|||.++.
T Consensus 196 ~~g~-~YWivrNSWG~~WGe~Gy~ri~~ 222 (236)
T cd02620 196 ENGV-PYWLAANSWGTDWGENGYFRILR 222 (236)
T ss_pred cCCe-eEEEEEeCCCCCCCCCcEEEEEc
Confidence 3565 49999999999999999999974
No 8
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification. ; InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues. The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate []. The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=98.66 E-value=1.2e-08 Score=68.71 Aligned_cols=27 Identities=30% Similarity=0.704 Sum_probs=22.7
Q ss_pred CCccceEEEecccCCCCCCceEEEEeHH
Q psy17192 5 TEEPTKWRVENSWGEEQNHKGYILMTSP 32 (74)
Q Consensus 5 ~g~~~~wkVeNSWG~~~G~~Gy~~ms~~ 32 (74)
.|+ .+|+|+||||+++|++|||.|+..
T Consensus 178 ~~~-~~wiv~NSWG~~WG~~Gy~~i~~~ 204 (219)
T PF00112_consen 178 NGK-GYWIVKNSWGTDWGDNGYFRISYD 204 (219)
T ss_dssp TTE-EEEEEE-SBTTTSTBTTEEEEESS
T ss_pred cce-eeEeeehhhCCccCCCeEEEEeeC
Confidence 345 599999999999999999999864
No 9
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=98.61 E-value=2.9e-08 Score=70.55 Aligned_cols=29 Identities=31% Similarity=0.693 Sum_probs=25.0
Q ss_pred CccceEEEecccCCCCCCceEEEEeHHHHh
Q psy17192 6 EEPTKWRVENSWGEEQNHKGYILMTSPWFK 35 (74)
Q Consensus 6 g~~~~wkVeNSWG~~~G~~Gy~~ms~~wF~ 35 (74)
|+ .+|+|+||||+++|++|||.|+..-+.
T Consensus 193 g~-~YWiikNSWG~~WGe~Gy~~i~rg~~~ 221 (239)
T cd02698 193 GV-EYWIVRNSWGEPWGERGWFRIVTSSYK 221 (239)
T ss_pred CC-EEEEEEcCCCcccCcCceEEEEccCCc
Confidence 66 499999999999999999999865543
No 10
>PTZ00203 cathepsin L protease; Provisional
Probab=98.59 E-value=4.6e-08 Score=74.23 Aligned_cols=27 Identities=44% Similarity=0.924 Sum_probs=23.5
Q ss_pred CCCccceEEEecccCCCCCCceEEEEeH
Q psy17192 4 ETEEPTKWRVENSWGEEQNHKGYILMTS 31 (74)
Q Consensus 4 ~~g~~~~wkVeNSWG~~~G~~Gy~~ms~ 31 (74)
++|.+ ||+|+||||+.||++|||.|..
T Consensus 298 ~~g~~-YWiikNSWG~~WGe~GY~ri~r 324 (348)
T PTZ00203 298 TGEVP-YWVIKNSWGEDWGEKGYVRVTM 324 (348)
T ss_pred CCCce-EEEEEcCCCCCcCcCceEEEEc
Confidence 45654 9999999999999999999964
No 11
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=98.55 E-value=5.8e-08 Score=68.80 Aligned_cols=25 Identities=40% Similarity=0.815 Sum_probs=22.4
Q ss_pred CccceEEEecccCCCCCCceEEEEeH
Q psy17192 6 EEPTKWRVENSWGEEQNHKGYILMTS 31 (74)
Q Consensus 6 g~~~~wkVeNSWG~~~G~~Gy~~ms~ 31 (74)
|. .+|+|+||||+.+|++|||.|..
T Consensus 202 g~-~YWiirNSWG~~WGe~Gy~~i~~ 226 (243)
T cd02621 202 GE-KYWIVKNSWGSSWGEKGYFKIRR 226 (243)
T ss_pred CC-cEEEEEcCCCCCCCcCCeEEEec
Confidence 55 59999999999999999999963
No 12
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=98.51 E-value=9.1e-08 Score=65.31 Aligned_cols=26 Identities=31% Similarity=0.755 Sum_probs=23.7
Q ss_pred CCccceEEEecccCCCCCCceEEEEeH
Q psy17192 5 TEEPTKWRVENSWGEEQNHKGYILMTS 31 (74)
Q Consensus 5 ~g~~~~wkVeNSWG~~~G~~Gy~~ms~ 31 (74)
+|++ +|+|+||||+.+|++|||.|+.
T Consensus 133 ~g~~-yWii~NSwG~~WG~~G~~~i~~ 158 (174)
T smart00645 133 NGKD-YWIVKNSWGTDWGENGYFRIAR 158 (174)
T ss_pred CCee-EEEEECCCCCCcccCeEEEEEc
Confidence 6775 9999999999999999999974
No 13
>PTZ00200 cysteine proteinase; Provisional
Probab=98.50 E-value=9e-08 Score=75.04 Aligned_cols=29 Identities=31% Similarity=0.825 Sum_probs=24.5
Q ss_pred CCCCCccceEEEecccCCCCCCceEEEEeH
Q psy17192 2 DQETEEPTKWRVENSWGEEQNHKGYILMTS 31 (74)
Q Consensus 2 d~~~g~~~~wkVeNSWG~~~G~~Gy~~ms~ 31 (74)
|.++|. .+|.|+||||++||++|||.|..
T Consensus 398 d~~~g~-~YWIIkNSWG~~WGe~GY~ri~r 426 (448)
T PTZ00200 398 DEKTKK-RYWIIKNSWGTDWGENGYMRLER 426 (448)
T ss_pred cCCCCC-ceEEEEcCCCCCcccCeeEEEEe
Confidence 345566 49999999999999999999974
No 14
>KOG1542|consensus
Probab=98.38 E-value=2e-07 Score=72.24 Aligned_cols=24 Identities=46% Similarity=0.955 Sum_probs=21.5
Q ss_pred CccceEEEecccCCCCCCceEEEEe
Q psy17192 6 EEPTKWRVENSWGEEQNHKGYILMT 30 (74)
Q Consensus 6 g~~~~wkVeNSWG~~~G~~Gy~~ms 30 (74)
.+| ||.|+||||+.||.+|||.+-
T Consensus 331 ~~P-YWIVKNSWG~~WGE~GY~~l~ 354 (372)
T KOG1542|consen 331 EKP-YWIVKNSWGTSWGEKGYYKLC 354 (372)
T ss_pred CCc-eEEEECCccccccccceEEEe
Confidence 466 999999999999999999873
No 15
>KOG1543|consensus
Probab=98.33 E-value=3.6e-07 Score=68.68 Aligned_cols=26 Identities=35% Similarity=0.749 Sum_probs=22.4
Q ss_pred CCccceEEEecccCCCCCCceEEEEeH
Q psy17192 5 TEEPTKWRVENSWGEEQNHKGYILMTS 31 (74)
Q Consensus 5 ~g~~~~wkVeNSWG~~~G~~Gy~~ms~ 31 (74)
+|. .||.|+||||+.||++|||-|-.
T Consensus 282 ~~~-~YWivkNSWG~~WGe~Gy~ri~r 307 (325)
T KOG1543|consen 282 DGV-DYWIVKNSWGTDWGEKGYFRIAR 307 (325)
T ss_pred CCc-eeEEEEcCCCCCcccCceEEEec
Confidence 344 49999999999999999999853
No 16
>PTZ00021 falcipain-2; Provisional
Probab=98.24 E-value=8.2e-07 Score=70.57 Aligned_cols=23 Identities=35% Similarity=0.744 Sum_probs=21.4
Q ss_pred ceEEEecccCCCCCCceEEEEeH
Q psy17192 9 TKWRVENSWGEEQNHKGYILMTS 31 (74)
Q Consensus 9 ~~wkVeNSWG~~~G~~Gy~~ms~ 31 (74)
.+|+|+||||+.||++|||.|..
T Consensus 446 ~YWIVKNSWGt~WGE~GY~rI~r 468 (489)
T PTZ00021 446 YYYIIKNSWGESWGEKGFIRIET 468 (489)
T ss_pred CEEEEECCCCCCcccCeEEEEEc
Confidence 49999999999999999999974
No 17
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=98.14 E-value=1e-06 Score=68.43 Aligned_cols=25 Identities=24% Similarity=0.644 Sum_probs=22.6
Q ss_pred ceEEEecccCCCCCCceEEEEeHHH
Q psy17192 9 TKWRVENSWGEEQNHKGYILMTSPW 33 (74)
Q Consensus 9 ~~wkVeNSWG~~~G~~Gy~~ms~~w 33 (74)
.-|+|+||||+.+|.+|||+||..+
T Consensus 290 GAfiikNSWGt~wG~~GYfwisY~y 314 (372)
T COG4870 290 GAFIIKNSWGTNWGENGYFWISYYY 314 (372)
T ss_pred ceEEEECccccccccCceEEEEeee
Confidence 4699999999999999999999654
No 18
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=98.13 E-value=1.9e-06 Score=69.35 Aligned_cols=27 Identities=22% Similarity=0.430 Sum_probs=23.5
Q ss_pred CCCccceEEEecccCC--CCCCceEEEEeH
Q psy17192 4 ETEEPTKWRVENSWGE--EQNHKGYILMTS 31 (74)
Q Consensus 4 ~~g~~~~wkVeNSWG~--~~G~~Gy~~ms~ 31 (74)
++|. .||+|+||||+ ++|++|||.|..
T Consensus 415 e~G~-~YWIVKNSWGt~~~WGE~GYfRI~R 443 (548)
T PTZ00364 415 ENGG-DYWLVLDPWGSRRSWCDGGTRKIAR 443 (548)
T ss_pred CCCc-eEEEEECCCCCCCCcccCCeEEEEc
Confidence 3566 49999999999 999999999964
No 19
>PTZ00049 cathepsin C-like protein; Provisional
Probab=98.11 E-value=2.4e-06 Score=70.53 Aligned_cols=27 Identities=26% Similarity=0.604 Sum_probs=23.2
Q ss_pred CCcc-ceEEEecccCCCCCCceEEEEeH
Q psy17192 5 TEEP-TKWRVENSWGEEQNHKGYILMTS 31 (74)
Q Consensus 5 ~g~~-~~wkVeNSWG~~~G~~Gy~~ms~ 31 (74)
+|++ .||.|+||||+.||++|||.|..
T Consensus 633 nG~~~~YWIVRNSWGt~WGenGYfKI~R 660 (693)
T PTZ00049 633 NGKLYKYWIGRNSWGKNWGKEGYFKIIR 660 (693)
T ss_pred CCcccCEEEEECCCCCCcccCceEEEEc
Confidence 5642 59999999999999999999964
No 20
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=98.10 E-value=2.4e-06 Score=72.73 Aligned_cols=25 Identities=32% Similarity=0.740 Sum_probs=22.4
Q ss_pred CccceEEEecccCCCCCCceEEEEeH
Q psy17192 6 EEPTKWRVENSWGEEQNHKGYILMTS 31 (74)
Q Consensus 6 g~~~~wkVeNSWG~~~G~~Gy~~ms~ 31 (74)
|+ .||.|+||||+.||++|||.|..
T Consensus 740 gk-~YWIVRNSWGt~WGEnGYFKI~r 764 (1004)
T PTZ00462 740 KK-SYWIVRNSWGKYWGDEGYFKVDM 764 (1004)
T ss_pred CC-ceEEEEcCCCCCcCCCeEEEEEe
Confidence 44 49999999999999999999974
No 21
>KOG1544|consensus
Probab=96.63 E-value=0.0015 Score=51.43 Aligned_cols=25 Identities=28% Similarity=0.696 Sum_probs=21.9
Q ss_pred CCc-cceEEEecccCCCCCCceEEEE
Q psy17192 5 TEE-PTKWRVENSWGEEQNHKGYILM 29 (74)
Q Consensus 5 ~g~-~~~wkVeNSWG~~~G~~Gy~~m 29 (74)
+|+ ..+|..-||||+.+|++|||.+
T Consensus 417 ~G~~~KyW~aANSWG~~WGE~GYFri 442 (470)
T KOG1544|consen 417 DGRTLKYWTAANSWGPAWGERGYFRI 442 (470)
T ss_pred CCCeeEEEEeecccccccccCceEEE
Confidence 454 4789999999999999999986
No 22
>PF04046 PSP: PSP; InterPro: IPR006568 PSP is a proline-rich domain of unknown function found in spliceosome associated proteins.
Probab=61.14 E-value=4.6 Score=23.29 Aligned_cols=20 Identities=10% Similarity=0.360 Sum_probs=17.3
Q ss_pred CCCHHHHhhhcCCCeeeCCC
Q psy17192 47 YVPASVLDVFNQEPTILPAW 66 (74)
Q Consensus 47 ~l~~~~~~~l~~~p~~l~~w 66 (74)
.||++++++|.=.+-.+|||
T Consensus 4 ~lS~~LR~ALg~~~~~~PPw 23 (48)
T PF04046_consen 4 KLSDELREALGMQENDPPPW 23 (48)
T ss_pred ccCHHHHHHcCCCCCCCChH
Confidence 58999999998777778888
No 23
>smart00581 PSP proline-rich domain in spliceosome associated proteins.
Probab=57.95 E-value=5.7 Score=23.46 Aligned_cols=21 Identities=14% Similarity=0.439 Sum_probs=18.0
Q ss_pred CCCCHHHHhhhcCCCeeeCCC
Q psy17192 46 KYVPASVLDVFNQEPTILPAW 66 (74)
Q Consensus 46 ~~l~~~~~~~l~~~p~~l~~w 66 (74)
-.||++++++|.=.+-.+|||
T Consensus 7 G~lS~~LR~ALG~~~~~pPPW 27 (54)
T smart00581 7 GRISDELREALGLPPGQPPPW 27 (54)
T ss_pred CcCCHHHHHHcCCCCCCCChH
Confidence 358999999998877788888
No 24
>PF01011 PQQ: PQQ enzyme repeat family.; InterPro: IPR002372 Pyrrolo-quinoline quinone (PQQ) is a redox coenzyme, which serves as a cofactor for a number of enzymes (quinoproteins) and particularly for some bacterial dehydrogenases [, ]. A number of bacterial quinoproteins belong to this family. Enzymes in this group have repeats of a beta propeller.; PDB: 1H4I_C 1H4J_E 1W6S_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A 1G72_A ....
Probab=48.41 E-value=12 Score=19.35 Aligned_cols=18 Identities=17% Similarity=0.346 Sum_probs=14.2
Q ss_pred CCCCCCccceEEEecccCC
Q psy17192 1 MDQETEEPTKWRVENSWGE 19 (74)
Q Consensus 1 ~d~~~g~~~~wkVeNSWG~ 19 (74)
||.++|+. .|+.+++-..
T Consensus 15 lD~~TG~~-~W~~~~~~~~ 32 (38)
T PF01011_consen 15 LDAKTGKV-LWKFQTGPPV 32 (38)
T ss_dssp EETTTTSE-EEEEESSSGG
T ss_pred EECCCCCE-EEeeeCCCCC
Confidence 57889995 8999987544
No 25
>PF05153 DUF706: Family of unknown function (DUF706) ; InterPro: IPR007828 Inositol oxygenase (1.13.99.1 from EC) is involved in the biosynthesis of UDP-glucuronic acid (UDP-GlcA), providing nucleotide sugars for cell-wall polymers. It may be also involved in plant ascorbate biosynthesis [, ].; GO: 0005506 iron ion binding, 0050113 inositol oxygenase activity, 0019310 inositol catabolic process, 0055114 oxidation-reduction process, 0005737 cytoplasm; PDB: 2HUO_A 3BXD_A 2IBN_A.
Probab=43.08 E-value=6.8 Score=29.54 Aligned_cols=23 Identities=26% Similarity=0.595 Sum_probs=16.4
Q ss_pred Hhhhceeeee-cCCCCCHHHHhhh
Q psy17192 34 FKEYVFEVVV-DKKYVPASVLDVF 56 (74)
Q Consensus 34 F~eyv~~ivV-~K~~l~~~~~~~l 56 (74)
=+||+|+|+. |+..||++.+.++
T Consensus 162 HDEYlY~Vlk~n~~tLP~eaL~mI 185 (253)
T PF05153_consen 162 HDEYLYQVLKHNKSTLPEEALYMI 185 (253)
T ss_dssp HHHHHHHHHHHCT----HHHHHHH
T ss_pred chHHHHHHHHcccCccCHHHHHHH
Confidence 5899999998 9999999887765
No 26
>PF10273 WGG: Pre-rRNA-processing protein TSR2; InterPro: IPR019398 The pre-rRNA-processing protein TSR2 is required for 20S pre-rRNA processing []. This family contains a distinctive WGG motif.
Probab=41.93 E-value=4.7 Score=24.92 Aligned_cols=32 Identities=25% Similarity=0.548 Sum_probs=22.5
Q ss_pred EecccCCCCCCceEEEEeHHHHhhhceeeeecCCCCC
Q psy17192 13 VENSWGEEQNHKGYILMTSPWFKEYVFEVVVDKKYVP 49 (74)
Q Consensus 13 VeNSWG~~~G~~Gy~~ms~~wF~eyv~~ivV~K~~l~ 49 (74)
|||+||-..+ .-..+||..++.+...+.+-+.
T Consensus 20 Ven~wGG~~s-----~~K~~~l~~~i~~~f~~~~~~~ 51 (82)
T PF10273_consen 20 VENGWGGPDS-----QEKADWLAEVIVDWFTENKDPD 51 (82)
T ss_pred HHhccCCccH-----HHHHHHHHHHHHHHHhcCCCCC
Confidence 7999996543 2345788888888877755544
No 27
>PF06658 DUF1168: Protein of unknown function (DUF1168); InterPro: IPR009548 This family consists of several hypothetical eukaryotic proteins of unknown function.
Probab=32.22 E-value=9.5 Score=26.30 Aligned_cols=20 Identities=20% Similarity=0.107 Sum_probs=15.6
Q ss_pred eEEEecccCCCCC-CceEEEE
Q psy17192 10 KWRVENSWGEEQN-HKGYILM 29 (74)
Q Consensus 10 ~wkVeNSWG~~~G-~~Gy~~m 29 (74)
.=+|.|-||+..| ..|.|.+
T Consensus 20 pe~V~NV~GSSAGAGSGeFHv 40 (142)
T PF06658_consen 20 PEFVRNVQGSSAGAGSGEFHV 40 (142)
T ss_pred CeeeccccccccccCccHHHH
Confidence 4579999999877 4788765
No 28
>smart00695 DUSP Domain in ubiquitin-specific proteases.
Probab=31.85 E-value=48 Score=19.64 Aligned_cols=21 Identities=14% Similarity=0.566 Sum_probs=15.1
Q ss_pred CCCceEEEEeHHHHhhhceeee
Q psy17192 21 QNHKGYILMTSPWFKEYVFEVV 42 (74)
Q Consensus 21 ~G~~Gy~~ms~~wF~eyv~~iv 42 (74)
.|+. +|.+|.+||+++.--+.
T Consensus 3 ~g~~-~yiIs~~W~~~~~~~~~ 23 (86)
T smart00695 3 EGDT-WYLISTRWYRQWADFVE 23 (86)
T ss_pred CCCE-EEEEeHHHHHHHHHHhC
Confidence 3544 78999999998775333
No 29
>KOG4032|consensus
Probab=31.41 E-value=15 Score=26.57 Aligned_cols=33 Identities=30% Similarity=0.558 Sum_probs=21.9
Q ss_pred EecccCCCCCCceEEEEeHHHHhhhceeeeecCCCCCH
Q psy17192 13 VENSWGEEQNHKGYILMTSPWFKEYVFEVVVDKKYVPA 50 (74)
Q Consensus 13 VeNSWG~~~G~~Gy~~ms~~wF~eyv~~ivV~K~~l~~ 50 (74)
||||||...+. --..|+-.++++..+.-+-|..
T Consensus 31 VEng~GG~dsq-----ek~~w~~~~~~d~~~~~~dV~~ 63 (184)
T KOG4032|consen 31 VENGWGGRDSQ-----EKAKWITGVVVDYFTMNKDVKA 63 (184)
T ss_pred HHhccCCccHH-----HHHHHHHhhHhhhcccccchhH
Confidence 79999975442 2356888888877755444543
No 30
>PF13586 DDE_Tnp_1_2: Transposase DDE domain
Probab=30.84 E-value=14 Score=22.21 Aligned_cols=13 Identities=23% Similarity=0.511 Sum_probs=10.2
Q ss_pred cceEEEecccCCC
Q psy17192 8 PTKWRVENSWGEE 20 (74)
Q Consensus 8 ~~~wkVeNSWG~~ 20 (74)
+.+|.||+|.|--
T Consensus 45 ~~Rw~VEr~f~wl 57 (88)
T PF13586_consen 45 KRRWVVERTFAWL 57 (88)
T ss_pred ccceehhhhhHHH
Confidence 4799999998643
No 31
>PF06646 Mycoplasma_p37: High affinity transport system protein p37; InterPro: IPR010592 This family consists of several high affinity transport system protein p37 sequences, which are specific to Mycoplasma species. The p37 gene is part of an operon encoding two additional proteins, which are highly similar to components of the periplasmic binding-protein-dependent transport systems of Gram-negative bacteria. It has been suggested that p37 is part of a homologous, high-affinity transport system in Mycoplasma hyorhinis, a Gram-positive bacterium [].; GO: 0005215 transporter activity, 0006810 transport, 0016020 membrane; PDB: 3MYU_B 3EKI_A 3E79_A 3E78_A.
Probab=26.40 E-value=32 Score=26.85 Aligned_cols=18 Identities=28% Similarity=0.547 Sum_probs=11.2
Q ss_pred ecccCCCCCCceEEEEeH
Q psy17192 14 ENSWGEEQNHKGYILMTS 31 (74)
Q Consensus 14 eNSWG~~~G~~Gy~~ms~ 31 (74)
||.||...|+.||-.+++
T Consensus 300 ~ntYG~~~GyNgy~~i~~ 317 (331)
T PF06646_consen 300 ENTYGPYTGYNGYKKINN 317 (331)
T ss_dssp --SSGGGCT--EEEE-S-
T ss_pred ccccccccccccccccCc
Confidence 789999999999987765
No 32
>COG2513 PrpB PEP phosphonomutase and related enzymes [Carbohydrate transport and metabolism]
Probab=24.43 E-value=31 Score=26.40 Aligned_cols=25 Identities=32% Similarity=0.615 Sum_probs=19.0
Q ss_pred CCHHHHhhhcC-CCeeeC-CCCccCcc
Q psy17192 48 VPASVLDVFNQ-EPTILP-AWDPMGTL 72 (74)
Q Consensus 48 l~~~~~~~l~~-~p~~l~-~wDpmg~l 72 (74)
.....++++.+ .|+++| +|||+.++
T Consensus 6 ~~~~fR~l~~~~~~~~~pg~~d~~sA~ 32 (289)
T COG2513 6 PGAAFRALHASGDPLVLPGAWDAGSAL 32 (289)
T ss_pred HHHHHHHHHhCCCCEEecCCcCHHHHH
Confidence 34567788876 688888 99999865
No 33
>KOG4003|consensus
Probab=24.10 E-value=31 Score=25.52 Aligned_cols=8 Identities=50% Similarity=1.311 Sum_probs=7.4
Q ss_pred EecccCCC
Q psy17192 13 VENSWGEE 20 (74)
Q Consensus 13 VeNSWG~~ 20 (74)
||||||.+
T Consensus 97 v~ntwG~d 104 (223)
T KOG4003|consen 97 VKNTWGVD 104 (223)
T ss_pred hccCCCCC
Confidence 89999987
No 34
>KOG0393|consensus
Probab=22.09 E-value=50 Score=23.82 Aligned_cols=26 Identities=38% Similarity=0.753 Sum_probs=19.7
Q ss_pred HhhhceeeeecCCCCCHHHHhhhcCCCeeeCCCCccCc
Q psy17192 34 FKEYVFEVVVDKKYVPASVLDVFNQEPTILPAWDPMGT 71 (74)
Q Consensus 34 F~eyv~~ivV~K~~l~~~~~~~l~~~p~~l~~wDpmg~ 71 (74)
|+.|.-+|.|+ +.++|.|-.||--|.
T Consensus 38 Fdnys~~v~V~------------dg~~v~L~LwDTAGq 63 (198)
T KOG0393|consen 38 FDNYSANVTVD------------DGKPVELGLWDTAGQ 63 (198)
T ss_pred EccceEEEEec------------CCCEEEEeeeecCCC
Confidence 57777777776 678899999987653
No 35
>TIGR03601 B_an_ocin probable heterocycle-containing bacteriocin, BA_2677 family. Numerous bacteria encode systems for producing bacteriocins by extensive modification of ribosomally produced precursors. These proteins are recognizable in part by proximity to the modification proteins, and in part by small size, with leader peptide-like N-terminal sequence followed by low-complexity sequence rich in Cys, Gly, and/or Ser. This protein family represents a probable member of the class, though previously unrecognized because it is not encoded adjacent to its modification proteins.
Probab=22.08 E-value=57 Score=20.53 Aligned_cols=23 Identities=17% Similarity=0.417 Sum_probs=16.5
Q ss_pred CCCCccceEEEecccCCCCCCceEEEEeHH
Q psy17192 3 QETEEPTKWRVENSWGEEQNHKGYILMTSP 32 (74)
Q Consensus 3 ~~~g~~~~wkVeNSWG~~~G~~Gy~~ms~~ 32 (74)
.+.|+.+||.-+|-. =||+|+++
T Consensus 16 y~~gnvvyWDqqnqy-------pyyyi~d~ 38 (79)
T TIGR03601 16 YQTGNVVYWDQQNQY-------PYYYIQDD 38 (79)
T ss_pred ccCCcEEEeccccCC-------cceehhhh
Confidence 377888999877753 57777664
No 36
>cd00044 CysPc Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Probab=21.30 E-value=1.8e+02 Score=21.36 Aligned_cols=29 Identities=24% Similarity=0.373 Sum_probs=20.2
Q ss_pred ceEEEecccCCC--------------------------CCCceEEEEeHHHHhhh
Q psy17192 9 TKWRVENSWGEE--------------------------QNHKGYILMTSPWFKEY 37 (74)
Q Consensus 9 ~~wkVeNSWG~~--------------------------~G~~Gy~~ms~~wF~ey 37 (74)
+.-+++|-||.. ..++|.|.|+-+=|-+|
T Consensus 253 ~lv~lrNPWg~~~w~G~ws~~~~~w~~~~~~~~~~~~~~~~dG~Fwm~~~df~~~ 307 (315)
T cd00044 253 RLLRLRNPWGVGEWWGGWSDDSSEWWVIDAERKKLLLSGKDDGEFWMSFEDFLRN 307 (315)
T ss_pred EEEEecCCccCCCccCCCCCCCchhccChHHHHHhcCCCCCCCEEEEEhHHhhee
Confidence 568999999953 23589999985544443
No 37
>PF06337 DUSP: DUSP domain; InterPro: IPR006615 Deubiquitinating enzymes (DUB) form a large family of cysteine protease that can deconjugate ubiquitin or ubiquitin-like proteins (see PDOC00271 from PROSITEDOC) from ubiquitin-conjugated proteins. All DUBs contain a catalytic domain surrounded by one or more subdomains, some of which contribute to target recognition. The ~120-residue DUSP (domain present in ubiquitin-specific proteases) domain is one of these specific subdomains. Single or tandem DUSP domains are located both N- and C-terminal to the ubiquitin carboxyl-terminal hydrolase catalytic core domain (see PDOC00750 from PROSITEDOC) []. The DUSP domain displays a tripod-like AB3 fold with a three-helix bundle and a three-stranded anti-parallel beta-sheet resembling the legs and seat of the tripod. Conserved residues are predominantly involved in hydrophobic packing interactions within the three alpha-helices. The most conserved DUSP residues, forming the PGPI motif, are flanked by two long loops that vary both in length and sequence. The PGPI motif packs against the three-helix bundle and is highly ordered []. The function of the DUSP domain is unknown but it may play a role in protein/protein interaction or substrate recognition. This domain is associated with ubiquitin carboxyl-terminal hydrolase family 2 (IPR001394 from INTERPRO, MEROPS peptidase family C19). They are a family 100 to 200 kDa peptides which includes the Ubp1 ubiquitin peptidase from yeast; others include: Mammalian ubiquitin carboxyl-terminal hydrolase 4 (USP4), Mammalian ubiquitin carboxyl-terminal hydrolase 11 (USP11), Mammalian ubiquitin carboxyl-terminal hydrolase 15 (USP15), Mammalian ubiquitin carboxyl-terminal hydrolase 20 (USP20), Mammalian ubiquitin carboxyl-terminal hydrolase 32 (USP32), Vertebrate ubiquitin carboxyl-terminal hydrolase 33 (USP33), Vertebrate ubiquitin carboxyl-terminal hydrolase 48 (USP48). ; GO: 0004221 ubiquitin thiolesterase activity; PDB: 4A3O_B 3PPA_A 3LMN_A 1W6V_A 3T9L_A 4A3P_A 3PV1_A 3JYU_B.
Probab=20.46 E-value=88 Score=18.67 Aligned_cols=22 Identities=23% Similarity=0.581 Sum_probs=15.5
Q ss_pred ceEEEEeHHHHhhhceeeeecC
Q psy17192 24 KGYILMTSPWFKEYVFEVVVDK 45 (74)
Q Consensus 24 ~Gy~~ms~~wF~eyv~~ivV~K 45 (74)
+=+|.+|.+||+.+.-.+--..
T Consensus 2 ~~~ylIs~~W~~~W~~~v~~~~ 23 (99)
T PF06337_consen 2 DKWYLISSKWWRQWKSYVSYDN 23 (99)
T ss_dssp -EEEEEEHHHHHHHHHHHTTTT
T ss_pred CEEEEECHHHHHHHHHHhCCCC
Confidence 3478999999998876554333
No 38
>PF10008 DUF2251: Uncharacterized protein conserved in bacteria (DUF2251); InterPro: IPR014449 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=20.05 E-value=63 Score=21.12 Aligned_cols=13 Identities=23% Similarity=0.585 Sum_probs=10.1
Q ss_pred CCCCCceEEEEeH
Q psy17192 19 EEQNHKGYILMTS 31 (74)
Q Consensus 19 ~~~G~~Gy~~ms~ 31 (74)
++.|+-||||.=+
T Consensus 17 EDdGeTGYFYa~d 29 (97)
T PF10008_consen 17 EDDGETGYFYACD 29 (97)
T ss_pred EeCCCcEEEEEEe
Confidence 4679999999743
Done!