Query         psy17192
Match_columns 74
No_of_seqs    114 out of 533
Neff          4.8 
Searched_HMMs 46136
Date          Sat Aug 17 00:12:09 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy17192.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/17192hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 COG3579 PepC Aminopeptidase C  100.0 3.3E-35 7.1E-40  225.7   6.6   71    4-74    374-444 (444)
  2 PF03051 Peptidase_C1_2:  Pepti 100.0 3.1E-34 6.7E-39  222.3   6.9   67    4-70    372-438 (438)
  3 cd00585 Peptidase_C1B Peptidas 100.0 2.5E-32 5.5E-37  211.6   6.8   67    4-70    371-437 (437)
  4 KOG4128|consensus               99.9 5.8E-28 1.3E-32  185.9   3.1   73    2-74    384-457 (457)
  5 cd02619 Peptidase_C1 C1 Peptid  98.8 3.7E-09 8.1E-14   71.2   3.8   36    6-42    187-222 (223)
  6 cd02248 Peptidase_C1A Peptidas  98.7 1.5E-08 3.2E-13   68.9   3.0   25    5-30    171-195 (210)
  7 cd02620 Peptidase_C1A_Cathepsi  98.7 1.6E-08 3.5E-13   71.7   3.3   27    4-31    196-222 (236)
  8 PF00112 Peptidase_C1:  Papain   98.7 1.2E-08 2.6E-13   68.7   2.3   27    5-32    178-204 (219)
  9 cd02698 Peptidase_C1A_Cathepsi  98.6 2.9E-08 6.3E-13   70.5   3.1   29    6-35    193-221 (239)
 10 PTZ00203 cathepsin L protease;  98.6 4.6E-08 9.9E-13   74.2   3.8   27    4-31    298-324 (348)
 11 cd02621 Peptidase_C1A_Cathepsi  98.5 5.8E-08 1.3E-12   68.8   3.2   25    6-31    202-226 (243)
 12 smart00645 Pept_C1 Papain fami  98.5 9.1E-08   2E-12   65.3   3.4   26    5-31    133-158 (174)
 13 PTZ00200 cysteine proteinase;   98.5   9E-08 1.9E-12   75.0   3.5   29    2-31    398-426 (448)
 14 KOG1542|consensus               98.4   2E-07 4.3E-12   72.2   2.7   24    6-30    331-354 (372)
 15 KOG1543|consensus               98.3 3.6E-07 7.8E-12   68.7   3.0   26    5-31    282-307 (325)
 16 PTZ00021 falcipain-2; Provisio  98.2 8.2E-07 1.8E-11   70.6   3.3   23    9-31    446-468 (489)
 17 COG4870 Cysteine protease [Pos  98.1   1E-06 2.2E-11   68.4   1.9   25    9-33    290-314 (372)
 18 PTZ00364 dipeptidyl-peptidase   98.1 1.9E-06 4.1E-11   69.4   3.4   27    4-31    415-443 (548)
 19 PTZ00049 cathepsin C-like prot  98.1 2.4E-06 5.1E-11   70.5   3.5   27    5-31    633-660 (693)
 20 PTZ00462 Serine-repeat antigen  98.1 2.4E-06 5.2E-11   72.7   3.5   25    6-31    740-764 (1004)
 21 KOG1544|consensus               96.6  0.0015 3.3E-08   51.4   2.7   25    5-29    417-442 (470)
 22 PF04046 PSP:  PSP;  InterPro:   61.1     4.6 9.9E-05   23.3   0.9   20   47-66      4-23  (48)
 23 smart00581 PSP proline-rich do  58.0     5.7 0.00012   23.5   1.0   21   46-66      7-27  (54)
 24 PF01011 PQQ:  PQQ enzyme repea  48.4      12 0.00025   19.4   1.2   18    1-19     15-32  (38)
 25 PF05153 DUF706:  Family of unk  43.1     6.8 0.00015   29.5  -0.4   23   34-56    162-185 (253)
 26 PF10273 WGG:  Pre-rRNA-process  41.9     4.7  0.0001   24.9  -1.2   32   13-49     20-51  (82)
 27 PF06658 DUF1168:  Protein of u  32.2     9.5 0.00021   26.3  -1.0   20   10-29     20-40  (142)
 28 smart00695 DUSP Domain in ubiq  31.9      48   0.001   19.6   2.2   21   21-42      3-23  (86)
 29 KOG4032|consensus               31.4      15 0.00032   26.6  -0.2   33   13-50     31-63  (184)
 30 PF13586 DDE_Tnp_1_2:  Transpos  30.8      14  0.0003   22.2  -0.3   13    8-20     45-57  (88)
 31 PF06646 Mycoplasma_p37:  High   26.4      32  0.0007   26.8   0.9   18   14-31    300-317 (331)
 32 COG2513 PrpB PEP phosphonomuta  24.4      31 0.00068   26.4   0.5   25   48-72      6-32  (289)
 33 KOG4003|consensus               24.1      31 0.00066   25.5   0.4    8   13-20     97-104 (223)
 34 KOG0393|consensus               22.1      50  0.0011   23.8   1.1   26   34-71     38-63  (198)
 35 TIGR03601 B_an_ocin probable h  22.1      57  0.0012   20.5   1.3   23    3-32     16-38  (79)
 36 cd00044 CysPc Calpains, domain  21.3 1.8E+02  0.0039   21.4   4.0   29    9-37    253-307 (315)
 37 PF06337 DUSP:  DUSP domain;  I  20.5      88  0.0019   18.7   1.9   22   24-45      2-23  (99)
 38 PF10008 DUF2251:  Uncharacteri  20.0      63  0.0014   21.1   1.2   13   19-31     17-29  (97)

No 1  
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=100.00  E-value=3.3e-35  Score=225.74  Aligned_cols=71  Identities=46%  Similarity=1.102  Sum_probs=69.5

Q ss_pred             CCCccceEEEecccCCCCCCceEEEEeHHHHhhhceeeeecCCCCCHHHHhhhcCCCeeeCCCCccCccCC
Q psy17192          4 ETEEPTKWRVENSWGEEQNHKGYILMTSPWFKEYVFEVVVDKKYVPASVLDVFNQEPTILPAWDPMGTLAQ   74 (74)
Q Consensus         4 ~~g~~~~wkVeNSWG~~~G~~Gy~~ms~~wF~eyv~~ivV~K~~l~~~~~~~l~~~p~~l~~wDpmg~l~~   74 (74)
                      ++|+|.|||||||||++.|.+|||+|||+||+||+|||||+|++||+++++++..|||+|+|||||||||.
T Consensus       374 ~~g~p~rwkVENSWG~d~G~~GyfvaSd~wmdEytyQIvV~k~~l~~e~~~a~~~epivL~pWDPMGALA~  444 (444)
T COG3579         374 ETGNPLRWKVENSWGKDVGKKGYFVASDAWMDEYTYQIVVDKKFLPKEELAAYEEEPIVLAPWDPMGALAK  444 (444)
T ss_pred             cCCCceeeEeecccccccCCCceEeehHhHhhhheeEEEEehhhCCHHHHHhhcCCCeecCCCCccccccC
Confidence            67899999999999999999999999999999999999999999999999999999999999999999985


No 2  
>PF03051 Peptidase_C1_2:  Peptidase C1-like family This family is a subfamily of the Prosite entry;  InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=100.00  E-value=3.1e-34  Score=222.27  Aligned_cols=67  Identities=52%  Similarity=1.205  Sum_probs=61.0

Q ss_pred             CCCccceEEEecccCCCCCCceEEEEeHHHHhhhceeeeecCCCCCHHHHhhhcCCCeeeCCCCccC
Q psy17192          4 ETEEPTKWRVENSWGEEQNHKGYILMTSPWFKEYVFEVVVDKKYVPASVLDVFNQEPTILPAWDPMG   70 (74)
Q Consensus         4 ~~g~~~~wkVeNSWG~~~G~~Gy~~ms~~wF~eyv~~ivV~K~~l~~~~~~~l~~~p~~l~~wDpmg   70 (74)
                      ++|+|++||||||||++.|++|||+||++||++|||+|||||++||++++++|++|||+||||||||
T Consensus       372 ~~g~p~~wkVeNSWG~~~g~kGy~~msd~wf~e~v~~vvV~K~~lp~~~~~~~~~~p~~l~~wdpmg  438 (438)
T PF03051_consen  372 EDGKPVRWKVENSWGTDNGDKGYFYMSDDWFDEYVYQVVVHKKYLPKELLKKLEKEPIVLPPWDPMG  438 (438)
T ss_dssp             TTSSEEEEEEE-SBTTTSTBTTEEEEEHHHHHHHEEEEEEEGGGS-HHHHHHHHS--EEE-TTSCCC
T ss_pred             cCCCeeEEEEEcCCCCCCCCCcEEEECHHHHHhhhheEEechhhCCHHHHHHhcCCCeECCCCCCCC
Confidence            6799999999999999999999999999999999999999999999999999999999999999998


No 3  
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.97  E-value=2.5e-32  Score=211.60  Aligned_cols=67  Identities=61%  Similarity=1.293  Sum_probs=65.2

Q ss_pred             CCCccceEEEecccCCCCCCceEEEEeHHHHhhhceeeeecCCCCCHHHHhhhcCCCeeeCCCCccC
Q psy17192          4 ETEEPTKWRVENSWGEEQNHKGYILMTSPWFKEYVFEVVVDKKYVPASVLDVFNQEPTILPAWDPMG   70 (74)
Q Consensus         4 ~~g~~~~wkVeNSWG~~~G~~Gy~~ms~~wF~eyv~~ivV~K~~l~~~~~~~l~~~p~~l~~wDpmg   70 (74)
                      ++|+|.||+||||||+++|++|||+||++||++|||+|||||++||++++++|+++||+||||||||
T Consensus       371 ~~g~p~yw~VkNSWG~~~G~~Gy~~ms~~wf~~y~~~ivV~kk~l~~~~~~~l~~~p~~L~~WDpmg  437 (437)
T cd00585         371 EDGKPVKWKVENSWGEKVGKKGYFVMSDDWFDEYVYQVVVDKKYLPEEVLDLLKQEPIVLPPWDPMG  437 (437)
T ss_pred             CCCCcceEEEEcccCCCCCCCcceehhHHHHHhhheEEEeecccCCHHHHHHhcCCCEECCCCCCCC
Confidence            4588889999999999999999999999999999999999999999999999999999999999997


No 4  
>KOG4128|consensus
Probab=99.94  E-value=5.8e-28  Score=185.87  Aligned_cols=73  Identities=53%  Similarity=0.974  Sum_probs=70.3

Q ss_pred             CCCCCccceEEEecccCCCCCCceEEEEeHHHHhhhceeeeecCCCCCHHHHhhhc-CCCeeeCCCCccCccCC
Q psy17192          2 DQETEEPTKWRVENSWGEEQNHKGYILMTSPWFKEYVFEVVVDKKYVPASVLDVFN-QEPTILPAWDPMGTLAQ   74 (74)
Q Consensus         2 d~~~g~~~~wkVeNSWG~~~G~~Gy~~ms~~wF~eyv~~ivV~K~~l~~~~~~~l~-~~p~~l~~wDpmg~l~~   74 (74)
                      |+.+|-+.+|+||||||++.|++||+.|+++||+||+|+||||.+++|++++.++. +|||+||+||||||||.
T Consensus       384 d~~~g~~~~~rVenswgkd~gkkg~~~mt~ewf~EY~feiVVd~k~vP~evl~~~~~~epi~LPaWDPMGALa~  457 (457)
T KOG4128|consen  384 DPATGGLNEHRVENSWGKDLGKKGVNKMTAEWFREYAFEIVVDEKFVPSEVLHAARIHEPIELPAWDPMGALAN  457 (457)
T ss_pred             CcccCCchhhhhhchhhhhccccchhhhhHHHHHhhheeEEeecccCcHHHHHHhhhcCCeeccccCchhhhcC
Confidence            67889999999999999999999999999999999999999999999999999997 89999999999999984


No 5  
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=98.82  E-value=3.7e-09  Score=71.19  Aligned_cols=36  Identities=25%  Similarity=0.396  Sum_probs=32.0

Q ss_pred             CccceEEEecccCCCCCCceEEEEeHHHHhhhceeee
Q psy17192          6 EEPTKWRVENSWGEEQNHKGYILMTSPWFKEYVFEVV   42 (74)
Q Consensus         6 g~~~~wkVeNSWG~~~G~~Gy~~ms~~wF~eyv~~iv   42 (74)
                      |+ .+|+|+||||+.+|++||+.|+.+-+.++++...
T Consensus       187 ~~-~~~i~~NSwG~~wg~~Gy~~i~~~~~~~~~~~~~  222 (223)
T cd02619         187 GK-GAFIVKNSWGTDWGDNGYGRISYEDVYEMTFGAN  222 (223)
T ss_pred             CC-CEEEEEeCCCCccccCCEEEEehhhhhhhhhccc
Confidence            55 5999999999999999999999999998887654


No 6  
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=98.67  E-value=1.5e-08  Score=68.87  Aligned_cols=25  Identities=40%  Similarity=0.795  Sum_probs=22.5

Q ss_pred             CCccceEEEecccCCCCCCceEEEEe
Q psy17192          5 TEEPTKWRVENSWGEEQNHKGYILMT   30 (74)
Q Consensus         5 ~g~~~~wkVeNSWG~~~G~~Gy~~ms   30 (74)
                      +|+ .+|+|+||||+++|++|||.|+
T Consensus       171 ~~~-~ywiv~NSWG~~WG~~Gy~~i~  195 (210)
T cd02248         171 NGV-DYWIVKNSWGTSWGEKGYIRIA  195 (210)
T ss_pred             CCc-eEEEEEcCCCCccccCcEEEEE
Confidence            354 5999999999999999999997


No 7  
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=98.67  E-value=1.6e-08  Score=71.72  Aligned_cols=27  Identities=30%  Similarity=0.667  Sum_probs=23.6

Q ss_pred             CCCccceEEEecccCCCCCCceEEEEeH
Q psy17192          4 ETEEPTKWRVENSWGEEQNHKGYILMTS   31 (74)
Q Consensus         4 ~~g~~~~wkVeNSWG~~~G~~Gy~~ms~   31 (74)
                      ++|+ .+|+|+||||+++|++|||.++.
T Consensus       196 ~~g~-~YWivrNSWG~~WGe~Gy~ri~~  222 (236)
T cd02620         196 ENGV-PYWLAANSWGTDWGENGYFRILR  222 (236)
T ss_pred             cCCe-eEEEEEeCCCCCCCCCcEEEEEc
Confidence            3565 49999999999999999999974


No 8  
>PF00112 Peptidase_C1:  Papain family cysteine protease This is family C1 in the peptidase classification. ;  InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues.  The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate [].  The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=98.66  E-value=1.2e-08  Score=68.71  Aligned_cols=27  Identities=30%  Similarity=0.704  Sum_probs=22.7

Q ss_pred             CCccceEEEecccCCCCCCceEEEEeHH
Q psy17192          5 TEEPTKWRVENSWGEEQNHKGYILMTSP   32 (74)
Q Consensus         5 ~g~~~~wkVeNSWG~~~G~~Gy~~ms~~   32 (74)
                      .|+ .+|+|+||||+++|++|||.|+..
T Consensus       178 ~~~-~~wiv~NSWG~~WG~~Gy~~i~~~  204 (219)
T PF00112_consen  178 NGK-GYWIVKNSWGTDWGDNGYFRISYD  204 (219)
T ss_dssp             TTE-EEEEEE-SBTTTSTBTTEEEEESS
T ss_pred             cce-eeEeeehhhCCccCCCeEEEEeeC
Confidence            345 599999999999999999999864


No 9  
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=98.61  E-value=2.9e-08  Score=70.55  Aligned_cols=29  Identities=31%  Similarity=0.693  Sum_probs=25.0

Q ss_pred             CccceEEEecccCCCCCCceEEEEeHHHHh
Q psy17192          6 EEPTKWRVENSWGEEQNHKGYILMTSPWFK   35 (74)
Q Consensus         6 g~~~~wkVeNSWG~~~G~~Gy~~ms~~wF~   35 (74)
                      |+ .+|+|+||||+++|++|||.|+..-+.
T Consensus       193 g~-~YWiikNSWG~~WGe~Gy~~i~rg~~~  221 (239)
T cd02698         193 GV-EYWIVRNSWGEPWGERGWFRIVTSSYK  221 (239)
T ss_pred             CC-EEEEEEcCCCcccCcCceEEEEccCCc
Confidence            66 499999999999999999999865543


No 10 
>PTZ00203 cathepsin L protease; Provisional
Probab=98.59  E-value=4.6e-08  Score=74.23  Aligned_cols=27  Identities=44%  Similarity=0.924  Sum_probs=23.5

Q ss_pred             CCCccceEEEecccCCCCCCceEEEEeH
Q psy17192          4 ETEEPTKWRVENSWGEEQNHKGYILMTS   31 (74)
Q Consensus         4 ~~g~~~~wkVeNSWG~~~G~~Gy~~ms~   31 (74)
                      ++|.+ ||+|+||||+.||++|||.|..
T Consensus       298 ~~g~~-YWiikNSWG~~WGe~GY~ri~r  324 (348)
T PTZ00203        298 TGEVP-YWVIKNSWGEDWGEKGYVRVTM  324 (348)
T ss_pred             CCCce-EEEEEcCCCCCcCcCceEEEEc
Confidence            45654 9999999999999999999964


No 11 
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=98.55  E-value=5.8e-08  Score=68.80  Aligned_cols=25  Identities=40%  Similarity=0.815  Sum_probs=22.4

Q ss_pred             CccceEEEecccCCCCCCceEEEEeH
Q psy17192          6 EEPTKWRVENSWGEEQNHKGYILMTS   31 (74)
Q Consensus         6 g~~~~wkVeNSWG~~~G~~Gy~~ms~   31 (74)
                      |. .+|+|+||||+.+|++|||.|..
T Consensus       202 g~-~YWiirNSWG~~WGe~Gy~~i~~  226 (243)
T cd02621         202 GE-KYWIVKNSWGSSWGEKGYFKIRR  226 (243)
T ss_pred             CC-cEEEEEcCCCCCCCcCCeEEEec
Confidence            55 59999999999999999999963


No 12 
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=98.51  E-value=9.1e-08  Score=65.31  Aligned_cols=26  Identities=31%  Similarity=0.755  Sum_probs=23.7

Q ss_pred             CCccceEEEecccCCCCCCceEEEEeH
Q psy17192          5 TEEPTKWRVENSWGEEQNHKGYILMTS   31 (74)
Q Consensus         5 ~g~~~~wkVeNSWG~~~G~~Gy~~ms~   31 (74)
                      +|++ +|+|+||||+.+|++|||.|+.
T Consensus       133 ~g~~-yWii~NSwG~~WG~~G~~~i~~  158 (174)
T smart00645      133 NGKD-YWIVKNSWGTDWGENGYFRIAR  158 (174)
T ss_pred             CCee-EEEEECCCCCCcccCeEEEEEc
Confidence            6775 9999999999999999999974


No 13 
>PTZ00200 cysteine proteinase; Provisional
Probab=98.50  E-value=9e-08  Score=75.04  Aligned_cols=29  Identities=31%  Similarity=0.825  Sum_probs=24.5

Q ss_pred             CCCCCccceEEEecccCCCCCCceEEEEeH
Q psy17192          2 DQETEEPTKWRVENSWGEEQNHKGYILMTS   31 (74)
Q Consensus         2 d~~~g~~~~wkVeNSWG~~~G~~Gy~~ms~   31 (74)
                      |.++|. .+|.|+||||++||++|||.|..
T Consensus       398 d~~~g~-~YWIIkNSWG~~WGe~GY~ri~r  426 (448)
T PTZ00200        398 DEKTKK-RYWIIKNSWGTDWGENGYMRLER  426 (448)
T ss_pred             cCCCCC-ceEEEEcCCCCCcccCeeEEEEe
Confidence            345566 49999999999999999999974


No 14 
>KOG1542|consensus
Probab=98.38  E-value=2e-07  Score=72.24  Aligned_cols=24  Identities=46%  Similarity=0.955  Sum_probs=21.5

Q ss_pred             CccceEEEecccCCCCCCceEEEEe
Q psy17192          6 EEPTKWRVENSWGEEQNHKGYILMT   30 (74)
Q Consensus         6 g~~~~wkVeNSWG~~~G~~Gy~~ms   30 (74)
                      .+| ||.|+||||+.||.+|||.+-
T Consensus       331 ~~P-YWIVKNSWG~~WGE~GY~~l~  354 (372)
T KOG1542|consen  331 EKP-YWIVKNSWGTSWGEKGYYKLC  354 (372)
T ss_pred             CCc-eEEEECCccccccccceEEEe
Confidence            466 999999999999999999873


No 15 
>KOG1543|consensus
Probab=98.33  E-value=3.6e-07  Score=68.68  Aligned_cols=26  Identities=35%  Similarity=0.749  Sum_probs=22.4

Q ss_pred             CCccceEEEecccCCCCCCceEEEEeH
Q psy17192          5 TEEPTKWRVENSWGEEQNHKGYILMTS   31 (74)
Q Consensus         5 ~g~~~~wkVeNSWG~~~G~~Gy~~ms~   31 (74)
                      +|. .||.|+||||+.||++|||-|-.
T Consensus       282 ~~~-~YWivkNSWG~~WGe~Gy~ri~r  307 (325)
T KOG1543|consen  282 DGV-DYWIVKNSWGTDWGEKGYFRIAR  307 (325)
T ss_pred             CCc-eeEEEEcCCCCCcccCceEEEec
Confidence            344 49999999999999999999853


No 16 
>PTZ00021 falcipain-2; Provisional
Probab=98.24  E-value=8.2e-07  Score=70.57  Aligned_cols=23  Identities=35%  Similarity=0.744  Sum_probs=21.4

Q ss_pred             ceEEEecccCCCCCCceEEEEeH
Q psy17192          9 TKWRVENSWGEEQNHKGYILMTS   31 (74)
Q Consensus         9 ~~wkVeNSWG~~~G~~Gy~~ms~   31 (74)
                      .+|+|+||||+.||++|||.|..
T Consensus       446 ~YWIVKNSWGt~WGE~GY~rI~r  468 (489)
T PTZ00021        446 YYYIIKNSWGESWGEKGFIRIET  468 (489)
T ss_pred             CEEEEECCCCCCcccCeEEEEEc
Confidence            49999999999999999999974


No 17 
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=98.14  E-value=1e-06  Score=68.43  Aligned_cols=25  Identities=24%  Similarity=0.644  Sum_probs=22.6

Q ss_pred             ceEEEecccCCCCCCceEEEEeHHH
Q psy17192          9 TKWRVENSWGEEQNHKGYILMTSPW   33 (74)
Q Consensus         9 ~~wkVeNSWG~~~G~~Gy~~ms~~w   33 (74)
                      .-|+|+||||+.+|.+|||+||..+
T Consensus       290 GAfiikNSWGt~wG~~GYfwisY~y  314 (372)
T COG4870         290 GAFIIKNSWGTNWGENGYFWISYYY  314 (372)
T ss_pred             ceEEEECccccccccCceEEEEeee
Confidence            4699999999999999999999654


No 18 
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=98.13  E-value=1.9e-06  Score=69.35  Aligned_cols=27  Identities=22%  Similarity=0.430  Sum_probs=23.5

Q ss_pred             CCCccceEEEecccCC--CCCCceEEEEeH
Q psy17192          4 ETEEPTKWRVENSWGE--EQNHKGYILMTS   31 (74)
Q Consensus         4 ~~g~~~~wkVeNSWG~--~~G~~Gy~~ms~   31 (74)
                      ++|. .||+|+||||+  ++|++|||.|..
T Consensus       415 e~G~-~YWIVKNSWGt~~~WGE~GYfRI~R  443 (548)
T PTZ00364        415 ENGG-DYWLVLDPWGSRRSWCDGGTRKIAR  443 (548)
T ss_pred             CCCc-eEEEEECCCCCCCCcccCCeEEEEc
Confidence            3566 49999999999  999999999964


No 19 
>PTZ00049 cathepsin C-like protein; Provisional
Probab=98.11  E-value=2.4e-06  Score=70.53  Aligned_cols=27  Identities=26%  Similarity=0.604  Sum_probs=23.2

Q ss_pred             CCcc-ceEEEecccCCCCCCceEEEEeH
Q psy17192          5 TEEP-TKWRVENSWGEEQNHKGYILMTS   31 (74)
Q Consensus         5 ~g~~-~~wkVeNSWG~~~G~~Gy~~ms~   31 (74)
                      +|++ .||.|+||||+.||++|||.|..
T Consensus       633 nG~~~~YWIVRNSWGt~WGenGYfKI~R  660 (693)
T PTZ00049        633 NGKLYKYWIGRNSWGKNWGKEGYFKIIR  660 (693)
T ss_pred             CCcccCEEEEECCCCCCcccCceEEEEc
Confidence            5642 59999999999999999999964


No 20 
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=98.10  E-value=2.4e-06  Score=72.73  Aligned_cols=25  Identities=32%  Similarity=0.740  Sum_probs=22.4

Q ss_pred             CccceEEEecccCCCCCCceEEEEeH
Q psy17192          6 EEPTKWRVENSWGEEQNHKGYILMTS   31 (74)
Q Consensus         6 g~~~~wkVeNSWG~~~G~~Gy~~ms~   31 (74)
                      |+ .||.|+||||+.||++|||.|..
T Consensus       740 gk-~YWIVRNSWGt~WGEnGYFKI~r  764 (1004)
T PTZ00462        740 KK-SYWIVRNSWGKYWGDEGYFKVDM  764 (1004)
T ss_pred             CC-ceEEEEcCCCCCcCCCeEEEEEe
Confidence            44 49999999999999999999974


No 21 
>KOG1544|consensus
Probab=96.63  E-value=0.0015  Score=51.43  Aligned_cols=25  Identities=28%  Similarity=0.696  Sum_probs=21.9

Q ss_pred             CCc-cceEEEecccCCCCCCceEEEE
Q psy17192          5 TEE-PTKWRVENSWGEEQNHKGYILM   29 (74)
Q Consensus         5 ~g~-~~~wkVeNSWG~~~G~~Gy~~m   29 (74)
                      +|+ ..+|..-||||+.+|++|||.+
T Consensus       417 ~G~~~KyW~aANSWG~~WGE~GYFri  442 (470)
T KOG1544|consen  417 DGRTLKYWTAANSWGPAWGERGYFRI  442 (470)
T ss_pred             CCCeeEEEEeecccccccccCceEEE
Confidence            454 4789999999999999999986


No 22 
>PF04046 PSP:  PSP;  InterPro: IPR006568 PSP is a proline-rich domain of unknown function found in spliceosome associated proteins.
Probab=61.14  E-value=4.6  Score=23.29  Aligned_cols=20  Identities=10%  Similarity=0.360  Sum_probs=17.3

Q ss_pred             CCCHHHHhhhcCCCeeeCCC
Q psy17192         47 YVPASVLDVFNQEPTILPAW   66 (74)
Q Consensus        47 ~l~~~~~~~l~~~p~~l~~w   66 (74)
                      .||++++++|.=.+-.+|||
T Consensus         4 ~lS~~LR~ALg~~~~~~PPw   23 (48)
T PF04046_consen    4 KLSDELREALGMQENDPPPW   23 (48)
T ss_pred             ccCHHHHHHcCCCCCCCChH
Confidence            58999999998777778888


No 23 
>smart00581 PSP proline-rich domain in spliceosome associated proteins.
Probab=57.95  E-value=5.7  Score=23.46  Aligned_cols=21  Identities=14%  Similarity=0.439  Sum_probs=18.0

Q ss_pred             CCCCHHHHhhhcCCCeeeCCC
Q psy17192         46 KYVPASVLDVFNQEPTILPAW   66 (74)
Q Consensus        46 ~~l~~~~~~~l~~~p~~l~~w   66 (74)
                      -.||++++++|.=.+-.+|||
T Consensus         7 G~lS~~LR~ALG~~~~~pPPW   27 (54)
T smart00581        7 GRISDELREALGLPPGQPPPW   27 (54)
T ss_pred             CcCCHHHHHHcCCCCCCCChH
Confidence            358999999998877788888


No 24 
>PF01011 PQQ:  PQQ enzyme repeat family.;  InterPro: IPR002372 Pyrrolo-quinoline quinone (PQQ) is a redox coenzyme, which serves as a cofactor for a number of enzymes (quinoproteins) and particularly for some bacterial dehydrogenases [, ]. A number of bacterial quinoproteins belong to this family. Enzymes in this group have repeats of a beta propeller.; PDB: 1H4I_C 1H4J_E 1W6S_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A 1G72_A ....
Probab=48.41  E-value=12  Score=19.35  Aligned_cols=18  Identities=17%  Similarity=0.346  Sum_probs=14.2

Q ss_pred             CCCCCCccceEEEecccCC
Q psy17192          1 MDQETEEPTKWRVENSWGE   19 (74)
Q Consensus         1 ~d~~~g~~~~wkVeNSWG~   19 (74)
                      ||.++|+. .|+.+++-..
T Consensus        15 lD~~TG~~-~W~~~~~~~~   32 (38)
T PF01011_consen   15 LDAKTGKV-LWKFQTGPPV   32 (38)
T ss_dssp             EETTTTSE-EEEEESSSGG
T ss_pred             EECCCCCE-EEeeeCCCCC
Confidence            57889995 8999987544


No 25 
>PF05153 DUF706:  Family of unknown function (DUF706) ;  InterPro: IPR007828 Inositol oxygenase (1.13.99.1 from EC) is involved in the biosynthesis of UDP-glucuronic acid (UDP-GlcA), providing nucleotide sugars for cell-wall polymers. It may be also involved in plant ascorbate biosynthesis [, ].; GO: 0005506 iron ion binding, 0050113 inositol oxygenase activity, 0019310 inositol catabolic process, 0055114 oxidation-reduction process, 0005737 cytoplasm; PDB: 2HUO_A 3BXD_A 2IBN_A.
Probab=43.08  E-value=6.8  Score=29.54  Aligned_cols=23  Identities=26%  Similarity=0.595  Sum_probs=16.4

Q ss_pred             Hhhhceeeee-cCCCCCHHHHhhh
Q psy17192         34 FKEYVFEVVV-DKKYVPASVLDVF   56 (74)
Q Consensus        34 F~eyv~~ivV-~K~~l~~~~~~~l   56 (74)
                      =+||+|+|+. |+..||++.+.++
T Consensus       162 HDEYlY~Vlk~n~~tLP~eaL~mI  185 (253)
T PF05153_consen  162 HDEYLYQVLKHNKSTLPEEALYMI  185 (253)
T ss_dssp             HHHHHHHHHHHCT----HHHHHHH
T ss_pred             chHHHHHHHHcccCccCHHHHHHH
Confidence            5899999998 9999999887765


No 26 
>PF10273 WGG:  Pre-rRNA-processing protein TSR2;  InterPro: IPR019398 The pre-rRNA-processing protein TSR2 is required for 20S pre-rRNA processing []. This family contains a distinctive WGG motif. 
Probab=41.93  E-value=4.7  Score=24.92  Aligned_cols=32  Identities=25%  Similarity=0.548  Sum_probs=22.5

Q ss_pred             EecccCCCCCCceEEEEeHHHHhhhceeeeecCCCCC
Q psy17192         13 VENSWGEEQNHKGYILMTSPWFKEYVFEVVVDKKYVP   49 (74)
Q Consensus        13 VeNSWG~~~G~~Gy~~ms~~wF~eyv~~ivV~K~~l~   49 (74)
                      |||+||-..+     .-..+||..++.+...+.+-+.
T Consensus        20 Ven~wGG~~s-----~~K~~~l~~~i~~~f~~~~~~~   51 (82)
T PF10273_consen   20 VENGWGGPDS-----QEKADWLAEVIVDWFTENKDPD   51 (82)
T ss_pred             HHhccCCccH-----HHHHHHHHHHHHHHHhcCCCCC
Confidence            7999996543     2345788888888877755544


No 27 
>PF06658 DUF1168:  Protein of unknown function (DUF1168);  InterPro: IPR009548 This family consists of several hypothetical eukaryotic proteins of unknown function.
Probab=32.22  E-value=9.5  Score=26.30  Aligned_cols=20  Identities=20%  Similarity=0.107  Sum_probs=15.6

Q ss_pred             eEEEecccCCCCC-CceEEEE
Q psy17192         10 KWRVENSWGEEQN-HKGYILM   29 (74)
Q Consensus        10 ~wkVeNSWG~~~G-~~Gy~~m   29 (74)
                      .=+|.|-||+..| ..|.|.+
T Consensus        20 pe~V~NV~GSSAGAGSGeFHv   40 (142)
T PF06658_consen   20 PEFVRNVQGSSAGAGSGEFHV   40 (142)
T ss_pred             CeeeccccccccccCccHHHH
Confidence            4579999999877 4788765


No 28 
>smart00695 DUSP Domain in ubiquitin-specific proteases.
Probab=31.85  E-value=48  Score=19.64  Aligned_cols=21  Identities=14%  Similarity=0.566  Sum_probs=15.1

Q ss_pred             CCCceEEEEeHHHHhhhceeee
Q psy17192         21 QNHKGYILMTSPWFKEYVFEVV   42 (74)
Q Consensus        21 ~G~~Gy~~ms~~wF~eyv~~iv   42 (74)
                      .|+. +|.+|.+||+++.--+.
T Consensus         3 ~g~~-~yiIs~~W~~~~~~~~~   23 (86)
T smart00695        3 EGDT-WYLISTRWYRQWADFVE   23 (86)
T ss_pred             CCCE-EEEEeHHHHHHHHHHhC
Confidence            3544 78999999998775333


No 29 
>KOG4032|consensus
Probab=31.41  E-value=15  Score=26.57  Aligned_cols=33  Identities=30%  Similarity=0.558  Sum_probs=21.9

Q ss_pred             EecccCCCCCCceEEEEeHHHHhhhceeeeecCCCCCH
Q psy17192         13 VENSWGEEQNHKGYILMTSPWFKEYVFEVVVDKKYVPA   50 (74)
Q Consensus        13 VeNSWG~~~G~~Gy~~ms~~wF~eyv~~ivV~K~~l~~   50 (74)
                      ||||||...+.     --..|+-.++++..+.-+-|..
T Consensus        31 VEng~GG~dsq-----ek~~w~~~~~~d~~~~~~dV~~   63 (184)
T KOG4032|consen   31 VENGWGGRDSQ-----EKAKWITGVVVDYFTMNKDVKA   63 (184)
T ss_pred             HHhccCCccHH-----HHHHHHHhhHhhhcccccchhH
Confidence            79999975442     2356888888877755444543


No 30 
>PF13586 DDE_Tnp_1_2:  Transposase DDE domain
Probab=30.84  E-value=14  Score=22.21  Aligned_cols=13  Identities=23%  Similarity=0.511  Sum_probs=10.2

Q ss_pred             cceEEEecccCCC
Q psy17192          8 PTKWRVENSWGEE   20 (74)
Q Consensus         8 ~~~wkVeNSWG~~   20 (74)
                      +.+|.||+|.|--
T Consensus        45 ~~Rw~VEr~f~wl   57 (88)
T PF13586_consen   45 KRRWVVERTFAWL   57 (88)
T ss_pred             ccceehhhhhHHH
Confidence            4799999998643


No 31 
>PF06646 Mycoplasma_p37:  High affinity transport system protein p37;  InterPro: IPR010592 This family consists of several high affinity transport system protein p37 sequences, which are specific to Mycoplasma species. The p37 gene is part of an operon encoding two additional proteins, which are highly similar to components of the periplasmic binding-protein-dependent transport systems of Gram-negative bacteria. It has been suggested that p37 is part of a homologous, high-affinity transport system in Mycoplasma hyorhinis, a Gram-positive bacterium [].; GO: 0005215 transporter activity, 0006810 transport, 0016020 membrane; PDB: 3MYU_B 3EKI_A 3E79_A 3E78_A.
Probab=26.40  E-value=32  Score=26.85  Aligned_cols=18  Identities=28%  Similarity=0.547  Sum_probs=11.2

Q ss_pred             ecccCCCCCCceEEEEeH
Q psy17192         14 ENSWGEEQNHKGYILMTS   31 (74)
Q Consensus        14 eNSWG~~~G~~Gy~~ms~   31 (74)
                      ||.||...|+.||-.+++
T Consensus       300 ~ntYG~~~GyNgy~~i~~  317 (331)
T PF06646_consen  300 ENTYGPYTGYNGYKKINN  317 (331)
T ss_dssp             --SSGGGCT--EEEE-S-
T ss_pred             ccccccccccccccccCc
Confidence            789999999999987765


No 32 
>COG2513 PrpB PEP phosphonomutase and related enzymes [Carbohydrate transport and metabolism]
Probab=24.43  E-value=31  Score=26.40  Aligned_cols=25  Identities=32%  Similarity=0.615  Sum_probs=19.0

Q ss_pred             CCHHHHhhhcC-CCeeeC-CCCccCcc
Q psy17192         48 VPASVLDVFNQ-EPTILP-AWDPMGTL   72 (74)
Q Consensus        48 l~~~~~~~l~~-~p~~l~-~wDpmg~l   72 (74)
                      .....++++.+ .|+++| +|||+.++
T Consensus         6 ~~~~fR~l~~~~~~~~~pg~~d~~sA~   32 (289)
T COG2513           6 PGAAFRALHASGDPLVLPGAWDAGSAL   32 (289)
T ss_pred             HHHHHHHHHhCCCCEEecCCcCHHHHH
Confidence            34567788876 688888 99999865


No 33 
>KOG4003|consensus
Probab=24.10  E-value=31  Score=25.52  Aligned_cols=8  Identities=50%  Similarity=1.311  Sum_probs=7.4

Q ss_pred             EecccCCC
Q psy17192         13 VENSWGEE   20 (74)
Q Consensus        13 VeNSWG~~   20 (74)
                      ||||||.+
T Consensus        97 v~ntwG~d  104 (223)
T KOG4003|consen   97 VKNTWGVD  104 (223)
T ss_pred             hccCCCCC
Confidence            89999987


No 34 
>KOG0393|consensus
Probab=22.09  E-value=50  Score=23.82  Aligned_cols=26  Identities=38%  Similarity=0.753  Sum_probs=19.7

Q ss_pred             HhhhceeeeecCCCCCHHHHhhhcCCCeeeCCCCccCc
Q psy17192         34 FKEYVFEVVVDKKYVPASVLDVFNQEPTILPAWDPMGT   71 (74)
Q Consensus        34 F~eyv~~ivV~K~~l~~~~~~~l~~~p~~l~~wDpmg~   71 (74)
                      |+.|.-+|.|+            +.++|.|-.||--|.
T Consensus        38 Fdnys~~v~V~------------dg~~v~L~LwDTAGq   63 (198)
T KOG0393|consen   38 FDNYSANVTVD------------DGKPVELGLWDTAGQ   63 (198)
T ss_pred             EccceEEEEec------------CCCEEEEeeeecCCC
Confidence            57777777776            678899999987653


No 35 
>TIGR03601 B_an_ocin probable heterocycle-containing bacteriocin, BA_2677 family. Numerous bacteria encode systems for producing bacteriocins by extensive modification of ribosomally produced precursors. These proteins are recognizable in part by proximity to the modification proteins, and in part by small size, with leader peptide-like N-terminal sequence followed by low-complexity sequence rich in Cys, Gly, and/or Ser. This protein family represents a probable member of the class, though previously unrecognized because it is not encoded adjacent to its modification proteins.
Probab=22.08  E-value=57  Score=20.53  Aligned_cols=23  Identities=17%  Similarity=0.417  Sum_probs=16.5

Q ss_pred             CCCCccceEEEecccCCCCCCceEEEEeHH
Q psy17192          3 QETEEPTKWRVENSWGEEQNHKGYILMTSP   32 (74)
Q Consensus         3 ~~~g~~~~wkVeNSWG~~~G~~Gy~~ms~~   32 (74)
                      .+.|+.+||.-+|-.       =||+|+++
T Consensus        16 y~~gnvvyWDqqnqy-------pyyyi~d~   38 (79)
T TIGR03601        16 YQTGNVVYWDQQNQY-------PYYYIQDD   38 (79)
T ss_pred             ccCCcEEEeccccCC-------cceehhhh
Confidence            377888999877753       57777664


No 36 
>cd00044 CysPc Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Probab=21.30  E-value=1.8e+02  Score=21.36  Aligned_cols=29  Identities=24%  Similarity=0.373  Sum_probs=20.2

Q ss_pred             ceEEEecccCCC--------------------------CCCceEEEEeHHHHhhh
Q psy17192          9 TKWRVENSWGEE--------------------------QNHKGYILMTSPWFKEY   37 (74)
Q Consensus         9 ~~wkVeNSWG~~--------------------------~G~~Gy~~ms~~wF~ey   37 (74)
                      +.-+++|-||..                          ..++|.|.|+-+=|-+|
T Consensus       253 ~lv~lrNPWg~~~w~G~ws~~~~~w~~~~~~~~~~~~~~~~dG~Fwm~~~df~~~  307 (315)
T cd00044         253 RLLRLRNPWGVGEWWGGWSDDSSEWWVIDAERKKLLLSGKDDGEFWMSFEDFLRN  307 (315)
T ss_pred             EEEEecCCccCCCccCCCCCCCchhccChHHHHHhcCCCCCCCEEEEEhHHhhee
Confidence            568999999953                          23589999985544443


No 37 
>PF06337 DUSP:  DUSP domain;  InterPro: IPR006615 Deubiquitinating enzymes (DUB) form a large family of cysteine protease that can deconjugate ubiquitin or ubiquitin-like proteins (see PDOC00271 from PROSITEDOC) from ubiquitin-conjugated proteins. All DUBs contain a catalytic domain surrounded by one or more subdomains, some of which contribute to target recognition. The ~120-residue DUSP (domain present in ubiquitin-specific proteases) domain is one of these specific subdomains. Single or tandem DUSP domains are located both N- and C-terminal to the ubiquitin carboxyl-terminal hydrolase catalytic core domain (see PDOC00750 from PROSITEDOC) [].  The DUSP domain displays a tripod-like AB3 fold with a three-helix bundle and a three-stranded anti-parallel beta-sheet resembling the legs and seat of the tripod. Conserved residues are predominantly involved in hydrophobic packing interactions within the three alpha-helices. The most conserved DUSP residues, forming the PGPI motif, are flanked by two long loops that vary both in length and sequence. The PGPI motif packs against the three-helix bundle and is highly ordered [].  The function of the DUSP domain is unknown but it may play a role in protein/protein interaction or substrate recognition. This domain is associated with ubiquitin carboxyl-terminal hydrolase family 2 (IPR001394 from INTERPRO, MEROPS peptidase family C19). They are a family 100 to 200 kDa peptides which includes the Ubp1 ubiquitin peptidase from yeast; others include:   Mammalian ubiquitin carboxyl-terminal hydrolase 4 (USP4), Mammalian ubiquitin carboxyl-terminal hydrolase 11 (USP11),  Mammalian ubiquitin carboxyl-terminal hydrolase 15 (USP15),  Mammalian ubiquitin carboxyl-terminal hydrolase 20 (USP20),  Mammalian ubiquitin carboxyl-terminal hydrolase 32 (USP32),  Vertebrate ubiquitin carboxyl-terminal hydrolase 33 (USP33),  Vertebrate ubiquitin carboxyl-terminal hydrolase 48 (USP48).  ; GO: 0004221 ubiquitin thiolesterase activity; PDB: 4A3O_B 3PPA_A 3LMN_A 1W6V_A 3T9L_A 4A3P_A 3PV1_A 3JYU_B.
Probab=20.46  E-value=88  Score=18.67  Aligned_cols=22  Identities=23%  Similarity=0.581  Sum_probs=15.5

Q ss_pred             ceEEEEeHHHHhhhceeeeecC
Q psy17192         24 KGYILMTSPWFKEYVFEVVVDK   45 (74)
Q Consensus        24 ~Gy~~ms~~wF~eyv~~ivV~K   45 (74)
                      +=+|.+|.+||+.+.-.+--..
T Consensus         2 ~~~ylIs~~W~~~W~~~v~~~~   23 (99)
T PF06337_consen    2 DKWYLISSKWWRQWKSYVSYDN   23 (99)
T ss_dssp             -EEEEEEHHHHHHHHHHHTTTT
T ss_pred             CEEEEECHHHHHHHHHHhCCCC
Confidence            3478999999998876554333


No 38 
>PF10008 DUF2251:  Uncharacterized protein conserved in bacteria (DUF2251);  InterPro: IPR014449 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=20.05  E-value=63  Score=21.12  Aligned_cols=13  Identities=23%  Similarity=0.585  Sum_probs=10.1

Q ss_pred             CCCCCceEEEEeH
Q psy17192         19 EEQNHKGYILMTS   31 (74)
Q Consensus        19 ~~~G~~Gy~~ms~   31 (74)
                      ++.|+-||||.=+
T Consensus        17 EDdGeTGYFYa~d   29 (97)
T PF10008_consen   17 EDDGETGYFYACD   29 (97)
T ss_pred             EeCCCcEEEEEEe
Confidence            4679999999743


Done!