Query psy14159
Match_columns 264
No_of_seqs 155 out of 704
Neff 5.6
Searched_HMMs 46136
Date Fri Aug 16 20:57:31 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14159.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14159hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0778|consensus 100.0 6.4E-37 1.4E-41 298.1 9.9 137 105-261 317-457 (511)
2 COG5160 ULP1 Protease, Ulp1 fa 100.0 1E-32 2.2E-37 266.8 10.5 137 97-253 369-510 (578)
3 PLN03189 Protease specific for 100.0 1.6E-32 3.4E-37 265.8 11.6 149 96-260 277-435 (490)
4 PF02902 Peptidase_C48: Ulp1 p 99.8 2.3E-20 5E-25 161.2 11.2 143 120-262 1-157 (216)
5 KOG0779|consensus 99.6 1.8E-16 3.8E-21 159.5 3.8 108 106-221 353-461 (595)
6 KOG3246|consensus 99.6 3.2E-15 6.9E-20 132.3 10.4 114 97-233 10-134 (223)
7 PF03290 Peptidase_C57: Vaccin 68.5 5.5 0.00012 39.0 3.6 37 196-232 229-267 (423)
8 PRK11836 deubiquitinase; Provi 60.7 8.2 0.00018 36.8 3.1 24 194-217 215-238 (403)
9 PRK14848 deubiquitinase SseL; 56.6 7.9 0.00017 36.1 2.2 30 19-48 25-54 (317)
10 KOG0779|consensus 45.1 7.5 0.00016 40.1 0.2 98 118-222 42-151 (595)
11 PF06543 Lac_bphage_repr: Lact 36.2 27 0.00059 24.4 1.8 18 26-43 29-46 (49)
12 PHA02130 hypothetical protein 28.8 12 0.00026 28.0 -1.0 37 181-217 14-51 (81)
13 COG2947 Uncharacterized conser 21.6 55 0.0012 28.2 1.5 61 3-70 78-148 (156)
No 1
>KOG0778|consensus
Probab=100.00 E-value=6.4e-37 Score=298.11 Aligned_cols=137 Identities=28% Similarity=0.548 Sum_probs=124.0
Q ss_pred CeeeehhhhhccCCCCCccHHHHHHHHHHHHHhhcCCCCCCceEEechHHHHhhccCCccccCCCCCCCccchhhhhhcc
Q psy14159 105 GISINTEDYACLDKEQFINDVIIDFYLGFLLQEMATPHIREKTHIFSTFFYKRLTQKPTRKISGSIEGDPHISAAKKRHS 184 (264)
Q Consensus 105 gi~It~~Dl~~L~~~~wLND~IInFYL~~L~~~~~~~~~~~~v~iFsSFFy~kL~~~~~~~~~~~~e~d~~l~~~~~~~~ 184 (264)
+|.||.+|+.||.+++||||+||||||++|.++.......++||+||||||++|... +|+
T Consensus 317 ~i~It~~dl~tl~~~~WLNDevINfYm~ll~ers~~~~~yp~~h~FnTFFy~kL~~~--------------------gy~ 376 (511)
T KOG0778|consen 317 NIDITGKDLQTLRPGNWLNDEVINFYMELLKERSKKDSKYPKVHAFNTFFYTKLVGR--------------------GYA 376 (511)
T ss_pred cccccHHHHhhccCccchhHHHHHHHHHHHHhhccccCCCceEEEEechhhhhhhhc--------------------chH
Confidence 699999999999999999999999999999998865545889999999999999854 699
Q ss_pred ceecccccCCCCccCeEEEeecCCCceeEEEEEcCC--cccccCCCCChHHHHHHHHHHHHHH--hhhcCcccCCCCeEE
Q psy14159 185 RVQSWTKNINVFEKDFIVVPINSNAHWFLAIICFPG--LKGCETPEGRAVEECQRFRSLRKRE--RIKLDEMQKTGRTLY 260 (264)
Q Consensus 185 ~V~rWtkkvdIF~kd~I~IPIN~~~HW~LaVI~~p~--l~~~dS~~~~~~~~~~~~~~~~~~~--~~k~~~~~~~~w~~~ 260 (264)
.|+||||+||||++|+||||||.+.||+||||+... +.|+|||++.+...+..++.|++.| .|++.+||..||+++
T Consensus 377 ~VkRWTk~v~if~~d~i~vPIH~~vHW~l~vid~r~k~i~y~DS~~~~~nr~~~aL~~Yl~~E~~~k~~~~~d~s~w~~~ 456 (511)
T KOG0778|consen 377 GVKRWTKKVDIFDKDIIFVPIHLGVHWCLAVIDLREKTIEYYDSLGGGPNRICDALAKYLQDESRDKSKKDFDVSGWTIE 456 (511)
T ss_pred HHHhHhhccCccccceeEeeeecCceEEEEEEEcccceEEEeeccCCCCcchHHHHHHHHHHHHhhhhcCCCCccchhhh
Confidence 999999999999999999999999999999999876 8999999866555568888888888 488999999999986
Q ss_pred e
Q psy14159 261 R 261 (264)
Q Consensus 261 ~ 261 (264)
-
T Consensus 457 ~ 457 (511)
T KOG0778|consen 457 F 457 (511)
T ss_pred h
Confidence 4
No 2
>COG5160 ULP1 Protease, Ulp1 family [Posttranslational modification, protein turnover, chaperones]
Probab=99.98 E-value=1e-32 Score=266.83 Aligned_cols=137 Identities=32% Similarity=0.551 Sum_probs=121.9
Q ss_pred eEEeeCCC---CeeeehhhhhccCCCCCccHHHHHHHHHHHHHhhcCCCCCCceEEechHHHHhhccCCccccCCCCCCC
Q psy14159 97 TIYIYKPT---GISINTEDYACLDKEQFINDVIIDFYLGFLLQEMATPHIREKTHIFSTFFYKRLTQKPTRKISGSIEGD 173 (264)
Q Consensus 97 ~il~yp~~---gi~It~~Dl~~L~~~~wLND~IInFYL~~L~~~~~~~~~~~~v~iFsSFFy~kL~~~~~~~~~~~~e~d 173 (264)
.+.+||.. +|+||..|+.||.+++||||+||||||+||.........+++||+||||||++|.+.
T Consensus 369 ~~~~~~~~~~~~i~IT~~D~~~L~~~~wLNDtIIdFy~k~ls~~sk~~s~~~~vh~FnTFFYT~Lsrr------------ 436 (578)
T COG5160 369 KSSCYPFNDRSSIEITNQDFKRLRNGDWLNDTIIDFYMKLLSKISKNTSKREQVHLFNTFFYTKLSRR------------ 436 (578)
T ss_pred ccccccccCcCcceeehHhhhhhccccchhhHHHHHHHHHHHHhccCcccccceEEeehhhHHHHHHH------------
Confidence 34468865 799999999999999999999999999999777666667899999999999999864
Q ss_pred ccchhhhhhccceecccccCCCCccCeEEEeecCCCceeEEEEEcCC--cccccCCCCChHHHHHHHHHHHHHHhhhcCc
Q psy14159 174 PHISAAKKRHSRVQSWTKNINVFEKDFIVVPINSNAHWFLAIICFPG--LKGCETPEGRAVEECQRFRSLRKRERIKLDE 251 (264)
Q Consensus 174 ~~l~~~~~~~~~V~rWtkkvdIF~kd~I~IPIN~~~HW~LaVI~~p~--l~~~dS~~~~~~~~~~~~~~~~~~~~~k~~~ 251 (264)
+|++|+||++++|||++++||||||...||+|||||.+. +.++||+++.+-..+..+++|.-.|.|...+
T Consensus 437 --------Gy~gVrrW~kk~dif~~k~I~iPIni~~HW~l~II~~~~~~i~~~DSLan~~~~v~~~L~~Y~ldE~k~~~~ 508 (578)
T COG5160 437 --------GYSGVRRWTKKTDIFSKKYIFIPINISYHWFLAIIDNPKKNILYFDSLANTHDPVLEFLRSYLLDEYKIQHD 508 (578)
T ss_pred --------HhHHHHHHHhccCccccceEEEEecccceEEEEEeecCcceeEEecccccCcHHHHHHHHHHHHHHHhcccC
Confidence 699999999999999999999999999999999999886 8899999998888888888888888665544
Q ss_pred cc
Q psy14159 252 MQ 253 (264)
Q Consensus 252 ~~ 253 (264)
-+
T Consensus 509 k~ 510 (578)
T COG5160 509 KD 510 (578)
T ss_pred Cc
Confidence 43
No 3
>PLN03189 Protease specific for SMALL UBIQUITIN-RELATED MODIFIER (SUMO); Provisional
Probab=99.98 E-value=1.6e-32 Score=265.83 Aligned_cols=149 Identities=26% Similarity=0.436 Sum_probs=125.1
Q ss_pred ceEEe-eCCCCeeeehhhhhccCCCCCccHHHHHHHHHHHHHhhcC-CCCCCceEEechHHHHhhccCCccccCCCCCCC
Q psy14159 96 QTIYI-YKPTGISINTEDYACLDKEQFINDVIIDFYLGFLLQEMAT-PHIREKTHIFSTFFYKRLTQKPTRKISGSIEGD 173 (264)
Q Consensus 96 ~~il~-yp~~gi~It~~Dl~~L~~~~wLND~IInFYL~~L~~~~~~-~~~~~~v~iFsSFFy~kL~~~~~~~~~~~~e~d 173 (264)
.++++ +.+.+++||++|+.||.|++||||+||||||++|.++... +....++|+||||||++|.+.+.
T Consensus 277 ~~vlvs~~~~~i~IT~~DL~~L~Pg~WLNDeVINfYm~LL~er~~~~p~~~~k~h~FNTFFytkL~~~~~---------- 346 (490)
T PLN03189 277 RKVLVTHENSNIDITGEILRCLKPGAWLNDEVINLYLELLKEREAREPKKFLKCHFFNTFFYKKLVSGKS---------- 346 (490)
T ss_pred cceeeecCCCceEEEHHHhhccCCCCccCHHHHHHHHHHHHHhhhcCcccccceEEEehHHHHHHhhcCC----------
Confidence 34444 4456899999999999999999999999999999876432 22357899999999999987531
Q ss_pred ccchhhhhhccceeccccc----CCCCccCeEEEeecCCCceeEEEEEcCC--cccccCCCCChHHHHHHHHHHHHHH--
Q psy14159 174 PHISAAKKRHSRVQSWTKN----INVFEKDFIVVPINSNAHWFLAIICFPG--LKGCETPEGRAVEECQRFRSLRKRE-- 245 (264)
Q Consensus 174 ~~l~~~~~~~~~V~rWtkk----vdIF~kd~I~IPIN~~~HW~LaVI~~p~--l~~~dS~~~~~~~~~~~~~~~~~~~-- 245 (264)
..+|.+|+|||++ +++|++|+||||||.+.||+|+||+++. +.+|||+++.....++.++.|...|
T Consensus 347 ------~ygY~~VrRWTk~kKigv~Lfs~D~IFIPIh~n~HWsLaVId~k~k~I~yyDSLgg~~~~vL~~L~rYL~~E~k 420 (490)
T PLN03189 347 ------GYDYKAVRRWTTQKKLGYHLIDCDKIFVPIHQEIHWTLAVINKKDQKFQYLDSLKGRDPKILDALAKYYVDEVK 420 (490)
T ss_pred ------cCChHHHHHHhhhcccccccccCceEEeeeecCCeeEEEEEEcCCCeEEEEeCCCCCCHHHHHHHHHHHHHHHh
Confidence 1268999999974 5799999999999999999999999986 8999999888777888888888777
Q ss_pred hhhcCcccCCCCeEE
Q psy14159 246 RIKLDEMQKTGRTLY 260 (264)
Q Consensus 246 ~~k~~~~~~~~w~~~ 260 (264)
.+++.+++.++|++.
T Consensus 421 dK~g~d~D~s~W~~~ 435 (490)
T PLN03189 421 DKSEKDIDVSSWEQE 435 (490)
T ss_pred hhcCCCcchhcceec
Confidence 367789999999864
No 4
>PF02902 Peptidase_C48: Ulp1 protease family, C-terminal catalytic domain This family belongs to family C48 of the peptidase classification.; InterPro: IPR003653 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins contain cysteine peptidases belonging to MEROPS peptidase family C48 (Ulp1 endopeptidase family, clan CE). The protein fold of the peptidase domain for members of this family resembles that of adenain, the type example for clan CE. This group of sequences also contains a number of hypothetical proteins, which have not yet been characterised, and non-peptidase homologues. These are proteins that have either been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity of the peptidases in the family. The Ulp1 endopeptidase family contain the deubiquitinating enzymes (DUB) that can de-conjugate ubiquitin or ubiquitin-like proteins from ubiquitin-conjugated proteins. They can be classified in 3 families according to sequence homology [, ]: Ubiquitin carboxyl-terminal hydrolase (UCH) (see PDOC00127 from PROSITEDOC), Ubiquitin-specific processing protease (UBP) (see PDOC00750 from PROSITEDOC), and ubiquitin-like protease (ULP) specific for de-conjugating ubiquitin-like proteins. In contrast to the UBP pathway, which is very redundant (16 UBP enzymes in yeast), there are few ubiquitin-like proteases (only one in yeast, Ulp1). Ulp1 catalyses two critical functions in the SUMO/Smt3 pathway via its cysteine protease activity. Ulp1 processes the Smt3 C-terminal sequence (-GGATY) to its mature form (-GG), and it de-conjugates Smt3 from the lysine epsilon-amino group of the target protein []. Crystal structure of yeast Ulp1 bound to Smt3 [] revealed that the catalytic and interaction interface is situated in a shallow and narrow cleft where conserved residues recognise the Gly-Gly motif at the C-terminal extremity of Smt3 protein. Ulp1 adopts a novel architecture despite some structural similarity with other cysteine protease. The secondary structure is composed of seven alpha helices and seven beta strands. The catalytic domain includes the central alpha helix, beta-strands 4 to 6, and the catalytic triad (Cys-His-Asp). This profile is directed against the C-terminal part of ULP proteins that displays full proteolytic activity [].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1EUV_A 2HL8_A 2HKP_A 2HL9_A 1XT9_A 2BKQ_C 2BKR_A 2IO1_E 1TH0_B 1TGZ_A ....
Probab=99.83 E-value=2.3e-20 Score=161.21 Aligned_cols=143 Identities=28% Similarity=0.488 Sum_probs=94.2
Q ss_pred CCccHHHHHHHHHHHHHhhc-CCCCCCceEEechHHHHhhccCCccccCCCCCCCccchhhhhhccceecccccC---CC
Q psy14159 120 QFINDVIIDFYLGFLLQEMA-TPHIREKTHIFSTFFYKRLTQKPTRKISGSIEGDPHISAAKKRHSRVQSWTKNI---NV 195 (264)
Q Consensus 120 ~wLND~IInFYL~~L~~~~~-~~~~~~~v~iFsSFFy~kL~~~~~~~~~~~~e~d~~l~~~~~~~~~V~rWtkkv---dI 195 (264)
+||||.|||||+++|.++.. .....+++++|+|+|+++|.........+.....-........+.++.+|++.. ++
T Consensus 1 ~wLnd~iId~y~~~l~~~~~~~~~~~~~~~~~~~~f~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 80 (216)
T PF02902_consen 1 EWLNDSIIDFYLEYLRHRLESENKNSKRVHFFSSFFYQKLTECFKQFKKDKERSKFKWNESKDFYNGVQRWFRKKNKKNL 80 (216)
T ss_dssp --EEHHHHHHHHHHHHHHTCCTHHTSTTEEEE-THHHHHHH----HHHCCH--S-----HHHHCHHHCGGGGTTCCTSTG
T ss_pred CcCCHHHHHHHHHHHHHhhccCccCCCcEEEECceeeeccccccccccccccccccchhhhhHhhhhhhhhhhccccccc
Confidence 69999999999999986543 334578999999999999984322111110000000011245678999999887 99
Q ss_pred CccCeEEEeecC-CCceeEEEEEcCC--cccccCCCCChH-----HHHHHHHHHHHHH--hhhcCcccCCCCeEEec
Q psy14159 196 FEKDFIVVPINS-NAHWFLAIICFPG--LKGCETPEGRAV-----EECQRFRSLRKRE--RIKLDEMQKTGRTLYRL 262 (264)
Q Consensus 196 F~kd~I~IPIN~-~~HW~LaVI~~p~--l~~~dS~~~~~~-----~~~~~~~~~~~~~--~~k~~~~~~~~w~~~~~ 262 (264)
+++|+|+||||. +.||+|+||+.|. +.++||+++... ..+..+..+...+ .......+...|+..+.
T Consensus 81 ~~~~~i~iPin~~~~HW~l~vi~~~~~~i~~~DSl~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 157 (216)
T PF02902_consen 81 FDKDYIFIPININNNHWVLLVIDLPKKRIYVYDSLGSSNNDKRYKRVIENIIPFLKREYKKKEGRDPDKSPFKIVRP 157 (216)
T ss_dssp GGSSEEEEEEEETTTEEEEEEEETTTTEEEEE-TTSTSSH-HHHHHHHHHHHHHHHHHHHHHHSSCT-TTTCEEEEE
T ss_pred cccCEEEEEEechhhccceeEEcccccEEEEEeccccccccccchhhhhhhhhhhhhccccccccccccceeeeccc
Confidence 999999999998 9999999999996 889999965544 3344554444444 34444556777776654
No 5
>KOG0779|consensus
Probab=99.62 E-value=1.8e-16 Score=159.48 Aligned_cols=108 Identities=41% Similarity=0.838 Sum_probs=94.4
Q ss_pred eeeehhhhhccCCCCCccHHHHHHHHHHHHHhhcC-CCCCCceEEechHHHHhhccCCccccCCCCCCCccchhhhhhcc
Q psy14159 106 ISINTEDYACLDKEQFINDVIIDFYLGFLLQEMAT-PHIREKTHIFSTFFYKRLTQKPTRKISGSIEGDPHISAAKKRHS 184 (264)
Q Consensus 106 i~It~~Dl~~L~~~~wLND~IInFYL~~L~~~~~~-~~~~~~v~iFsSFFy~kL~~~~~~~~~~~~e~d~~l~~~~~~~~ 184 (264)
+.++..|+.||+++.+|||.|+|||++|+....+. .+....+|+|++|||.++.+...++..+ .....++.
T Consensus 353 ~~v~~~Dl~cl~~~e~L~d~i~dfyl~~i~~~~l~~~~~s~~~h~f~tffyk~l~~~~~~~~~d--------~~~~~~~~ 424 (595)
T KOG0779|consen 353 HQVNNNDLVCLEEGEFLNDTIKDFYLEYIRQYLLSQKELSNDRHIFSTFFYKRLCRKLRQKSND--------QIQDNRAV 424 (595)
T ss_pred cceeccchhhccccchhhhhhhhhhhHHHHHhhhcccccCcchhhhhhHHHhhhhhhhhhhhhh--------hhhccccC
Confidence 48999999999999999999999999999998876 6678899999999999998875432111 22345788
Q ss_pred ceecccccCCCCccCeEEEeecCCCceeEEEEEcCCc
Q psy14159 185 RVQSWTKNINVFEKDFIVVPINSNAHWFLAIICFPGL 221 (264)
Q Consensus 185 ~V~rWtkkvdIF~kd~I~IPIN~~~HW~LaVI~~p~l 221 (264)
++++|++.+|+|.++||++|+|+..||+|++||+|+.
T Consensus 425 ~~~~~~~~~d~~~k~yi~~P~~E~~hw~laiic~p~~ 461 (595)
T KOG0779|consen 425 RLRTWTRHFDLFNKDYVFVPTHERFHWKLAIICNPDL 461 (595)
T ss_pred ceeeeeeccccccceeEEecCchHhhhhccccccCcc
Confidence 9999999999999999999999999999999999973
No 6
>KOG3246|consensus
Probab=99.61 E-value=3.2e-15 Score=132.31 Aligned_cols=114 Identities=23% Similarity=0.454 Sum_probs=86.8
Q ss_pred eEEeeCCCCeeeehhhhhccCCCCCccHHHHHHHHHHHHHhhcCCCCCCceEEechHHHHhhccCCccccCCCCCCCccc
Q psy14159 97 TIYIYKPTGISINTEDYACLDKEQFINDVIIDFYLGFLLQEMATPHIREKTHIFSTFFYKRLTQKPTRKISGSIEGDPHI 176 (264)
Q Consensus 97 ~il~yp~~gi~It~~Dl~~L~~~~wLND~IInFYL~~L~~~~~~~~~~~~v~iFsSFFy~kL~~~~~~~~~~~~e~d~~l 176 (264)
.+|.|- .+.++.+|+..|++..||||.+|+||.+||.+..... .+..|++++=.-..|...+ +
T Consensus 10 v~Lsy~--dv~Lr~sDVdlL~~p~wlnD~~I~F~~e~l~~~~~~s--~~~~~ll~P~~t~~l~~~~-----~-------- 72 (223)
T KOG3246|consen 10 VVLSYF--DVSLRQSDVDLLQPPEWLNDRLIDFYYEYLEHRRSRS--EPDLHLLRPSLTFFLRHAP-----N-------- 72 (223)
T ss_pred eEEEee--ehhhhhhhhhhcCCCchhhhhHHHHHHHHHHHhhccc--CcchhccCHHHHHHHHhCC-----C--------
Confidence 677777 8999999999999999999999999999999986543 3346888753333333221 1
Q ss_pred hhhhhhccceecccccCCCCccCeEEEeecCC---------CceeEEEEEcCC--cccccCCCCChHH
Q psy14159 177 SAAKKRHSRVQSWTKNINVFEKDFIVVPINSN---------AHWFLAIICFPG--LKGCETPEGRAVE 233 (264)
Q Consensus 177 ~~~~~~~~~V~rWtkkvdIF~kd~I~IPIN~~---------~HW~LaVI~~p~--l~~~dS~~~~~~~ 233 (264)
-+.++......++++|++||+|||.+ +||+|.|+.-|. +.+|||+.|.+.+
T Consensus 73 ------~~e~~~~~~pl~l~~k~~iflpiNDn~~~~~~~GGsHWSLLV~sr~~~~f~hyDS~~n~nt~ 134 (223)
T KOG3246|consen 73 ------PEEIAMVLDPLDLNDKDFIFLPINDNSNVTRASGGSHWSLLVFSRPDGKFYHYDSLSNGNTK 134 (223)
T ss_pred ------cHHHHHhcChhhcCCCceEEEEecCCCcccccCCCcceEEEEEEeeCCcEEEeecccCCCcH
Confidence 12344555677999999999999963 699999999885 7899999554433
No 7
>PF03290 Peptidase_C57: Vaccinia virus I7 processing peptidase; InterPro: IPR004970 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This is a group of cysteine peptidases which constitute MEROPS peptidase family C57 (clan CE). The type example is vaccinia virus I7 processing peptidase (vaccinia virus); protein I7 is expressed in the late phase of infection [].
Probab=68.48 E-value=5.5 Score=38.99 Aligned_cols=37 Identities=11% Similarity=0.099 Sum_probs=29.0
Q ss_pred CccCeEEEeecCCCceeEEEEEcCC--cccccCCCCChH
Q psy14159 196 FEKDFIVVPINSNAHWFLAIICFPG--LKGCETPEGRAV 232 (264)
Q Consensus 196 F~kd~I~IPIN~~~HW~LaVI~~p~--l~~~dS~~~~~~ 232 (264)
-++.|+.+|.+...||.++|++-.. ...|||-|+.+.
T Consensus 229 ~~~RyvmFgfcY~~Hwkc~IfDk~~~~v~FydSgG~~P~ 267 (423)
T PF03290_consen 229 SKKRYVMFGFCYMSHWKCCIFDKEKKIVYFYDSGGNIPE 267 (423)
T ss_pred ccccEEEeeeeehhcceEEEEeccccEEEEEcCCCCCHH
Confidence 3578999999999999999998654 457777655443
No 8
>PRK11836 deubiquitinase; Provisional
Probab=60.70 E-value=8.2 Score=36.77 Aligned_cols=24 Identities=29% Similarity=0.679 Sum_probs=20.0
Q ss_pred CCCccCeEEEeecCCCceeEEEEE
Q psy14159 194 NVFEKDFIVVPINSNAHWFLAIIC 217 (264)
Q Consensus 194 dIF~kd~I~IPIN~~~HW~LaVI~ 217 (264)
.++-++.=++|||.+.||.|++..
T Consensus 215 ~~~~k~~elFpINtg~HWil~~l~ 238 (403)
T PRK11836 215 PSWPKEVQLFPINTGGHWILVSLQ 238 (403)
T ss_pred CCCcccceEEEecCCCcEEEEEeH
Confidence 456678889999999999999763
No 9
>PRK14848 deubiquitinase SseL; Provisional
Probab=56.60 E-value=7.9 Score=36.13 Aligned_cols=30 Identities=23% Similarity=0.410 Sum_probs=23.3
Q ss_pred eeEEEcCCCCChHHHHHHHHHhhccCCCCC
Q psy14159 19 RRITLLPEKIPDEVKCILSDIYSKTSSGRS 48 (264)
Q Consensus 19 ~~i~l~~~~~~~~~~~~l~~~~~~~~~~~~ 48 (264)
+-|.|-.|.+..+++.+|.++|+...+|..
T Consensus 25 ~NiALr~D~~G~~~E~iLfdlfsgk~~~~~ 54 (317)
T PRK14848 25 CNLALRNDDLGHKVEKLLFDLFSGKRSGSP 54 (317)
T ss_pred HHHhhccccccchHHHHHHHHHcCCccCCc
Confidence 345566788999999999999997765544
No 10
>KOG0779|consensus
Probab=45.06 E-value=7.5 Score=40.09 Aligned_cols=98 Identities=19% Similarity=0.170 Sum_probs=68.5
Q ss_pred CCCCccHHHHHHHHHHHHHhhcCCCCCCceEEechHHHHhhccCCccccCCCCCCCccchhhhhhccceecc--------
Q psy14159 118 KEQFINDVIIDFYLGFLLQEMATPHIREKTHIFSTFFYKRLTQKPTRKISGSIEGDPHISAAKKRHSRVQSW-------- 189 (264)
Q Consensus 118 ~~~wLND~IInFYL~~L~~~~~~~~~~~~v~iFsSFFy~kL~~~~~~~~~~~~e~d~~l~~~~~~~~~V~rW-------- 189 (264)
...++||.++.+++............+..-+.|.+.+...+........ ......++....+.+|
T Consensus 42 ~~~~~~~i~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k-------i~~~~~~~~~~~~~~~~~~~~~~~ 114 (595)
T KOG0779|consen 42 RPNYLNDIIIDLVLLKDDNTLNDEYTRELSILFILAVKKGLTSLTEPLK-------IAVLEEQKPHAVVPKKIISVSELR 114 (595)
T ss_pred CcccccccccchheeccccCCCccccceeeecccCCccccccccccccc-------cccccccCCccccccccccchhcc
Confidence 4668999999999987776655444577778888888888876543321 1112234556778888
Q ss_pred ----cccCCCCccCeEEEeecCCCceeEEEEEcCCcc
Q psy14159 190 ----TKNINVFEKDFIVVPINSNAHWFLAIICFPGLK 222 (264)
Q Consensus 190 ----tkkvdIF~kd~I~IPIN~~~HW~LaVI~~p~l~ 222 (264)
++.+...++.....|++...||+++.++++.+.
T Consensus 115 ~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 151 (595)
T KOG0779|consen 115 NTNLTEGTFYSDYLNRIKNIFESVRKFFNQVSYPKFM 151 (595)
T ss_pred ccCCcccceecccccccccccchhcchhhhccccccc
Confidence 555555556667778888999999999987644
No 11
>PF06543 Lac_bphage_repr: Lactococcus bacteriophage repressor; InterPro: IPR009498 This entry represents the C terminus of various Lactococcus bacteriophage repressor proteins.
Probab=36.20 E-value=27 Score=24.44 Aligned_cols=18 Identities=33% Similarity=0.468 Sum_probs=16.4
Q ss_pred CCCChHHHHHHHHHhhcc
Q psy14159 26 EKIPDEVKCILSDIYSKT 43 (264)
Q Consensus 26 ~~~~~~~~~~l~~~~~~~ 43 (264)
.|||+++|.+++-+|++.
T Consensus 29 rPltdevK~a~k~i~~~~ 46 (49)
T PF06543_consen 29 RPLTDEVKEAMKLIFGKR 46 (49)
T ss_pred eeCCHHHHHHHHHHHhhh
Confidence 589999999999999965
No 12
>PHA02130 hypothetical protein
Probab=28.77 E-value=12 Score=28.02 Aligned_cols=37 Identities=19% Similarity=0.545 Sum_probs=30.8
Q ss_pred hhccceecccc-cCCCCccCeEEEeecCCCceeEEEEE
Q psy14159 181 KRHSRVQSWTK-NINVFEKDFIVVPINSNAHWFLAIIC 217 (264)
Q Consensus 181 ~~~~~V~rWtk-kvdIF~kd~I~IPIN~~~HW~LaVI~ 217 (264)
+.++.++.|.. +.|-++-|++=||.-...||-|+-..
T Consensus 14 ks~~sl~~wl~~~~dswdddil~ipfkstv~w~lcp~~ 51 (81)
T PHA02130 14 KSWESLREWLDERFDSWDDDILSIPFKSTVYWDLCPYA 51 (81)
T ss_pred HHHHHHHHHHHhcccccccchhcccccceeeeccCcch
Confidence 35677888885 67889999999999999999998543
No 13
>COG2947 Uncharacterized conserved protein [Function unknown]
Probab=21.60 E-value=55 Score=28.16 Aligned_cols=61 Identities=16% Similarity=0.209 Sum_probs=35.8
Q ss_pred CCCCCccCCCCCCCCceeEEEc-------CCCCChHHHHHHHH---HhhccCCCCCCCCCCCcccccCCHHHHhhhhh
Q psy14159 3 KNSKFYFDPSSPDETHRRITLL-------PEKIPDEVKCILSD---IYSKTSSGRSEEPQDGPCLEELSIKEANNILV 70 (264)
Q Consensus 3 ~~~~~~~dp~~~~~~~~~i~l~-------~~~~~~~~~~~l~~---~~~~~~~~~~~~~~~~~~~~~l~~~e~~~~~~ 70 (264)
++..|||||.++.++-|--+.. ++++|.++.....+ +-.-..++++ -+.-++..|..+|+.
T Consensus 78 d~~spYyDPka~~e~pRW~~Vdv~~v~~~~~~vtL~~lK~~~~~~~~~~l~~g~RL-------SV~PVt~~ew~~i~~ 148 (156)
T COG2947 78 DPASPYYDPKATPEDPRWYCVDVRFVRKLPRPVTLKELKANPELAEMSLLVKGNRL-------SVQPVTPEEWKEILR 148 (156)
T ss_pred CCCCcccCcccccCCCCeeEEeeHHHhhcCCCccHHHHhcCcchhhhhhhhccCee-------eeeeCCHHHHHHHHH
Confidence 4678999999988877754443 45677665433221 1101111233 256678888888763
Done!