Query psy15801
Match_columns 318
No_of_seqs 255 out of 1557
Neff 7.7
Searched_HMMs 46136
Date Fri Aug 16 21:41:43 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy15801.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15801hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0960|consensus 99.9 8.9E-25 1.9E-29 202.8 11.8 139 141-279 29-167 (467)
2 TIGR02110 PQQ_syn_pqqF coenzym 99.9 1.3E-23 2.8E-28 214.0 16.3 131 148-278 2-134 (696)
3 PF00675 Peptidase_M16: Insuli 99.9 8.3E-23 1.8E-27 171.2 14.6 122 156-277 1-123 (149)
4 COG0612 PqqL Predicted Zn-depe 99.9 2.2E-22 4.8E-27 197.1 13.7 137 144-280 15-152 (438)
5 PRK15101 protease3; Provisiona 99.9 5.1E-22 1.1E-26 211.8 15.3 135 143-277 41-177 (961)
6 KOG2067|consensus 99.8 3.4E-21 7.3E-26 179.3 9.2 134 144-277 23-156 (472)
7 KOG2583|consensus 99.8 1E-18 2.3E-23 162.8 13.0 134 145-278 22-156 (429)
8 KOG2067|consensus 99.7 1.1E-17 2.3E-22 156.1 9.9 131 2-132 277-432 (472)
9 COG1025 Ptr Secreted/periplasm 99.7 2.4E-17 5.1E-22 168.6 12.0 133 142-274 20-154 (937)
10 KOG0959|consensus 99.7 4.8E-16 1E-20 161.2 12.5 137 142-278 24-163 (974)
11 KOG0960|consensus 99.6 1.5E-15 3.2E-20 141.7 7.7 126 2-128 283-430 (467)
12 PRK15101 protease3; Provisiona 99.6 6.1E-14 1.3E-18 149.9 17.5 213 56-275 372-650 (961)
13 PTZ00432 falcilysin; Provision 99.4 7.9E-13 1.7E-17 142.6 12.2 111 145-259 91-207 (1119)
14 PTZ00432 falcilysin; Provision 99.3 3.3E-10 7.2E-15 122.4 20.3 120 152-272 666-803 (1119)
15 KOG2019|consensus 98.8 1.4E-06 3E-11 87.3 21.4 259 4-270 328-696 (998)
16 COG1026 Predicted Zn-dependent 98.7 1.4E-06 3E-11 90.8 21.2 262 2-272 291-665 (978)
17 PF08367 M16C_assoc: Peptidase 98.4 1.8E-05 3.9E-10 72.1 16.9 167 56-258 15-193 (248)
18 COG0612 PqqL Predicted Zn-depe 98.3 1.8E-06 3.9E-11 84.7 8.3 121 5-132 273-418 (438)
19 KOG2583|consensus 98.1 6.9E-06 1.5E-10 77.7 6.8 121 2-127 264-401 (429)
20 COG1026 Predicted Zn-dependent 97.4 0.00044 9.5E-09 72.6 7.7 109 150-262 26-137 (978)
21 KOG0961|consensus 96.9 0.0032 6.8E-08 63.8 8.1 107 153-263 28-136 (1022)
22 PF05193 Peptidase_M16_C: Pept 96.6 0.0028 6.2E-08 53.0 4.4 71 2-80 93-184 (184)
23 TIGR02110 PQQ_syn_pqqF coenzym 93.4 8.8 0.00019 40.3 19.0 198 60-266 102-346 (696)
24 PF03410 Peptidase_M44: Protei 92.3 0.16 3.5E-06 49.8 4.2 88 149-250 2-90 (590)
25 KOG2019|consensus 92.3 0.18 3.8E-06 51.7 4.5 107 152-262 60-169 (998)
26 KOG0959|consensus 91.7 18 0.0004 39.2 18.8 221 44-274 591-861 (974)
27 PHA03081 putative metalloprote 90.1 0.29 6.3E-06 48.1 3.5 98 149-260 2-100 (595)
28 PF05193 Peptidase_M16_C: Pept 89.6 0.75 1.6E-05 38.1 5.3 109 153-266 66-184 (184)
29 COG1025 Ptr Secreted/periplasm 85.7 5.1 0.00011 42.9 9.6 117 153-274 509-629 (937)
30 KOG0961|consensus 82.6 4.9 0.00011 41.6 7.6 124 2-127 826-962 (1022)
31 PF10278 Med19: Mediator of RN 76.6 1 2.2E-05 38.8 0.6 24 204-227 40-63 (178)
32 PF07946 DUF1682: Protein of u 71.9 22 0.00047 33.7 8.4 49 223-271 213-269 (321)
33 KOG0345|consensus 65.7 11 0.00025 37.4 5.1 15 225-239 424-438 (567)
34 KOG2422|consensus 58.9 2 4.3E-05 43.4 -1.4 7 283-289 88-94 (665)
35 PF09186 DUF1949: Domain of un 51.2 69 0.0015 21.1 6.9 46 205-250 9-54 (56)
36 PF08496 Peptidase_S49_N: Pept 49.9 22 0.00048 30.0 3.7 11 264-274 52-62 (155)
37 KOG3794|consensus 48.6 37 0.0008 32.8 5.3 27 53-79 22-48 (453)
38 smart00874 B5 tRNA synthetase 47.4 32 0.00069 24.4 3.8 50 201-250 18-67 (71)
39 TIGR00869 sec62 protein transl 45.0 13 0.00027 33.6 1.6 7 118-124 72-78 (232)
40 PF03484 B5: tRNA synthetase B 43.3 63 0.0014 23.0 4.8 49 201-250 18-66 (70)
41 PF08496 Peptidase_S49_N: Pept 40.1 22 0.00047 30.0 2.2 15 259-273 51-65 (155)
42 PRK14552 C/D box methylation g 39.8 4.7 0.0001 39.6 -2.1 15 233-247 330-344 (414)
43 PRK14552 C/D box methylation g 39.6 5.1 0.00011 39.3 -2.0 16 267-282 372-387 (414)
44 KOG3794|consensus 37.9 23 0.0005 34.2 2.2 7 152-158 117-123 (453)
45 cd04914 ACT_AKi-DapG-BS_1 ACT 37.0 91 0.002 21.9 4.8 49 205-254 17-65 (67)
46 PF00675 Peptidase_M16: Insuli 35.7 1.2E+02 0.0027 24.3 6.1 54 55-108 87-140 (149)
47 PF15289 RFXA_RFXANK_bdg: Regu 33.1 14 0.00031 29.7 0.0 16 276-291 19-34 (124)
48 PF08432 Vfa1: AAA-ATPase Vps4 32.4 16 0.00035 31.6 0.2 34 251-284 50-85 (182)
49 PF05890 Ebp2: Eukaryotic rRNA 31.6 88 0.0019 28.9 4.9 27 245-275 110-136 (271)
50 KOG2138|consensus 30.9 26 0.00057 36.4 1.4 14 242-255 774-787 (883)
51 PF09851 SHOCT: Short C-termin 29.9 1.3E+02 0.0027 17.9 4.2 17 65-81 14-30 (31)
52 KOG4043|consensus 29.9 40 0.00086 28.6 2.1 11 185-195 68-78 (214)
53 PF12122 DUF3582: Protein of u 28.1 2E+02 0.0044 22.3 5.7 49 204-253 13-62 (101)
54 PRK06063 DNA polymerase III su 28.0 1.3E+02 0.0029 28.2 5.6 69 193-266 237-308 (313)
55 cd04923 ACT_AK-LysC-DapG-like_ 25.6 2E+02 0.0044 18.8 5.4 44 205-249 18-61 (63)
56 PF14257 DUF4349: Domain of un 25.2 2E+02 0.0044 26.0 6.2 44 205-248 66-117 (262)
57 PF13840 ACT_7: ACT domain ; P 24.9 2.2E+02 0.0048 19.7 5.1 37 209-247 29-65 (65)
58 KOG4043|consensus 24.8 55 0.0012 27.8 2.1 11 171-181 119-129 (214)
59 KOG2573|consensus 24.8 24 0.00051 34.3 -0.1 21 225-245 390-410 (498)
60 COG1327 Predicted transcriptio 24.7 4E+02 0.0086 22.5 7.1 70 53-123 64-149 (156)
61 cd04936 ACT_AKii-LysC-BS-like_ 24.5 2.1E+02 0.0046 18.7 5.4 44 205-249 18-61 (63)
62 PHA01735 hypothetical protein 22.4 1.3E+02 0.0027 21.9 3.2 32 56-87 18-49 (76)
63 KOG3089|consensus 22.3 68 0.0015 28.8 2.3 42 235-276 4-45 (271)
64 PF08288 PIGA: PIGA (GPI ancho 21.9 71 0.0015 24.4 2.0 15 148-162 9-23 (90)
65 KOG2138|consensus 21.5 63 0.0014 33.7 2.1 13 67-79 488-500 (883)
66 PF07499 RuvA_C: RuvA, C-termi 21.1 2.1E+02 0.0046 18.6 4.0 28 55-82 2-29 (47)
67 KOG2573|consensus 20.3 1.1E+02 0.0025 29.8 3.4 9 68-76 137-145 (498)
68 KOG0487|consensus 20.1 64 0.0014 30.4 1.8 50 239-288 245-297 (308)
69 COG5137 Histone chaperone invo 20.1 5.4E+02 0.012 23.1 7.3 62 142-210 77-143 (279)
No 1
>KOG0960|consensus
Probab=99.92 E-value=8.9e-25 Score=202.77 Aligned_cols=139 Identities=21% Similarity=0.284 Sum_probs=133.7
Q ss_pred CCCCCcEEEEcCCCcEEEEEecCCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeE
Q psy15801 141 LFSPDIQSGTSYNKVKVVSVNETSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYA 220 (318)
Q Consensus 141 ~~~~~~~~~~L~NGlkv~~~~~~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~ 220 (318)
...|+.+.++|+||++|.+.++..++++|+++|++||++|.+.++|.+||++||.|+||.+++...+..+++.+|+.+|+
T Consensus 29 ~~~P~t~vttL~NGlrVaTE~~~a~TATVGVwidaGSR~EnekNNG~ahFLEhlaFKGT~~Rs~~alElEieniGahLNA 108 (467)
T KOG0960|consen 29 LSVPETEVTTLPNGLRVATEHNSASTATVGVWIDAGSRFENEKNNGTAHFLEHLAFKGTKNRSQAALELEIENIGAHLNA 108 (467)
T ss_pred ccCCcceEEEcCCCcEEEeccCCCcceEEEEEeccCccccccccccHHHHHHHHHhcCCCcchhHHHHHHHHHHHHHhcc
Confidence 34578899999999999999998899999999999999999999999999999999999999999999999999999999
Q ss_pred EecCceEEEEEEecCCcHHHHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCCCChhh
Q psy15801 221 TVDREGIVYTIEAVRKEIGHVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRPSQGNE 279 (318)
Q Consensus 221 ~~~~d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~~~~~~ 279 (318)
+++||+|.|+..++++++++++++|+||++|..+.+.+|+++|..|+.|+++.+.+.++
T Consensus 109 ytSReqT~yyakal~~dv~kavdiLaDIlqns~L~~s~IerER~vILrEmqevd~~~~e 167 (467)
T KOG0960|consen 109 YTSREQTVYYAKALSKDVPKAVDILADILQNSKLEESAIERERDVILREMQEVDKNHQE 167 (467)
T ss_pred cccccceeeehhhccccchHHHHHHHHHHHhCccchhHHHHHHHHHHHHHHHHHhhhhH
Confidence 99999999999999999999999999999999999999999999999999998877664
No 2
>TIGR02110 PQQ_syn_pqqF coenzyme PQQ biosynthesis probable peptidase PqqF. In a subset of species that make coenzyme PQQ (pyrrolo-quinoline-quinone), this probable peptidase is found in the PQQ biosynthesis region and is thought to act as a protease on PqqA (TIGR02107), a probable peptide precursor of the coenzyme. PQQ is required for some glucose dehydrogenases and alcohol dehydrogenases.
Probab=99.91 E-value=1.3e-23 Score=214.05 Aligned_cols=131 Identities=17% Similarity=0.152 Sum_probs=124.9
Q ss_pred EEEcCCCcEEEEEec-CCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHH-HHHHHHHHhCCeEeEEecCc
Q psy15801 148 SGTSYNKVKVVSVNE-TSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHF-GIIRHVQQAGGSLYATVDRE 225 (318)
Q Consensus 148 ~~~L~NGlkv~~~~~-~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~-~i~~~l~~lG~~l~~~~~~d 225 (318)
..+|+|||+|+++++ ..|++++.++|++|+++|+.+.+|++||++||+|+||++++.. +|.+.++.+||++|+++++|
T Consensus 2 ~~tL~NGLrVllv~~p~~p~vav~l~v~aGS~~Ep~~~~GLAHfLEHMLFkGT~~~~~~~~i~~~le~lGG~lNA~Ts~d 81 (696)
T TIGR02110 2 RITLPNGLRVHLYHQPDAKRAAALLRVAAGSHDEPSAWPGLAHFLEHLLFLGGERFQGDDRLMPWVQRQGGQVNATTLER 81 (696)
T ss_pred eEEcCCCCEEEEEECCCCCEEEEEEEEeeccCCCCCCCCcHHHHHHHHHhcCCCCCCcHHHHHHHHHHhCCeEEEEEcCC
Confidence 468999999999998 5799999999999999999999999999999999999999985 89999999999999999999
Q ss_pred eEEEEEEecCCcHHHHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCCCChh
Q psy15801 226 GIVYTIEAVRKEIGHVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRPSQGN 278 (318)
Q Consensus 226 ~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~~~~~ 278 (318)
+|.|++++++++++.+|++|+|++.+|.|++++|+++|.++++|+++..+++.
T Consensus 82 ~T~y~~~v~~~~l~~aL~lLaD~l~~P~f~eeeierEr~vvl~Ei~~~~ddp~ 134 (696)
T TIGR02110 82 TTAFFFELPAAALAAGLARLCDMLARPLLTAEDQQREREVLEAEYIAWQNDAD 134 (696)
T ss_pred eEEEEEEecHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHhcCHH
Confidence 99999999999999999999999999999999999999999999998776553
No 3
>PF00675 Peptidase_M16: Insulinase (Peptidase family M16) This is family M16 in the peptidase classification. ; InterPro: IPR011765 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. The majority of the sequences in this entry are metallopeptidases and non-peptidase homologs belong to MEROPS peptidase family M16 (clan ME), subfamilies M16A, M16B and M16C; they include: Insulinase, insulin-degrading enzyme (3.4.24.56 from EC) Mitochondrial processing peptidase alpha subunit, (Alpha-MPP, 3.4.24.64 from EC) Pitrlysin, Protease III precursor (3.4.24.55 from EC) Nardilysin, (3.4.24.61 from EC) Ubiquinol-cytochrome C reductase complex core protein I,mitochondrial precursor (1.10.2.2 from EC) Coenzyme PQQ synthesis protein F (3.4.99 from EC) These proteins do not share many regions of sequence similarity; the most noticeable is in the N-terminal section. This region includes a conserved histidine followed, two residues later by a glutamate and another histidine. In pitrilysin, it has been shown [] that this H-x-x-E-H motif is involved in enzymatic activity; the two histidines bind zinc and the glutamate is necessary for catalytic activity. The proteins classified as non-peptidase homologues either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3P7L_A 3P7O_A 3TUV_A 3GO9_A 1BE3_B 1PP9_B 2A06_B 1SQB_B 1SQP_B 1L0N_B ....
Probab=99.90 E-value=8.3e-23 Score=171.24 Aligned_cols=122 Identities=22% Similarity=0.382 Sum_probs=117.8
Q ss_pred EEEEEec-CCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEEecCceEEEEEEec
Q psy15801 156 KVVSVNE-TSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYATVDREGIVYTIEAV 234 (318)
Q Consensus 156 kv~~~~~-~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~~~~d~~~~~~~~l 234 (318)
+|++.++ ..|+++++++|++|+++|+.+++|++|+++||+++||.+++..+|.+.++.+|+.+++++++|++.|+++++
T Consensus 1 ~V~~~~~~~~~~~~~~l~~~~Gs~~e~~~~~G~a~ll~~l~~~gs~~~~~~~l~~~l~~~G~~~~~~t~~d~t~~~~~~~ 80 (149)
T PF00675_consen 1 KVVLVEDPGSPVVSVSLVFKAGSRYEPPGKPGLAHLLEHLLFRGSKKYSSDELQEELESLGASFNASTSRDSTSYSASVL 80 (149)
T ss_dssp EEEEEESTTSSEEEEEEEES-SGGGSCTTTTTHHHHHHHHTTSBBSSSBHHHHHHHHHHTTCEEEEEEESSEEEEEEEEE
T ss_pred CEEEEEcCCCCEEEEEEEEeeccCCCCCCCCchhhhhhhhcccccchhhhhhhHHHhhhhccccceEecccceEEEEEEe
Confidence 6889998 789999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCcHHHHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCCCCh
Q psy15801 235 RKEIGHVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRPSQG 277 (318)
Q Consensus 235 ~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~~~~ 277 (318)
+++++.+|++|++++.+|.|++++|+++|..+..++++...++
T Consensus 81 ~~~~~~~l~~l~~~~~~P~f~~~~~~~~r~~~~~ei~~~~~~~ 123 (149)
T PF00675_consen 81 SEDLEKALELLADMLFNPSFDEEEFEREREQILQEIEEIKENP 123 (149)
T ss_dssp GGGHHHHHHHHHHHHHSBGGCHHHHHHHHHHHHHHHHHHTTHH
T ss_pred cccchhHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHCCH
Confidence 9999999999999999999999999999999999999988776
No 4
>COG0612 PqqL Predicted Zn-dependent peptidases [General function prediction only]
Probab=99.88 E-value=2.2e-22 Score=197.09 Aligned_cols=137 Identities=22% Similarity=0.278 Sum_probs=131.0
Q ss_pred CCcEEEEcCCCcEEEEEecC-CCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEEe
Q psy15801 144 PDIQSGTSYNKVKVVSVNET-SPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYATV 222 (318)
Q Consensus 144 ~~~~~~~L~NGlkv~~~~~~-~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~~ 222 (318)
+.++..+|+||+++++.+++ .|.+++.++|++|+++|+.+..|+|||++||+|+||++++..+|.+.++.+||.+++++
T Consensus 15 ~~~~~~~L~nGl~~~~~~~~~~~~vs~~~~v~~Gs~~e~~~~~G~AH~lehm~fkgt~~~~~~~i~~~~~~~G~~~na~t 94 (438)
T COG0612 15 PGLQVFTLPNGLRVITYPNPTAPTVSLDVWVKAGSRAEPAGKAGIAHFLEHMAFKGTTGLPSAELAEAFEKLGGQLNAFT 94 (438)
T ss_pred ccceEEEcCCCCEEEEEeCCCCCEEEEEEEEeecccCCCCCcccHHHHHHHHHccCCCCCChHHHHHHHHHhcCeeeccc
Confidence 45899999999999999995 89999999999999999999999999999999999999998899999999999999999
Q ss_pred cCceEEEEEEecCCcHHHHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCCCChhhH
Q psy15801 223 DREGIVYTIEAVRKEIGHVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRPSQGNEK 280 (318)
Q Consensus 223 ~~d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~~~~~~~ 280 (318)
++|+|.|++++++++++.+|++++|++.+|.|++++|+++|.++++||++..+++..-
T Consensus 95 s~d~t~y~~~~l~~~~~~~l~llad~l~~p~f~~~~~e~Ek~vil~ei~~~~d~p~~~ 152 (438)
T COG0612 95 SFDYTVYYLSVLPDNLDKALDLLADILLNPTFDEEEVEREKGVILEEIRMRQDDPDDL 152 (438)
T ss_pred cchhhhhhhhhchhhhHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHHhhccCchHH
Confidence 9999999999999999999999999999999999999999999999999998876543
No 5
>PRK15101 protease3; Provisional
Probab=99.88 E-value=5.1e-22 Score=211.76 Aligned_cols=135 Identities=13% Similarity=0.175 Sum_probs=127.4
Q ss_pred CCCcEEEEcCCCcEEEEEec-CCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCC-HHHHHHHHHHhCCeEeE
Q psy15801 143 SPDIQSGTSYNKVKVVSVNE-TSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFT-HFGIIRHVQQAGGSLYA 220 (318)
Q Consensus 143 ~~~~~~~~L~NGlkv~~~~~-~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s-~~~i~~~l~~lG~~l~~ 220 (318)
+..++..+|+|||+|+++++ ..|.+++++.|++|+++||.+.+|+|||++||+|+||.+++ ..+|.+.++.+||++|+
T Consensus 41 ~~~~~~~~L~NGL~v~l~~~~~~~~~~~~l~v~~Gs~~ep~~~~GlAHflEHmlf~GT~~~p~~~~~~~~l~~~Gg~~NA 120 (961)
T PRK15101 41 PRQYQAIRLDNGMTVLLVSDPQAVKSLAALALPVGSLEDPDAQQGLAHYLEHMVLMGSKKYPQPDSLAEFLKKHGGSHNA 120 (961)
T ss_pred ccceEEEEeCCCCEEEEEeCCCCcceeEEEEeCcCCCCCCCCCCchHHHHHHHHhcCCccCCCcchHHHHHHHhCCCccc
Confidence 45678889999999999998 57999999999999999999999999999999999999996 67999999999999999
Q ss_pred EecCceEEEEEEecCCcHHHHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCCCCh
Q psy15801 221 TVDREGIVYTIEAVRKEIGHVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRPSQG 277 (318)
Q Consensus 221 ~~~~d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~~~~ 277 (318)
+|+.|+|.|++++++++++.+|++|++++.+|.|+++++++++..+.+|++....++
T Consensus 121 ~T~~d~T~y~~~~~~~~l~~aL~~~ad~~~~P~f~~~~~erE~~~v~~E~~~~~~~~ 177 (961)
T PRK15101 121 STASYRTAFYLEVENDALPPAVDRLADAIAEPLLDPKNADRERNAVNAELTMARSRD 177 (961)
T ss_pred eECCCceEEEEEcCHHHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHHHHHHhcCCH
Confidence 999999999999999999999999999999999999999999999999998665543
No 6
>KOG2067|consensus
Probab=99.84 E-value=3.4e-21 Score=179.26 Aligned_cols=134 Identities=25% Similarity=0.325 Sum_probs=129.0
Q ss_pred CCcEEEEcCCCcEEEEEecCCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEEec
Q psy15801 144 PDIQSGTSYNKVKVVSVNETSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYATVD 223 (318)
Q Consensus 144 ~~~~~~~L~NGlkv~~~~~~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~~~ 223 (318)
.+.++++|+||++|.+.+++++.++++++++.|+++|.+...|++||++.|.|++|.+++..+|...||.+||.++++++
T Consensus 23 ~~~kvttL~NGlkvase~~pg~f~~vGlyIdsGsrYE~~~~~GisH~lerLAF~ST~~~~~~ei~~~LE~~GGn~~cqsS 102 (472)
T KOG2067|consen 23 SNTKVTTLPNGLKVASENTPGQFCTVGLYIDSGSRYEAKYFSGISHFLERLAFKSTERFSSKEILAELEKLGGNCDCQSS 102 (472)
T ss_pred ccceeeecCCccEEeccCCCCCceEEEEEEecCccccCcCcccHHHHHHHHhhccccCCcHHHHHHHHHHhCCccccccc
Confidence 47789999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CceEEEEEEecCCcHHHHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCCCCh
Q psy15801 224 REGIVYTIEAVRKEIGHVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRPSQG 277 (318)
Q Consensus 224 ~d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~~~~ 277 (318)
+|++.|.+++.+++++.++.+|+|.+++|.|.++|++..+..+..|+++....+
T Consensus 103 Retm~Yaas~~~~~v~sm~~lLadtV~~P~~~d~ev~~~~~~v~~E~~el~~~P 156 (472)
T KOG2067|consen 103 RETMMYAASADSDGVDSMVELLADTVLNPKFTDQEVEEARRAVKYEIEELWMRP 156 (472)
T ss_pred HhhhHHHHHhhhcccHHHHHHHHHHHhcccccHHHHHHHHHhhhheccccccCc
Confidence 999999999999999999999999999999999999999999999999876544
No 7
>KOG2583|consensus
Probab=99.79 E-value=1e-18 Score=162.85 Aligned_cols=134 Identities=30% Similarity=0.492 Sum_probs=127.3
Q ss_pred CcEEEEcCCCcEEEEEecCCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEEecC
Q psy15801 145 DIQSGTSYNKVKVVSVNETSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYATVDR 224 (318)
Q Consensus 145 ~~~~~~L~NGlkv~~~~~~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~~~~ 224 (318)
......|.|||+|...+++.|+.++.+.|++|||+|+..++|++|+++...++.|+.++...|.+.++.+||.++.+.+|
T Consensus 22 ~~~~~kl~ngL~Vas~e~~~~is~l~l~~~AGSRYe~~~~~G~sHllr~f~g~~Tq~~sal~ivr~se~~GG~Lss~~tR 101 (429)
T KOG2583|consen 22 ISKTTKLVNGLTVASREAPTAISSLSLAFRAGSRYEPADQQGLSHLLRNFVGRDTQERSALKIVRESEQLGGTLSSTATR 101 (429)
T ss_pred hhhhhccccceEEEeccCCCcceEEEEEEecCccCCccccccHHHHHHHhcccCccccchhhhhhhhHhhCceeeeeeec
Confidence 34567899999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred ceEEEEEEecCCcHHHHHHHHHHhhccCCCCHHHHhhhh-hhHHHHHhhCCCChh
Q psy15801 225 EGIVYTIEAVRKEIGHVHKFLASVVGKTEFRPWEVSDLT-PRLKYDRLTRPSQGN 278 (318)
Q Consensus 225 d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e~~~~k-~~i~~ei~~~~~~~~ 278 (318)
|++.|++++++++++-.|.+|++++.+|.|.+||++++. ..|..++....++..
T Consensus 102 e~~~~tvt~lrd~~~~~l~~L~~V~~~paFkPwEl~D~~~~ti~~~l~~~t~~~~ 156 (429)
T KOG2583|consen 102 ELIGLTVTFLRDDLEYYLSLLGDVLDAPAFKPWELEDVVLATIDADLAYQTPYTI 156 (429)
T ss_pred ceEEEEEEEecccHHHHHHHHHHhhcccCcCchhhhhhhhhhhHHHhhhcChHHH
Confidence 999999999999999999999999999999999999998 888888888777554
No 8
>KOG2067|consensus
Probab=99.73 E-value=1.1e-17 Score=156.08 Aligned_cols=131 Identities=24% Similarity=0.360 Sum_probs=109.1
Q ss_pred CchhHHHHHHHHHHhcCC-CCCCCCCCCCccccc----cc-------CCCCCCccccccccCc---------hHHHHHHH
Q psy15801 2 SAKEALAFAVLKYALGAG-PAKTGKNQRSTNSKG----LS-------RTPGRESLDESHHFSP---------PQALQASI 60 (318)
Q Consensus 2 ~~~d~~al~vL~~iLG~g-~~~k~~~~~~~l~~~----~~-------~~~~~~~~~~~a~~~~---------~~~~~~~l 60 (318)
+++|+++++|||.+|||| +||.+|+|.|++++. +. |....+.|++++++.. .++++-+.
T Consensus 277 ~deD~v~~avLq~lmGGGGSFSAGGPGKGMySrLY~~vLNry~wv~sctAfnhsy~DtGlfgi~~s~~P~~a~~aveli~ 356 (472)
T KOG2067|consen 277 NDEDFVALAVLQMLMGGGGSFSAGGPGKGMYSRLYLNVLNRYHWVYSCTAFNHSYSDTGLFGIYASAPPQAANDAVELIA 356 (472)
T ss_pred CChhHHHHHHHHHHhcCCcccCCCCCCcchHHHHHHHHHhhhHHHHHhhhhhccccCCceeEEeccCCHHHHHHHHHHHH
Confidence 578999999999999975 589999998877652 22 3344566888888822 34555556
Q ss_pred HHHH--cCCCCHHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHHHhCCCCChHHHHHHhhccCccccc--ccccc
Q psy15801 61 DVVR--GAGIKDADIARGKALLKRHLADSFETLDSTVDSVVRQAVTTGVVKSLPDLLAEVEAVSSSDVS--AKRSF 132 (318)
Q Consensus 61 ~~l~--~~~~t~~EL~raK~~lk~~~~~~les~~~~a~~lg~~~~~~g~~~~~~~~~~~I~~VT~edV~--~~r~~ 132 (318)
.+|. .++++.+||+|||+|+++.++|+|||.--..|++|||.+.+|.+..+.+++..|++||++||. +.+.+
T Consensus 357 ~e~~~~~~~v~~~el~RAK~qlkS~LlMNLESR~V~~EDvGRQVL~~g~rk~p~e~~~~Ie~lt~~DI~rva~kvl 432 (472)
T KOG2067|consen 357 KEMINMAGGVTQEELERAKTQLKSMLLMNLESRPVAFEDVGRQVLTTGERKPPDEFIKKIEQLTPSDISRVASKVL 432 (472)
T ss_pred HHHHHHhCCCCHHHHHHHHHHHHHHHHhcccccchhHHHHhHHHHhccCcCCHHHHHHHHHhcCHHHHHHHHHHHh
Confidence 6666 367999999999999999999999999999999999999999999999999999999999998 34544
No 9
>COG1025 Ptr Secreted/periplasmic Zn-dependent peptidases, insulinase-like [Posttranslational modification, protein turnover, chaperones]
Probab=99.72 E-value=2.4e-17 Score=168.59 Aligned_cols=133 Identities=12% Similarity=0.134 Sum_probs=125.6
Q ss_pred CCCCcEEEEcCCCcEEEEEec-CCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCH-HHHHHHHHHhCCeEe
Q psy15801 142 FSPDIQSGTSYNKVKVVSVNE-TSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTH-FGIIRHVQQAGGSLY 219 (318)
Q Consensus 142 ~~~~~~~~~L~NGlkv~~~~~-~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~-~~i~~~l~~lG~~l~ 219 (318)
....++..+|||||+++++++ ..+...+.|.|++|+..||.+.+|+|||++||+|-||++++. ..+...|..+||+.|
T Consensus 20 d~r~y~~I~LpNGl~~LlisDP~a~ks~aAL~V~vGs~~DP~e~~GLAHflEHmlfmGseKYP~~~~f~~fLskhgGs~N 99 (937)
T COG1025 20 DDRKYRAIKLPNGLRALLVSDPQADKSSAALVVPVGSFDDPEEYPGLAHFLEHMLFMGSEKYPDEGGFSEFLSKHGGSHN 99 (937)
T ss_pred cCcceeEEECCCCceEEEecCCCCCccceeEEeecCCCCChhhcccHHHHHHHHHHhcCccCCCccchHHHHHHcCCccc
Confidence 346788999999999999998 578999999999999999999999999999999999999975 599999999999999
Q ss_pred EEecCceEEEEEEecCCcHHHHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCC
Q psy15801 220 ATVDREGIVYTIEAVRKEIGHVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRP 274 (318)
Q Consensus 220 ~~~~~d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~ 274 (318)
++|..+.|.|++++..++|+.+|+.|++++.+|.|+++.+++|+..+.+|.....
T Consensus 100 A~T~~~~T~fyFeV~~~al~~ALDrFa~ff~~PLf~~e~~dRE~~AV~sE~~~~~ 154 (937)
T COG1025 100 ASTAGERTAFYFEVENDALEGALDRFADFFIEPLFNKEALDRERNAVNSEFTMNL 154 (937)
T ss_pred cccCCCceeEEEEecHHHHHHHHHHHHHHHhccccChHHHHHHHHHHHHHHhcCc
Confidence 9999999999999999999999999999999999999999999999999987654
No 10
>KOG0959|consensus
Probab=99.66 E-value=4.8e-16 Score=161.22 Aligned_cols=137 Identities=12% Similarity=0.157 Sum_probs=125.1
Q ss_pred CCCCcEEEEcCCCcEEEEEecC-CCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHH-HHHHHHHHhCCeEe
Q psy15801 142 FSPDIQSGTSYNKVKVVSVNET-SPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHF-GIIRHVQQAGGSLY 219 (318)
Q Consensus 142 ~~~~~~~~~L~NGlkv~~~~~~-~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~-~i~~~l~~lG~~l~ 219 (318)
....+...+|+|||++++++++ ....++.+-+.+||..||.+.+|||||++||+|.||++++.+ ++...+.+.||+-|
T Consensus 24 d~r~yr~~~L~Ngl~alLisDp~tD~ssaal~V~vGS~~DP~dl~GLAHF~EHMlFmGS~KYP~En~y~~~lsk~gGssN 103 (974)
T KOG0959|consen 24 DTREYRGIELTNGLRALLISDPKTDKSSAALDVKVGSFSDPEDLQGLAHFCEHMLFMGSEKYPDENEYSKFLSKNGGSSN 103 (974)
T ss_pred CccceeEEEecCCceEEEecCCCCCccceeeeeeccccCCccccccHHHHHHHHHhhccccCCCcchhHHHHHhcCCccc
Confidence 3456788999999999999884 567889999999999999999999999999999999999754 78888889999999
Q ss_pred EEecCceEEEEEEecCCcHHHHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCCC-Chh
Q psy15801 220 ATVDREGIVYTIEAVRKEIGHVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRPS-QGN 278 (318)
Q Consensus 220 ~~~~~d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~~-~~~ 278 (318)
+.|..++|.|++.+..++|+.||+.|++++.+|.|.++.++|++..+..|.+.... +.|
T Consensus 104 A~T~~e~T~y~F~V~~~~l~~ALDrFaqFf~~Plf~~~a~eREv~AVdSE~~~nl~~D~w 163 (974)
T KOG0959|consen 104 AYTDSEHTNYYFDVQHDHLEGALDRFAQFFSDPLFNKSATEREVGAVDSEHEKNLNSDGW 163 (974)
T ss_pred cccccccceEEEecchHHHHHHHHHHHHHhhCcccChHHHHHHHHHHHHHHHhccCcchh
Confidence 99999999999999999999999999999999999999999999999999877553 443
No 11
>KOG0960|consensus
Probab=99.60 E-value=1.5e-15 Score=141.68 Aligned_cols=126 Identities=20% Similarity=0.263 Sum_probs=102.6
Q ss_pred CchhHHHHHHHHHHhcCCCCCC--CCCCCCcccccccCCCCC-------CccccccccCc----------hHHHHHH---
Q psy15801 2 SAKEALAFAVLKYALGAGPAKT--GKNQRSTNSKGLSRTPGR-------ESLDESHHFSP----------PQALQAS--- 59 (318)
Q Consensus 2 ~~~d~~al~vL~~iLG~g~~~k--~~~~~~~l~~~~~~~~~~-------~~~~~~a~~~~----------~~~~~~~--- 59 (318)
+|||+++++|.+.|+|+|.+.. +.+.+++|++.+....++ ..|.++|+... ++.++.+
T Consensus 283 ~~pD~~~l~van~iiG~wdr~~g~g~~~~s~La~~~~~~~l~~sfqsFnt~YkDTGLwG~y~V~~~~~~iddl~~~vl~e 362 (467)
T KOG0960|consen 283 AHPDYFALMVANTIIGNWDRTEGGGRNLSSRLAQKIQQDQLCHSFQSFNTSYKDTGLWGIYFVTDNLTMIDDLIHSVLKE 362 (467)
T ss_pred CCccHHHHHHHHHHhhhhhcccCCccCCccHHHHHHHHHHHHHHHhhhhcccccccceeEEEEecChhhHHHHHHHHHHH
Confidence 5899999999999999998653 334488888877654433 22444444421 4444444
Q ss_pred HHHHHcCCCCHHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHHHhCCCCChHHHHHHhhccCcccccc
Q psy15801 60 IDVVRGAGIKDADIARGKALLKRHLADSFETLDSTVDSVVRQAVTTGVVKSLPDLLAEVEAVSSSDVSA 128 (318)
Q Consensus 60 l~~l~~~~~t~~EL~raK~~lk~~~~~~les~~~~a~~lg~~~~~~g~~~~~~~~~~~I~~VT~edV~~ 128 (318)
|.+|.. .+|+.||+|||++|+.+++..||+..-+|++||+|++++|++.++.++.++|++||+.||+.
T Consensus 363 W~rL~~-~vteaEV~RAKn~Lkt~Lll~ldgttpi~ediGrqlL~~Grri~l~El~~rId~vt~~~Vr~ 430 (467)
T KOG0960|consen 363 WMRLAT-SVTEAEVERAKNQLKTNLLLSLDGTTPIAEDIGRQLLTYGRRIPLAELEARIDAVTAKDVRE 430 (467)
T ss_pred HHHHHh-hccHHHHHHHHHHHHHHHHHHhcCCCchHHHHHHHHhhcCCcCChHHHHHHHhhccHHHHHH
Confidence 666665 79999999999999999999999999999999999999999999999999999999999983
No 12
>PRK15101 protease3; Provisional
Probab=99.57 E-value=6.1e-14 Score=149.92 Aligned_cols=213 Identities=9% Similarity=0.041 Sum_probs=158.2
Q ss_pred HHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHh-cccchhHHHHHHHHHHHhCCCCChHHHHHHhhccCccccccc-----
Q psy15801 56 LQASIDVVRGAGIKDADIARGKALLKRHLADS-FETLDSTVDSVVRQAVTTGVVKSLPDLLAEVEAVSSSDVSAK----- 129 (318)
Q Consensus 56 ~~~~l~~l~~~~~t~~EL~raK~~lk~~~~~~-les~~~~a~~lg~~~~~~g~~~~~~~~~~~I~~VT~edV~~~----- 129 (318)
+...|+.|+..+++++|++++|+.+..++... .......+..++..+ ....+.........++.+++++|+..
T Consensus 372 i~~~i~~l~~~g~~~~el~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~~~~~~~~~~~~~i~~~~~~l~ 450 (961)
T PRK15101 372 IFSYLNLLREKGIDKSYFDELAHVLDLDFRYPSITRDMDYIEWLADTM-LRVPVEHTLDAPYIADRYDPKAIKARLAEMT 450 (961)
T ss_pred HHHHHHHHHhcCCcHHHHHHHHHHHhccccCCCCCChHHHHHHHHHHh-hhCCHHHheeCchhhhcCCHHHHHHHHhhcC
Confidence 33447777788999999999999999887443 223333445555432 22222223333455666777777620
Q ss_pred -------------------cccccc------------------cc-----CcC-------C-------CCCCcEEEEcCC
Q psy15801 130 -------------------RSFAAQ------------------PA-----TKA-------L-------FSPDIQSGTSYN 153 (318)
Q Consensus 130 -------------------r~~~~~------------------~~-----p~~-------~-------~~~~~~~~~L~N 153 (318)
.+|..+ .. |.. + ....++...++|
T Consensus 451 ~~n~~i~~~~~~~~~~~~~~~~~~~Y~~~~i~~~~~~~~~~~~~~~~l~lP~~n~fip~~~~~~~~~~~~~~p~~i~~~~ 530 (961)
T PRK15101 451 PQNARIWYISPQEPHNKTAYFVDAPYQVDKISEQTFADWQQKAQNIALSLPELNPYIPDDFSLIKADKAYKHPELIVDEP 530 (961)
T ss_pred HhHEEEEEEeCCCCCCccccccCCcceeecCCHHHHHHHhcCCCCccCCCCCCCCccCCCCeeccCCCCCCCCeEEEcCC
Confidence 011110 00 110 0 012367788899
Q ss_pred CcEEEEEecC----CCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEEecCceEEE
Q psy15801 154 KVKVVSVNET----SPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYATVDREGIVY 229 (318)
Q Consensus 154 Glkv~~~~~~----~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~~~~d~~~~ 229 (318)
|++|++.+++ .|.+.+.+.|.+|...+++...|++.|+..|+ ..+..++....+..|.+++.. +.+.+.+
T Consensus 531 g~~vw~~~d~~f~~~Pk~~i~~~~~~~~~~~~~~~~~l~~L~~~ll-----~~~l~e~~y~a~~aG~~~~~~-~~~g~~i 604 (961)
T PRK15101 531 GLRVVYMPSQYFADEPKADISLVLRNPKAMDSARNQVLFALNDYLA-----GLALDQLSNQASVGGISFSTN-ANNGLMV 604 (961)
T ss_pred CeEEEEeCCCccccCCCEEEEEEEeCCCccCCHHHHHHHHHHHHHH-----HHHHHHHhchHHhcCcEEEEc-cCCCEEE
Confidence 9999999874 59999999999999999999999999999997 345567788888899999999 7999999
Q ss_pred EEEecCCcHHHHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCCC
Q psy15801 230 TIEAVRKEIGHVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRPS 275 (318)
Q Consensus 230 ~~~~l~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~~ 275 (318)
++++++++++.+|+++.+++.+|.|++++|+++|..++.+++....
T Consensus 605 ~v~g~s~~l~~ll~~l~d~l~~~~~~~~~fe~~k~~~~~~l~~~~~ 650 (961)
T PRK15101 605 NANGYTQRLPQLLQALLEGYFSFTPTEEQLAQAKSWYREQLDSAEK 650 (961)
T ss_pred EEEecChhHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHhhhcc
Confidence 9999999999999999999999999999999999999999987553
No 13
>PTZ00432 falcilysin; Provisional
Probab=99.42 E-value=7.9e-13 Score=142.56 Aligned_cols=111 Identities=11% Similarity=0.097 Sum_probs=98.5
Q ss_pred CcEEEEcCCCcEEEEEecCC---CEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhC--CeEe
Q psy15801 145 DIQSGTSYNKVKVVSVNETS---PIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAG--GSLY 219 (318)
Q Consensus 145 ~~~~~~L~NGlkv~~~~~~~---p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG--~~l~ 219 (318)
.+...-.+||++|++++++. +.+.++++|++|+..+ .|++|+++||+|.||.+++..++...++..| +.+|
T Consensus 91 ~~~~~H~~nGl~vl~~~~~d~~~~~~~f~i~f~T~~~d~----~G~aH~LEH~~f~GS~k~p~~~~~~~l~~~gl~~~lN 166 (1119)
T PTZ00432 91 ATVYSHKKTGLQVISLKTNDSSGKEMCFDFYVPTPPHND----KGIPHILEHSVLSGSKKYNYKDSFSLLVQGGFNSFLN 166 (1119)
T ss_pred EEEEEEcCCCCEEEEEecCCCccceeEEEEEecCCCCCC----cchhHHHHHHHhCCCCCCCcccHHHHHHhcCcCCCcc
Confidence 34555678999999988643 5899999999998554 5999999999999999999999999998865 8899
Q ss_pred EEecCceEEEEEEecCC-cHHHHHHHHHHhhccCCCCHHHH
Q psy15801 220 ATVDREGIVYTIEAVRK-EIGHVHKFLASVVGKTEFRPWEV 259 (318)
Q Consensus 220 ~~~~~d~~~~~~~~l~~-~l~~~l~ll~d~l~~P~f~e~e~ 259 (318)
+.|+.|+|.|.+.++.+ ++..+|++++|++.+|.|+++++
T Consensus 167 A~T~~D~T~Y~~~~~~e~d~~~~ldv~~d~v~~P~~~~~~~ 207 (1119)
T PTZ00432 167 AYTFKDRTSYLFASTNEKDFYNTADVYMDSVFQPNILEDKD 207 (1119)
T ss_pred ccCCCCceEEEeccCCHHHHHHHHHHHHHHHhCcCcccccc
Confidence 99999999999999885 79999999999999999999864
No 14
>PTZ00432 falcilysin; Provisional
Probab=99.26 E-value=3.3e-10 Score=122.43 Aligned_cols=120 Identities=10% Similarity=0.049 Sum_probs=91.8
Q ss_pred CCCcEEEEEec-CCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEE----ec---
Q psy15801 152 YNKVKVVSVNE-TSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYAT----VD--- 223 (318)
Q Consensus 152 ~NGlkv~~~~~-~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~----~~--- 223 (318)
.+|+.|+..+- .+.++.+.++|+.....+ ...+=+.-|...+..-||.+++..++...++..-|.++++ ++
T Consensus 666 ~~~~~~~~~~~~TnGi~y~~~~fdl~~l~~-e~~~yl~L~~~~l~~~gT~~~s~~el~~~i~~~tGg~~~~~~~~~~~~~ 744 (1119)
T PTZ00432 666 GGSVTVLVHPIESRGILYLDFAFSLDSLTV-DELKYLNLFKALLKENGTDKLSSEEFTYKREKNLGGLSASTAFYSETNN 744 (1119)
T ss_pred CCCcceEEEecCCCCeEEEEEEecCCCCCH-HHHhhHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCCeEEEEEEeccccc
Confidence 57899988876 577999999999986543 2334444444444456999999999999999987777654 22
Q ss_pred ---------CceEEEEEEecCCcHHHHHHHHHHhhccCCCCHHH-HhhhhhhHHHHHhh
Q psy15801 224 ---------REGIVYTIEAVRKEIGHVHKFLASVVGKTEFRPWE-VSDLTPRLKYDRLT 272 (318)
Q Consensus 224 ---------~d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e-~~~~k~~i~~ei~~ 272 (318)
...+.+++.++.++++.+++|+.+++.++.|++.+ +..+..+.++++.+
T Consensus 745 ~~~~~~~~~~~~~~v~~k~l~~~~~~~~~l~~eil~~~~f~d~~rl~~il~~~~~~~~~ 803 (1119)
T PTZ00432 745 LTYDDPYNGVGYLNVRAKVLKHKVNEMVDIVLEALKDADFSNSKKGVEILKRKINGMKT 803 (1119)
T ss_pred cccCcccccceEEEEEEEEhhhhHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHHHHHH
Confidence 34689999999999999999999999999998766 65555555555544
No 15
>KOG2019|consensus
Probab=98.77 E-value=1.4e-06 Score=87.25 Aligned_cols=259 Identities=14% Similarity=0.140 Sum_probs=171.8
Q ss_pred hhHHHHHHHHHHhcCCCCCCCCCCCCcccccccCCCCCCcccccc-ccC--------------c-------hHHHHHHHH
Q psy15801 4 KEALAFAVLKYALGAGPAKTGKNQRSTNSKGLSRTPGRESLDESH-HFS--------------P-------PQALQASID 61 (318)
Q Consensus 4 ~d~~al~vL~~iLG~g~~~k~~~~~~~l~~~~~~~~~~~~~~~~a-~~~--------------~-------~~~~~~~l~ 61 (318)
=+.+||.||..+|-+|+ +||+|+++.+.+....|++.. +.. + ..++..++.
T Consensus 328 ~etfaL~~L~~Ll~~gp-------sSp~yk~LiESGLGtEfsvnsG~~~~t~~~~fsVGLqGvseediekve~lV~~t~~ 400 (998)
T KOG2019|consen 328 YETFALKVLSHLLLDGP-------SSPFYKALIESGLGTEFSVNSGYEDTTLQPQFSVGLQGVSEEDIEKVEELVMNTFN 400 (998)
T ss_pred HHHHHHHHHHHHhcCCC-------ccHHHHHHHHcCCCcccccCCCCCcccccceeeeeeccccHHHHHHHHHHHHHHHH
Confidence 36789999999999999 899999998776553233222 221 1 335555677
Q ss_pred HHHcCCCCHHHHHHHHHHHHHHHHHhcccc-hhHHHHHHHHHHHhCCCCChHHHHHHhhcc---Ccc---cc-c------
Q psy15801 62 VVRGAGIKDADIARGKALLKRHLADSFETL-DSTVDSVVRQAVTTGVVKSLPDLLAEVEAV---SSS---DV-S------ 127 (318)
Q Consensus 62 ~l~~~~~t~~EL~raK~~lk~~~~~~les~-~~~a~~lg~~~~~~g~~~~~~~~~~~I~~V---T~e---dV-~------ 127 (318)
.|...+++.+-++....+++.++--..-+- -+++..+...|+..++|.....+-+.|..+ -.+ .| +
T Consensus 401 ~lae~gfd~drieAil~qiEislk~qst~fGL~L~~~i~~~W~~d~DPfE~Lk~~~~L~~lk~~l~ek~~~lfq~lIkkY 480 (998)
T KOG2019|consen 401 KLAETGFDNDRIEAILHQIEISLKHQSTGFGLSLMQSIISKWINDMDPFEPLKFEEQLKKLKQRLAEKSKKLFQPLIKKY 480 (998)
T ss_pred HHHHhccchHHHHHHHHHhhhhhhccccchhHHHHHHHhhhhccCCCccchhhhhhHHHHHHHHHhhhchhHHHHHHHHH
Confidence 787778999999999999887665443221 224666777787777765544444443332 111 00 0
Q ss_pred --ccc-cc-----ccc----------------------------------------ccCcC--------C-C-CCCc---
Q psy15801 128 --AKR-SF-----AAQ----------------------------------------PATKA--------L-F-SPDI--- 146 (318)
Q Consensus 128 --~~r-~~-----~~~----------------------------------------~~p~~--------~-~-~~~~--- 146 (318)
.+. ++ +++ ..|.. + . |+.+
T Consensus 481 ilnn~h~~t~smqpd~e~~~~~~~eE~tkL~ek~~alteeD~~ei~k~~~eL~~kQ~tp~dlsClPtL~vsDIp~~~~~~ 560 (998)
T KOG2019|consen 481 ILNNPHCFTFSMQPDPEFAEKLEQEEATKLEEKKAALTEEDLAEIAKAGEELREKQSTPEDLSCLPTLNVSDIPKTIPYT 560 (998)
T ss_pred HhcCCceEEEEecCCchhhHHHHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHhhCCccccccccccccccCCCCCCcc
Confidence 011 11 100 01221 0 0 1111
Q ss_pred E-EEEcCCCcEEEEEec-CCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEEe--
Q psy15801 147 Q-SGTSYNKVKVVSVNE-TSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYATV-- 222 (318)
Q Consensus 147 ~-~~~L~NGlkv~~~~~-~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~~-- 222 (318)
. ....-||++|..... .+.++.+.++++.|+.-+. -.+=+.-|...++..||...+..|+++.+...-|.++++.
T Consensus 561 ~~~v~dingvkv~~~dl~tngi~Y~r~~~~l~~~p~e-L~PylPlfc~sll~lGt~~lsf~el~qqI~rkTGGiS~~p~~ 639 (998)
T KOG2019|consen 561 KLEVGDINGVKVQRCDLFTNGITYTRVVFDLNSLPEE-LLPYLPLFCQSLLNLGTGDLSFVELEQQIGRKTGGISVSPLV 639 (998)
T ss_pred ceeeeeccCceeEEeeccCCceEEEEEeeccccCcHH-hhcchHHHHHHHHhcCCCcccHHHHHHHhhhhcCceeeccee
Confidence 1 223459999988877 7889999999999996543 3567888889999999999999999999999877776442
Q ss_pred --cC------ceEEEEEEecCCcHHHHHHHHHHhhccCCCCHHH-HhhhhhhHHHHH
Q psy15801 223 --DR------EGIVYTIEAVRKEIGHVHKFLASVVGKTEFRPWE-VSDLTPRLKYDR 270 (318)
Q Consensus 223 --~~------d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e-~~~~k~~i~~ei 270 (318)
+. -.+.|+..++..+.+..++|...++.++.|.+.+ |.....+..+++
T Consensus 640 ~s~~~~d~p~~~i~~~~~~l~rn~~dlfel~n~il~e~~f~n~dkfkvlvk~s~s~~ 696 (998)
T KOG2019|consen 640 SSDDGMDEPELGIVFSGSMLDRNADDLFELWNKILQETCFTNQDKFKVLVKQSASRM 696 (998)
T ss_pred ccCCCCCccceeEEechhhhcCChhHHHHHHHHHhcccCcccHHHHHHHHHHHHHHh
Confidence 11 2256677788899999999999999999999654 444444444443
No 16
>COG1026 Predicted Zn-dependent peptidases, insulinase-like [General function prediction only]
Probab=98.74 E-value=1.4e-06 Score=90.80 Aligned_cols=262 Identities=14% Similarity=0.144 Sum_probs=177.4
Q ss_pred CchhHHHHHHHHHHhcCCCCCCCCCCCCcccccccCCCCCCccccccccC---------------c-------hHHHHHH
Q psy15801 2 SAKEALAFAVLKYALGAGPAKTGKNQRSTNSKGLSRTPGRESLDESHHFS---------------P-------PQALQAS 59 (318)
Q Consensus 2 ~~~d~~al~vL~~iLG~g~~~k~~~~~~~l~~~~~~~~~~~~~~~~a~~~---------------~-------~~~~~~~ 59 (318)
|.-|..|+.||..+|-++. ++|+++++.+.+.. ...++++.. + .+.+-.+
T Consensus 291 d~~~~lal~vL~~iLl~~~-------asPl~~~liesglg-~~~~~g~~~~~~~~~~f~v~~~gv~~ek~~~~k~lV~~~ 362 (978)
T COG1026 291 DAEDSLALEVLEEILLDSA-------ASPLTQALIESGLG-FADVSGSYDSDLKETIFSVGLKGVSEEKIAKLKNLVLST 362 (978)
T ss_pred cHHHHHHHHHHHHHHccCc-------ccHHHHHHHHcCCC-cccccceeccccceeEEEEEecCCCHHHHHHHHHHHHHH
Confidence 3457899999999999888 78999999877654 122333221 1 4456666
Q ss_pred HHHHHcCCCCHHHHHHHHHHHHHHHHHhcccchh--HHHHHHHHHHHhCCCCChHH---HHHHhhc-cCccc-ccc----
Q psy15801 60 IDVVRGAGIKDADIARGKALLKRHLADSFETLDS--TVDSVVRQAVTTGVVKSLPD---LLAEVEA-VSSSD-VSA---- 128 (318)
Q Consensus 60 l~~l~~~~~t~~EL~raK~~lk~~~~~~les~~~--~a~~lg~~~~~~g~~~~~~~---~~~~I~~-VT~ed-V~~---- 128 (318)
++.+...+++.+.|+.+..+++.+.-..-...-+ +......+++..|+|..-.. ++++|.. +...+ ...
T Consensus 363 L~~l~~~gi~~~~ie~~~~q~E~s~ke~~s~pfgl~l~~~~~~gw~~G~dp~~~Lr~~~~~~~Lr~~le~~~~fe~LI~k 442 (978)
T COG1026 363 LKELVKNGIDKKLIEAILHQLEFSLKEVKSYPFGLGLMFRSLYGWLNGGDPEDSLRFLDYLQNLREKLEKGPYFEKLIRK 442 (978)
T ss_pred HHHHHHhcCCHHHHHHHHHHHHHhhhhhcCCCccHHHHHHhccccccCCChhhhhhhHHHHHHHHHhhhcChHHHHHHHH
Confidence 8888888899999999999999888776444444 45556667777777655444 4444433 22222 110
Q ss_pred ----cccc------cc----------------------------------------cccCcC-----------C-C-C--
Q psy15801 129 ----KRSF------AA----------------------------------------QPATKA-----------L-F-S-- 143 (318)
Q Consensus 129 ----~r~~------~~----------------------------------------~~~p~~-----------~-~-~-- 143 (318)
+.++ ++ +..|.. . . |
T Consensus 443 y~l~N~h~~~v~~~Ps~~~~~~~ekee~e~L~~~~~~l~de~~~ki~~~~~~lke~Q~~~dse~~~~~lP~l~~~dvp~~ 522 (978)
T COG1026 443 YFLDNPHYVTVIVLPSPELEEKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKERQDQPDSEEDLATLPTLKLGDVPDP 522 (978)
T ss_pred HhhcCCccEEEEEecChHHHHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHHHHhhcCCCchhhhhhccccchhcCCCc
Confidence 1111 00 012211 0 0 1
Q ss_pred -CCc-EEEEcCCCcEEEEEec-CCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeE
Q psy15801 144 -PDI-QSGTSYNKVKVVSVNE-TSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYA 220 (318)
Q Consensus 144 -~~~-~~~~L~NGlkv~~~~~-~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~ 220 (318)
+.. -...-.|..+|+..+. ++.++.+.++|+.. .-.....+=+.-|...|...||.+++..++..+++..-|.+++
T Consensus 523 ~~k~~l~~~~~~~~~v~~~~~~tn~i~yl~~~~~~~-~l~~~llpyL~L~~~~l~~lgt~~~~y~e~~~~i~~~TGgis~ 601 (978)
T COG1026 523 IEKTSLETEVSNEAKVLHHDLFTNGITYLRLYFDLD-MLPSELLPYLPLFAFALTNLGTETYSYKELLNQIERHTGGISV 601 (978)
T ss_pred ccccceeeeccCCcceEEeecCCCCeEEEEEEeecC-CCChhhhhhHHHHHHHHHhcCCCCcCHHHHHHHHHHHhCCcee
Confidence 122 2223347888888877 78999999999994 4444566778878888888999999999999999998777664
Q ss_pred Eec-----------CceEEEEEEecCCcHHHHHHHHHHhhccCCCC-HHHHhhhhhhHHHHHhh
Q psy15801 221 TVD-----------REGIVYTIEAVRKEIGHVHKFLASVVGKTEFR-PWEVSDLTPRLKYDRLT 272 (318)
Q Consensus 221 ~~~-----------~d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~-e~e~~~~k~~i~~ei~~ 272 (318)
..+ +..+.|++.++.+..+.+++++.+++.++.|+ .+-+..+..+.++.+..
T Consensus 602 ~~~~~~~~~~~~~~~~~~~i~~K~l~~k~~~~~~~i~~~l~~~~F~D~~Rlkell~q~~~~l~~ 665 (978)
T COG1026 602 SLSVDTDPGDDGEYRPSFSISGKALRSKVEKLFELIREILANTDFHDRERLKELLEQYLSDLTS 665 (978)
T ss_pred eEeeccCCCccccccceEEEEEEehhhhhhHHHHHHHHHHhcCCcCcHHHHHHHHHHHHhhhHH
Confidence 432 25678999999999999999999999999995 44444444444444444
No 17
>PF08367 M16C_assoc: Peptidase M16C associated; InterPro: IPR013578 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This domain appears in eukaryotes as well as bacteria and tends to be found near the C terminus of metalloproteases and related sequences belonging to MEROPS peptidase family M16 (subfamily M16C, clan ME). These include: eupitrilysin, falcilysin, PreP peptidase, CYM1 peptidase and subfamily M16C non-peptidase homologues.; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 2FGE_B 3S5I_A 3S5H_A 3S5M_A 3S5K_A.
Probab=98.41 E-value=1.8e-05 Score=72.07 Aligned_cols=167 Identities=13% Similarity=0.180 Sum_probs=105.7
Q ss_pred HHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHHHhCCCCChHHHHHHhhccCccccccccccccc
Q psy15801 56 LQASIDVVRGAGIKDADIARGKALLKRHLADSFETLDSTVDSVVRQAVTTGVVKSLPDLLAEVEAVSSSDVSAKRSFAAQ 135 (318)
Q Consensus 56 ~~~~l~~l~~~~~t~~EL~raK~~lk~~~~~~les~~~~a~~lg~~~~~~g~~~~~~~~~~~I~~VT~edV~~~r~~~~~ 135 (318)
..+-++++++ .+|++|+++...+.+.-..++- . ++-++-|-.++.+||...-.
T Consensus 15 e~~~L~~~k~-~Ls~~e~~~i~~~~~~L~~~Q~-~---------------------~d~~~~LP~L~~~Di~~~~~---- 67 (248)
T PF08367_consen 15 EKEKLAAYKA-SLSEEEKEKIIEQTKELKERQE-A---------------------EDDLATLPTLSLSDIPREIE---- 67 (248)
T ss_dssp HHHHHHHHHH-CS-HHHHHHHHHHHHHHHHHHC-S-----------------------HHTTS----GGGS-SS------
T ss_pred HHHHHHHHHh-hCCHHHHHHHHHHHHHHHHHhC-C---------------------hhhHHHHccccHHhcCCCCC----
Confidence 3344666655 5888888888777554222211 1 11334566777888763211
Q ss_pred ccCcCCCCCCcEEEEcCCCcEEEEEec-CCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHh
Q psy15801 136 PATKALFSPDIQSGTSYNKVKVVSVNE-TSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQA 214 (318)
Q Consensus 136 ~~p~~~~~~~~~~~~L~NGlkv~~~~~-~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~l 214 (318)
...+.... .+|++|++.+. .+.++.+.++|+.+.... ...+=+.-|..-|-.-||.+++..++...+..+
T Consensus 68 -------~~~~~~~~-~~~~~v~~~~~~TnGI~Y~~l~fdl~~l~~-e~l~yl~Ll~~ll~~lgT~~~sy~el~~~i~~~ 138 (248)
T PF08367_consen 68 -------KIPLEVEK-LGGIPVLFHEQPTNGIVYVRLYFDLSDLPE-EDLPYLPLLTDLLGELGTKNYSYEELSNEIDLY 138 (248)
T ss_dssp -----------EECC-CTTCEEEEEE---TTEEEEEEEEE-TTS-C-CCHCCHHHHHHHCCCS-BSSS-HHHHHHHHHHH
T ss_pred -------CCCceeee-cCCccEEEEEcCCCCeEEEEEEecCCCCCH-HHHHhHHHHHHHHHhCCCCCCCHHHHHHHHHHh
Confidence 11223333 37999998877 788999999999985443 345667777665556799999999999999998
Q ss_pred CCeEeEEec-----------CceEEEEEEecCCcHHHHHHHHHHhhccCCCCHHH
Q psy15801 215 GGSLYATVD-----------REGIVYTIEAVRKEIGHVHKFLASVVGKTEFRPWE 258 (318)
Q Consensus 215 G~~l~~~~~-----------~d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e 258 (318)
=|.++++.. .-++.|++.|+.++++++++|+.+++.+|.|++.+
T Consensus 139 tGGis~~~~~~~~~~~~~~~~~~l~is~k~L~~~~~~~~~ll~eil~~~~f~d~~ 193 (248)
T PF08367_consen 139 TGGISFSIEVYTDYDDDDKYRPYLVISAKCLDEKLDEAFELLSEILTETDFDDKE 193 (248)
T ss_dssp SSEEEEEEEEEEEECTECCCEEEEEEEEEEEGGGHHHHHHHHHHHHHCB-TT-HH
T ss_pred CCCeEEEeeeccCCCCccceeEEEEEEEEeHhhhHHHHHHHHHHHHhccCCCcHH
Confidence 777765541 14567888999999999999999999999998875
No 18
>COG0612 PqqL Predicted Zn-dependent peptidases [General function prediction only]
Probab=98.31 E-value=1.8e-06 Score=84.73 Aligned_cols=121 Identities=24% Similarity=0.342 Sum_probs=90.2
Q ss_pred hHHHHHHHHHHhcCCCCCCCCCCCCcccccccCCCC-C-------Ccccccccc------C---c---hHHHHHHHHHHH
Q psy15801 5 EALAFAVLKYALGAGPAKTGKNQRSTNSKGLSRTPG-R-------ESLDESHHF------S---P---PQALQASIDVVR 64 (318)
Q Consensus 5 d~~al~vL~~iLG~g~~~k~~~~~~~l~~~~~~~~~-~-------~~~~~~a~~------~---~---~~~~~~~l~~l~ 64 (318)
|++++.|+..+||++. .|+|.+.+.+... + ..+..++++ . + ...+.+++..++
T Consensus 273 ~~~~~~l~~~llgg~~-------~SrLf~~~re~~glay~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ 345 (438)
T COG0612 273 DYAALLLLNGLLGGGF-------SSRLFQELREKRGLAYSVSSFSDFLSDSGLFSIYAGTAPENPEKTAELVEEILKALK 345 (438)
T ss_pred hhHHHHHHHHHhCCCc-------chHHHHHHHHhcCceeeeccccccccccCCceEEEEecCCChhhHHHHHHHHHHHHH
Confidence 7899999999999885 5566655433221 1 111111111 1 1 335556666666
Q ss_pred cC---CCCHHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHHHhCCCCChHHHHHHhhccCcccccc--cccc
Q psy15801 65 GA---GIKDADIARGKALLKRHLADSFETLDSTVDSVVRQAVTTGVVKSLPDLLAEVEAVSSSDVSA--KRSF 132 (318)
Q Consensus 65 ~~---~~t~~EL~raK~~lk~~~~~~les~~~~a~~lg~~~~~~g~~~~~~~~~~~I~~VT~edV~~--~r~~ 132 (318)
.. .+|+++++++|..+...++...++...+++.++......+.+.++.++...|++||++||.+ ++++
T Consensus 346 ~~~~~~~t~~~~~~~k~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~vt~~dv~~~a~~~~ 418 (438)
T COG0612 346 KGLKGPFTEEELDAAKQLLIGLLLLSLDSPSSIAELLGQYLLLGGSLITLEELLERIEAVTLEDVNAVAKKLL 418 (438)
T ss_pred HHhccCCCHHHHHHHHHHHHHHhhhccCCHHHHHHHHHHHHHhcCCccCHHHHHHHHHhcCHHHHHHHHHHhc
Confidence 54 39999999999999999999999999999999988877789999999999999999999983 4444
No 19
>KOG2583|consensus
Probab=98.09 E-value=6.9e-06 Score=77.74 Aligned_cols=121 Identities=21% Similarity=0.194 Sum_probs=82.1
Q ss_pred CchhHHHHHHHHHHhcCCCCCCCCCCCCcccccccCCC--------CCCccccccccCc---------hHHHHHHHHHHH
Q psy15801 2 SAKEALAFAVLKYALGAGPAKTGKNQRSTNSKGLSRTP--------GRESLDESHHFSP---------PQALQASIDVVR 64 (318)
Q Consensus 2 ~~~d~~al~vL~~iLG~g~~~k~~~~~~~l~~~~~~~~--------~~~~~~~~a~~~~---------~~~~~~~l~~l~ 64 (318)
|+++..++.|+.++||.++.+|.+ .|.+.++..+.. ...+|++++++.- ++++..++..++
T Consensus 264 ~~k~~~a~av~~~~Lg~~~~~k~~--t~~~~~aa~~a~~~~~s~sA~~a~ysDsGL~gv~~~~~~~~a~~~v~s~v~~lk 341 (429)
T KOG2583|consen 264 NLKVLAAQAVLLAALGNSAPVKRG--TGLLSEAAGAAGEQGASASAFNAPYSDSGLFGVYVSAQGSQAGKVVSSEVKKLK 341 (429)
T ss_pred chHHHHHHHHHHHHHhcccccccc--cchHHHHHhhccccCceeeeecccccCCceEEEEEEecCccHHHHHHHHHHHHH
Confidence 689999999999999999988888 444554332221 2345888888732 677888888888
Q ss_pred cCCCCHHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHHHhCCCCChHHHHHHhhccCccccc
Q psy15801 65 GAGIKDADIARGKALLKRHLADSFETLDSTVDSVVRQAVTTGVVKSLPDLLAEVEAVSSSDVS 127 (318)
Q Consensus 65 ~~~~t~~EL~raK~~lk~~~~~~les~~~~a~~lg~~~~~~g~~~~~~~~~~~I~~VT~edV~ 127 (318)
+..+++-|..+.++-.+.-....+.+..++-...+.+. .. +..+++++++|++||+.||.
T Consensus 342 s~~~~~id~~~~~a~~~~l~~~~~ss~~a~~~~~~~~a--~~-~~~~d~~i~~id~Vt~sdV~ 401 (429)
T KOG2583|consen 342 SALVSDIDNAKVKAAIKALKASYLSSVEALELATGSQA--NL-VSEPDAFIQQIDKVTASDVQ 401 (429)
T ss_pred HHHhcCCcchHHHHHHHHHHHHhhcchHHHHHhhHHHh--cC-CCChHHHHHHhccccHHHHH
Confidence 65555555555555555544444444444444444333 22 23889999999999999998
No 20
>COG1026 Predicted Zn-dependent peptidases, insulinase-like [General function prediction only]
Probab=97.39 E-value=0.00044 Score=72.64 Aligned_cols=109 Identities=13% Similarity=0.138 Sum_probs=88.9
Q ss_pred EcCCCcEEEEEecCCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHH--HHhCCeEeEEecCceE
Q psy15801 150 TSYNKVKVVSVNETSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHV--QQAGGSLYATVDREGI 227 (318)
Q Consensus 150 ~L~NGlkv~~~~~~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l--~~lG~~l~~~~~~d~~ 227 (318)
.-+.|++++.+.+..|-...+++|+.-+..+ .|++|.++|+.+.|+.+++-.+.--.+ ..++-.+|+.|+-|.|
T Consensus 26 H~~TGa~l~hi~~~d~~~vFsi~F~T~p~ds----tGVaHiLEHtvlcGS~kYPvkdPF~~ml~rSLntF~NA~T~~D~T 101 (978)
T COG1026 26 HEKTGAELAHIKNEDPNNVFSIAFKTEPHDS----TGVAHILEHTVLCGSKKYPVKDPFFKMLKRSLNTFLNAFTFPDKT 101 (978)
T ss_pred eccCCceEEEecCCCcCceEEEEeecCCCCC----CCcchHHHHHhhhCCCCCCCCChHHHHHHHhHHHHHhhccCCCcc
Confidence 3468999999988777777888888865554 499999999999999999865443222 2345558999999999
Q ss_pred EEEEEec-CCcHHHHHHHHHHhhccCCCCHHHHhhh
Q psy15801 228 VYTIEAV-RKEIGHVHKFLASVVGKTEFRPWEVSDL 262 (318)
Q Consensus 228 ~~~~~~l-~~~l~~~l~ll~d~l~~P~f~e~e~~~~ 262 (318)
.|-++.. .+++-..|.+..|.+.+|...++.|-++
T Consensus 102 ~YP~sS~~~~Df~NLl~VYlDavf~PlL~~e~F~QE 137 (978)
T COG1026 102 VYPASSANEKDFYNLLSVYLDAVFHPLLTKESFLQE 137 (978)
T ss_pred eeeccccCcchHHHHHHHHHHhhhCcccchHHHhhh
Confidence 9999755 5689999999999999999999988776
No 21
>KOG0961|consensus
Probab=96.92 E-value=0.0032 Score=63.79 Aligned_cols=107 Identities=10% Similarity=0.140 Sum_probs=89.3
Q ss_pred CCcEEEEEecCCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHH-hCCeEeEEecCceEEEEE
Q psy15801 153 NKVKVVSVNETSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQ-AGGSLYATVDREGIVYTI 231 (318)
Q Consensus 153 NGlkv~~~~~~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~-lG~~l~~~~~~d~~~~~~ 231 (318)
-|++|+....+.+++.-.+.|..-...+. |+.|-++||.|-|+++++..-+-+.+.. .=|++|+-++-|+|.|++
T Consensus 28 Tkl~va~~~~pts~vhG~f~v~TEa~~d~----G~PHTLEHL~FMGSKkYP~kGvLd~~anr~l~dtNAwTDtD~T~YtL 103 (1022)
T KOG0961|consen 28 TKLRVAIGEVPTSMVHGAFSVVTEADSDD----GLPHTLEHLVFMGSKKYPFKGVLDVIANRCLADTNAWTDTDHTAYTL 103 (1022)
T ss_pred cceEEEEeecCCcceeeeEEeeeeecCCC----CCchhHHHHhhhccccCCcccHHHHhhcchhcccccccccCcceEEe
Confidence 38889888887778888888887666655 9999999999999999998766555443 335699999999999999
Q ss_pred Eec-CCcHHHHHHHHHHhhccCCCCHHHHhhhh
Q psy15801 232 EAV-RKEIGHVHKFLASVVGKTEFRPWEVSDLT 263 (318)
Q Consensus 232 ~~l-~~~l~~~l~ll~d~l~~P~f~e~e~~~~k 263 (318)
+.. .+.|-+.|..+.|-++.|...++.|-.+.
T Consensus 104 Stag~dGFlklLPvy~dHiL~P~Ltdeaf~TEV 136 (1022)
T KOG0961|consen 104 STAGSDGFLKLLPVYIDHILTPMLTDEAFATEV 136 (1022)
T ss_pred ecccccchHHHhHHHHHhhcCcccchhhhhhhe
Confidence 855 56799999999999999999999988763
No 22
>PF05193 Peptidase_M16_C: Peptidase M16 inactive domain; InterPro: IPR007863 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. These metallopeptidases belong to MEROPS peptidase family M16 (clan ME). They include proteins, which are classified as non-peptidase homologues either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. The peptidases in this group of sequences include: Insulinase, insulin-degrading enzyme (3.4.24.56 from EC) Mitochondrial processing peptidase alpha subunit, (Alpha-MPP, 3.4.24.64 from EC) Pitrlysin, Protease III precursor (3.4.24.55 from EC) Nardilysin, (3.4.24.61 from EC) Ubiquinol-cytochrome C reductase complex core protein I,mitochondrial precursor (1.10.2.2 from EC) Coenzyme PQQ synthesis protein F (3.4.99 from EC) These proteins do not share many regions of sequence similarity; the most noticeable is in the N-terminal section. This region includes a conserved histidine followed, two residues later by a glutamate and another histidine. In pitrilysin, it has been shown [] that this H-x-x-E-H motif is involved in enzymatic activity; the two histidines bind zinc and the glutamate is necessary for catalytic activity. The mitochondrial processing peptidase consists of two structurally related domains. One is the active peptidase whereas the other, the C-terminal region, is inactive. The two domains hold the substrate like a clamp [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 1BE3_B 1PP9_B 2A06_B 1SQB_B 1SQP_B 1L0N_B 1SQX_B 1NU1_B 1L0L_B 2FYU_B ....
Probab=96.57 E-value=0.0028 Score=53.03 Aligned_cols=71 Identities=25% Similarity=0.382 Sum_probs=46.1
Q ss_pred CchhHHHHHHHHHHhcCCCCCCCCCCCCcccccccCCCCCCcccccccc---------------Cc---h---HHHHHHH
Q psy15801 2 SAKEALAFAVLKYALGAGPAKTGKNQRSTNSKGLSRTPGRESLDESHHF---------------SP---P---QALQASI 60 (318)
Q Consensus 2 ~~~d~~al~vL~~iLG~g~~~k~~~~~~~l~~~~~~~~~~~~~~~~a~~---------------~~---~---~~~~~~l 60 (318)
+++|..++.+|..+||++. .++|...+-+..... |++.+.+ .+ . +.+...+
T Consensus 93 ~~~~~~~~~~l~~~l~~~~-------~s~l~~~lr~~~~l~-y~v~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~l 164 (184)
T PF05193_consen 93 DSKDYFALNLLSSLLGNGM-------SSRLFQELREKQGLA-YSVSASNSSYRDSGLFSISFQVTPENLDEAIEAILQEL 164 (184)
T ss_dssp TSTTHHHHHHHHHHHHCST-------TSHHHHHHHTTTTSE-SEEEEEEEEESSEEEEEEEEEEEGGGHHHHHHHHHHHH
T ss_pred ccchhhHHHHHHHHHhcCc-------cchhHHHHHhccccc-eEEEeeeeccccceEEEEEEEcCcccHHHHHHHHHHHH
Confidence 6789999999999999996 566666654222110 1111111 11 2 2344447
Q ss_pred HHHHcCCCCHHHHHHHHHHH
Q psy15801 61 DVVRGAGIKDADIARGKALL 80 (318)
Q Consensus 61 ~~l~~~~~t~~EL~raK~~l 80 (318)
..+...+++++|++++|++|
T Consensus 165 ~~l~~~~~s~~el~~~k~~L 184 (184)
T PF05193_consen 165 KRLREGGISEEELERAKNQL 184 (184)
T ss_dssp HHHHHHCS-HHHHHHHHHHH
T ss_pred HHHHHcCCCHHHHHHHHhcC
Confidence 77777789999999999986
No 23
>TIGR02110 PQQ_syn_pqqF coenzyme PQQ biosynthesis probable peptidase PqqF. In a subset of species that make coenzyme PQQ (pyrrolo-quinoline-quinone), this probable peptidase is found in the PQQ biosynthesis region and is thought to act as a protease on PqqA (TIGR02107), a probable peptide precursor of the coenzyme. PQQ is required for some glucose dehydrogenases and alcohol dehydrogenases.
Probab=93.44 E-value=8.8 Score=40.26 Aligned_cols=198 Identities=10% Similarity=0.050 Sum_probs=101.5
Q ss_pred HHHHHcCCCCHHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHHHhCCCCC-----hHHHHHHhhccCcccccc--cccc
Q psy15801 60 IDVVRGAGIKDADIARGKALLKRHLADSFETLDSTVDSVVRQAVTTGVVKS-----LPDLLAEVEAVSSSDVSA--KRSF 132 (318)
Q Consensus 60 l~~l~~~~~t~~EL~raK~~lk~~~~~~les~~~~a~~lg~~~~~~g~~~~-----~~~~~~~I~~VT~edV~~--~r~~ 132 (318)
.+.+..--++++|+++.|..+..++.+..++...++...-...++.+.+.. ..+-+..|..+|.+||+. +++|
T Consensus 102 aD~l~~P~f~eeeierEr~vvl~Ei~~~~ddp~~~~~~~l~~~l~~~HPy~~~~iGt~esL~~it~~t~edL~~F~~~~Y 181 (696)
T TIGR02110 102 CDMLARPLLTAEDQQREREVLEAEYIAWQNDADTLREAALLDALQAGHPLRRFHAGSRDSLALPNTAFQQALRDFHRRHY 181 (696)
T ss_pred HHHHhCCCCCHHHHHHHHHHHHHHHHHHhcCHHHHHHHHHHHHcCCCCCCCCCCCCCHHHHhCcccchHHHHHHHHHHhc
Confidence 333444559999999999999999999888888777665444444333322 233333443455889984 3333
Q ss_pred ccc---------cc---------------CcCCCC-CCcEEEEcCCCcEEEEEecCCCEEEEEEEEecCCCCCCCCCccH
Q psy15801 133 AAQ---------PA---------------TKALFS-PDIQSGTSYNKVKVVSVNETSPIARISVAFKAGARYETPDNLGI 187 (318)
Q Consensus 133 ~~~---------~~---------------p~~~~~-~~~~~~~L~NGlkv~~~~~~~p~v~v~l~~~~Gs~~e~~~~~Gl 187 (318)
... .. |....+ +.........+ ++.+.....+.+.+.+.+++-+..++. .
T Consensus 182 ~p~NmvLvIvGdvs~eel~~l~e~~f~~~~~~~~~~~~~~~p~~~~~-~~~~~~~~~~q~~l~~~~p~~~~~d~~----a 256 (696)
T TIGR02110 182 QAGNMQLWLQGPQSLDELEQLAARFGASLAAGGECAQAPPAPLLRFD-RLTLAGGSEPRLWLLFALAGLPATARD----N 256 (696)
T ss_pred chhcEEEEEEeCCCHHHHHHHHHHHhCCCCCCCCCCCCCCCCCCCCc-eeEEEecCcceEEEEEeecCCCCCChH----H
Confidence 210 00 000000 00000011122 233333344667777777765554432 2
Q ss_pred HHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEEec-----Cc--eEEEEEEe---cCCcHHHHHHHHHHhh----cc-C
Q psy15801 188 THVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYATVD-----RE--GIVYTIEA---VRKEIGHVHKFLASVV----GK-T 252 (318)
Q Consensus 188 a~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~~~-----~d--~~~~~~~~---l~~~l~~~l~ll~d~l----~~-P 252 (318)
..++.+++..|...+ +...|...|-..++++. .. .+.+++.+ ..++.+.+...+.++| .+ +
T Consensus 257 l~lL~~iLg~g~sSr----L~~~LRe~GLaysV~s~~~~~~~g~~lf~I~~~lt~~~~~~~~~v~~~i~~~L~~L~~~~~ 332 (696)
T TIGR02110 257 VTLLCEFLQDEAPGG----LLAQLRERGLAESVAATWLYQDAGQALLALEFSARCISAAAAQQIEQLLTQWLGALAEQTW 332 (696)
T ss_pred HHHHHHHhCCCcchH----HHHHHHHCCCEEEEEEeccccCCCCcEEEEEEEEcCCCccCHHHHHHHHHHHHHHHHhcCC
Confidence 456677666553332 33334445555444432 12 33444554 1336666665554443 33 1
Q ss_pred CCCHHHHhhhhhhH
Q psy15801 253 EFRPWEVSDLTPRL 266 (318)
Q Consensus 253 ~f~e~e~~~~k~~i 266 (318)
.-..+|+.+++.+-
T Consensus 333 ~~~~eel~rlk~~~ 346 (696)
T TIGR02110 333 AEQLEHYAQLAQRR 346 (696)
T ss_pred CCCHHHHHHHHHhh
Confidence 34577888877763
No 24
>PF03410 Peptidase_M44: Protein G1; InterPro: IPR005072 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M44 (clan ME). The active site residues for members of this family and family M16 occur in the motif HXXEHProtein. The type example is the vaccinia virus-type metalloendopeptidase G1 from vaccinia virus, it is a metalloendopeptidase expressed by many Poxviridae which appears to play a role in the maturation of viral proteins.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0019067 viral assembly, maturation, egress, and release
Probab=92.33 E-value=0.16 Score=49.77 Aligned_cols=88 Identities=16% Similarity=0.101 Sum_probs=61.0
Q ss_pred EEcCCCcEEEEEecCCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEEecCceEE
Q psy15801 149 GTSYNKVKVVSVNETSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYATVDREGIV 228 (318)
Q Consensus 149 ~~L~NGlkv~~~~~~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~~~~d~~~ 228 (318)
..|+||++|+.-+.-..-+.+++. ..|-..|-.+-.|+|||++|+|-. ++..-+ --|++|+|.+.+
T Consensus 2 IvL~NGVRiFin~~M~KDIYlGIs-~FGFe~DI~~iLGiAHLLEHILIs----FD~~~F---------~ANASTaRsYMS 67 (590)
T PF03410_consen 2 IVLSNGVRIFINPSMKKDIYLGIS-NFGFENDIGEILGIAHLLEHILIS----FDSSKF---------LANASTARSYMS 67 (590)
T ss_pred eEecCceEEEecCccccceEEeec-ccccccchHHHHhHHHHHHHHeee----cchHHh---------hcccchhhhhhh
Confidence 368999999998774444555443 566667777889999999999853 222111 247889999999
Q ss_pred EEEEecCCc-HHHHHHHHHHhhc
Q psy15801 229 YTIEAVRKE-IGHVHKFLASVVG 250 (318)
Q Consensus 229 ~~~~~l~~~-l~~~l~ll~d~l~ 250 (318)
|....++.. .-+++.-+..++.
T Consensus 68 FWC~si~g~~~~DAvrtliSWFF 90 (590)
T PF03410_consen 68 FWCKSIRGRTYIDAVRTLISWFF 90 (590)
T ss_pred hhhhhccCCChhHHHHHHHHHhh
Confidence 988877654 4555665655543
No 25
>KOG2019|consensus
Probab=92.31 E-value=0.18 Score=51.71 Aligned_cols=107 Identities=11% Similarity=0.172 Sum_probs=81.4
Q ss_pred CCCcEEEEEecCCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHH-HHHHHHH-HhCCeEeEEecCceEEE
Q psy15801 152 YNKVKVVSVNETSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHF-GIIRHVQ-QAGGSLYATVDREGIVY 229 (318)
Q Consensus 152 ~NGlkv~~~~~~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~-~i~~~l~-~lG~~l~~~~~~d~~~~ 229 (318)
+-|.+++.+.+..+--.+++.|+.-+..++ ||.|.++|....|+.+++-. =|.+.|. .+.-.+|+++.-|+|.|
T Consensus 60 ~Tgae~lhl~reD~N~vFsI~FrTpp~dst----GiPHILEHtvLCGS~KYPvrdPFfkmLnrSLatFmNAfT~pD~T~y 135 (998)
T KOG2019|consen 60 KTGAEVLHLDREDENNVFSIVFRTPPKDST----GIPHILEHTVLCGSRKYPVRDPFFKMLNRSLATFMNAFTAPDYTFY 135 (998)
T ss_pred CCCceeEeeccCCCCceeEEEeecCCCccC----CCchhhhhheeeccCcCcccChHHHHHHHHHHHHHhhccCCCccee
Confidence 569999998874333334566776655554 99999999999999999743 3444443 34555899999999999
Q ss_pred EEE-ecCCcHHHHHHHHHHhhccCCCCHHHHhhh
Q psy15801 230 TIE-AVRKEIGHVHKFLASVVGKTEFRPWEVSDL 262 (318)
Q Consensus 230 ~~~-~l~~~l~~~l~ll~d~l~~P~f~e~e~~~~ 262 (318)
=+. .-+.+|-...++..|....|.+.+.+|.++
T Consensus 136 PfattN~kDf~NL~dVYLDAtffPklr~~dF~QE 169 (998)
T KOG2019|consen 136 PFATTNTKDFYNLRDVYLDATFFPKLRKLDFQQE 169 (998)
T ss_pred ecccCChHHHHHHHHHhhhcccchHHHhhhhhhh
Confidence 886 456789999999999999999888887765
No 26
>KOG0959|consensus
Probab=91.69 E-value=18 Score=39.21 Aligned_cols=221 Identities=15% Similarity=0.138 Sum_probs=130.7
Q ss_pred cccccccCc-hHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHH-hcccchhHHHHHHHHHHHhCCCCChHHHHHHhhcc
Q psy15801 44 LDESHHFSP-PQALQASIDVVRGAGIKDADIARGKALLKRHLAD-SFETLDSTVDSVVRQAVTTGVVKSLPDLLAEVEAV 121 (318)
Q Consensus 44 ~~~~a~~~~-~~~~~~~l~~l~~~~~t~~EL~raK~~lk~~~~~-~les~~~~a~~lg~~~~~~g~~~~~~~~~~~I~~V 121 (318)
..++||+.- .-.+..+++.+..-.++.+.+.-+|..++..+-. .++.....|..+-... .....-+..+.+++++.|
T Consensus 591 ~~v~Gfnekl~~ll~~~~~~~~~f~~~~~rf~iike~~~~~~~n~~~~~p~~~a~~~~~ll-l~~~~W~~~e~~~al~~~ 669 (974)
T KOG0959|consen 591 LRVSGFNEKLPLLLEKVVQMMANFELDEDRFEIIKELLKRELRNHAFDNPYQLANDYLLLL-LEESIWSKEELLEALDDV 669 (974)
T ss_pred EEEeccCcccHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHhhhhhccHHHHHHHHHHHH-hhccccchHHHHHHhhcc
Confidence 456777754 3344555554444469999999999999988888 5666666676655333 333344778889999999
Q ss_pred Ccccccc------ccccc------c---------------c---ccCcC---C--CCCCcEEEEcCCCcEEEEEec---C
Q psy15801 122 SSSDVSA------KRSFA------A---------------Q---PATKA---L--FSPDIQSGTSYNKVKVVSVNE---T 163 (318)
Q Consensus 122 T~edV~~------~r~~~------~---------------~---~~p~~---~--~~~~~~~~~L~NGlkv~~~~~---~ 163 (318)
|.+|+.. .+.+- . . ..|.. + ....-...+|++|...++... .
T Consensus 670 ~le~~~~F~~~~~~~~~~e~~i~GN~te~~A~~l~~~v~d~l~~~~~~~~p~~~~~~~~~~~~~lp~G~~~~~~~~~n~~ 749 (974)
T KOG0959|consen 670 TLEDLESFISEFLQPFHLELLIHGNLTEKEALQLLKSVLDILKSAAPNSRPLFRSEHLPRREIQLPNGDYYFYRHLLNKT 749 (974)
T ss_pred cHHHHHHHHHHHhhhhheEEEEecCcchHHHHHHHHHHHhhhhccCCCCccccccccCcccceeccCCceEEEEcccccC
Confidence 9999983 11110 0 0 01111 0 123456789999999987654 2
Q ss_pred CCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEE----ecCceEEEEEEec--CCc
Q psy15801 164 SPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYAT----VDREGIVYTIEAV--RKE 237 (318)
Q Consensus 164 ~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~----~~~d~~~~~~~~l--~~~ 237 (318)
-+-..+.+++..|.... ...-+..|+.+++-.-+ +++-+.-+++|=.+... .+-..+.+.++.. .++
T Consensus 750 ~~ns~i~~~~Q~~~~~~--~~~~~~~L~~~li~ep~-----Fd~LRTkeqLGYiv~~~~r~~~G~~~~~i~Vqs~~~~~~ 822 (974)
T KOG0959|consen 750 DDNSCIEVYYQIGVQDT--RDNAVLGLLEQLIKEPA-----FDQLRTKEQLGYIVSTGVRLNYGTVGLQITVQSEKSVDY 822 (974)
T ss_pred CCCceEEEEEEcccchh--HHHHHHHHHHHHhccch-----HHhhhhHHhhCeEeeeeeeeecCcceeEEEEccCCCchH
Confidence 35666777777644432 24467777776655422 56778888888544433 3333444444332 112
Q ss_pred HH----HHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCC
Q psy15801 238 IG----HVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRP 274 (318)
Q Consensus 238 l~----~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~ 274 (318)
++ ..++-+.+.+. ..++++|+..+..++..+.+..
T Consensus 823 le~rIe~fl~~~~~~i~--~m~~e~Fe~~~~~lI~~~~ek~ 861 (974)
T KOG0959|consen 823 LEERIESFLETFLEEIV--EMSDEEFEKHKSGLIASKLEKP 861 (974)
T ss_pred HHHHHHHHHHHHHHHHH--hcchhhhhhhHHHHHHHHhhcC
Confidence 22 22333333332 2567777777766666655543
No 27
>PHA03081 putative metalloprotease; Provisional
Probab=90.11 E-value=0.29 Score=48.13 Aligned_cols=98 Identities=15% Similarity=0.067 Sum_probs=65.9
Q ss_pred EEcCCCcEEEEEecCCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEEecCceEE
Q psy15801 149 GTSYNKVKVVSVNETSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYATVDREGIV 228 (318)
Q Consensus 149 ~~L~NGlkv~~~~~~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~~~~d~~~ 228 (318)
..|+||++|+.-+.-..-+.+++. ..|-..|-.+-.|+|||++|++-. ++..-+ --++++.+.+.+
T Consensus 2 i~~~ngvr~f~~~~m~kdiy~gi~-~fgfe~di~~~lg~ahllehili~----fd~~~f---------~anast~r~yms 67 (595)
T PHA03081 2 IVLSNGVRIFINPSMKKDIYLGIS-NFGFENDIGEILGIAHLLEHILIS----FDSSKF---------VANASTARSYMS 67 (595)
T ss_pred eEecCceEEEecCccccceEEeec-ccccccchHHHHhHHHHHHHHeee----cchHHh---------cccchhhhhhHh
Confidence 368999999988774444445442 556666777889999999998853 221111 246788888888
Q ss_pred EEEEecCCc-HHHHHHHHHHhhccCCCCHHHHh
Q psy15801 229 YTIEAVRKE-IGHVHKFLASVVGKTEFRPWEVS 260 (318)
Q Consensus 229 ~~~~~l~~~-l~~~l~ll~d~l~~P~f~e~e~~ 260 (318)
|....++.. ..+++.-+..++..+.--.+.|.
T Consensus 68 fwc~sirg~~y~DAvrtliSWFF~~~~Lr~~F~ 100 (595)
T PHA03081 68 FWCKSIRGRSYIDAIRTLISWFFDNGKLKDNFS 100 (595)
T ss_pred HhhHhhcCCchHHHHHHHHHHhccCCccccccc
Confidence 888776643 46777777777766653333333
No 28
>PF05193 Peptidase_M16_C: Peptidase M16 inactive domain; InterPro: IPR007863 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. These metallopeptidases belong to MEROPS peptidase family M16 (clan ME). They include proteins, which are classified as non-peptidase homologues either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. The peptidases in this group of sequences include: Insulinase, insulin-degrading enzyme (3.4.24.56 from EC) Mitochondrial processing peptidase alpha subunit, (Alpha-MPP, 3.4.24.64 from EC) Pitrlysin, Protease III precursor (3.4.24.55 from EC) Nardilysin, (3.4.24.61 from EC) Ubiquinol-cytochrome C reductase complex core protein I,mitochondrial precursor (1.10.2.2 from EC) Coenzyme PQQ synthesis protein F (3.4.99 from EC) These proteins do not share many regions of sequence similarity; the most noticeable is in the N-terminal section. This region includes a conserved histidine followed, two residues later by a glutamate and another histidine. In pitrilysin, it has been shown [] that this H-x-x-E-H motif is involved in enzymatic activity; the two histidines bind zinc and the glutamate is necessary for catalytic activity. The mitochondrial processing peptidase consists of two structurally related domains. One is the active peptidase whereas the other, the C-terminal region, is inactive. The two domains hold the substrate like a clamp [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 1BE3_B 1PP9_B 2A06_B 1SQB_B 1SQP_B 1L0N_B 1SQX_B 1NU1_B 1L0L_B 2FYU_B ....
Probab=89.56 E-value=0.75 Score=38.06 Aligned_cols=109 Identities=12% Similarity=0.148 Sum_probs=63.3
Q ss_pred CCcEEEEEecCCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHH-HhCC--eEeEEe----cCc
Q psy15801 153 NKVKVVSVNETSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQ-QAGG--SLYATV----DRE 225 (318)
Q Consensus 153 NGlkv~~~~~~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~-~lG~--~l~~~~----~~d 225 (318)
.+..++..+...+...+.+.+++...... .......++.+++..+... -|...+. ..|. .++++. +..
T Consensus 66 ~~~~~~~~~~~~~~~~v~~~~~~~~~~~~-~~~~~~~~l~~~l~~~~~s----~l~~~lr~~~~l~y~v~~~~~~~~~~~ 140 (184)
T PF05193_consen 66 QGKEIVIPSKDESQSIVSIAFPGPPIKDS-KDYFALNLLSSLLGNGMSS----RLFQELREKQGLAYSVSASNSSYRDSG 140 (184)
T ss_dssp SEEEEEEEESSSSSEEEEEEEEEEETGTS-TTHHHHHHHHHHHHCSTTS----HHHHHHHTTTTSESEEEEEEEEESSEE
T ss_pred ccccccccccccccccccccccccccccc-chhhHHHHHHHHHhcCccc----hhHHHHHhccccceEEEeeeeccccce
Confidence 44555555544366666666666555222 2446777888888877333 2444444 4443 333332 224
Q ss_pred eEEEEEEecCCcHHHHHHHHHHhhc---cCCCCHHHHhhhhhhH
Q psy15801 226 GIVYTIEAVRKEIGHVHKFLASVVG---KTEFRPWEVSDLTPRL 266 (318)
Q Consensus 226 ~~~~~~~~l~~~l~~~l~ll~d~l~---~P~f~e~e~~~~k~~i 266 (318)
.+.+++.+.+.+++.+++.+.+.+. .-.|+++||++.+.++
T Consensus 141 ~~~i~~~~~~~~~~~~~~~~~~~l~~l~~~~~s~~el~~~k~~L 184 (184)
T PF05193_consen 141 LFSISFQVTPENLDEAIEAILQELKRLREGGISEEELERAKNQL 184 (184)
T ss_dssp EEEEEEEEEGGGHHHHHHHHHHHHHHHHHHCS-HHHHHHHHHHH
T ss_pred EEEEEEEcCcccHHHHHHHHHHHHHHHHHcCCCHHHHHHHHhcC
Confidence 4666777777788777776655432 2359999999988763
No 29
>COG1025 Ptr Secreted/periplasmic Zn-dependent peptidases, insulinase-like [Posttranslational modification, protein turnover, chaperones]
Probab=85.68 E-value=5.1 Score=42.88 Aligned_cols=117 Identities=13% Similarity=0.156 Sum_probs=87.6
Q ss_pred CCcEEEEEe-c--CC-CEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEEecCceEE
Q psy15801 153 NKVKVVSVN-E--TS-PIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYATVDREGIV 228 (318)
Q Consensus 153 NGlkv~~~~-~--~~-p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~~~~d~~~ 228 (318)
.|.++++.. + .. |...+.+.|+.=.....+...-+..|...|++--...++ ......|-+++...+.+.+.
T Consensus 509 ~~~~~wy~~~d~F~~~PK~~v~~~irsp~~~~s~r~~Vl~~l~~~la~dal~~~~-----y~A~~aG~sfs~~~~~~Gl~ 583 (937)
T COG1025 509 PNLRLWYLKEDYFAVEPKASVSLAIRSPHASRSPRNQVLTELYAYLANDALDKLS-----YQASLAGLSFSLAANSNGLD 583 (937)
T ss_pred CCceEEEecCCccccCCcceeEEEEeCcccccCHHHHHHHHHHHHHHHHHHHhhh-----hHHHhcceEEEeecCCCceE
Confidence 366666654 3 34 888899999764444444444455555666554333332 23455688899999999999
Q ss_pred EEEEecCCcHHHHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCC
Q psy15801 229 YTIEAVRKEIGHVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRP 274 (318)
Q Consensus 229 ~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~ 274 (318)
+++++.++.++..+..+.+.+..-.++++-|+..|.++.++++...
T Consensus 584 ltisGft~~lp~L~~~~l~~l~~~~~~~~~f~~~K~~~~~~~~~a~ 629 (937)
T COG1025 584 LTISGFTQRLPQLLRAFLDGLFSLPVDEDRFEQAKSQLSEELKNAL 629 (937)
T ss_pred EEeeccccchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHhhh
Confidence 9999999999999999999999888899999999999999887654
No 30
>KOG0961|consensus
Probab=82.56 E-value=4.9 Score=41.59 Aligned_cols=124 Identities=13% Similarity=0.006 Sum_probs=68.0
Q ss_pred CchhHHHHHHHHHHhcC--CCCCCCCCCCCccccccc-CCCCCCccccccccCc------hHHHHHHHHHHHc--CCCCH
Q psy15801 2 SAKEALAFAVLKYALGA--GPAKTGKNQRSTNSKGLS-RTPGRESLDESHHFSP------PQALQASIDVVRG--AGIKD 70 (318)
Q Consensus 2 ~~~d~~al~vL~~iLG~--g~~~k~~~~~~~l~~~~~-~~~~~~~~~~~a~~~~------~~~~~~~l~~l~~--~~~t~ 70 (318)
++|++++.++....||. ||+=+.-.|.|..|-+-. -..... .....++.+ -.....++..+.+ +.++.
T Consensus 826 ~dpel~~~~l~~~YL~~~eGPfW~~IRG~GLAYGanm~~~~d~~-~~~~~iyr~ad~~kaye~~rdiV~~~vsG~~e~s~ 904 (1022)
T KOG0961|consen 826 NDPELIPAMLFGQYLSQCEGPFWRAIRGDGLAYGANMFVKPDRK-QITLSIYRCADPAKAYERTRDIVRKIVSGSGEISK 904 (1022)
T ss_pred CCcchhHHHHHHHHHHhcccchhhhhcccchhccceeEEeccCC-EEEEEeecCCcHHHHHHHHHHHHHHHhcCceeecH
Confidence 57899999999999997 654222223333332110 000000 011122222 2234455777775 46999
Q ss_pred HHHHHHHHHHHHHHHHhcccc-hhHHHHHHHHHHHhCCC-CChHHHHHHhhccCccccc
Q psy15801 71 ADIARGKALLKRHLADSFETL-DSTVDSVVRQAVTTGVV-KSLPDLLAEVEAVSSSDVS 127 (318)
Q Consensus 71 ~EL~raK~~lk~~~~~~les~-~~~a~~lg~~~~~~g~~-~~~~~~~~~I~~VT~edV~ 127 (318)
.|++-||...-+ .++.+|+. -.-|-.+..-....+.+ ..-..++.+|.+||.+|..
T Consensus 905 ~~~egAk~s~~~-~~~~~Eng~~~~a~~~~~l~~~~q~~~~fn~~~leri~nvT~~~~~ 962 (1022)
T KOG0961|consen 905 AEFEGAKRSTVF-EMMKRENGTVSGAAKISILNNFRQTPHPFNIDLLERIWNVTSEEMV 962 (1022)
T ss_pred HHhccchHHHHH-HHHHHhccceechHHHHHHHHHHhcCCcccHHHHHHHHHhhHHHHH
Confidence 999999988654 44456631 11121221112233332 3446889999999999997
No 31
>PF10278 Med19: Mediator of RNA pol II transcription subunit 19 ; InterPro: IPR019403 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. Med19 represents a family of conserved proteins which are members of the multi-protein co-activator Mediator complex []. ; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex
Probab=76.61 E-value=1 Score=38.80 Aligned_cols=24 Identities=4% Similarity=0.061 Sum_probs=12.8
Q ss_pred HHHHHHHHHHhCCeEeEEecCceE
Q psy15801 204 HFGIIRHVQQAGGSLYATVDREGI 227 (318)
Q Consensus 204 ~~~i~~~l~~lG~~l~~~~~~d~~ 227 (318)
.+++...|--+=|.++..++.|..
T Consensus 40 ke~lSsFLp~lpG~~~~~~~~d~s 63 (178)
T PF10278_consen 40 KESLSSFLPHLPGNIDLPGSQDNS 63 (178)
T ss_pred hhhHHHHccCCCCcccCCCCcCcc
Confidence 445555555555555555544433
No 32
>PF07946 DUF1682: Protein of unknown function (DUF1682); InterPro: IPR012879 The members of this family are all hypothetical eukaryotic proteins of unknown function. One member (Q920S6 from SWISSPROT) is described as being an adipocyte-specific protein, but no evidence of this was found.
Probab=71.93 E-value=22 Score=33.67 Aligned_cols=49 Identities=6% Similarity=0.123 Sum_probs=30.7
Q ss_pred cCceEEEEEEecCCcH-----HHHHHH---HHHhhccCCCCHHHHhhhhhhHHHHHh
Q psy15801 223 DREGIVYTIEAVRKEI-----GHVHKF---LASVVGKTEFRPWEVSDLTPRLKYDRL 271 (318)
Q Consensus 223 ~~d~~~~~~~~l~~~l-----~~~l~l---l~d~l~~P~f~e~e~~~~k~~i~~ei~ 271 (318)
..-.+.++++.++..- ...+.+ |.|.+.+-.++++...+....-..+++
T Consensus 213 ~~~~l~~~~~lp~~~~~~~~~~~l~~~v~~l~D~~~~~~l~~e~~~K~~k~R~~~~~ 269 (321)
T PF07946_consen 213 PKKRLIFSFRLPSSSDDMEALEPLLKLVFYLIDKLARFKLSPEAKKKAKKNREEEEE 269 (321)
T ss_pred cCcEEEEEEEeCCCcccHHHHHHHHHHHHHHHHHhheeeeCHHHHHHHHHHHHHHHH
Confidence 4456778888887542 333333 356678888999988876444433333
No 33
>KOG0345|consensus
Probab=65.67 E-value=11 Score=37.35 Aligned_cols=15 Identities=13% Similarity=0.275 Sum_probs=7.7
Q ss_pred ceEEEEEEecCCcHH
Q psy15801 225 EGIVYTIEAVRKEIG 239 (318)
Q Consensus 225 d~~~~~~~~l~~~l~ 239 (318)
++.+|-++.-.=++.
T Consensus 424 H~cs~Ifr~kdLd~~ 438 (567)
T KOG0345|consen 424 HHCSYIFRLKDLDLG 438 (567)
T ss_pred cceeEEEeecCCcHH
Confidence 455555555444443
No 34
>KOG2422|consensus
Probab=58.95 E-value=2 Score=43.40 Aligned_cols=7 Identities=86% Similarity=0.947 Sum_probs=2.6
Q ss_pred HhhHHHH
Q psy15801 283 KKNKKKK 289 (318)
Q Consensus 283 ~~~~~~~ 289 (318)
-|||+||
T Consensus 88 sk~k~KK 94 (665)
T KOG2422|consen 88 SKNKKKK 94 (665)
T ss_pred hccccch
Confidence 3343333
No 35
>PF09186 DUF1949: Domain of unknown function (DUF1949); InterPro: IPR015269 Members of this entry are a set of functionally uncharacterised hypothetical bacterial proteins. They adopt a ferredoxin-like fold, with a beta-alpha-beta-beta-alpha-beta arrangement []. This entry contains the protein Impact, which is a translational regulator that ensures constant high levels of translation under amino acid starvation. It acts by interacting with Gcn1/Gcn1L1, thereby preventing activation of Gcn2 protein kinases (EIF2AK1 to 4) and subsequent down-regulation of protein synthesis. It is evolutionary conserved from eukaryotes to archaea []. ; PDB: 2CVE_A 1VI7_A.
Probab=51.23 E-value=69 Score=21.13 Aligned_cols=46 Identities=13% Similarity=0.170 Sum_probs=38.5
Q ss_pred HHHHHHHHHhCCeEeEEecCceEEEEEEecCCcHHHHHHHHHHhhc
Q psy15801 205 FGIIRHVQQAGGSLYATVDREGIVYTIEAVRKEIGHVHKFLASVVG 250 (318)
Q Consensus 205 ~~i~~~l~~lG~~l~~~~~~d~~~~~~~~l~~~l~~~l~ll~d~l~ 250 (318)
..+...++..|+.+.-..-.+.+.+.+.++.+..+.....|.++..
T Consensus 9 ~~v~~~l~~~~~~i~~~~y~~~V~~~v~v~~~~~~~f~~~l~~~t~ 54 (56)
T PF09186_consen 9 GKVERLLEQNGIEIVDEDYTDDVTLTVAVPEEEVEEFKAQLTDLTS 54 (56)
T ss_dssp HHHHHHHHHTTTEEEEEEECTTEEEEEEEECCCHHHHHHHHHHHTT
T ss_pred HHHHHHHHHCCCEEEcceecceEEEEEEECHHHHHHHHHHHHHHcC
Confidence 4678889999999865555566999999999999999998888753
No 36
>PF08496 Peptidase_S49_N: Peptidase family S49 N-terminal; InterPro: IPR013703 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain is found to the N terminus of bacterial signal peptidases that belong to the MEROPS peptidase family S49 (protease IV family, clan SK) (see also IPR002142 from INTERPRO) [, ]. ; GO: 0004252 serine-type endopeptidase activity, 0005886 plasma membrane
Probab=49.94 E-value=22 Score=29.98 Aligned_cols=11 Identities=9% Similarity=-0.112 Sum_probs=4.4
Q ss_pred hhHHHHHhhCC
Q psy15801 264 PRLKYDRLTRP 274 (318)
Q Consensus 264 ~~i~~ei~~~~ 274 (318)
..+...++..-
T Consensus 52 ~~~k~~L~~al 62 (155)
T PF08496_consen 52 EDMKEQLKQAL 62 (155)
T ss_pred HHHHHHHHHHh
Confidence 33334444433
No 37
>KOG3794|consensus
Probab=48.61 E-value=37 Score=32.85 Aligned_cols=27 Identities=7% Similarity=-0.017 Sum_probs=17.4
Q ss_pred hHHHHHHHHHHHcCCCCHHHHHHHHHH
Q psy15801 53 PQALQASIDVVRGAGIKDADIARGKAL 79 (318)
Q Consensus 53 ~~~~~~~l~~l~~~~~t~~EL~raK~~ 79 (318)
...++-+-+.+......++||.+.-.+
T Consensus 22 ~k~vW~AeQ~~s~~~kkqEEL~~qyek 48 (453)
T KOG3794|consen 22 IRKVWMAEQKISYDKKKQEELRRQYEK 48 (453)
T ss_pred hhHHHHHhhhcccccchHHHHHHHHHH
Confidence 445555555555667888999887433
No 38
>smart00874 B5 tRNA synthetase B5 domain. This domain is found in phenylalanine-tRNA synthetase beta subunits.
Probab=47.41 E-value=32 Score=24.35 Aligned_cols=50 Identities=14% Similarity=0.139 Sum_probs=33.2
Q ss_pred CCCHHHHHHHHHHhCCeEeEEecCceEEEEEEecCCcHHHHHHHHHHhhc
Q psy15801 201 EFTHFGIIRHVQQAGGSLYATVDREGIVYTIEAVRKEIGHVHKFLASVVG 250 (318)
Q Consensus 201 ~~s~~~i~~~l~~lG~~l~~~~~~d~~~~~~~~l~~~l~~~l~ll~d~l~ 250 (318)
..+..++.+.|.++|..+......+.+.+.+-.-+.++....+++.|+++
T Consensus 18 ~i~~~ei~~~L~~lg~~~~~~~~~~~~~v~~P~~R~Di~~~~DliEei~r 67 (71)
T smart00874 18 DLSAEEIEEILKRLGFEVEVSGDDDTLEVTVPSYRFDILIEADLIEEVAR 67 (71)
T ss_pred CCCHHHHHHHHHHCCCeEEecCCCCeEEEECCCCccccCcccHHHHHHHH
Confidence 57899999999999998865322344444444444566666676666654
No 39
>TIGR00869 sec62 protein translocation protein, Sec62 family. protein secretary systems of yeast microsomes. They are also the non-selective cation (NS) channels of the mammalian cytoplasmic membrane. The yeast Sec62 protein has been shown to be essential for cell growth. The mammalian NS channel proteins has been implicated in platelet derived growth factor(PGDF) dependent single channel current in fibroblasts. These channels are essentially closed in serum deprived tissue-culture cells and are specifically opened by exposure to PDGF. These channels are reported to exhibit equal selectivity for Na+, K+ and Cs+ with low permeability to Ca2+, and no permeability to anions.
Probab=45.00 E-value=13 Score=33.56 Aligned_cols=7 Identities=29% Similarity=0.491 Sum_probs=2.8
Q ss_pred hhccCcc
Q psy15801 118 VEAVSSS 124 (318)
Q Consensus 118 I~~VT~e 124 (318)
.+.+..+
T Consensus 72 ~~k~~~~ 78 (232)
T TIGR00869 72 VDKLPSQ 78 (232)
T ss_pred EEecccc
Confidence 3444443
No 40
>PF03484 B5: tRNA synthetase B5 domain; InterPro: IPR005147 Domain B5 is found in phenylalanine-tRNA synthetase beta subunits. This domain has been shown to bind DNA through a winged helix-turn-helix motif []. Phenylalanine-tRNA synthetase may influence common cellular processes via DNA binding, in addition to its aminoacylation function.; GO: 0000287 magnesium ion binding, 0003723 RNA binding, 0005524 ATP binding, 0006432 phenylalanyl-tRNA aminoacylation; PDB: 2AKW_B 1B70_B 1B7Y_B 2ALY_B 2IY5_B 2AMC_B 3PCO_D 2CXI_C 1JJC_B 1EIY_B ....
Probab=43.32 E-value=63 Score=23.02 Aligned_cols=49 Identities=16% Similarity=0.279 Sum_probs=34.5
Q ss_pred CCCHHHHHHHHHHhCCeEeEEecCceEEEEEEecCCcHHHHHHHHHHhhc
Q psy15801 201 EFTHFGIIRHVQQAGGSLYATVDREGIVYTIEAVRKEIGHVHKFLASVVG 250 (318)
Q Consensus 201 ~~s~~~i~~~l~~lG~~l~~~~~~d~~~~~~~~l~~~l~~~l~ll~d~l~ 250 (318)
..+..++.+.|..+|..+... +.+...+..=.-+.++....++..|+++
T Consensus 18 ~i~~~~i~~~L~~lg~~~~~~-~~~~~~v~vP~~R~Di~~~~DliEEiaR 66 (70)
T PF03484_consen 18 DISPEEIIKILKRLGFKVEKI-DGDTLEVTVPSYRFDIEHEEDLIEEIAR 66 (70)
T ss_dssp ---HHHHHHHHHHTT-EEEE--CTTEEEEEEETTSTT-SSHHHHHHHHHH
T ss_pred CCCHHHHHHHHHHCCCEEEEC-CCCEEEEEcCCCcCCcCcccHHHHHHHH
Confidence 778999999999999998877 5566666666667788888888877764
No 41
>PF08496 Peptidase_S49_N: Peptidase family S49 N-terminal; InterPro: IPR013703 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain is found to the N terminus of bacterial signal peptidases that belong to the MEROPS peptidase family S49 (protease IV family, clan SK) (see also IPR002142 from INTERPRO) [, ]. ; GO: 0004252 serine-type endopeptidase activity, 0005886 plasma membrane
Probab=40.08 E-value=22 Score=30.05 Aligned_cols=15 Identities=27% Similarity=0.388 Sum_probs=5.9
Q ss_pred HhhhhhhHHHHHhhC
Q psy15801 259 VSDLTPRLKYDRLTR 273 (318)
Q Consensus 259 ~~~~k~~i~~ei~~~ 273 (318)
++..++.+...+-..
T Consensus 51 Y~~~k~~L~~all~k 65 (155)
T PF08496_consen 51 YEDMKEQLKQALLDK 65 (155)
T ss_pred HHHHHHHHHHHhcCH
Confidence 333444444443333
No 42
>PRK14552 C/D box methylation guide ribonucleoprotein complex aNOP56 subunit; Provisional
Probab=39.79 E-value=4.7 Score=39.56 Aligned_cols=15 Identities=20% Similarity=0.244 Sum_probs=8.2
Q ss_pred ecCCcHHHHHHHHHH
Q psy15801 233 AVRKEIGHVHKFLAS 247 (318)
Q Consensus 233 ~l~~~l~~~l~ll~d 247 (318)
+++.+-..+..+|+.
T Consensus 330 ~p~~~rgkiaR~lA~ 344 (414)
T PRK14552 330 SPWWQRGKIARALAA 344 (414)
T ss_pred CCHHHHHHHHHHHHH
Confidence 334445666666653
No 43
>PRK14552 C/D box methylation guide ribonucleoprotein complex aNOP56 subunit; Provisional
Probab=39.63 E-value=5.1 Score=39.34 Aligned_cols=16 Identities=13% Similarity=0.056 Sum_probs=7.6
Q ss_pred HHHHhhCCCChhhHHH
Q psy15801 267 KYDRLTRPSQGNEKKK 282 (318)
Q Consensus 267 ~~ei~~~~~~~~~~~~ 282 (318)
+.+|+..-+.|+..|.
T Consensus 372 i~~i~~k~~~Pp~~k~ 387 (414)
T PRK14552 372 IEEIKEKYPKPPKKKR 387 (414)
T ss_pred HHHHHHhcCCCCCCCc
Confidence 4444444455555543
No 44
>KOG3794|consensus
Probab=37.91 E-value=23 Score=34.17 Aligned_cols=7 Identities=14% Similarity=-0.059 Sum_probs=3.3
Q ss_pred CCCcEEE
Q psy15801 152 YNKVKVV 158 (318)
Q Consensus 152 ~NGlkv~ 158 (318)
|=|+.|-
T Consensus 117 PFGiqVR 123 (453)
T KOG3794|consen 117 PFGIQVR 123 (453)
T ss_pred ccceEee
Confidence 3455553
No 45
>cd04914 ACT_AKi-DapG-BS_1 ACT domains of the diaminopimelate-sensitive aspartokinase (AK) isoenzyme AKI. This CD includes the N-terminal of the two ACT domains of the diaminopimelate-sensitive aspartokinase (AK) isoenzyme AKI, a monofunctional class enzyme found in Bacilli (Bacillus subtilis (BS) strain 168), Clostridia, and Actinobacteria, bacterial species. In B. subtilis, the regulation of the diaminopimelate-lysine biosynthetic pathway involves dual control by diaminopimelate and lysine, effected through separate diaminopimelate- and lysine-sensitive aspartokinase isoenzymes. AKI activity is invariant during the exponential and stationary phases of growth and is not altered by addition of amino acids to the growth medium. The role of this isoenzyme is most likely to provide a constant level of aspartyl-beta-phosphate for the biosynthesis of diaminopimelate for peptidoglycan synthesis and dipicolinate during sporulation. The B. subtilis AKI is tetrameric consisting of two alpha and
Probab=36.96 E-value=91 Score=21.90 Aligned_cols=49 Identities=8% Similarity=-0.070 Sum_probs=36.2
Q ss_pred HHHHHHHHHhCCeEeEEecCceEEEEEEecCCcHHHHHHHHHHhhccCCC
Q psy15801 205 FGIIRHVQQAGGSLYATVDREGIVYTIEAVRKEIGHVHKFLASVVGKTEF 254 (318)
Q Consensus 205 ~~i~~~l~~lG~~l~~~~~~d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f 254 (318)
.+|.+.|...|.+++.-.-. .-.+++++...+++.+.+++.++=..|..
T Consensus 17 a~if~~La~~~InvDmI~~~-~~~isFtv~~~d~~~~~~il~~~~~~~~~ 65 (67)
T cd04914 17 QRVFKALANAGISVDLINVS-PEEVIFTVDGEVAEKAVDILEKMGLDPSV 65 (67)
T ss_pred HHHHHHHHHcCCcEEEEEec-CCCEEEEEchhhHHHHHHHHHHcCCceEe
Confidence 45777788888888876322 22588999999999999988887666643
No 46
>PF00675 Peptidase_M16: Insulinase (Peptidase family M16) This is family M16 in the peptidase classification. ; InterPro: IPR011765 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. The majority of the sequences in this entry are metallopeptidases and non-peptidase homologs belong to MEROPS peptidase family M16 (clan ME), subfamilies M16A, M16B and M16C; they include: Insulinase, insulin-degrading enzyme (3.4.24.56 from EC) Mitochondrial processing peptidase alpha subunit, (Alpha-MPP, 3.4.24.64 from EC) Pitrlysin, Protease III precursor (3.4.24.55 from EC) Nardilysin, (3.4.24.61 from EC) Ubiquinol-cytochrome C reductase complex core protein I,mitochondrial precursor (1.10.2.2 from EC) Coenzyme PQQ synthesis protein F (3.4.99 from EC) These proteins do not share many regions of sequence similarity; the most noticeable is in the N-terminal section. This region includes a conserved histidine followed, two residues later by a glutamate and another histidine. In pitrilysin, it has been shown [] that this H-x-x-E-H motif is involved in enzymatic activity; the two histidines bind zinc and the glutamate is necessary for catalytic activity. The proteins classified as non-peptidase homologues either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3P7L_A 3P7O_A 3TUV_A 3GO9_A 1BE3_B 1PP9_B 2A06_B 1SQB_B 1SQP_B 1L0N_B ....
Probab=35.66 E-value=1.2e+02 Score=24.28 Aligned_cols=54 Identities=13% Similarity=0.169 Sum_probs=37.9
Q ss_pred HHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHHHhCCC
Q psy15801 55 ALQASIDVVRGAGIKDADIARGKALLKRHLADSFETLDSTVDSVVRQAVTTGVV 108 (318)
Q Consensus 55 ~~~~~l~~l~~~~~t~~EL~raK~~lk~~~~~~les~~~~a~~lg~~~~~~g~~ 108 (318)
+++-..+.+..-.++++++++.|..+..++....++...++...-...++.|.+
T Consensus 87 ~l~~l~~~~~~P~f~~~~~~~~r~~~~~ei~~~~~~~~~~~~~~l~~~~f~~~p 140 (149)
T PF00675_consen 87 ALELLADMLFNPSFDEEEFEREREQILQEIEEIKENPQELAFEKLHSAAFRGHP 140 (149)
T ss_dssp HHHHHHHHHHSBGGCHHHHHHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHTTSG
T ss_pred HHHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHhccCC
Confidence 344444455555699999999999999988888777766666655455555554
No 47
>PF15289 RFXA_RFXANK_bdg: Regulatory factor X-associated C-terminal binding domain; PDB: 2KW3_C.
Probab=33.14 E-value=14 Score=29.66 Aligned_cols=16 Identities=38% Similarity=0.244 Sum_probs=0.0
Q ss_pred ChhhHHHHhhHHHHHH
Q psy15801 276 QGNEKKKKKNKKKKKK 291 (318)
Q Consensus 276 ~~~~~~~~~~~~~~~~ 291 (318)
-||+||+.+||.-|-|
T Consensus 19 kpwmck~hRNK~yk~k 34 (124)
T PF15289_consen 19 KPWMCKKHRNKMYKDK 34 (124)
T ss_dssp ----------------
T ss_pred cchhhhhhccccchhh
Confidence 3899999999885444
No 48
>PF08432 Vfa1: AAA-ATPase Vps4-associated protein 1; InterPro: IPR013640 This is a family of fungal proteins of unknown function.
Probab=32.44 E-value=16 Score=31.60 Aligned_cols=34 Identities=18% Similarity=0.035 Sum_probs=18.8
Q ss_pred cCCCCHHHHhhh--hhhHHHHHhhCCCChhhHHHHh
Q psy15801 251 KTEFRPWEVSDL--TPRLKYDRLTRPSQGNEKKKKK 284 (318)
Q Consensus 251 ~P~f~e~e~~~~--k~~i~~ei~~~~~~~~~~~~~~ 284 (318)
.|..+++..+.. +..+..+|+...-.+-++.+++
T Consensus 50 ~p~~~~~~~~~~~k~~el~~eiekvkke~Eekq~~k 85 (182)
T PF08432_consen 50 TPIYDEEYVEAKKKKKELEEEIEKVKKEYEEKQKWK 85 (182)
T ss_pred CCCCCHHHHHHHhhHHHHHHHHHHHHHHHHHHhHHh
Confidence 477777665543 4555666666544444444433
No 49
>PF05890 Ebp2: Eukaryotic rRNA processing protein EBP2; InterPro: IPR008610 This family consists of several eukaryotic rRNA processing protein EBP2 sequences. Ebp2p is required for the maturation of 25S rRNA and 60S subunit assembly. Ebp2p may be one of the target proteins of Rrs1p for executing the signal to regulate ribosome biogenesis [].
Probab=31.63 E-value=88 Score=28.91 Aligned_cols=27 Identities=11% Similarity=0.126 Sum_probs=20.1
Q ss_pred HHHhhccCCCCHHHHhhhhhhHHHHHhhCCC
Q psy15801 245 LASVVGKTEFRPWEVSDLTPRLKYDRLTRPS 275 (318)
Q Consensus 245 l~d~l~~P~f~e~e~~~~k~~i~~ei~~~~~ 275 (318)
||+|+. +++.+++++..++.+-..+..
T Consensus 110 fAEMvK----SD~HM~KVr~kLl~~~~~ie~ 136 (271)
T PF05890_consen 110 FAEMVK----SDEHMEKVRQKLLKEQKRIEA 136 (271)
T ss_pred hHHHhc----CHHHHHHHHHHHHHHHHHHHH
Confidence 566666 688899999998877665554
No 50
>KOG2138|consensus
Probab=30.91 E-value=26 Score=36.38 Aligned_cols=14 Identities=0% Similarity=-0.278 Sum_probs=7.3
Q ss_pred HHHHHHhhccCCCC
Q psy15801 242 HKFLASVVGKTEFR 255 (318)
Q Consensus 242 l~ll~d~l~~P~f~ 255 (318)
+++-.+-...|+++
T Consensus 774 ~e~~~~e~fgprlp 787 (883)
T KOG2138|consen 774 MEIDEREEFGPRLP 787 (883)
T ss_pred hccChhhhcCCCCC
Confidence 33334445567776
No 51
>PF09851 SHOCT: Short C-terminal domain; InterPro: IPR018649 This family of hypothetical prokaryotic proteins has no known function.
Probab=29.93 E-value=1.3e+02 Score=17.94 Aligned_cols=17 Identities=24% Similarity=0.259 Sum_probs=14.8
Q ss_pred cCCCCHHHHHHHHHHHH
Q psy15801 65 GAGIKDADIARGKALLK 81 (318)
Q Consensus 65 ~~~~t~~EL~raK~~lk 81 (318)
.|-+|++|.++.|+.+.
T Consensus 14 ~G~IseeEy~~~k~~ll 30 (31)
T PF09851_consen 14 KGEISEEEYEQKKARLL 30 (31)
T ss_pred cCCCCHHHHHHHHHHHh
Confidence 57799999999998874
No 52
>KOG4043|consensus
Probab=29.88 E-value=40 Score=28.61 Aligned_cols=11 Identities=18% Similarity=0.060 Sum_probs=6.1
Q ss_pred ccHHHHHHHHH
Q psy15801 185 LGITHVLRSAA 195 (318)
Q Consensus 185 ~Gla~ll~~ml 195 (318)
.-|+.|+.|+.
T Consensus 68 EkiaSFL~~I~ 78 (214)
T KOG4043|consen 68 EKIASFLPHII 78 (214)
T ss_pred HHHHHHhhccC
Confidence 34666666554
No 53
>PF12122 DUF3582: Protein of unknown function (DUF3582); InterPro: IPR022732 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the N-terminal domain of membrane-bound serine endopeptidases belonging to MEROPS peptidase family S54 (rhomboid-1, clan ST). This domain contains a conserved ASW sequence motif and a single completely conserved residue F that may be functionally important. The tertiary structure of the GlpG protein from Escherichia coli has been determined []. The GlpG protein has six transmembrane domains (other members of the family are predicted to have seven), with the N- and C-terminal ends anchored in the cytoplasm. One transmembrane domain is shorter than the rest, creating an internal, aqueous cavity just below the membrane surface and it is here were proteolysis occurs. There is also a membrane-embedded loop between the first and second transmembrane domains which is postulated to act as a gate controlling substrate access to the active site. No other family of serine peptidases is known to have active site residues within transmembrane domains (although transmembrane active sites are known for aspartic peptidase and metallopeptidases), and the GlpG protein has the type structure for clan ST.; GO: 0004252 serine-type endopeptidase activity, 0016021 integral to membrane; PDB: 3UBB_A 3B45_A 3B44_A 2NRF_A 3TXT_A 2O7L_A 2XTU_A 2IRV_A 2XOW_A 2XTV_A ....
Probab=28.06 E-value=2e+02 Score=22.29 Aligned_cols=49 Identities=10% Similarity=0.146 Sum_probs=33.5
Q ss_pred HHHHHHHHHHhCCeEeEEecCce-EEEEEEecCCcHHHHHHHHHHhhccCC
Q psy15801 204 HFGIIRHVQQAGGSLYATVDREG-IVYTIEAVRKEIGHVHKFLASVVGKTE 253 (318)
Q Consensus 204 ~~~i~~~l~~lG~~l~~~~~~d~-~~~~~~~l~~~l~~~l~ll~d~l~~P~ 253 (318)
+..|.+.|...|..+......+. +.+.+. -.++++.+-..|...+.||.
T Consensus 13 AqaF~DYl~sqgI~~~i~~~~~~~~~lwl~-de~~~~~a~~el~~Fl~nP~ 62 (101)
T PF12122_consen 13 AQAFIDYLASQGIELQIEPEGQGQFALWLH-DEEHLEQAEQELEEFLQNPN 62 (101)
T ss_dssp HHHHHHHHHHTT--EEEE-SSSE--EEEES--GGGHHHHHHHHHHHHHS-S
T ss_pred HHHHHHHHHHCCCeEEEEECCCCceEEEEe-CHHHHHHHHHHHHHHHHCCC
Confidence 45777888888977777764443 555555 66899999999999999995
No 54
>PRK06063 DNA polymerase III subunit epsilon; Provisional
Probab=27.98 E-value=1.3e+02 Score=28.18 Aligned_cols=69 Identities=16% Similarity=0.141 Sum_probs=50.6
Q ss_pred HHHhCCCCCCCHHHHHHHHHHhCCeEeEEecCceEEEEEEe---cCCcHHHHHHHHHHhhccCCCCHHHHhhhhhhH
Q psy15801 193 SAAGLSTEEFTHFGIIRHVQQAGGSLYATVDREGIVYTIEA---VRKEIGHVHKFLASVVGKTEFRPWEVSDLTPRL 266 (318)
Q Consensus 193 ~ml~~GT~~~s~~~i~~~l~~lG~~l~~~~~~d~~~~~~~~---l~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i 266 (318)
...|.|+-.++..++.+.+..+||.+..++++...++-.-- .+.-+.+|-+ +-=|..+|++|-+....+
T Consensus 237 ~~v~TG~l~~~R~e~~~~~~~~G~~v~~sVs~~t~~lv~g~~~~~ssK~~kA~~-----~gi~ii~e~~f~~ll~~~ 308 (313)
T PRK06063 237 RVALSAEVSRTHEELVERILHAGLAYSDSVDRDTSLVVCNDPAPEQGKGYHARQ-----LGVPVLDEAAFLELLRAV 308 (313)
T ss_pred EEEEecCCCCCHHHHHHHHHHcCCEecCccccCccEEEECCCCCcccHHHHHHH-----cCCccccHHHHHHHHHhh
Confidence 45678888889999999999999999999988666554421 2234555555 345788899988776544
No 55
>cd04923 ACT_AK-LysC-DapG-like_2 ACT domains of the lysine-sensitive aspartokinase isoenzyme AKII of Bacillus subtilis (BS) strain 168 and related domains. This CD includes the C-terminal of the two ACT domains of the lysine-sensitive aspartokinase isoenzyme AKII of Bacillus subtilis (BS) strain 168, and the lysine plus threonine-sensitive aspartokinase of Corynebacterium glutamicum, as well as, the second and fourth, of four, ACT domains present in cyanobacteria AK. Also included are the C-terminal of the two ACT domains of the diaminopimelate-sensitive aspartokinase isoenzyme AKI found in Bacilli (B. subtilis strain 168), Clostridia, and Actinobacteria bacterial species. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=25.55 E-value=2e+02 Score=18.83 Aligned_cols=44 Identities=9% Similarity=0.067 Sum_probs=33.6
Q ss_pred HHHHHHHHHhCCeEeEEecCceEEEEEEecCCcHHHHHHHHHHhh
Q psy15801 205 FGIIRHVQQAGGSLYATVDREGIVYTIEAVRKEIGHVHKFLASVV 249 (318)
Q Consensus 205 ~~i~~~l~~lG~~l~~~~~~d~~~~~~~~l~~~l~~~l~ll~d~l 249 (318)
..+...+...|..+...+.. ...+++.+...+.+.++..+.+.+
T Consensus 18 ~~i~~~L~~~~i~v~~i~~s-~~~is~~v~~~~~~~~~~~l~~~l 61 (63)
T cd04923 18 AKMFKALAEAGINIEMISTS-EIKISCLVDEDDAEKAVRALHEAF 61 (63)
T ss_pred HHHHHHHHHCCCCEEEEEcc-CCeEEEEEeHHHHHHHHHHHHHHh
Confidence 35667788888888776643 477888888899999988887665
No 56
>PF14257 DUF4349: Domain of unknown function (DUF4349)
Probab=25.16 E-value=2e+02 Score=25.98 Aligned_cols=44 Identities=16% Similarity=0.157 Sum_probs=35.5
Q ss_pred HHHHHHHHHhCCeEeEEec--------CceEEEEEEecCCcHHHHHHHHHHh
Q psy15801 205 FGIIRHVQQAGGSLYATVD--------REGIVYTIEAVRKEIGHVHKFLASV 248 (318)
Q Consensus 205 ~~i~~~l~~lG~~l~~~~~--------~d~~~~~~~~l~~~l~~~l~ll~d~ 248 (318)
.+|.+.+..+||.+..... .....+++.+|++.|+..++-|..+
T Consensus 66 ~~i~~~~~~~gG~i~~~~~~~~~~~~~~~~~~ltiRVP~~~~~~~l~~l~~~ 117 (262)
T PF14257_consen 66 KKIENLVESYGGYIESSSSSSSGGSDDERSASLTIRVPADKFDSFLDELSEL 117 (262)
T ss_pred HHHHHHHHHcCCEEEEEeeecccCCCCcceEEEEEEECHHHHHHHHHHHhcc
Confidence 4677888999998765544 4578999999999999999988753
No 57
>PF13840 ACT_7: ACT domain ; PDB: 3S1T_A 1ZHV_A 3AB4_K 3AB2_O 2DTJ_A 3AAW_A 2RE1_B 3MAH_A 1ZVP_D.
Probab=24.93 E-value=2.2e+02 Score=19.73 Aligned_cols=37 Identities=14% Similarity=0.213 Sum_probs=27.2
Q ss_pred HHHHHhCCeEeEEecCceEEEEEEecCCcHHHHHHHHHH
Q psy15801 209 RHVQQAGGSLYATVDREGIVYTIEAVRKEIGHVHKFLAS 247 (318)
Q Consensus 209 ~~l~~lG~~l~~~~~~d~~~~~~~~l~~~l~~~l~ll~d 247 (318)
..|...|.++..-+ .+...++-+..+++++++..|.+
T Consensus 29 ~~La~~~I~i~~is--S~~~~~ilV~~~~~~~A~~~L~~ 65 (65)
T PF13840_consen 29 SALAEAGINIFMIS--SEISISILVKEEDLEKAVEALHE 65 (65)
T ss_dssp HHHHHTTS-ECEEE--ESSEEEEEEEGGGHHHHHHHHHH
T ss_pred HHHHHCCCCEEEEE--EeeeEEEEEeHHHHHHHHHHhcC
Confidence 44566777776666 66677788889999999998754
No 58
>KOG4043|consensus
Probab=24.78 E-value=55 Score=27.77 Aligned_cols=11 Identities=27% Similarity=0.311 Sum_probs=5.0
Q ss_pred EEEecCCCCCC
Q psy15801 171 VAFKAGARYET 181 (318)
Q Consensus 171 l~~~~Gs~~e~ 181 (318)
+-+.+|+..|.
T Consensus 119 FkLh~GPl~Eq 129 (214)
T KOG4043|consen 119 FKLHAGPLDEQ 129 (214)
T ss_pred eecCCCCchHH
Confidence 33445555443
No 59
>KOG2573|consensus
Probab=24.77 E-value=24 Score=34.35 Aligned_cols=21 Identities=14% Similarity=0.181 Sum_probs=10.8
Q ss_pred ceEEEEEEecCCcHHHHHHHH
Q psy15801 225 EGIVYTIEAVRKEIGHVHKFL 245 (318)
Q Consensus 225 d~~~~~~~~l~~~l~~~l~ll 245 (318)
+-|+.....++.++++=|+++
T Consensus 390 ~pts~fGe~Lr~qVEeRL~fy 410 (498)
T KOG2573|consen 390 DPTSVFGEKLREQVEERLEFY 410 (498)
T ss_pred CCchHHHHHHHHHHHHHHHhh
Confidence 334444455555566655543
No 60
>COG1327 Predicted transcriptional regulator, consists of a Zn-ribbon and ATP-cone domains [Transcription]
Probab=24.72 E-value=4e+02 Score=22.48 Aligned_cols=70 Identities=17% Similarity=0.181 Sum_probs=46.5
Q ss_pred hHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHhcccchhHHHHHH----------------HHHHHhCCCCChHHHHH
Q psy15801 53 PQALQASIDVVRGAGIKDADIARGKALLKRHLADSFETLDSTVDSVV----------------RQAVTTGVVKSLPDLLA 116 (318)
Q Consensus 53 ~~~~~~~l~~l~~~~~t~~EL~raK~~lk~~~~~~les~~~~a~~lg----------------~~~~~~g~~~~~~~~~~ 116 (318)
..++.-++.++.+.+++.+.|+++.+.++..+...-++- --...|| +.+-.|.+...+.++.+
T Consensus 64 ~Kl~~gl~~A~~KRpVs~e~ie~~v~~ie~~Lr~~g~~E-V~S~~IG~~VM~~Lk~lD~VAYvRFASVYr~F~dv~~F~e 142 (156)
T COG1327 64 EKLRRGLIRACEKRPVSSEQIEEAVSHIERQLRSSGERE-VPSKEIGELVMEELKKLDEVAYVRFASVYRSFKDVDDFEE 142 (156)
T ss_pred HHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHhcCCCC-CCHHHHHHHHHHHHHhcchhhhhhhhhHhcccCCHHHHHH
Confidence 556777777888889999999999999998888776542 1112222 13334555566666666
Q ss_pred HhhccCc
Q psy15801 117 EVEAVSS 123 (318)
Q Consensus 117 ~I~~VT~ 123 (318)
.|..++.
T Consensus 143 ~i~~l~~ 149 (156)
T COG1327 143 EIEELTK 149 (156)
T ss_pred HHHHHHh
Confidence 6666544
No 61
>cd04936 ACT_AKii-LysC-BS-like_2 ACT domains of the lysine-sensitive, aspartokinase (AK) isoenzyme AKII of Bacillus subtilis (BS) strain 168 and related domains. This CD includes the C-terminal of the two ACT domains of the lysine-sensitive, aspartokinase (AK) isoenzyme AKII of Bacillus subtilis (BS) strain 168, and the lysine plus threonine-sensitive aspartokinase of Corynebacterium glutamicum, and related sequences. In B. subtilis strain 168, the regulation of the diaminopimelate (Dap)-lysine biosynthetic pathway involves dual control by Dap and lysine, effected through separate Dap- and lysine-sensitive AK isoenzymes. The B. subtilis strain 168 AKII is induced by methionine and repressed and inhibited by lysine. Although C. glutamicum is known to contain a single AK, both the succinylase and dehydrogenase variant pathways of DAP-lysine synthesis operate simultaneously in this organism. In corynebacteria and other various Gram-positive bacteria, the DAP-lysine pathway is feedback regu
Probab=24.50 E-value=2.1e+02 Score=18.72 Aligned_cols=44 Identities=9% Similarity=0.043 Sum_probs=33.7
Q ss_pred HHHHHHHHHhCCeEeEEecCceEEEEEEecCCcHHHHHHHHHHhh
Q psy15801 205 FGIIRHVQQAGGSLYATVDREGIVYTIEAVRKEIGHVHKFLASVV 249 (318)
Q Consensus 205 ~~i~~~l~~lG~~l~~~~~~d~~~~~~~~l~~~l~~~l~ll~d~l 249 (318)
..+...+...|..+...+.. ...+++.+...+.+.++..+.+.+
T Consensus 18 ~~i~~~L~~~~i~v~~i~~s-~~~is~~v~~~d~~~~~~~l~~~~ 61 (63)
T cd04936 18 AKMFEALAEAGINIEMISTS-EIKISCLIDEDDAEKAVRALHEAF 61 (63)
T ss_pred HHHHHHHHHCCCcEEEEEcc-CceEEEEEeHHHHHHHHHHHHHHh
Confidence 35667788888888777643 477888899899999988887655
No 62
>PHA01735 hypothetical protein
Probab=22.38 E-value=1.3e+02 Score=21.87 Aligned_cols=32 Identities=13% Similarity=0.138 Sum_probs=23.6
Q ss_pred HHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHh
Q psy15801 56 LQASIDVVRGAGIKDADIARGKALLKRHLADS 87 (318)
Q Consensus 56 ~~~~l~~l~~~~~t~~EL~raK~~lk~~~~~~ 87 (318)
..+.+.+++.+..|..+|..|..-++.+=+.+
T Consensus 18 t~El~~RiksgeATtaDL~AA~d~Lk~NdItg 49 (76)
T PHA01735 18 TNELLSRIKSGEATTADLRAACDWLKSNDITG 49 (76)
T ss_pred HHHHHHHHhcCcccHHHHHHHHHHHHHCCCce
Confidence 34558888888899999988887777654433
No 63
>KOG3089|consensus
Probab=22.31 E-value=68 Score=28.83 Aligned_cols=42 Identities=5% Similarity=0.059 Sum_probs=26.5
Q ss_pred CCcHHHHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCCCC
Q psy15801 235 RKEIGHVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRPSQ 276 (318)
Q Consensus 235 ~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~~~ 276 (318)
+++++..+..=.|.+..-.|+.++-..-...+..-.-+.+++
T Consensus 4 ~d~ldD~l~yq~d~vheV~v~~e~~~qPtkkve~f~i~qddD 45 (271)
T KOG3089|consen 4 PDDLDDGLAYQQDTVHEVPVPSEKTKQPTKKVECFLIQQDDD 45 (271)
T ss_pred cccccchhhhhhccccccccCccccCCCcchhheeecccCCc
Confidence 456777777777777777788877666554444334444444
No 64
>PF08288 PIGA: PIGA (GPI anchor biosynthesis); InterPro: IPR013234 This domain is found on phosphatidylinositol N-acetylglucosaminyltransferase proteins. These proteins are involved in GPI anchor biosynthesis and are associated with the disease paroxysmal nocturnal haemoglobinuria [].; GO: 0006506 GPI anchor biosynthetic process
Probab=21.94 E-value=71 Score=24.39 Aligned_cols=15 Identities=20% Similarity=-0.035 Sum_probs=12.6
Q ss_pred EEEcCCCcEEEEEec
Q psy15801 148 SGTSYNKVKVVSVNE 162 (318)
Q Consensus 148 ~~~L~NGlkv~~~~~ 162 (318)
+.-|.||+||+++|.
T Consensus 9 vRyltngLKVYYlP~ 23 (90)
T PF08288_consen 9 VRYLTNGLKVYYLPL 23 (90)
T ss_pred eEEcCCCeEEEeecc
Confidence 456899999999885
No 65
>KOG2138|consensus
Probab=21.52 E-value=63 Score=33.73 Aligned_cols=13 Identities=8% Similarity=0.028 Sum_probs=6.8
Q ss_pred CCCHHHHHHHHHH
Q psy15801 67 GIKDADIARGKAL 79 (318)
Q Consensus 67 ~~t~~EL~raK~~ 79 (318)
.+|-.|-+|++-.
T Consensus 488 ~~smsewere~er 500 (883)
T KOG2138|consen 488 DPSMSEWERERER 500 (883)
T ss_pred CcchhHHHHhhhh
Confidence 3455555565554
No 66
>PF07499 RuvA_C: RuvA, C-terminal domain; InterPro: IPR011114 In prokaryotes, RuvA, RuvB, and RuvC process the universal DNA intermediate of homologous recombination, termed Holliday junction. The tetrameric DNA helicase RuvA specifically binds to the Holliday junction and facilitates the isomerization of the junction from the stacked folded configuration to the square-planar structure []. In the RuvA tetramer, each subunit consists of three domains, I, II and III, where I and II form the major core that is responsible for Holliday junction binding and base pair rearrangements of Holliday junction executed at the crossover point, whereas domain III regulates branch migration through direct contact with RuvB. The domain represents the C-terminal domain III of RuvA. This domain plays a significant role in the ATP-dependent branch migration of the hetero-duplex through direct contact with RuvB []. Within the Holliday junction, this domain makes no interaction with the DNA.; GO: 0005524 ATP binding, 0009378 four-way junction helicase activity, 0006281 DNA repair, 0006310 DNA recombination, 0009379 Holliday junction helicase complex; PDB: 1HJP_A 1CUK_A 1C7Y_A 1IXS_A 1IXR_B 1BVS_E 2ZTC_A 2ZTD_B 2H5X_A.
Probab=21.14 E-value=2.1e+02 Score=18.57 Aligned_cols=28 Identities=18% Similarity=0.317 Sum_probs=22.6
Q ss_pred HHHHHHHHHHcCCCCHHHHHHHHHHHHH
Q psy15801 55 ALQASIDVVRGAGIKDADIARGKALLKR 82 (318)
Q Consensus 55 ~~~~~l~~l~~~~~t~~EL~raK~~lk~ 82 (318)
...+++.+|..-|++..|+.++...+..
T Consensus 2 ~~~d~~~AL~~LGy~~~e~~~av~~~~~ 29 (47)
T PF07499_consen 2 ALEDALEALISLGYSKAEAQKAVSKLLE 29 (47)
T ss_dssp HHHHHHHHHHHTTS-HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHcCCCHHHHHHHHHHhhc
Confidence 4567788888889999999999988765
No 67
>KOG2573|consensus
Probab=20.27 E-value=1.1e+02 Score=29.85 Aligned_cols=9 Identities=11% Similarity=0.512 Sum_probs=3.7
Q ss_pred CCHHHHHHH
Q psy15801 68 IKDADIARG 76 (318)
Q Consensus 68 ~t~~EL~ra 76 (318)
+++-++++|
T Consensus 137 L~~~d~~ka 145 (498)
T KOG2573|consen 137 LDPGDLEKA 145 (498)
T ss_pred CCCccHHHH
Confidence 444444433
No 68
>KOG0487|consensus
Probab=20.11 E-value=64 Score=30.38 Aligned_cols=50 Identities=12% Similarity=-0.010 Sum_probs=35.0
Q ss_pred HHHHHHHHHhhccCCCCH---HHHhhhhhhHHHHHhhCCCChhhHHHHhhHHH
Q psy15801 239 GHVHKFLASVVGKTEFRP---WEVSDLTPRLKYDRLTRPSQGNEKKKKKNKKK 288 (318)
Q Consensus 239 ~~~l~ll~d~l~~P~f~e---~e~~~~k~~i~~ei~~~~~~~~~~~~~~~~~~ 288 (318)
-+.++|--|.|.|-=... .||.+.......++..+..+.|||.||-|+..
T Consensus 245 ~QtlELEkEFlfN~YitkeKR~ElSr~lNLTeRQVKIWFQNRRMK~KK~~re~ 297 (308)
T KOG0487|consen 245 HQTLELEKEFLFNMYITKEKRLELSRTLNLTERQVKIWFQNRRMKEKKVNREN 297 (308)
T ss_pred HHHHHHHHHHHHHHHHhHHHHHHHHHhcccchhheeeeehhhhhHHhhhhhhh
Confidence 567777777776544333 36666666667778888889999999887643
No 69
>COG5137 Histone chaperone involved in gene silencing [Transcription / Chromatin structure and dynamics]
Probab=20.06 E-value=5.4e+02 Score=23.06 Aligned_cols=62 Identities=18% Similarity=0.171 Sum_probs=40.7
Q ss_pred CCCCcEEEEcCC--CcEEEEEec---CCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHH
Q psy15801 142 FSPDIQSGTSYN--KVKVVSVNE---TSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRH 210 (318)
Q Consensus 142 ~~~~~~~~~L~N--Glkv~~~~~---~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~ 210 (318)
.||.+....|+. |++|+++.. ...+|.++.+++.|- +||..+.....-.-..+.....+.+.
T Consensus 77 DpP~v~liPlse~~gvTvILlsc~Y~g~eFvRvGYyVnney-------~gi~~~e~~d~E~p~~kvd~~~v~r~ 143 (279)
T COG5137 77 DPPDVNLIPLSEMFGVTVILLSCRYKGQEFVRVGYYVNNEY-------PGITKLEKSDVEEPSEKVDEEDVERE 143 (279)
T ss_pred CCCCccccchhhhhceeEEEEEEeecCceeEEEEEEeccCC-------cchhhhhhhhhcCCchhcCHHHHHHH
Confidence 456666665654 999998875 467999999998764 48866665555444555544444443
Done!