Query         psy15801
Match_columns 318
No_of_seqs    255 out of 1557
Neff          7.7 
Searched_HMMs 46136
Date          Fri Aug 16 21:41:43 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy15801.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15801hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG0960|consensus               99.9 8.9E-25 1.9E-29  202.8  11.8  139  141-279    29-167 (467)
  2 TIGR02110 PQQ_syn_pqqF coenzym  99.9 1.3E-23 2.8E-28  214.0  16.3  131  148-278     2-134 (696)
  3 PF00675 Peptidase_M16:  Insuli  99.9 8.3E-23 1.8E-27  171.2  14.6  122  156-277     1-123 (149)
  4 COG0612 PqqL Predicted Zn-depe  99.9 2.2E-22 4.8E-27  197.1  13.7  137  144-280    15-152 (438)
  5 PRK15101 protease3; Provisiona  99.9 5.1E-22 1.1E-26  211.8  15.3  135  143-277    41-177 (961)
  6 KOG2067|consensus               99.8 3.4E-21 7.3E-26  179.3   9.2  134  144-277    23-156 (472)
  7 KOG2583|consensus               99.8   1E-18 2.3E-23  162.8  13.0  134  145-278    22-156 (429)
  8 KOG2067|consensus               99.7 1.1E-17 2.3E-22  156.1   9.9  131    2-132   277-432 (472)
  9 COG1025 Ptr Secreted/periplasm  99.7 2.4E-17 5.1E-22  168.6  12.0  133  142-274    20-154 (937)
 10 KOG0959|consensus               99.7 4.8E-16   1E-20  161.2  12.5  137  142-278    24-163 (974)
 11 KOG0960|consensus               99.6 1.5E-15 3.2E-20  141.7   7.7  126    2-128   283-430 (467)
 12 PRK15101 protease3; Provisiona  99.6 6.1E-14 1.3E-18  149.9  17.5  213   56-275   372-650 (961)
 13 PTZ00432 falcilysin; Provision  99.4 7.9E-13 1.7E-17  142.6  12.2  111  145-259    91-207 (1119)
 14 PTZ00432 falcilysin; Provision  99.3 3.3E-10 7.2E-15  122.4  20.3  120  152-272   666-803 (1119)
 15 KOG2019|consensus               98.8 1.4E-06   3E-11   87.3  21.4  259    4-270   328-696 (998)
 16 COG1026 Predicted Zn-dependent  98.7 1.4E-06   3E-11   90.8  21.2  262    2-272   291-665 (978)
 17 PF08367 M16C_assoc:  Peptidase  98.4 1.8E-05 3.9E-10   72.1  16.9  167   56-258    15-193 (248)
 18 COG0612 PqqL Predicted Zn-depe  98.3 1.8E-06 3.9E-11   84.7   8.3  121    5-132   273-418 (438)
 19 KOG2583|consensus               98.1 6.9E-06 1.5E-10   77.7   6.8  121    2-127   264-401 (429)
 20 COG1026 Predicted Zn-dependent  97.4 0.00044 9.5E-09   72.6   7.7  109  150-262    26-137 (978)
 21 KOG0961|consensus               96.9  0.0032 6.8E-08   63.8   8.1  107  153-263    28-136 (1022)
 22 PF05193 Peptidase_M16_C:  Pept  96.6  0.0028 6.2E-08   53.0   4.4   71    2-80     93-184 (184)
 23 TIGR02110 PQQ_syn_pqqF coenzym  93.4     8.8 0.00019   40.3  19.0  198   60-266   102-346 (696)
 24 PF03410 Peptidase_M44:  Protei  92.3    0.16 3.5E-06   49.8   4.2   88  149-250     2-90  (590)
 25 KOG2019|consensus               92.3    0.18 3.8E-06   51.7   4.5  107  152-262    60-169 (998)
 26 KOG0959|consensus               91.7      18  0.0004   39.2  18.8  221   44-274   591-861 (974)
 27 PHA03081 putative metalloprote  90.1    0.29 6.3E-06   48.1   3.5   98  149-260     2-100 (595)
 28 PF05193 Peptidase_M16_C:  Pept  89.6    0.75 1.6E-05   38.1   5.3  109  153-266    66-184 (184)
 29 COG1025 Ptr Secreted/periplasm  85.7     5.1 0.00011   42.9   9.6  117  153-274   509-629 (937)
 30 KOG0961|consensus               82.6     4.9 0.00011   41.6   7.6  124    2-127   826-962 (1022)
 31 PF10278 Med19:  Mediator of RN  76.6       1 2.2E-05   38.8   0.6   24  204-227    40-63  (178)
 32 PF07946 DUF1682:  Protein of u  71.9      22 0.00047   33.7   8.4   49  223-271   213-269 (321)
 33 KOG0345|consensus               65.7      11 0.00025   37.4   5.1   15  225-239   424-438 (567)
 34 KOG2422|consensus               58.9       2 4.3E-05   43.4  -1.4    7  283-289    88-94  (665)
 35 PF09186 DUF1949:  Domain of un  51.2      69  0.0015   21.1   6.9   46  205-250     9-54  (56)
 36 PF08496 Peptidase_S49_N:  Pept  49.9      22 0.00048   30.0   3.7   11  264-274    52-62  (155)
 37 KOG3794|consensus               48.6      37  0.0008   32.8   5.3   27   53-79     22-48  (453)
 38 smart00874 B5 tRNA synthetase   47.4      32 0.00069   24.4   3.8   50  201-250    18-67  (71)
 39 TIGR00869 sec62 protein transl  45.0      13 0.00027   33.6   1.6    7  118-124    72-78  (232)
 40 PF03484 B5:  tRNA synthetase B  43.3      63  0.0014   23.0   4.8   49  201-250    18-66  (70)
 41 PF08496 Peptidase_S49_N:  Pept  40.1      22 0.00047   30.0   2.2   15  259-273    51-65  (155)
 42 PRK14552 C/D box methylation g  39.8     4.7  0.0001   39.6  -2.1   15  233-247   330-344 (414)
 43 PRK14552 C/D box methylation g  39.6     5.1 0.00011   39.3  -2.0   16  267-282   372-387 (414)
 44 KOG3794|consensus               37.9      23  0.0005   34.2   2.2    7  152-158   117-123 (453)
 45 cd04914 ACT_AKi-DapG-BS_1 ACT   37.0      91   0.002   21.9   4.8   49  205-254    17-65  (67)
 46 PF00675 Peptidase_M16:  Insuli  35.7 1.2E+02  0.0027   24.3   6.1   54   55-108    87-140 (149)
 47 PF15289 RFXA_RFXANK_bdg:  Regu  33.1      14 0.00031   29.7   0.0   16  276-291    19-34  (124)
 48 PF08432 Vfa1:  AAA-ATPase Vps4  32.4      16 0.00035   31.6   0.2   34  251-284    50-85  (182)
 49 PF05890 Ebp2:  Eukaryotic rRNA  31.6      88  0.0019   28.9   4.9   27  245-275   110-136 (271)
 50 KOG2138|consensus               30.9      26 0.00057   36.4   1.4   14  242-255   774-787 (883)
 51 PF09851 SHOCT:  Short C-termin  29.9 1.3E+02  0.0027   17.9   4.2   17   65-81     14-30  (31)
 52 KOG4043|consensus               29.9      40 0.00086   28.6   2.1   11  185-195    68-78  (214)
 53 PF12122 DUF3582:  Protein of u  28.1   2E+02  0.0044   22.3   5.7   49  204-253    13-62  (101)
 54 PRK06063 DNA polymerase III su  28.0 1.3E+02  0.0029   28.2   5.6   69  193-266   237-308 (313)
 55 cd04923 ACT_AK-LysC-DapG-like_  25.6   2E+02  0.0044   18.8   5.4   44  205-249    18-61  (63)
 56 PF14257 DUF4349:  Domain of un  25.2   2E+02  0.0044   26.0   6.2   44  205-248    66-117 (262)
 57 PF13840 ACT_7:  ACT domain ; P  24.9 2.2E+02  0.0048   19.7   5.1   37  209-247    29-65  (65)
 58 KOG4043|consensus               24.8      55  0.0012   27.8   2.1   11  171-181   119-129 (214)
 59 KOG2573|consensus               24.8      24 0.00051   34.3  -0.1   21  225-245   390-410 (498)
 60 COG1327 Predicted transcriptio  24.7   4E+02  0.0086   22.5   7.1   70   53-123    64-149 (156)
 61 cd04936 ACT_AKii-LysC-BS-like_  24.5 2.1E+02  0.0046   18.7   5.4   44  205-249    18-61  (63)
 62 PHA01735 hypothetical protein   22.4 1.3E+02  0.0027   21.9   3.2   32   56-87     18-49  (76)
 63 KOG3089|consensus               22.3      68  0.0015   28.8   2.3   42  235-276     4-45  (271)
 64 PF08288 PIGA:  PIGA (GPI ancho  21.9      71  0.0015   24.4   2.0   15  148-162     9-23  (90)
 65 KOG2138|consensus               21.5      63  0.0014   33.7   2.1   13   67-79    488-500 (883)
 66 PF07499 RuvA_C:  RuvA, C-termi  21.1 2.1E+02  0.0046   18.6   4.0   28   55-82      2-29  (47)
 67 KOG2573|consensus               20.3 1.1E+02  0.0025   29.8   3.4    9   68-76    137-145 (498)
 68 KOG0487|consensus               20.1      64  0.0014   30.4   1.8   50  239-288   245-297 (308)
 69 COG5137 Histone chaperone invo  20.1 5.4E+02   0.012   23.1   7.3   62  142-210    77-143 (279)

No 1  
>KOG0960|consensus
Probab=99.92  E-value=8.9e-25  Score=202.77  Aligned_cols=139  Identities=21%  Similarity=0.284  Sum_probs=133.7

Q ss_pred             CCCCCcEEEEcCCCcEEEEEecCCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeE
Q psy15801        141 LFSPDIQSGTSYNKVKVVSVNETSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYA  220 (318)
Q Consensus       141 ~~~~~~~~~~L~NGlkv~~~~~~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~  220 (318)
                      ...|+.+.++|+||++|.+.++..++++|+++|++||++|.+.++|.+||++||.|+||.+++...+..+++.+|+.+|+
T Consensus        29 ~~~P~t~vttL~NGlrVaTE~~~a~TATVGVwidaGSR~EnekNNG~ahFLEhlaFKGT~~Rs~~alElEieniGahLNA  108 (467)
T KOG0960|consen   29 LSVPETEVTTLPNGLRVATEHNSASTATVGVWIDAGSRFENEKNNGTAHFLEHLAFKGTKNRSQAALELEIENIGAHLNA  108 (467)
T ss_pred             ccCCcceEEEcCCCcEEEeccCCCcceEEEEEeccCccccccccccHHHHHHHHHhcCCCcchhHHHHHHHHHHHHHhcc
Confidence            34578899999999999999998899999999999999999999999999999999999999999999999999999999


Q ss_pred             EecCceEEEEEEecCCcHHHHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCCCChhh
Q psy15801        221 TVDREGIVYTIEAVRKEIGHVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRPSQGNE  279 (318)
Q Consensus       221 ~~~~d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~~~~~~  279 (318)
                      +++||+|.|+..++++++++++++|+||++|..+.+.+|+++|..|+.|+++.+.+.++
T Consensus       109 ytSReqT~yyakal~~dv~kavdiLaDIlqns~L~~s~IerER~vILrEmqevd~~~~e  167 (467)
T KOG0960|consen  109 YTSREQTVYYAKALSKDVPKAVDILADILQNSKLEESAIERERDVILREMQEVDKNHQE  167 (467)
T ss_pred             cccccceeeehhhccccchHHHHHHHHHHHhCccchhHHHHHHHHHHHHHHHHHhhhhH
Confidence            99999999999999999999999999999999999999999999999999998877664


No 2  
>TIGR02110 PQQ_syn_pqqF coenzyme PQQ biosynthesis probable peptidase PqqF. In a subset of species that make coenzyme PQQ (pyrrolo-quinoline-quinone), this probable peptidase is found in the PQQ biosynthesis region and is thought to act as a protease on PqqA (TIGR02107), a probable peptide precursor of the coenzyme. PQQ is required for some glucose dehydrogenases and alcohol dehydrogenases.
Probab=99.91  E-value=1.3e-23  Score=214.05  Aligned_cols=131  Identities=17%  Similarity=0.152  Sum_probs=124.9

Q ss_pred             EEEcCCCcEEEEEec-CCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHH-HHHHHHHHhCCeEeEEecCc
Q psy15801        148 SGTSYNKVKVVSVNE-TSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHF-GIIRHVQQAGGSLYATVDRE  225 (318)
Q Consensus       148 ~~~L~NGlkv~~~~~-~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~-~i~~~l~~lG~~l~~~~~~d  225 (318)
                      ..+|+|||+|+++++ ..|++++.++|++|+++|+.+.+|++||++||+|+||++++.. +|.+.++.+||++|+++++|
T Consensus         2 ~~tL~NGLrVllv~~p~~p~vav~l~v~aGS~~Ep~~~~GLAHfLEHMLFkGT~~~~~~~~i~~~le~lGG~lNA~Ts~d   81 (696)
T TIGR02110         2 RITLPNGLRVHLYHQPDAKRAAALLRVAAGSHDEPSAWPGLAHFLEHLLFLGGERFQGDDRLMPWVQRQGGQVNATTLER   81 (696)
T ss_pred             eEEcCCCCEEEEEECCCCCEEEEEEEEeeccCCCCCCCCcHHHHHHHHHhcCCCCCCcHHHHHHHHHHhCCeEEEEEcCC
Confidence            468999999999998 5799999999999999999999999999999999999999985 89999999999999999999


Q ss_pred             eEEEEEEecCCcHHHHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCCCChh
Q psy15801        226 GIVYTIEAVRKEIGHVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRPSQGN  278 (318)
Q Consensus       226 ~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~~~~~  278 (318)
                      +|.|++++++++++.+|++|+|++.+|.|++++|+++|.++++|+++..+++.
T Consensus        82 ~T~y~~~v~~~~l~~aL~lLaD~l~~P~f~eeeierEr~vvl~Ei~~~~ddp~  134 (696)
T TIGR02110        82 TTAFFFELPAAALAAGLARLCDMLARPLLTAEDQQREREVLEAEYIAWQNDAD  134 (696)
T ss_pred             eEEEEEEecHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHhcCHH
Confidence            99999999999999999999999999999999999999999999998776553


No 3  
>PF00675 Peptidase_M16:  Insulinase (Peptidase family M16) This is family M16 in the peptidase classification. ;  InterPro: IPR011765 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. The majority of the sequences in this entry are metallopeptidases and non-peptidase homologs belong to MEROPS peptidase family M16 (clan ME), subfamilies M16A, M16B and M16C; they include:  Insulinase, insulin-degrading enzyme (3.4.24.56 from EC) Mitochondrial processing peptidase alpha subunit, (Alpha-MPP, 3.4.24.64 from EC) Pitrlysin, Protease III precursor (3.4.24.55 from EC) Nardilysin, (3.4.24.61 from EC) Ubiquinol-cytochrome C reductase complex core protein I,mitochondrial precursor (1.10.2.2 from EC) Coenzyme PQQ synthesis protein F (3.4.99 from EC)  These proteins do not share many regions of sequence similarity; the most noticeable is in the N-terminal section. This region includes a conserved histidine followed, two residues later by a glutamate and another histidine. In pitrilysin, it has been shown [] that this H-x-x-E-H motif is involved in enzymatic activity; the two histidines bind zinc and the glutamate is necessary for catalytic activity. The proteins classified as non-peptidase homologues either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3P7L_A 3P7O_A 3TUV_A 3GO9_A 1BE3_B 1PP9_B 2A06_B 1SQB_B 1SQP_B 1L0N_B ....
Probab=99.90  E-value=8.3e-23  Score=171.24  Aligned_cols=122  Identities=22%  Similarity=0.382  Sum_probs=117.8

Q ss_pred             EEEEEec-CCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEEecCceEEEEEEec
Q psy15801        156 KVVSVNE-TSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYATVDREGIVYTIEAV  234 (318)
Q Consensus       156 kv~~~~~-~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~~~~d~~~~~~~~l  234 (318)
                      +|++.++ ..|+++++++|++|+++|+.+++|++|+++||+++||.+++..+|.+.++.+|+.+++++++|++.|+++++
T Consensus         1 ~V~~~~~~~~~~~~~~l~~~~Gs~~e~~~~~G~a~ll~~l~~~gs~~~~~~~l~~~l~~~G~~~~~~t~~d~t~~~~~~~   80 (149)
T PF00675_consen    1 KVVLVEDPGSPVVSVSLVFKAGSRYEPPGKPGLAHLLEHLLFRGSKKYSSDELQEELESLGASFNASTSRDSTSYSASVL   80 (149)
T ss_dssp             EEEEEESTTSSEEEEEEEES-SGGGSCTTTTTHHHHHHHHTTSBBSSSBHHHHHHHHHHTTCEEEEEEESSEEEEEEEEE
T ss_pred             CEEEEEcCCCCEEEEEEEEeeccCCCCCCCCchhhhhhhhcccccchhhhhhhHHHhhhhccccceEecccceEEEEEEe
Confidence            6889998 789999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             CCcHHHHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCCCCh
Q psy15801        235 RKEIGHVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRPSQG  277 (318)
Q Consensus       235 ~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~~~~  277 (318)
                      +++++.+|++|++++.+|.|++++|+++|..+..++++...++
T Consensus        81 ~~~~~~~l~~l~~~~~~P~f~~~~~~~~r~~~~~ei~~~~~~~  123 (149)
T PF00675_consen   81 SEDLEKALELLADMLFNPSFDEEEFEREREQILQEIEEIKENP  123 (149)
T ss_dssp             GGGHHHHHHHHHHHHHSBGGCHHHHHHHHHHHHHHHHHHTTHH
T ss_pred             cccchhHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHCCH
Confidence            9999999999999999999999999999999999999988776


No 4  
>COG0612 PqqL Predicted Zn-dependent peptidases [General function prediction only]
Probab=99.88  E-value=2.2e-22  Score=197.09  Aligned_cols=137  Identities=22%  Similarity=0.278  Sum_probs=131.0

Q ss_pred             CCcEEEEcCCCcEEEEEecC-CCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEEe
Q psy15801        144 PDIQSGTSYNKVKVVSVNET-SPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYATV  222 (318)
Q Consensus       144 ~~~~~~~L~NGlkv~~~~~~-~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~~  222 (318)
                      +.++..+|+||+++++.+++ .|.+++.++|++|+++|+.+..|+|||++||+|+||++++..+|.+.++.+||.+++++
T Consensus        15 ~~~~~~~L~nGl~~~~~~~~~~~~vs~~~~v~~Gs~~e~~~~~G~AH~lehm~fkgt~~~~~~~i~~~~~~~G~~~na~t   94 (438)
T COG0612          15 PGLQVFTLPNGLRVITYPNPTAPTVSLDVWVKAGSRAEPAGKAGIAHFLEHMAFKGTTGLPSAELAEAFEKLGGQLNAFT   94 (438)
T ss_pred             ccceEEEcCCCCEEEEEeCCCCCEEEEEEEEeecccCCCCCcccHHHHHHHHHccCCCCCChHHHHHHHHHhcCeeeccc
Confidence            45899999999999999995 89999999999999999999999999999999999999998899999999999999999


Q ss_pred             cCceEEEEEEecCCcHHHHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCCCChhhH
Q psy15801        223 DREGIVYTIEAVRKEIGHVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRPSQGNEK  280 (318)
Q Consensus       223 ~~d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~~~~~~~  280 (318)
                      ++|+|.|++++++++++.+|++++|++.+|.|++++|+++|.++++||++..+++..-
T Consensus        95 s~d~t~y~~~~l~~~~~~~l~llad~l~~p~f~~~~~e~Ek~vil~ei~~~~d~p~~~  152 (438)
T COG0612          95 SFDYTVYYLSVLPDNLDKALDLLADILLNPTFDEEEVEREKGVILEEIRMRQDDPDDL  152 (438)
T ss_pred             cchhhhhhhhhchhhhHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHHhhccCchHH
Confidence            9999999999999999999999999999999999999999999999999998876543


No 5  
>PRK15101 protease3; Provisional
Probab=99.88  E-value=5.1e-22  Score=211.76  Aligned_cols=135  Identities=13%  Similarity=0.175  Sum_probs=127.4

Q ss_pred             CCCcEEEEcCCCcEEEEEec-CCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCC-HHHHHHHHHHhCCeEeE
Q psy15801        143 SPDIQSGTSYNKVKVVSVNE-TSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFT-HFGIIRHVQQAGGSLYA  220 (318)
Q Consensus       143 ~~~~~~~~L~NGlkv~~~~~-~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s-~~~i~~~l~~lG~~l~~  220 (318)
                      +..++..+|+|||+|+++++ ..|.+++++.|++|+++||.+.+|+|||++||+|+||.+++ ..+|.+.++.+||++|+
T Consensus        41 ~~~~~~~~L~NGL~v~l~~~~~~~~~~~~l~v~~Gs~~ep~~~~GlAHflEHmlf~GT~~~p~~~~~~~~l~~~Gg~~NA  120 (961)
T PRK15101         41 PRQYQAIRLDNGMTVLLVSDPQAVKSLAALALPVGSLEDPDAQQGLAHYLEHMVLMGSKKYPQPDSLAEFLKKHGGSHNA  120 (961)
T ss_pred             ccceEEEEeCCCCEEEEEeCCCCcceeEEEEeCcCCCCCCCCCCchHHHHHHHHhcCCccCCCcchHHHHHHHhCCCccc
Confidence            45678889999999999998 57999999999999999999999999999999999999996 67999999999999999


Q ss_pred             EecCceEEEEEEecCCcHHHHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCCCCh
Q psy15801        221 TVDREGIVYTIEAVRKEIGHVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRPSQG  277 (318)
Q Consensus       221 ~~~~d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~~~~  277 (318)
                      +|+.|+|.|++++++++++.+|++|++++.+|.|+++++++++..+.+|++....++
T Consensus       121 ~T~~d~T~y~~~~~~~~l~~aL~~~ad~~~~P~f~~~~~erE~~~v~~E~~~~~~~~  177 (961)
T PRK15101        121 STASYRTAFYLEVENDALPPAVDRLADAIAEPLLDPKNADRERNAVNAELTMARSRD  177 (961)
T ss_pred             eECCCceEEEEEcCHHHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHHHHHHhcCCH
Confidence            999999999999999999999999999999999999999999999999998665543


No 6  
>KOG2067|consensus
Probab=99.84  E-value=3.4e-21  Score=179.26  Aligned_cols=134  Identities=25%  Similarity=0.325  Sum_probs=129.0

Q ss_pred             CCcEEEEcCCCcEEEEEecCCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEEec
Q psy15801        144 PDIQSGTSYNKVKVVSVNETSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYATVD  223 (318)
Q Consensus       144 ~~~~~~~L~NGlkv~~~~~~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~~~  223 (318)
                      .+.++++|+||++|.+.+++++.++++++++.|+++|.+...|++||++.|.|++|.+++..+|...||.+||.++++++
T Consensus        23 ~~~kvttL~NGlkvase~~pg~f~~vGlyIdsGsrYE~~~~~GisH~lerLAF~ST~~~~~~ei~~~LE~~GGn~~cqsS  102 (472)
T KOG2067|consen   23 SNTKVTTLPNGLKVASENTPGQFCTVGLYIDSGSRYEAKYFSGISHFLERLAFKSTERFSSKEILAELEKLGGNCDCQSS  102 (472)
T ss_pred             ccceeeecCCccEEeccCCCCCceEEEEEEecCccccCcCcccHHHHHHHHhhccccCCcHHHHHHHHHHhCCccccccc
Confidence            47789999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             CceEEEEEEecCCcHHHHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCCCCh
Q psy15801        224 REGIVYTIEAVRKEIGHVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRPSQG  277 (318)
Q Consensus       224 ~d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~~~~  277 (318)
                      +|++.|.+++.+++++.++.+|+|.+++|.|.++|++..+..+..|+++....+
T Consensus       103 Retm~Yaas~~~~~v~sm~~lLadtV~~P~~~d~ev~~~~~~v~~E~~el~~~P  156 (472)
T KOG2067|consen  103 RETMMYAASADSDGVDSMVELLADTVLNPKFTDQEVEEARRAVKYEIEELWMRP  156 (472)
T ss_pred             HhhhHHHHHhhhcccHHHHHHHHHHHhcccccHHHHHHHHHhhhheccccccCc
Confidence            999999999999999999999999999999999999999999999999876544


No 7  
>KOG2583|consensus
Probab=99.79  E-value=1e-18  Score=162.85  Aligned_cols=134  Identities=30%  Similarity=0.492  Sum_probs=127.3

Q ss_pred             CcEEEEcCCCcEEEEEecCCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEEecC
Q psy15801        145 DIQSGTSYNKVKVVSVNETSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYATVDR  224 (318)
Q Consensus       145 ~~~~~~L~NGlkv~~~~~~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~~~~  224 (318)
                      ......|.|||+|...+++.|+.++.+.|++|||+|+..++|++|+++...++.|+.++...|.+.++.+||.++.+.+|
T Consensus        22 ~~~~~kl~ngL~Vas~e~~~~is~l~l~~~AGSRYe~~~~~G~sHllr~f~g~~Tq~~sal~ivr~se~~GG~Lss~~tR  101 (429)
T KOG2583|consen   22 ISKTTKLVNGLTVASREAPTAISSLSLAFRAGSRYEPADQQGLSHLLRNFVGRDTQERSALKIVRESEQLGGTLSSTATR  101 (429)
T ss_pred             hhhhhccccceEEEeccCCCcceEEEEEEecCccCCccccccHHHHHHHhcccCccccchhhhhhhhHhhCceeeeeeec
Confidence            34567899999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             ceEEEEEEecCCcHHHHHHHHHHhhccCCCCHHHHhhhh-hhHHHHHhhCCCChh
Q psy15801        225 EGIVYTIEAVRKEIGHVHKFLASVVGKTEFRPWEVSDLT-PRLKYDRLTRPSQGN  278 (318)
Q Consensus       225 d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e~~~~k-~~i~~ei~~~~~~~~  278 (318)
                      |++.|++++++++++-.|.+|++++.+|.|.+||++++. ..|..++....++..
T Consensus       102 e~~~~tvt~lrd~~~~~l~~L~~V~~~paFkPwEl~D~~~~ti~~~l~~~t~~~~  156 (429)
T KOG2583|consen  102 ELIGLTVTFLRDDLEYYLSLLGDVLDAPAFKPWELEDVVLATIDADLAYQTPYTI  156 (429)
T ss_pred             ceEEEEEEEecccHHHHHHHHHHhhcccCcCchhhhhhhhhhhHHHhhhcChHHH
Confidence            999999999999999999999999999999999999998 888888888777554


No 8  
>KOG2067|consensus
Probab=99.73  E-value=1.1e-17  Score=156.08  Aligned_cols=131  Identities=24%  Similarity=0.360  Sum_probs=109.1

Q ss_pred             CchhHHHHHHHHHHhcCC-CCCCCCCCCCccccc----cc-------CCCCCCccccccccCc---------hHHHHHHH
Q psy15801          2 SAKEALAFAVLKYALGAG-PAKTGKNQRSTNSKG----LS-------RTPGRESLDESHHFSP---------PQALQASI   60 (318)
Q Consensus         2 ~~~d~~al~vL~~iLG~g-~~~k~~~~~~~l~~~----~~-------~~~~~~~~~~~a~~~~---------~~~~~~~l   60 (318)
                      +++|+++++|||.+|||| +||.+|+|.|++++.    +.       |....+.|++++++..         .++++-+.
T Consensus       277 ~deD~v~~avLq~lmGGGGSFSAGGPGKGMySrLY~~vLNry~wv~sctAfnhsy~DtGlfgi~~s~~P~~a~~aveli~  356 (472)
T KOG2067|consen  277 NDEDFVALAVLQMLMGGGGSFSAGGPGKGMYSRLYLNVLNRYHWVYSCTAFNHSYSDTGLFGIYASAPPQAANDAVELIA  356 (472)
T ss_pred             CChhHHHHHHHHHHhcCCcccCCCCCCcchHHHHHHHHHhhhHHHHHhhhhhccccCCceeEEeccCCHHHHHHHHHHHH
Confidence            578999999999999975 589999998877652    22       3344566888888822         34555556


Q ss_pred             HHHH--cCCCCHHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHHHhCCCCChHHHHHHhhccCccccc--ccccc
Q psy15801         61 DVVR--GAGIKDADIARGKALLKRHLADSFETLDSTVDSVVRQAVTTGVVKSLPDLLAEVEAVSSSDVS--AKRSF  132 (318)
Q Consensus        61 ~~l~--~~~~t~~EL~raK~~lk~~~~~~les~~~~a~~lg~~~~~~g~~~~~~~~~~~I~~VT~edV~--~~r~~  132 (318)
                      .+|.  .++++.+||+|||+|+++.++|+|||.--..|++|||.+.+|.+..+.+++..|++||++||.  +.+.+
T Consensus       357 ~e~~~~~~~v~~~el~RAK~qlkS~LlMNLESR~V~~EDvGRQVL~~g~rk~p~e~~~~Ie~lt~~DI~rva~kvl  432 (472)
T KOG2067|consen  357 KEMINMAGGVTQEELERAKTQLKSMLLMNLESRPVAFEDVGRQVLTTGERKPPDEFIKKIEQLTPSDISRVASKVL  432 (472)
T ss_pred             HHHHHHhCCCCHHHHHHHHHHHHHHHHhcccccchhHHHHhHHHHhccCcCCHHHHHHHHHhcCHHHHHHHHHHHh
Confidence            6666  367999999999999999999999999999999999999999999999999999999999998  34544


No 9  
>COG1025 Ptr Secreted/periplasmic Zn-dependent peptidases, insulinase-like [Posttranslational modification, protein turnover, chaperones]
Probab=99.72  E-value=2.4e-17  Score=168.59  Aligned_cols=133  Identities=12%  Similarity=0.134  Sum_probs=125.6

Q ss_pred             CCCCcEEEEcCCCcEEEEEec-CCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCH-HHHHHHHHHhCCeEe
Q psy15801        142 FSPDIQSGTSYNKVKVVSVNE-TSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTH-FGIIRHVQQAGGSLY  219 (318)
Q Consensus       142 ~~~~~~~~~L~NGlkv~~~~~-~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~-~~i~~~l~~lG~~l~  219 (318)
                      ....++..+|||||+++++++ ..+...+.|.|++|+..||.+.+|+|||++||+|-||++++. ..+...|..+||+.|
T Consensus        20 d~r~y~~I~LpNGl~~LlisDP~a~ks~aAL~V~vGs~~DP~e~~GLAHflEHmlfmGseKYP~~~~f~~fLskhgGs~N   99 (937)
T COG1025          20 DDRKYRAIKLPNGLRALLVSDPQADKSSAALVVPVGSFDDPEEYPGLAHFLEHMLFMGSEKYPDEGGFSEFLSKHGGSHN   99 (937)
T ss_pred             cCcceeEEECCCCceEEEecCCCCCccceeEEeecCCCCChhhcccHHHHHHHHHHhcCccCCCccchHHHHHHcCCccc
Confidence            346788999999999999998 578999999999999999999999999999999999999975 599999999999999


Q ss_pred             EEecCceEEEEEEecCCcHHHHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCC
Q psy15801        220 ATVDREGIVYTIEAVRKEIGHVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRP  274 (318)
Q Consensus       220 ~~~~~d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~  274 (318)
                      ++|..+.|.|++++..++|+.+|+.|++++.+|.|+++.+++|+..+.+|.....
T Consensus       100 A~T~~~~T~fyFeV~~~al~~ALDrFa~ff~~PLf~~e~~dRE~~AV~sE~~~~~  154 (937)
T COG1025         100 ASTAGERTAFYFEVENDALEGALDRFADFFIEPLFNKEALDRERNAVNSEFTMNL  154 (937)
T ss_pred             cccCCCceeEEEEecHHHHHHHHHHHHHHHhccccChHHHHHHHHHHHHHHhcCc
Confidence            9999999999999999999999999999999999999999999999999987654


No 10 
>KOG0959|consensus
Probab=99.66  E-value=4.8e-16  Score=161.22  Aligned_cols=137  Identities=12%  Similarity=0.157  Sum_probs=125.1

Q ss_pred             CCCCcEEEEcCCCcEEEEEecC-CCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHH-HHHHHHHHhCCeEe
Q psy15801        142 FSPDIQSGTSYNKVKVVSVNET-SPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHF-GIIRHVQQAGGSLY  219 (318)
Q Consensus       142 ~~~~~~~~~L~NGlkv~~~~~~-~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~-~i~~~l~~lG~~l~  219 (318)
                      ....+...+|+|||++++++++ ....++.+-+.+||..||.+.+|||||++||+|.||++++.+ ++...+.+.||+-|
T Consensus        24 d~r~yr~~~L~Ngl~alLisDp~tD~ssaal~V~vGS~~DP~dl~GLAHF~EHMlFmGS~KYP~En~y~~~lsk~gGssN  103 (974)
T KOG0959|consen   24 DTREYRGIELTNGLRALLISDPKTDKSSAALDVKVGSFSDPEDLQGLAHFCEHMLFMGSEKYPDENEYSKFLSKNGGSSN  103 (974)
T ss_pred             CccceeEEEecCCceEEEecCCCCCccceeeeeeccccCCccccccHHHHHHHHHhhccccCCCcchhHHHHHhcCCccc
Confidence            3456788999999999999884 567889999999999999999999999999999999999754 78888889999999


Q ss_pred             EEecCceEEEEEEecCCcHHHHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCCC-Chh
Q psy15801        220 ATVDREGIVYTIEAVRKEIGHVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRPS-QGN  278 (318)
Q Consensus       220 ~~~~~d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~~-~~~  278 (318)
                      +.|..++|.|++.+..++|+.||+.|++++.+|.|.++.++|++..+..|.+.... +.|
T Consensus       104 A~T~~e~T~y~F~V~~~~l~~ALDrFaqFf~~Plf~~~a~eREv~AVdSE~~~nl~~D~w  163 (974)
T KOG0959|consen  104 AYTDSEHTNYYFDVQHDHLEGALDRFAQFFSDPLFNKSATEREVGAVDSEHEKNLNSDGW  163 (974)
T ss_pred             cccccccceEEEecchHHHHHHHHHHHHHhhCcccChHHHHHHHHHHHHHHHhccCcchh
Confidence            99999999999999999999999999999999999999999999999999877553 443


No 11 
>KOG0960|consensus
Probab=99.60  E-value=1.5e-15  Score=141.68  Aligned_cols=126  Identities=20%  Similarity=0.263  Sum_probs=102.6

Q ss_pred             CchhHHHHHHHHHHhcCCCCCC--CCCCCCcccccccCCCCC-------CccccccccCc----------hHHHHHH---
Q psy15801          2 SAKEALAFAVLKYALGAGPAKT--GKNQRSTNSKGLSRTPGR-------ESLDESHHFSP----------PQALQAS---   59 (318)
Q Consensus         2 ~~~d~~al~vL~~iLG~g~~~k--~~~~~~~l~~~~~~~~~~-------~~~~~~a~~~~----------~~~~~~~---   59 (318)
                      +|||+++++|.+.|+|+|.+..  +.+.+++|++.+....++       ..|.++|+...          ++.++.+   
T Consensus       283 ~~pD~~~l~van~iiG~wdr~~g~g~~~~s~La~~~~~~~l~~sfqsFnt~YkDTGLwG~y~V~~~~~~iddl~~~vl~e  362 (467)
T KOG0960|consen  283 AHPDYFALMVANTIIGNWDRTEGGGRNLSSRLAQKIQQDQLCHSFQSFNTSYKDTGLWGIYFVTDNLTMIDDLIHSVLKE  362 (467)
T ss_pred             CCccHHHHHHHHHHhhhhhcccCCccCCccHHHHHHHHHHHHHHHhhhhcccccccceeEEEEecChhhHHHHHHHHHHH
Confidence            5899999999999999998653  334488888877654433       22444444421          4444444   


Q ss_pred             HHHHHcCCCCHHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHHHhCCCCChHHHHHHhhccCcccccc
Q psy15801         60 IDVVRGAGIKDADIARGKALLKRHLADSFETLDSTVDSVVRQAVTTGVVKSLPDLLAEVEAVSSSDVSA  128 (318)
Q Consensus        60 l~~l~~~~~t~~EL~raK~~lk~~~~~~les~~~~a~~lg~~~~~~g~~~~~~~~~~~I~~VT~edV~~  128 (318)
                      |.+|.. .+|+.||+|||++|+.+++..||+..-+|++||+|++++|++.++.++.++|++||+.||+.
T Consensus       363 W~rL~~-~vteaEV~RAKn~Lkt~Lll~ldgttpi~ediGrqlL~~Grri~l~El~~rId~vt~~~Vr~  430 (467)
T KOG0960|consen  363 WMRLAT-SVTEAEVERAKNQLKTNLLLSLDGTTPIAEDIGRQLLTYGRRIPLAELEARIDAVTAKDVRE  430 (467)
T ss_pred             HHHHHh-hccHHHHHHHHHHHHHHHHHHhcCCCchHHHHHHHHhhcCCcCChHHHHHHHhhccHHHHHH
Confidence            666665 79999999999999999999999999999999999999999999999999999999999983


No 12 
>PRK15101 protease3; Provisional
Probab=99.57  E-value=6.1e-14  Score=149.92  Aligned_cols=213  Identities=9%  Similarity=0.041  Sum_probs=158.2

Q ss_pred             HHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHh-cccchhHHHHHHHHHHHhCCCCChHHHHHHhhccCccccccc-----
Q psy15801         56 LQASIDVVRGAGIKDADIARGKALLKRHLADS-FETLDSTVDSVVRQAVTTGVVKSLPDLLAEVEAVSSSDVSAK-----  129 (318)
Q Consensus        56 ~~~~l~~l~~~~~t~~EL~raK~~lk~~~~~~-les~~~~a~~lg~~~~~~g~~~~~~~~~~~I~~VT~edV~~~-----  129 (318)
                      +...|+.|+..+++++|++++|+.+..++... .......+..++..+ ....+.........++.+++++|+..     
T Consensus       372 i~~~i~~l~~~g~~~~el~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~~~~~~~~~~~~~i~~~~~~l~  450 (961)
T PRK15101        372 IFSYLNLLREKGIDKSYFDELAHVLDLDFRYPSITRDMDYIEWLADTM-LRVPVEHTLDAPYIADRYDPKAIKARLAEMT  450 (961)
T ss_pred             HHHHHHHHHhcCCcHHHHHHHHHHHhccccCCCCCChHHHHHHHHHHh-hhCCHHHheeCchhhhcCCHHHHHHHHhhcC
Confidence            33447777788999999999999999887443 223333445555432 22222223333455666777777620     


Q ss_pred             -------------------cccccc------------------cc-----CcC-------C-------CCCCcEEEEcCC
Q psy15801        130 -------------------RSFAAQ------------------PA-----TKA-------L-------FSPDIQSGTSYN  153 (318)
Q Consensus       130 -------------------r~~~~~------------------~~-----p~~-------~-------~~~~~~~~~L~N  153 (318)
                                         .+|..+                  ..     |..       +       ....++...++|
T Consensus       451 ~~n~~i~~~~~~~~~~~~~~~~~~~Y~~~~i~~~~~~~~~~~~~~~~l~lP~~n~fip~~~~~~~~~~~~~~p~~i~~~~  530 (961)
T PRK15101        451 PQNARIWYISPQEPHNKTAYFVDAPYQVDKISEQTFADWQQKAQNIALSLPELNPYIPDDFSLIKADKAYKHPELIVDEP  530 (961)
T ss_pred             HhHEEEEEEeCCCCCCccccccCCcceeecCCHHHHHHHhcCCCCccCCCCCCCCccCCCCeeccCCCCCCCCeEEEcCC
Confidence                               011110                  00     110       0       012367788899


Q ss_pred             CcEEEEEecC----CCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEEecCceEEE
Q psy15801        154 KVKVVSVNET----SPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYATVDREGIVY  229 (318)
Q Consensus       154 Glkv~~~~~~----~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~~~~d~~~~  229 (318)
                      |++|++.+++    .|.+.+.+.|.+|...+++...|++.|+..|+     ..+..++....+..|.+++.. +.+.+.+
T Consensus       531 g~~vw~~~d~~f~~~Pk~~i~~~~~~~~~~~~~~~~~l~~L~~~ll-----~~~l~e~~y~a~~aG~~~~~~-~~~g~~i  604 (961)
T PRK15101        531 GLRVVYMPSQYFADEPKADISLVLRNPKAMDSARNQVLFALNDYLA-----GLALDQLSNQASVGGISFSTN-ANNGLMV  604 (961)
T ss_pred             CeEEEEeCCCccccCCCEEEEEEEeCCCccCCHHHHHHHHHHHHHH-----HHHHHHHhchHHhcCcEEEEc-cCCCEEE
Confidence            9999999874    59999999999999999999999999999997     345567788888899999999 7999999


Q ss_pred             EEEecCCcHHHHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCCC
Q psy15801        230 TIEAVRKEIGHVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRPS  275 (318)
Q Consensus       230 ~~~~l~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~~  275 (318)
                      ++++++++++.+|+++.+++.+|.|++++|+++|..++.+++....
T Consensus       605 ~v~g~s~~l~~ll~~l~d~l~~~~~~~~~fe~~k~~~~~~l~~~~~  650 (961)
T PRK15101        605 NANGYTQRLPQLLQALLEGYFSFTPTEEQLAQAKSWYREQLDSAEK  650 (961)
T ss_pred             EEEecChhHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHhhhcc
Confidence            9999999999999999999999999999999999999999987553


No 13 
>PTZ00432 falcilysin; Provisional
Probab=99.42  E-value=7.9e-13  Score=142.56  Aligned_cols=111  Identities=11%  Similarity=0.097  Sum_probs=98.5

Q ss_pred             CcEEEEcCCCcEEEEEecCC---CEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhC--CeEe
Q psy15801        145 DIQSGTSYNKVKVVSVNETS---PIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAG--GSLY  219 (318)
Q Consensus       145 ~~~~~~L~NGlkv~~~~~~~---p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG--~~l~  219 (318)
                      .+...-.+||++|++++++.   +.+.++++|++|+..+    .|++|+++||+|.||.+++..++...++..|  +.+|
T Consensus        91 ~~~~~H~~nGl~vl~~~~~d~~~~~~~f~i~f~T~~~d~----~G~aH~LEH~~f~GS~k~p~~~~~~~l~~~gl~~~lN  166 (1119)
T PTZ00432         91 ATVYSHKKTGLQVISLKTNDSSGKEMCFDFYVPTPPHND----KGIPHILEHSVLSGSKKYNYKDSFSLLVQGGFNSFLN  166 (1119)
T ss_pred             EEEEEEcCCCCEEEEEecCCCccceeEEEEEecCCCCCC----cchhHHHHHHHhCCCCCCCcccHHHHHHhcCcCCCcc
Confidence            34555678999999988643   5899999999998554    5999999999999999999999999998865  8899


Q ss_pred             EEecCceEEEEEEecCC-cHHHHHHHHHHhhccCCCCHHHH
Q psy15801        220 ATVDREGIVYTIEAVRK-EIGHVHKFLASVVGKTEFRPWEV  259 (318)
Q Consensus       220 ~~~~~d~~~~~~~~l~~-~l~~~l~ll~d~l~~P~f~e~e~  259 (318)
                      +.|+.|+|.|.+.++.+ ++..+|++++|++.+|.|+++++
T Consensus       167 A~T~~D~T~Y~~~~~~e~d~~~~ldv~~d~v~~P~~~~~~~  207 (1119)
T PTZ00432        167 AYTFKDRTSYLFASTNEKDFYNTADVYMDSVFQPNILEDKD  207 (1119)
T ss_pred             ccCCCCceEEEeccCCHHHHHHHHHHHHHHHhCcCcccccc
Confidence            99999999999999885 79999999999999999999864


No 14 
>PTZ00432 falcilysin; Provisional
Probab=99.26  E-value=3.3e-10  Score=122.43  Aligned_cols=120  Identities=10%  Similarity=0.049  Sum_probs=91.8

Q ss_pred             CCCcEEEEEec-CCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEE----ec---
Q psy15801        152 YNKVKVVSVNE-TSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYAT----VD---  223 (318)
Q Consensus       152 ~NGlkv~~~~~-~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~----~~---  223 (318)
                      .+|+.|+..+- .+.++.+.++|+.....+ ...+=+.-|...+..-||.+++..++...++..-|.++++    ++   
T Consensus       666 ~~~~~~~~~~~~TnGi~y~~~~fdl~~l~~-e~~~yl~L~~~~l~~~gT~~~s~~el~~~i~~~tGg~~~~~~~~~~~~~  744 (1119)
T PTZ00432        666 GGSVTVLVHPIESRGILYLDFAFSLDSLTV-DELKYLNLFKALLKENGTDKLSSEEFTYKREKNLGGLSASTAFYSETNN  744 (1119)
T ss_pred             CCCcceEEEecCCCCeEEEEEEecCCCCCH-HHHhhHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCCeEEEEEEeccccc
Confidence            57899988876 577999999999986543 2334444444444456999999999999999987777654    22   


Q ss_pred             ---------CceEEEEEEecCCcHHHHHHHHHHhhccCCCCHHH-HhhhhhhHHHHHhh
Q psy15801        224 ---------REGIVYTIEAVRKEIGHVHKFLASVVGKTEFRPWE-VSDLTPRLKYDRLT  272 (318)
Q Consensus       224 ---------~d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e-~~~~k~~i~~ei~~  272 (318)
                               ...+.+++.++.++++.+++|+.+++.++.|++.+ +..+..+.++++.+
T Consensus       745 ~~~~~~~~~~~~~~v~~k~l~~~~~~~~~l~~eil~~~~f~d~~rl~~il~~~~~~~~~  803 (1119)
T PTZ00432        745 LTYDDPYNGVGYLNVRAKVLKHKVNEMVDIVLEALKDADFSNSKKGVEILKRKINGMKT  803 (1119)
T ss_pred             cccCcccccceEEEEEEEEhhhhHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHHHHHH
Confidence                     34689999999999999999999999999998766 65555555555544


No 15 
>KOG2019|consensus
Probab=98.77  E-value=1.4e-06  Score=87.25  Aligned_cols=259  Identities=14%  Similarity=0.140  Sum_probs=171.8

Q ss_pred             hhHHHHHHHHHHhcCCCCCCCCCCCCcccccccCCCCCCcccccc-ccC--------------c-------hHHHHHHHH
Q psy15801          4 KEALAFAVLKYALGAGPAKTGKNQRSTNSKGLSRTPGRESLDESH-HFS--------------P-------PQALQASID   61 (318)
Q Consensus         4 ~d~~al~vL~~iLG~g~~~k~~~~~~~l~~~~~~~~~~~~~~~~a-~~~--------------~-------~~~~~~~l~   61 (318)
                      =+.+||.||..+|-+|+       +||+|+++.+.+....|++.. +..              +       ..++..++.
T Consensus       328 ~etfaL~~L~~Ll~~gp-------sSp~yk~LiESGLGtEfsvnsG~~~~t~~~~fsVGLqGvseediekve~lV~~t~~  400 (998)
T KOG2019|consen  328 YETFALKVLSHLLLDGP-------SSPFYKALIESGLGTEFSVNSGYEDTTLQPQFSVGLQGVSEEDIEKVEELVMNTFN  400 (998)
T ss_pred             HHHHHHHHHHHHhcCCC-------ccHHHHHHHHcCCCcccccCCCCCcccccceeeeeeccccHHHHHHHHHHHHHHHH
Confidence            36789999999999999       899999998776553233222 221              1       335555677


Q ss_pred             HHHcCCCCHHHHHHHHHHHHHHHHHhcccc-hhHHHHHHHHHHHhCCCCChHHHHHHhhcc---Ccc---cc-c------
Q psy15801         62 VVRGAGIKDADIARGKALLKRHLADSFETL-DSTVDSVVRQAVTTGVVKSLPDLLAEVEAV---SSS---DV-S------  127 (318)
Q Consensus        62 ~l~~~~~t~~EL~raK~~lk~~~~~~les~-~~~a~~lg~~~~~~g~~~~~~~~~~~I~~V---T~e---dV-~------  127 (318)
                      .|...+++.+-++....+++.++--..-+- -+++..+...|+..++|.....+-+.|..+   -.+   .| +      
T Consensus       401 ~lae~gfd~drieAil~qiEislk~qst~fGL~L~~~i~~~W~~d~DPfE~Lk~~~~L~~lk~~l~ek~~~lfq~lIkkY  480 (998)
T KOG2019|consen  401 KLAETGFDNDRIEAILHQIEISLKHQSTGFGLSLMQSIISKWINDMDPFEPLKFEEQLKKLKQRLAEKSKKLFQPLIKKY  480 (998)
T ss_pred             HHHHhccchHHHHHHHHHhhhhhhccccchhHHHHHHHhhhhccCCCccchhhhhhHHHHHHHHHhhhchhHHHHHHHHH
Confidence            787778999999999999887665443221 224666777787777765544444443332   111   00 0      


Q ss_pred             --ccc-cc-----ccc----------------------------------------ccCcC--------C-C-CCCc---
Q psy15801        128 --AKR-SF-----AAQ----------------------------------------PATKA--------L-F-SPDI---  146 (318)
Q Consensus       128 --~~r-~~-----~~~----------------------------------------~~p~~--------~-~-~~~~---  146 (318)
                        .+. ++     +++                                        ..|..        + . |+.+   
T Consensus       481 ilnn~h~~t~smqpd~e~~~~~~~eE~tkL~ek~~alteeD~~ei~k~~~eL~~kQ~tp~dlsClPtL~vsDIp~~~~~~  560 (998)
T KOG2019|consen  481 ILNNPHCFTFSMQPDPEFAEKLEQEEATKLEEKKAALTEEDLAEIAKAGEELREKQSTPEDLSCLPTLNVSDIPKTIPYT  560 (998)
T ss_pred             HhcCCceEEEEecCCchhhHHHHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHhhCCccccccccccccccCCCCCCcc
Confidence              011 11     100                                        01221        0 0 1111   


Q ss_pred             E-EEEcCCCcEEEEEec-CCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEEe--
Q psy15801        147 Q-SGTSYNKVKVVSVNE-TSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYATV--  222 (318)
Q Consensus       147 ~-~~~L~NGlkv~~~~~-~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~~--  222 (318)
                      . ....-||++|..... .+.++.+.++++.|+.-+. -.+=+.-|...++..||...+..|+++.+...-|.++++.  
T Consensus       561 ~~~v~dingvkv~~~dl~tngi~Y~r~~~~l~~~p~e-L~PylPlfc~sll~lGt~~lsf~el~qqI~rkTGGiS~~p~~  639 (998)
T KOG2019|consen  561 KLEVGDINGVKVQRCDLFTNGITYTRVVFDLNSLPEE-LLPYLPLFCQSLLNLGTGDLSFVELEQQIGRKTGGISVSPLV  639 (998)
T ss_pred             ceeeeeccCceeEEeeccCCceEEEEEeeccccCcHH-hhcchHHHHHHHHhcCCCcccHHHHHHHhhhhcCceeeccee
Confidence            1 223459999988877 7889999999999996543 3567888889999999999999999999999877776442  


Q ss_pred             --cC------ceEEEEEEecCCcHHHHHHHHHHhhccCCCCHHH-HhhhhhhHHHHH
Q psy15801        223 --DR------EGIVYTIEAVRKEIGHVHKFLASVVGKTEFRPWE-VSDLTPRLKYDR  270 (318)
Q Consensus       223 --~~------d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e-~~~~k~~i~~ei  270 (318)
                        +.      -.+.|+..++..+.+..++|...++.++.|.+.+ |.....+..+++
T Consensus       640 ~s~~~~d~p~~~i~~~~~~l~rn~~dlfel~n~il~e~~f~n~dkfkvlvk~s~s~~  696 (998)
T KOG2019|consen  640 SSDDGMDEPELGIVFSGSMLDRNADDLFELWNKILQETCFTNQDKFKVLVKQSASRM  696 (998)
T ss_pred             ccCCCCCccceeEEechhhhcCChhHHHHHHHHHhcccCcccHHHHHHHHHHHHHHh
Confidence              11      2256677788899999999999999999999654 444444444443


No 16 
>COG1026 Predicted Zn-dependent peptidases, insulinase-like [General function prediction only]
Probab=98.74  E-value=1.4e-06  Score=90.80  Aligned_cols=262  Identities=14%  Similarity=0.144  Sum_probs=177.4

Q ss_pred             CchhHHHHHHHHHHhcCCCCCCCCCCCCcccccccCCCCCCccccccccC---------------c-------hHHHHHH
Q psy15801          2 SAKEALAFAVLKYALGAGPAKTGKNQRSTNSKGLSRTPGRESLDESHHFS---------------P-------PQALQAS   59 (318)
Q Consensus         2 ~~~d~~al~vL~~iLG~g~~~k~~~~~~~l~~~~~~~~~~~~~~~~a~~~---------------~-------~~~~~~~   59 (318)
                      |.-|..|+.||..+|-++.       ++|+++++.+.+.. ...++++..               +       .+.+-.+
T Consensus       291 d~~~~lal~vL~~iLl~~~-------asPl~~~liesglg-~~~~~g~~~~~~~~~~f~v~~~gv~~ek~~~~k~lV~~~  362 (978)
T COG1026         291 DAEDSLALEVLEEILLDSA-------ASPLTQALIESGLG-FADVSGSYDSDLKETIFSVGLKGVSEEKIAKLKNLVLST  362 (978)
T ss_pred             cHHHHHHHHHHHHHHccCc-------ccHHHHHHHHcCCC-cccccceeccccceeEEEEEecCCCHHHHHHHHHHHHHH
Confidence            3457899999999999888       78999999877654 122333221               1       4456666


Q ss_pred             HHHHHcCCCCHHHHHHHHHHHHHHHHHhcccchh--HHHHHHHHHHHhCCCCChHH---HHHHhhc-cCccc-ccc----
Q psy15801         60 IDVVRGAGIKDADIARGKALLKRHLADSFETLDS--TVDSVVRQAVTTGVVKSLPD---LLAEVEA-VSSSD-VSA----  128 (318)
Q Consensus        60 l~~l~~~~~t~~EL~raK~~lk~~~~~~les~~~--~a~~lg~~~~~~g~~~~~~~---~~~~I~~-VT~ed-V~~----  128 (318)
                      ++.+...+++.+.|+.+..+++.+.-..-...-+  +......+++..|+|..-..   ++++|.. +...+ ...    
T Consensus       363 L~~l~~~gi~~~~ie~~~~q~E~s~ke~~s~pfgl~l~~~~~~gw~~G~dp~~~Lr~~~~~~~Lr~~le~~~~fe~LI~k  442 (978)
T COG1026         363 LKELVKNGIDKKLIEAILHQLEFSLKEVKSYPFGLGLMFRSLYGWLNGGDPEDSLRFLDYLQNLREKLEKGPYFEKLIRK  442 (978)
T ss_pred             HHHHHHhcCCHHHHHHHHHHHHHhhhhhcCCCccHHHHHHhccccccCCChhhhhhhHHHHHHHHHhhhcChHHHHHHHH
Confidence            8888888899999999999999888776444444  45556667777777655444   4444433 22222 110    


Q ss_pred             ----cccc------cc----------------------------------------cccCcC-----------C-C-C--
Q psy15801        129 ----KRSF------AA----------------------------------------QPATKA-----------L-F-S--  143 (318)
Q Consensus       129 ----~r~~------~~----------------------------------------~~~p~~-----------~-~-~--  143 (318)
                          +.++      ++                                        +..|..           . . |  
T Consensus       443 y~l~N~h~~~v~~~Ps~~~~~~~ekee~e~L~~~~~~l~de~~~ki~~~~~~lke~Q~~~dse~~~~~lP~l~~~dvp~~  522 (978)
T COG1026         443 YFLDNPHYVTVIVLPSPELEEKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKERQDQPDSEEDLATLPTLKLGDVPDP  522 (978)
T ss_pred             HhhcCCccEEEEEecChHHHHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHHHHhhcCCCchhhhhhccccchhcCCCc
Confidence                1111      00                                        012211           0 0 1  


Q ss_pred             -CCc-EEEEcCCCcEEEEEec-CCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeE
Q psy15801        144 -PDI-QSGTSYNKVKVVSVNE-TSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYA  220 (318)
Q Consensus       144 -~~~-~~~~L~NGlkv~~~~~-~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~  220 (318)
                       +.. -...-.|..+|+..+. ++.++.+.++|+.. .-.....+=+.-|...|...||.+++..++..+++..-|.+++
T Consensus       523 ~~k~~l~~~~~~~~~v~~~~~~tn~i~yl~~~~~~~-~l~~~llpyL~L~~~~l~~lgt~~~~y~e~~~~i~~~TGgis~  601 (978)
T COG1026         523 IEKTSLETEVSNEAKVLHHDLFTNGITYLRLYFDLD-MLPSELLPYLPLFAFALTNLGTETYSYKELLNQIERHTGGISV  601 (978)
T ss_pred             ccccceeeeccCCcceEEeecCCCCeEEEEEEeecC-CCChhhhhhHHHHHHHHHhcCCCCcCHHHHHHHHHHHhCCcee
Confidence             122 2223347888888877 78999999999994 4444566778878888888999999999999999998777664


Q ss_pred             Eec-----------CceEEEEEEecCCcHHHHHHHHHHhhccCCCC-HHHHhhhhhhHHHHHhh
Q psy15801        221 TVD-----------REGIVYTIEAVRKEIGHVHKFLASVVGKTEFR-PWEVSDLTPRLKYDRLT  272 (318)
Q Consensus       221 ~~~-----------~d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~-e~e~~~~k~~i~~ei~~  272 (318)
                      ..+           +..+.|++.++.+..+.+++++.+++.++.|+ .+-+..+..+.++.+..
T Consensus       602 ~~~~~~~~~~~~~~~~~~~i~~K~l~~k~~~~~~~i~~~l~~~~F~D~~Rlkell~q~~~~l~~  665 (978)
T COG1026         602 SLSVDTDPGDDGEYRPSFSISGKALRSKVEKLFELIREILANTDFHDRERLKELLEQYLSDLTS  665 (978)
T ss_pred             eEeeccCCCccccccceEEEEEEehhhhhhHHHHHHHHHHhcCCcCcHHHHHHHHHHHHhhhHH
Confidence            432           25678999999999999999999999999995 44444444444444444


No 17 
>PF08367 M16C_assoc:  Peptidase M16C associated;  InterPro: IPR013578 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This domain appears in eukaryotes as well as bacteria and tends to be found near the C terminus of metalloproteases and related sequences belonging to MEROPS peptidase family M16 (subfamily M16C, clan ME). These include: eupitrilysin, falcilysin, PreP peptidase, CYM1 peptidase and subfamily M16C non-peptidase homologues.; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 2FGE_B 3S5I_A 3S5H_A 3S5M_A 3S5K_A.
Probab=98.41  E-value=1.8e-05  Score=72.07  Aligned_cols=167  Identities=13%  Similarity=0.180  Sum_probs=105.7

Q ss_pred             HHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHHHhCCCCChHHHHHHhhccCccccccccccccc
Q psy15801         56 LQASIDVVRGAGIKDADIARGKALLKRHLADSFETLDSTVDSVVRQAVTTGVVKSLPDLLAEVEAVSSSDVSAKRSFAAQ  135 (318)
Q Consensus        56 ~~~~l~~l~~~~~t~~EL~raK~~lk~~~~~~les~~~~a~~lg~~~~~~g~~~~~~~~~~~I~~VT~edV~~~r~~~~~  135 (318)
                      ..+-++++++ .+|++|+++...+.+.-..++- .                     ++-++-|-.++.+||...-.    
T Consensus        15 e~~~L~~~k~-~Ls~~e~~~i~~~~~~L~~~Q~-~---------------------~d~~~~LP~L~~~Di~~~~~----   67 (248)
T PF08367_consen   15 EKEKLAAYKA-SLSEEEKEKIIEQTKELKERQE-A---------------------EDDLATLPTLSLSDIPREIE----   67 (248)
T ss_dssp             HHHHHHHHHH-CS-HHHHHHHHHHHHHHHHHHC-S-----------------------HHTTS----GGGS-SS------
T ss_pred             HHHHHHHHHh-hCCHHHHHHHHHHHHHHHHHhC-C---------------------hhhHHHHccccHHhcCCCCC----
Confidence            3344666655 5888888888777554222211 1                     11334566777888763211    


Q ss_pred             ccCcCCCCCCcEEEEcCCCcEEEEEec-CCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHh
Q psy15801        136 PATKALFSPDIQSGTSYNKVKVVSVNE-TSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQA  214 (318)
Q Consensus       136 ~~p~~~~~~~~~~~~L~NGlkv~~~~~-~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~l  214 (318)
                             ...+.... .+|++|++.+. .+.++.+.++|+.+.... ...+=+.-|..-|-.-||.+++..++...+..+
T Consensus        68 -------~~~~~~~~-~~~~~v~~~~~~TnGI~Y~~l~fdl~~l~~-e~l~yl~Ll~~ll~~lgT~~~sy~el~~~i~~~  138 (248)
T PF08367_consen   68 -------KIPLEVEK-LGGIPVLFHEQPTNGIVYVRLYFDLSDLPE-EDLPYLPLLTDLLGELGTKNYSYEELSNEIDLY  138 (248)
T ss_dssp             -----------EECC-CTTCEEEEEE---TTEEEEEEEEE-TTS-C-CCHCCHHHHHHHCCCS-BSSS-HHHHHHHHHHH
T ss_pred             -------CCCceeee-cCCccEEEEEcCCCCeEEEEEEecCCCCCH-HHHHhHHHHHHHHHhCCCCCCCHHHHHHHHHHh
Confidence                   11223333 37999998877 788999999999985443 345667777665556799999999999999998


Q ss_pred             CCeEeEEec-----------CceEEEEEEecCCcHHHHHHHHHHhhccCCCCHHH
Q psy15801        215 GGSLYATVD-----------REGIVYTIEAVRKEIGHVHKFLASVVGKTEFRPWE  258 (318)
Q Consensus       215 G~~l~~~~~-----------~d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e  258 (318)
                      =|.++++..           .-++.|++.|+.++++++++|+.+++.+|.|++.+
T Consensus       139 tGGis~~~~~~~~~~~~~~~~~~l~is~k~L~~~~~~~~~ll~eil~~~~f~d~~  193 (248)
T PF08367_consen  139 TGGISFSIEVYTDYDDDDKYRPYLVISAKCLDEKLDEAFELLSEILTETDFDDKE  193 (248)
T ss_dssp             SSEEEEEEEEEEEECTECCCEEEEEEEEEEEGGGHHHHHHHHHHHHHCB-TT-HH
T ss_pred             CCCeEEEeeeccCCCCccceeEEEEEEEEeHhhhHHHHHHHHHHHHhccCCCcHH
Confidence            777765541           14567888999999999999999999999998875


No 18 
>COG0612 PqqL Predicted Zn-dependent peptidases [General function prediction only]
Probab=98.31  E-value=1.8e-06  Score=84.73  Aligned_cols=121  Identities=24%  Similarity=0.342  Sum_probs=90.2

Q ss_pred             hHHHHHHHHHHhcCCCCCCCCCCCCcccccccCCCC-C-------Ccccccccc------C---c---hHHHHHHHHHHH
Q psy15801          5 EALAFAVLKYALGAGPAKTGKNQRSTNSKGLSRTPG-R-------ESLDESHHF------S---P---PQALQASIDVVR   64 (318)
Q Consensus         5 d~~al~vL~~iLG~g~~~k~~~~~~~l~~~~~~~~~-~-------~~~~~~a~~------~---~---~~~~~~~l~~l~   64 (318)
                      |++++.|+..+||++.       .|+|.+.+.+... +       ..+..++++      .   +   ...+.+++..++
T Consensus       273 ~~~~~~l~~~llgg~~-------~SrLf~~~re~~glay~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~  345 (438)
T COG0612         273 DYAALLLLNGLLGGGF-------SSRLFQELREKRGLAYSVSSFSDFLSDSGLFSIYAGTAPENPEKTAELVEEILKALK  345 (438)
T ss_pred             hhHHHHHHHHHhCCCc-------chHHHHHHHHhcCceeeeccccccccccCCceEEEEecCCChhhHHHHHHHHHHHHH
Confidence            7899999999999885       5566655433221 1       111111111      1   1   335556666666


Q ss_pred             cC---CCCHHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHHHhCCCCChHHHHHHhhccCcccccc--cccc
Q psy15801         65 GA---GIKDADIARGKALLKRHLADSFETLDSTVDSVVRQAVTTGVVKSLPDLLAEVEAVSSSDVSA--KRSF  132 (318)
Q Consensus        65 ~~---~~t~~EL~raK~~lk~~~~~~les~~~~a~~lg~~~~~~g~~~~~~~~~~~I~~VT~edV~~--~r~~  132 (318)
                      ..   .+|+++++++|..+...++...++...+++.++......+.+.++.++...|++||++||.+  ++++
T Consensus       346 ~~~~~~~t~~~~~~~k~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~vt~~dv~~~a~~~~  418 (438)
T COG0612         346 KGLKGPFTEEELDAAKQLLIGLLLLSLDSPSSIAELLGQYLLLGGSLITLEELLERIEAVTLEDVNAVAKKLL  418 (438)
T ss_pred             HHhccCCCHHHHHHHHHHHHHHhhhccCCHHHHHHHHHHHHHhcCCccCHHHHHHHHHhcCHHHHHHHHHHhc
Confidence            54   39999999999999999999999999999999988877789999999999999999999983  4444


No 19 
>KOG2583|consensus
Probab=98.09  E-value=6.9e-06  Score=77.74  Aligned_cols=121  Identities=21%  Similarity=0.194  Sum_probs=82.1

Q ss_pred             CchhHHHHHHHHHHhcCCCCCCCCCCCCcccccccCCC--------CCCccccccccCc---------hHHHHHHHHHHH
Q psy15801          2 SAKEALAFAVLKYALGAGPAKTGKNQRSTNSKGLSRTP--------GRESLDESHHFSP---------PQALQASIDVVR   64 (318)
Q Consensus         2 ~~~d~~al~vL~~iLG~g~~~k~~~~~~~l~~~~~~~~--------~~~~~~~~a~~~~---------~~~~~~~l~~l~   64 (318)
                      |+++..++.|+.++||.++.+|.+  .|.+.++..+..        ...+|++++++.-         ++++..++..++
T Consensus       264 ~~k~~~a~av~~~~Lg~~~~~k~~--t~~~~~aa~~a~~~~~s~sA~~a~ysDsGL~gv~~~~~~~~a~~~v~s~v~~lk  341 (429)
T KOG2583|consen  264 NLKVLAAQAVLLAALGNSAPVKRG--TGLLSEAAGAAGEQGASASAFNAPYSDSGLFGVYVSAQGSQAGKVVSSEVKKLK  341 (429)
T ss_pred             chHHHHHHHHHHHHHhcccccccc--cchHHHHHhhccccCceeeeecccccCCceEEEEEEecCccHHHHHHHHHHHHH
Confidence            689999999999999999988888  444554332221        2345888888732         677888888888


Q ss_pred             cCCCCHHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHHHhCCCCChHHHHHHhhccCccccc
Q psy15801         65 GAGIKDADIARGKALLKRHLADSFETLDSTVDSVVRQAVTTGVVKSLPDLLAEVEAVSSSDVS  127 (318)
Q Consensus        65 ~~~~t~~EL~raK~~lk~~~~~~les~~~~a~~lg~~~~~~g~~~~~~~~~~~I~~VT~edV~  127 (318)
                      +..+++-|..+.++-.+.-....+.+..++-...+.+.  .. +..+++++++|++||+.||.
T Consensus       342 s~~~~~id~~~~~a~~~~l~~~~~ss~~a~~~~~~~~a--~~-~~~~d~~i~~id~Vt~sdV~  401 (429)
T KOG2583|consen  342 SALVSDIDNAKVKAAIKALKASYLSSVEALELATGSQA--NL-VSEPDAFIQQIDKVTASDVQ  401 (429)
T ss_pred             HHHhcCCcchHHHHHHHHHHHHhhcchHHHHHhhHHHh--cC-CCChHHHHHHhccccHHHHH
Confidence            65555555555555555544444444444444444333  22 23889999999999999998


No 20 
>COG1026 Predicted Zn-dependent peptidases, insulinase-like [General function prediction only]
Probab=97.39  E-value=0.00044  Score=72.64  Aligned_cols=109  Identities=13%  Similarity=0.138  Sum_probs=88.9

Q ss_pred             EcCCCcEEEEEecCCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHH--HHhCCeEeEEecCceE
Q psy15801        150 TSYNKVKVVSVNETSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHV--QQAGGSLYATVDREGI  227 (318)
Q Consensus       150 ~L~NGlkv~~~~~~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l--~~lG~~l~~~~~~d~~  227 (318)
                      .-+.|++++.+.+..|-...+++|+.-+..+    .|++|.++|+.+.|+.+++-.+.--.+  ..++-.+|+.|+-|.|
T Consensus        26 H~~TGa~l~hi~~~d~~~vFsi~F~T~p~ds----tGVaHiLEHtvlcGS~kYPvkdPF~~ml~rSLntF~NA~T~~D~T  101 (978)
T COG1026          26 HEKTGAELAHIKNEDPNNVFSIAFKTEPHDS----TGVAHILEHTVLCGSKKYPVKDPFFKMLKRSLNTFLNAFTFPDKT  101 (978)
T ss_pred             eccCCceEEEecCCCcCceEEEEeecCCCCC----CCcchHHHHHhhhCCCCCCCCChHHHHHHHhHHHHHhhccCCCcc
Confidence            3468999999988777777888888865554    499999999999999999865443222  2345558999999999


Q ss_pred             EEEEEec-CCcHHHHHHHHHHhhccCCCCHHHHhhh
Q psy15801        228 VYTIEAV-RKEIGHVHKFLASVVGKTEFRPWEVSDL  262 (318)
Q Consensus       228 ~~~~~~l-~~~l~~~l~ll~d~l~~P~f~e~e~~~~  262 (318)
                      .|-++.. .+++-..|.+..|.+.+|...++.|-++
T Consensus       102 ~YP~sS~~~~Df~NLl~VYlDavf~PlL~~e~F~QE  137 (978)
T COG1026         102 VYPASSANEKDFYNLLSVYLDAVFHPLLTKESFLQE  137 (978)
T ss_pred             eeeccccCcchHHHHHHHHHHhhhCcccchHHHhhh
Confidence            9999755 5689999999999999999999988776


No 21 
>KOG0961|consensus
Probab=96.92  E-value=0.0032  Score=63.79  Aligned_cols=107  Identities=10%  Similarity=0.140  Sum_probs=89.3

Q ss_pred             CCcEEEEEecCCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHH-hCCeEeEEecCceEEEEE
Q psy15801        153 NKVKVVSVNETSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQ-AGGSLYATVDREGIVYTI  231 (318)
Q Consensus       153 NGlkv~~~~~~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~-lG~~l~~~~~~d~~~~~~  231 (318)
                      -|++|+....+.+++.-.+.|..-...+.    |+.|-++||.|-|+++++..-+-+.+.. .=|++|+-++-|+|.|++
T Consensus        28 Tkl~va~~~~pts~vhG~f~v~TEa~~d~----G~PHTLEHL~FMGSKkYP~kGvLd~~anr~l~dtNAwTDtD~T~YtL  103 (1022)
T KOG0961|consen   28 TKLRVAIGEVPTSMVHGAFSVVTEADSDD----GLPHTLEHLVFMGSKKYPFKGVLDVIANRCLADTNAWTDTDHTAYTL  103 (1022)
T ss_pred             cceEEEEeecCCcceeeeEEeeeeecCCC----CCchhHHHHhhhccccCCcccHHHHhhcchhcccccccccCcceEEe
Confidence            38889888887778888888887666655    9999999999999999998766555443 335699999999999999


Q ss_pred             Eec-CCcHHHHHHHHHHhhccCCCCHHHHhhhh
Q psy15801        232 EAV-RKEIGHVHKFLASVVGKTEFRPWEVSDLT  263 (318)
Q Consensus       232 ~~l-~~~l~~~l~ll~d~l~~P~f~e~e~~~~k  263 (318)
                      +.. .+.|-+.|..+.|-++.|...++.|-.+.
T Consensus       104 Stag~dGFlklLPvy~dHiL~P~Ltdeaf~TEV  136 (1022)
T KOG0961|consen  104 STAGSDGFLKLLPVYIDHILTPMLTDEAFATEV  136 (1022)
T ss_pred             ecccccchHHHhHHHHHhhcCcccchhhhhhhe
Confidence            855 56799999999999999999999988763


No 22 
>PF05193 Peptidase_M16_C:  Peptidase M16 inactive domain;  InterPro: IPR007863 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. These metallopeptidases belong to MEROPS peptidase family M16 (clan ME). They include proteins, which are classified as non-peptidase homologues either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.  The peptidases in this group of sequences include:  Insulinase, insulin-degrading enzyme (3.4.24.56 from EC) Mitochondrial processing peptidase alpha subunit, (Alpha-MPP, 3.4.24.64 from EC) Pitrlysin, Protease III precursor (3.4.24.55 from EC) Nardilysin, (3.4.24.61 from EC) Ubiquinol-cytochrome C reductase complex core protein I,mitochondrial precursor (1.10.2.2 from EC) Coenzyme PQQ synthesis protein F (3.4.99 from EC)  These proteins do not share many regions of sequence similarity; the most noticeable is in the N-terminal section. This region includes a conserved histidine followed, two residues later by a glutamate and another histidine. In pitrilysin, it has been shown [] that this H-x-x-E-H motif is involved in enzymatic activity; the two histidines bind zinc and the glutamate is necessary for catalytic activity. The mitochondrial processing peptidase consists of two structurally related domains. One is the active peptidase whereas the other, the C-terminal region, is inactive. The two domains hold the substrate like a clamp [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 1BE3_B 1PP9_B 2A06_B 1SQB_B 1SQP_B 1L0N_B 1SQX_B 1NU1_B 1L0L_B 2FYU_B ....
Probab=96.57  E-value=0.0028  Score=53.03  Aligned_cols=71  Identities=25%  Similarity=0.382  Sum_probs=46.1

Q ss_pred             CchhHHHHHHHHHHhcCCCCCCCCCCCCcccccccCCCCCCcccccccc---------------Cc---h---HHHHHHH
Q psy15801          2 SAKEALAFAVLKYALGAGPAKTGKNQRSTNSKGLSRTPGRESLDESHHF---------------SP---P---QALQASI   60 (318)
Q Consensus         2 ~~~d~~al~vL~~iLG~g~~~k~~~~~~~l~~~~~~~~~~~~~~~~a~~---------------~~---~---~~~~~~l   60 (318)
                      +++|..++.+|..+||++.       .++|...+-+..... |++.+.+               .+   .   +.+...+
T Consensus        93 ~~~~~~~~~~l~~~l~~~~-------~s~l~~~lr~~~~l~-y~v~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~l  164 (184)
T PF05193_consen   93 DSKDYFALNLLSSLLGNGM-------SSRLFQELREKQGLA-YSVSASNSSYRDSGLFSISFQVTPENLDEAIEAILQEL  164 (184)
T ss_dssp             TSTTHHHHHHHHHHHHCST-------TSHHHHHHHTTTTSE-SEEEEEEEEESSEEEEEEEEEEEGGGHHHHHHHHHHHH
T ss_pred             ccchhhHHHHHHHHHhcCc-------cchhHHHHHhccccc-eEEEeeeeccccceEEEEEEEcCcccHHHHHHHHHHHH
Confidence            6789999999999999996       566666654222110 1111111               11   2   2344447


Q ss_pred             HHHHcCCCCHHHHHHHHHHH
Q psy15801         61 DVVRGAGIKDADIARGKALL   80 (318)
Q Consensus        61 ~~l~~~~~t~~EL~raK~~l   80 (318)
                      ..+...+++++|++++|++|
T Consensus       165 ~~l~~~~~s~~el~~~k~~L  184 (184)
T PF05193_consen  165 KRLREGGISEEELERAKNQL  184 (184)
T ss_dssp             HHHHHHCS-HHHHHHHHHHH
T ss_pred             HHHHHcCCCHHHHHHHHhcC
Confidence            77777789999999999986


No 23 
>TIGR02110 PQQ_syn_pqqF coenzyme PQQ biosynthesis probable peptidase PqqF. In a subset of species that make coenzyme PQQ (pyrrolo-quinoline-quinone), this probable peptidase is found in the PQQ biosynthesis region and is thought to act as a protease on PqqA (TIGR02107), a probable peptide precursor of the coenzyme. PQQ is required for some glucose dehydrogenases and alcohol dehydrogenases.
Probab=93.44  E-value=8.8  Score=40.26  Aligned_cols=198  Identities=10%  Similarity=0.050  Sum_probs=101.5

Q ss_pred             HHHHHcCCCCHHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHHHhCCCCC-----hHHHHHHhhccCcccccc--cccc
Q psy15801         60 IDVVRGAGIKDADIARGKALLKRHLADSFETLDSTVDSVVRQAVTTGVVKS-----LPDLLAEVEAVSSSDVSA--KRSF  132 (318)
Q Consensus        60 l~~l~~~~~t~~EL~raK~~lk~~~~~~les~~~~a~~lg~~~~~~g~~~~-----~~~~~~~I~~VT~edV~~--~r~~  132 (318)
                      .+.+..--++++|+++.|..+..++.+..++...++...-...++.+.+..     ..+-+..|..+|.+||+.  +++|
T Consensus       102 aD~l~~P~f~eeeierEr~vvl~Ei~~~~ddp~~~~~~~l~~~l~~~HPy~~~~iGt~esL~~it~~t~edL~~F~~~~Y  181 (696)
T TIGR02110       102 CDMLARPLLTAEDQQREREVLEAEYIAWQNDADTLREAALLDALQAGHPLRRFHAGSRDSLALPNTAFQQALRDFHRRHY  181 (696)
T ss_pred             HHHHhCCCCCHHHHHHHHHHHHHHHHHHhcCHHHHHHHHHHHHcCCCCCCCCCCCCCHHHHhCcccchHHHHHHHHHHhc
Confidence            333444559999999999999999999888888777665444444333322     233333443455889984  3333


Q ss_pred             ccc---------cc---------------CcCCCC-CCcEEEEcCCCcEEEEEecCCCEEEEEEEEecCCCCCCCCCccH
Q psy15801        133 AAQ---------PA---------------TKALFS-PDIQSGTSYNKVKVVSVNETSPIARISVAFKAGARYETPDNLGI  187 (318)
Q Consensus       133 ~~~---------~~---------------p~~~~~-~~~~~~~L~NGlkv~~~~~~~p~v~v~l~~~~Gs~~e~~~~~Gl  187 (318)
                      ...         ..               |....+ +.........+ ++.+.....+.+.+.+.+++-+..++.    .
T Consensus       182 ~p~NmvLvIvGdvs~eel~~l~e~~f~~~~~~~~~~~~~~~p~~~~~-~~~~~~~~~~q~~l~~~~p~~~~~d~~----a  256 (696)
T TIGR02110       182 QAGNMQLWLQGPQSLDELEQLAARFGASLAAGGECAQAPPAPLLRFD-RLTLAGGSEPRLWLLFALAGLPATARD----N  256 (696)
T ss_pred             chhcEEEEEEeCCCHHHHHHHHHHHhCCCCCCCCCCCCCCCCCCCCc-eeEEEecCcceEEEEEeecCCCCCChH----H
Confidence            210         00               000000 00000011122 233333344667777777765554432    2


Q ss_pred             HHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEEec-----Cc--eEEEEEEe---cCCcHHHHHHHHHHhh----cc-C
Q psy15801        188 THVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYATVD-----RE--GIVYTIEA---VRKEIGHVHKFLASVV----GK-T  252 (318)
Q Consensus       188 a~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~~~-----~d--~~~~~~~~---l~~~l~~~l~ll~d~l----~~-P  252 (318)
                      ..++.+++..|...+    +...|...|-..++++.     ..  .+.+++.+   ..++.+.+...+.++|    .+ +
T Consensus       257 l~lL~~iLg~g~sSr----L~~~LRe~GLaysV~s~~~~~~~g~~lf~I~~~lt~~~~~~~~~v~~~i~~~L~~L~~~~~  332 (696)
T TIGR02110       257 VTLLCEFLQDEAPGG----LLAQLRERGLAESVAATWLYQDAGQALLALEFSARCISAAAAQQIEQLLTQWLGALAEQTW  332 (696)
T ss_pred             HHHHHHHhCCCcchH----HHHHHHHCCCEEEEEEeccccCCCCcEEEEEEEEcCCCccCHHHHHHHHHHHHHHHHhcCC
Confidence            456677666553332    33334445555444432     12  33444554   1336666665554443    33 1


Q ss_pred             CCCHHHHhhhhhhH
Q psy15801        253 EFRPWEVSDLTPRL  266 (318)
Q Consensus       253 ~f~e~e~~~~k~~i  266 (318)
                      .-..+|+.+++.+-
T Consensus       333 ~~~~eel~rlk~~~  346 (696)
T TIGR02110       333 AEQLEHYAQLAQRR  346 (696)
T ss_pred             CCCHHHHHHHHHhh
Confidence            34577888877763


No 24 
>PF03410 Peptidase_M44:  Protein G1;  InterPro: IPR005072 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M44 (clan ME). The active site residues for members of this family and family M16 occur in the motif HXXEHProtein. The type example is the vaccinia virus-type metalloendopeptidase G1 from vaccinia virus, it is a metalloendopeptidase expressed by many Poxviridae which appears to play a role in the maturation of viral proteins.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0019067 viral assembly, maturation, egress, and release
Probab=92.33  E-value=0.16  Score=49.77  Aligned_cols=88  Identities=16%  Similarity=0.101  Sum_probs=61.0

Q ss_pred             EEcCCCcEEEEEecCCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEEecCceEE
Q psy15801        149 GTSYNKVKVVSVNETSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYATVDREGIV  228 (318)
Q Consensus       149 ~~L~NGlkv~~~~~~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~~~~d~~~  228 (318)
                      ..|+||++|+.-+.-..-+.+++. ..|-..|-.+-.|+|||++|+|-.    ++..-+         --|++|+|.+.+
T Consensus         2 IvL~NGVRiFin~~M~KDIYlGIs-~FGFe~DI~~iLGiAHLLEHILIs----FD~~~F---------~ANASTaRsYMS   67 (590)
T PF03410_consen    2 IVLSNGVRIFINPSMKKDIYLGIS-NFGFENDIGEILGIAHLLEHILIS----FDSSKF---------LANASTARSYMS   67 (590)
T ss_pred             eEecCceEEEecCccccceEEeec-ccccccchHHHHhHHHHHHHHeee----cchHHh---------hcccchhhhhhh
Confidence            368999999998774444555443 566667777889999999999853    222111         247889999999


Q ss_pred             EEEEecCCc-HHHHHHHHHHhhc
Q psy15801        229 YTIEAVRKE-IGHVHKFLASVVG  250 (318)
Q Consensus       229 ~~~~~l~~~-l~~~l~ll~d~l~  250 (318)
                      |....++.. .-+++.-+..++.
T Consensus        68 FWC~si~g~~~~DAvrtliSWFF   90 (590)
T PF03410_consen   68 FWCKSIRGRTYIDAVRTLISWFF   90 (590)
T ss_pred             hhhhhccCCChhHHHHHHHHHhh
Confidence            988877654 4555665655543


No 25 
>KOG2019|consensus
Probab=92.31  E-value=0.18  Score=51.71  Aligned_cols=107  Identities=11%  Similarity=0.172  Sum_probs=81.4

Q ss_pred             CCCcEEEEEecCCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHH-HHHHHHH-HhCCeEeEEecCceEEE
Q psy15801        152 YNKVKVVSVNETSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHF-GIIRHVQ-QAGGSLYATVDREGIVY  229 (318)
Q Consensus       152 ~NGlkv~~~~~~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~-~i~~~l~-~lG~~l~~~~~~d~~~~  229 (318)
                      +-|.+++.+.+..+--.+++.|+.-+..++    ||.|.++|....|+.+++-. =|.+.|. .+.-.+|+++.-|+|.|
T Consensus        60 ~Tgae~lhl~reD~N~vFsI~FrTpp~dst----GiPHILEHtvLCGS~KYPvrdPFfkmLnrSLatFmNAfT~pD~T~y  135 (998)
T KOG2019|consen   60 KTGAEVLHLDREDENNVFSIVFRTPPKDST----GIPHILEHTVLCGSRKYPVRDPFFKMLNRSLATFMNAFTAPDYTFY  135 (998)
T ss_pred             CCCceeEeeccCCCCceeEEEeecCCCccC----CCchhhhhheeeccCcCcccChHHHHHHHHHHHHHhhccCCCccee
Confidence            569999998874333334566776655554    99999999999999999743 3444443 34555899999999999


Q ss_pred             EEE-ecCCcHHHHHHHHHHhhccCCCCHHHHhhh
Q psy15801        230 TIE-AVRKEIGHVHKFLASVVGKTEFRPWEVSDL  262 (318)
Q Consensus       230 ~~~-~l~~~l~~~l~ll~d~l~~P~f~e~e~~~~  262 (318)
                      =+. .-+.+|-...++..|....|.+.+.+|.++
T Consensus       136 PfattN~kDf~NL~dVYLDAtffPklr~~dF~QE  169 (998)
T KOG2019|consen  136 PFATTNTKDFYNLRDVYLDATFFPKLRKLDFQQE  169 (998)
T ss_pred             ecccCChHHHHHHHHHhhhcccchHHHhhhhhhh
Confidence            886 456789999999999999999888887765


No 26 
>KOG0959|consensus
Probab=91.69  E-value=18  Score=39.21  Aligned_cols=221  Identities=15%  Similarity=0.138  Sum_probs=130.7

Q ss_pred             cccccccCc-hHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHH-hcccchhHHHHHHHHHHHhCCCCChHHHHHHhhcc
Q psy15801         44 LDESHHFSP-PQALQASIDVVRGAGIKDADIARGKALLKRHLAD-SFETLDSTVDSVVRQAVTTGVVKSLPDLLAEVEAV  121 (318)
Q Consensus        44 ~~~~a~~~~-~~~~~~~l~~l~~~~~t~~EL~raK~~lk~~~~~-~les~~~~a~~lg~~~~~~g~~~~~~~~~~~I~~V  121 (318)
                      ..++||+.- .-.+..+++.+..-.++.+.+.-+|..++..+-. .++.....|..+-... .....-+..+.+++++.|
T Consensus       591 ~~v~Gfnekl~~ll~~~~~~~~~f~~~~~rf~iike~~~~~~~n~~~~~p~~~a~~~~~ll-l~~~~W~~~e~~~al~~~  669 (974)
T KOG0959|consen  591 LRVSGFNEKLPLLLEKVVQMMANFELDEDRFEIIKELLKRELRNHAFDNPYQLANDYLLLL-LEESIWSKEELLEALDDV  669 (974)
T ss_pred             EEEeccCcccHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHhhhhhccHHHHHHHHHHHH-hhccccchHHHHHHhhcc
Confidence            456777754 3344555554444469999999999999988888 5666666676655333 333344778889999999


Q ss_pred             Ccccccc------ccccc------c---------------c---ccCcC---C--CCCCcEEEEcCCCcEEEEEec---C
Q psy15801        122 SSSDVSA------KRSFA------A---------------Q---PATKA---L--FSPDIQSGTSYNKVKVVSVNE---T  163 (318)
Q Consensus       122 T~edV~~------~r~~~------~---------------~---~~p~~---~--~~~~~~~~~L~NGlkv~~~~~---~  163 (318)
                      |.+|+..      .+.+-      .               .   ..|..   +  ....-...+|++|...++...   .
T Consensus       670 ~le~~~~F~~~~~~~~~~e~~i~GN~te~~A~~l~~~v~d~l~~~~~~~~p~~~~~~~~~~~~~lp~G~~~~~~~~~n~~  749 (974)
T KOG0959|consen  670 TLEDLESFISEFLQPFHLELLIHGNLTEKEALQLLKSVLDILKSAAPNSRPLFRSEHLPRREIQLPNGDYYFYRHLLNKT  749 (974)
T ss_pred             cHHHHHHHHHHHhhhhheEEEEecCcchHHHHHHHHHHHhhhhccCCCCccccccccCcccceeccCCceEEEEcccccC
Confidence            9999983      11110      0               0   01111   0  123456789999999987654   2


Q ss_pred             CCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEE----ecCceEEEEEEec--CCc
Q psy15801        164 SPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYAT----VDREGIVYTIEAV--RKE  237 (318)
Q Consensus       164 ~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~----~~~d~~~~~~~~l--~~~  237 (318)
                      -+-..+.+++..|....  ...-+..|+.+++-.-+     +++-+.-+++|=.+...    .+-..+.+.++..  .++
T Consensus       750 ~~ns~i~~~~Q~~~~~~--~~~~~~~L~~~li~ep~-----Fd~LRTkeqLGYiv~~~~r~~~G~~~~~i~Vqs~~~~~~  822 (974)
T KOG0959|consen  750 DDNSCIEVYYQIGVQDT--RDNAVLGLLEQLIKEPA-----FDQLRTKEQLGYIVSTGVRLNYGTVGLQITVQSEKSVDY  822 (974)
T ss_pred             CCCceEEEEEEcccchh--HHHHHHHHHHHHhccch-----HHhhhhHHhhCeEeeeeeeeecCcceeEEEEccCCCchH
Confidence            35666777777644432  24467777776655422     56778888888544433    3333444444332  112


Q ss_pred             HH----HHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCC
Q psy15801        238 IG----HVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRP  274 (318)
Q Consensus       238 l~----~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~  274 (318)
                      ++    ..++-+.+.+.  ..++++|+..+..++..+.+..
T Consensus       823 le~rIe~fl~~~~~~i~--~m~~e~Fe~~~~~lI~~~~ek~  861 (974)
T KOG0959|consen  823 LEERIESFLETFLEEIV--EMSDEEFEKHKSGLIASKLEKP  861 (974)
T ss_pred             HHHHHHHHHHHHHHHHH--hcchhhhhhhHHHHHHHHhhcC
Confidence            22    22333333332  2567777777766666655543


No 27 
>PHA03081 putative metalloprotease; Provisional
Probab=90.11  E-value=0.29  Score=48.13  Aligned_cols=98  Identities=15%  Similarity=0.067  Sum_probs=65.9

Q ss_pred             EEcCCCcEEEEEecCCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEEecCceEE
Q psy15801        149 GTSYNKVKVVSVNETSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYATVDREGIV  228 (318)
Q Consensus       149 ~~L~NGlkv~~~~~~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~~~~d~~~  228 (318)
                      ..|+||++|+.-+.-..-+.+++. ..|-..|-.+-.|+|||++|++-.    ++..-+         --++++.+.+.+
T Consensus         2 i~~~ngvr~f~~~~m~kdiy~gi~-~fgfe~di~~~lg~ahllehili~----fd~~~f---------~anast~r~yms   67 (595)
T PHA03081          2 IVLSNGVRIFINPSMKKDIYLGIS-NFGFENDIGEILGIAHLLEHILIS----FDSSKF---------VANASTARSYMS   67 (595)
T ss_pred             eEecCceEEEecCccccceEEeec-ccccccchHHHHhHHHHHHHHeee----cchHHh---------cccchhhhhhHh
Confidence            368999999988774444445442 556666777889999999998853    221111         246788888888


Q ss_pred             EEEEecCCc-HHHHHHHHHHhhccCCCCHHHHh
Q psy15801        229 YTIEAVRKE-IGHVHKFLASVVGKTEFRPWEVS  260 (318)
Q Consensus       229 ~~~~~l~~~-l~~~l~ll~d~l~~P~f~e~e~~  260 (318)
                      |....++.. ..+++.-+..++..+.--.+.|.
T Consensus        68 fwc~sirg~~y~DAvrtliSWFF~~~~Lr~~F~  100 (595)
T PHA03081         68 FWCKSIRGRSYIDAIRTLISWFFDNGKLKDNFS  100 (595)
T ss_pred             HhhHhhcCCchHHHHHHHHHHhccCCccccccc
Confidence            888776643 46777777777766653333333


No 28 
>PF05193 Peptidase_M16_C:  Peptidase M16 inactive domain;  InterPro: IPR007863 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. These metallopeptidases belong to MEROPS peptidase family M16 (clan ME). They include proteins, which are classified as non-peptidase homologues either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.  The peptidases in this group of sequences include:  Insulinase, insulin-degrading enzyme (3.4.24.56 from EC) Mitochondrial processing peptidase alpha subunit, (Alpha-MPP, 3.4.24.64 from EC) Pitrlysin, Protease III precursor (3.4.24.55 from EC) Nardilysin, (3.4.24.61 from EC) Ubiquinol-cytochrome C reductase complex core protein I,mitochondrial precursor (1.10.2.2 from EC) Coenzyme PQQ synthesis protein F (3.4.99 from EC)  These proteins do not share many regions of sequence similarity; the most noticeable is in the N-terminal section. This region includes a conserved histidine followed, two residues later by a glutamate and another histidine. In pitrilysin, it has been shown [] that this H-x-x-E-H motif is involved in enzymatic activity; the two histidines bind zinc and the glutamate is necessary for catalytic activity. The mitochondrial processing peptidase consists of two structurally related domains. One is the active peptidase whereas the other, the C-terminal region, is inactive. The two domains hold the substrate like a clamp [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 1BE3_B 1PP9_B 2A06_B 1SQB_B 1SQP_B 1L0N_B 1SQX_B 1NU1_B 1L0L_B 2FYU_B ....
Probab=89.56  E-value=0.75  Score=38.06  Aligned_cols=109  Identities=12%  Similarity=0.148  Sum_probs=63.3

Q ss_pred             CCcEEEEEecCCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHH-HhCC--eEeEEe----cCc
Q psy15801        153 NKVKVVSVNETSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQ-QAGG--SLYATV----DRE  225 (318)
Q Consensus       153 NGlkv~~~~~~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~-~lG~--~l~~~~----~~d  225 (318)
                      .+..++..+...+...+.+.+++...... .......++.+++..+...    -|...+. ..|.  .++++.    +..
T Consensus        66 ~~~~~~~~~~~~~~~~v~~~~~~~~~~~~-~~~~~~~~l~~~l~~~~~s----~l~~~lr~~~~l~y~v~~~~~~~~~~~  140 (184)
T PF05193_consen   66 QGKEIVIPSKDESQSIVSIAFPGPPIKDS-KDYFALNLLSSLLGNGMSS----RLFQELREKQGLAYSVSASNSSYRDSG  140 (184)
T ss_dssp             SEEEEEEEESSSSSEEEEEEEEEEETGTS-TTHHHHHHHHHHHHCSTTS----HHHHHHHTTTTSESEEEEEEEEESSEE
T ss_pred             ccccccccccccccccccccccccccccc-chhhHHHHHHHHHhcCccc----hhHHHHHhccccceEEEeeeeccccce
Confidence            44555555544366666666666555222 2446777888888877333    2444444 4443  333332    224


Q ss_pred             eEEEEEEecCCcHHHHHHHHHHhhc---cCCCCHHHHhhhhhhH
Q psy15801        226 GIVYTIEAVRKEIGHVHKFLASVVG---KTEFRPWEVSDLTPRL  266 (318)
Q Consensus       226 ~~~~~~~~l~~~l~~~l~ll~d~l~---~P~f~e~e~~~~k~~i  266 (318)
                      .+.+++.+.+.+++.+++.+.+.+.   .-.|+++||++.+.++
T Consensus       141 ~~~i~~~~~~~~~~~~~~~~~~~l~~l~~~~~s~~el~~~k~~L  184 (184)
T PF05193_consen  141 LFSISFQVTPENLDEAIEAILQELKRLREGGISEEELERAKNQL  184 (184)
T ss_dssp             EEEEEEEEEGGGHHHHHHHHHHHHHHHHHHCS-HHHHHHHHHHH
T ss_pred             EEEEEEEcCcccHHHHHHHHHHHHHHHHHcCCCHHHHHHHHhcC
Confidence            4666777777788777776655432   2359999999988763


No 29 
>COG1025 Ptr Secreted/periplasmic Zn-dependent peptidases, insulinase-like [Posttranslational modification, protein turnover, chaperones]
Probab=85.68  E-value=5.1  Score=42.88  Aligned_cols=117  Identities=13%  Similarity=0.156  Sum_probs=87.6

Q ss_pred             CCcEEEEEe-c--CC-CEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHHHHHhCCeEeEEecCceEE
Q psy15801        153 NKVKVVSVN-E--TS-PIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRHVQQAGGSLYATVDREGIV  228 (318)
Q Consensus       153 NGlkv~~~~-~--~~-p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~l~~lG~~l~~~~~~d~~~  228 (318)
                      .|.++++.. +  .. |...+.+.|+.=.....+...-+..|...|++--...++     ......|-+++...+.+.+.
T Consensus       509 ~~~~~wy~~~d~F~~~PK~~v~~~irsp~~~~s~r~~Vl~~l~~~la~dal~~~~-----y~A~~aG~sfs~~~~~~Gl~  583 (937)
T COG1025         509 PNLRLWYLKEDYFAVEPKASVSLAIRSPHASRSPRNQVLTELYAYLANDALDKLS-----YQASLAGLSFSLAANSNGLD  583 (937)
T ss_pred             CCceEEEecCCccccCCcceeEEEEeCcccccCHHHHHHHHHHHHHHHHHHHhhh-----hHHHhcceEEEeecCCCceE
Confidence            366666654 3  34 888899999764444444444455555666554333332     23455688899999999999


Q ss_pred             EEEEecCCcHHHHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCC
Q psy15801        229 YTIEAVRKEIGHVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRP  274 (318)
Q Consensus       229 ~~~~~l~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~  274 (318)
                      +++++.++.++..+..+.+.+..-.++++-|+..|.++.++++...
T Consensus       584 ltisGft~~lp~L~~~~l~~l~~~~~~~~~f~~~K~~~~~~~~~a~  629 (937)
T COG1025         584 LTISGFTQRLPQLLRAFLDGLFSLPVDEDRFEQAKSQLSEELKNAL  629 (937)
T ss_pred             EEeeccccchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHhhh
Confidence            9999999999999999999999888899999999999999887654


No 30 
>KOG0961|consensus
Probab=82.56  E-value=4.9  Score=41.59  Aligned_cols=124  Identities=13%  Similarity=0.006  Sum_probs=68.0

Q ss_pred             CchhHHHHHHHHHHhcC--CCCCCCCCCCCccccccc-CCCCCCccccccccCc------hHHHHHHHHHHHc--CCCCH
Q psy15801          2 SAKEALAFAVLKYALGA--GPAKTGKNQRSTNSKGLS-RTPGRESLDESHHFSP------PQALQASIDVVRG--AGIKD   70 (318)
Q Consensus         2 ~~~d~~al~vL~~iLG~--g~~~k~~~~~~~l~~~~~-~~~~~~~~~~~a~~~~------~~~~~~~l~~l~~--~~~t~   70 (318)
                      ++|++++.++....||.  ||+=+.-.|.|..|-+-. -..... .....++.+      -.....++..+.+  +.++.
T Consensus       826 ~dpel~~~~l~~~YL~~~eGPfW~~IRG~GLAYGanm~~~~d~~-~~~~~iyr~ad~~kaye~~rdiV~~~vsG~~e~s~  904 (1022)
T KOG0961|consen  826 NDPELIPAMLFGQYLSQCEGPFWRAIRGDGLAYGANMFVKPDRK-QITLSIYRCADPAKAYERTRDIVRKIVSGSGEISK  904 (1022)
T ss_pred             CCcchhHHHHHHHHHHhcccchhhhhcccchhccceeEEeccCC-EEEEEeecCCcHHHHHHHHHHHHHHHhcCceeecH
Confidence            57899999999999997  654222223333332110 000000 011122222      2234455777775  46999


Q ss_pred             HHHHHHHHHHHHHHHHhcccc-hhHHHHHHHHHHHhCCC-CChHHHHHHhhccCccccc
Q psy15801         71 ADIARGKALLKRHLADSFETL-DSTVDSVVRQAVTTGVV-KSLPDLLAEVEAVSSSDVS  127 (318)
Q Consensus        71 ~EL~raK~~lk~~~~~~les~-~~~a~~lg~~~~~~g~~-~~~~~~~~~I~~VT~edV~  127 (318)
                      .|++-||...-+ .++.+|+. -.-|-.+..-....+.+ ..-..++.+|.+||.+|..
T Consensus       905 ~~~egAk~s~~~-~~~~~Eng~~~~a~~~~~l~~~~q~~~~fn~~~leri~nvT~~~~~  962 (1022)
T KOG0961|consen  905 AEFEGAKRSTVF-EMMKRENGTVSGAAKISILNNFRQTPHPFNIDLLERIWNVTSEEMV  962 (1022)
T ss_pred             HHhccchHHHHH-HHHHHhccceechHHHHHHHHHHhcCCcccHHHHHHHHHhhHHHHH
Confidence            999999988654 44456631 11121221112233332 3446889999999999997


No 31 
>PF10278 Med19:  Mediator of RNA pol II transcription subunit 19 ;  InterPro: IPR019403 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins.  The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11.  The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation.   The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22.  The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4.  The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16.  The CDK8 module contains: MED12, MED13, CCNC and CDK8.   Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. Med19 represents a family of conserved proteins which are members of the multi-protein co-activator Mediator complex []. ; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex
Probab=76.61  E-value=1  Score=38.80  Aligned_cols=24  Identities=4%  Similarity=0.061  Sum_probs=12.8

Q ss_pred             HHHHHHHHHHhCCeEeEEecCceE
Q psy15801        204 HFGIIRHVQQAGGSLYATVDREGI  227 (318)
Q Consensus       204 ~~~i~~~l~~lG~~l~~~~~~d~~  227 (318)
                      .+++...|--+=|.++..++.|..
T Consensus        40 ke~lSsFLp~lpG~~~~~~~~d~s   63 (178)
T PF10278_consen   40 KESLSSFLPHLPGNIDLPGSQDNS   63 (178)
T ss_pred             hhhHHHHccCCCCcccCCCCcCcc
Confidence            445555555555555555544433


No 32 
>PF07946 DUF1682:  Protein of unknown function (DUF1682);  InterPro: IPR012879 The members of this family are all hypothetical eukaryotic proteins of unknown function. One member (Q920S6 from SWISSPROT) is described as being an adipocyte-specific protein, but no evidence of this was found. 
Probab=71.93  E-value=22  Score=33.67  Aligned_cols=49  Identities=6%  Similarity=0.123  Sum_probs=30.7

Q ss_pred             cCceEEEEEEecCCcH-----HHHHHH---HHHhhccCCCCHHHHhhhhhhHHHHHh
Q psy15801        223 DREGIVYTIEAVRKEI-----GHVHKF---LASVVGKTEFRPWEVSDLTPRLKYDRL  271 (318)
Q Consensus       223 ~~d~~~~~~~~l~~~l-----~~~l~l---l~d~l~~P~f~e~e~~~~k~~i~~ei~  271 (318)
                      ..-.+.++++.++..-     ...+.+   |.|.+.+-.++++...+....-..+++
T Consensus       213 ~~~~l~~~~~lp~~~~~~~~~~~l~~~v~~l~D~~~~~~l~~e~~~K~~k~R~~~~~  269 (321)
T PF07946_consen  213 PKKRLIFSFRLPSSSDDMEALEPLLKLVFYLIDKLARFKLSPEAKKKAKKNREEEEE  269 (321)
T ss_pred             cCcEEEEEEEeCCCcccHHHHHHHHHHHHHHHHHhheeeeCHHHHHHHHHHHHHHHH
Confidence            4456778888887542     333333   356678888999988876444433333


No 33 
>KOG0345|consensus
Probab=65.67  E-value=11  Score=37.35  Aligned_cols=15  Identities=13%  Similarity=0.275  Sum_probs=7.7

Q ss_pred             ceEEEEEEecCCcHH
Q psy15801        225 EGIVYTIEAVRKEIG  239 (318)
Q Consensus       225 d~~~~~~~~l~~~l~  239 (318)
                      ++.+|-++.-.=++.
T Consensus       424 H~cs~Ifr~kdLd~~  438 (567)
T KOG0345|consen  424 HHCSYIFRLKDLDLG  438 (567)
T ss_pred             cceeEEEeecCCcHH
Confidence            455555555444443


No 34 
>KOG2422|consensus
Probab=58.95  E-value=2  Score=43.40  Aligned_cols=7  Identities=86%  Similarity=0.947  Sum_probs=2.6

Q ss_pred             HhhHHHH
Q psy15801        283 KKNKKKK  289 (318)
Q Consensus       283 ~~~~~~~  289 (318)
                      -|||+||
T Consensus        88 sk~k~KK   94 (665)
T KOG2422|consen   88 SKNKKKK   94 (665)
T ss_pred             hccccch
Confidence            3343333


No 35 
>PF09186 DUF1949:  Domain of unknown function (DUF1949);  InterPro: IPR015269 Members of this entry are a set of functionally uncharacterised hypothetical bacterial proteins. They adopt a ferredoxin-like fold, with a beta-alpha-beta-beta-alpha-beta arrangement [].   This entry contains the protein Impact, which is a translational regulator that ensures constant high levels of translation under amino acid starvation. It acts by interacting with Gcn1/Gcn1L1, thereby preventing activation of Gcn2 protein kinases (EIF2AK1 to 4) and subsequent down-regulation of protein synthesis. It is evolutionary conserved from eukaryotes to archaea []. ; PDB: 2CVE_A 1VI7_A.
Probab=51.23  E-value=69  Score=21.13  Aligned_cols=46  Identities=13%  Similarity=0.170  Sum_probs=38.5

Q ss_pred             HHHHHHHHHhCCeEeEEecCceEEEEEEecCCcHHHHHHHHHHhhc
Q psy15801        205 FGIIRHVQQAGGSLYATVDREGIVYTIEAVRKEIGHVHKFLASVVG  250 (318)
Q Consensus       205 ~~i~~~l~~lG~~l~~~~~~d~~~~~~~~l~~~l~~~l~ll~d~l~  250 (318)
                      ..+...++..|+.+.-..-.+.+.+.+.++.+..+.....|.++..
T Consensus         9 ~~v~~~l~~~~~~i~~~~y~~~V~~~v~v~~~~~~~f~~~l~~~t~   54 (56)
T PF09186_consen    9 GKVERLLEQNGIEIVDEDYTDDVTLTVAVPEEEVEEFKAQLTDLTS   54 (56)
T ss_dssp             HHHHHHHHHTTTEEEEEEECTTEEEEEEEECCCHHHHHHHHHHHTT
T ss_pred             HHHHHHHHHCCCEEEcceecceEEEEEEECHHHHHHHHHHHHHHcC
Confidence            4678889999999865555566999999999999999998888753


No 36 
>PF08496 Peptidase_S49_N:  Peptidase family S49 N-terminal;  InterPro: IPR013703 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain is found to the N terminus of bacterial signal peptidases that belong to the MEROPS peptidase family S49 (protease IV family, clan SK) (see also IPR002142 from INTERPRO) [, ]. ; GO: 0004252 serine-type endopeptidase activity, 0005886 plasma membrane
Probab=49.94  E-value=22  Score=29.98  Aligned_cols=11  Identities=9%  Similarity=-0.112  Sum_probs=4.4

Q ss_pred             hhHHHHHhhCC
Q psy15801        264 PRLKYDRLTRP  274 (318)
Q Consensus       264 ~~i~~ei~~~~  274 (318)
                      ..+...++..-
T Consensus        52 ~~~k~~L~~al   62 (155)
T PF08496_consen   52 EDMKEQLKQAL   62 (155)
T ss_pred             HHHHHHHHHHh
Confidence            33334444433


No 37 
>KOG3794|consensus
Probab=48.61  E-value=37  Score=32.85  Aligned_cols=27  Identities=7%  Similarity=-0.017  Sum_probs=17.4

Q ss_pred             hHHHHHHHHHHHcCCCCHHHHHHHHHH
Q psy15801         53 PQALQASIDVVRGAGIKDADIARGKAL   79 (318)
Q Consensus        53 ~~~~~~~l~~l~~~~~t~~EL~raK~~   79 (318)
                      ...++-+-+.+......++||.+.-.+
T Consensus        22 ~k~vW~AeQ~~s~~~kkqEEL~~qyek   48 (453)
T KOG3794|consen   22 IRKVWMAEQKISYDKKKQEELRRQYEK   48 (453)
T ss_pred             hhHHHHHhhhcccccchHHHHHHHHHH
Confidence            445555555555667888999887433


No 38 
>smart00874 B5 tRNA synthetase B5 domain. This domain is found in phenylalanine-tRNA synthetase beta subunits.
Probab=47.41  E-value=32  Score=24.35  Aligned_cols=50  Identities=14%  Similarity=0.139  Sum_probs=33.2

Q ss_pred             CCCHHHHHHHHHHhCCeEeEEecCceEEEEEEecCCcHHHHHHHHHHhhc
Q psy15801        201 EFTHFGIIRHVQQAGGSLYATVDREGIVYTIEAVRKEIGHVHKFLASVVG  250 (318)
Q Consensus       201 ~~s~~~i~~~l~~lG~~l~~~~~~d~~~~~~~~l~~~l~~~l~ll~d~l~  250 (318)
                      ..+..++.+.|.++|..+......+.+.+.+-.-+.++....+++.|+++
T Consensus        18 ~i~~~ei~~~L~~lg~~~~~~~~~~~~~v~~P~~R~Di~~~~DliEei~r   67 (71)
T smart00874       18 DLSAEEIEEILKRLGFEVEVSGDDDTLEVTVPSYRFDILIEADLIEEVAR   67 (71)
T ss_pred             CCCHHHHHHHHHHCCCeEEecCCCCeEEEECCCCccccCcccHHHHHHHH
Confidence            57899999999999998865322344444444444566666676666654


No 39 
>TIGR00869 sec62 protein translocation protein, Sec62 family. protein secretary systems of yeast microsomes. They are also the non-selective cation (NS) channels of the mammalian cytoplasmic membrane. The yeast Sec62 protein has been shown to be essential for cell growth. The mammalian NS channel proteins has been implicated in platelet derived growth factor(PGDF) dependent single channel current in fibroblasts. These channels are essentially closed in serum deprived tissue-culture cells and are specifically opened by exposure to PDGF. These channels are reported to exhibit equal selectivity for Na+, K+ and Cs+ with low permeability to Ca2+, and no permeability to anions.
Probab=45.00  E-value=13  Score=33.56  Aligned_cols=7  Identities=29%  Similarity=0.491  Sum_probs=2.8

Q ss_pred             hhccCcc
Q psy15801        118 VEAVSSS  124 (318)
Q Consensus       118 I~~VT~e  124 (318)
                      .+.+..+
T Consensus        72 ~~k~~~~   78 (232)
T TIGR00869        72 VDKLPSQ   78 (232)
T ss_pred             EEecccc
Confidence            3444443


No 40 
>PF03484 B5:  tRNA synthetase B5 domain;  InterPro: IPR005147 Domain B5 is found in phenylalanine-tRNA synthetase beta subunits. This domain has been shown to bind DNA through a winged helix-turn-helix motif []. Phenylalanine-tRNA synthetase may influence common cellular processes via DNA binding, in addition to its aminoacylation function.; GO: 0000287 magnesium ion binding, 0003723 RNA binding, 0005524 ATP binding, 0006432 phenylalanyl-tRNA aminoacylation; PDB: 2AKW_B 1B70_B 1B7Y_B 2ALY_B 2IY5_B 2AMC_B 3PCO_D 2CXI_C 1JJC_B 1EIY_B ....
Probab=43.32  E-value=63  Score=23.02  Aligned_cols=49  Identities=16%  Similarity=0.279  Sum_probs=34.5

Q ss_pred             CCCHHHHHHHHHHhCCeEeEEecCceEEEEEEecCCcHHHHHHHHHHhhc
Q psy15801        201 EFTHFGIIRHVQQAGGSLYATVDREGIVYTIEAVRKEIGHVHKFLASVVG  250 (318)
Q Consensus       201 ~~s~~~i~~~l~~lG~~l~~~~~~d~~~~~~~~l~~~l~~~l~ll~d~l~  250 (318)
                      ..+..++.+.|..+|..+... +.+...+..=.-+.++....++..|+++
T Consensus        18 ~i~~~~i~~~L~~lg~~~~~~-~~~~~~v~vP~~R~Di~~~~DliEEiaR   66 (70)
T PF03484_consen   18 DISPEEIIKILKRLGFKVEKI-DGDTLEVTVPSYRFDIEHEEDLIEEIAR   66 (70)
T ss_dssp             ---HHHHHHHHHHTT-EEEE--CTTEEEEEEETTSTT-SSHHHHHHHHHH
T ss_pred             CCCHHHHHHHHHHCCCEEEEC-CCCEEEEEcCCCcCCcCcccHHHHHHHH
Confidence            778999999999999998877 5566666666667788888888877764


No 41 
>PF08496 Peptidase_S49_N:  Peptidase family S49 N-terminal;  InterPro: IPR013703 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain is found to the N terminus of bacterial signal peptidases that belong to the MEROPS peptidase family S49 (protease IV family, clan SK) (see also IPR002142 from INTERPRO) [, ]. ; GO: 0004252 serine-type endopeptidase activity, 0005886 plasma membrane
Probab=40.08  E-value=22  Score=30.05  Aligned_cols=15  Identities=27%  Similarity=0.388  Sum_probs=5.9

Q ss_pred             HhhhhhhHHHHHhhC
Q psy15801        259 VSDLTPRLKYDRLTR  273 (318)
Q Consensus       259 ~~~~k~~i~~ei~~~  273 (318)
                      ++..++.+...+-..
T Consensus        51 Y~~~k~~L~~all~k   65 (155)
T PF08496_consen   51 YEDMKEQLKQALLDK   65 (155)
T ss_pred             HHHHHHHHHHHhcCH
Confidence            333444444443333


No 42 
>PRK14552 C/D box methylation guide ribonucleoprotein complex aNOP56 subunit; Provisional
Probab=39.79  E-value=4.7  Score=39.56  Aligned_cols=15  Identities=20%  Similarity=0.244  Sum_probs=8.2

Q ss_pred             ecCCcHHHHHHHHHH
Q psy15801        233 AVRKEIGHVHKFLAS  247 (318)
Q Consensus       233 ~l~~~l~~~l~ll~d  247 (318)
                      +++.+-..+..+|+.
T Consensus       330 ~p~~~rgkiaR~lA~  344 (414)
T PRK14552        330 SPWWQRGKIARALAA  344 (414)
T ss_pred             CCHHHHHHHHHHHHH
Confidence            334445666666653


No 43 
>PRK14552 C/D box methylation guide ribonucleoprotein complex aNOP56 subunit; Provisional
Probab=39.63  E-value=5.1  Score=39.34  Aligned_cols=16  Identities=13%  Similarity=0.056  Sum_probs=7.6

Q ss_pred             HHHHhhCCCChhhHHH
Q psy15801        267 KYDRLTRPSQGNEKKK  282 (318)
Q Consensus       267 ~~ei~~~~~~~~~~~~  282 (318)
                      +.+|+..-+.|+..|.
T Consensus       372 i~~i~~k~~~Pp~~k~  387 (414)
T PRK14552        372 IEEIKEKYPKPPKKKR  387 (414)
T ss_pred             HHHHHHhcCCCCCCCc
Confidence            4444444455555543


No 44 
>KOG3794|consensus
Probab=37.91  E-value=23  Score=34.17  Aligned_cols=7  Identities=14%  Similarity=-0.059  Sum_probs=3.3

Q ss_pred             CCCcEEE
Q psy15801        152 YNKVKVV  158 (318)
Q Consensus       152 ~NGlkv~  158 (318)
                      |=|+.|-
T Consensus       117 PFGiqVR  123 (453)
T KOG3794|consen  117 PFGIQVR  123 (453)
T ss_pred             ccceEee
Confidence            3455553


No 45 
>cd04914 ACT_AKi-DapG-BS_1 ACT domains of the diaminopimelate-sensitive aspartokinase (AK) isoenzyme AKI. This CD includes the N-terminal of the two ACT domains of the diaminopimelate-sensitive aspartokinase (AK) isoenzyme AKI, a monofunctional class enzyme found in Bacilli (Bacillus subtilis (BS) strain 168), Clostridia, and Actinobacteria, bacterial species. In B. subtilis, the regulation of the diaminopimelate-lysine biosynthetic pathway involves dual control by diaminopimelate and lysine, effected through separate diaminopimelate- and lysine-sensitive aspartokinase isoenzymes. AKI activity is invariant during the exponential and stationary phases of growth and is not altered by addition of amino acids to the growth medium. The role of this isoenzyme is most likely to provide a constant level of aspartyl-beta-phosphate for the biosynthesis of diaminopimelate for peptidoglycan synthesis and dipicolinate during sporulation. The B. subtilis AKI is tetrameric consisting of two alpha and 
Probab=36.96  E-value=91  Score=21.90  Aligned_cols=49  Identities=8%  Similarity=-0.070  Sum_probs=36.2

Q ss_pred             HHHHHHHHHhCCeEeEEecCceEEEEEEecCCcHHHHHHHHHHhhccCCC
Q psy15801        205 FGIIRHVQQAGGSLYATVDREGIVYTIEAVRKEIGHVHKFLASVVGKTEF  254 (318)
Q Consensus       205 ~~i~~~l~~lG~~l~~~~~~d~~~~~~~~l~~~l~~~l~ll~d~l~~P~f  254 (318)
                      .+|.+.|...|.+++.-.-. .-.+++++...+++.+.+++.++=..|..
T Consensus        17 a~if~~La~~~InvDmI~~~-~~~isFtv~~~d~~~~~~il~~~~~~~~~   65 (67)
T cd04914          17 QRVFKALANAGISVDLINVS-PEEVIFTVDGEVAEKAVDILEKMGLDPSV   65 (67)
T ss_pred             HHHHHHHHHcCCcEEEEEec-CCCEEEEEchhhHHHHHHHHHHcCCceEe
Confidence            45777788888888876322 22588999999999999988887666643


No 46 
>PF00675 Peptidase_M16:  Insulinase (Peptidase family M16) This is family M16 in the peptidase classification. ;  InterPro: IPR011765 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. The majority of the sequences in this entry are metallopeptidases and non-peptidase homologs belong to MEROPS peptidase family M16 (clan ME), subfamilies M16A, M16B and M16C; they include:  Insulinase, insulin-degrading enzyme (3.4.24.56 from EC) Mitochondrial processing peptidase alpha subunit, (Alpha-MPP, 3.4.24.64 from EC) Pitrlysin, Protease III precursor (3.4.24.55 from EC) Nardilysin, (3.4.24.61 from EC) Ubiquinol-cytochrome C reductase complex core protein I,mitochondrial precursor (1.10.2.2 from EC) Coenzyme PQQ synthesis protein F (3.4.99 from EC)  These proteins do not share many regions of sequence similarity; the most noticeable is in the N-terminal section. This region includes a conserved histidine followed, two residues later by a glutamate and another histidine. In pitrilysin, it has been shown [] that this H-x-x-E-H motif is involved in enzymatic activity; the two histidines bind zinc and the glutamate is necessary for catalytic activity. The proteins classified as non-peptidase homologues either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3P7L_A 3P7O_A 3TUV_A 3GO9_A 1BE3_B 1PP9_B 2A06_B 1SQB_B 1SQP_B 1L0N_B ....
Probab=35.66  E-value=1.2e+02  Score=24.28  Aligned_cols=54  Identities=13%  Similarity=0.169  Sum_probs=37.9

Q ss_pred             HHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHHHhCCC
Q psy15801         55 ALQASIDVVRGAGIKDADIARGKALLKRHLADSFETLDSTVDSVVRQAVTTGVV  108 (318)
Q Consensus        55 ~~~~~l~~l~~~~~t~~EL~raK~~lk~~~~~~les~~~~a~~lg~~~~~~g~~  108 (318)
                      +++-..+.+..-.++++++++.|..+..++....++...++...-...++.|.+
T Consensus        87 ~l~~l~~~~~~P~f~~~~~~~~r~~~~~ei~~~~~~~~~~~~~~l~~~~f~~~p  140 (149)
T PF00675_consen   87 ALELLADMLFNPSFDEEEFEREREQILQEIEEIKENPQELAFEKLHSAAFRGHP  140 (149)
T ss_dssp             HHHHHHHHHHSBGGCHHHHHHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHTTSG
T ss_pred             HHHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHhccCC
Confidence            344444455555699999999999999988888777766666655455555554


No 47 
>PF15289 RFXA_RFXANK_bdg:  Regulatory factor X-associated C-terminal binding domain; PDB: 2KW3_C.
Probab=33.14  E-value=14  Score=29.66  Aligned_cols=16  Identities=38%  Similarity=0.244  Sum_probs=0.0

Q ss_pred             ChhhHHHHhhHHHHHH
Q psy15801        276 QGNEKKKKKNKKKKKK  291 (318)
Q Consensus       276 ~~~~~~~~~~~~~~~~  291 (318)
                      -||+||+.+||.-|-|
T Consensus        19 kpwmck~hRNK~yk~k   34 (124)
T PF15289_consen   19 KPWMCKKHRNKMYKDK   34 (124)
T ss_dssp             ----------------
T ss_pred             cchhhhhhccccchhh
Confidence            3899999999885444


No 48 
>PF08432 Vfa1:  AAA-ATPase Vps4-associated protein 1;  InterPro: IPR013640 This is a family of fungal proteins of unknown function. 
Probab=32.44  E-value=16  Score=31.60  Aligned_cols=34  Identities=18%  Similarity=0.035  Sum_probs=18.8

Q ss_pred             cCCCCHHHHhhh--hhhHHHHHhhCCCChhhHHHHh
Q psy15801        251 KTEFRPWEVSDL--TPRLKYDRLTRPSQGNEKKKKK  284 (318)
Q Consensus       251 ~P~f~e~e~~~~--k~~i~~ei~~~~~~~~~~~~~~  284 (318)
                      .|..+++..+..  +..+..+|+...-.+-++.+++
T Consensus        50 ~p~~~~~~~~~~~k~~el~~eiekvkke~Eekq~~k   85 (182)
T PF08432_consen   50 TPIYDEEYVEAKKKKKELEEEIEKVKKEYEEKQKWK   85 (182)
T ss_pred             CCCCCHHHHHHHhhHHHHHHHHHHHHHHHHHHhHHh
Confidence            477777665543  4555666666544444444433


No 49 
>PF05890 Ebp2:  Eukaryotic rRNA processing protein EBP2;  InterPro: IPR008610 This family consists of several eukaryotic rRNA processing protein EBP2 sequences. Ebp2p is required for the maturation of 25S rRNA and 60S subunit assembly. Ebp2p may be one of the target proteins of Rrs1p for executing the signal to regulate ribosome biogenesis [].
Probab=31.63  E-value=88  Score=28.91  Aligned_cols=27  Identities=11%  Similarity=0.126  Sum_probs=20.1

Q ss_pred             HHHhhccCCCCHHHHhhhhhhHHHHHhhCCC
Q psy15801        245 LASVVGKTEFRPWEVSDLTPRLKYDRLTRPS  275 (318)
Q Consensus       245 l~d~l~~P~f~e~e~~~~k~~i~~ei~~~~~  275 (318)
                      ||+|+.    +++.+++++..++.+-..+..
T Consensus       110 fAEMvK----SD~HM~KVr~kLl~~~~~ie~  136 (271)
T PF05890_consen  110 FAEMVK----SDEHMEKVRQKLLKEQKRIEA  136 (271)
T ss_pred             hHHHhc----CHHHHHHHHHHHHHHHHHHHH
Confidence            566666    688899999998877665554


No 50 
>KOG2138|consensus
Probab=30.91  E-value=26  Score=36.38  Aligned_cols=14  Identities=0%  Similarity=-0.278  Sum_probs=7.3

Q ss_pred             HHHHHHhhccCCCC
Q psy15801        242 HKFLASVVGKTEFR  255 (318)
Q Consensus       242 l~ll~d~l~~P~f~  255 (318)
                      +++-.+-...|+++
T Consensus       774 ~e~~~~e~fgprlp  787 (883)
T KOG2138|consen  774 MEIDEREEFGPRLP  787 (883)
T ss_pred             hccChhhhcCCCCC
Confidence            33334445567776


No 51 
>PF09851 SHOCT:  Short C-terminal domain;  InterPro: IPR018649  This family of hypothetical prokaryotic proteins has no known function. 
Probab=29.93  E-value=1.3e+02  Score=17.94  Aligned_cols=17  Identities=24%  Similarity=0.259  Sum_probs=14.8

Q ss_pred             cCCCCHHHHHHHHHHHH
Q psy15801         65 GAGIKDADIARGKALLK   81 (318)
Q Consensus        65 ~~~~t~~EL~raK~~lk   81 (318)
                      .|-+|++|.++.|+.+.
T Consensus        14 ~G~IseeEy~~~k~~ll   30 (31)
T PF09851_consen   14 KGEISEEEYEQKKARLL   30 (31)
T ss_pred             cCCCCHHHHHHHHHHHh
Confidence            57799999999998874


No 52 
>KOG4043|consensus
Probab=29.88  E-value=40  Score=28.61  Aligned_cols=11  Identities=18%  Similarity=0.060  Sum_probs=6.1

Q ss_pred             ccHHHHHHHHH
Q psy15801        185 LGITHVLRSAA  195 (318)
Q Consensus       185 ~Gla~ll~~ml  195 (318)
                      .-|+.|+.|+.
T Consensus        68 EkiaSFL~~I~   78 (214)
T KOG4043|consen   68 EKIASFLPHII   78 (214)
T ss_pred             HHHHHHhhccC
Confidence            34666666554


No 53 
>PF12122 DUF3582:  Protein of unknown function (DUF3582);  InterPro: IPR022732 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ].  This entry represents the N-terminal domain of membrane-bound serine endopeptidases belonging to MEROPS peptidase family S54 (rhomboid-1, clan ST). This domain contains a conserved ASW sequence motif and a single completely conserved residue F that may be functionally important.  The tertiary structure of the GlpG protein from Escherichia coli has been determined []. The GlpG protein has six transmembrane domains (other members of the family are predicted to have seven), with the N- and C-terminal ends anchored in the cytoplasm. One transmembrane domain is shorter than the rest, creating an internal, aqueous cavity just below the membrane surface and it is here were proteolysis occurs. There is also a membrane-embedded loop between the first and second transmembrane domains which is postulated to act as a gate controlling substrate access to the active site. No other family of serine peptidases is known to have active site residues within transmembrane domains (although transmembrane active sites are known for aspartic peptidase and metallopeptidases), and the GlpG protein has the type structure for clan ST.; GO: 0004252 serine-type endopeptidase activity, 0016021 integral to membrane; PDB: 3UBB_A 3B45_A 3B44_A 2NRF_A 3TXT_A 2O7L_A 2XTU_A 2IRV_A 2XOW_A 2XTV_A ....
Probab=28.06  E-value=2e+02  Score=22.29  Aligned_cols=49  Identities=10%  Similarity=0.146  Sum_probs=33.5

Q ss_pred             HHHHHHHHHHhCCeEeEEecCce-EEEEEEecCCcHHHHHHHHHHhhccCC
Q psy15801        204 HFGIIRHVQQAGGSLYATVDREG-IVYTIEAVRKEIGHVHKFLASVVGKTE  253 (318)
Q Consensus       204 ~~~i~~~l~~lG~~l~~~~~~d~-~~~~~~~l~~~l~~~l~ll~d~l~~P~  253 (318)
                      +..|.+.|...|..+......+. +.+.+. -.++++.+-..|...+.||.
T Consensus        13 AqaF~DYl~sqgI~~~i~~~~~~~~~lwl~-de~~~~~a~~el~~Fl~nP~   62 (101)
T PF12122_consen   13 AQAFIDYLASQGIELQIEPEGQGQFALWLH-DEEHLEQAEQELEEFLQNPN   62 (101)
T ss_dssp             HHHHHHHHHHTT--EEEE-SSSE--EEEES--GGGHHHHHHHHHHHHHS-S
T ss_pred             HHHHHHHHHHCCCeEEEEECCCCceEEEEe-CHHHHHHHHHHHHHHHHCCC
Confidence            45777888888977777764443 555555 66899999999999999995


No 54 
>PRK06063 DNA polymerase III subunit epsilon; Provisional
Probab=27.98  E-value=1.3e+02  Score=28.18  Aligned_cols=69  Identities=16%  Similarity=0.141  Sum_probs=50.6

Q ss_pred             HHHhCCCCCCCHHHHHHHHHHhCCeEeEEecCceEEEEEEe---cCCcHHHHHHHHHHhhccCCCCHHHHhhhhhhH
Q psy15801        193 SAAGLSTEEFTHFGIIRHVQQAGGSLYATVDREGIVYTIEA---VRKEIGHVHKFLASVVGKTEFRPWEVSDLTPRL  266 (318)
Q Consensus       193 ~ml~~GT~~~s~~~i~~~l~~lG~~l~~~~~~d~~~~~~~~---l~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i  266 (318)
                      ...|.|+-.++..++.+.+..+||.+..++++...++-.--   .+.-+.+|-+     +-=|..+|++|-+....+
T Consensus       237 ~~v~TG~l~~~R~e~~~~~~~~G~~v~~sVs~~t~~lv~g~~~~~ssK~~kA~~-----~gi~ii~e~~f~~ll~~~  308 (313)
T PRK06063        237 RVALSAEVSRTHEELVERILHAGLAYSDSVDRDTSLVVCNDPAPEQGKGYHARQ-----LGVPVLDEAAFLELLRAV  308 (313)
T ss_pred             EEEEecCCCCCHHHHHHHHHHcCCEecCccccCccEEEECCCCCcccHHHHHHH-----cCCccccHHHHHHHHHhh
Confidence            45678888889999999999999999999988666554421   2234555555     345788899988776544


No 55 
>cd04923 ACT_AK-LysC-DapG-like_2 ACT domains of the lysine-sensitive aspartokinase isoenzyme AKII of Bacillus subtilis (BS) strain 168 and related domains. This CD includes the C-terminal of the two ACT domains of the lysine-sensitive aspartokinase isoenzyme AKII of Bacillus subtilis (BS) strain 168, and the lysine plus threonine-sensitive aspartokinase of Corynebacterium glutamicum, as well as, the second and fourth, of four, ACT domains present in cyanobacteria AK. Also included are the C-terminal of the two ACT domains of the diaminopimelate-sensitive aspartokinase isoenzyme AKI found in Bacilli (B. subtilis strain 168), Clostridia, and Actinobacteria bacterial species. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=25.55  E-value=2e+02  Score=18.83  Aligned_cols=44  Identities=9%  Similarity=0.067  Sum_probs=33.6

Q ss_pred             HHHHHHHHHhCCeEeEEecCceEEEEEEecCCcHHHHHHHHHHhh
Q psy15801        205 FGIIRHVQQAGGSLYATVDREGIVYTIEAVRKEIGHVHKFLASVV  249 (318)
Q Consensus       205 ~~i~~~l~~lG~~l~~~~~~d~~~~~~~~l~~~l~~~l~ll~d~l  249 (318)
                      ..+...+...|..+...+.. ...+++.+...+.+.++..+.+.+
T Consensus        18 ~~i~~~L~~~~i~v~~i~~s-~~~is~~v~~~~~~~~~~~l~~~l   61 (63)
T cd04923          18 AKMFKALAEAGINIEMISTS-EIKISCLVDEDDAEKAVRALHEAF   61 (63)
T ss_pred             HHHHHHHHHCCCCEEEEEcc-CCeEEEEEeHHHHHHHHHHHHHHh
Confidence            35667788888888776643 477888888899999988887665


No 56 
>PF14257 DUF4349:  Domain of unknown function (DUF4349)
Probab=25.16  E-value=2e+02  Score=25.98  Aligned_cols=44  Identities=16%  Similarity=0.157  Sum_probs=35.5

Q ss_pred             HHHHHHHHHhCCeEeEEec--------CceEEEEEEecCCcHHHHHHHHHHh
Q psy15801        205 FGIIRHVQQAGGSLYATVD--------REGIVYTIEAVRKEIGHVHKFLASV  248 (318)
Q Consensus       205 ~~i~~~l~~lG~~l~~~~~--------~d~~~~~~~~l~~~l~~~l~ll~d~  248 (318)
                      .+|.+.+..+||.+.....        .....+++.+|++.|+..++-|..+
T Consensus        66 ~~i~~~~~~~gG~i~~~~~~~~~~~~~~~~~~ltiRVP~~~~~~~l~~l~~~  117 (262)
T PF14257_consen   66 KKIENLVESYGGYIESSSSSSSGGSDDERSASLTIRVPADKFDSFLDELSEL  117 (262)
T ss_pred             HHHHHHHHHcCCEEEEEeeecccCCCCcceEEEEEEECHHHHHHHHHHHhcc
Confidence            4677888999998765544        4578999999999999999988753


No 57 
>PF13840 ACT_7:  ACT domain ; PDB: 3S1T_A 1ZHV_A 3AB4_K 3AB2_O 2DTJ_A 3AAW_A 2RE1_B 3MAH_A 1ZVP_D.
Probab=24.93  E-value=2.2e+02  Score=19.73  Aligned_cols=37  Identities=14%  Similarity=0.213  Sum_probs=27.2

Q ss_pred             HHHHHhCCeEeEEecCceEEEEEEecCCcHHHHHHHHHH
Q psy15801        209 RHVQQAGGSLYATVDREGIVYTIEAVRKEIGHVHKFLAS  247 (318)
Q Consensus       209 ~~l~~lG~~l~~~~~~d~~~~~~~~l~~~l~~~l~ll~d  247 (318)
                      ..|...|.++..-+  .+...++-+..+++++++..|.+
T Consensus        29 ~~La~~~I~i~~is--S~~~~~ilV~~~~~~~A~~~L~~   65 (65)
T PF13840_consen   29 SALAEAGINIFMIS--SEISISILVKEEDLEKAVEALHE   65 (65)
T ss_dssp             HHHHHTTS-ECEEE--ESSEEEEEEEGGGHHHHHHHHHH
T ss_pred             HHHHHCCCCEEEEE--EeeeEEEEEeHHHHHHHHHHhcC
Confidence            44566777776666  66677788889999999998754


No 58 
>KOG4043|consensus
Probab=24.78  E-value=55  Score=27.77  Aligned_cols=11  Identities=27%  Similarity=0.311  Sum_probs=5.0

Q ss_pred             EEEecCCCCCC
Q psy15801        171 VAFKAGARYET  181 (318)
Q Consensus       171 l~~~~Gs~~e~  181 (318)
                      +-+.+|+..|.
T Consensus       119 FkLh~GPl~Eq  129 (214)
T KOG4043|consen  119 FKLHAGPLDEQ  129 (214)
T ss_pred             eecCCCCchHH
Confidence            33445555443


No 59 
>KOG2573|consensus
Probab=24.77  E-value=24  Score=34.35  Aligned_cols=21  Identities=14%  Similarity=0.181  Sum_probs=10.8

Q ss_pred             ceEEEEEEecCCcHHHHHHHH
Q psy15801        225 EGIVYTIEAVRKEIGHVHKFL  245 (318)
Q Consensus       225 d~~~~~~~~l~~~l~~~l~ll  245 (318)
                      +-|+.....++.++++=|+++
T Consensus       390 ~pts~fGe~Lr~qVEeRL~fy  410 (498)
T KOG2573|consen  390 DPTSVFGEKLREQVEERLEFY  410 (498)
T ss_pred             CCchHHHHHHHHHHHHHHHhh
Confidence            334444455555566655543


No 60 
>COG1327 Predicted transcriptional regulator, consists of a Zn-ribbon and ATP-cone domains [Transcription]
Probab=24.72  E-value=4e+02  Score=22.48  Aligned_cols=70  Identities=17%  Similarity=0.181  Sum_probs=46.5

Q ss_pred             hHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHhcccchhHHHHHH----------------HHHHHhCCCCChHHHHH
Q psy15801         53 PQALQASIDVVRGAGIKDADIARGKALLKRHLADSFETLDSTVDSVV----------------RQAVTTGVVKSLPDLLA  116 (318)
Q Consensus        53 ~~~~~~~l~~l~~~~~t~~EL~raK~~lk~~~~~~les~~~~a~~lg----------------~~~~~~g~~~~~~~~~~  116 (318)
                      ..++.-++.++.+.+++.+.|+++.+.++..+...-++- --...||                +.+-.|.+...+.++.+
T Consensus        64 ~Kl~~gl~~A~~KRpVs~e~ie~~v~~ie~~Lr~~g~~E-V~S~~IG~~VM~~Lk~lD~VAYvRFASVYr~F~dv~~F~e  142 (156)
T COG1327          64 EKLRRGLIRACEKRPVSSEQIEEAVSHIERQLRSSGERE-VPSKEIGELVMEELKKLDEVAYVRFASVYRSFKDVDDFEE  142 (156)
T ss_pred             HHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHhcCCCC-CCHHHHHHHHHHHHHhcchhhhhhhhhHhcccCCHHHHHH
Confidence            556777777888889999999999999998888776542 1112222                13334555566666666


Q ss_pred             HhhccCc
Q psy15801        117 EVEAVSS  123 (318)
Q Consensus       117 ~I~~VT~  123 (318)
                      .|..++.
T Consensus       143 ~i~~l~~  149 (156)
T COG1327         143 EIEELTK  149 (156)
T ss_pred             HHHHHHh
Confidence            6666544


No 61 
>cd04936 ACT_AKii-LysC-BS-like_2 ACT domains of the lysine-sensitive, aspartokinase (AK) isoenzyme AKII of Bacillus subtilis (BS) strain 168 and related domains. This CD includes the C-terminal of the two ACT domains of the lysine-sensitive, aspartokinase (AK) isoenzyme AKII of Bacillus subtilis (BS) strain 168, and the lysine plus threonine-sensitive aspartokinase of Corynebacterium glutamicum, and related sequences. In B. subtilis strain 168, the regulation of the diaminopimelate (Dap)-lysine biosynthetic pathway involves dual control by Dap and lysine, effected through separate Dap- and lysine-sensitive AK isoenzymes. The B. subtilis strain 168 AKII is induced by methionine and repressed and inhibited by lysine. Although C. glutamicum is known to contain a single AK, both the succinylase and dehydrogenase variant pathways of DAP-lysine synthesis operate simultaneously in this organism. In corynebacteria and other various Gram-positive bacteria, the DAP-lysine pathway is feedback regu
Probab=24.50  E-value=2.1e+02  Score=18.72  Aligned_cols=44  Identities=9%  Similarity=0.043  Sum_probs=33.7

Q ss_pred             HHHHHHHHHhCCeEeEEecCceEEEEEEecCCcHHHHHHHHHHhh
Q psy15801        205 FGIIRHVQQAGGSLYATVDREGIVYTIEAVRKEIGHVHKFLASVV  249 (318)
Q Consensus       205 ~~i~~~l~~lG~~l~~~~~~d~~~~~~~~l~~~l~~~l~ll~d~l  249 (318)
                      ..+...+...|..+...+.. ...+++.+...+.+.++..+.+.+
T Consensus        18 ~~i~~~L~~~~i~v~~i~~s-~~~is~~v~~~d~~~~~~~l~~~~   61 (63)
T cd04936          18 AKMFEALAEAGINIEMISTS-EIKISCLIDEDDAEKAVRALHEAF   61 (63)
T ss_pred             HHHHHHHHHCCCcEEEEEcc-CceEEEEEeHHHHHHHHHHHHHHh
Confidence            35667788888888777643 477888899899999988887655


No 62 
>PHA01735 hypothetical protein
Probab=22.38  E-value=1.3e+02  Score=21.87  Aligned_cols=32  Identities=13%  Similarity=0.138  Sum_probs=23.6

Q ss_pred             HHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHh
Q psy15801         56 LQASIDVVRGAGIKDADIARGKALLKRHLADS   87 (318)
Q Consensus        56 ~~~~l~~l~~~~~t~~EL~raK~~lk~~~~~~   87 (318)
                      ..+.+.+++.+..|..+|..|..-++.+=+.+
T Consensus        18 t~El~~RiksgeATtaDL~AA~d~Lk~NdItg   49 (76)
T PHA01735         18 TNELLSRIKSGEATTADLRAACDWLKSNDITG   49 (76)
T ss_pred             HHHHHHHHhcCcccHHHHHHHHHHHHHCCCce
Confidence            34558888888899999988887777654433


No 63 
>KOG3089|consensus
Probab=22.31  E-value=68  Score=28.83  Aligned_cols=42  Identities=5%  Similarity=0.059  Sum_probs=26.5

Q ss_pred             CCcHHHHHHHHHHhhccCCCCHHHHhhhhhhHHHHHhhCCCC
Q psy15801        235 RKEIGHVHKFLASVVGKTEFRPWEVSDLTPRLKYDRLTRPSQ  276 (318)
Q Consensus       235 ~~~l~~~l~ll~d~l~~P~f~e~e~~~~k~~i~~ei~~~~~~  276 (318)
                      +++++..+..=.|.+..-.|+.++-..-...+..-.-+.+++
T Consensus         4 ~d~ldD~l~yq~d~vheV~v~~e~~~qPtkkve~f~i~qddD   45 (271)
T KOG3089|consen    4 PDDLDDGLAYQQDTVHEVPVPSEKTKQPTKKVECFLIQQDDD   45 (271)
T ss_pred             cccccchhhhhhccccccccCccccCCCcchhheeecccCCc
Confidence            456777777777777777788877666554444334444444


No 64 
>PF08288 PIGA:  PIGA (GPI anchor biosynthesis);  InterPro: IPR013234 This domain is found on phosphatidylinositol N-acetylglucosaminyltransferase proteins. These proteins are involved in GPI anchor biosynthesis and are associated with the disease paroxysmal nocturnal haemoglobinuria [].; GO: 0006506 GPI anchor biosynthetic process
Probab=21.94  E-value=71  Score=24.39  Aligned_cols=15  Identities=20%  Similarity=-0.035  Sum_probs=12.6

Q ss_pred             EEEcCCCcEEEEEec
Q psy15801        148 SGTSYNKVKVVSVNE  162 (318)
Q Consensus       148 ~~~L~NGlkv~~~~~  162 (318)
                      +.-|.||+||+++|.
T Consensus         9 vRyltngLKVYYlP~   23 (90)
T PF08288_consen    9 VRYLTNGLKVYYLPL   23 (90)
T ss_pred             eEEcCCCeEEEeecc
Confidence            456899999999885


No 65 
>KOG2138|consensus
Probab=21.52  E-value=63  Score=33.73  Aligned_cols=13  Identities=8%  Similarity=0.028  Sum_probs=6.8

Q ss_pred             CCCHHHHHHHHHH
Q psy15801         67 GIKDADIARGKAL   79 (318)
Q Consensus        67 ~~t~~EL~raK~~   79 (318)
                      .+|-.|-+|++-.
T Consensus       488 ~~smsewere~er  500 (883)
T KOG2138|consen  488 DPSMSEWERERER  500 (883)
T ss_pred             CcchhHHHHhhhh
Confidence            3455555565554


No 66 
>PF07499 RuvA_C:  RuvA, C-terminal domain;  InterPro: IPR011114 In prokaryotes, RuvA, RuvB, and RuvC process the universal DNA intermediate of homologous recombination, termed Holliday junction. The tetrameric DNA helicase RuvA specifically binds to the Holliday junction and facilitates the isomerization of the junction from the stacked folded configuration to the square-planar structure []. In the RuvA tetramer, each subunit consists of three domains, I, II and III, where I and II form the major core that is responsible for Holliday junction binding and base pair rearrangements of Holliday junction executed at the crossover point, whereas domain III regulates branch migration through direct contact with RuvB. The domain represents the C-terminal domain III of RuvA. This domain plays a significant role in the ATP-dependent branch migration of the hetero-duplex through direct contact with RuvB []. Within the Holliday junction, this domain makes no interaction with the DNA.; GO: 0005524 ATP binding, 0009378 four-way junction helicase activity, 0006281 DNA repair, 0006310 DNA recombination, 0009379 Holliday junction helicase complex; PDB: 1HJP_A 1CUK_A 1C7Y_A 1IXS_A 1IXR_B 1BVS_E 2ZTC_A 2ZTD_B 2H5X_A.
Probab=21.14  E-value=2.1e+02  Score=18.57  Aligned_cols=28  Identities=18%  Similarity=0.317  Sum_probs=22.6

Q ss_pred             HHHHHHHHHHcCCCCHHHHHHHHHHHHH
Q psy15801         55 ALQASIDVVRGAGIKDADIARGKALLKR   82 (318)
Q Consensus        55 ~~~~~l~~l~~~~~t~~EL~raK~~lk~   82 (318)
                      ...+++.+|..-|++..|+.++...+..
T Consensus         2 ~~~d~~~AL~~LGy~~~e~~~av~~~~~   29 (47)
T PF07499_consen    2 ALEDALEALISLGYSKAEAQKAVSKLLE   29 (47)
T ss_dssp             HHHHHHHHHHHTTS-HHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHcCCCHHHHHHHHHHhhc
Confidence            4567788888889999999999988765


No 67 
>KOG2573|consensus
Probab=20.27  E-value=1.1e+02  Score=29.85  Aligned_cols=9  Identities=11%  Similarity=0.512  Sum_probs=3.7

Q ss_pred             CCHHHHHHH
Q psy15801         68 IKDADIARG   76 (318)
Q Consensus        68 ~t~~EL~ra   76 (318)
                      +++-++++|
T Consensus       137 L~~~d~~ka  145 (498)
T KOG2573|consen  137 LDPGDLEKA  145 (498)
T ss_pred             CCCccHHHH
Confidence            444444433


No 68 
>KOG0487|consensus
Probab=20.11  E-value=64  Score=30.38  Aligned_cols=50  Identities=12%  Similarity=-0.010  Sum_probs=35.0

Q ss_pred             HHHHHHHHHhhccCCCCH---HHHhhhhhhHHHHHhhCCCChhhHHHHhhHHH
Q psy15801        239 GHVHKFLASVVGKTEFRP---WEVSDLTPRLKYDRLTRPSQGNEKKKKKNKKK  288 (318)
Q Consensus       239 ~~~l~ll~d~l~~P~f~e---~e~~~~k~~i~~ei~~~~~~~~~~~~~~~~~~  288 (318)
                      -+.++|--|.|.|-=...   .||.+.......++..+..+.|||.||-|+..
T Consensus       245 ~QtlELEkEFlfN~YitkeKR~ElSr~lNLTeRQVKIWFQNRRMK~KK~~re~  297 (308)
T KOG0487|consen  245 HQTLELEKEFLFNMYITKEKRLELSRTLNLTERQVKIWFQNRRMKEKKVNREN  297 (308)
T ss_pred             HHHHHHHHHHHHHHHHhHHHHHHHHHhcccchhheeeeehhhhhHHhhhhhhh
Confidence            567777777776544333   36666666667778888889999999887643


No 69 
>COG5137 Histone chaperone involved in gene silencing [Transcription / Chromatin structure and dynamics]
Probab=20.06  E-value=5.4e+02  Score=23.06  Aligned_cols=62  Identities=18%  Similarity=0.171  Sum_probs=40.7

Q ss_pred             CCCCcEEEEcCC--CcEEEEEec---CCCEEEEEEEEecCCCCCCCCCccHHHHHHHHHhCCCCCCCHHHHHHH
Q psy15801        142 FSPDIQSGTSYN--KVKVVSVNE---TSPIARISVAFKAGARYETPDNLGITHVLRSAAGLSTEEFTHFGIIRH  210 (318)
Q Consensus       142 ~~~~~~~~~L~N--Glkv~~~~~---~~p~v~v~l~~~~Gs~~e~~~~~Gla~ll~~ml~~GT~~~s~~~i~~~  210 (318)
                      .||.+....|+.  |++|+++..   ...+|.++.+++.|-       +||..+.....-.-..+.....+.+.
T Consensus        77 DpP~v~liPlse~~gvTvILlsc~Y~g~eFvRvGYyVnney-------~gi~~~e~~d~E~p~~kvd~~~v~r~  143 (279)
T COG5137          77 DPPDVNLIPLSEMFGVTVILLSCRYKGQEFVRVGYYVNNEY-------PGITKLEKSDVEEPSEKVDEEDVERE  143 (279)
T ss_pred             CCCCccccchhhhhceeEEEEEEeecCceeEEEEEEeccCC-------cchhhhhhhhhcCCchhcCHHHHHHH
Confidence            456666665654  999998875   467999999998764       48866665555444555544444443


Done!