Query         psy14093
Match_columns 857
No_of_seqs    357 out of 2673
Neff          6.3 
Searched_HMMs 46136
Date          Fri Aug 16 19:21:39 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy14093.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14093hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG3658|consensus              100.0 2.7E-97  6E-102  831.3  33.7  526    1-634   129-697 (764)
  2 KOG3607|consensus              100.0   1E-60 2.3E-65  563.8  27.4  375    1-545   118-511 (716)
  3 KOG3607|consensus              100.0 3.2E-45   7E-50  432.5  12.6  225  589-852   268-526 (716)
  4 KOG3658|consensus              100.0 8.1E-42 1.8E-46  384.7  13.7  243  555-827   274-573 (764)
  5 cd04270 ZnMc_TACE_like Zinc-de 100.0 8.5E-41 1.8E-45  354.7  20.0  235   96-424     2-243 (244)
  6 cd04273 ZnMc_ADAMTS_like Zinc- 100.0 2.2E-31 4.8E-36  276.3  12.9  191   95-415     1-196 (207)
  7 cd04269 ZnMc_adamalysin_II_lik 100.0 9.4E-31   2E-35  268.6  15.5  187   95-420     1-194 (194)
  8 PF01421 Reprolysin:  Reprolysi 100.0 3.7E-31 7.9E-36  272.8  11.0  191   95-422     1-199 (199)
  9 cd04272 ZnMc_salivary_gland_MP 100.0 1.9E-28 4.1E-33  256.7  15.5  189   96-415     2-209 (220)
 10 cd04271 ZnMc_ADAM_fungal Zinc- 100.0 1.5E-28 3.2E-33  258.4  14.2  200   96-415     2-215 (228)
 11 cd04267 ZnMc_ADAM_like Zinc-de 100.0 8.6E-28 1.9E-32  246.2  15.2  185   95-415     1-189 (192)
 12 KOG3538|consensus               99.9 2.1E-25 4.5E-30  274.0  22.0  343    2-539   107-467 (845)
 13 PF13688 Reprolysin_5:  Metallo  99.9 7.9E-25 1.7E-29  224.8   9.1  183   94-399     2-196 (196)
 14 smart00050 DISIN Homologues of  99.8 4.8E-20   1E-24  159.5   6.0   75  432-519     1-75  (75)
 15 smart00050 DISIN Homologues of  99.8 2.2E-19 4.7E-24  155.4   4.2   71  717-812     1-75  (75)
 16 PF00200 Disintegrin:  Disinteg  99.8 2.4E-19 5.2E-24  156.4   4.3   76  432-520     1-76  (76)
 17 PF00200 Disintegrin:  Disinteg  99.8 3.5E-19 7.6E-24  155.3   3.3   72  717-813     1-76  (76)
 18 PF13574 Reprolysin_2:  Metallo  99.7 2.9E-19 6.3E-24  180.7   0.3  103  258-415    64-169 (173)
 19 PF13583 Reprolysin_4:  Metallo  99.7 6.7E-16 1.4E-20  160.5  15.0   42   94-135     2-43  (206)
 20 cd04270 ZnMc_TACE_like Zinc-de  99.7   2E-16 4.4E-21  168.5   8.8  145  558-709    52-243 (244)
 21 cd00203 ZnMc Zinc-dependent me  99.6 5.6E-15 1.2E-19  147.5  13.1  106  260-415    51-165 (167)
 22 cd04273 ZnMc_ADAMTS_like Zinc-  99.3   2E-12 4.4E-17  134.5   6.2   94  604-704    94-207 (207)
 23 PF01421 Reprolysin:  Reprolysi  99.3   7E-13 1.5E-17  137.0   2.0  105  588-706    75-198 (199)
 24 cd04269 ZnMc_adamalysin_II_lik  99.3 3.1E-12 6.7E-17  131.6   6.1   92  604-705    87-194 (194)
 25 cd04271 ZnMc_ADAM_fungal Zinc-  99.2 1.2E-11 2.6E-16  130.7   5.3  105  591-704    82-228 (228)
 26 PF13582 Reprolysin_3:  Metallo  99.1 3.3E-11 7.1E-16  114.8   3.9   61  264-365    64-124 (124)
 27 cd04272 ZnMc_salivary_gland_MP  99.1 9.1E-11   2E-15  123.3   5.2   95  603-704    94-220 (220)
 28 cd04267 ZnMc_ADAM_like Zinc-de  98.8 2.5E-09 5.4E-14  109.8   5.1   85  602-695    87-191 (192)
 29 PF13688 Reprolysin_5:  Metallo  98.3 1.3E-07 2.9E-12   97.3   0.7  103  558-672    45-161 (196)
 30 KOG3475|consensus               98.2 5.6E-07 1.2E-11   78.1   2.0   75  779-853     1-75  (92)
 31 PF08516 ADAM_CR:  ADAM cystein  98.2 3.6E-07 7.7E-12   86.9   0.2   39  815-853     1-44  (117)
 32 PTZ00073 60S ribosomal protein  98.2 1.5E-06 3.2E-11   77.0   3.4   75  779-853     1-75  (91)
 33 PF13574 Reprolysin_2:  Metallo  98.1 9.4E-07   2E-11   89.7   2.1   87  601-696    64-172 (173)
 34 cd00203 ZnMc Zinc-dependent me  97.6 4.9E-05 1.1E-09   75.8   4.4   58  609-672    54-115 (167)
 35 PRK13267 archaemetzincin-like   97.5 7.7E-05 1.7E-09   76.2   3.3  100  240-386    47-152 (179)
 36 KOG3538|consensus               97.3 0.00037 7.9E-09   87.2   7.4  112  604-722   270-406 (845)
 37 PF13582 Reprolysin_3:  Metallo  96.8 0.00042   9E-09   65.9   1.2   58  606-670    63-124 (124)
 38 smart00608 ACR ADAM Cysteine-R  96.8  0.0005 1.1E-08   67.4   1.7   30  823-852    14-43  (137)
 39 PF13583 Reprolysin_4:  Metallo  96.8  0.0011 2.3E-08   69.5   4.2   86  606-699    93-202 (206)
 40 PRK04179 rpl37e 50S ribosomal   96.1   0.004 8.6E-08   51.9   2.3   54  779-832     1-56  (62)
 41 cd04279 ZnMc_MMP_like_1 Zinc-d  95.7   0.009 1.9E-07   59.4   3.8   42  348-400   104-145 (156)
 42 PF00413 Peptidase_M10:  Matrix  95.6  0.0028   6E-08   62.4  -0.4   31  347-388   104-134 (154)
 43 smart00608 ACR ADAM Cysteine-R  95.5  0.0094   2E-07   58.5   2.7   23  522-544     1-27  (137)
 44 cd04278 ZnMc_MMP Zinc-dependen  94.6   0.016 3.5E-07   57.7   1.8   40  348-400   107-146 (157)
 45 TIGR02232 myxo_disulf_rpt Myxo  94.6   0.034 7.5E-07   42.4   2.9   34  709-745     2-35  (38)
 46 smart00235 ZnMc Zinc-dependent  94.2   0.021 4.6E-07   55.5   1.6   18  350-367    88-105 (140)
 47 cd04277 ZnMc_serralysin_like Z  94.1   0.043 9.3E-07   56.2   3.5   40  348-387   113-157 (186)
 48 cd04268 ZnMc_MMP_like Zinc-dep  94.1   0.021 4.5E-07   56.9   1.2   40  348-387    94-138 (165)
 49 COG1913 Predicted Zn-dependent  93.9   0.095 2.1E-06   52.9   5.4   27  349-386   125-151 (181)
 50 cd04275 ZnMc_pappalysin_like Z  93.9     0.2 4.3E-06   53.3   8.2   21  348-368   137-157 (225)
 51 COG2126 RPL37A Ribosomal prote  93.6   0.049 1.1E-06   44.9   2.2   53  779-831     1-54  (61)
 52 PF12388 Peptidase_M57:  Dual-a  93.4   0.049 1.1E-06   56.9   2.6   28  347-374   132-161 (211)
 53 PF05548 Peptidase_M11:  Gameto  92.9    0.09 1.9E-06   58.6   3.8   19  349-367   151-169 (314)
 54 PF05572 Peptidase_M43:  Pregna  92.6   0.049 1.1E-06   54.5   1.1   24  348-372    69-92  (154)
 55 cd04276 ZnMc_MMP_like_2 Zinc-d  92.2   0.093   2E-06   54.6   2.6   26  348-373   116-141 (197)
 56 TIGR02232 myxo_disulf_rpt Myxo  92.1    0.16 3.5E-06   38.8   3.0   37  424-473     2-38  (38)
 57 PRK13267 archaemetzincin-like   91.8   0.093   2E-06   53.8   2.1   85  583-679    47-150 (179)
 58 PF02031 Peptidase_M7:  Strepto  91.1   0.076 1.7E-06   50.9   0.5   23  349-372    78-100 (132)
 59 PF01907 Ribosomal_L37e:  Ribos  90.1    0.12 2.5E-06   42.5   0.7   52  780-831     1-53  (55)
 60 PF10462 Peptidase_M66:  Peptid  88.4    0.22 4.9E-06   55.0   1.6   19  348-366   193-211 (305)
 61 cd04327 ZnMc_MMP_like_3 Zinc-d  87.8    0.26 5.7E-06   51.2   1.6   22  349-370    93-114 (198)
 62 KOG1565|consensus               86.5     0.4 8.7E-06   56.3   2.3   30  348-388   211-240 (469)
 63 PF08516 ADAM_CR:  ADAM cystein  86.1    0.34 7.4E-06   46.3   1.2   31  522-552     1-42  (117)
 64 cd04283 ZnMc_hatching_enzyme Z  85.6    0.35 7.6E-06   49.8   1.1   18  350-367    79-96  (182)
 65 PF07998 Peptidase_M54:  Peptid  85.0    0.42 9.2E-06   49.6   1.4   19  348-366   145-163 (194)
 66 cd04281 ZnMc_BMP1_TLD Zinc-dep  84.6     0.4 8.7E-06   50.1   1.0   18  350-367    89-106 (200)
 67 cd04280 ZnMc_astacin_like Zinc  84.2    0.41   9E-06   49.0   0.9   18  350-367    76-93  (180)
 68 PF01400 Astacin:  Astacin (Pep  82.7    0.45 9.7E-06   49.3   0.4   20  349-368    80-99  (191)
 69 COG4783 Putative Zn-dependent   81.4    0.55 1.2E-05   54.3   0.5   18  346-363   128-145 (484)
 70 TIGR03296 M6dom_TIGR03296 M6 f  80.7    0.53 1.2E-05   51.8   0.1   37  349-386   166-203 (286)
 71 cd04282 ZnMc_meprin Zinc-depen  80.5    0.73 1.6E-05   49.2   1.0   18  350-367   122-139 (230)
 72 PF13948 DUF4215:  Domain of un  75.4     1.3 2.8E-05   35.4   0.9   21  708-728    18-38  (47)
 73 KOG3714|consensus               72.6     1.3 2.8E-05   51.3   0.4   18  350-367   161-178 (411)
 74 PF12044 Metallopep:  Putative   70.5     2.7 5.8E-05   48.7   2.3   17  351-367   319-335 (423)
 75 COG3824 Predicted Zn-dependent  70.0     1.7 3.8E-05   41.3   0.5   20  347-366   108-127 (136)
 76 PF06114 DUF955:  Domain of unk  68.2     1.9   4E-05   39.7   0.3   14  347-360    41-54  (122)
 77 PF12725 DUF3810:  Protein of u  66.2       3 6.4E-05   46.8   1.5   26  342-367   190-215 (318)
 78 PF13948 DUF4215:  Domain of un  65.4       3 6.6E-05   33.3   1.0   20  423-444    18-37  (47)
 79 COG2856 Predicted Zn peptidase  60.4     3.5 7.7E-05   43.5   0.7   13  348-360    72-84  (213)
 80 PF09471 Peptidase_M64:  IgA Pe  58.5     4.9 0.00011   43.9   1.4   20  347-366   215-234 (264)
 81 COG3590 PepO Predicted metallo  57.8     4.5 9.8E-05   47.9   1.0   27  334-367   480-509 (654)
 82 KOG4525|consensus               56.3     7.3 0.00016   44.6   2.3   17  351-367   304-320 (614)
 83 PF00413 Peptidase_M10:  Matrix  55.7     3.9 8.5E-05   40.0   0.1   29  649-681   104-132 (154)
 84 PF12115 Salp15:  Salivary prot  55.3     7.5 0.00016   37.2   1.9   21  520-540    93-114 (118)
 85 cd04268 ZnMc_MMP_like Zinc-dep  51.8     7.3 0.00016   38.6   1.3   19  650-671    94-112 (165)
 86 PF01435 Peptidase_M48:  Peptid  50.3     7.2 0.00015   40.5   1.0   18  346-363    87-104 (226)
 87 PRK01345 heat shock protein Ht  50.2     6.2 0.00013   44.2   0.5   15  346-360   122-136 (317)
 88 PF05547 Peptidase_M6:  Immune   50.0     4.4 9.6E-05   49.4  -0.7   17  349-365   222-238 (645)
 89 PF06262 DUF1025:  Possibl zinc  49.1     8.8 0.00019   35.6   1.3   20  346-365    71-90  (97)
 90 COG0501 HtpX Zn-dependent prot  48.9     6.7 0.00015   42.8   0.6   15  346-360   155-169 (302)
 91 TIGR02289 M3_not_pepF oligoend  47.3      14 0.00029   44.6   2.8   14  347-360   336-349 (549)
 92 TIGR00181 pepF oligoendopeptid  45.0      16 0.00034   44.4   2.9   14  347-360   377-390 (591)
 93 PRK03001 M48 family peptidase;  43.7      10 0.00022   41.7   1.0   15  346-360   122-136 (283)
 94 PRK02870 heat shock protein Ht  43.3     9.9 0.00022   43.0   0.8   18  346-365   171-188 (336)
 95 PRK03982 heat shock protein Ht  43.2      11 0.00023   41.7   1.0   15  346-360   123-137 (288)
 96 PRK05457 heat shock protein Ht  42.9      10 0.00022   41.9   0.8   15  346-360   132-146 (284)
 97 PRK01265 heat shock protein Ht  42.0     9.8 0.00021   42.8   0.5   14  346-359   138-151 (324)
 98 PRK03072 heat shock protein Ht  41.5      12 0.00025   41.4   1.0   16  346-361   125-140 (288)
 99 cd04279 ZnMc_MMP_like_1 Zinc-d  39.9      10 0.00022   37.6   0.2   29  649-680   103-131 (156)
100 COG1913 Predicted Zn-dependent  39.3      11 0.00024   38.4   0.4   23  653-679   127-149 (181)
101 PF01863 DUF45:  Protein of unk  39.2      17 0.00037   37.5   1.7   24  344-367   160-183 (205)
102 PRK04897 heat shock protein Ht  39.0      12 0.00027   41.4   0.7   15  346-360   135-149 (298)
103 PF11350 DUF3152:  Protein of u  38.2      16 0.00035   38.2   1.3   20  347-366   138-157 (203)
104 cd04278 ZnMc_MMP Zinc-dependen  38.1      13 0.00029   36.8   0.7   30  649-682   106-135 (157)
105 PF05572 Peptidase_M43:  Pregna  38.1      10 0.00022   38.1  -0.2   18  650-670    69-86  (154)
106 cd04275 ZnMc_pappalysin_like Z  37.7      23  0.0005   37.8   2.5   45  623-670    98-154 (225)
107 PF01431 Peptidase_M13:  Peptid  37.4      13 0.00029   38.4   0.6   44  347-421    35-83  (206)
108 PF04298 Zn_peptidase_2:  Putat  36.8      20 0.00044   38.1   1.8   14  347-360    88-101 (222)
109 TIGR02290 M3_fam_3 oligoendope  36.7      25 0.00055   42.7   2.9   14  347-360   374-387 (587)
110 PRK02391 heat shock protein Ht  34.9      17 0.00037   40.3   1.0   14  346-359   131-144 (296)
111 PTZ00337 surface protease GP63  32.7      44 0.00095   40.5   3.9   19  347-365   228-246 (567)
112 PF04228 Zn_peptidase:  Putativ  31.2      21 0.00046   39.5   1.0   13  347-359   169-181 (292)
113 COG1164 Oligoendopeptidase F [  30.2      19 0.00042   43.8   0.5   16  347-362   379-394 (598)
114 PF12315 DUF3633:  Protein of u  29.9      23  0.0005   37.2   0.9   23  343-365    88-110 (212)
115 KOG2719|consensus               29.4      30 0.00065   40.2   1.8   14  346-359   278-291 (428)
116 PF02031 Peptidase_M7:  Strepto  29.3      15 0.00033   35.7  -0.5   24  653-679    80-104 (132)
117 cd06456 M3A_DCP_Oligopeptidase  29.0      84  0.0018   36.7   5.4   14  347-360   207-220 (422)
118 PF13485 Peptidase_MA_2:  Pepti  28.0      25 0.00055   32.4   0.7   16  348-363    25-40  (128)
119 PF07998 Peptidase_M54:  Peptid  27.6      18  0.0004   37.7  -0.3   89  581-679    60-169 (194)
120 smart00235 ZnMc Zinc-dependent  27.4      16 0.00035   35.3  -0.7   16  652-670    88-103 (140)
121 COG2738 Predicted Zn-dependent  27.0      30 0.00065   36.0   1.1   26  337-365    84-109 (226)
122 cd06459 M3B_Oligoendopeptidase  27.0      26 0.00055   40.3   0.7   14  347-360   221-234 (427)
123 KOG3624|consensus               26.3      35 0.00075   42.3   1.7   14  347-360   517-530 (687)
124 cd06455 M3A_TOP Peptidase M3 T  26.2      27 0.00058   41.3   0.7   14  347-360   262-275 (472)
125 PF01447 Peptidase_M4:  Thermol  26.0      35 0.00075   34.2   1.3   11  349-359   136-146 (150)
126 COG4784 Putative Zn-dependent   25.7      26 0.00057   39.3   0.4   19  346-365   122-140 (479)
127 PF01432 Peptidase_M3:  Peptida  25.6      27 0.00059   40.9   0.6   14  347-360   241-254 (458)
128 KOG1047|consensus               25.5      31 0.00067   41.1   1.0   14  347-360   287-300 (613)
129 PF12388 Peptidase_M57:  Dual-a  24.3      22 0.00048   37.5  -0.4   18  650-670   133-150 (211)
130 KOG2090|consensus               23.8      94   0.002   37.6   4.5   14  347-360   478-491 (704)
131 PF01457 Peptidase_M8:  Leishma  23.7      40 0.00086   40.5   1.5   18  347-364   209-226 (521)
132 PF13699 DUF4157:  Domain of un  23.2      54  0.0012   29.2   1.9   13  348-360    61-73  (79)
133 cd04277 ZnMc_serralysin_like Z  23.2      23 0.00049   36.2  -0.6   17  652-671   115-131 (186)
134 PF14891 Peptidase_M91:  Effect  23.0      30 0.00065   35.2   0.2   21  347-367   102-122 (174)
135 PF08325 WLM:  WLM domain;  Int  22.6      42 0.00091   34.8   1.2   22  346-367    80-101 (186)
136 PF11403 Yeast_MT:  Yeast metal  21.4 1.3E+02  0.0028   22.5   3.2   37  490-528     4-40  (40)
137 PF05548 Peptidase_M11:  Gameto  20.8      40 0.00087   37.8   0.7   17  652-671   152-168 (314)
138 COG5549 Predicted Zn-dependent  20.1      42 0.00092   35.3   0.6   19  349-367   188-207 (236)

No 1  
>KOG3658|consensus
Probab=100.00  E-value=2.7e-97  Score=831.28  Aligned_cols=526  Identities=39%  Similarity=0.636  Sum_probs=421.9

Q ss_pred             CEEEEEcCCeEEEEEeCCCCCCCCCC-ccEEEEEecccccccccC-CCCCCCCCCCCCCcCCCCccccCcCCCCCCCccc
Q psy14093          1 MTASIHTPEETYHIEPSWRHLPHLGN-QSMVAYRASDVKLSWDHA-HDKPASPCGYVKEEFNTTDFDLDMEEDDPDMPHT   78 (857)
Q Consensus         1 l~G~I~~~~~~Y~IEPl~~~~~~~~~-~~HivYr~sd~~~~~~~~-~~~~~~~Cg~~~~~~~~~~~~~~~~~~~~~~~~~   78 (857)
                      +.|.|++++++|+|||+|+|+++... +++|||+.+|++...... ....+..||+.+..........+....++.....
T Consensus       129 ~~g~I~t~~~~y~IEPa~~y~~d~~~p~~~viYke~di~~~~~~~~~~~~p~~~g~~k~~~~el~~k~~~~~~p~n~~~~  208 (764)
T KOG3658|consen  129 FEGSIQTEGDTYYIEPAWRYFEDDKLPFHSVIYKESDIKNVHRLVATAGSPKVCGYSKRRRRELLKKGEVDRIPPNELTA  208 (764)
T ss_pred             EEEEEEecCceEEeccccccccCCCCccceEEEehhhhcchhhcccccCCCcccchhhhhhhhcccccccccCChhhhhh
Confidence            57999999999999999999988765 999999999998542211 1123677887663222211111111111111122


Q ss_pred             --ccccccCCCCCCCCCcceeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHHHhhccCCcccccccCCCccceeEEEE
Q psy14093         79 --RSKRQVDPAYEYKPTKTRCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIK  156 (857)
Q Consensus        79 --r~kR~~~~~~~~~~~~~~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V~~iY~~~~~~~~~~~~~~~~i~~~I~  156 (857)
                        |.||......  .+.+++|.|+||||++||++||..+.+++++||+.+|.+||.||++++|+++   .|+.+|+|+|+
T Consensus       209 ~~r~~R~~d~~~--~~~kntC~LylqADh~fy~~mg~~t~e~~~~~l~~hI~rVn~IY~~T~f~~~---~g~kni~F~Ik  283 (764)
T KOG3658|consen  209 NGRSKRRKDLLS--DPMKNTCSLYLQADHRFYRYMGRGTRETTTNYLINHIDRVNEIYRNTNFDDR---GGFKNIGFVIK  283 (764)
T ss_pred             hhhhhhhccccc--ccccceeeEEEEecHHHHHHHcCCchHHHHHHHHHHHHHHHHHhcCCccccC---CCccceEEEEE
Confidence              5566554321  3679999999999999999999888999999999999999999999999754   45678888888


Q ss_pred             EEEEeccchhhccccccccccccccccchhhhhhhhhccccccccccccCCcccccccccceeEeeEEeecCchhhcccc
Q psy14093        157 KIVVHSDATRVRQGEAHYNMIRDKWDVRNLLEISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGE  236 (857)
Q Consensus       157 ~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~i~i~~~~~~~l~~~  236 (857)
                      +|.|+++++..+.++.||||+.+                                                         
T Consensus       284 rIkI~~~~~~~~~~~~~~N~f~~---------------------------------------------------------  306 (764)
T KOG3658|consen  284 RIKILDSPACVRPGEDHYNMFCE---------------------------------------------------------  306 (764)
T ss_pred             EEEecCchhhcccCcCccccchh---------------------------------------------------------
Confidence            88888777777777777777431                                                         


Q ss_pred             ccccccccccchhhhccccccccCCCccceEEEeeccccCCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCc
Q psy14093        237 AHYNMIRDKWDVRNLLEVFSREYSHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSS  316 (857)
Q Consensus       237 ~~~~~~f~~w~~~~ll~~~s~~~~~D~~cla~L~T~~df~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~  316 (857)
                             ......++++++|. .+||+||||||||++||++|||||||||+|+.+++||||++.+..             
T Consensus       307 -------e~~~v~~fl~l~s~-~~~sdfCLAylfT~rdFe~GtLGLAwVgsp~~~saGGIC~k~~~y-------------  365 (764)
T KOG3658|consen  307 -------EVKDVLEFLSLNSE-EKHSDFCLAYLFTYRDFEGGTLGLAWVGSPRSNSAGGICEKAYTY-------------  365 (764)
T ss_pred             -------hHHHHHHHhhhhhh-ccccchhheeeeeecccccceeeeEEeccCccCCCCcccccceec-------------
Confidence                   11223345555554 478999999999999999999999999999999999999986532             


Q ss_pred             cccccccceeecccCCccccccCCccccccchhhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEeeccCCCCCCCC
Q psy14093        317 SQYFKNGYTLYLNSGLSSSRNHYGQRVITREADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTYSVSGYDVNNK  396 (857)
Q Consensus       317 ~~~~~~~~~~~~n~Gl~~~~n~~G~~v~~~~~~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~~~~g~~~n~~  396 (857)
                          .+|..+|+|+||+|++| ||++||++++.+|+|||||||||++||++. +|.|+.+++|+|||++++++|+++||.
T Consensus       366 ----~~G~~~sLNtGi~T~~N-Yg~~Vp~kvs~lt~AHEiGHNfGSpHDp~~-ECsPg~~~~GnyiM~a~AtSGd~~NN~  439 (764)
T KOG3658|consen  366 ----NEGKKRSLNTGISTSVN-YGKRVPTKVSDLTLAHEIGHNFGSPHDPDI-ECSPGESQGGNYIMYAYATSGDKPNNK  439 (764)
T ss_pred             ----cCcceEEeecceeeeee-cCCccCcchhheeehhhhccccCCCCCCCC-ccCCCCCCCCcEEEEEecccCcCccCc
Confidence                23667799999999999 999999999999999999999999999975 999999999999999999999999999


Q ss_pred             CCCCCCCCCCcccHHHHhc-------cccccCCCcccCccccCCCCCCccCCCcccccccCCCCCCCCCCCCC-----CC
Q psy14093        397 TFSSHVDGTNFCLAHLFTH-------QSFWSRGSSILGLAYIASPRPYSIGGGILGLAYVGSPRRNSKCCYHS-----IL  464 (857)
Q Consensus       397 ~FS~~~~~~~~CS~~~i~~-------~Cl~~~~~~~CGN~ive~g~~EeCDcG~~~~~~~~~~~~~d~CC~~~-----~~  464 (857)
                      +||+       ||+++|++       .||.++..++|||+|||+|  ||||||...       .|.|.||++.     ..
T Consensus       440 kFSp-------CS~ksI~~VL~~k~~~CFqE~~~sfCGN~iVE~G--EECDcG~~~-------~~~D~CC~p~~~~~~k~  503 (764)
T KOG3658|consen  440 KFSP-------CSLKSISKVLESKKRNCFQERESSFCGNGIVEPG--EECDCGFST-------DCKDSCCDPQPNLDEKP  503 (764)
T ss_pred             ccCc-------chHHHHHHHHHHhhhhhccCccccccCCccccCC--cccCccccc-------ccCCcccCCCCCCCCCC
Confidence            9999       99999998       5999999999999999999  999999986       6999999931     25


Q ss_pred             CcccCCCccCCCCCCccc-CCcccCCCceeecCCCCCCCCCcccCCCCCCCCCCCCCCCCCcCCCCceecCCccchHhhh
Q psy14093        465 SELFPVLLYSDKNSPCCQ-NCKFMAVGMKCRDAQYATCEQESRCTGSSSVCPPSAPMSDNTGCLERGKCRGGKCIPFCET  543 (857)
Q Consensus       465 C~l~~ga~Cs~~~g~CC~-~C~~~~~g~~Cr~~~~~~C~~~~~C~G~s~~Cp~~~~~~dgt~C~~~~~C~~G~C~~~C~~  543 (857)
                      |+|++||||||+|||||. +|+|+++++.||.+++.+|..+++|+|.+++||++.|++|||+|.+.+.|++|+|.++|+.
T Consensus       504 C~lk~gaqCSpsqgpCC~~~Cqf~~~~~kc~~~d~~~C~~~s~CnG~~aeCP~s~~~~d~t~C~~~~~C~~G~C~gs~c~  583 (764)
T KOG3658|consen  504 CTLKPGAQCSPSQGPCCTPNCQFHTSGEKCREADEATCKGESTCNGFSAECPPSPPKPDGTVCNETGVCINGKCIGSCCL  583 (764)
T ss_pred             ceeCCCCccCCCCCCccCCcceeecccCeeeccccccCcCcccccCCccCCcCCCCCCCCCcccccceEeCCcCccHHHH
Confidence            999999999999999995 9999999999998887899999999999999999999999999999999999999986665


Q ss_pred             -cccccccccccC-------------CCCcceee-e-------eEEEcCCCCccC--CCccccCcccccccccccccccc
Q psy14093        544 -QNQQSCMCDVNG-------------YRGMGFVI-K-------KIVVHSDATRVR--QGEAHYNMIRDKWDVRNLLETFS  599 (857)
Q Consensus       544 -~~~~~C~c~~~~-------------~~g~C~pi-~-------~~~il~~GT~C~--~G~c~~n~c~r~W~~~~lL~~~~  599 (857)
                       +|+++|+|+...             ..++|+|+ +       ..++|++|+||+  .|+|+.+..++.-+....|.|+.
T Consensus       584 ~~glesC~c~~~~~~~sC~lcCr~~~~~stC~~~v~~~~~~~~~~~~L~pGsPC~~~~GyCdvf~KCr~vd~dgpl~rl~  663 (764)
T KOG3658|consen  584 MQGLESCFCTETDLDKSCKLCCRKPDDNSTCVPTVEANRKNFDVGIYLRPGSPCNNYKGYCDVFGKCRKVDADGPLARLK  663 (764)
T ss_pred             hhCcceeeeccCCchhhHHHHhcCCCCCCCcccccCccccccccceecCCCCcCcCCcceeccccceeeecCCchHHHHH
Confidence             899999988633             56888884 2       245899999999  89999988888777776666654


Q ss_pred             cCCC-CCCc-ceEEEEEeeeecccCCCcceeeeecCC
Q psy14093        600 SHVD-GTNF-CLAHLFTHQSFWSRGSSILGLAYIASP  634 (857)
Q Consensus       600 ~~~~-hD~~-c~AhLft~~df~~~~~~tvGlA~vgs~  634 (857)
                      ..+. -..+ -+|..++. .||  ....+|+|++-.|
T Consensus       664 ~li~~~~~~~~~~~w~~~-~w~--~v~i~gi~~i~~m  697 (764)
T KOG3658|consen  664 DLIFQLETIETFAEWIVL-NWL--AVNIVGIVLIVLM  697 (764)
T ss_pred             HHHhhhhhhhhhHHHHHh-hhH--HhHhHHHHHHHHH
Confidence            2110 0100 01222222 344  3567777776555


No 2  
>KOG3607|consensus
Probab=100.00  E-value=1e-60  Score=563.80  Aligned_cols=375  Identities=26%  Similarity=0.453  Sum_probs=299.5

Q ss_pred             CEEEEEcCCeEEEEEeCCCCCCCCCCccEEEEEecccccccccCCCCCCCCCCCCCCcCCCCccccCcCCCCCC-Ccccc
Q psy14093          1 MTASIHTPEETYHIEPSWRHLPHLGNQSMVAYRASDVKLSWDHAHDKPASPCGYVKEEFNTTDFDLDMEEDDPD-MPHTR   79 (857)
Q Consensus         1 l~G~I~~~~~~Y~IEPl~~~~~~~~~~~HivYr~sd~~~~~~~~~~~~~~~Cg~~~~~~~~~~~~~~~~~~~~~-~~~~r   79 (857)
                      |+|+|...+..|.|||+    ..+..|.|+||+.......      .....||..+.....  ...  ....+. .+..+
T Consensus       118 l~g~~~~~~~~Y~Iepl----~~s~~f~h~iy~~~~~~~~------~~~~~~g~~~~~~~~--~~~--~~~~~~~~~~~~  183 (716)
T KOG3607|consen  118 LRGLFLFENISYSIEPL----EGSDSFEHLIYKLLSLETD------NSALVCGHQEAHIIT--KQM--ELPLRSKRQERR  183 (716)
T ss_pred             cceeEEECceeEEEEEC----CCCCCcceeeecccccccc------ccccccccchhhccc--ccc--cccCcccccccc
Confidence            68999999999999999    4567899999999876632      335678877543332  000  000111 11122


Q ss_pred             cccccCCCCCCCCCcceeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHHHhhccCCcccccccCCCccceeEEEEEEE
Q psy14093         80 SKRQVDPAYEYKPTKTRCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIV  159 (857)
Q Consensus        80 ~kR~~~~~~~~~~~~~~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V~~iY~~~~~~~~~~~~~~~~i~~~I~~v~  159 (857)
                      .+|.      ++...+|+||+||+|+.+|++++. ++.++.++++.++|+||.||+..              +++|.++ 
T Consensus       184 ~~~~------~~~~~kyvEl~vV~D~~~~~~~~~-~~~~v~~~~~~vvn~~d~~y~~l--------------ni~i~lv-  241 (716)
T KOG3607|consen  184 EKRT------VIEHTKYVELYVVVDNDLYKKYGS-NLSKVRSFAKEVVNLVDSIYKQL--------------NIRIVLV-  241 (716)
T ss_pred             cccc------cccccceEEEEEEEehHHHHHccc-cHHHHHHHHHHhhhHHhhhHhhc--------------cceEEEE-
Confidence            2332      246899999999999999999985 78899999999999999999962              3444443 


Q ss_pred             EeccchhhccccccccccccccccchhhhhhhhhccccccccccccCCcccccccccceeEeeEEeecCchhhccccccc
Q psy14093        160 VHSDATRVRQGEAHYNMIRDKWDVRNLLEISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHY  239 (857)
Q Consensus       160 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~i~i~~~~~~~l~~~~~~  239 (857)
                                                              ++|+|+++|             +|.+..++..+|.     
T Consensus       242 ----------------------------------------~lE~Wt~~d-------------ki~~~~~~~~tL~-----  263 (716)
T KOG3607|consen  242 ----------------------------------------GLEIWTDGN-------------KIDVSEDLRETLH-----  263 (716)
T ss_pred             ----------------------------------------EEEecCCCC-------------eecccccHHHHHH-----
Confidence                                                    235555555             6777778888887     


Q ss_pred             cccccccchhhhccccccccCCCccceEEEeeccccCCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCcccc
Q psy14093        240 NMIRDKWDVRNLLEVFSREYSHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQY  319 (857)
Q Consensus       240 ~~~f~~w~~~~ll~~~s~~~~~D~~cla~L~T~~df~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~  319 (857)
                        .|..|+...+..    ...||   .|||||+..|.+.++|+||+|        |||++.++  +++.         . 
T Consensus       264 --~F~~wr~~~l~~----r~~hD---~a~L~~~~~~~~~~~G~a~~~--------~mCs~~~s--~gv~---------~-  314 (716)
T KOG3607|consen  264 --NFLKWRKSYLTT----RLPHD---AAHLLSGILFYGKYVGLAYFG--------GMCSPGHS--GGVN---------K-  314 (716)
T ss_pred             --HHHHHHHhhccc----cCCCC---ceEEEEeeeccCceeceeecc--------cccCcccc--ccee---------e-
Confidence              699999976664    35789   999999999999999999999        99999763  3332         0 


Q ss_pred             ccccceeecccCCccccccCCccccccchhhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEeeccCCCCCCCCCCC
Q psy14093        320 FKNGYTLYLNSGLSSSRNHYGQRVITREADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTYSVSGYDVNNKTFS  399 (857)
Q Consensus       320 ~~~~~~~~~n~Gl~~~~n~~G~~v~~~~~~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~~~~g~~~n~~~FS  399 (857)
                                         +.... +...|.|+||||||||||.||...+.|.+    .+.||| +..+.  .+.+.+||
T Consensus       315 -------------------~~~~~-~~~~a~v~AhelgH~lGm~hD~~~c~C~~----~~~CiM-~~~~~--~~~~~~FS  367 (716)
T KOG3607|consen  315 -------------------FHSDI-LLAFAVVLAHELGHNLGMIHDESSCSCPS----SGLCIM-PEETG--FYIPKKFS  367 (716)
T ss_pred             -------------------cCccc-chhHHHHHHHHHHhhcCcccccccccCCC----CCcccc-ccccC--cCcccccc
Confidence                               11111 23579999999999999999997777763    367999 55442  36789999


Q ss_pred             CCCCCCCcccHHHHhc-------cccccCC-------CcccCccccCCCCCCccCCCcccccccCCCCCCCCCCCCCCCC
Q psy14093        400 SHVDGTNFCLAHLFTH-------QSFWSRG-------SSILGLAYIASPRPYSIGGGILGLAYVGSPRRNSKCCYHSILS  465 (857)
Q Consensus       400 ~~~~~~~~CS~~~i~~-------~Cl~~~~-------~~~CGN~ive~g~~EeCDcG~~~~~~~~~~~~~d~CC~~~~~C  465 (857)
                      +       ||++++..       .||.+.|       .++||||+||+|  ||||||+.+       .|.+.||++. ||
T Consensus       368 ~-------CS~~~~~~~~~~~~~~cl~n~p~p~~~~~~~~CGNg~vE~g--EECDCG~~~-------eC~~~cC~~~-tC  430 (716)
T KOG3607|consen  368 N-------CSYQDFESFLLKGGGSCLLNLPSPEKIFSGPVCGNGVVEEG--EECDCGTPE-------ECSNLCCCAS-TC  430 (716)
T ss_pred             c-------chHHHHHHHHhcCCcceEecCCCcccccCCCccCCcccccC--CcCCCCChH-------HcCccccccc-cc
Confidence            9       99999998       4988744       589999999999  999999998       4777799988 99


Q ss_pred             cccCCCccCCCCCCcccCCcccCCCceeecCCCCCCCCCcccCCCCCCCCCCCCCCCCCcCCC-CceecCCccch---Hh
Q psy14093        466 ELFPVLLYSDKNSPCCQNCKFMAVGMKCRDAQYATCEQESRCTGSSSVCPPSAPMSDNTGCLE-RGKCRGGKCIP---FC  541 (857)
Q Consensus       466 ~l~~ga~Cs~~~g~CC~~C~~~~~g~~Cr~~~~~~C~~~~~C~G~s~~Cp~~~~~~dgt~C~~-~~~C~~G~C~~---~C  541 (857)
                      +|+++|+|+.  |+||.+|+|+|+|++||++. ++||++|||+|+|+.||+|.+++||++|.. .++||+|.|.+   +|
T Consensus       431 kL~pga~Ca~--G~CC~~C~~~p~G~~CR~~~-~eCDlpE~C~G~S~~Cp~d~~~~dG~~C~~~~g~Cy~G~C~t~~~QC  507 (716)
T KOG3607|consen  431 KLKPGAQCAL--GLCCKDCKFLPAGTVCRQAK-NECDLPEYCNGTSSQCPPDLYVQDGIPCQGGQGYCYDGRCHTRDRQC  507 (716)
T ss_pred             cccCCCCCCC--CCchhccccccCcceecccC-CccccccccCCCCCCCCCCceeECCEecCCCcceecCCCCCCHHHHH
Confidence            9999999995  99999999999999999987 899999999999999999999999999998 68999999996   89


Q ss_pred             hhcc
Q psy14093        542 ETQN  545 (857)
Q Consensus       542 ~~~~  545 (857)
                      +++.
T Consensus       508 ~~i~  511 (716)
T KOG3607|consen  508 QKIW  511 (716)
T ss_pred             HHHh
Confidence            9963


No 3  
>KOG3607|consensus
Probab=100.00  E-value=3.2e-45  Score=432.53  Aligned_cols=225  Identities=27%  Similarity=0.469  Sum_probs=196.8

Q ss_pred             ccccccccccccCCCCCCcceEEEEEeeeecccCCCcceeeeecCC-CCCcccceec---cccccccchhhheeeeeeec
Q psy14093        589 WDVRNLLETFSSHVDGTNFCLAHLFTHQSFWSRGSSILGLAYIASP-RPYSIGGSIH---ENLKYFDTLHSSHISHTIVK  664 (857)
Q Consensus       589 W~~~~lL~~~~~~~~hD~~c~AhLft~~df~~~~~~tvGlA~vgs~-~~~~s~Gv~~---~~~~~~~~~~aheigh~l~~  664 (857)
                      |+...+..+++    ||   .||||++..|   .+.++|+||+|+| ++.+++||..   ++...++.++||||||+|  
T Consensus       268 wr~~~l~~r~~----hD---~a~L~~~~~~---~~~~~G~a~~~~mCs~~~s~gv~~~~~~~~~~~a~v~AhelgH~l--  335 (716)
T KOG3607|consen  268 WRKSYLTTRLP----HD---AAHLLSGILF---YGKYVGLAYFGGMCSPGHSGGVNKFHSDILLAFAVVLAHELGHNL--  335 (716)
T ss_pred             HHHhhccccCC----CC---ceEEEEeeec---cCceeceeecccccCcccccceeecCcccchhHHHHHHHHHHhhc--
Confidence            54444444444    89   9999999888   4899999999999 7778999874   356788899999999999  


Q ss_pred             ccccccCCC------CCCcccC--------CCCCCCCHHHHHHHhhcCCCCcCCCC-------CccccCCcccccccccc
Q psy14093        665 RGVQESNHP------FNKIKEV--------TFKTLGNLRSIRKVLEAKSGKCFSEP-------EESFCGNLRVEGDEECD  723 (857)
Q Consensus       665 ~g~~~d~~~------~gcim~~--------~fss~CS~~~~~~~l~~~~~~Cl~~~-------~~~~CGNgiVE~gEeCD  723 (857)
                       ||.||...      ..|||..        .| |.||+++|..++.++.+.||+|.       ..++||||+||+|||||
T Consensus       336 -Gm~hD~~~c~C~~~~~CiM~~~~~~~~~~~F-S~CS~~~~~~~~~~~~~~cl~n~p~p~~~~~~~~CGNg~vE~gEECD  413 (716)
T KOG3607|consen  336 -GMIHDESSCSCPSSGLCIMPEETGFYIPKKF-SNCSYQDFESFLLKGGGSCLLNLPSPEKIFSGPVCGNGVVEEGEECD  413 (716)
T ss_pred             -CcccccccccCCCCCccccccccCcCccccc-ccchHHHHHHHHhcCCcceEecCCCcccccCCCccCCcccccCCcCC
Confidence             99999733      3588852        45 47999999999999999999982       56889999999999999


Q ss_pred             CCCCCCCCCCCCCC-CCCccccCCCCCcccCCCCCCCCCCCCccccccCccccccccccCccccccccCccCCCc---cc
Q psy14093        724 AGLLGTEDNDSCCD-KVCKLRRNEGAGDEECDAGLLGTEDNDSCCDKVCKLRRNEGATKGTSSFGKRRNKTHTLC---RR  799 (857)
Q Consensus       724 CG~~~~c~~d~CC~-~tC~Lk~ga~~~~a~C~~G~~~~~~n~~CC~~~C~~~~~~~~~~g~~c~~~r~~~~~~~C---e~  799 (857)
                      ||++++|. |.||+ +||||+||     |+||.|+        || ++|+|+|+     |++|   |+  +.++|   ||
T Consensus       414 CG~~~eC~-~~cC~~~tCkL~pg-----a~Ca~G~--------CC-~~C~~~p~-----G~~C---R~--~~~eCDlpE~  468 (716)
T KOG3607|consen  414 CGTPEECS-NLCCCASTCKLKPG-----AQCALGL--------CC-KDCKFLPA-----GTVC---RQ--AKNECDLPEY  468 (716)
T ss_pred             CCChHHcC-ccccccccccccCC-----CCCCCCC--------ch-hccccccC-----ccee---cc--cCCccccccc
Confidence            99999995 55666 79999999     9999999        99 99999999     9999   87  77888   99


Q ss_pred             cCCCcccccccccccCCCCCCCCccc-----CCChhhhhhcccCCcccchhhhHHHhh
Q psy14093        800 CGKSSYHIQKKKCAQCGYPCPRLRHY-----NWSVKAQRRKTTGTGRMRHLKIVRRRF  852 (857)
Q Consensus       800 C~G~s~~Cp~d~~~~~G~~C~~~~~~-----c~~~~~qc~~~~G~~~~~a~~~c~~~~  852 (857)
                      |+|+|++||+|+|++||+||+..++|     |.++++||++|||++|+.|++.||+.+
T Consensus       469 C~G~S~~Cp~d~~~~dG~~C~~~~g~Cy~G~C~t~~~QC~~i~g~~a~~a~~~Cy~~v  526 (716)
T KOG3607|consen  469 CNGTSSQCPPDLYVQDGIPCQGGQGYCYDGRCHTRDRQCQKIWGPGARAAPKYCYEKV  526 (716)
T ss_pred             cCCCCCCCCCCceeECCEecCCCcceecCCCCCCHHHHHHHHhCCCcccCchhhhhhc
Confidence            99999999999999999999987788     679999999999999999999999874


No 4  
>KOG3658|consensus
Probab=100.00  E-value=8.1e-42  Score=384.67  Aligned_cols=243  Identities=36%  Similarity=0.635  Sum_probs=204.7

Q ss_pred             CCCCcceeeeeEEEcCCCCccCCCccccCccc-ccccccccccccccCCCCCCcceEEEEEeeeecccCCCcceeeeecC
Q psy14093        555 GYRGMGFVIKKIVVHSDATRVRQGEAHYNMIR-DKWDVRNLLETFSSHVDGTNFCLAHLFTHQSFWSRGSSILGLAYIAS  633 (857)
Q Consensus       555 ~~~g~C~pi~~~~il~~GT~C~~G~c~~n~c~-r~W~~~~lL~~~~~~~~hD~~c~AhLft~~df~~~~~~tvGlA~vgs  633 (857)
                      .++++.|+|++|.|+.+++--..+.-++|+-+ ...++.+||+..+ ..+|+++|+|||||++||   +++||||||||+
T Consensus       274 g~kni~F~IkrIkI~~~~~~~~~~~~~~N~f~~e~~~v~~fl~l~s-~~~~sdfCLAylfT~rdF---e~GtLGLAwVgs  349 (764)
T KOG3658|consen  274 GFKNIGFVIKRIKILDSPACVRPGEDHYNMFCEEVKDVLEFLSLNS-EEKHSDFCLAYLFTYRDF---EGGTLGLAWVGS  349 (764)
T ss_pred             CccceEEEEEEEEecCchhhcccCcCccccchhhHHHHHHHhhhhh-hccccchhheeeeeeccc---ccceeeeEEecc
Confidence            37899999999999988875556778888887 6667777777654 567899999999999999   799999999999


Q ss_pred             CCCCcccceeccc-------cccccc-------------------hhhheeeeeeecccccccCC-CC--------CCcc
Q psy14093        634 PRPYSIGGSIHEN-------LKYFDT-------------------LHSSHISHTIVKRGVQESNH-PF--------NKIK  678 (857)
Q Consensus       634 ~~~~~s~Gv~~~~-------~~~~~~-------------------~~aheigh~l~~~g~~~d~~-~~--------gcim  678 (857)
                      +..+.+|||+...       ..++++                   ++|||||||+   |..||-. .+        .=||
T Consensus       350 p~~~saGGIC~k~~~y~~G~~~sLNtGi~T~~NYg~~Vp~kvs~lt~AHEiGHNf---GSpHDp~~ECsPg~~~~GnyiM  426 (764)
T KOG3658|consen  350 PRSNSAGGICEKAYTYNEGKKRSLNTGISTSVNYGKRVPTKVSDLTLAHEIGHNF---GSPHDPDIECSPGESQGGNYIM  426 (764)
T ss_pred             CccCCCCcccccceeccCcceEEeecceeeeeecCCccCcchhheeehhhhcccc---CCCCCCCCccCCCCCCCCcEEE
Confidence            8667899998532       123332                   4599999999   8788753 11        1256


Q ss_pred             c-----------CCCCCCCCHHHHHHHhhcCCCCcCCCCCccccCCccccccccccCCCCCCCCCCCCCC-------CCC
Q psy14093        679 E-----------VTFKTLGNLRSIRKVLEAKSGKCFSEPEESFCGNLRVEGDEECDAGLLGTEDNDSCCD-------KVC  740 (857)
Q Consensus       679 ~-----------~~fss~CS~~~~~~~l~~~~~~Cl~~~~~~~CGNgiVE~gEeCDCG~~~~c~~d~CC~-------~tC  740 (857)
                      .           .+| |+||+++|+++|..++..||+++..+||||+|||+|||||||...+| .|.||+       ..|
T Consensus       427 ~a~AtSGd~~NN~kF-SpCS~ksI~~VL~~k~~~CFqE~~~sfCGN~iVE~GEECDcG~~~~~-~D~CC~p~~~~~~k~C  504 (764)
T KOG3658|consen  427 YAYATSGDKPNNKKF-SPCSLKSISKVLESKKRNCFQERESSFCGNGIVEPGEECDCGFSTDC-KDSCCDPQPNLDEKPC  504 (764)
T ss_pred             EEecccCcCccCccc-CcchHHHHHHHHHHhhhhhccCccccccCCccccCCcccCccccccc-CCcccCCCCCCCCCCc
Confidence            2           478 58999999999999999999999999999999999999999999998 999999       689


Q ss_pred             ccccCCCCCcccCCCCCCCCCCCCccccccCccccccccccCccccccccCccCCCc---cccCCCcccccccccccCCC
Q psy14093        741 KLRRNEGAGDEECDAGLLGTEDNDSCCDKVCKLRRNEGATKGTSSFGKRRNKTHTLC---RRCGKSSYHIQKKKCAQCGY  817 (857)
Q Consensus       741 ~Lk~ga~~~~a~C~~G~~~~~~n~~CC~~~C~~~~~~~~~~g~~c~~~r~~~~~~~C---e~C~G~s~~Cp~d~~~~~G~  817 (857)
                      |||||     |||      ||+|++||+.+|||+++     |..|   |. ....+|   .+|+|.+++||+...+.||+
T Consensus       505 ~lk~g-----aqC------SpsqgpCC~~~Cqf~~~-----~~kc---~~-~d~~~C~~~s~CnG~~aeCP~s~~~~d~t  564 (764)
T KOG3658|consen  505 TLKPG-----AQC------SPSQGPCCTPNCQFHTS-----GEKC---RE-ADEATCKGESTCNGFSAECPPSPPKPDGT  564 (764)
T ss_pred             eeCCC-----Ccc------CCCCCCccCCcceeecc-----cCee---ec-cccccCcCcccccCCccCCcCCCCCCCCC
Confidence            99999     655      89999999779999999     9999   64 245689   89999999999999999999


Q ss_pred             CCCCCcccCC
Q psy14093        818 PCPRLRHYNW  827 (857)
Q Consensus       818 ~C~~~~~~c~  827 (857)
                      +|.. .+.|.
T Consensus       565 ~C~~-~~~C~  573 (764)
T KOG3658|consen  565 VCNE-TGVCI  573 (764)
T ss_pred             cccc-cceEe
Confidence            9984 66654


No 5  
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=100.00  E-value=8.5e-41  Score=354.74  Aligned_cols=235  Identities=51%  Similarity=0.881  Sum_probs=183.6

Q ss_pred             eeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHHHhhccCCcccccccCCCccceeEEEEEEEEeccchhhcccccccc
Q psy14093         96 RCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYN  175 (857)
Q Consensus        96 ~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V~~iY~~~~~~~~~~~~~~~~i~~~I~~v~v~~~~~~~~~~~~~~~  175 (857)
                      +|.|+||||++||++++++++++++.||+++||+||.||++++|+.+    +..+|+|.|.+|+|+              
T Consensus         2 ~C~l~vVaD~~~~~~~~~~~~~~v~~~~~~~vn~vn~iY~~~n~~~~----~l~~i~i~~~~i~I~--------------   63 (244)
T cd04270           2 TCKLLLVADHRFYKYMGRGEEETTINYLISHIDRVDDIYRNTDWDGG----GFKGIGFQIKRIRIH--------------   63 (244)
T ss_pred             ccEEEEEEcHHHHHHhCCChHHHHHHHHHHHHHHHHHHhcccccCCC----cccceEEEEEEEEEE--------------
Confidence            59999999999999999644788999999999999999999886422    123566666655332              


Q ss_pred             ccccccccchhhhhhhhhccccccccccccCCcccccccccceeEeeEEeecCchhhccccccccccccccchhhhcccc
Q psy14093        176 MIRDKWDVRNLLEISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYNMIRDKWDVRNLLEVF  255 (857)
Q Consensus       176 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~i~i~~~~~~~l~~~~~~~~~f~~w~~~~ll~~~  255 (857)
                                                   .+.+                    +..++.  ..++..|..|+...+++.+
T Consensus        64 -----------------------------~~~~--------------------~~~~~~--~~~~~~f~~w~~~~~~~~~   92 (244)
T cd04270          64 -----------------------------TTPD--------------------EVDPGN--KFYNKSFPNWGVEKFLVKL   92 (244)
T ss_pred             -----------------------------cCcc--------------------cccCCC--cccccCccchhHHHHhhhh
Confidence                                         1111                    111111  1122358999998888766


Q ss_pred             ccccCCCccceEEEeeccccCCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCccccccccceeecccCCccc
Q psy14093        256 SREYSHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTLYLNSGLSSS  335 (857)
Q Consensus       256 s~~~~~D~~cla~L~T~~df~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~Gl~~~  335 (857)
                      +....+|+||+|||||++||.++++||||+|++..+..||||++.++...                 +...++|+||++.
T Consensus        93 ~~~~~~d~~c~A~LlT~~df~~~tlGlA~vg~~~~~~~gGIC~~~~s~~~-----------------~~~~~~n~gl~t~  155 (244)
T cd04270          93 LLEQFSDDVCLAHLFTYRDFDMGTLGLAYVGSPRDNSAGGICEKAYYYSN-----------------GKKKYLNTGLTTT  155 (244)
T ss_pred             hhhcCCccceEEEEEeccccCCCceeeeeecccccCCCCcccCCcccccC-----------------CcceeeecceEee
Confidence            55455666899999999999999999999999888889999999764311                 1224789999988


Q ss_pred             cccCCccccccchhhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEeeccCCCCCCCCCCCCCCCCCCcccHHHHhc
Q psy14093        336 RNHYGQRVITREADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTYSVSGYDVNNKTFSSHVDGTNFCLAHLFTH  415 (857)
Q Consensus       336 ~n~~G~~v~~~~~~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~~~~g~~~n~~~FS~~~~~~~~CS~~~i~~  415 (857)
                      .+ ||.+++++.+++||||||||||||+||++.+.|.|...++++|||++.+++|+.+++++||+       ||+++|++
T Consensus       156 ~~-~~~~~~~~~~a~t~AHElGHnlGm~HD~~~~~C~~~~~~~~~cIM~~~~~~~~~~~~~~FS~-------CS~~~~~~  227 (244)
T cd04270         156 VN-YGKRVPTKESDLVTAHELGHNFGSPHDPDIAECAPGESQGGNYIMYARATSGDKENNKKFSP-------CSKKSISK  227 (244)
T ss_pred             ec-cCCccchhHHHHHHHHHHHHhcCCCCCCCCCCCCCCCCCCCceEecCccccCCCCCCCCCCH-------hHHHHHHH
Confidence            88 89888888889999999999999999997667877655467999999988888889999999       99999987


Q ss_pred             -------cccccCCCc
Q psy14093        416 -------QSFWSRGSS  424 (857)
Q Consensus       416 -------~Cl~~~~~~  424 (857)
                             .||.++..+
T Consensus       228 ~L~~~~~~CL~~~~~~  243 (244)
T cd04270         228 VLEVKSNSCFVERSQS  243 (244)
T ss_pred             HHccCCCCCcCCCCCC
Confidence                   599887544


No 6  
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=99.97  E-value=2.2e-31  Score=276.35  Aligned_cols=191  Identities=24%  Similarity=0.361  Sum_probs=142.4

Q ss_pred             ceeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHHHhhccCCcccccccCCCccceeEEEEEEEEeccchhhccccccc
Q psy14093         95 TRCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHY  174 (857)
Q Consensus        95 ~~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V~~iY~~~~~~~~~~~~~~~~i~~~I~~v~v~~~~~~~~~~~~~~  174 (857)
                      +|+||+||+|+.+|++++.   +.+++|++.+||+||.||+++++.        ++|+|.|+.|+               
T Consensus         1 ~~vEl~vv~D~~~~~~~~~---~~~~~y~~~~~n~v~~~y~~~s~~--------~~i~i~l~~i~---------------   54 (207)
T cd04273           1 RYVETLVVADSKMVEFHHG---EDLEHYILTLMNIVASLYKDPSLG--------NSINIVVVRLI---------------   54 (207)
T ss_pred             CeEEEEEEECHHHHHHcCh---HHHHHHHHHHHHHHHHHhcCcccC--------CCeEEEEEEEE---------------
Confidence            5899999999999998763   458899999999999999986532        45666666653               


Q ss_pred             cccccccccchhhhhhhhhccccccccccccCCcccccccccceeEeeEEeecCchhhccccccccccccccchhhhccc
Q psy14093        175 NMIRDKWDVRNLLEISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYNMIRDKWDVRNLLEV  254 (857)
Q Consensus       175 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~i~i~~~~~~~l~~~~~~~~~f~~w~~~~ll~~  254 (857)
                           +|++.++                                   .+.+..++..+|.       .|.+|+.+.+...
T Consensus        55 -----i~~~~~~-----------------------------------~~~~~~~~~~~L~-------~F~~w~~~~~~~~   87 (207)
T cd04273          55 -----VLEDEES-----------------------------------GLLISGNAQKSLK-------SFCRWQKKLNPPN   87 (207)
T ss_pred             -----EecCCCC-----------------------------------cCcccCCHHHHHH-------HHHHHHHHcCCcc
Confidence                 3433221                                   1223456666665       7999998776643


Q ss_pred             cccccCCCccceEEEeeccccC-----CCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCccccccccceeecc
Q psy14093        255 FSREYSHKDFCLAHLFTDLKFE-----GGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTLYLN  329 (857)
Q Consensus       255 ~s~~~~~D~~cla~L~T~~df~-----~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n  329 (857)
                      .....++|   +|+|||++||.     ++++|+||+|        |||++.++.  ++.                   .+
T Consensus        88 ~~~~~~~D---~a~llt~~d~~~~~~~~~~~G~A~~g--------giC~~~~s~--~i~-------------------~~  135 (207)
T cd04273          88 DSDPEHHD---HAILLTRQDICRSNGNCDTLGLAPVG--------GMCSPSRSC--SIN-------------------ED  135 (207)
T ss_pred             cccccccc---eEEEEeeecccccCCCCCceEEeccc--------cCCCCCcce--EEE-------------------cC
Confidence            22224677   99999999996     6899999999        999987632  222                   11


Q ss_pred             cCCccccccCCccccccchhhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEeeccCCCCCCCCCCCCCCCCCCccc
Q psy14093        330 SGLSSSRNHYGQRVITREADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTYSVSGYDVNNKTFSSHVDGTNFCL  409 (857)
Q Consensus       330 ~Gl~~~~n~~G~~v~~~~~~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~~~~g~~~n~~~FS~~~~~~~~CS  409 (857)
                      .|+              ..|.|+||||||+|||.||++.+.|++..  +++|||++.....  .++++||+       ||
T Consensus       136 ~~~--------------~~a~~~aHElGH~LG~~HD~~~~~C~~~~--~~~~IM~~~~~~~--~~~~~fS~-------CS  190 (207)
T cd04273         136 TGL--------------SSAFTIAHELGHVLGMPHDGDGNSCGPEG--KDGHIMSPTLGAN--TGPFTWSK-------CS  190 (207)
T ss_pred             CCc--------------eeEEeeeeechhhcCCCCCCCCCCCCCCC--CCceeecccccCC--CCCCCcCH-------HH
Confidence            111              35789999999999999999878897642  2479999987642  36899999       99


Q ss_pred             HHHHhc
Q psy14093        410 AHLFTH  415 (857)
Q Consensus       410 ~~~i~~  415 (857)
                      +++|.+
T Consensus       191 ~~~~~~  196 (207)
T cd04273         191 RRYLTS  196 (207)
T ss_pred             HHHHHH
Confidence            999987


No 7  
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=99.97  E-value=9.4e-31  Score=268.65  Aligned_cols=187  Identities=28%  Similarity=0.495  Sum_probs=143.2

Q ss_pred             ceeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHHHhhccCCcccccccCCCccceeEEEEEEEEeccchhhccccccc
Q psy14093         95 TRCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHY  174 (857)
Q Consensus        95 ~~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V~~iY~~~~~~~~~~~~~~~~i~~~I~~v~v~~~~~~~~~~~~~~  174 (857)
                      +|+|++||+|+.+|+++++ +++++++|++.++|.||.||++.            +|+|.|..|                
T Consensus         1 ~~iE~~vv~D~~~~~~~~~-n~~~~~~~v~~~~n~vn~~y~~~------------~i~v~l~~i----------------   51 (194)
T cd04269           1 KYVELVVVVDNSLYKKYGS-NLSKVRQRVIEIVNIVDSIYRPL------------NIRVVLVGL----------------   51 (194)
T ss_pred             CeEEEEEEEeHHHHHHhCC-CHHHHHHHHHHHHHHHHHHHcCC------------CeEEEEEEE----------------
Confidence            5899999999999999985 68889999999999999999983            466666665                


Q ss_pred             cccccccccchhhhhhhhhccccccccccccCCcccccccccceeEeeEEeecCchhhccccccccccccccchhhhccc
Q psy14093        175 NMIRDKWDVRNLLEISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYNMIRDKWDVRNLLEV  254 (857)
Q Consensus       175 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~i~i~~~~~~~l~~~~~~~~~f~~w~~~~ll~~  254 (857)
                          +||++++++                                    .+..++..+|.       .|.+|+...+.. 
T Consensus        52 ----~i~~~~~~~------------------------------------~~~~~~~~~L~-------~f~~w~~~~~~~-   83 (194)
T cd04269          52 ----EIWTDKDKI------------------------------------SVSGDAGETLN-------RFLDWKRSNLLP-   83 (194)
T ss_pred             ----EEeecCCcc------------------------------------cccCCHHHHHH-------HHHHHHHHhhcc-
Confidence                346555543                                    23345555555       688998876653 


Q ss_pred             cccccCCCccceEEEeeccccCCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCccccccccceeecccCCcc
Q psy14093        255 FSREYSHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTLYLNSGLSS  334 (857)
Q Consensus       255 ~s~~~~~D~~cla~L~T~~df~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~Gl~~  334 (857)
                         ...+|   +|+|||+++|.++++|+||+|        |+|++.++ .+.+.                  .       
T Consensus        84 ---~~~~D---~a~Llt~~~~~~~~~G~A~~g--------~iC~~~~~-~~v~~------------------~-------  123 (194)
T cd04269          84 ---RKPHD---NAQLLTGRDFDGNTVGLAYVG--------GMCSPKYS-GGVVQ------------------D-------  123 (194)
T ss_pred             ---ccCCC---ceEEEEeeecCCCceeeeecC--------CccCCCcc-eEEEE------------------e-------
Confidence               24677   999999999988999999999        99997552 22221                  0       


Q ss_pred             ccccCCccccccchhhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEeeccCCCCCCCCCCCCCCCCCCcccHHHHh
Q psy14093        335 SRNHYGQRVITREADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTYSVSGYDVNNKTFSSHVDGTNFCLAHLFT  414 (857)
Q Consensus       335 ~~n~~G~~v~~~~~~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~~~~g~~~n~~~FS~~~~~~~~CS~~~i~  414 (857)
                          .+..  ....+.|+||||||+|||.||++.+.|+.     ++|||++....    ++.+||+       ||+++|.
T Consensus       124 ----~~~~--~~~~a~~~AHElGH~lG~~HD~~~c~C~~-----~~cIM~~~~~~----~~~~fS~-------CS~~~~~  181 (194)
T cd04269         124 ----HSRN--LLLFAVTMAHELGHNLGMEHDDGGCTCGR-----STCIMAPSPSS----LTDAFSN-------CSYEDYQ  181 (194)
T ss_pred             ----CCcc--hHHHHHHHHHHHHhhcCCCcCCCCCCCCC-----CCeEecCCCCC----CCCCCCh-------hhHHHHH
Confidence                0100  12468999999999999999998777764     57999998653    6899999       9999998


Q ss_pred             c-------ccccc
Q psy14093        415 H-------QSFWS  420 (857)
Q Consensus       415 ~-------~Cl~~  420 (857)
                      +       +||.+
T Consensus       182 ~~l~~~~~~CL~~  194 (194)
T cd04269         182 KFLSRGGGQCLLN  194 (194)
T ss_pred             HHHhcCCCCccCC
Confidence            7       57763


No 8  
>PF01421 Reprolysin:  Reprolysin (M12B) family zinc metalloprotease  This Prosite motif covers only the active site.;  InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=99.97  E-value=3.7e-31  Score=272.84  Aligned_cols=191  Identities=27%  Similarity=0.464  Sum_probs=136.3

Q ss_pred             ceeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHHHhhccCCcccccccCCCccceeEEEEEEEEeccchhhccccccc
Q psy14093         95 TRCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHY  174 (857)
Q Consensus        95 ~~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V~~iY~~~~~~~~~~~~~~~~i~~~I~~v~v~~~~~~~~~~~~~~  174 (857)
                      ||+||+||+|+.+|++|++ +++.+++|+++++|+|+.+|++.            +|.|.|..+                
T Consensus         1 kyiEl~vvvD~~~~~~~~~-n~~~~~~~~~~i~n~v~~~y~~l------------~i~v~l~~l----------------   51 (199)
T PF01421_consen    1 KYIELLVVVDNSMYQYHGS-NVTKVIQYVLTIVNIVDSIYQQL------------NIRVVLVGL----------------   51 (199)
T ss_dssp             EEEEEEEEE-HHHHHHTTT-GHHHHHHHHHHHHHHHHHHHGGG------------TEEEEEEEE----------------
T ss_pred             CeEEEEEEEeHHHHHHhcc-cHHHHHHHHHHHHHHHhhhcccC------------CeEEEEEEE----------------
Confidence            6899999999999999985 78889999999999999999983            355555554                


Q ss_pred             cccccccccchhhhhhhhhccccccccccccCCcccccccccceeEeeEEeecCchhhccccccccccccccchhhhccc
Q psy14093        175 NMIRDKWDVRNLLEISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYNMIRDKWDVRNLLEV  254 (857)
Q Consensus       175 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~i~i~~~~~~~l~~~~~~~~~f~~w~~~~ll~~  254 (857)
                          +||+++|+                                    |.+..++..+|.       .|.+|+...+.+ 
T Consensus        52 ----eiw~~~d~------------------------------------i~~~~~~~~~L~-------~F~~w~~~~~~~-   83 (199)
T PF01421_consen   52 ----EIWTEEDK------------------------------------INISNDADSTLE-------NFCNWQKSELPP-   83 (199)
T ss_dssp             ----EEESSSTS------------------------------------S---SSHHHHHH-------HHHHHHHHTHHH-
T ss_pred             ----EEcccCCc------------------------------------eeeecchHHHHH-------HHHHHHHhhccc-
Confidence                44555553                                    444566777776       699999987663 


Q ss_pred             cccccCCCccceEEEeeccccCCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCccccccccceeecccCCcc
Q psy14093        255 FSREYSHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTLYLNSGLSS  334 (857)
Q Consensus       255 ~s~~~~~D~~cla~L~T~~df~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~Gl~~  334 (857)
                         ..+||   +|+|||++++..+++|+||+|        |||++.++  +++.         ++               
T Consensus        84 ---~~~~D---~a~Llt~~~~~~~~~G~a~~~--------~~C~~~~s--~~i~---------~~---------------  123 (199)
T PF01421_consen   84 ---RIHHD---HAHLLTGKDFPDSTVGLAYVG--------GMCSPSRS--CGIV---------ED---------------  123 (199)
T ss_dssp             ---HS--S---EEEEEESS-CSCCBSEEE-TT---------TTSTTTS--EEEE---------E----------------
T ss_pred             ---ccccc---eeEEeeecccccceeeeEeCC--------CCCCcCCC--CcEe---------ee---------------
Confidence               34678   999999999987899999999        99999763  2332         10               


Q ss_pred             ccccCCccccccchhhHHhhhhhhhCCCCCCCC-CCCCCCCCCCCCceEEEeeccCCCCCCCCCCCCCCCCCCcccHHHH
Q psy14093        335 SRNHYGQRVITREADLVTAHEFGHNWGSEHDPD-MPECSPSASQGGSYLMYTYSVSGYDVNNKTFSSHVDGTNFCLAHLF  413 (857)
Q Consensus       335 ~~n~~G~~v~~~~~~~t~AHElGHnlG~~HD~~-~~~C~~~~~~~g~~IM~~~~~~g~~~n~~~FS~~~~~~~~CS~~~i  413 (857)
                          .+  ......|.++||||||+|||.||+. .+.|.+    ..+|||++....   .....||+       ||++++
T Consensus       124 ----~~--~~~~~~a~~~AHelGH~lGm~HD~~~~C~C~~----~~~cIM~~~~~~---~~~~~fS~-------CS~~~~  183 (199)
T PF01421_consen  124 ----HS--RSGLSFAVIIAHELGHNLGMPHDGDNGCKCPR----RDGCIMSPSIGP---SSSDKFSN-------CSRRQF  183 (199)
T ss_dssp             ----SS--SSHHHHHHHHHHHHHHHTT---TTTCTHCSSS----SSSSTTSSSSSS---SSTSSS-H-------HHHHHH
T ss_pred             ----cc--chhHHHHHHHHHHHHHhcCCCCCCCCcCCcCC----CCCccccccccC---CCCCCcCH-------HHHHHH
Confidence                00  0112578999999999999999998 677853    378999996542   34569999       999999


Q ss_pred             hc-------cccccCC
Q psy14093        414 TH-------QSFWSRG  422 (857)
Q Consensus       414 ~~-------~Cl~~~~  422 (857)
                      .+       .||.+.|
T Consensus       184 ~~~l~~~~~~CL~n~P  199 (199)
T PF01421_consen  184 EEFLSSGKSSCLLNKP  199 (199)
T ss_dssp             HHHHHHHTTGGGSS--
T ss_pred             HHHHccCCCCccCCCC
Confidence            98       6999876


No 9  
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=99.96  E-value=1.9e-28  Score=256.69  Aligned_cols=189  Identities=20%  Similarity=0.262  Sum_probs=134.5

Q ss_pred             eeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHHHhhccCCcccccccCCCccceeEEEEEEEEeccchhhcccccccc
Q psy14093         96 RCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYN  175 (857)
Q Consensus        96 ~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V~~iY~~~~~~~~~~~~~~~~i~~~I~~v~v~~~~~~~~~~~~~~~  175 (857)
                      |+||+||+|+.+|++++  +.++++.|++.+||.||.||+++.         .++|.|.|+.|                 
T Consensus         2 yvEl~iV~D~~~~~~~~--~~~~~~~yv~~~~n~vn~~Y~~~~---------~p~I~i~lv~i-----------------   53 (220)
T cd04272           2 YPELFVVVDYDHQSEFF--SNEQLIRYLAVMVNAANLRYRDLK---------SPRIRLLLVGI-----------------   53 (220)
T ss_pred             cceEEEEecHHHHHhcC--CHHHHHHHHHHHHHHHHHHhhhcC---------CCcEEEEEEEE-----------------
Confidence            79999999999999954  578899999999999999999853         24677777776                 


Q ss_pred             ccccccccchhhhhhhhhccccccccccccCCcccccccccceeEeeEEeecCchhhccccccccccccccchhhhcccc
Q psy14093        176 MIRDKWDVRNLLEISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYNMIRDKWDVRNLLEVF  255 (857)
Q Consensus       176 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~i~i~~~~~~~l~~~~~~~~~f~~w~~~~ll~~~  255 (857)
                         +||+..+++.+.            .+.+.                 -..++.++|.       .|.+|+.+.+.   
T Consensus        54 ---~i~~~~~~~~~~------------~~~~~-----------------~~~~~~~tL~-------~F~~~~~~~~~---   91 (220)
T cd04272          54 ---TISKDPDFEPYI------------HPINY-----------------GYIDAAETLE-------NFNEYVKKKRD---   91 (220)
T ss_pred             ---EeccCcccceee------------ccCCc-----------------ccccHHHHHH-------HHHHHHhccCC---
Confidence               446655543220            00000                 0123445665       68999886543   


Q ss_pred             ccccCCCccceEEEeecccc--------CCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCccccccccceee
Q psy14093        256 SREYSHKDFCLAHLFTDLKF--------EGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTLY  327 (857)
Q Consensus       256 s~~~~~D~~cla~L~T~~df--------~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  327 (857)
                        ..++|   +|+|||++||        ..+++|+||+|        |||++..  .+.+.                   
T Consensus        92 --~~~~D---~~~LlT~~~~~~~~~g~~~~~~~G~A~~g--------~~C~~~~--~~~~~-------------------  137 (220)
T cd04272          92 --YFNPD---VVFLVTGLDMSTYSGGSLQTGTGGYAYVG--------GACTENR--VAMGE-------------------  137 (220)
T ss_pred             --CCccc---EEEEEeccceeeccCcccccCccceEeec--------CccCCCc--eeEee-------------------
Confidence              23667   9999999998        35789999999        9999542  22221                   


Q ss_pred             cccCCccccccCCccccccchhhHHhhhhhhhCCCCCCCCCC-----------CCCCCCCCCCceEEEeeccCCCCCCCC
Q psy14093        328 LNSGLSSSRNHYGQRVITREADLVTAHEFGHNWGSEHDPDMP-----------ECSPSASQGGSYLMYTYSVSGYDVNNK  396 (857)
Q Consensus       328 ~n~Gl~~~~n~~G~~v~~~~~~~t~AHElGHnlG~~HD~~~~-----------~C~~~~~~~g~~IM~~~~~~g~~~n~~  396 (857)
                       +.+.            +...+.|+||||||+|||+||++..           .|++    .++|||++...   ..+++
T Consensus       138 -d~~~------------~~~~~~~~AHElGH~lG~~HD~~~~~~~~~~~~~~~~C~~----~~~~IM~~~~~---~~~~~  197 (220)
T cd04272         138 -DTPG------------SYYGVYTMTHELAHLLGAPHDGSPPPSWVKGHPGSLDCPW----DDGYIMSYVVN---GERQY  197 (220)
T ss_pred             -cCCC------------CcccHHHHHHHHHHHhCCCCCCCCCCccCCCCCccCcCCC----CCCeEEeeccC---CCCCC
Confidence             1110            0134899999999999999998653           2543    35699998643   35788


Q ss_pred             CCCCCCCCCCcccHHHHhc
Q psy14093        397 TFSSHVDGTNFCLAHLFTH  415 (857)
Q Consensus       397 ~FS~~~~~~~~CS~~~i~~  415 (857)
                      +||+       ||+++|..
T Consensus       198 ~fS~-------CS~~~~~~  209 (220)
T cd04272         198 RFSQ-------CSQRQIRN  209 (220)
T ss_pred             ccCc-------chHHHHHH
Confidence            9999       99999987


No 10 
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=99.96  E-value=1.5e-28  Score=258.43  Aligned_cols=200  Identities=18%  Similarity=0.294  Sum_probs=136.7

Q ss_pred             eeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHHHhhccCCcccccccCCCccceeEEEEEEEEeccchhhcccccccc
Q psy14093         96 RCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYN  175 (857)
Q Consensus        96 ~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V~~iY~~~~~~~~~~~~~~~~i~~~I~~v~v~~~~~~~~~~~~~~~  175 (857)
                      .+-+.||||+.||++|+  +.++++.||+++||+||.||+++           .+|+|.|++|                 
T Consensus         2 ~~~~~vvaD~~~~~~~~--~~~~~~~~v~~ivN~vn~~Y~~~-----------~nI~v~Lvgl-----------------   51 (228)
T cd04271           2 VALIGVAADCSYTKSFG--SVEEARRNILNNVNSASQLYESS-----------FNISLGLRNL-----------------   51 (228)
T ss_pred             eEEEEEEechHHHHHcC--CHHHHHHHHHHHHHHHHHHhhCc-----------ceEEEEEEEE-----------------
Confidence            46688999999999997  36889999999999999999963           2466776665                 


Q ss_pred             ccccccccchhhhhhhhhccccccccccccCCcccccccccceeEeeEEeecCchhhccccccccccccccchhhhcccc
Q psy14093        176 MIRDKWDVRNLLEISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYNMIRDKWDVRNLLEVF  255 (857)
Q Consensus       176 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~i~i~~~~~~~l~~~~~~~~~f~~w~~~~ll~~~  255 (857)
                         ++|++++++++         +....|+..               +.+..++..+|.       .|.+|+.+.+    
T Consensus        52 ---ei~~~~d~~~~---------~~~~~~~~~---------------~~~~~~~~~tL~-------~F~~Wr~~~~----   93 (228)
T cd04271          52 ---TISDASCPSTA---------VDSAPWNLP---------------CNSRIDIDDRLS-------IFSQWRGQQP----   93 (228)
T ss_pred             ---EEecCCCCCcc---------ccccccccc---------------ccccCCHHHHHH-------HHHHHHhcCC----
Confidence               55777776543         122234321               112345666776       7999987532    


Q ss_pred             ccccCCCccceEEEeeccccCCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCccccccccceeecccCCccc
Q psy14093        256 SREYSHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTLYLNSGLSSS  335 (857)
Q Consensus       256 s~~~~~D~~cla~L~T~~df~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~Gl~~~  335 (857)
                          .+| ..+++|+|+.. .++++||||+|        |||++.++ .+ +.         ++       +.+.++.. 
T Consensus        94 ----~~d-~a~~~l~t~~~-~~~~lGlA~vg--------~mC~~~~s-~~-~~---------~~-------~~~~~~~~-  140 (228)
T cd04271          94 ----DDG-NAFWTLMTACP-SGSEVGVAWLG--------QLCRTGAS-DQ-GN---------ET-------VAGTNVVV-  140 (228)
T ss_pred             ----CCC-cEEeEeecccc-CCCceeEeecc--------CCCCCCcC-CC-cc---------cc-------ccccceee-
Confidence                233 12449999986 57999999999        99999763 22 21         11       11111211 


Q ss_pred             cccCCccccccchhhHHhhhhhhhCCCCCCCCCCCCCC-----------CC---CCCCceEEEeeccCCCCCCCCCCCCC
Q psy14093        336 RNHYGQRVITREADLVTAHEFGHNWGSEHDPDMPECSP-----------SA---SQGGSYLMYTYSVSGYDVNNKTFSSH  401 (857)
Q Consensus       336 ~n~~G~~v~~~~~~~t~AHElGHnlG~~HD~~~~~C~~-----------~~---~~~g~~IM~~~~~~g~~~n~~~FS~~  401 (857)
                              .+...+.||||||||||||.||++...|.+           +.   .+.++|||++.+.   . ++++||+ 
T Consensus       141 --------~~~~~~~t~AHElGHnLGm~HD~~~~~C~~~~~~~~~~Cp~~~~~C~~~~~~IM~~~~~---~-~~~~FS~-  207 (228)
T cd04271         141 --------RTSNEWQVFAHEIGHTFGAVHDCTSGTCSDGSVGSQQCCPLSTSTCDANGQYIMNPSSS---S-GITEFSP-  207 (228)
T ss_pred             --------cccccceehhhhhhhhcCCCCCCCCCccccccccccccCCCcccccCCCCCEEeecCcC---C-CCCccCh-
Confidence                    111235799999999999999987655632           11   1247899999743   2 7899999 


Q ss_pred             CCCCCcccHHHHhc
Q psy14093        402 VDGTNFCLAHLFTH  415 (857)
Q Consensus       402 ~~~~~~CS~~~i~~  415 (857)
                            ||+++|++
T Consensus       208 ------CS~~~~~~  215 (228)
T cd04271         208 ------CTIGNICS  215 (228)
T ss_pred             ------hhHHHHHH
Confidence                  99999986


No 11 
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=99.95  E-value=8.6e-28  Score=246.23  Aligned_cols=185  Identities=30%  Similarity=0.487  Sum_probs=134.4

Q ss_pred             ceeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHHHhhccCCcccccccCCCccceeEEEEEEEEeccchhhccccccc
Q psy14093         95 TRCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHY  174 (857)
Q Consensus        95 ~~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V~~iY~~~~~~~~~~~~~~~~i~~~I~~v~v~~~~~~~~~~~~~~  174 (857)
                      +||||+||+|+.+|++|++ +.+.++.||+.+||.||.+|++++++        .+|+|.|..|+               
T Consensus         1 ~~vE~~vv~D~~~~~~~~~-~~~~~~~~i~~~in~vn~~y~~~~~~--------~~i~~~l~~i~---------------   56 (192)
T cd04267           1 REIELVVVADHRMVSYFNS-DENILQAYITELINIANSIYRSTNLR--------LGIRISLEGLQ---------------   56 (192)
T ss_pred             CEEEEEEEEcHHHHHHhcC-CHHHHHHHHHHHHHHHHHHHhcCccC--------CCeEEEEEEEE---------------
Confidence            5899999999999999974 67889999999999999999996521        45666666653               


Q ss_pred             cccccccccchhhhhhhhhccccccccccccCCcccccccccceeEeeEEeecCchhhccccccccccccccchhhhccc
Q psy14093        175 NMIRDKWDVRNLLEISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYNMIRDKWDVRNLLEV  254 (857)
Q Consensus       175 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~i~i~~~~~~~l~~~~~~~~~f~~w~~~~ll~~  254 (857)
                           +|+..+.+..                                   +..++..+|.       .|..|+... .  
T Consensus        57 -----i~~~~~~~~~-----------------------------------~~~~~~~~l~-------~f~~~~~~~-~--   86 (192)
T cd04267          57 -----ILKGEQFAPP-----------------------------------IDSDASNTLN-------SFSFWRAEG-P--   86 (192)
T ss_pred             -----EecCCCccCc-----------------------------------cCccHHHHHH-------HHHHHHhcC-C--
Confidence                 3443332110                                   0123334444       588887754 1  


Q ss_pred             cccccCCCccceEEEeecccc-CCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCccccccccceeecccCCc
Q psy14093        255 FSREYSHKDFCLAHLFTDLKF-EGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTLYLNSGLS  333 (857)
Q Consensus       255 ~s~~~~~D~~cla~L~T~~df-~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~Gl~  333 (857)
                          .++|   +++|||+++| ..+++|+||++        ++|++.++ .+.+.                    ..   
T Consensus        87 ----~~~d---~~~l~t~~~~~~~~~~G~a~~~--------~~C~~~~~-~~v~~--------------------~~---  127 (192)
T cd04267          87 ----IRHD---NAVLLTAQDFIEGDILGLAYVG--------SMCNPYSS-VGVVE--------------------DT---  127 (192)
T ss_pred             ----CCCc---eEEEEeeeccCCCCeeeeeccc--------CCCCCCCC-eEEEe--------------------cC---
Confidence                3678   9999999998 78999999999        99998652 22222                    11   


Q ss_pred             cccccCCccccccchhhHHhhhhhhhCCCCCCCCCC---CCCCCCCCCCceEEEeeccCCCCCCCCCCCCCCCCCCcccH
Q psy14093        334 SSRNHYGQRVITREADLVTAHEFGHNWGSEHDPDMP---ECSPSASQGGSYLMYTYSVSGYDVNNKTFSSHVDGTNFCLA  410 (857)
Q Consensus       334 ~~~n~~G~~v~~~~~~~t~AHElGHnlG~~HD~~~~---~C~~~~~~~g~~IM~~~~~~g~~~n~~~FS~~~~~~~~CS~  410 (857)
                         + +     +...+.|+||||||+|||.||+..+   .|.+    +++|||++....   .++.+||+       ||+
T Consensus       128 ---~-~-----~~~~~~~~aHElGH~lG~~HD~~~~~~~~c~~----~~~~IM~~~~~~---~~~~~FS~-------CS~  184 (192)
T cd04267         128 ---G-F-----TLLTALTMAHELGHNLGAEHDGGDELAFECDG----GGNYIMAPVDSG---LNSYRFSQ-------CSI  184 (192)
T ss_pred             ---C-c-----ceeehhhhhhhHHhhcCCcCCCCCCccccCCC----CCCeEEcccccC---CCCCccCh-------hhH
Confidence               1 0     1246889999999999999998643   3332    467999997542   57899999       999


Q ss_pred             HHHhc
Q psy14093        411 HLFTH  415 (857)
Q Consensus       411 ~~i~~  415 (857)
                      .+|.+
T Consensus       185 ~~i~~  189 (192)
T cd04267         185 GSIRE  189 (192)
T ss_pred             HHHHH
Confidence            99874


No 12 
>KOG3538|consensus
Probab=99.93  E-value=2.1e-25  Score=274.03  Aligned_cols=343  Identities=24%  Similarity=0.359  Sum_probs=224.1

Q ss_pred             EEEEEcCCeEEEEEeCCCCCCCCCC-ccEEEEEecccccccccCCCCCCCCCCCCCCcCCCCccccCcCCCCCCCccccc
Q psy14093          2 TASIHTPEETYHIEPSWRHLPHLGN-QSMVAYRASDVKLSWDHAHDKPASPCGYVKEEFNTTDFDLDMEEDDPDMPHTRS   80 (857)
Q Consensus         2 ~G~I~~~~~~Y~IEPl~~~~~~~~~-~~HivYr~sd~~~~~~~~~~~~~~~Cg~~~~~~~~~~~~~~~~~~~~~~~~~r~   80 (857)
                      .|+|+++++.|+|||+.+....... ..|++||..++..-     ...+..+..  .....      .........+.|.
T Consensus       107 ~g~~~~~~~~~~iepl~~~~~~~~~~~~h~~~~r~~~~~~-----~~~~~~~~~--~~~~~------~~~~~~~~~~~~~  173 (845)
T KOG3538|consen  107 HGSIQLQGHDYFIEPLPRGPAAPVSGRAHLVYRRLDVELA-----SEVLDGTRD--QKCDP------SPGDGAGRGRCRG  173 (845)
T ss_pred             ccceeccCcceeEeeccCCCcccccCCCcccccchhhccc-----ccCcccccc--ccccc------CChhhcccccccc
Confidence            4899999999999999876544333 58999999333210     000111110  00000      0000011111222


Q ss_pred             ccccCCCCCCCCCcceeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHHHhhccCCcccccccCCCccceeEEEEEEEE
Q psy14093         81 KRQVDPAYEYKPTKTRCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVV  160 (857)
Q Consensus        81 kR~~~~~~~~~~~~~~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V~~iY~~~~~~~~~~~~~~~~i~~~I~~v~v  160 (857)
                      ||.+.       ..+ +|.+||+|..+|.+|+.    .+..|++++||+|..||++.+..        ..|++.|+.+++
T Consensus       174 ~r~~~-------~~~-vetlvvaD~~~~~~~~~----~~~~~~lt~~n~v~~l~~~~sl~--------~~i~i~vvr~~~  233 (845)
T KOG3538|consen  174 RRSVS-------RHR-VETLVVADKCMVAFHGE----SVERYVLTLMNIVASLYKDPSLG--------NHINIVVVRLII  233 (845)
T ss_pred             ccccc-------cce-eeEEEEecHHHHHhcch----hhHHHHHHHHHHHHHHhcCCCCC--------CceEEEEEEEEE
Confidence            34443       233 99999999999999982    37899999999999999997643        567777776644


Q ss_pred             eccchhhccccccccccccccccchhhhhhhhhccccccccccccCCcccccccccceeEeeEEeecCchhhcccccccc
Q psy14093        161 HSDATRVRQGEAHYNMIRDKWDVRNLLEISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYN  240 (857)
Q Consensus       161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~i~i~~~~~~~l~~~~~~~  240 (857)
                      ....                                             ++          ...|..++..+|.      
T Consensus       234 l~~~---------------------------------------------~~----------~~~i~~~~~~sl~------  252 (845)
T KOG3538|consen  234 LLKE---------------------------------------------PS----------ALNIEGNAGKSLD------  252 (845)
T ss_pred             EecC---------------------------------------------CC----------CceeccCcccchh------
Confidence            2110                                             01          1334455554554      


Q ss_pred             ccccccchhhhccccccccCCCccceEEEeeccccC------CCceeeeeccCCCCCCcccccCCCccccceEEeccCCC
Q psy14093        241 MIRDKWDVRNLLEVFSREYSHKDFCLAHLFTDLKFE------GGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGL  314 (857)
Q Consensus       241 ~~f~~w~~~~ll~~~s~~~~~D~~cla~L~T~~df~------~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~  314 (857)
                       .|++|+....... . ..++|   +|+|||+.||.      .+++|+|.++        |||.+.++.. .        
T Consensus       253 -~fC~wq~~~n~~~-d-~~~~D---~ailltr~d~~~~~~~~~~~~G~A~v~--------gmC~~~~sC~-i--------  309 (845)
T KOG3538|consen  253 -SFCKWQTSLNSEE-D-TEHHD---HAILLTRQDFCEQQNGPCDTLGVAPVG--------GMCSPSRSCS-I--------  309 (845)
T ss_pred             -hhcccccccCCCc-c-ccccc---eeEEeeeeecccccCCCCccccceeec--------ceeCCcccee-e--------
Confidence             7999998654431 1 13567   99999998875      4789999999        9999988642 1        


Q ss_pred             CccccccccceeecccCCccccccCCccccccchhhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEeeccCCCCCC
Q psy14093        315 SSSQYFKNGYTLYLNSGLSSSRNHYGQRVITREADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTYSVSGYDVN  394 (857)
Q Consensus       315 ~~~~~~~~~~~~~~n~Gl~~~~n~~G~~v~~~~~~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~~~~g~~~n  394 (857)
                                  ..++||              .+++|+|||+||+|||.||+....|.+.     ..||++....+.  .
T Consensus       310 ------------~e~~gl--------------~~aftiahE~GH~~gm~hd~~~~~C~~~-----~~im~~~~~~~~--~  356 (845)
T KOG3538|consen  310 ------------NEDTGL--------------PSAFTIAHELGHNFGMEHDGRGNSCGPT-----FHVMSSDTCGGN--H  356 (845)
T ss_pred             ------------ecCCCC--------------ccceeeeeccccccCcccCCcCCCCCCc-----ccccCcccccCC--C
Confidence                        123333              2589999999999999999987779864     289999876653  4


Q ss_pred             CCCCCCCCCCCCcccHHHHhc-------cccccCCCcccCccccCCCCCCccCCCcccccccCCCCCCCCCCCCCCCCcc
Q psy14093        395 NKTFSSHVDGTNFCLAHLFTH-------QSFWSRGSSILGLAYIASPRPYSIGGGILGLAYVGSPRRNSKCCYHSILSEL  467 (857)
Q Consensus       395 ~~~FS~~~~~~~~CS~~~i~~-------~Cl~~~~~~~CGN~ive~g~~EeCDcG~~~~~~~~~~~~~d~CC~~~~~C~l  467 (857)
                      ...||+       ||+..+..       .||.+.+.   ...+.-+.    .-+|..+              +++.+|++
T Consensus       357 ~~~WS~-------CS~~~~t~fl~~~~~~Cl~d~p~---~~~~~~p~----~lPg~~y--------------~~~~qC~~  408 (845)
T KOG3538|consen  357 ELTWSE-------CSREYLTRFLDDGEGNCLSDQPF---SALPKLPS----QLPGQTY--------------DADEQCEF  408 (845)
T ss_pred             CccCCc-------chHHHHHHHhccCccceeecCCc---cccCCCCc----cCCcccc--------------CCCCcccc
Confidence            555999       99999998       59998774   22222222    2234322              23346777


Q ss_pred             cCCCccCCCCCCcccCCcccCCCceeecCCCCCCCCCcccCCCCC--CCCCCC-CCCCCCcCCC-CceecCCccch
Q psy14093        468 FPVLLYSDKNSPCCQNCKFMAVGMKCRDAQYATCEQESRCTGSSS--VCPPSA-PMSDNTGCLE-RGKCRGGKCIP  539 (857)
Q Consensus       468 ~~ga~Cs~~~g~CC~~C~~~~~g~~Cr~~~~~~C~~~~~C~G~s~--~Cp~~~-~~~dgt~C~~-~~~C~~G~C~~  539 (857)
                      ..|..        =..|.......+|+..         ||.+...  .|.+.. |+.|||.|+. .++|.+|.|++
T Consensus       409 ~fg~~--------~~~c~~~~~~~~c~~l---------WC~~~~g~~~C~t~~~p~~~gt~Cg~~~~~c~~g~c~~  467 (845)
T KOG3538|consen  409 QFGPK--------SKECPDMRAPDPCREL---------WCHTDGGSSSCSTKHLPAADGTECGEFGKWCRKGRCTP  467 (845)
T ss_pred             ccCCc--------ceeccccCccchhhhh---------cccCCCCCCccccCcCCcccCCCcCCcCCcccCCcccC
Confidence            76542        2235544433567653         7765443  698854 9999999997 99999999996


No 13 
>PF13688 Reprolysin_5:  Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=99.91  E-value=7.9e-25  Score=224.82  Aligned_cols=183  Identities=40%  Similarity=0.671  Sum_probs=98.1

Q ss_pred             cceeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHH-HhhccCCcccccccCCCccceeEEEEEEEEeccchhhccccc
Q psy14093         94 KTRCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRV-HKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEA  172 (857)
Q Consensus        94 ~~~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V-~~iY~~~~~~~~~~~~~~~~i~~~I~~v~v~~~~~~~~~~~~  172 (857)
                      +++|+|+||+|++|++.|++ +. +++.||+.+|+.| +.+|++            .+|+|+|..|.|++.+        
T Consensus         2 ~~t~~l~v~~D~~f~~~~~~-~~-~~~~~i~~~~~~v~n~~~~~------------~~I~~~i~~i~i~~~~--------   59 (196)
T PF13688_consen    2 KTTCELLVVADYEFYQAFGS-DQ-AVISYIASLFNQVSNLYYRT------------SNIQFQISGITIWTSS--------   59 (196)
T ss_dssp             SEEEEEEEEE-HHHHHHTST--H-HHHHHHHHHHHHHHHHHHT----------------EEEEEEEEE--S---------
T ss_pred             ceEEEEEEEEcHHHHHHhCC-cH-HHHHHHHHHHHHHHHHHhhc------------cCceEEEEEEEEecCC--------
Confidence            56899999999999999986 33 7999999999999 777776            2688999988664322        


Q ss_pred             cccccccccccchhhhhhhhhccccccccccccCCcccccccccceeEeeEEeecCchhhccccccccccccccchhhhc
Q psy14093        173 HYNMIRDKWDVRNLLEISLIDRVHKIYNDTIWHDRSEQDGYRGMGFVIKKIVVHSDATRVRQGEAHYNMIRDKWDVRNLL  252 (857)
Q Consensus       173 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~i~i~~~~~~~l~~~~~~~~~f~~w~~~~ll  252 (857)
                                  ++....           . |                                      -..+....+|
T Consensus        60 ------------~~~~~~-----------~-~--------------------------------------~~~~~~~~~L   77 (196)
T PF13688_consen   60 ------------DPYTNN-----------V-S--------------------------------------NSNIDAEDTL   77 (196)
T ss_dssp             ------------B---TT-------------T--------------------------------------SSS--HHHHH
T ss_pred             ------------CCCccc-----------c-c--------------------------------------ccccCHHHHH
Confidence                        211000           0 0                                      0112222233


Q ss_pred             ccc----ccccCCCccceEEEeeccccCCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCccccccccceeec
Q psy14093        253 EVF----SREYSHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTLYL  328 (857)
Q Consensus       253 ~~~----s~~~~~D~~cla~L~T~~df~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  328 (857)
                      +.|    +.....+.+|+++|||++||.++ +|+||+|++..+..+++|+....    .                   + 
T Consensus        78 ~~f~~~~~~~~~~~~~dl~~l~t~~d~~~g-lGlA~~g~~~~~~~~~~~~~~~~----~-------------------~-  132 (196)
T PF13688_consen   78 QDFNNDFSSWRDSSYYDLAHLFTGRDFSGG-LGLAYVGSSCSNTSGGICSSSVS----G-------------------S-  132 (196)
T ss_dssp             HHH--HHHTHHHHTTSSEEEEEE----GGG-EEEESSSSSSTT-SSSSS--E-E----E-------------------E-
T ss_pred             HHHhhhhhhccCCCCCCEEEEEecccCCCC-ceeEeccccCcCCccceeccCcc----c-------------------c-
Confidence            333    22122234469999999999877 99999993333333333332110    0                   0 


Q ss_pred             ccCCccccccCCccccccchhhHHhhhhhhhCCCCCCCCCC----CCCCC---CCCCCceEEEeeccCCCCCCCCCCC
Q psy14093        329 NSGLSSSRNHYGQRVITREADLVTAHEFGHNWGSEHDPDMP----ECSPS---ASQGGSYLMYTYSVSGYDVNNKTFS  399 (857)
Q Consensus       329 n~Gl~~~~n~~G~~v~~~~~~~t~AHElGHnlG~~HD~~~~----~C~~~---~~~~g~~IM~~~~~~g~~~n~~~FS  399 (857)
                                ++..+++...++|+||||||+||+.||++..    .|++.   ..++++|||++...    +|+.+||
T Consensus       133 ----------~~~~~~~~~~~~~~AHEiGH~lGa~HD~~~~~~~~~c~~~~~~~~~~~~~IM~y~~~----~n~~~FS  196 (196)
T PF13688_consen  133 ----------YGQVPPTYNGAITFAHEIGHNLGAPHDGDYSSTSSSCPPSSSTCPAGGGYIMSYCAS----PNNNKFS  196 (196)
T ss_dssp             ----------TTEE--HHHHHHHHHHHHHHHTT-----SSS---TTTS-----GGGT---TTSSS------GGGGS--
T ss_pred             ----------cceECCCCceehhhHHhHHHhcCCCCCCCCCCCCCCCCCCccccCCCCCeEEcccCC----CCcccCC
Confidence                      2222334567999999999999999999753    67663   23468999999865    6899998


No 14 
>smart00050 DISIN Homologues of snake disintegrins. Snake disintegrins inhibit the binding of ligands to integrin receptors. They contain a 'RGD' sequence, identical to the recognition site of many adhesion proteins. Molecules containing both disintegrin and metalloprotease domains are known as ADAMs.
Probab=99.80  E-value=4.8e-20  Score=159.45  Aligned_cols=75  Identities=29%  Similarity=0.656  Sum_probs=69.6

Q ss_pred             CCCCCCccCCCcccccccCCCCCCCCCCCCCCCCcccCCCccCCCCCCcccCCcccCCCceeecCCCCCCCCCcccCCCC
Q psy14093        432 ASPRPYSIGGGILGLAYVGSPRRNSKCCYHSILSELFPVLLYSDKNSPCCQNCKFMAVGMKCRDAQYATCEQESRCTGSS  511 (857)
Q Consensus       432 e~g~~EeCDcG~~~~~~~~~~~~~d~CC~~~~~C~l~~ga~Cs~~~g~CC~~C~~~~~g~~Cr~~~~~~C~~~~~C~G~s  511 (857)
                      |+|  ||||||+..       .+.|+||++. +|+|++||+|+  +|+||.+|+|+++|++||+++ ++||++|||+|.|
T Consensus         1 E~g--EeCDCG~~~-------~~~d~CC~~~-tC~l~~ga~Cs--~G~CC~~C~~~~~g~~CR~~~-~~Cd~~e~C~G~s   67 (75)
T smart00050        1 EEG--EECDCGSPK-------ECTNPCCDPA-TCKLKPGAQCA--SGPCCDNCKFKPAGTLCRPSV-DECDLPEYCNGTS   67 (75)
T ss_pred             CcC--cccCCCChh-------hcCCCCcCCC-cceecCCCeec--CCCCCCcceEccCCCEecCcc-CCCCCccccCCCC
Confidence            678  999999986       4799999997 89999999999  499999999999999999997 6999999999999


Q ss_pred             CCCCCCCC
Q psy14093        512 SVCPPSAP  519 (857)
Q Consensus       512 ~~Cp~~~~  519 (857)
                      ++||+++|
T Consensus        68 ~~CP~~~~   75 (75)
T smart00050       68 ADCPPDVY   75 (75)
T ss_pred             CcCcCCCC
Confidence            99999865


No 15 
>smart00050 DISIN Homologues of snake disintegrins. Snake disintegrins inhibit the binding of ligands to integrin receptors. They contain a 'RGD' sequence, identical to the recognition site of many adhesion proteins. Molecules containing both disintegrin and metalloprotease domains are known as ADAMs.
Probab=99.76  E-value=2.2e-19  Score=155.37  Aligned_cols=71  Identities=34%  Similarity=0.754  Sum_probs=65.5

Q ss_pred             cccccccCCCCCCCCCCCCCCC-CCccccCCCCCcccCCCCCCCCCCCCccccccCccccccccccCccccccccCccCC
Q psy14093        717 EGDEECDAGLLGTEDNDSCCDK-VCKLRRNEGAGDEECDAGLLGTEDNDSCCDKVCKLRRNEGATKGTSSFGKRRNKTHT  795 (857)
Q Consensus       717 E~gEeCDCG~~~~c~~d~CC~~-tC~Lk~ga~~~~a~C~~G~~~~~~n~~CC~~~C~~~~~~~~~~g~~c~~~r~~~~~~  795 (857)
                      |+|||||||+.++| .|+||++ +|+|++|     |+|++|+        || .+|+|+++     |++|   |+  +.+
T Consensus         1 E~gEeCDCG~~~~~-~d~CC~~~tC~l~~g-----a~Cs~G~--------CC-~~C~~~~~-----g~~C---R~--~~~   55 (75)
T smart00050        1 EEGEECDCGSPKEC-TNPCCDPATCKLKPG-----AQCASGP--------CC-DNCKFKPA-----GTLC---RP--SVD   55 (75)
T ss_pred             CcCcccCCCChhhc-CCCCcCCCcceecCC-----CeecCCC--------CC-CcceEccC-----CCEe---cC--ccC
Confidence            78999999999886 8999997 7999999     8998888        99 69999999     9999   87  666


Q ss_pred             Cc---cccCCCccccccccc
Q psy14093        796 LC---RRCGKSSYHIQKKKC  812 (857)
Q Consensus       796 ~C---e~C~G~s~~Cp~d~~  812 (857)
                      +|   |||+|+|+.||+|+|
T Consensus        56 ~Cd~~e~C~G~s~~CP~~~~   75 (75)
T smart00050       56 ECDLPEYCNGTSADCPPDVY   75 (75)
T ss_pred             CCCCccccCCCCCcCcCCCC
Confidence            88   999999999999986


No 16 
>PF00200 Disintegrin:  Disintegrin;  InterPro: IPR001762  Disintegrins are a family of small proteins from viper venoms that function as potent inhibitors of both platelet aggregation and integrin-dependent cell adhesion [, ]. Integrin receptors are involved in cell-cell and cell-extracellular matrix interactions, serving as the final common pathway leading to aggregation via formation of platelet-platelet bridges, which are essential in thrombosis and haemostasis. Disintegrins contain an RGD (Arg-Gly-Asp) or KGD (Lys-Gly-Asp) sequence motif that binds specifically to integrin IIb-IIIa receptors on the platelet surface, thereby blocking the binding of fibrinogen to the receptor-glycoprotein complex of activated platelets. Disintegrins act as receptor antagonists, inhibiting aggregation induced by ADP, thrombin, platelet-activating factor and collagen []. The role of disintegrin in preventing blood coagulation renders it of medical interest, particularly with regard to its use as an anti-coagulant []. Disintegrins from different snake species have been characterised: albolabrin, applagin, barbourin, batroxostatin, bitistatin, obtustatin [], schistatin [], echistatin [], elegantin, eristicophin, flavoridin [], halysin, kistrin, tergeminin, salmosin [] and triflavin. Disintegrin-like proteins are found in various species ranging from slime mold to humans. Some other proteins known to contain a disintegrin domain are:  Some snake venom zinc metalloproteinases [] consist of an N-terminal catalytic domain fused to a disintegrin domain. Such is the case for trimerelysin I (HR1B), atrolysin-e (Ht-e) and trigramin. It has been suggested that these proteinases are able to cleave themselves from the disintegrin domains and that the latter may arise from such a post-translational processing. The beta-subunit of guinea pig sperm surface protein PH30 []. PH30 is a protein involved in sperm-egg fusion. The beta subunit contains a disintegrin at the N-terminal extremity. Mammalian epididymial apical protein 1 (EAP I) []. EAP I is associated with the sperm membrane and may play a role in sperm maturation. Structurally, EAP I consists of an N-terminal domain, followed by a zinc metalloproteinase domain, a disintegrin domain, and a large C-terminal domain that contains a transmembrane region. ; PDB: 1L3X_A 3C05_D 3K7N_A 3K7L_A 2ECH_A 1RO3_A 2PJF_A 2PJG_A 1Q7J_A 1Q7I_A ....
Probab=99.76  E-value=2.4e-19  Score=156.37  Aligned_cols=76  Identities=34%  Similarity=0.670  Sum_probs=67.0

Q ss_pred             CCCCCCccCCCcccccccCCCCCCCCCCCCCCCCcccCCCccCCCCCCcccCCcccCCCceeecCCCCCCCCCcccCCCC
Q psy14093        432 ASPRPYSIGGGILGLAYVGSPRRNSKCCYHSILSELFPVLLYSDKNSPCCQNCKFMAVGMKCRDAQYATCEQESRCTGSS  511 (857)
Q Consensus       432 e~g~~EeCDcG~~~~~~~~~~~~~d~CC~~~~~C~l~~ga~Cs~~~g~CC~~C~~~~~g~~Cr~~~~~~C~~~~~C~G~s  511 (857)
                      |+|  ||||||+...       |.|+||++. +|+|+++|+|+  +|+||.+|+|+++|++||+++ ++||++|||+|.|
T Consensus         1 E~g--EeCDCG~~~~-------C~~~CC~~~-~C~l~~ga~C~--~g~CC~~C~~~~~g~~Cr~~~-~~Cd~~e~C~G~s   67 (76)
T PF00200_consen    1 EEG--EECDCGSPEE-------CNDPCCDAT-TCKLKPGAQCS--NGPCCNNCQFKPSGTVCRPAT-NECDLPEYCNGSS   67 (76)
T ss_dssp             -TT--TS-STSSTTT-------HTCTTEETT-TTEESTTSSTS--SSTTEETTEESSTTEEEEECS-STTSEEEEECSSS
T ss_pred             CcC--ccCCCCChhh-------ccCCCcCcc-ccccccccccc--cCcccCcccccccccccCccc-CcCCCCCcCCCcC
Confidence            678  9999999874       489999986 79999999994  599999999999999999997 7999999999999


Q ss_pred             CCCCCCCCC
Q psy14093        512 SVCPPSAPM  520 (857)
Q Consensus       512 ~~Cp~~~~~  520 (857)
                      +.||+++||
T Consensus        68 ~~CP~d~~k   76 (76)
T PF00200_consen   68 AECPPDVYK   76 (76)
T ss_dssp             SSS-SSTT-
T ss_pred             CcCcCCCCC
Confidence            999999885


No 17 
>PF00200 Disintegrin:  Disintegrin;  InterPro: IPR001762  Disintegrins are a family of small proteins from viper venoms that function as potent inhibitors of both platelet aggregation and integrin-dependent cell adhesion [, ]. Integrin receptors are involved in cell-cell and cell-extracellular matrix interactions, serving as the final common pathway leading to aggregation via formation of platelet-platelet bridges, which are essential in thrombosis and haemostasis. Disintegrins contain an RGD (Arg-Gly-Asp) or KGD (Lys-Gly-Asp) sequence motif that binds specifically to integrin IIb-IIIa receptors on the platelet surface, thereby blocking the binding of fibrinogen to the receptor-glycoprotein complex of activated platelets. Disintegrins act as receptor antagonists, inhibiting aggregation induced by ADP, thrombin, platelet-activating factor and collagen []. The role of disintegrin in preventing blood coagulation renders it of medical interest, particularly with regard to its use as an anti-coagulant []. Disintegrins from different snake species have been characterised: albolabrin, applagin, barbourin, batroxostatin, bitistatin, obtustatin [], schistatin [], echistatin [], elegantin, eristicophin, flavoridin [], halysin, kistrin, tergeminin, salmosin [] and triflavin. Disintegrin-like proteins are found in various species ranging from slime mold to humans. Some other proteins known to contain a disintegrin domain are:  Some snake venom zinc metalloproteinases [] consist of an N-terminal catalytic domain fused to a disintegrin domain. Such is the case for trimerelysin I (HR1B), atrolysin-e (Ht-e) and trigramin. It has been suggested that these proteinases are able to cleave themselves from the disintegrin domains and that the latter may arise from such a post-translational processing. The beta-subunit of guinea pig sperm surface protein PH30 []. PH30 is a protein involved in sperm-egg fusion. The beta subunit contains a disintegrin at the N-terminal extremity. Mammalian epididymial apical protein 1 (EAP I) []. EAP I is associated with the sperm membrane and may play a role in sperm maturation. Structurally, EAP I consists of an N-terminal domain, followed by a zinc metalloproteinase domain, a disintegrin domain, and a large C-terminal domain that contains a transmembrane region. ; PDB: 1L3X_A 3C05_D 3K7N_A 3K7L_A 2ECH_A 1RO3_A 2PJF_A 2PJG_A 1Q7J_A 1Q7I_A ....
Probab=99.75  E-value=3.5e-19  Score=155.35  Aligned_cols=72  Identities=36%  Similarity=0.781  Sum_probs=63.8

Q ss_pred             cccccccCCCCCCCCCCCCCCC-CCccccCCCCCcccCCCCCCCCCCCCccccccCccccccccccCccccccccCccCC
Q psy14093        717 EGDEECDAGLLGTEDNDSCCDK-VCKLRRNEGAGDEECDAGLLGTEDNDSCCDKVCKLRRNEGATKGTSSFGKRRNKTHT  795 (857)
Q Consensus       717 E~gEeCDCG~~~~c~~d~CC~~-tC~Lk~ga~~~~a~C~~G~~~~~~n~~CC~~~C~~~~~~~~~~g~~c~~~r~~~~~~  795 (857)
                      |+|||||||++.+| .|+||++ +|+|++|     |+|+.|+        || .+|+|+++     |++|   |+  +.+
T Consensus         1 E~gEeCDCG~~~~C-~~~CC~~~~C~l~~g-----a~C~~g~--------CC-~~C~~~~~-----g~~C---r~--~~~   55 (76)
T PF00200_consen    1 EEGEECDCGSPEEC-NDPCCDATTCKLKPG-----AQCSNGP--------CC-NNCQFKPS-----GTVC---RP--ATN   55 (76)
T ss_dssp             -TTTS-STSSTTTH-TCTTEETTTTEESTT-----SSTSSST--------TE-ETTEESST-----TEEE---EE--CSS
T ss_pred             CcCccCCCCChhhc-cCCCcCccccccccc-----cccccCc--------cc-Cccccccc-----cccc---Cc--ccC
Confidence            78999999999999 8999996 9999999     8998888        99 69999999     9999   88  667


Q ss_pred             Cc---cccCCCcccccccccc
Q psy14093        796 LC---RRCGKSSYHIQKKKCA  813 (857)
Q Consensus       796 ~C---e~C~G~s~~Cp~d~~~  813 (857)
                      +|   |||+|.|+.||+|+|+
T Consensus        56 ~Cd~~e~C~G~s~~CP~d~~k   76 (76)
T PF00200_consen   56 ECDLPEYCNGSSAECPPDVYK   76 (76)
T ss_dssp             TTSEEEEECSSSSSS-SSTT-
T ss_pred             cCCCCCcCCCcCCcCcCCCCC
Confidence            88   9999999999999985


No 18 
>PF13574 Reprolysin_2:  Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=99.73  E-value=2.9e-19  Score=180.72  Aligned_cols=103  Identities=40%  Similarity=0.663  Sum_probs=51.1

Q ss_pred             ccCCCccceEEEeeccccCCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCccccccccceeecccCCccccc
Q psy14093        258 EYSHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTLYLNSGLSSSRN  337 (857)
Q Consensus       258 ~~~~D~~cla~L~T~~df~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~Gl~~~~n  337 (857)
                      ...+++||++|||  .++..+.+||||+|        ++|++.+...                          |++.  +
T Consensus        64 ~~~~~~~Di~~l~--~~~~~g~~Gla~~g--------~~c~~~~~~~--------------------------g~~~--~  105 (173)
T PF13574_consen   64 WIGSSNYDIGHLF--GTFSGGGGGLAYVG--------GVCSKGYKAT--------------------------GLSS--P  105 (173)
T ss_dssp             ----TCSS-CCEE--EEESSGCCSSEEE---------CCCCTCCCE---------------------------CCCS---
T ss_pred             cCCCCCCCEEEEe--eccCCCCccEEEEe--------EeCcCCCEEE--------------------------EEEc--C
Confidence            4456678899999  44467899999999        9999865321                          1111  1


Q ss_pred             cCCccccccchhhHHhhhhhhhCCCCCCCCCCCCCC---CCCCCCceEEEeeccCCCCCCCCCCCCCCCCCCcccHHHHh
Q psy14093        338 HYGQRVITREADLVTAHEFGHNWGSEHDPDMPECSP---SASQGGSYLMYTYSVSGYDVNNKTFSSHVDGTNFCLAHLFT  414 (857)
Q Consensus       338 ~~G~~v~~~~~~~t~AHElGHnlG~~HD~~~~~C~~---~~~~~g~~IM~~~~~~g~~~n~~~FS~~~~~~~~CS~~~i~  414 (857)
                       +|..    ..+.|+||||||+||++||.+. .|.+   .....+.|||++.   + .+++.+||+       ||+++|+
T Consensus       106 -~~~~----~~~~~~aHElGH~lGa~Hd~~~-~~~~~~~~~~~sg~~IMsy~---~-~~~~~~fS~-------cS~~~i~  168 (173)
T PF13574_consen  106 -YGDF----FGIDTFAHELGHQLGAPHDFDN-GCNPGSPTCPGSGGYIMSYA---G-ICNNDKFSP-------CSIRQIR  168 (173)
T ss_dssp             --TTS----HHHHHHHHHHHHHHT---SSSS-----SS--T-STTT-TT-SS-----GGGG-------------------
T ss_pred             -CCce----eeeeeehhhhHhhcCCCCCCCC-CCCCCCCcCCCCCCEEEeeC---C-CCCCccccc-------ccccccc
Confidence             2221    2578999999999999999975 3222   2234589999976   2 478999999       9999997


Q ss_pred             c
Q psy14093        415 H  415 (857)
Q Consensus       415 ~  415 (857)
                      .
T Consensus       169 ~  169 (173)
T PF13574_consen  169 A  169 (173)
T ss_dssp             -
T ss_pred             c
Confidence            5


No 19 
>PF13583 Reprolysin_4:  Metallo-peptidase family M12B Reprolysin-like
Probab=99.67  E-value=6.7e-16  Score=160.49  Aligned_cols=42  Identities=24%  Similarity=0.423  Sum_probs=37.5

Q ss_pred             cceeeEEEEechHHHHHhCCCchHHHHHHHHHHHHHHHhhcc
Q psy14093         94 KTRCPLLLVADYRFYQEMGGSNTKTTINYLISLIDRVHKIYN  135 (857)
Q Consensus        94 ~~~~el~vvaD~~~~~~~g~~~~~~~~~yi~~l~n~V~~iY~  135 (857)
                      .+.++|+|++|+.+++.|++.+++.+..+|..+++.+|.+|.
T Consensus         2 ~~~~~l~v~~d~~~~~~~~~~s~~~v~~~i~~~v~~~N~~Y~   43 (206)
T PF13583_consen    2 LRVYRLAVVYDYSFYSWFFGGSIEDVKAFIASMVNFANEAYN   43 (206)
T ss_pred             ceEEEEEEEECHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHh
Confidence            467999999999999999434688899999999999999994


No 20 
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=99.65  E-value=2e-16  Score=168.47  Aligned_cols=145  Identities=35%  Similarity=0.566  Sum_probs=103.6

Q ss_pred             CcceeeeeEEEcCCCCccCCCccccCcccccccccccccccccCCCCCCcceEEEEEeeeecccCCCcceeeeecCCCCC
Q psy14093        558 GMGFVIKKIVVHSDATRVRQGEAHYNMIRDKWDVRNLLETFSSHVDGTNFCLAHLFTHQSFWSRGSSILGLAYIASPRPY  637 (857)
Q Consensus       558 g~C~pi~~~~il~~GT~C~~G~c~~n~c~r~W~~~~lL~~~~~~~~hD~~c~AhLft~~df~~~~~~tvGlA~vgs~~~~  637 (857)
                      ++-+.+.++.|..++.+-....-.++..-..|+...+++.+.....||.+|+|||||+++|   .++++||||+|+++..
T Consensus        52 ~i~i~~~~i~I~~~~~~~~~~~~~~~~~f~~w~~~~~~~~~~~~~~~d~~c~A~LlT~~df---~~~tlGlA~vg~~~~~  128 (244)
T cd04270          52 GIGFQIKRIRIHTTPDEVDPGNKFYNKSFPNWGVEKFLVKLLLEQFSDDVCLAHLFTYRDF---DMGTLGLAYVGSPRDN  128 (244)
T ss_pred             ceEEEEEEEEEEcCcccccCCCcccccCccchhHHHHhhhhhhhcCCccceEEEEEecccc---CCCceeeeeecccccC
Confidence            4556667777777665432222222234567888777754332223555799999999999   5789999999998655


Q ss_pred             cccceeccccc--------------------------cccchhhheeeeeeecccccccCC--CC--------CCcccC-
Q psy14093        638 SIGGSIHENLK--------------------------YFDTLHSSHISHTIVKRGVQESNH--PF--------NKIKEV-  680 (857)
Q Consensus       638 ~s~Gv~~~~~~--------------------------~~~~~~aheigh~l~~~g~~~d~~--~~--------gcim~~-  680 (857)
                      .++|||.+...                          .++.+|||||||+|   ||.||..  .+        +|||+. 
T Consensus       129 ~~gGIC~~~~s~~~~~~~~~n~gl~t~~~~~~~~~~~~~a~t~AHElGHnl---Gm~HD~~~~~C~~~~~~~~~cIM~~~  205 (244)
T cd04270         129 SAGGICEKAYYYSNGKKKYLNTGLTTTVNYGKRVPTKESDLVTAHELGHNF---GSPHDPDIAECAPGESQGGNYIMYAR  205 (244)
T ss_pred             CCCcccCCcccccCCcceeeecceEeeeccCCccchhHHHHHHHHHHHHhc---CCCCCCCCCCCCCCCCCCCceEecCc
Confidence            67888754221                          13468999999999   9999976  22        489962 


Q ss_pred             ----------CCCCCCCHHHHHHHhhcCCCCcCCCCCcc
Q psy14093        681 ----------TFKTLGNLRSIRKVLEAKSGKCFSEPEES  709 (857)
Q Consensus       681 ----------~fss~CS~~~~~~~l~~~~~~Cl~~~~~~  709 (857)
                                .| |.||+++|.++|..+.+.||.+...+
T Consensus       206 ~~~~~~~~~~~F-S~CS~~~~~~~L~~~~~~CL~~~~~~  243 (244)
T cd04270         206 ATSGDKENNKKF-SPCSKKSISKVLEVKSNSCFVERSQS  243 (244)
T ss_pred             cccCCCCCCCCC-CHhHHHHHHHHHccCCCCCcCCCCCC
Confidence                      36 47999999999999999999987654


No 21 
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=99.60  E-value=5.6e-15  Score=147.48  Aligned_cols=106  Identities=29%  Similarity=0.359  Sum_probs=74.1

Q ss_pred             CCCccceEEEeeccccCCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCccccccccceeecccCCccccccC
Q psy14093        260 SHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTLYLNSGLSSSRNHY  339 (857)
Q Consensus       260 ~~D~~cla~L~T~~df~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~Gl~~~~n~~  339 (857)
                      .+|   +++++++.++.++.+|+||+|        ++|.+...  .++.                    .     ... .
T Consensus        51 ~~d---i~~~~~~~~~~~~~~g~a~~~--------~~c~~~~~--~~~~--------------------~-----~~~-~   91 (167)
T cd00203          51 KAD---IAILVTRQDFDGGTGGWAYLG--------RVCDSLRG--VGVL--------------------Q-----DNQ-S   91 (167)
T ss_pred             cCc---EEEEeccccCCCCceEEEecC--------CccCCCCC--cEEE--------------------e-----cCC-c
Confidence            457   999999999888899999999        89976541  1211                    0     000 0


Q ss_pred             CccccccchhhHHhhhhhhhCCCCCCCCCCCCCC---------CCCCCCceEEEeeccCCCCCCCCCCCCCCCCCCcccH
Q psy14093        340 GQRVITREADLVTAHEFGHNWGSEHDPDMPECSP---------SASQGGSYLMYTYSVSGYDVNNKTFSSHVDGTNFCLA  410 (857)
Q Consensus       340 G~~v~~~~~~~t~AHElGHnlG~~HD~~~~~C~~---------~~~~~g~~IM~~~~~~g~~~n~~~FS~~~~~~~~CS~  410 (857)
                      +    ....+.|+||||||+||+.||.+.+.+..         .......+||++....+..+...+||+       ||+
T Consensus        92 ~----~~~~~~~~~HElGH~LGl~H~~~~~~~~~~~~~~~~~~~~~~~~~siM~y~~~~~~~~~~~~fS~-------~d~  160 (167)
T cd00203          92 G----TKEGAQTIAHELGHALGFYHDHDRKDRDDYPTIDDTLNAEDDDYYSVMSYTKGSFSDGQRKDFSQ-------CDI  160 (167)
T ss_pred             c----cccchhhHHHHHHHHhCCCccCcCCCCCCCccccccccCCCCCCCeEeccCccccCcccCCCcCH-------HHH
Confidence            0    12457899999999999999987543220         011246899999876543357899999       999


Q ss_pred             HHHhc
Q psy14093        411 HLFTH  415 (857)
Q Consensus       411 ~~i~~  415 (857)
                      ++|.+
T Consensus       161 ~~i~~  165 (167)
T cd00203         161 DQINK  165 (167)
T ss_pred             HHHHh
Confidence            98864


No 22 
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=99.30  E-value=2e-12  Score=134.50  Aligned_cols=94  Identities=18%  Similarity=0.214  Sum_probs=77.6

Q ss_pred             CCCcceEEEEEeeeecc--cCCCcceeeeecCC-CCCcccceeccccccccchhhheeeeeeecccccccCCC-------
Q psy14093        604 GTNFCLAHLFTHQSFWS--RGSSILGLAYIASP-RPYSIGGSIHENLKYFDTLHSSHISHTIVKRGVQESNHP-------  673 (857)
Q Consensus       604 hD~~c~AhLft~~df~~--~~~~tvGlA~vgs~-~~~~s~Gv~~~~~~~~~~~~aheigh~l~~~g~~~d~~~-------  673 (857)
                      ||   +|+|||+++|..  ..++++|+||+|++ ++..+++|..+..+..+.++||||||+|   ||.||+..       
T Consensus        94 ~D---~a~llt~~d~~~~~~~~~~~G~A~~ggiC~~~~s~~i~~~~~~~~a~~~aHElGH~L---G~~HD~~~~~C~~~~  167 (207)
T cd04273          94 HD---HAILLTRQDICRSNGNCDTLGLAPVGGMCSPSRSCSINEDTGLSSAFTIAHELGHVL---GMPHDGDGNSCGPEG  167 (207)
T ss_pred             cc---eEEEEeeecccccCCCCCceEEeccccCCCCCcceEEEcCCCceeEEeeeeechhhc---CCCCCCCCCCCCCCC
Confidence            56   999999999910  01589999999999 5678888887777778889999999999   99999751       


Q ss_pred             -CCCcccC---------CCCCCCCHHHHHHHhhcCCCCcCC
Q psy14093        674 -FNKIKEV---------TFKTLGNLRSIRKVLEAKSGKCFS  704 (857)
Q Consensus       674 -~gcim~~---------~fss~CS~~~~~~~l~~~~~~Cl~  704 (857)
                       -+|||+.         .| |.||+++|.++|..+...||+
T Consensus       168 ~~~~IM~~~~~~~~~~~~f-S~CS~~~~~~~l~~~~~~CL~  207 (207)
T cd04273         168 KDGHIMSPTLGANTGPFTW-SKCSRRYLTSFLDTGDGNCLL  207 (207)
T ss_pred             CCceeecccccCCCCCCCc-CHHHHHHHHHHHccCCCcccC
Confidence             1699962         36 589999999999998899985


No 23 
>PF01421 Reprolysin:  Reprolysin (M12B) family zinc metalloprotease  This Prosite motif covers only the active site.;  InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=99.29  E-value=7e-13  Score=136.98  Aligned_cols=105  Identities=21%  Similarity=0.386  Sum_probs=75.8

Q ss_pred             cccccccccccccCCCCCCcceEEEEEeeeecccCCCcceeeeecCC-CCCcccceeccc---cccccchhhheeeeeee
Q psy14093        588 KWDVRNLLETFSSHVDGTNFCLAHLFTHQSFWSRGSSILGLAYIASP-RPYSIGGSIHEN---LKYFDTLHSSHISHTIV  663 (857)
Q Consensus       588 ~W~~~~lL~~~~~~~~hD~~c~AhLft~~df~~~~~~tvGlA~vgs~-~~~~s~Gv~~~~---~~~~~~~~aheigh~l~  663 (857)
                      .|+...+..+..    ||   .|+|||++++   ...++|+||+|+| ++..++||..+.   ...++.++||||||+| 
T Consensus        75 ~w~~~~~~~~~~----~D---~a~Llt~~~~---~~~~~G~a~~~~~C~~~~s~~i~~~~~~~~~~~a~~~AHelGH~l-  143 (199)
T PF01421_consen   75 NWQKSELPPRIH----HD---HAHLLTGKDF---PDSTVGLAYVGGMCSPSRSCGIVEDHSRSGLSFAVIIAHELGHNL-  143 (199)
T ss_dssp             HHHHHTHHHHS------S---EEEEEESS-C---SCCBSEEE-TT-TTSTTTSEEEEE-SSSSHHHHHHHHHHHHHHHT-
T ss_pred             HHHHhhcccccc----cc---eeEEeeeccc---ccceeeeEeCCCCCCcCCCCcEeeeccchhHHHHHHHHHHHHHhc-
Confidence            355444443333    78   9999999999   3559999999999 666888888654   6678889999999999 


Q ss_pred             cccccccCC-C--C----CCcccC--------CCCCCCCHHHHHHHhhcCCCCcCCCC
Q psy14093        664 KRGVQESNH-P--F----NKIKEV--------TFKTLGNLRSIRKVLEAKSGKCFSEP  706 (857)
Q Consensus       664 ~~g~~~d~~-~--~----gcim~~--------~fss~CS~~~~~~~l~~~~~~Cl~~~  706 (857)
                        ||.||+. .  +    +|||+.        .| |.||..++.+++.++.+.||+|.
T Consensus       144 --Gm~HD~~~~C~C~~~~~cIM~~~~~~~~~~~f-S~CS~~~~~~~l~~~~~~CL~n~  198 (199)
T PF01421_consen  144 --GMPHDGDNGCKCPRRDGCIMSPSIGPSSSDKF-SNCSRRQFEEFLSSGKSSCLLNK  198 (199)
T ss_dssp             --T---TTTCTHCSSSSSSSTTSSSSSSSSTSSS--HHHHHHHHHHHHHHTTGGGSS-
T ss_pred             --CCCCCCCCcCCcCCCCCccccccccCCCCCCc-CHHHHHHHHHHHccCCCCccCCC
Confidence              9999987 2  2    789974        35 57999999999999889999874


No 24 
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=99.28  E-value=3.1e-12  Score=131.57  Aligned_cols=92  Identities=23%  Similarity=0.447  Sum_probs=76.9

Q ss_pred             CCCcceEEEEEeeeecccCCCcceeeeecCC-CCCcccceeccc---cccccchhhheeeeeeecccccccCCCC-----
Q psy14093        604 GTNFCLAHLFTHQSFWSRGSSILGLAYIASP-RPYSIGGSIHEN---LKYFDTLHSSHISHTIVKRGVQESNHPF-----  674 (857)
Q Consensus       604 hD~~c~AhLft~~df~~~~~~tvGlA~vgs~-~~~~s~Gv~~~~---~~~~~~~~aheigh~l~~~g~~~d~~~~-----  674 (857)
                      ||   +|||||+++|   .++++|+||+|++ ++..+++|....   ....+.++||||||+|   ||.||+..+     
T Consensus        87 ~D---~a~Llt~~~~---~~~~~G~A~~g~iC~~~~~~~v~~~~~~~~~~~a~~~AHElGH~l---G~~HD~~~c~C~~~  157 (194)
T cd04269          87 HD---NAQLLTGRDF---DGNTVGLAYVGGMCSPKYSGGVVQDHSRNLLLFAVTMAHELGHNL---GMEHDDGGCTCGRS  157 (194)
T ss_pred             CC---ceEEEEeeec---CCCceeeeecCCccCCCcceEEEEeCCcchHHHHHHHHHHHHhhc---CCCcCCCCCCCCCC
Confidence            67   9999999998   5799999999999 447778887543   4566789999999999   999998543     


Q ss_pred             CCcccC-------CCCCCCCHHHHHHHhhcCCCCcCCC
Q psy14093        675 NKIKEV-------TFKTLGNLRSIRKVLEAKSGKCFSE  705 (857)
Q Consensus       675 gcim~~-------~fss~CS~~~~~~~l~~~~~~Cl~~  705 (857)
                      +|||..       .| |.||.+++.++|..+...||++
T Consensus       158 ~cIM~~~~~~~~~~f-S~CS~~~~~~~l~~~~~~CL~~  194 (194)
T cd04269         158 TCIMAPSPSSLTDAF-SNCSYEDYQKFLSRGGGQCLLN  194 (194)
T ss_pred             CeEecCCCCCCCCCC-ChhhHHHHHHHHhcCCCCccCC
Confidence            689963       45 5799999999999988999974


No 25 
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=99.20  E-value=1.2e-11  Score=130.67  Aligned_cols=105  Identities=17%  Similarity=0.196  Sum_probs=73.7

Q ss_pred             ccccccccccCCCCCCcceEEEEEeeeecccCCCcceeeeecCC-CCCcccceecccc---cccc-----chhhheeeee
Q psy14093        591 VRNLLETFSSHVDGTNFCLAHLFTHQSFWSRGSSILGLAYIASP-RPYSIGGSIHENL---KYFD-----TLHSSHISHT  661 (857)
Q Consensus       591 ~~~lL~~~~~~~~hD~~c~AhLft~~df~~~~~~tvGlA~vgs~-~~~~s~Gv~~~~~---~~~~-----~~~aheigh~  661 (857)
                      +..|.+|......||. .++||+++..+    ++++||||+|+| ++..++|+..+..   ...+     .++||||||+
T Consensus        82 L~~F~~Wr~~~~~~d~-a~~~l~t~~~~----~~~lGlA~vg~mC~~~~s~~~~~~~~~~~~~~~~~~~~~t~AHElGHn  156 (228)
T cd04271          82 LSIFSQWRGQQPDDGN-AFWTLMTACPS----GSEVGVAWLGQLCRTGASDQGNETVAGTNVVVRTSNEWQVFAHEIGHT  156 (228)
T ss_pred             HHHHHHHHhcCCCCCc-EEeEeeccccC----CCceeEeeccCCCCCCcCCCccccccccceeecccccceehhhhhhhh
Confidence            4455555443334551 14499999865    899999999999 5656666653221   1111     3899999999


Q ss_pred             eecccccccCCC--C----------------------CCcccC-------CCCCCCCHHHHHHHhhcCC--CCcCC
Q psy14093        662 IVKRGVQESNHP--F----------------------NKIKEV-------TFKTLGNLRSIRKVLEAKS--GKCFS  704 (857)
Q Consensus       662 l~~~g~~~d~~~--~----------------------gcim~~-------~fss~CS~~~~~~~l~~~~--~~Cl~  704 (857)
                      |   ||.||+..  +                      +|||+.       .| |.||+.+|.++|..+.  +.||.
T Consensus       157 L---Gm~HD~~~~~C~~~~~~~~~~Cp~~~~~C~~~~~~IM~~~~~~~~~~F-S~CS~~~~~~~l~~~~~~~~CL~  228 (228)
T cd04271         157 F---GAVHDCTSGTCSDGSVGSQQCCPLSTSTCDANGQYIMNPSSSSGITEF-SPCTIGNICSLLGRNPVRTSCLS  228 (228)
T ss_pred             c---CCCCCCCCCccccccccccccCCCcccccCCCCCEEeecCcCCCCCcc-ChhhHHHHHHHHhcCCCCCcccC
Confidence            9   99999642  1                      469973       46 4799999999998865  89984


No 26 
>PF13582 Reprolysin_3:  Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=99.12  E-value=3.3e-11  Score=114.81  Aligned_cols=61  Identities=41%  Similarity=0.672  Sum_probs=34.0

Q ss_pred             cceEEEeeccccCCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCccccccccceeecccCCccccccCCccc
Q psy14093        264 FCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSSQYFKNGYTLYLNSGLSSSRNHYGQRV  343 (857)
Q Consensus       264 ~cla~L~T~~df~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~Gl~~~~n~~G~~v  343 (857)
                      +++++||+..++ .+..|+||+|        ++|...... +...                    ..      ..++   
T Consensus        64 aD~v~~~~~~~~-~~~~G~A~~~--------~~~~~~~~~-~~~~--------------------~~------~~~~---  104 (124)
T PF13582_consen   64 ADLVHLFTGNDL-NGAGGIAYIG--------GVCSNNSGY-GVNY--------------------DS------SPYG---  104 (124)
T ss_dssp             -TTSEEEEEESS-GGB-EEEECT-----------TTT--T-T-SE--------------------EE------EE-----
T ss_pred             CCEEEEeccCCC-CCCceEEEeC--------CccccCCce-EEEc--------------------cc------CCCC---
Confidence            339999999986 6788999999        999986531 1110                    00      0011   


Q ss_pred             cccchhhHHhhhhhhhCCCCCC
Q psy14093        344 ITREADLVTAHEFGHNWGSEHD  365 (857)
Q Consensus       344 ~~~~~~~t~AHElGHnlG~~HD  365 (857)
                        .....|+||||||+||+.||
T Consensus       105 --~~~~~~~~HEiGH~lGl~Hd  124 (124)
T PF13582_consen  105 --DSGVDTFAHEIGHNLGLNHD  124 (124)
T ss_dssp             --TTSTTHHHHHHHHHTT----
T ss_pred             --cccceEeeehhhHhcCCCCC
Confidence              12358999999999999998


No 27 
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=99.08  E-value=9.1e-11  Score=123.31  Aligned_cols=95  Identities=17%  Similarity=0.206  Sum_probs=73.4

Q ss_pred             CCCCcceEEEEEeeee----cc-cCCCcceeeeecCCCCCcccceecccc--ccccchhhheeeeeeecccccccCCC--
Q psy14093        603 DGTNFCLAHLFTHQSF----WS-RGSSILGLAYIASPRPYSIGGSIHENL--KYFDTLHSSHISHTIVKRGVQESNHP--  673 (857)
Q Consensus       603 ~hD~~c~AhLft~~df----~~-~~~~tvGlA~vgs~~~~~s~Gv~~~~~--~~~~~~~aheigh~l~~~g~~~d~~~--  673 (857)
                      .||   +|+|||+++|    .. ...+++|+||+|+++...++|++.++.  ...+.++||||||+|   ||.||+..  
T Consensus        94 ~~D---~~~LlT~~~~~~~~~g~~~~~~~G~A~~g~~C~~~~~~~~~d~~~~~~~~~~~AHElGH~l---G~~HD~~~~~  167 (220)
T cd04272          94 NPD---VVFLVTGLDMSTYSGGSLQTGTGGYAYVGGACTENRVAMGEDTPGSYYGVYTMTHELAHLL---GAPHDGSPPP  167 (220)
T ss_pred             ccc---EEEEEeccceeeccCcccccCccceEeecCccCCCceeEeecCCCCcccHHHHHHHHHHHh---CCCCCCCCCC
Confidence            357   9999999987    10 014678999999995456778776653  355789999999999   99998532  


Q ss_pred             ---------------CCCcccC--------CCCCCCCHHHHHHHhhcCCCCcCC
Q psy14093        674 ---------------FNKIKEV--------TFKTLGNLRSIRKVLEAKSGKCFS  704 (857)
Q Consensus       674 ---------------~gcim~~--------~fss~CS~~~~~~~l~~~~~~Cl~  704 (857)
                                     .+|||+.        .| |.||..+|..+|..+...||.
T Consensus       168 ~~~~~~~~~~~C~~~~~~IM~~~~~~~~~~~f-S~CS~~~~~~~l~~~~~~CL~  220 (220)
T cd04272         168 SWVKGHPGSLDCPWDDGYIMSYVVNGERQYRF-SQCSQRQIRNVFRRLGASCLH  220 (220)
T ss_pred             ccCCCCCccCcCCCCCCeEEeeccCCCCCCcc-CcchHHHHHHHHcCCCccccC
Confidence                           2589973        36 579999999999998899984


No 28 
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=98.84  E-value=2.5e-09  Score=109.79  Aligned_cols=85  Identities=29%  Similarity=0.410  Sum_probs=67.2

Q ss_pred             CCCCCcceEEEEEeeeecccCCCcceeeeecCC-CCCcccceeccc--cccccchhhheeeeeeecccccccCCC-----
Q psy14093        602 VDGTNFCLAHLFTHQSFWSRGSSILGLAYIASP-RPYSIGGSIHEN--LKYFDTLHSSHISHTIVKRGVQESNHP-----  673 (857)
Q Consensus       602 ~~hD~~c~AhLft~~df~~~~~~tvGlA~vgs~-~~~~s~Gv~~~~--~~~~~~~~aheigh~l~~~g~~~d~~~-----  673 (857)
                      ..||   +++|||+.+|  ...+++|+||++++ .+..++||+.+.  ....+.++||||||+|   ||.||+..     
T Consensus        87 ~~~d---~~~l~t~~~~--~~~~~~G~a~~~~~C~~~~~~~v~~~~~~~~~~~~~~aHElGH~l---G~~HD~~~~~~~~  158 (192)
T cd04267          87 IRHD---NAVLLTAQDF--IEGDILGLAYVGSMCNPYSSVGVVEDTGFTLLTALTMAHELGHNL---GAEHDGGDELAFE  158 (192)
T ss_pred             CCCc---eEEEEeeecc--CCCCeeeeecccCCCCCCCCeEEEecCCcceeehhhhhhhHHhhc---CCcCCCCCCcccc
Confidence            3577   9999999997  23688999999999 566778887543  3566779999999999   99999752     


Q ss_pred             ----CCCcccC--------CCCCCCCHHHHHHHh
Q psy14093        674 ----FNKIKEV--------TFKTLGNLRSIRKVL  695 (857)
Q Consensus       674 ----~gcim~~--------~fss~CS~~~~~~~l  695 (857)
                          -+|||+.        .| |.||+.+|.++|
T Consensus       159 c~~~~~~IM~~~~~~~~~~~F-S~CS~~~i~~~l  191 (192)
T cd04267         159 CDGGGNYIMAPVDSGLNSYRF-SQCSIGSIREFL  191 (192)
T ss_pred             CCCCCCeEEcccccCCCCCcc-ChhhHHHHHHHh
Confidence                1579962        46 579999999886


No 29 
>PF13688 Reprolysin_5:  Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=98.32  E-value=1.3e-07  Score=97.28  Aligned_cols=103  Identities=26%  Similarity=0.422  Sum_probs=53.4

Q ss_pred             CcceeeeeEEEcCCCCccCCCccccCccccccccccccccc----ccCCCCCCcceEEEEEeeeecccCCCcceeeeecC
Q psy14093        558 GMGFVIKKIVVHSDATRVRQGEAHYNMIRDKWDVRNLLETF----SSHVDGTNFCLAHLFTHQSFWSRGSSILGLAYIAS  633 (857)
Q Consensus       558 g~C~pi~~~~il~~GT~C~~G~c~~n~c~r~W~~~~lL~~~----~~~~~hD~~c~AhLft~~df~~~~~~tvGlA~vgs  633 (857)
                      +.-+.+..+.+....++=...     .....++...+|+.|    +.....+..|++||||+++|   .++ +|+||+|+
T Consensus        45 ~I~~~i~~i~i~~~~~~~~~~-----~~~~~~~~~~~L~~f~~~~~~~~~~~~~dl~~l~t~~d~---~~g-lGlA~~g~  115 (196)
T PF13688_consen   45 NIQFQISGITIWTSSDPYTNN-----VSNSNIDAEDTLQDFNNDFSSWRDSSYYDLAHLFTGRDF---SGG-LGLAYVGS  115 (196)
T ss_dssp             --EEEEEEEEE--S-B---TT------TSSS--HHHHHHHH--HHHTHHHHTTSSEEEEEE-------GGG-EEEESSSS
T ss_pred             CceEEEEEEEEecCCCCCccc-----ccccccCHHHHHHHHhhhhhhccCCCCCCEEEEEecccC---CCC-ceeEeccc
Confidence            556778888876655431111     134456666777666    22111233469999999999   344 99999999


Q ss_pred             CCCCcccceeccc----------cccccchhhheeeeeeecccccccCC
Q psy14093        634 PRPYSIGGSIHEN----------LKYFDTLHSSHISHTIVKRGVQESNH  672 (857)
Q Consensus       634 ~~~~~s~Gv~~~~----------~~~~~~~~aheigh~l~~~g~~~d~~  672 (857)
                      ++....+|++...          ......++||||||+|   |..||+.
T Consensus       116 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~AHEiGH~l---Ga~HD~~  161 (196)
T PF13688_consen  116 SCSNTSGGICSSSVSGSYGQVPPTYNGAITFAHEIGHNL---GAPHDGD  161 (196)
T ss_dssp             SSTT-SSSSS--E-EEETTEE--HHHHHHHHHHHHHHHT---T-----S
T ss_pred             cCcCCccceeccCcccccceECCCCceehhhHHhHHHhc---CCCCCCC
Confidence            9655566666433          1233458899999999   9999864


No 30 
>KOG3475|consensus
Probab=98.21  E-value=5.6e-07  Score=78.10  Aligned_cols=75  Identities=73%  Similarity=1.287  Sum_probs=72.8

Q ss_pred             cccCccccccccCccCCCccccCCCcccccccccccCCCCCCCCcccCCChhhhhhcccCCcccchhhhHHHhhc
Q psy14093        779 ATKGTSSFGKRRNKTHTLCRRCGKSSYHIQKKKCAQCGYPCPRLRHYNWSVKAQRRKTTGTGRMRHLKIVRRRFS  853 (857)
Q Consensus       779 ~~~g~~c~~~r~~~~~~~Ce~C~G~s~~Cp~d~~~~~G~~C~~~~~~c~~~~~qc~~~~G~~~~~a~~~c~~~~~  853 (857)
                      |+|||...|+|.++++.+|..|.-.|.+-+...+.++|+|=+.++.|.|+.+...+...|+|.|..+++.|+||+
T Consensus         1 MtKGT~sFGKr~nkshtlC~RCG~~syH~QKstC~~CGYpaak~R~ynws~KAkRR~TtGTGrMr~lk~v~rrfr   75 (92)
T KOG3475|consen    1 MTKGTSSFGKRHNKSHTLCRRCGRRSYHIQKSTCSSCGYPAAKKRSYNWSAKAKRRRTTGTGRMRYLKIVYRRFR   75 (92)
T ss_pred             CCCcccchhhccccchHHHHHhCchhhhhhcccccccCCcchhccCCCcchhhhhccccccchhHHHHHHHHHHh
Confidence            678999999999999999999999999999999999999999999999999999999999999999999999997


No 31 
>PF08516 ADAM_CR:  ADAM cysteine-rich;  InterPro: IPR006586 An ADAM is a transmembrane protein that contains a disintegrin and metalloprotease domain (MEROPS peptidase family M12B). All members of the ADAM family display a common domain organisation - a pro-domain, the metalloprotease, disintigrin, cysteine-rich, epidermal-growth factor like, and transmembrane domains and a C-terminal cytoplasmic tail. They possess four potential functions: proteolysis, cell adhesion, cell fusion, and cell signalling. ADAMs are membrane-anchored proteases that proteolytically modify cell surface and extracellular matrix (ECM) in order to alter cell behaviour. They are responsible for the proteolytic cleavage of transmembrane proteins and release of their extracellular domain [, ]. The ADAM cysteine-rich domain is not found in plant, archaeal, bacterial or viral proteins. The cysteine-rich domain complements the binding capacity of the disintegrin domain, and perhaps imparts specificity to disintegrin domain-mediated interactions. It has been shown that the cysteine-rich domain of ADAM13 regulates the protein's metalloprotease activity []. ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2DW2_A 2DW0_B 2DW1_A 3G5C_B 2ERQ_B 2ERO_A 2ERP_B 3K7L_A 2E3X_A 3K7N_A.
Probab=98.18  E-value=3.6e-07  Score=86.92  Aligned_cols=39  Identities=18%  Similarity=0.192  Sum_probs=30.6

Q ss_pred             CCCCCCCCccc-----CCChhhhhhcccCCcccchhhhHHHhhc
Q psy14093        815 CGYPCPRLRHY-----NWSVKAQRRKTTGTGRMRHLKIVRRRFS  853 (857)
Q Consensus       815 ~G~~C~~~~~~-----c~~~~~qc~~~~G~~~~~a~~~c~~~~~  853 (857)
                      ||+||.+.++|     |++++.||+.|||++|++|+++||+.+-
T Consensus         1 nG~pC~~~~~yCy~G~C~t~~~QC~~lwG~~a~~A~~~Cy~~~N   44 (117)
T PF08516_consen    1 NGTPCSNGQGYCYNGRCPTRDQQCQELWGPGARVAPDSCYEEVN   44 (117)
T ss_dssp             TT-EETTTTEEEBTTBEE-HHHHHHHHH-TTEEE--GGGGGGGG
T ss_pred             CCcccCCCCeEEECCcCCCHhHHHHHHhCCCccccchhHHHHHh
Confidence            79999987788     6799999999999999999999997753


No 32 
>PTZ00073 60S ribosomal protein L37; Provisional
Probab=98.15  E-value=1.5e-06  Score=76.98  Aligned_cols=75  Identities=71%  Similarity=1.245  Sum_probs=72.4

Q ss_pred             cccCccccccccCccCCCccccCCCcccccccccccCCCCCCCCcccCCChhhhhhcccCCcccchhhhHHHhhc
Q psy14093        779 ATKGTSSFGKRRNKTHTLCRRCGKSSYHIQKKKCAQCGYPCPRLRHYNWSVKAQRRKTTGTGRMRHLKIVRRRFS  853 (857)
Q Consensus       779 ~~~g~~c~~~r~~~~~~~Ce~C~G~s~~Cp~d~~~~~G~~C~~~~~~c~~~~~qc~~~~G~~~~~a~~~c~~~~~  853 (857)
                      |++||..+|+|.++++..|..|.-.|.+-+...+..+|+|.+.++.|-|+.+...+...|+|.|..++..++||+
T Consensus         1 MtKGT~SfGKr~~ktHtlCrRCG~~syH~qK~~CasCGypsak~R~YnWs~Ka~rr~ttGtGrmr~lk~v~rrf~   75 (91)
T PTZ00073          1 MTKGTGSFGKRHGKTHTLCRRCGKRSFHVQKKRCASCGYPSAKMRRYNWSVKAKRRRTTGTGRCRYLKTVARRAK   75 (91)
T ss_pred             CCCCCcccccccCcCcchhcccCccccccccccchhcCCchhhccccchhhhhccccCCccchhHHHHHHHHHHh
Confidence            678999999999999999999999999999999999999999999999999999999999999999999999996


No 33 
>PF13574 Reprolysin_2:  Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=98.13  E-value=9.4e-07  Score=89.73  Aligned_cols=87  Identities=22%  Similarity=0.282  Sum_probs=38.8

Q ss_pred             CCCCCCcceEEEEEeeeecccCCCcceeeeecCCCCCc--ccceecccc-ccccchhhheeeeeeecccccccCCCC---
Q psy14093        601 HVDGTNFCLAHLFTHQSFWSRGSSILGLAYIASPRPYS--IGGSIHENL-KYFDTLHSSHISHTIVKRGVQESNHPF---  674 (857)
Q Consensus       601 ~~~hD~~c~AhLft~~df~~~~~~tvGlA~vgs~~~~~--s~Gv~~~~~-~~~~~~~aheigh~l~~~g~~~d~~~~---  674 (857)
                      ...++++|++|||  .++   .++..||||+|+++...  ..|+..+.. .+..-++||||||+|   |+.||...-   
T Consensus        64 ~~~~~~~Di~~l~--~~~---~~g~~Gla~~g~~c~~~~~~~g~~~~~~~~~~~~~~aHElGH~l---Ga~Hd~~~~~~~  135 (173)
T PF13574_consen   64 WIGSSNYDIGHLF--GTF---SGGGGGLAYVGGVCSKGYKATGLSSPYGDFFGIDTFAHELGHQL---GAPHDFDNGCNP  135 (173)
T ss_dssp             ----TCSS-CCEE--EEE---SSGCCSSEEE-CCCCTCCCE-CCCS--TTSHHHHHHHHHHHHHH---T---SSSS----
T ss_pred             cCCCCCCCEEEEe--ecc---CCCCccEEEEeEeCcCCCEEEEEEcCCCceeeeeeehhhhHhhc---CCCCCCCCCCCC
Confidence            3446778899999  444   37899999999994322  234432211 123346999999999   999987431   


Q ss_pred             ---------CCccc-------CCCCCCCCHHHHHHHhh
Q psy14093        675 ---------NKIKE-------VTFKTLGNLRSIRKVLE  696 (857)
Q Consensus       675 ---------gcim~-------~~fss~CS~~~~~~~l~  696 (857)
                               +-||.       ..| |+||+++|..+|.
T Consensus       136 ~~~~~~~sg~~IMsy~~~~~~~~f-S~cS~~~i~~~l~  172 (173)
T PF13574_consen  136 GSPTCPGSGGYIMSYAGICNNDKF-SPCSIRQIRAVLG  172 (173)
T ss_dssp             SS--T-STTT-TT-SS-GGGG-----------------
T ss_pred             CCCcCCCCCCEEEeeCCCCCCccc-ccccccccccccC
Confidence                     24775       267 5899999998875


No 34 
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=97.62  E-value=4.9e-05  Score=75.84  Aligned_cols=58  Identities=22%  Similarity=0.307  Sum_probs=45.1

Q ss_pred             eEEEEEeeeecccCCCcceeeeecCC-CCCcccceecccc---ccccchhhheeeeeeecccccccCC
Q psy14093        609 LAHLFTHQSFWSRGSSILGLAYIASP-RPYSIGGSIHENL---KYFDTLHSSHISHTIVKRGVQESNH  672 (857)
Q Consensus       609 ~AhLft~~df~~~~~~tvGlA~vgs~-~~~~s~Gv~~~~~---~~~~~~~aheigh~l~~~g~~~d~~  672 (857)
                      ++++++..++   .+++.|+||++++ .+....++.....   .....+++|||||.|   ||.|+..
T Consensus        54 i~~~~~~~~~---~~~~~g~a~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~HElGH~L---Gl~H~~~  115 (167)
T cd00203          54 IAILVTRQDF---DGGTGGWAYLGRVCDSLRGVGVLQDNQSGTKEGAQTIAHELGHAL---GFYHDHD  115 (167)
T ss_pred             EEEEeccccC---CCCceEEEecCCccCCCCCcEEEecCCcccccchhhHHHHHHHHh---CCCccCc
Confidence            9999999987   4688999999998 5555555554332   356779999999999   9988754


No 35 
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=97.46  E-value=7.7e-05  Score=76.15  Aligned_cols=100  Identities=24%  Similarity=0.379  Sum_probs=56.5

Q ss_pred             cccccccchhhhcccccccc--CCCccceEEEeeccccCCCceeeeeccCCCCCCcccccCCCccccceEEeccCCCCcc
Q psy14093        240 NMIRDKWDVRNLLEVFSREY--SHKDFCLAHLFTDLKFEGGILGLAYVGSPRRNSVGGICTPEYFKNGYTLYLNSGLSSS  317 (857)
Q Consensus       240 ~~~f~~w~~~~ll~~~s~~~--~~D~~cla~L~T~~df~~g~lGlA~vg~~~~~~~ggiCs~~~~~~~~~~~~~~~~~~~  317 (857)
                      +....++....+++......  ..+   .+.++|..|+..  -|++||.        |+|.+...  .+++.       .
T Consensus        47 ~~~r~Q~~a~~iL~~l~~~~~~~~~---~vl~vt~~DLy~--~~~nfvf--------G~a~~~~~--~aVvS-------~  104 (179)
T PRK13267         47 DWERGQYRAEKFLPLLSRIGRFNGD---KNIGITDCDLYY--RGLNFVF--------GLAYPNLR--GAVIS-------T  104 (179)
T ss_pred             CCCCceecHHHHHHHHHhhCCCCCc---eEEEEEccccCC--CCCCcee--------EEeecCCC--EEEEE-------c
Confidence            33445566666665544322  334   788899999753  3677877        88887542  23330       0


Q ss_pred             ccccccceeecccCCccccccCCcc----ccccchhhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEee
Q psy14093        318 QYFKNGYTLYLNSGLSSSRNHYGQR----VITREADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTY  386 (857)
Q Consensus       318 ~~~~~~~~~~~n~Gl~~~~n~~G~~----v~~~~~~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~  386 (857)
                      .        -++..+      ||..    +...-...+++|||||+||+.|      |.     ...|+|+.+
T Consensus       105 ~--------Rl~~~~------~~~~~~~~l~~~R~~k~~~HElGH~lGL~H------C~-----~~~CvM~~s  152 (179)
T PRK13267        105 Y--------RLRPEF------YGNKPDSELFEERVRKEVTHELGHTLGLEH------CD-----NPRCVMNFS  152 (179)
T ss_pred             c--------ccChhh------ccccchHHHHHHHHHHHHHHHHHHHcCCcc------CC-----CCCccCCCC
Confidence            0        011110      2211    1111234458999999999999      43     246899854


No 36 
>KOG3538|consensus
Probab=97.31  E-value=0.00037  Score=87.23  Aligned_cols=112  Identities=17%  Similarity=0.242  Sum_probs=84.3

Q ss_pred             CCCcceEEEEEeeeecc---cCCCcceeeeecCC-CCCcccceeccccccccchhhheeeeeeecccccccCCCCC----
Q psy14093        604 GTNFCLAHLFTHQSFWS---RGSSILGLAYIASP-RPYSIGGSIHENLKYFDTLHSSHISHTIVKRGVQESNHPFN----  675 (857)
Q Consensus       604 hD~~c~AhLft~~df~~---~~~~tvGlA~vgs~-~~~~s~Gv~~~~~~~~~~~~aheigh~l~~~g~~~d~~~~g----  675 (857)
                      ||   .|.|||..||-.   -...++|+|.+++| .+.++.-|+.+..+..+.++|||+||++   ||.||+....    
T Consensus       270 ~D---~ailltr~d~~~~~~~~~~~~G~A~v~gmC~~~~sC~i~e~~gl~~aftiahE~GH~~---gm~hd~~~~~C~~~  343 (845)
T KOG3538|consen  270 HD---HAILLTRQDFCEQQNGPCDTLGVAPVGGMCSPSRSCSINEDTGLPSAFTIAHELGHNF---GMEHDGRGNSCGPT  343 (845)
T ss_pred             cc---eeEEeeeeecccccCCCCccccceeecceeCCccceeeecCCCCccceeeeecccccc---CcccCCcCCCCCCc
Confidence            57   999999888721   23478999999999 7888888887777778889999999999   9999974322    


Q ss_pred             -CcccC--------CCCCCCCHHHHHHHhhcCCCCcCCCCC--------ccccCCccccccccc
Q psy14093        676 -KIKEV--------TFKTLGNLRSIRKVLEAKSGKCFSEPE--------ESFCGNLRVEGDEEC  722 (857)
Q Consensus       676 -cim~~--------~fss~CS~~~~~~~l~~~~~~Cl~~~~--------~~~CGNgiVE~gEeC  722 (857)
                       .||..        ...|.||...+..||......||....        ....| ......|||
T Consensus       344 ~~im~~~~~~~~~~~~WS~CS~~~~t~fl~~~~~~Cl~d~p~~~~~~~p~~lPg-~~y~~~~qC  406 (845)
T KOG3538|consen  344 FHVMSSDTCGGNHELTWSECSREYLTRFLDDGEGNCLSDQPFSALPKLPSQLPG-QTYDADEQC  406 (845)
T ss_pred             ccccCcccccCCCCccCCcchHHHHHHHhccCccceeecCCccccCCCCccCCc-cccCCCCcc
Confidence             36752        224689999999999988789998542        12344 345556666


No 37 
>PF13582 Reprolysin_3:  Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=96.83  E-value=0.00042  Score=65.92  Aligned_cols=58  Identities=24%  Similarity=0.256  Sum_probs=34.5

Q ss_pred             CcceEEEEEeeeecccCCCcceeeeecCC-CCCcccceeccc---cccccchhhheeeeeeeccccccc
Q psy14093        606 NFCLAHLFTHQSFWSRGSSILGLAYIASP-RPYSIGGSIHEN---LKYFDTLHSSHISHTIVKRGVQES  670 (857)
Q Consensus       606 ~~c~AhLft~~df~~~~~~tvGlA~vgs~-~~~~s~Gv~~~~---~~~~~~~~aheigh~l~~~g~~~d  670 (857)
                      ++.+++||+..++    .+..|+||++++ .+....++....   ......+++|||||+|   |+.||
T Consensus        63 ~aD~v~~~~~~~~----~~~~G~A~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HEiGH~l---Gl~Hd  124 (124)
T PF13582_consen   63 GADLVHLFTGNDL----NGAGGIAYIGGVCSNNSGYGVNYDSSPYGDSGVDTFAHEIGHNL---GLNHD  124 (124)
T ss_dssp             T-TTSEEEEEESS----GGB-EEEECT---TTT--TT-SEEEEE--TTSTTHHHHHHHHHT---T----
T ss_pred             CCCEEEEeccCCC----CCCceEEEeCCccccCCceEEEcccCCCCcccceEeeehhhHhc---CCCCC
Confidence            3449999999887    688999999988 444333443221   2234578999999999   98886


No 38 
>smart00608 ACR ADAM Cysteine-Rich Domain.
Probab=96.83  E-value=0.0005  Score=67.37  Aligned_cols=30  Identities=10%  Similarity=-0.064  Sum_probs=27.5

Q ss_pred             cccCCChhhhhhcccCCcccchhhhHHHhh
Q psy14093        823 RHYNWSVKAQRRKTTGTGRMRHLKIVRRRF  852 (857)
Q Consensus       823 ~~~c~~~~~qc~~~~G~~~~~a~~~c~~~~  852 (857)
                      +|.|.+++.||+.|||++|+.|++.||+.+
T Consensus        14 ~G~C~t~~~QC~~lfG~~a~~A~~~Cy~~~   43 (137)
T smart00608       14 NGRCPTRDNQCQALFGPGAKVAPDSCYEEL   43 (137)
T ss_pred             CCCcCCcchhhHhhhCCCCcccChhHhHHH
Confidence            356999999999999999999999999865


No 39 
>PF13583 Reprolysin_4:  Metallo-peptidase family M12B Reprolysin-like
Probab=96.82  E-value=0.0011  Score=69.46  Aligned_cols=86  Identities=22%  Similarity=0.281  Sum_probs=57.0

Q ss_pred             CcceEEEEEeeeecccCCCcceeeeecCCC-CCcccceeccc---cccccchhhheeeeeeecccccccCCCC-------
Q psy14093        606 NFCLAHLFTHQSFWSRGSSILGLAYIASPR-PYSIGGSIHEN---LKYFDTLHSSHISHTIVKRGVQESNHPF-------  674 (857)
Q Consensus       606 ~~c~AhLft~~df~~~~~~tvGlA~vgs~~-~~~s~Gv~~~~---~~~~~~~~aheigh~l~~~g~~~d~~~~-------  674 (857)
                      ++++++||+....   ..+..|+||+++.. .+..+++....   ...+ -+++|||||+|   |+.|+...-       
T Consensus        93 ~~D~v~l~~~~~~---~~~~~GlA~~~~~~~~~~~~~~~~~~~~~~~~~-~~~aHEiGH~l---Gl~H~~~~~~~~~~~~  165 (206)
T PF13583_consen   93 NADLVILLTDTNK---DSGTCGLAYVGSVCWDNAKAGAYSDPDLSANGY-QTFAHEIGHNL---GLRHDFDYSGDPYSYA  165 (206)
T ss_pred             CCCEEEEEecccc---CCCCCCeeeeccccccccceEEEEecccccccc-hHHHHHHHHHh---cCCCCcccCCCccccc
Confidence            3449999998754   37899999999984 44444444321   1122 35999999999   988876321       


Q ss_pred             ------CCcccC-------CCCCCCCHHHHHHHhhcCC
Q psy14093        675 ------NKIKEV-------TFKTLGNLRSIRKVLEAKS  699 (857)
Q Consensus       675 ------gcim~~-------~fss~CS~~~~~~~l~~~~  699 (857)
                            .-||..       .| |+|++.++..++..+.
T Consensus       166 ~~~~~~~tIMsY~~~~~~~~F-S~~~i~~i~~~~~~~~  202 (206)
T PF13583_consen  166 YECDSGQTIMSYASRPSRPYF-SPPSIEYIREVVICGD  202 (206)
T ss_pred             ccCCCCCEEeCCCCCCCCCCC-CchhHHHHhCCCcCcc
Confidence                  247752       35 4788888876654443


No 40 
>PRK04179 rpl37e 50S ribosomal protein L37e; Reviewed
Probab=96.05  E-value=0.004  Score=51.92  Aligned_cols=54  Identities=50%  Similarity=1.046  Sum_probs=49.4

Q ss_pred             cccCccccccccCc-cCCCccccCCCcccccccccccCCC-CCCCCcccCCChhhh
Q psy14093        779 ATKGTSSFGKRRNK-THTLCRRCGKSSYHIQKKKCAQCGY-PCPRLRHYNWSVKAQ  832 (857)
Q Consensus       779 ~~~g~~c~~~r~~~-~~~~Ce~C~G~s~~Cp~d~~~~~G~-~C~~~~~~c~~~~~q  832 (857)
                      |++||..+|+|.++ ++..|..|.-.|.+-+...++.+|+ |...++.|-|+....
T Consensus         1 MtKGT~SfGKr~~k~tHt~CrRCG~~syh~qK~~CasCGygps~k~R~YnWs~Ka~   56 (62)
T PRK04179          1 MSKGTPSMGKRNKGKTHIRCRRCGRHSYNVRKKYCAACGFGRSKRIRRYSWQNKKV   56 (62)
T ss_pred             CCCCCcccccccCCcccchhcccCcccccccccchhhcCCCcccccccccHHHHhh
Confidence            67899999999865 8999999999999999999999999 999999999987653


No 41 
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=95.73  E-value=0.009  Score=59.44  Aligned_cols=42  Identities=31%  Similarity=0.384  Sum_probs=29.4

Q ss_pred             hhhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEeeccCCCCCCCCCCCC
Q psy14093        348 ADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTYSVSGYDVNNKTFSS  400 (857)
Q Consensus       348 ~~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~~~~g~~~n~~~FS~  400 (857)
                      ...|++|||||.||+.|....          ..-||++....+. ......+.
T Consensus       104 ~~~~~~HEiGHaLGL~H~~~~----------~~siM~p~~~~~~-~~~~~l~~  145 (156)
T cd04279         104 LQAIALHELGHALGLWHHSDR----------PEDAMYPSQGQGP-DGNPTLSA  145 (156)
T ss_pred             HHHHHHHHhhhhhcCCCCCCC----------ccceeeeecccCC-cCcCCCCH
Confidence            467999999999999999752          1369999865431 12335555


No 42 
>PF00413 Peptidase_M10:  Matrixin This Prosite motif covers only the active site.;  InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)).  The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=95.62  E-value=0.0028  Score=62.39  Aligned_cols=31  Identities=35%  Similarity=0.629  Sum_probs=24.8

Q ss_pred             chhhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEeecc
Q psy14093        347 EADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTYSV  388 (857)
Q Consensus       347 ~~~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~~~  388 (857)
                      ....|++|||||.||+.|..+           ..-||++...
T Consensus       104 ~~~~v~~HEiGHaLGL~H~~~-----------~~svM~~~~~  134 (154)
T PF00413_consen  104 DLQSVAIHEIGHALGLDHSND-----------PNSVMYPYYR  134 (154)
T ss_dssp             EHHHHHHHHHHHHTTBESSSS-----------TTSTTSSSCT
T ss_pred             hhhhhhhhccccccCcCcCCC-----------cccceeeecc
Confidence            356899999999999999864           2358999754


No 43 
>smart00608 ACR ADAM Cysteine-Rich Domain.
Probab=95.47  E-value=0.0094  Score=58.47  Aligned_cols=23  Identities=35%  Similarity=0.810  Sum_probs=20.8

Q ss_pred             CCCcCCCC-ceecCCccch---Hhhhc
Q psy14093        522 DNTGCLER-GKCRGGKCIP---FCETQ  544 (857)
Q Consensus       522 dgt~C~~~-~~C~~G~C~~---~C~~~  544 (857)
                      ||+||.++ ++||+|+|.+   ||+.+
T Consensus         1 dG~pC~~~~~yCy~G~C~t~~~QC~~l   27 (137)
T smart00608        1 DGTPCDNGQGYCYNGRCPTRDNQCQAL   27 (137)
T ss_pred             CCcCcCCCCEEeECCCcCCcchhhHhh
Confidence            79999986 8999999996   89885


No 44 
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=94.63  E-value=0.016  Score=57.71  Aligned_cols=40  Identities=35%  Similarity=0.466  Sum_probs=28.2

Q ss_pred             hhhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEeeccCCCCCCCCCCCC
Q psy14093        348 ADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTYSVSGYDVNNKTFSS  400 (857)
Q Consensus       348 ~~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~~~~g~~~n~~~FS~  400 (857)
                      ...|+.|||||.||+.|-.+           ..-||++.....  .+...+|+
T Consensus       107 ~~~~~~HEiGHaLGL~H~~~-----------~~~vM~~~~~~~--~~~~~l~~  146 (157)
T cd04278         107 LFSVAAHEIGHALGLGHSSD-----------PDSIMYPYYQGP--VPKFKLSQ  146 (157)
T ss_pred             HHHHHHHHhccccccCCCCC-----------CcCeecccccCC--CcCCCCCH
Confidence            45799999999999999764           235999874321  13455665


No 45 
>TIGR02232 myxo_disulf_rpt Myxococcus cysteine-rich repeat. This model represents a sequence region shared between several proteins of Myxococcus xanthus DK 1622 and some eukaryotic proteins that include human pappalysin-1. The region of about 40 amino acids contains several conserved Cys residues presumed to form disulfide bonds. The region appears in up to 13 repeats in Myxococcus.
Probab=94.57  E-value=0.034  Score=42.36  Aligned_cols=34  Identities=35%  Similarity=0.776  Sum_probs=26.6

Q ss_pred             cccCCccccccccccCCCCCCCCCCCCCCCCCccccC
Q psy14093        709 SFCGNLRVEGDEECDAGLLGTEDNDSCCDKVCKLRRN  745 (857)
Q Consensus       709 ~~CGNgiVE~gEeCDCG~~~~c~~d~CC~~tC~Lk~g  745 (857)
                      +.||||+|..+||||=|-...  .|- |.+.|++-++
T Consensus         2 ~~CGdG~~~~~EeCDDGN~~~--~DG-Cs~~C~iE~~   35 (38)
T TIGR02232         2 PTCGDGIIEPGEECDDGNTTS--GDG-CSATCRLEEG   35 (38)
T ss_pred             CcCCCCcCcCCccccCCCCcC--CCC-cCcccccCCC
Confidence            679999999999999996532  233 4678998776


No 46 
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=94.24  E-value=0.021  Score=55.53  Aligned_cols=18  Identities=44%  Similarity=0.667  Sum_probs=16.5

Q ss_pred             hHHhhhhhhhCCCCCCCC
Q psy14093        350 LVTAHEFGHNWGSEHDPD  367 (857)
Q Consensus       350 ~t~AHElGHnlG~~HD~~  367 (857)
                      .|++|||||.||+.|...
T Consensus        88 ~~~~HEigHaLGl~H~~~  105 (140)
T smart00235       88 GVAAHELGHALGLYHEQS  105 (140)
T ss_pred             ccHHHHHHHHhcCCcCCC
Confidence            389999999999999975


No 47 
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=94.09  E-value=0.043  Score=56.21  Aligned_cols=40  Identities=28%  Similarity=0.282  Sum_probs=27.7

Q ss_pred             hhhHHhhhhhhhCCCCCCCCCCCCCCC-----CCCCCceEEEeec
Q psy14093        348 ADLVTAHEFGHNWGSEHDPDMPECSPS-----ASQGGSYLMYTYS  387 (857)
Q Consensus       348 ~~~t~AHElGHnlG~~HD~~~~~C~~~-----~~~~g~~IM~~~~  387 (857)
                      .-.|+.|||||.||+.|..+...-.+.     ......-||++..
T Consensus       113 ~~~t~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SVMSY~~  157 (186)
T cd04277         113 GYQTIIHEIGHALGLEHPGDYNGGDPVPPTYALDSREYTVMSYNS  157 (186)
T ss_pred             hHHHHHHHHHHHhcCCCCCcCCCCCCCCccccccCcceEEEeecC
Confidence            467999999999999999865332221     1123456999864


No 48 
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=94.07  E-value=0.021  Score=56.89  Aligned_cols=40  Identities=20%  Similarity=0.228  Sum_probs=26.1

Q ss_pred             hhhHHhhhhhhhCCCCCCCCCCCCCCC-----CCCCCceEEEeec
Q psy14093        348 ADLVTAHEFGHNWGSEHDPDMPECSPS-----ASQGGSYLMYTYS  387 (857)
Q Consensus       348 ~~~t~AHElGHnlG~~HD~~~~~C~~~-----~~~~g~~IM~~~~  387 (857)
                      ...|++|||||.||+.|-.....-.+.     ....-.-||++..
T Consensus        94 ~~~~~~HEiGHaLGL~H~~~~~~~~~~~~~~~~~~~~~SvM~y~~  138 (165)
T cd04268          94 LRNTAEHELGHALGLRHNFAASDRDDNVDLLAEKGDTSSVMDYAP  138 (165)
T ss_pred             HHHHHHHHHHHHhcccccCcCCccCCcchhhccCCCCcccCCCCc
Confidence            467999999999999999864322210     0001356888864


No 49 
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=93.93  E-value=0.095  Score=52.88  Aligned_cols=27  Identities=30%  Similarity=0.347  Sum_probs=21.3

Q ss_pred             hhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEee
Q psy14093        349 DLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTY  386 (857)
Q Consensus       349 ~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~  386 (857)
                      .--+.|||||.||+.|=++           ..|+|+.+
T Consensus       125 ~KEv~HElGH~~GL~HC~N-----------~~CVM~FS  151 (181)
T COG1913         125 VKEVLHELGHLLGLSHCPN-----------PRCVMNFS  151 (181)
T ss_pred             HHHHHHHhhhhcCcccCCC-----------CCcEEeCC
Confidence            3457899999999999643           46899865


No 50 
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=93.90  E-value=0.2  Score=53.27  Aligned_cols=21  Identities=29%  Similarity=0.248  Sum_probs=18.3

Q ss_pred             hhhHHhhhhhhhCCCCCCCCC
Q psy14093        348 ADLVTAHEFGHNWGSEHDPDM  368 (857)
Q Consensus       348 ~~~t~AHElGHnlG~~HD~~~  368 (857)
                      ...|++|||||-||+.|--..
T Consensus       137 ~g~t~~HEvGH~lGL~HtF~~  157 (225)
T cd04275         137 LGDTATHEVGHWLGLYHTFQG  157 (225)
T ss_pred             ccceeEEeccceeeeeeeecC
Confidence            357999999999999999764


No 51 
>COG2126 RPL37A Ribosomal protein L37E [Translation, ribosomal structure and biogenesis]
Probab=93.59  E-value=0.049  Score=44.89  Aligned_cols=53  Identities=58%  Similarity=1.195  Sum_probs=47.4

Q ss_pred             cccCccccccccCccCCCccccCCCcccccccccccCCCC-CCCCcccCCChhh
Q psy14093        779 ATKGTSSFGKRRNKTHTLCRRCGKSSYHIQKKKCAQCGYP-CPRLRHYNWSVKA  831 (857)
Q Consensus       779 ~~~g~~c~~~r~~~~~~~Ce~C~G~s~~Cp~d~~~~~G~~-C~~~~~~c~~~~~  831 (857)
                      |++||..+|+|.++++..|..|.-.|.+.....+..+|+| -..++.|-|..+.
T Consensus         1 MskGT~S~GKrnk~tH~~CRRCGr~syhv~k~~CaaCGfgrs~r~R~y~W~~K~   54 (61)
T COG2126           1 MSKGTPSFGKRNKKTHIRCRRCGRRSYHVRKKYCAACGFGRSARMRSYNWQAKK   54 (61)
T ss_pred             CCCCCccccccCCcceehhhhccchheeeccceecccCCCCccccccchhhhhh
Confidence            6789999999998889999999999999999999999999 7778888776553


No 52 
>PF12388 Peptidase_M57:  Dual-action HEIGH metallo-peptidase;  InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=93.44  E-value=0.049  Score=56.93  Aligned_cols=28  Identities=29%  Similarity=0.470  Sum_probs=21.3

Q ss_pred             chhhHHhhhhhhhCCCCCCCC--CCCCCCC
Q psy14093        347 EADLVTAHEFGHNWGSEHDPD--MPECSPS  374 (857)
Q Consensus       347 ~~~~t~AHElGHnlG~~HD~~--~~~C~~~  374 (857)
                      ..+.||+|||||.+|+.|-+-  ...|...
T Consensus       132 ~~~hvi~HEiGH~IGfRHTD~~~R~SCG~~  161 (211)
T PF12388_consen  132 VIEHVITHEIGHCIGFRHTDYFNRSSCGSG  161 (211)
T ss_pred             HHHHHHHHHhhhhccccccCcCCccccccc
Confidence            357899999999999999762  1267543


No 53 
>PF05548 Peptidase_M11:  Gametolysin peptidase M11;  InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=92.93  E-value=0.09  Score=58.59  Aligned_cols=19  Identities=37%  Similarity=0.416  Sum_probs=16.6

Q ss_pred             hhHHhhhhhhhCCCCCCCC
Q psy14093        349 DLVTAHEFGHNWGSEHDPD  367 (857)
Q Consensus       349 ~~t~AHElGHnlG~~HD~~  367 (857)
                      ..|+.||||||||+.|=..
T Consensus       151 ~~~~~HElgHN~GL~Ha~~  169 (314)
T PF05548_consen  151 WATIMHELGHNLGLWHAGR  169 (314)
T ss_pred             HHHHHHHhhhhccccccCC
Confidence            3489999999999999863


No 54 
>PF05572 Peptidase_M43:  Pregnancy-associated plasma protein-A;  InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=92.61  E-value=0.049  Score=54.49  Aligned_cols=24  Identities=29%  Similarity=0.346  Sum_probs=15.9

Q ss_pred             hhhHHhhhhhhhCCCCCCCCCCCCC
Q psy14093        348 ADLVTAHEFGHNWGSEHDPDMPECS  372 (857)
Q Consensus       348 ~~~t~AHElGHnlG~~HD~~~~~C~  372 (857)
                      .+.|++||+||-||+.|--.. .|.
T Consensus        69 ~g~TltHEvGH~LGL~HtF~~-~C~   92 (154)
T PF05572_consen   69 FGKTLTHEVGHWLGLYHTFGG-GCD   92 (154)
T ss_dssp             SSHHHHHHHHHHTT---TT----TS
T ss_pred             cccchhhhhhhhhcccccccC-CCC
Confidence            478999999999999999753 375


No 55 
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=92.23  E-value=0.093  Score=54.65  Aligned_cols=26  Identities=27%  Similarity=0.396  Sum_probs=20.6

Q ss_pred             hhhHHhhhhhhhCCCCCCCCCCCCCC
Q psy14093        348 ADLVTAHEFGHNWGSEHDPDMPECSP  373 (857)
Q Consensus       348 ~~~t~AHElGHnlG~~HD~~~~~C~~  373 (857)
                      ...+++|||||.||+.|.-......|
T Consensus       116 ~~~~~~he~gh~lGl~hn~~~s~~~~  141 (197)
T cd04276         116 LRYLLAHEVGHTLGLRHNFKASSDGS  141 (197)
T ss_pred             HHHHHHHHHHHHhcCcccccccccCc
Confidence            45799999999999999865544544


No 56 
>TIGR02232 myxo_disulf_rpt Myxococcus cysteine-rich repeat. This model represents a sequence region shared between several proteins of Myxococcus xanthus DK 1622 and some eukaryotic proteins that include human pappalysin-1. The region of about 40 amino acids contains several conserved Cys residues presumed to form disulfide bonds. The region appears in up to 13 repeats in Myxococcus.
Probab=92.05  E-value=0.16  Score=38.78  Aligned_cols=37  Identities=14%  Similarity=0.079  Sum_probs=26.8

Q ss_pred             cccCccccCCCCCCccCCCcccccccCCCCCCCCCCCCCCCCcccCCCcc
Q psy14093        424 SILGLAYIASPRPYSIGGGILGLAYVGSPRRNSKCCYHSILSELFPVLLY  473 (857)
Q Consensus       424 ~~CGN~ive~g~~EeCDcG~~~~~~~~~~~~~d~CC~~~~~C~l~~ga~C  473 (857)
                      +.|||+++..+  ||||-|-..        -.|-| ...  |++-++..|
T Consensus         2 ~~CGdG~~~~~--EeCDDGN~~--------~~DGC-s~~--C~iE~~~~C   38 (38)
T TIGR02232         2 PTCGDGIIEPG--EECDDGNTT--------SGDGC-SAT--CRLEEGFAC   38 (38)
T ss_pred             CcCCCCcCcCC--ccccCCCCc--------CCCCc-Ccc--cccCCCCCC
Confidence            68999999999  999999653        23444 333  777776655


No 57 
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=91.84  E-value=0.093  Score=53.83  Aligned_cols=85  Identities=15%  Similarity=0.207  Sum_probs=55.7

Q ss_pred             CcccccccccccccccccCC--CCCCcceEEEEEeeeecccCCCcceeeeecCC-CCCcccceeccccc-----------
Q psy14093        583 NMIRDKWDVRNLLETFSSHV--DGTNFCLAHLFTHQSFWSRGSSILGLAYIASP-RPYSIGGSIHENLK-----------  648 (857)
Q Consensus       583 n~c~r~W~~~~lL~~~~~~~--~hD~~c~AhLft~~df~~~~~~tvGlA~vgs~-~~~~s~Gv~~~~~~-----------  648 (857)
                      +.-+..++...+|+++....  +.+   .+.++|..|.+.     -|+.|+.+. .+...+||++...+           
T Consensus        47 ~~~r~Q~~a~~iL~~l~~~~~~~~~---~vl~vt~~DLy~-----~~~nfvfG~a~~~~~~aVvS~~Rl~~~~~~~~~~~  118 (179)
T PRK13267         47 DWERGQYRAEKFLPLLSRIGRFNGD---KNIGITDCDLYY-----RGLNFVFGLAYPNLRGAVISTYRLRPEFYGNKPDS  118 (179)
T ss_pred             CCCCceecHHHHHHHHHhhCCCCCc---eEEEEEccccCC-----CCCCceeEEeecCCCEEEEEccccChhhccccchH
Confidence            44445566667776665432  233   678889988642     367888888 66667777742111           


Q ss_pred             -----cccchhhheeeeeeecccccccCCCCCCccc
Q psy14093        649 -----YFDTLHSSHISHTIVKRGVQESNHPFNKIKE  679 (857)
Q Consensus       649 -----~~~~~~aheigh~l~~~g~~~d~~~~gcim~  679 (857)
                           .+...++||+||++   |+.|-. +..|+|+
T Consensus       119 ~l~~~R~~k~~~HElGH~l---GL~HC~-~~~CvM~  150 (179)
T PRK13267        119 ELFEERVRKEVTHELGHTL---GLEHCD-NPRCVMN  150 (179)
T ss_pred             HHHHHHHHHHHHHHHHHHc---CCccCC-CCCccCC
Confidence                 11224799999999   999974 5689997


No 58 
>PF02031 Peptidase_M7:  Streptomyces extracellular neutral proteinase (M7) family;  InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=91.11  E-value=0.076  Score=50.95  Aligned_cols=23  Identities=35%  Similarity=0.512  Sum_probs=14.4

Q ss_pred             hhHHhhhhhhhCCCCCCCCCCCCC
Q psy14093        349 DLVTAHEFGHNWGSEHDPDMPECS  372 (857)
Q Consensus       349 ~~t~AHElGHnlG~~HD~~~~~C~  372 (857)
                      -.|+||||||.||++-.-.+ -|+
T Consensus        78 ~RIaaHE~GHiLGLPD~y~G-pCS  100 (132)
T PF02031_consen   78 TRIAAHELGHILGLPDHYPG-PCS  100 (132)
T ss_dssp             HHHHHHHHHHHHT----TTS--TT
T ss_pred             ceeeeehhccccCCCCCCCC-ccH
Confidence            46999999999999866543 465


No 59 
>PF01907 Ribosomal_L37e:  Ribosomal protein L37e;  InterPro: IPR001569 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits.  Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. A number of eukaryotic and archaeal ribosomal proteins can be grouped on the basis of sequence similarities. One of these families consists of proteins of 56 to 96 amino-acid residues that share a highly conserved region located in the N-terminal part.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A19_A 4A1D_A 4A18_A 4A1B_A 1S1I_Y 3O5H_d 3IZS_l 3O58_d 2ZKR_2 3IZR_l ....
Probab=90.12  E-value=0.12  Score=42.46  Aligned_cols=52  Identities=79%  Similarity=1.421  Sum_probs=41.2

Q ss_pred             ccCccccccccCccCCCccccCCCcccccccccccCCCCCCCCc-ccCCChhh
Q psy14093        780 TKGTSSFGKRRNKTHTLCRRCGKSSYHIQKKKCAQCGYPCPRLR-HYNWSVKA  831 (857)
Q Consensus       780 ~~g~~c~~~r~~~~~~~Ce~C~G~s~~Cp~d~~~~~G~~C~~~~-~~c~~~~~  831 (857)
                      ++||..+|+|.++++..|..|.-.|.+-....+..+|+|=..++ .|-|+.+.
T Consensus         1 tKGT~S~Gkr~~ktH~~CrRCG~~syH~qK~~CasCGyp~~kkrr~ynWs~Ka   53 (55)
T PF01907_consen    1 TKGTSSFGKRHNKTHTLCRRCGRRSYHIQKKTCASCGYPAAKKRRKYNWSAKA   53 (55)
T ss_dssp             -TTHHHHTTS-S-SEEE-TTTSSEEEETTTTEETTTBTTTSSS----SSSSHH
T ss_pred             CCCCCCccccCCccEeeecccCCeeeecCCCcccccCCCcccccccccchhhh
Confidence            36999999999889999999999999999999999999998877 88887654


No 60 
>PF10462 Peptidase_M66:  Peptidase M66;  InterPro: IPR019503 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of metallopeptidases belonging to MEROPS peptidase family M66 (StcE peptidase, clan MA). The StcE peptidase is a virulence factor found in Shiga toxigenic Escherichia coli strains. StcE peptidase cleaves C1 esterase inhibitor []. The SwissProt proteins in this entry have been name "Dictomallein". ; GO: 0004222 metalloendopeptidase activity; PDB: 3UJZ_A.
Probab=88.39  E-value=0.22  Score=55.03  Aligned_cols=19  Identities=42%  Similarity=0.667  Sum_probs=14.5

Q ss_pred             hhhHHhhhhhhhCCCCCCC
Q psy14093        348 ADLVTAHEFGHNWGSEHDP  366 (857)
Q Consensus       348 ~~~t~AHElGHnlG~~HD~  366 (857)
                      ...+++|||||+||+.|=+
T Consensus       193 ~~~~f~HE~GH~~GL~H~p  211 (305)
T PF10462_consen  193 YGNEFSHELGHNFGLGHYP  211 (305)
T ss_dssp             SHHHHHHHHHHTTT--SST
T ss_pred             ccceeehhhhhhcCCCCCC
Confidence            4689999999999999943


No 61 
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=87.84  E-value=0.26  Score=51.17  Aligned_cols=22  Identities=41%  Similarity=0.675  Sum_probs=18.2

Q ss_pred             hhHHhhhhhhhCCCCCCCCCCC
Q psy14093        349 DLVTAHEFGHNWGSEHDPDMPE  370 (857)
Q Consensus       349 ~~t~AHElGHnlG~~HD~~~~~  370 (857)
                      ..|+.|||||.||+.|....++
T Consensus        93 ~~~i~HElgHaLG~~HEh~rpd  114 (198)
T cd04327          93 SRVVLHEFGHALGFIHEHQSPA  114 (198)
T ss_pred             HHHHHHHHHHHhcCcccccCCC
Confidence            3589999999999999976543


No 62 
>KOG1565|consensus
Probab=86.47  E-value=0.4  Score=56.26  Aligned_cols=30  Identities=40%  Similarity=0.618  Sum_probs=25.3

Q ss_pred             hhhHHhhhhhhhCCCCCCCCCCCCCCCCCCCCceEEEeecc
Q psy14093        348 ADLVTAHEFGHNWGSEHDPDMPECSPSASQGGSYLMYTYSV  388 (857)
Q Consensus       348 ~~~t~AHElGHnlG~~HD~~~~~C~~~~~~~g~~IM~~~~~  388 (857)
                      +.+|+||||||.||+.|-.+           ..-||+|...
T Consensus       211 l~~Va~HEiGH~LGL~HS~~-----------~~aiM~P~y~  240 (469)
T KOG1565|consen  211 LFLVAAHEIGHALGLGHSSD-----------PDAIMYPFYQ  240 (469)
T ss_pred             hHHHhhhhcccccccCCCCC-----------cccccccccc
Confidence            57899999999999999854           3579999865


No 63 
>PF08516 ADAM_CR:  ADAM cysteine-rich;  InterPro: IPR006586 An ADAM is a transmembrane protein that contains a disintegrin and metalloprotease domain (MEROPS peptidase family M12B). All members of the ADAM family display a common domain organisation - a pro-domain, the metalloprotease, disintigrin, cysteine-rich, epidermal-growth factor like, and transmembrane domains and a C-terminal cytoplasmic tail. They possess four potential functions: proteolysis, cell adhesion, cell fusion, and cell signalling. ADAMs are membrane-anchored proteases that proteolytically modify cell surface and extracellular matrix (ECM) in order to alter cell behaviour. They are responsible for the proteolytic cleavage of transmembrane proteins and release of their extracellular domain [, ]. The ADAM cysteine-rich domain is not found in plant, archaeal, bacterial or viral proteins. The cysteine-rich domain complements the binding capacity of the disintegrin domain, and perhaps imparts specificity to disintegrin domain-mediated interactions. It has been shown that the cysteine-rich domain of ADAM13 regulates the protein's metalloprotease activity []. ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2DW2_A 2DW0_B 2DW1_A 3G5C_B 2ERQ_B 2ERO_A 2ERP_B 3K7L_A 2E3X_A 3K7N_A.
Probab=86.06  E-value=0.34  Score=46.28  Aligned_cols=31  Identities=29%  Similarity=0.733  Sum_probs=22.8

Q ss_pred             CCCcCCCC-ceecCCccch---Hhhhc-------cccccccc
Q psy14093        522 DNTGCLER-GKCRGGKCIP---FCETQ-------NQQSCMCD  552 (857)
Q Consensus       522 dgt~C~~~-~~C~~G~C~~---~C~~~-------~~~~C~c~  552 (857)
                      ||+||.++ ++||+|+|.+   ||+.+       +.+.|+=.
T Consensus         1 nG~pC~~~~~yCy~G~C~t~~~QC~~lwG~~a~~A~~~Cy~~   42 (117)
T PF08516_consen    1 NGTPCSNGQGYCYNGRCPTRDQQCQELWGPGARVAPDSCYEE   42 (117)
T ss_dssp             TT-EETTTTEEEBTTBEE-HHHHHHHHH-TTEEE--GGGGGG
T ss_pred             CCcccCCCCeEEECCcCCCHhHHHHHHhCCCccccchhHHHH
Confidence            79999986 8999999996   89996       23667643


No 64 
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related  metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different  substrate specificities and cooperatively digest the chorion.
Probab=85.55  E-value=0.35  Score=49.76  Aligned_cols=18  Identities=28%  Similarity=0.366  Sum_probs=16.3

Q ss_pred             hHHhhhhhhhCCCCCCCC
Q psy14093        350 LVTAHEFGHNWGSEHDPD  367 (857)
Q Consensus       350 ~t~AHElGHnlG~~HD~~  367 (857)
                      -|++|||||.||+.|.-.
T Consensus        79 G~i~HEl~HaLG~~HEhs   96 (182)
T cd04283          79 GIIQHELLHALGFYHEQT   96 (182)
T ss_pred             chHHHHHHHHhCCccccc
Confidence            489999999999999974


No 65 
>PF07998 Peptidase_M54:  Peptidase family M54;  InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=85.02  E-value=0.42  Score=49.59  Aligned_cols=19  Identities=32%  Similarity=0.490  Sum_probs=13.1

Q ss_pred             hhhHHhhhhhhhCCCCCCC
Q psy14093        348 ADLVTAHEFGHNWGSEHDP  366 (857)
Q Consensus       348 ~~~t~AHElGHnlG~~HD~  366 (857)
                      +.-++.||+||.||+.|=.
T Consensus       145 ~~Kea~HElGH~~GL~HC~  163 (194)
T PF07998_consen  145 VCKEAVHELGHLFGLDHCE  163 (194)
T ss_dssp             HHHHHHHHHHHHTT----S
T ss_pred             HHHHHHHHHHHHcCCcCCC
Confidence            4678999999999999964


No 66 
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=84.61  E-value=0.4  Score=50.10  Aligned_cols=18  Identities=33%  Similarity=0.484  Sum_probs=16.2

Q ss_pred             hHHhhhhhhhCCCCCCCC
Q psy14093        350 LVTAHEFGHNWGSEHDPD  367 (857)
Q Consensus       350 ~t~AHElGHnlG~~HD~~  367 (857)
                      -|++|||||.||+.|.-.
T Consensus        89 Gti~HEl~HaLGf~HEhs  106 (200)
T cd04281          89 GIVVHELGHVIGFWHEHT  106 (200)
T ss_pred             chHHHHHHHHhcCcchhc
Confidence            489999999999999964


No 67 
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=84.19  E-value=0.41  Score=49.04  Aligned_cols=18  Identities=28%  Similarity=0.340  Sum_probs=16.4

Q ss_pred             hHHhhhhhhhCCCCCCCC
Q psy14093        350 LVTAHEFGHNWGSEHDPD  367 (857)
Q Consensus       350 ~t~AHElGHnlG~~HD~~  367 (857)
                      .|+.|||||.||+.|...
T Consensus        76 g~v~HE~~HalG~~HEh~   93 (180)
T cd04280          76 GTIVHELMHALGFYHEQS   93 (180)
T ss_pred             chhHHHHHHHhcCcchhc
Confidence            589999999999999974


No 68 
>PF01400 Astacin:  Astacin (Peptidase family M12A) This Prosite motif covers only the active site.;  InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=82.68  E-value=0.45  Score=49.26  Aligned_cols=20  Identities=30%  Similarity=0.342  Sum_probs=15.4

Q ss_pred             hhHHhhhhhhhCCCCCCCCC
Q psy14093        349 DLVTAHEFGHNWGSEHDPDM  368 (857)
Q Consensus       349 ~~t~AHElGHnlG~~HD~~~  368 (857)
                      .-|++|||||.||+.|.-..
T Consensus        80 ~~~i~HEl~HaLG~~HEh~R   99 (191)
T PF01400_consen   80 VGTILHELGHALGFWHEHQR   99 (191)
T ss_dssp             HHHHHHHHHHHHTB--GGGS
T ss_pred             ccchHHHHHHHHhhhhhhhc
Confidence            45899999999999999753


No 69 
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=81.38  E-value=0.55  Score=54.35  Aligned_cols=18  Identities=39%  Similarity=0.359  Sum_probs=15.0

Q ss_pred             cchhhHHhhhhhhhCCCC
Q psy14093        346 READLVTAHEFGHNWGSE  363 (857)
Q Consensus       346 ~~~~~t~AHElGHnlG~~  363 (857)
                      .+.+-|+||||||..|=.
T Consensus       128 sElagViAHEigHv~qrH  145 (484)
T COG4783         128 SELAGVIAHEIGHVAQRH  145 (484)
T ss_pred             HHHHHHHHHHHHHHhhhh
Confidence            367999999999988743


No 70 
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=80.66  E-value=0.53  Score=51.82  Aligned_cols=37  Identities=30%  Similarity=0.228  Sum_probs=23.4

Q ss_pred             hhHHhhhhhhhCCCCCCCCCC-CCCCCCCCCCceEEEee
Q psy14093        349 DLVTAHEFGHNWGSEHDPDMP-ECSPSASQGGSYLMYTY  386 (857)
Q Consensus       349 ~~t~AHElGHnlG~~HD~~~~-~C~~~~~~~g~~IM~~~  386 (857)
                      ..|+|||+||.||+++-=+.. .-.. ..-+.-.||+..
T Consensus       166 igv~~HE~gH~lGLPDlY~~~~~~~~-~~vG~w~lM~~G  203 (286)
T TIGR03296       166 VGVIAHELGHDLGLPDLYDTSYDGGG-EPVGYWSLMSSG  203 (286)
T ss_pred             eeeeehhhhcccCCCccccCcCCCCC-CCccccccccCC
Confidence            569999999999998764321 1110 012455699873


No 71 
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=80.46  E-value=0.73  Score=49.20  Aligned_cols=18  Identities=33%  Similarity=0.484  Sum_probs=16.2

Q ss_pred             hHHhhhhhhhCCCCCCCC
Q psy14093        350 LVTAHEFGHNWGSEHDPD  367 (857)
Q Consensus       350 ~t~AHElGHnlG~~HD~~  367 (857)
                      -|++|||||.||+.|.-.
T Consensus       122 Gti~HEl~HalGf~HEqs  139 (230)
T cd04282         122 ATVEHEFLHALGFYHEQS  139 (230)
T ss_pred             chHHHHHHHHhCCccccc
Confidence            489999999999999974


No 72 
>PF13948 DUF4215:  Domain of unknown function (DUF4215)
Probab=75.37  E-value=1.3  Score=35.39  Aligned_cols=21  Identities=43%  Similarity=0.781  Sum_probs=18.4

Q ss_pred             ccccCCccccccccccCCCCC
Q psy14093        708 ESFCGNLRVEGDEECDAGLLG  728 (857)
Q Consensus       708 ~~~CGNgiVE~gEeCDCG~~~  728 (857)
                      .++||++||...||||=|-..
T Consensus        18 ~~~CGDgii~~~E~CDD~N~~   38 (47)
T PF13948_consen   18 EPICGDGIIVGDEQCDDGNSI   38 (47)
T ss_pred             ccccCCCeEECCcccCCCCcc
Confidence            588999999999999998543


No 73 
>KOG3714|consensus
Probab=72.64  E-value=1.3  Score=51.33  Aligned_cols=18  Identities=28%  Similarity=0.294  Sum_probs=16.1

Q ss_pred             hHHhhhhhhhCCCCCCCC
Q psy14093        350 LVTAHEFGHNWGSEHDPD  367 (857)
Q Consensus       350 ~t~AHElGHnlG~~HD~~  367 (857)
                      -|++|||+|.||..|...
T Consensus       161 G~i~HEl~HaLGf~Hehs  178 (411)
T KOG3714|consen  161 GTIVHELMHALGFWHEHS  178 (411)
T ss_pred             chhHHHHHHHhhhhhccC
Confidence            489999999999999864


No 74 
>PF12044 Metallopep:  Putative peptidase family;  InterPro: IPR021917  This family of proteins is functionally uncharacterised. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length. 
Probab=70.55  E-value=2.7  Score=48.72  Aligned_cols=17  Identities=41%  Similarity=0.741  Sum_probs=15.3

Q ss_pred             HHhhhhhhhCCCCCCCC
Q psy14093        351 VTAHEFGHNWGSEHDPD  367 (857)
Q Consensus       351 t~AHElGHnlG~~HD~~  367 (857)
                      -+-|||||.||++|-.+
T Consensus       319 A~lHEiGH~fg~pH~~~  335 (423)
T PF12044_consen  319 AFLHEIGHLFGCPHQED  335 (423)
T ss_pred             HHHHHHHHhcCCCCCCC
Confidence            47899999999999985


No 75 
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=69.96  E-value=1.7  Score=41.31  Aligned_cols=20  Identities=35%  Similarity=0.639  Sum_probs=17.3

Q ss_pred             chhhHHhhhhhhhCCCCCCC
Q psy14093        347 EADLVTAHEFGHNWGSEHDP  366 (857)
Q Consensus       347 ~~~~t~AHElGHnlG~~HD~  366 (857)
                      ....|+-|||||-||+.-|.
T Consensus       108 ~vthvliHEIgHhFGLsDdd  127 (136)
T COG3824         108 QVTHVLIHEIGHHFGLSDDD  127 (136)
T ss_pred             HhhhhhhhhhhhhcCCChhH
Confidence            45789999999999998775


No 76 
>PF06114 DUF955:  Domain of unknown function (DUF955);  InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=68.19  E-value=1.9  Score=39.74  Aligned_cols=14  Identities=36%  Similarity=0.565  Sum_probs=11.7

Q ss_pred             chhhHHhhhhhhhC
Q psy14093        347 EADLVTAHEFGHNW  360 (857)
Q Consensus       347 ~~~~t~AHElGHnl  360 (857)
                      .-.+++||||||.+
T Consensus        41 ~~~f~laHELgH~~   54 (122)
T PF06114_consen   41 RQRFTLAHELGHIL   54 (122)
T ss_dssp             HHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHH
Confidence            34789999999975


No 77 
>PF12725 DUF3810:  Protein of unknown function (DUF3810);  InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=66.23  E-value=3  Score=46.77  Aligned_cols=26  Identities=15%  Similarity=0.307  Sum_probs=21.4

Q ss_pred             cccccchhhHHhhhhhhhCCCCCCCC
Q psy14093        342 RVITREADLVTAHEFGHNWGSEHDPD  367 (857)
Q Consensus       342 ~v~~~~~~~t~AHElGHnlG~~HD~~  367 (857)
                      .+|......|+||||+|.+|.-..++
T Consensus       190 ~~p~~~~P~T~~HElAHq~G~a~E~E  215 (318)
T PF12725_consen  190 DLPPYSLPFTICHELAHQLGFASEDE  215 (318)
T ss_pred             CCCcccccHHHHHHHHHHhCCCCHHH
Confidence            34555678999999999999998875


No 78 
>PF13948 DUF4215:  Domain of unknown function (DUF4215)
Probab=65.38  E-value=3  Score=33.31  Aligned_cols=20  Identities=20%  Similarity=0.129  Sum_probs=17.8

Q ss_pred             CcccCccccCCCCCCccCCCcc
Q psy14093        423 SSILGLAYIASPRPYSIGGGIL  444 (857)
Q Consensus       423 ~~~CGN~ive~g~~EeCDcG~~  444 (857)
                      .++||++||...  ||||-|-.
T Consensus        18 ~~~CGDgii~~~--E~CDD~N~   37 (47)
T PF13948_consen   18 EPICGDGIIVGD--EQCDDGNS   37 (47)
T ss_pred             ccccCCCeEECC--cccCCCCc
Confidence            579999999999  99999854


No 79 
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=60.37  E-value=3.5  Score=43.52  Aligned_cols=13  Identities=38%  Similarity=0.657  Sum_probs=11.4

Q ss_pred             hhhHHhhhhhhhC
Q psy14093        348 ADLVTAHEFGHNW  360 (857)
Q Consensus       348 ~~~t~AHElGHnl  360 (857)
                      --+|+||||||.|
T Consensus        72 ~rFtlAHELGH~l   84 (213)
T COG2856          72 KRFTLAHELGHAL   84 (213)
T ss_pred             HHHHHHHHHhHHH
Confidence            3689999999987


No 80 
>PF09471 Peptidase_M64:  IgA Peptidase M64;  InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=58.53  E-value=4.9  Score=43.88  Aligned_cols=20  Identities=40%  Similarity=0.694  Sum_probs=13.4

Q ss_pred             chhhHHhhhhhhhCCCCCCC
Q psy14093        347 EADLVTAHEFGHNWGSEHDP  366 (857)
Q Consensus       347 ~~~~t~AHElGHnlG~~HD~  366 (857)
                      .+..|+.||+||.||--=|.
T Consensus       215 ~~~~v~vHE~GHsf~~LaDE  234 (264)
T PF09471_consen  215 SFKQVVVHEFGHSFGGLADE  234 (264)
T ss_dssp             THHHHHHHHHHHHTT-----
T ss_pred             cccceeeeeccccccccccc
Confidence            35789999999999998886


No 81 
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=57.82  E-value=4.5  Score=47.88  Aligned_cols=27  Identities=44%  Similarity=0.806  Sum_probs=19.5

Q ss_pred             cccccCCccccccchhhHHhhhhhhhC---CCCCCCC
Q psy14093        334 SSRNHYGQRVITREADLVTAHEFGHNW---GSEHDPD  367 (857)
Q Consensus       334 ~~~n~~G~~v~~~~~~~t~AHElGHnl---G~~HD~~  367 (857)
                      +..| ||      -...||+|||||-|   |+..|++
T Consensus       480 ~a~N-YG------gIGaVIgHEI~HgFDdqGakfD~~  509 (654)
T COG3590         480 SAAN-YG------GIGAVIGHEIGHGFDDQGAKFDGD  509 (654)
T ss_pred             hhhc-cc------CccceehhhhcccccCCccccCCC
Confidence            3446 77      45789999999987   5556654


No 82 
>KOG4525|consensus
Probab=56.26  E-value=7.3  Score=44.60  Aligned_cols=17  Identities=35%  Similarity=0.755  Sum_probs=15.3

Q ss_pred             HHhhhhhhhCCCCCCCC
Q psy14093        351 VTAHEFGHNWGSEHDPD  367 (857)
Q Consensus       351 t~AHElGHnlG~~HD~~  367 (857)
                      .|-|||||.||.+|.++
T Consensus       304 A~~HElGH~lgcpHq~~  320 (614)
T KOG4525|consen  304 AVCHELGHCLGCPHQSE  320 (614)
T ss_pred             HHHHHhhhccCCCCCCC
Confidence            46799999999999976


No 83 
>PF00413 Peptidase_M10:  Matrixin This Prosite motif covers only the active site.;  InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)).  The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=55.72  E-value=3.9  Score=39.99  Aligned_cols=29  Identities=17%  Similarity=0.261  Sum_probs=21.9

Q ss_pred             cccchhhheeeeeeecccccccCCCCCCcccCC
Q psy14093        649 YFDTLHSSHISHTIVKRGVQESNHPFNKIKEVT  681 (857)
Q Consensus       649 ~~~~~~aheigh~l~~~g~~~d~~~~gcim~~~  681 (857)
                      .+..++.|||||.|   |+.|. +.-..+|...
T Consensus       104 ~~~~v~~HEiGHaL---GL~H~-~~~~svM~~~  132 (154)
T PF00413_consen  104 DLQSVAIHEIGHAL---GLDHS-NDPNSVMYPY  132 (154)
T ss_dssp             EHHHHHHHHHHHHT---TBESS-SSTTSTTSSS
T ss_pred             hhhhhhhhcccccc---CcCcC-CCcccceeee
Confidence            44568899999999   99876 3457788743


No 84 
>PF12115 Salp15:  Salivary protein of 15kDa inhibits CD4+ T cell activation;  InterPro: IPR021971  This is a family of 15kDa salivary proteins from Acari Arachnids that is induced on feeding and assists the parasite to remain attached to its arthropod host. By repressing calcium fluxes triggered by TCR engagement, Salp15 inhibits CD4+ T cell activation. Salp15 shows weak similarity to Inhibin A, a member of the TGF-beta superfamily that inhibits the production of cytokines and the proliferation of T cells. 
Probab=55.26  E-value=7.5  Score=37.20  Aligned_cols=21  Identities=33%  Similarity=0.835  Sum_probs=18.6

Q ss_pred             CCCCCcCC-CCceecCCccchH
Q psy14093        520 MSDNTGCL-ERGKCRGGKCIPF  540 (857)
Q Consensus       520 ~~dgt~C~-~~~~C~~G~C~~~  540 (857)
                      ++|||||+ .+.+|.+|.|+..
T Consensus        93 LpdgtPCg~~~~~C~~G~Cv~~  114 (118)
T PF12115_consen   93 LPDGTPCGPNGQTCKNGKCVGH  114 (118)
T ss_pred             CCCCCccCCCcCEECCCccCCC
Confidence            89999999 5789999999863


No 85 
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=51.80  E-value=7.3  Score=38.57  Aligned_cols=19  Identities=11%  Similarity=0.373  Sum_probs=15.2

Q ss_pred             ccchhhheeeeeeecccccccC
Q psy14093        650 FDTLHSSHISHTIVKRGVQESN  671 (857)
Q Consensus       650 ~~~~~aheigh~l~~~g~~~d~  671 (857)
                      +-.+++|||||.|   |+.|-.
T Consensus        94 ~~~~~~HEiGHaL---GL~H~~  112 (165)
T cd04268          94 LRNTAEHELGHAL---GLRHNF  112 (165)
T ss_pred             HHHHHHHHHHHHh---cccccC
Confidence            4467899999999   987653


No 86 
>PF01435 Peptidase_M48:  Peptidase family M48 This is family M48 in the peptidase classification. ;  InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated.  The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases [].  HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=50.33  E-value=7.2  Score=40.51  Aligned_cols=18  Identities=39%  Similarity=0.405  Sum_probs=14.8

Q ss_pred             cchhhHHhhhhhhhCCCC
Q psy14093        346 READLVTAHEFGHNWGSE  363 (857)
Q Consensus       346 ~~~~~t~AHElGHnlG~~  363 (857)
                      .+.+.|+||||||...=.
T Consensus        87 ~el~aVlaHElgH~~~~h  104 (226)
T PF01435_consen   87 DELAAVLAHELGHIKHRH  104 (226)
T ss_dssp             HHHHHHHHHHHHHHHTTH
T ss_pred             HHHHHHHHHHHHHHHcCC
Confidence            478999999999997433


No 87 
>PRK01345 heat shock protein HtpX; Provisional
Probab=50.25  E-value=6.2  Score=44.22  Aligned_cols=15  Identities=40%  Similarity=0.379  Sum_probs=13.4

Q ss_pred             cchhhHHhhhhhhhC
Q psy14093        346 READLVTAHEFGHNW  360 (857)
Q Consensus       346 ~~~~~t~AHElGHnl  360 (857)
                      .+.+-|+||||||.-
T Consensus       122 dEL~aVlAHElgHi~  136 (317)
T PRK01345        122 EEVAGVMAHELAHVK  136 (317)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            478999999999987


No 88 
>PF05547 Peptidase_M6:  Immune inhibitor A peptidase M6;  InterPro: IPR008757 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M6 (immune inhibitor A family, clan MA(M)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH.  InhA of Bacillus thuringiensis (an entomopathogenic bacterium) specifically cleaves antibacterial peptides produced by insect hosts []. B. thuringiensis is highly resistant to the insect immune system due to its production of two factors, inhibitor A (InhA or InA) and inhibitor B (InhB or InB), which selectively block the humoral defence system developed by insects against Escherichia coli and Bacillus cereus []. B. thuringiensis is especially resistant to cecropins and attacins, which are the main classes of inducible antibacterial peptides in various lepidopterans and dipterans [], []. InhA has been shown to specifically hydrolyze cecropins and attacins in the immune hemolymph of Hyalophora cecropia (Cecropia moth) in vitro []. However, it has been suggested that the role of InhA in resistance to the humoral defence system is not consistent with the time course of InhA production []. B. thuringiensis has two proteins belonging to this group (InhA and InhA2), and it has been shown that InhA2 has a vital role in virulence when the host is infected via the oral route []. The B. cereus member has been found as an exosporium component from endospores []. B. thuringiensis InhA is induced at the onset of sporulation and is regulated by Spo0A and AbrB []. Vibrio cholerae PrtV is thought to be encoded in the pathogenicity island []. However, PrtV mutants did not exhibit a reduced virulence phenotype, and thus PrtV is not an indispensable virulence factor []. Annotation note: due to the presence of PKD repeats in some of the members of this group (e.g., V. cholerae VCA0223), spurious similarity hits may appear (involving unrelated proteins), which may lead to the erroneous transfer of functional annotations and protein names. Also, please note that related Bacillus subtilis Bacillopeptidase F (Bpr or Bpf) contains two different protease domains: N-terminal IPR000209 from INTERPRO (peptidase S8, subtilase, a subtilisin-like serine protease) and this C-terminal domain (peptidase M6), which may also complicate annotation.; GO: 0008233 peptidase activity, 0006508 proteolysis
Probab=49.97  E-value=4.4  Score=49.41  Aligned_cols=17  Identities=47%  Similarity=0.520  Sum_probs=14.8

Q ss_pred             hhHHhhhhhhhCCCCCC
Q psy14093        349 DLVTAHEFGHNWGSEHD  365 (857)
Q Consensus       349 ~~t~AHElGHnlG~~HD  365 (857)
                      .-|+|||+||.||++--
T Consensus       222 iGVfaHEfGH~LGLPDl  238 (645)
T PF05547_consen  222 IGVFAHEFGHDLGLPDL  238 (645)
T ss_pred             eEEEEeeccccCCCCCc
Confidence            45999999999999854


No 89 
>PF06262 DUF1025:  Possibl zinc metallo-peptidase;  InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=49.08  E-value=8.8  Score=35.57  Aligned_cols=20  Identities=20%  Similarity=0.356  Sum_probs=14.9

Q ss_pred             cchhhHHhhhhhhhCCCCCC
Q psy14093        346 READLVTAHEFGHNWGSEHD  365 (857)
Q Consensus       346 ~~~~~t~AHElGHnlG~~HD  365 (857)
                      .....|+-||+||.||+.-|
T Consensus        71 ~~I~~tlvhEiah~fG~~~e   90 (97)
T PF06262_consen   71 ELIRDTLVHEIAHHFGISDE   90 (97)
T ss_dssp             HHHHHHHHHHHHHHTT--HH
T ss_pred             HHHHHHHHHHHHHHcCCCHH
Confidence            35678999999999998754


No 90 
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=48.90  E-value=6.7  Score=42.83  Aligned_cols=15  Identities=47%  Similarity=0.579  Sum_probs=13.1

Q ss_pred             cchhhHHhhhhhhhC
Q psy14093        346 READLVTAHEFGHNW  360 (857)
Q Consensus       346 ~~~~~t~AHElGHnl  360 (857)
                      .+...|+|||+||..
T Consensus       155 dEl~aVlaHElgHi~  169 (302)
T COG0501         155 DELEAVLAHELGHIK  169 (302)
T ss_pred             HHHHHHHHHHHHHHh
Confidence            578999999999974


No 91 
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=47.33  E-value=14  Score=44.62  Aligned_cols=14  Identities=36%  Similarity=0.406  Sum_probs=12.1

Q ss_pred             chhhHHhhhhhhhC
Q psy14093        347 EADLVTAHEFGHNW  360 (857)
Q Consensus       347 ~~~~t~AHElGHnl  360 (857)
                      -.+.|+|||+||.+
T Consensus       336 ~dv~TL~HElGHa~  349 (549)
T TIGR02289       336 GDIDVLTHEAGHAF  349 (549)
T ss_pred             hHHHHHHHHhhHHH
Confidence            35789999999987


No 92 
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=45.00  E-value=16  Score=44.41  Aligned_cols=14  Identities=36%  Similarity=0.627  Sum_probs=12.0

Q ss_pred             chhhHHhhhhhhhC
Q psy14093        347 EADLVTAHEFGHNW  360 (857)
Q Consensus       347 ~~~~t~AHElGHnl  360 (857)
                      -.+.|+|||+||.+
T Consensus       377 ~dv~TLaHElGHa~  390 (591)
T TIGR00181       377 NSVFTLAHELGHSM  390 (591)
T ss_pred             chHHHHHHHhhhHH
Confidence            35889999999985


No 93 
>PRK03001 M48 family peptidase; Provisional
Probab=43.72  E-value=10  Score=41.65  Aligned_cols=15  Identities=47%  Similarity=0.454  Sum_probs=13.2

Q ss_pred             cchhhHHhhhhhhhC
Q psy14093        346 READLVTAHEFGHNW  360 (857)
Q Consensus       346 ~~~~~t~AHElGHnl  360 (857)
                      .+.+.|+||||||.-
T Consensus       122 ~El~aVlAHElgHi~  136 (283)
T PRK03001        122 REIRGVMAHELAHVK  136 (283)
T ss_pred             HHHHHHHHHHHHHHh
Confidence            578999999999975


No 94 
>PRK02870 heat shock protein HtpX; Provisional
Probab=43.35  E-value=9.9  Score=42.97  Aligned_cols=18  Identities=39%  Similarity=0.477  Sum_probs=14.5

Q ss_pred             cchhhHHhhhhhhhCCCCCC
Q psy14093        346 READLVTAHEFGHNWGSEHD  365 (857)
Q Consensus       346 ~~~~~t~AHElGHnlG~~HD  365 (857)
                      .+.+-|+||||||.  ..+|
T Consensus       171 dEL~aVlAHELgHi--k~~d  188 (336)
T PRK02870        171 DELQAVMAHELSHI--RHGD  188 (336)
T ss_pred             HHHHHHHHHHHHHH--Hccc
Confidence            57899999999998  4444


No 95 
>PRK03982 heat shock protein HtpX; Provisional
Probab=43.22  E-value=11  Score=41.67  Aligned_cols=15  Identities=40%  Similarity=0.353  Sum_probs=13.1

Q ss_pred             cchhhHHhhhhhhhC
Q psy14093        346 READLVTAHEFGHNW  360 (857)
Q Consensus       346 ~~~~~t~AHElGHnl  360 (857)
                      .+.+.|+||||||.-
T Consensus       123 ~El~AVlAHElgHi~  137 (288)
T PRK03982        123 DELEGVIAHELTHIK  137 (288)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            478999999999975


No 96 
>PRK05457 heat shock protein HtpX; Provisional
Probab=42.92  E-value=10  Score=41.86  Aligned_cols=15  Identities=40%  Similarity=0.446  Sum_probs=12.9

Q ss_pred             cchhhHHhhhhhhhC
Q psy14093        346 READLVTAHEFGHNW  360 (857)
Q Consensus       346 ~~~~~t~AHElGHnl  360 (857)
                      .+...|+||||||.-
T Consensus       132 ~El~aVlAHElgHi~  146 (284)
T PRK05457        132 DEVEAVLAHEISHIA  146 (284)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            578999999999963


No 97 
>PRK01265 heat shock protein HtpX; Provisional
Probab=41.96  E-value=9.8  Score=42.80  Aligned_cols=14  Identities=43%  Similarity=0.655  Sum_probs=12.5

Q ss_pred             cchhhHHhhhhhhh
Q psy14093        346 READLVTAHEFGHN  359 (857)
Q Consensus       346 ~~~~~t~AHElGHn  359 (857)
                      .+...|+||||||.
T Consensus       138 ~El~aVlAHElgHi  151 (324)
T PRK01265        138 DEIKAVAGHELGHL  151 (324)
T ss_pred             HHHHHHHHHHHHHH
Confidence            57889999999995


No 98 
>PRK03072 heat shock protein HtpX; Provisional
Probab=41.49  E-value=12  Score=41.43  Aligned_cols=16  Identities=38%  Similarity=0.528  Sum_probs=13.5

Q ss_pred             cchhhHHhhhhhhhCC
Q psy14093        346 READLVTAHEFGHNWG  361 (857)
Q Consensus       346 ~~~~~t~AHElGHnlG  361 (857)
                      .+...|+||||||.-.
T Consensus       125 ~El~aVlAHElgHi~~  140 (288)
T PRK03072        125 RELRGVLGHELSHVYN  140 (288)
T ss_pred             HHHHHHHHHHHHHHhc
Confidence            5789999999999753


No 99 
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=39.91  E-value=10  Score=37.59  Aligned_cols=29  Identities=7%  Similarity=0.140  Sum_probs=20.4

Q ss_pred             cccchhhheeeeeeecccccccCCCCCCcccC
Q psy14093        649 YFDTLHSSHISHTIVKRGVQESNHPFNKIKEV  680 (857)
Q Consensus       649 ~~~~~~aheigh~l~~~g~~~d~~~~gcim~~  680 (857)
                      .+..++.|||||.|   |+.|....-..+|..
T Consensus       103 ~~~~~~~HEiGHaL---GL~H~~~~~~siM~p  131 (156)
T cd04279         103 NLQAIALHELGHAL---GLWHHSDRPEDAMYP  131 (156)
T ss_pred             HHHHHHHHHhhhhh---cCCCCCCCccceeee
Confidence            34568899999999   987763322377763


No 100
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=39.25  E-value=11  Score=38.42  Aligned_cols=23  Identities=9%  Similarity=0.109  Sum_probs=19.0

Q ss_pred             hhhheeeeeeecccccccCCCCCCccc
Q psy14093        653 LHSSHISHTIVKRGVQESNHPFNKIKE  679 (857)
Q Consensus       653 ~~aheigh~l~~~g~~~d~~~~gcim~  679 (857)
                      -+.||+||.+   |..|- .|-.|+|+
T Consensus       127 Ev~HElGH~~---GL~HC-~N~~CVM~  149 (181)
T COG1913         127 EVLHELGHLL---GLSHC-PNPRCVMN  149 (181)
T ss_pred             HHHHHhhhhc---CcccC-CCCCcEEe
Confidence            4589999999   98887 35689997


No 101
>PF01863 DUF45:  Protein of unknown function DUF45;  InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=39.20  E-value=17  Score=37.45  Aligned_cols=24  Identities=25%  Similarity=0.278  Sum_probs=20.3

Q ss_pred             cccchhhHHhhhhhhhCCCCCCCC
Q psy14093        344 ITREADLVTAHEFGHNWGSEHDPD  367 (857)
Q Consensus       344 ~~~~~~~t~AHElGHnlG~~HD~~  367 (857)
                      |......|++|||.|..-++|...
T Consensus       160 P~~~idYVvvHEL~Hl~~~nHs~~  183 (205)
T PF01863_consen  160 PPEVIDYVVVHELCHLRHPNHSKR  183 (205)
T ss_pred             CccHHHHHHHHHHHHhccCCCCHH
Confidence            445678999999999999999863


No 102
>PRK04897 heat shock protein HtpX; Provisional
Probab=38.99  E-value=12  Score=41.43  Aligned_cols=15  Identities=33%  Similarity=0.379  Sum_probs=13.2

Q ss_pred             cchhhHHhhhhhhhC
Q psy14093        346 READLVTAHEFGHNW  360 (857)
Q Consensus       346 ~~~~~t~AHElGHnl  360 (857)
                      .+...|+||||||.-
T Consensus       135 ~El~aVlAHElgHi~  149 (298)
T PRK04897        135 EELEGVIGHEISHIR  149 (298)
T ss_pred             HHHHHHHHHHHHHHh
Confidence            578899999999975


No 103
>PF11350 DUF3152:  Protein of unknown function (DUF3152);  InterPro: IPR022603 This entry represents Actinobacteria proteins of unknown function. Some are annotated as membrane proteins, however this cannot be confirmed.
Probab=38.17  E-value=16  Score=38.17  Aligned_cols=20  Identities=40%  Similarity=0.722  Sum_probs=17.2

Q ss_pred             chhhHHhhhhhhhCCCCCCC
Q psy14093        347 EADLVTAHEFGHNWGSEHDP  366 (857)
Q Consensus       347 ~~~~t~AHElGHnlG~~HD~  366 (857)
                      .-..+|=||+||-||-.|.+
T Consensus       138 YRqYvINHEVGH~LGh~H~~  157 (203)
T PF11350_consen  138 YRQYVINHEVGHALGHGHEP  157 (203)
T ss_pred             HHHHhhhhhhhhhcccCCCc
Confidence            45789999999999988875


No 104
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=38.11  E-value=13  Score=36.78  Aligned_cols=30  Identities=17%  Similarity=0.237  Sum_probs=21.7

Q ss_pred             cccchhhheeeeeeecccccccCCCCCCcccCCC
Q psy14093        649 YFDTLHSSHISHTIVKRGVQESNHPFNKIKEVTF  682 (857)
Q Consensus       649 ~~~~~~aheigh~l~~~g~~~d~~~~gcim~~~f  682 (857)
                      .+-.++.|||||.|   |+.|- +..+.||...+
T Consensus       106 ~~~~~~~HEiGHaL---GL~H~-~~~~~vM~~~~  135 (157)
T cd04278         106 DLFSVAAHEIGHAL---GLGHS-SDPDSIMYPYY  135 (157)
T ss_pred             hHHHHHHHHhcccc---ccCCC-CCCcCeecccc
Confidence            34457899999999   98764 34578886433


No 105
>PF05572 Peptidase_M43:  Pregnancy-associated plasma protein-A;  InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=38.11  E-value=10  Score=38.08  Aligned_cols=18  Identities=17%  Similarity=0.429  Sum_probs=12.7

Q ss_pred             ccchhhheeeeeeeccccccc
Q psy14093        650 FDTLHSSHISHTIVKRGVQES  670 (857)
Q Consensus       650 ~~~~~aheigh~l~~~g~~~d  670 (857)
                      .+.+++||+||-|   |+.|.
T Consensus        69 ~g~TltHEvGH~L---GL~Ht   86 (154)
T PF05572_consen   69 FGKTLTHEVGHWL---GLYHT   86 (154)
T ss_dssp             SSHHHHHHHHHHT---T---T
T ss_pred             cccchhhhhhhhh---ccccc
Confidence            4568999999999   87775


No 106
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=37.75  E-value=23  Score=37.76  Aligned_cols=45  Identities=13%  Similarity=0.276  Sum_probs=27.9

Q ss_pred             CCcceeeeecCC-CC--Ccccceec-c--------ccccccchhhheeeeeeeccccccc
Q psy14093        623 SSILGLAYIASP-RP--YSIGGSIH-E--------NLKYFDTLHSSHISHTIVKRGVQES  670 (857)
Q Consensus       623 ~~tvGlA~vgs~-~~--~~s~Gv~~-~--------~~~~~~~~~aheigh~l~~~g~~~d  670 (857)
                      .+.+|.|+.=.. ..  ....||+- .        ....+..+++||+||-|   |+.|.
T Consensus        98 ~~~lG~a~fP~~~~~~~~~~dGvvi~~~~~~~~~~~~~n~g~t~~HEvGH~l---GL~Ht  154 (225)
T cd04275          98 GGLLGYATFPDSLVSLAFITDGVVINPSSLPGGSAAPYNLGDTATHEVGHWL---GLYHT  154 (225)
T ss_pred             CCcCEEEECCCcccCCccccceEEEeccccCCCCcccccccceeEEecccee---eeeee
Confidence            577898875443 21  23345541 1        12234568999999999   88776


No 107
>PF01431 Peptidase_M13:  Peptidase family M13 This is family M13 in the peptidase classification. ;  InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell [].  Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=37.37  E-value=13  Score=38.42  Aligned_cols=44  Identities=25%  Similarity=0.326  Sum_probs=27.0

Q ss_pred             chhhHHhhhhhhhC---CCCCCCCCCCCCCCCCCCCceEEEeeccCCCCCCCCCCCCCCCCCCcccHHHHhc--cccccC
Q psy14093        347 EADLVTAHEFGHNW---GSEHDPDMPECSPSASQGGSYLMYTYSVSGYDVNNKTFSSHVDGTNFCLAHLFTH--QSFWSR  421 (857)
Q Consensus       347 ~~~~t~AHElGHnl---G~~HD~~~~~C~~~~~~~g~~IM~~~~~~g~~~n~~~FS~~~~~~~~CS~~~i~~--~Cl~~~  421 (857)
                      .++.+|||||+|.|   |...|.+                      |  .....||+       -+...+.+  +||.+.
T Consensus        35 ~lG~ilahel~hafd~~g~~~D~~----------------------g--~~~~wWs~-------~~~~~~~~~~~Cl~~q   83 (206)
T PF01431_consen   35 GLGFILAHELMHAFDPEGINYDED----------------------G--NLRNWWSE-------ESRSNFKERAQCLRDQ   83 (206)
T ss_dssp             THHHHHHHHHHHCTSTTGGGB-TT----------------------S---BS--S-H-------HHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHhHhhcCcc----------------------c--ccccccch-------hhHHHHHHHHHHHHHH
Confidence            56899999999987   3343432                      1  12356888       77777776  688753


No 108
>PF04298 Zn_peptidase_2:  Putative neutral zinc metallopeptidase;  InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=36.80  E-value=20  Score=38.08  Aligned_cols=14  Identities=50%  Similarity=0.593  Sum_probs=11.7

Q ss_pred             chhhHHhhhhhhhC
Q psy14093        347 EADLVTAHEFGHNW  360 (857)
Q Consensus       347 ~~~~t~AHElGHnl  360 (857)
                      .+.-|.|||.||.+
T Consensus        88 aAvaVAAHEvGHAi  101 (222)
T PF04298_consen   88 AAVAVAAHEVGHAI  101 (222)
T ss_pred             HHHHHHHHHHhHHH
Confidence            46779999999975


No 109
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=36.75  E-value=25  Score=42.65  Aligned_cols=14  Identities=36%  Similarity=0.577  Sum_probs=12.0

Q ss_pred             chhhHHhhhhhhhC
Q psy14093        347 EADLVTAHEFGHNW  360 (857)
Q Consensus       347 ~~~~t~AHElGHnl  360 (857)
                      ..+.|+|||+||.+
T Consensus       374 ~~v~TL~HE~GHa~  387 (587)
T TIGR02290       374 RDVSTLAHELGHAY  387 (587)
T ss_pred             hhHHHHHHHhhHHH
Confidence            35789999999987


No 110
>PRK02391 heat shock protein HtpX; Provisional
Probab=34.89  E-value=17  Score=40.33  Aligned_cols=14  Identities=43%  Similarity=0.570  Sum_probs=12.4

Q ss_pred             cchhhHHhhhhhhh
Q psy14093        346 READLVTAHEFGHN  359 (857)
Q Consensus       346 ~~~~~t~AHElGHn  359 (857)
                      .+...|+||||||.
T Consensus       131 ~El~aVlaHElgHi  144 (296)
T PRK02391        131 DELEAVLAHELSHV  144 (296)
T ss_pred             HHHHHHHHHHHHHH
Confidence            47889999999995


No 111
>PTZ00337 surface protease GP63; Provisional
Probab=32.68  E-value=44  Score=40.52  Aligned_cols=19  Identities=42%  Similarity=0.538  Sum_probs=15.7

Q ss_pred             chhhHHhhhhhhhCCCCCC
Q psy14093        347 EADLVTAHEFGHNWGSEHD  365 (857)
Q Consensus       347 ~~~~t~AHElGHnlG~~HD  365 (857)
                      ....|++|||.|.||...+
T Consensus       228 ~~~~v~~HEi~HALGFs~~  246 (567)
T PTZ00337        228 GDVRVAAHELGHALGFVRD  246 (567)
T ss_pred             HHHHHHHHHHHHHHccCHH
Confidence            3467999999999999654


No 112
>PF04228 Zn_peptidase:  Putative neutral zinc metallopeptidase;  InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=31.22  E-value=21  Score=39.54  Aligned_cols=13  Identities=46%  Similarity=0.833  Sum_probs=10.6

Q ss_pred             chhhHHhhhhhhh
Q psy14093        347 EADLVTAHEFGHN  359 (857)
Q Consensus       347 ~~~~t~AHElGHn  359 (857)
                      ..+.|||||.||-
T Consensus       169 a~ayVlAHEyGHH  181 (292)
T PF04228_consen  169 AQAYVLAHEYGHH  181 (292)
T ss_pred             HHHHHHHHHHHHH
Confidence            4577999999983


No 113
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=30.25  E-value=19  Score=43.84  Aligned_cols=16  Identities=38%  Similarity=0.493  Sum_probs=13.0

Q ss_pred             chhhHHhhhhhhhCCC
Q psy14093        347 EADLVTAHEFGHNWGS  362 (857)
Q Consensus       347 ~~~~t~AHElGHnlG~  362 (857)
                      -.+.|+|||+||.+=+
T Consensus       379 ~dV~TLaHElGHs~Hs  394 (598)
T COG1164         379 RDVFTLAHELGHSVHS  394 (598)
T ss_pred             hHHHHHHHHccHHHHH
Confidence            4689999999998643


No 114
>PF12315 DUF3633:  Protein of unknown function (DUF3633);  InterPro: IPR022087  This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM. 
Probab=29.92  E-value=23  Score=37.21  Aligned_cols=23  Identities=22%  Similarity=0.273  Sum_probs=16.9

Q ss_pred             ccccchhhHHhhhhhhhCCCCCC
Q psy14093        343 VITREADLVTAHEFGHNWGSEHD  365 (857)
Q Consensus       343 v~~~~~~~t~AHElGHnlG~~HD  365 (857)
                      +|-..+..++|||++|.|=.-..
T Consensus        88 LPrll~gsiLAHE~mHa~Lrl~g  110 (212)
T PF12315_consen   88 LPRLLTGSILAHELMHAWLRLNG  110 (212)
T ss_pred             CCHHHHhhHHHHHHHHHHhcccC
Confidence            34445688999999999855544


No 115
>KOG2719|consensus
Probab=29.45  E-value=30  Score=40.16  Aligned_cols=14  Identities=50%  Similarity=0.567  Sum_probs=11.8

Q ss_pred             cchhhHHhhhhhhh
Q psy14093        346 READLVTAHEFGHN  359 (857)
Q Consensus       346 ~~~~~t~AHElGHn  359 (857)
                      .+.+.|+|||+||-
T Consensus       278 eel~AVl~HELGHW  291 (428)
T KOG2719|consen  278 EELVAVLAHELGHW  291 (428)
T ss_pred             HHHHHHHHHHhhHH
Confidence            36788999999994


No 116
>PF02031 Peptidase_M7:  Streptomyces extracellular neutral proteinase (M7) family;  InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=29.29  E-value=15  Score=35.66  Aligned_cols=24  Identities=13%  Similarity=0.461  Sum_probs=13.5

Q ss_pred             hhhheeeeeeecccccccC-CCCCCccc
Q psy14093        653 LHSSHISHTIVKRGVQESN-HPFNKIKE  679 (857)
Q Consensus       653 ~~aheigh~l~~~g~~~d~-~~~gcim~  679 (857)
                      +.||||||+|   |+.+.- ..++-+|+
T Consensus        80 IaaHE~GHiL---GLPD~y~GpCS~LMS  104 (132)
T PF02031_consen   80 IAAHELGHIL---GLPDHYPGPCSELMS  104 (132)
T ss_dssp             HHHHHHHHHH---T----TTS-TT-GGG
T ss_pred             eeeehhcccc---CCCCCCCCccHHhhc
Confidence            7899999999   876542 23455664


No 117
>cd06456 M3A_DCP_Oligopeptidase_A Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called  peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala.
Probab=28.97  E-value=84  Score=36.70  Aligned_cols=14  Identities=36%  Similarity=0.551  Sum_probs=11.9

Q ss_pred             chhhHHhhhhhhhC
Q psy14093        347 EADLVTAHEFGHNW  360 (857)
Q Consensus       347 ~~~~t~AHElGHnl  360 (857)
                      ..+.|++||+||.+
T Consensus       207 ~~v~tLfHEfGHal  220 (422)
T cd06456         207 DEVTTLFHEFGHAL  220 (422)
T ss_pred             HHHHHHHHHHHHHH
Confidence            45789999999976


No 118
>PF13485 Peptidase_MA_2:  Peptidase MA superfamily
Probab=27.99  E-value=25  Score=32.36  Aligned_cols=16  Identities=44%  Similarity=0.727  Sum_probs=12.5

Q ss_pred             hhhHHhhhhhhhCCCC
Q psy14093        348 ADLVTAHEFGHNWGSE  363 (857)
Q Consensus       348 ~~~t~AHElGHnlG~~  363 (857)
                      ...|++||++|.+=..
T Consensus        25 ~~~~l~HE~~H~~~~~   40 (128)
T PF13485_consen   25 LDRVLAHELAHQWFGN   40 (128)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            4589999999997333


No 119
>PF07998 Peptidase_M54:  Peptidase family M54;  InterPro: IPR012962 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry represents zinc-dependent peptidases belonging to the MEROPS peptidase family M54, more commonly known as the archaemetzincins. The family has a wide taxonomic distribution, being found in archaea, bacteria and eukaryotes. Two human homologues have been characterised []. ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 3LMC_A 2XHQ_A 2X7M_A.
Probab=27.60  E-value=18  Score=37.71  Aligned_cols=89  Identities=18%  Similarity=0.258  Sum_probs=42.1

Q ss_pred             ccCcccccccccccccccccC---CCCCCc----c-eEEEEEeeeecccC-CCcceeeeecCCCCCcccceecc--cc--
Q psy14093        581 HYNMIRDKWDVRNLLETFSSH---VDGTNF----C-LAHLFTHQSFWSRG-SSILGLAYIASPRPYSIGGSIHE--NL--  647 (857)
Q Consensus       581 ~~n~c~r~W~~~~lL~~~~~~---~~hD~~----c-~AhLft~~df~~~~-~~tvGlA~vgs~~~~~s~Gv~~~--~~--  647 (857)
                      .++.-...+....++.++...   ...|.+    . .+..+|..|.+.+. .-+.|+|..+     ...+|++.  ..  
T Consensus        60 a~~~~r~Qy~a~~ll~~L~~~~~~~~~d~~~k~~gi~~l~it~~DlY~~~~nfVFG~A~~~-----~~~aVvS~~~~~fy  134 (194)
T PF07998_consen   60 AYNWYRKQYDAEALLDRLKSLREEYPDDKVTKIFGITVLGITDRDLYSPGLNFVFGLARPG-----GGVAVVSTSRNEFY  134 (194)
T ss_dssp             GEETTTTEEEHHHHHHHHHT--S----SEE------SEEEEESS-EEETTESEBSEEEECC-----SSEEEEEGGCGGGG
T ss_pred             cCCcccceeeHHHHHHHHHHHHHhhhhhhhhhccccceEEEeccccCCCCCceEEEEeecC-----CCeEEEEEeccccc
Confidence            344445566666767665432   122311    0 12556777753211 2345555542     23334321  10  


Q ss_pred             --------ccccchhhheeeeeeecccccccCCCCCCccc
Q psy14093        648 --------KYFDTLHSSHISHTIVKRGVQESNHPFNKIKE  679 (857)
Q Consensus       648 --------~~~~~~~aheigh~l~~~g~~~d~~~~gcim~  679 (857)
                              .-+...+.||+||.+   |+.|-..  .|+|+
T Consensus       135 ~~~~~l~~~R~~Kea~HElGH~~---GL~HC~~--~CvM~  169 (194)
T PF07998_consen  135 GEDEELFLERVCKEAVHELGHLF---GLDHCEN--RCVMN  169 (194)
T ss_dssp             TSSHHHHHHHHHHHHHHHHHHHT---T----SS--TSTTS
T ss_pred             cccHHHHHHHHHHHHHHHHHHHc---CCcCCCC--CCccC
Confidence                    112346789999999   9999854  99997


No 120
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=27.44  E-value=16  Score=35.26  Aligned_cols=16  Identities=13%  Similarity=0.441  Sum_probs=13.8

Q ss_pred             chhhheeeeeeeccccccc
Q psy14093        652 TLHSSHISHTIVKRGVQES  670 (857)
Q Consensus       652 ~~~aheigh~l~~~g~~~d  670 (857)
                      .+++|||||.|   |+.|.
T Consensus        88 ~~~~HEigHaL---Gl~H~  103 (140)
T smart00235       88 GVAAHELGHAL---GLYHE  103 (140)
T ss_pred             ccHHHHHHHHh---cCCcC
Confidence            47899999999   98765


No 121
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=27.04  E-value=30  Score=35.98  Aligned_cols=26  Identities=42%  Similarity=0.629  Sum_probs=18.2

Q ss_pred             ccCCccccccchhhHHhhhhhhhCCCCCC
Q psy14093        337 NHYGQRVITREADLVTAHEFGHNWGSEHD  365 (857)
Q Consensus       337 n~~G~~v~~~~~~~t~AHElGHnlG~~HD  365 (857)
                      |++|..|   ....|.|||+||.+==.||
T Consensus        84 ~y~g~Si---a~~aVAAHEVGHAiQd~~~  109 (226)
T COG2738          84 NYYGPSI---AAIAVAAHEVGHAIQDQED  109 (226)
T ss_pred             ccCCccH---HHHHHHHHHhhHHHhhhcc
Confidence            4455444   3567999999998865555


No 122
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=26.96  E-value=26  Score=40.35  Aligned_cols=14  Identities=36%  Similarity=0.717  Sum_probs=11.5

Q ss_pred             chhhHHhhhhhhhC
Q psy14093        347 EADLVTAHEFGHNW  360 (857)
Q Consensus       347 ~~~~t~AHElGHnl  360 (857)
                      ....|++||+||.+
T Consensus       221 ~~v~tl~HE~GHa~  234 (427)
T cd06459         221 DDVFTLAHELGHAF  234 (427)
T ss_pred             hhHHHHHHHhhHHH
Confidence            35789999999964


No 123
>KOG3624|consensus
Probab=26.32  E-value=35  Score=42.31  Aligned_cols=14  Identities=29%  Similarity=0.579  Sum_probs=12.4

Q ss_pred             chhhHHhhhhhhhC
Q psy14093        347 EADLVTAHEFGHNW  360 (857)
Q Consensus       347 ~~~~t~AHElGHnl  360 (857)
                      ..+.|+||||+|.|
T Consensus       517 ~iG~vigHEl~H~F  530 (687)
T KOG3624|consen  517 GIGFVIGHELTHGF  530 (687)
T ss_pred             HHHHHHHHHHhhcc
Confidence            57899999999966


No 124
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits.  TOP has been shown to degrade peptides released by the proteasom
Probab=26.19  E-value=27  Score=41.29  Aligned_cols=14  Identities=36%  Similarity=0.422  Sum_probs=12.0

Q ss_pred             chhhHHhhhhhhhC
Q psy14093        347 EADLVTAHEFGHNW  360 (857)
Q Consensus       347 ~~~~t~AHElGHnl  360 (857)
                      ..+.|++||+||.+
T Consensus       262 ~~V~TLfHEfGHal  275 (472)
T cd06455         262 DEVETFFHEFGHVI  275 (472)
T ss_pred             HHHHHHHHHHHHHH
Confidence            45889999999976


No 125
>PF01447 Peptidase_M4:  Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ;  InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=26.04  E-value=35  Score=34.16  Aligned_cols=11  Identities=45%  Similarity=0.446  Sum_probs=9.5

Q ss_pred             hhHHhhhhhhh
Q psy14093        349 DLVTAHEFGHN  359 (857)
Q Consensus       349 ~~t~AHElGHn  359 (857)
                      .-|+||||+|-
T Consensus       136 lDVvaHEltHG  146 (150)
T PF01447_consen  136 LDVVAHELTHG  146 (150)
T ss_dssp             HHHHHHHHHHH
T ss_pred             cceeeeccccc
Confidence            56999999995


No 126
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=25.69  E-value=26  Score=39.30  Aligned_cols=19  Identities=42%  Similarity=0.574  Sum_probs=14.5

Q ss_pred             cchhhHHhhhhhhhCCCCCC
Q psy14093        346 READLVTAHEFGHNWGSEHD  365 (857)
Q Consensus       346 ~~~~~t~AHElGHnlG~~HD  365 (857)
                      .+.|.|+||||||.- ++|-
T Consensus       122 sEvAAVl~HEmgHVt-AnHg  140 (479)
T COG4784         122 SEVAAVLAHEMGHVT-ANHG  140 (479)
T ss_pred             HHHHHHHHhhhhhee-cchh
Confidence            367999999999964 4443


No 127
>PF01432 Peptidase_M3:  Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.;  InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains:  mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=25.62  E-value=27  Score=40.86  Aligned_cols=14  Identities=36%  Similarity=0.501  Sum_probs=11.7

Q ss_pred             chhhHHhhhhhhhC
Q psy14093        347 EADLVTAHEFGHNW  360 (857)
Q Consensus       347 ~~~~t~AHElGHnl  360 (857)
                      ..+.|++||+||.+
T Consensus       241 ~~v~tLfHE~GHa~  254 (458)
T PF01432_consen  241 DDVETLFHEFGHAM  254 (458)
T ss_dssp             HHHHHHHHHHHHHH
T ss_pred             hhHHHHHHHHhHHH
Confidence            35789999999965


No 128
>KOG1047|consensus
Probab=25.46  E-value=31  Score=41.11  Aligned_cols=14  Identities=43%  Similarity=0.703  Sum_probs=12.0

Q ss_pred             chhhHHhhhhhhhC
Q psy14093        347 EADLVTAHEFGHNW  360 (857)
Q Consensus       347 ~~~~t~AHElGHnl  360 (857)
                      ..+.|+||||+|+|
T Consensus       287 sl~~vIaHEIAHSW  300 (613)
T KOG1047|consen  287 SLVDVIAHEIAHSW  300 (613)
T ss_pred             chhhHHHHHhhhhh
Confidence            34789999999987


No 129
>PF12388 Peptidase_M57:  Dual-action HEIGH metallo-peptidase;  InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=24.31  E-value=22  Score=37.50  Aligned_cols=18  Identities=22%  Similarity=0.532  Sum_probs=14.7

Q ss_pred             ccchhhheeeeeeeccccccc
Q psy14093        650 FDTLHSSHISHTIVKRGVQES  670 (857)
Q Consensus       650 ~~~~~aheigh~l~~~g~~~d  670 (857)
                      ...+|+|||||.+   |+.|-
T Consensus       133 ~~hvi~HEiGH~I---GfRHT  150 (211)
T PF12388_consen  133 IEHVITHEIGHCI---GFRHT  150 (211)
T ss_pred             HHHHHHHHhhhhc---ccccc
Confidence            4458999999999   88764


No 130
>KOG2090|consensus
Probab=23.83  E-value=94  Score=37.60  Aligned_cols=14  Identities=29%  Similarity=0.321  Sum_probs=11.5

Q ss_pred             chhhHHhhhhhhhC
Q psy14093        347 EADLVTAHEFGHNW  360 (857)
Q Consensus       347 ~~~~t~AHElGHnl  360 (857)
                      ..+-|+-|||||.+
T Consensus       478 ~~vetLFHEmGHAM  491 (704)
T KOG2090|consen  478 SEVETLFHEMGHAM  491 (704)
T ss_pred             HHHHHHHHHHHHHH
Confidence            45789999999964


No 131
>PF01457 Peptidase_M8:  Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ;  InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA.   Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=23.74  E-value=40  Score=40.46  Aligned_cols=18  Identities=28%  Similarity=0.255  Sum_probs=14.6

Q ss_pred             chhhHHhhhhhhhCCCCC
Q psy14093        347 EADLVTAHEFGHNWGSEH  364 (857)
Q Consensus       347 ~~~~t~AHElGHnlG~~H  364 (857)
                      ....|++|||+|.||..-
T Consensus       209 ~~~~~~~HEi~HaLGFs~  226 (521)
T PF01457_consen  209 EFFRTVIHEIAHALGFSS  226 (521)
T ss_dssp             HHHHHHHHHHHHHTT-SH
T ss_pred             cccceeeeeeeeeeeecc
Confidence            456799999999999876


No 132
>PF13699 DUF4157:  Domain of unknown function (DUF4157)
Probab=23.23  E-value=54  Score=29.17  Aligned_cols=13  Identities=31%  Similarity=0.391  Sum_probs=10.7

Q ss_pred             hhhHHhhhhhhhC
Q psy14093        348 ADLVTAHEFGHNW  360 (857)
Q Consensus       348 ~~~t~AHElGHnl  360 (857)
                      ...++||||.|.+
T Consensus        61 ~~~llaHEl~Hv~   73 (79)
T PF13699_consen   61 GRALLAHELAHVV   73 (79)
T ss_pred             cchhHhHHHHHHH
Confidence            3568999999975


No 133
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=23.18  E-value=23  Score=36.17  Aligned_cols=17  Identities=18%  Similarity=0.393  Sum_probs=14.1

Q ss_pred             chhhheeeeeeecccccccC
Q psy14093        652 TLHSSHISHTIVKRGVQESN  671 (857)
Q Consensus       652 ~~~aheigh~l~~~g~~~d~  671 (857)
                      .++.|||||.|   |+.|-.
T Consensus       115 ~t~~HEiGHaL---GL~H~~  131 (186)
T cd04277         115 QTIIHEIGHAL---GLEHPG  131 (186)
T ss_pred             HHHHHHHHHHh---cCCCCC
Confidence            47899999999   987754


No 134
>PF14891 Peptidase_M91:  Effector protein
Probab=22.97  E-value=30  Score=35.23  Aligned_cols=21  Identities=19%  Similarity=0.137  Sum_probs=17.6

Q ss_pred             chhhHHhhhhhhhCCCCCCCC
Q psy14093        347 EADLVTAHEFGHNWGSEHDPD  367 (857)
Q Consensus       347 ~~~~t~AHElGHnlG~~HD~~  367 (857)
                      ..++++||||+|.+-+.++..
T Consensus       102 ~p~v~L~HEL~HA~~~~~Gt~  122 (174)
T PF14891_consen  102 PPFVVLYHELIHAYDYMNGTM  122 (174)
T ss_pred             HHHHHHHHHHHHHHHHHCCCC
Confidence            457899999999998888764


No 135
>PF08325 WLM:  WLM domain;  InterPro: IPR013536 The WLM (WSS1-like metalloprotease) domain is a globular domain related to the zincin-like superfamily of Zn-dependent peptidase. Since the WLM domain contains all known active site residues of zincins, it is predicted to be a catalytically active peptidase domain. The WLM domain is a eukaryotic domain represented in plants, fungi, Plasmodium, and kinetoplastids. By contrast, it is absent in animals, Cryptosporidium, and Microsporidia, suggesting that it has been lost on multiple occasions during the evolution of eukaryotes. The WLM domain is found either in stand-alone form or in association with other domains such as the RanBP2 zinc finger , the ubiquitin domain, or the PUB/PUG domain. This domain could function as a specific de-SUMOylating domain of distinct protein complexes in the nucleus and the cytoplasm []. It has been suggested to form a segregated alpha/beta structure with eight helices and five strands. Proteins containign this domain include yeast WSS1 (a weak suppressor of the Ub-related protein SMT3), and various putative metalloproteases from plant and fungal species.
Probab=22.63  E-value=42  Score=34.80  Aligned_cols=22  Identities=27%  Similarity=0.268  Sum_probs=19.2

Q ss_pred             cchhhHHhhhhhhhCCCCCCCC
Q psy14093        346 READLVTAHEFGHNWGSEHDPD  367 (857)
Q Consensus       346 ~~~~~t~AHElGHnlG~~HD~~  367 (857)
                      ..+..||-|||.|+.=++||..
T Consensus        80 ~~i~~t~lHELaH~~~~~H~~~  101 (186)
T PF08325_consen   80 ETILGTMLHELAHNVHGPHDDK  101 (186)
T ss_pred             HHHHHHHHHHHHhcccCCccHH
Confidence            4567899999999999999974


No 136
>PF11403 Yeast_MT:  Yeast metallothionein;  InterPro: IPR022710  Metallothioneins are characterised by an abundance of cysteine residues and a lack of generic secondary structure motifs. This protein functions in primary metal storage, transport and detoxification []. For the first 40 residues in the protein the polypeptide wraps around the metal by forming two large parallel loops separated by a deep cleft containing the metal cluster []. ; PDB: 1AQS_A 1AQR_A 1RJU_V 1FMY_A 1AOO_A 1AQQ_A.
Probab=21.39  E-value=1.3e+02  Score=22.46  Aligned_cols=37  Identities=24%  Similarity=0.659  Sum_probs=17.6

Q ss_pred             CceeecCCCCCCCCCcccCCCCCCCCCCCCCCCCCcCCC
Q psy14093        490 GMKCRDAQYATCEQESRCTGSSSVCPPSAPMSDNTGCLE  528 (857)
Q Consensus       490 g~~Cr~~~~~~C~~~~~C~G~s~~Cp~~~~~~dgt~C~~  528 (857)
                      |+.|+-.- +.|...|.|-- |-.||+.-...|--||++
T Consensus         4 ghecqcqc-gscknneqcqk-scscptgcnsddkcpcgn   40 (40)
T PF11403_consen    4 GHECQCQC-GSCKNNEQCQK-SCSCPTGCNSDDKCPCGN   40 (40)
T ss_dssp             SS----SS-STTTT-TTSTT-S-SS-TTTTSSTT--TT-
T ss_pred             Cceeeeec-CCccChHHHhh-cCCCCCCCCCCCcCCCCC
Confidence            44444332 45666666654 567888777888778863


No 137
>PF05548 Peptidase_M11:  Gametolysin peptidase M11;  InterPro: IPR008752 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M11 (gametolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The type example is gametolysin from the unicellular biflagellated alga, Chlamydomonas reinhardtii Gametolysin is a zinc-containing metallo-protease, which is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox [, ].
Probab=20.79  E-value=40  Score=37.80  Aligned_cols=17  Identities=12%  Similarity=0.315  Sum_probs=14.8

Q ss_pred             chhhheeeeeeecccccccC
Q psy14093        652 TLHSSHISHTIVKRGVQESN  671 (857)
Q Consensus       652 ~~~aheigh~l~~~g~~~d~  671 (857)
                      .++.||||||+   |+.|-+
T Consensus       152 ~~~~HElgHN~---GL~Ha~  168 (314)
T PF05548_consen  152 ATIMHELGHNL---GLWHAG  168 (314)
T ss_pred             HHHHHHhhhhc---cccccC
Confidence            37899999999   998875


No 138
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=20.11  E-value=42  Score=35.28  Aligned_cols=19  Identities=32%  Similarity=0.447  Sum_probs=16.0

Q ss_pred             hhHHhhhhhhhCCCC-CCCC
Q psy14093        349 DLVTAHEFGHNWGSE-HDPD  367 (857)
Q Consensus       349 ~~t~AHElGHnlG~~-HD~~  367 (857)
                      .-|.-|||||.||.. |-+-
T Consensus       188 ~~tarhElGhaLgi~ghsd~  207 (236)
T COG5549         188 NPTARHELGHALGIWGHSDL  207 (236)
T ss_pred             hHHHHHhhcchheecccccc
Confidence            568889999999999 8753


Done!