Query         gi|254780384|ref|YP_003064797.1| serralysin [Candidatus Liberibacter asiaticus str. psy62]
Match_columns 665
No_of_seqs    305 out of 556
Neff          6.2 
Searched_HMMs 39220
Date          Sun May 29 20:42:06 2011
Command       /home/congqian_1/programs/hhpred/hhsearch -i 254780384.hhm -d /home/congqian_1/database/cdd/Cdd.hhm 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 cd04277 ZnMc_serralysin_like Z 100.0 1.4E-40 4.2E-45  262.3  12.2  154   39-196    23-186 (186)
  2 pfam08548 Peptidase_M10_C Pept 100.0 2.1E-32 5.4E-37  213.0  12.9  127  197-326     1-137 (220)
  3 pfam00413 Peptidase_M10 Matrix  99.8 3.7E-20 9.3E-25  139.1   9.6  138   23-196     2-158 (158)
  4 cd04278 ZnMc_MMP Zinc-dependen  99.8 4.1E-19   1E-23  132.8   9.6  138   23-196     2-157 (157)
  5 cd04279 ZnMc_MMP_like_1 Zinc-d  99.7 2.5E-16 6.4E-21  115.9   8.7  125   48-196    19-156 (156)
  6 cd00203 ZnMc Zinc-dependent me  99.7 6.4E-16 1.6E-20  113.5   9.8  142   43-195    15-167 (167)
  7 KOG1565 consensus               99.6 3.6E-15 9.3E-20  109.0   6.1  138   35-200   106-265 (469)
  8 cd04268 ZnMc_MMP_like Zinc-dep  99.5 7.4E-14 1.9E-18  101.1   6.2  151   39-195     3-165 (165)
  9 cd04327 ZnMc_MMP_like_3 Zinc-d  99.4 8.1E-12 2.1E-16   88.8  11.5  157   37-195     6-197 (198)
 10 smart00235 ZnMc Zinc-dependent  99.3 2.5E-11 6.5E-16   85.7   8.1   91   38-142     8-108 (140)
 11 pfam12388 Peptidase_M57 Dual-a  99.1 8.4E-10 2.1E-14   76.6  10.4  172    7-195     8-211 (211)
 12 cd04281 ZnMc_BMP1_TLD Zinc-dep  98.9 2.2E-08 5.6E-13   68.0  11.5   97   35-142    10-109 (200)
 13 pfam01400 Astacin Astacin (Pep  98.9 3.4E-08 8.8E-13   66.9  11.9   95   37-143     5-103 (192)
 14 cd04283 ZnMc_hatching_enzyme Z  98.9 3.4E-08 8.7E-13   66.9  10.4   85   45-141    12-98  (182)
 15 cd04280 ZnMc_astacin_like Zinc  98.8 1.9E-07 4.9E-12   62.4  11.5   89   42-142     6-96  (180)
 16 KOG3714 consensus               98.7 1.4E-07 3.7E-12   63.1  10.3  152   33-199    80-271 (411)
 17 cd04282 ZnMc_meprin Zinc-depen  98.7 2.5E-07 6.3E-12   61.7   9.7   91   39-141    49-141 (230)
 18 cd04276 ZnMc_MMP_like_2 Zinc-d  98.5 3.6E-08 9.1E-13   66.8   1.4  157   36-196     6-196 (197)
 19 pfam08548 Peptidase_M10_C Pept  98.4 2.9E-06 7.4E-11   55.2  10.2   55  259-313    81-135 (220)
 20 COG2931 RTX toxins and related  98.4 1.9E-06 4.8E-11   56.4   8.3   60  266-325   155-217 (510)
 21 COG2931 RTX toxins and related  98.2 7.3E-06 1.9E-10   52.8   7.1  127  268-398   275-406 (510)
 22 pfam05572 Peptidase_M43 Pregna  97.7 1.8E-05 4.7E-10   50.4   2.3   55   85-141    34-90  (152)
 23 cd04275 ZnMc_pappalysin_like Z  97.4 7.8E-05   2E-09   46.6   1.8   55   85-141   100-158 (225)
 24 pfam02031 Peptidase_M7 Strepto  97.0   0.001 2.6E-08   39.8   4.7   89   39-136     3-93  (132)
 25 COG5549 Predicted Zn-dependent  97.0 0.00048 1.2E-08   41.8   2.5   92   46-137    98-205 (236)
 26 COG1913 Predicted Zn-dependent  96.1  0.0019 4.9E-08   38.2   0.8   17  121-137   125-141 (181)
 27 cd04272 ZnMc_salivary_gland_MP  96.0  0.0072 1.8E-07   34.7   3.5   22  120-141   145-166 (220)
 28 cd04267 ZnMc_ADAM_like Zinc-de  95.9  0.0073 1.9E-07   34.7   3.2   38  120-166   133-170 (192)
 29 pfam05547 Peptidase_M6 Immune   95.6   0.016 4.1E-07   32.6   4.0   73  115-198   217-289 (646)
 30 PRK13267 archaemetzincin-like   95.2  0.0064 1.6E-07   35.1   0.9   19  122-140   125-143 (177)
 31 pfam10462 Peptidase_M66 Peptid  95.2  0.0053 1.3E-07   35.6   0.3   16  122-137   194-209 (304)
 32 pfam07998 Peptidase_M54 Peptid  95.1   0.008   2E-07   34.5   0.9   21  120-140   126-146 (176)
 33 cd04271 ZnMc_ADAM_fungal Zinc-  94.8  0.0088 2.2E-07   34.2   0.6   21  117-137   142-162 (228)
 34 cd04269 ZnMc_adamalysin_II_lik  94.8   0.012   3E-07   33.5   1.1   19  122-140   133-151 (194)
 35 pfam11350 DUF3152 Protein of u  94.7   0.022 5.7E-07   31.8   2.4   87   49-137    51-155 (189)
 36 cd04273 ZnMc_ADAMTS_like Zinc-  94.5   0.015 3.8E-07   32.8   1.2   20  121-140   141-160 (207)
 37 pfam01421 Reprolysin Reprolysi  94.3   0.016   4E-07   32.7   0.9   19  121-139   132-150 (198)
 38 pfam05548 Peptidase_M11 Gameto  94.0   0.021 5.3E-07   32.0   1.1   32  108-139   124-155 (303)
 39 pfam12044 Metallopep Putative   93.9   0.021 5.4E-07   31.9   0.9   21  119-139   312-332 (419)
 40 TIGR03296 M6dom_TIGR03296 M6 f  92.6   0.078   2E-06   28.5   2.2   41  118-166   163-203 (286)
 41 pfam11996 DUF3491 Protein of u  90.7    0.93 2.4E-05   22.0   6.1   32  219-250   347-380 (934)
 42 TIGR00181 pepF oligoendopeptid  90.3   0.075 1.9E-06   28.6   0.2   67   44-132   331-401 (611)
 43 TIGR02289 M3_not_pepF oligoend  89.8   0.099 2.5E-06   27.9   0.4   73   51-134   277-354 (553)
 44 cd04270 ZnMc_TACE_like Zinc-de  89.7    0.22 5.6E-06   25.8   2.1   20  121-140   168-187 (244)
 45 COG3824 Predicted Zn-dependent  89.5    0.11 2.8E-06   27.6   0.5   24  113-136   102-125 (136)
 46 KOG4525 consensus               86.0    0.33 8.5E-06   24.7   1.2   80  119-208   300-383 (614)
 47 KOG3607 consensus               86.0    0.25 6.5E-06   25.4   0.6   19  120-138   323-341 (716)
 48 pfam00353 HemolysinCabind Hemo  85.3    0.42 1.1E-05   24.1   1.4   20  290-309     1-20  (26)
 49 pfam06262 DUF1025 Domain of un  82.5    0.47 1.2E-05   23.8   0.7   21  116-136    68-88  (96)
 50 cd06459 M3B_Oligoendopeptidase  81.5    0.46 1.2E-05   23.8   0.4   14  119-132   221-234 (427)
 51 pfam03272 Enhancin Viral enhan  81.0    0.92 2.3E-05   22.0   1.8   26  354-379   531-556 (775)
 52 pfam01457 Peptidase_M8 Leishma  81.0    0.58 1.5E-05   23.3   0.7   20  119-138   239-258 (569)
 53 PTZ00257 Glycoprotein GP63 (le  80.7     0.6 1.5E-05   23.1   0.7   17  121-137   257-273 (615)
 54 pfam09471 Peptidase_M64 IgA Pe  80.7    0.39   1E-05   24.3  -0.2   26  117-144   206-232 (257)
 55 TIGR02290 M3_fam_3 oligoendope  78.3    0.67 1.7E-05   22.8   0.3   13  120-132   385-397 (600)
 56 pfam01432 Peptidase_M3 Peptida  77.5    0.75 1.9E-05   22.6   0.4   19   57-75    141-159 (448)
 57 PTZ00337 surface protease GP63  76.8    0.82 2.1E-05   22.3   0.4   22  117-138   226-247 (567)
 58 pfam02382 RTX RTX N-terminal d  74.2     4.9 0.00012   17.7   4.8   20  291-310   622-641 (653)
 59 cd06258 Peptidase_M3_like The   71.8     1.1 2.9E-05   21.5   0.1   17  119-135   153-170 (365)
 60 KOG3538 consensus               70.9     1.4 3.6E-05   20.9   0.5   22  120-141   317-338 (845)
 61 COG1164 Oligoendopeptidase F [  70.8     1.2 3.1E-05   21.3   0.1   10  122-131   382-391 (598)
 62 KOG3658 consensus               70.7     1.8 4.6E-05   20.2   1.0   20  346-365   365-384 (764)
 63 pfam06594 HCBP_related Haemoly  69.7     5.8 0.00015   17.2   3.4   14  338-351    24-37  (43)
 64 cd06162 S2P-M50_PDZ_SREBP Ster  69.6     1.5 3.9E-05   20.7   0.4   13  122-134   137-149 (277)
 65 COG2856 Predicted Zn peptidase  66.9     1.9 4.8E-05   20.1   0.4   18  119-136    71-88  (213)
 66 cd06455 M3A_TOP Peptidase M3 T  66.3     1.8 4.5E-05   20.3   0.2   36  190-231   168-204 (472)
 67 cd06164 S2P-M50_SpoIVFB_CBS Sp  65.1     2.4 6.1E-05   19.5   0.7   15  122-136    55-69  (227)
 68 cd06457 M3A_MIP Peptidase M3 m  64.7     2.1 5.5E-05   19.8   0.4   18   57-74    138-155 (458)
 69 cd06161 S2P-M50_SpoIVFB SpoIVF  64.6     2.5 6.3E-05   19.4   0.7   16  122-137    40-55  (208)
 70 pfam05279 Asp-B-Hydro_N Aspart  63.6     2.8 7.1E-05   19.1   0.8   22   52-73     15-36  (232)
 71 PRK10280 dipeptidyl carboxypep  62.1     2.3   6E-05   19.6   0.2   33  190-228   373-406 (681)
 72 pfam01434 Peptidase_M41 Peptid  60.9     2.9 7.3E-05   19.0   0.4   17  121-137     9-26  (192)
 73 LOAD_S2Pmetalloprt consensus    60.7     3.2 8.2E-05   18.8   0.7   18  122-139     9-26  (148)
 74 KOG2921 consensus               60.1     3.2 8.3E-05   18.7   0.6   13  123-135   134-146 (484)
 75 PRK10911 oligopeptidase A; Pro  59.1     3.1 7.9E-05   18.8   0.4   16  123-138   264-279 (680)
 76 pfam02163 Peptidase_M50 Peptid  56.7     4.1 0.00011   18.1   0.7   17  123-139    10-26  (205)
 77 COG0339 Dcp Zn-dependent oligo  56.6     3.6 9.1E-05   18.5   0.3   17  123-139   267-283 (683)
 78 cd05709 S2P-M50 Site-2 proteas  54.5     4.7 0.00012   17.7   0.7   18  122-139    10-27  (180)
 79 cd06460 M32_Taq Peptidase fami  54.0     4.5 0.00012   17.8   0.5   17  120-136   159-178 (396)
 80 cd06160 S2P-M50_like_2 Unchara  52.6     5.3 0.00013   17.5   0.7   19  120-138    41-59  (183)
 81 KOG2090 consensus               46.9     6.6 0.00017   16.9   0.4   26  539-566   641-669 (704)
 82 TIGR00054 TIGR00054 membrane-a  45.4     7.4 0.00019   16.5   0.5   11  121-131    16-26  (463)
 83 cd02641 R3H_Smubp-2_like R3H d  43.8      10 0.00026   15.7   1.0   15  123-137    32-46  (60)
 84 pfam08463 EcoEI_R_C EcoEI R pr  42.5      11 0.00028   15.5   1.0   61  518-602   130-190 (232)
 85 pfam10263 SprT-like SprT-like   42.4      12 0.00032   15.2   1.3   21  119-139    57-83  (153)
 86 pfam02074 Peptidase_M32 Carbox  42.3     9.1 0.00023   16.0   0.6   12  121-132   260-271 (494)
 87 PRK10733 hflB ATP-dependent me  41.4     9.1 0.00023   16.0   0.4   11  606-616   583-593 (644)
 88 cd06158 S2P-M50_like_1 Unchara  39.7      11 0.00028   15.5   0.7   14  124-137    13-26  (181)
 89 cd06163 S2P-M50_PDZ_RseP-like   39.0      11 0.00029   15.4   0.6   14  122-135    11-24  (182)
 90 TIGR02604 Piru_Ver_Nterm putat  37.0      17 0.00043   14.4   1.3   25  115-139    58-83  (459)
 91 COG2317 Zn-dependent carboxype  36.5      12 0.00031   15.2   0.5   10  123-132   262-271 (497)
 92 cd02639 R3H_RRM R3H domain of   36.4      15 0.00038   14.7   0.9   17  123-139    32-48  (60)
 93 pfam06114 DUF955 Domain of unk  36.2      12 0.00031   15.2   0.4   14  120-133    41-54  (121)
 94 smart00731 SprT SprT homologue  35.1      18 0.00047   14.2   1.2   20  119-138    58-81  (146)
 95 PRK10779 zinc metallopeptidase  34.5      14 0.00035   14.9   0.5   17  120-136    15-31  (449)
 96 KOG3854 consensus               34.2      13 0.00034   15.0   0.4   12  522-533   376-387 (505)
 97 pfam06189 5-nucleotidase 5'-nu  33.8      24 0.00061   13.5   1.6   13  410-422   120-132 (263)
 98 COG4227 Antirestriction protei  29.8      15 0.00038   14.7   0.0   23  508-530   212-235 (316)
 99 TIGR01241 FtsH_fam ATP-depende  29.8      18 0.00046   14.2   0.4   26  121-150   317-343 (505)
100 cd06159 S2P-M50_PDZ_Arch Uncha  28.6      21 0.00055   13.8   0.6   12  121-132   119-130 (263)
101 pfam01742 Peptidase_M27 Clostr  27.2      19 0.00049   14.1   0.2   13  415-427   210-222 (408)
102 cd02640 R3H_NRF R3H domain of   26.9      28  0.0007   13.1   0.9   15  123-137    32-46  (60)
103 TIGR01338 phycocy_alpha phycoc  25.6      24 0.00062   13.4   0.5   70  522-599    83-157 (161)
104 COG2738 Predicted Zn-dependent  25.3      24 0.00062   13.4   0.4   83   50-136    19-108 (226)
105 COG3793 TerB Tellurite resista  24.9      32 0.00082   12.7   1.0   10  125-134   128-137 (144)
106 pfam07108 PipA PipA protein. T  24.6      28 0.00072   13.0   0.7   14  586-599   175-188 (228)
107 pfam02102 Peptidase_M35 Deuter  23.8      30 0.00077   12.9   0.7   28  556-586   320-347 (352)
108 TIGR00962 atpA ATP synthase F1  23.2      38 0.00098   12.2   1.3   30  571-600   402-433 (520)
109 PRK13262 ureE urease accessory  23.0      31 0.00079   12.8   0.6   16  124-139   105-120 (231)
110 COG4783 Putative Zn-dependent   21.7      31 0.00079   12.8   0.4   12  122-133   132-143 (484)
111 pfam09954 DUF2188 Uncharacteri  21.4      40   0.001   12.1   0.9   22  635-656    87-113 (131)
112 COG4744 Uncharacterized conser  21.0      18 0.00046   14.2  -0.9   41  158-198    40-80  (121)

No 1  
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=100.00  E-value=1.4e-40  Score=262.31  Aligned_cols=154  Identities=40%  Similarity=0.538  Sum_probs=124.7

Q ss_pred             EEEEEEEECCHHHHHHHHHHHHHHHHHHCCEEEEECCC--CCEEEECCC------CEEEEECCCCCC--CCCCCCCCCCC
Q ss_conf             02688750799999999999998865747568982478--716896158------535663146775--31224433544
Q gi|254780384|r   39 RLSFDITELNPNAQEVARWALGEWSKVVDLTFEETSIH--SDIKFISSN------NGYVCTPRYDSR--YFMEINFDKAD  108 (665)
Q Consensus        39 ~~t~~it~l~~~~~~~~R~Al~~ws~va~ItF~Ev~~~--~d~~f~~~~------~g~a~~p~~~~~--~~~~~~~~~~~  108 (665)
                      ....++.+|+++||+++|.||+.|++|++|+|+||+..  ++++|...+      .|+++.|.....  ...++|++...
T Consensus        23 ~~~~~~~~~s~~q~~~ir~a~~~w~~v~~i~F~ev~~~~~adI~~~~~~~~~~~~~g~a~~p~~~~~~~~~gdi~~~~~~  102 (186)
T cd04277          23 EDTTNTAALSAAQQAAARDALEAWEDVADIDFVEVSDNSGADIRFGNSSDPDGNTAGYAYYPGSGSGTAYGGDIWFNSSY  102 (186)
T ss_pred             CCCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCEEEEEECCCCCCCEEEEEECCCCCCCCCCCCEEEECCCC
T ss_conf             57677416899999999999999872139449993688863289996058888825999658987776756448965876


Q ss_pred             CCCCCCCCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCEECCHHHHHH
Q ss_conf             33366666734663003234655187888876667666677754101420244675235677666655200021122148
Q gi|254780384|r  109 VWRYGIGKGTILSHNALHEIGHALGLMHPGSYNGGYPVYGVDNDYENDSFLTSVMSYFMPQDSMMDASFGYCATPMVSDI  188 (665)
Q Consensus       109 ~~~~~~~~g~y~~~t~iHEIGHALGL~Hp~~~~~g~~~~~~~~~~~~d~~~~TvMSY~~~~~~~~~~~~~~~~tpm~~DI  188 (665)
                      .+. ....|+|.++|+||||||||||+|||+++++.+.   ..++.+|+.++|||||+.+........+.+|.|||++||
T Consensus       103 ~~~-~~~~g~~~~~t~lHEIGHALGL~Hp~d~~~~~~~---~~~~~~d~~~~TvMSY~~~~~~~~~~~~~~~~tpm~~DI  178 (186)
T cd04277         103 DTN-SDSPGSYGYQTIIHEIGHALGLEHPGDYNGGDPV---PPTYALDSREYTVMSYNSGYGNGASAGGGYPQTPMLLDI  178 (186)
T ss_pred             CCC-CCCCCCCCEEEEEEEECHHCCCCCCCCCCCCCCC---CCCHHHCCCCCEEEECCCCCCCCCCCCCCCCCCCCHHHH
T ss_conf             557-7787754247888881013267888767777777---775335084735886557888886646646578758999


Q ss_pred             HHHHHHCC
Q ss_conf             99998608
Q gi|254780384|r  189 VAVQKIYG  196 (665)
Q Consensus       189 ~Alq~lYG  196 (665)
                      +|||+|||
T Consensus       179 aAlQ~lYG  186 (186)
T cd04277         179 AALQYLYG  186 (186)
T ss_pred             HHHHHHHC
T ss_conf             99999629


No 2  
>pfam08548 Peptidase_M10_C Peptidase M10 serralysin C terminal. Serralysins are peptidases related to mammalian matrix metallopeptidases (MMPs). The peptidase unit is found at the N terminal while this domain at the C terminal forms a corkscrew and is thought to be important for secretion of the protein through the bacterial cell wall. This domain contains the calcium ion binding domain pfam00353.
Probab=100.00  E-value=2.1e-32  Score=212.98  Aligned_cols=127  Identities=29%  Similarity=0.506  Sum_probs=111.3

Q ss_pred             CC-CCCCCCCEEECCCCCC---------CCCEEEEEECCCCCEEEEEEECCCCCCEEECCCCCCCEECCCCCCEEEECCC
Q ss_conf             76-6666773463257678---------7635898722897325998504787424744587310103555754773482
Q gi|254780384|r  197 AP-KGSLGNNIYKIEVQQK---------DSPFIQTIYDSGGDDLLDLTLIGGVKNYIDLNTESWSSIGGYEKNMTIAQHT  266 (665)
Q Consensus       197 an-~~~~gdt~y~~~~~~~---------~~~~~~tI~D~gG~DTid~s~~~~~~~~IdL~~g~~S~~~g~~~n~~Ia~gt  266 (665)
                      || ++|+|||||+|++++.         ..++.+||||+||+||||+|+++..+ +|||+++.||++++.++|++||+++
T Consensus         1 an~~Tr~GdtvYgfnsntg~~~~~~~~~~~~~~~tiwD~gG~DTiD~Sg~~~~~-~IdL~~g~~S~vgg~~gN~~Ia~g~   79 (220)
T pfam08548         1 ANMTTRTGDTVYGFNSNTDRDFYSATSSSDKLIFSVWDAGGNDTFDFSGFSQNQ-RINLNEGSFSDVGGLKGNVSIAAGV   79 (220)
T ss_pred             CCCCCCCCCCEECCCCCCCCCEEECCCCCCCEEEEEECCCCCEEECCCCCCCCE-EEECCCCCEECCCCCCCCEEECCCC
T ss_conf             986224788737268889850461367889769999938987848546985460-8966688242145653745432773


Q ss_pred             EEEEEECCCCCCEEEECCCCCEEECCCCCCEEECCCCCEEEEEECCCCCEEEEEECCCEE
Q ss_conf             574466177651688266674686678634588888733899823665137872058259
Q gi|254780384|r  267 VIESVVGSSGNDVVIGNSANNVFFESSGNDVFDGSSGLDTFFYYAPSYLYKIYRFENAII  326 (665)
Q Consensus       267 ~IEna~g~~g~D~i~gn~~~n~l~gg~g~d~~~g~~g~Dt~~~~~~~~~~~~~~~~~~~~  326 (665)
                      +|||++|++|+|+|+||.++|+|.||.|+|+|+|+.|.|++.  +..+.+.+....+...
T Consensus        80 ~IEn~~GgsG~D~l~Gn~~~n~L~Gg~G~D~l~Gg~G~D~l~--GG~G~D~f~~~~g~Ds  137 (220)
T pfam08548        80 TIENAIGGSGNDLLIGNDAANILKGGAGNDIIYGGGGADQLW--GGAGKDTFVYGAASDS  137 (220)
T ss_pred             EEEEEECCCCCCEEECCCCCCEEECCCCCCEEECCCCCCEEE--CCCCCEEEEECCCCCC
T ss_conf             166886667676898788885578789988687778871898--8998349984076657


No 3  
>pfam00413 Peptidase_M10 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis.
Probab=99.83  E-value=3.7e-20  Score=139.12  Aligned_cols=138  Identities=30%  Similarity=0.386  Sum_probs=92.8

Q ss_pred             ECCCEEEEECCCCCCCEEEEEEEECC-HHHHHHHHHHHHHHHHHHCCEEEEECC-CCCEEEECCC---------------
Q ss_conf             40434675057656750268875079-999999999999886574756898247-8716896158---------------
Q gi|254780384|r   23 WSNYFIRYKPDVLTTHRLSFDITELN-PNAQEVARWALGEWSKVVDLTFEETSI-HSDIKFISSN---------------   85 (665)
Q Consensus        23 ~~~~~~~~~~d~~~~~~~t~~it~l~-~~~~~~~R~Al~~ws~va~ItF~Ev~~-~~d~~f~~~~---------------   85 (665)
                      |+...++++...++.        .++ ..++.++|.||+.|++|++|+|+||+. .+||++.+..               
T Consensus         2 W~k~~lTy~i~~~~~--------~l~~~~vr~~i~~Af~~Ws~v~~l~F~ev~~~~adI~i~F~~~~h~d~~~fDg~gg~   73 (158)
T pfam00413         2 WKKKNLTYRIVNYTP--------DLPRDEVRRAIRRAFKVWSEVTPLTFTEVPEGTADIMIGFGRGEHGDGYPFDGPGGV   73 (158)
T ss_pred             CCCCCEEEEEECCCC--------CCCHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCEEEEECCCCCCCCCCCCCCCCE
T ss_conf             988815799835799--------989999999999999999863685479777898778999446767998886798986


Q ss_pred             CEEEEECCCCCCCCCCCCCCCCCCCCCC--CCCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEE
Q ss_conf             5356631467753122443354433366--66673466300323465518788887666766667775410142024467
Q gi|254780384|r   86 NGYVCTPRYDSRYFMEINFDKADVWRYG--IGKGTILSHNALHEIGHALGLMHPGSYNGGYPVYGVDNDYENDSFLTSVM  163 (665)
Q Consensus        86 ~g~a~~p~~~~~~~~~~~~~~~~~~~~~--~~~g~y~~~t~iHEIGHALGL~Hp~~~~~g~~~~~~~~~~~~d~~~~TvM  163 (665)
                      -++++.|...   .+++.++....|..+  ...+...+++++|||||||||+|.-.                   .-+||
T Consensus        74 laha~~P~~~---~G~ihfd~~e~w~~~~~~~~g~~l~~va~HEIGHaLGL~Hs~~-------------------~~siM  131 (158)
T pfam00413        74 LAHAFFPGPI---GGDIHFDDDEQWTVGNESPEGTNLFLVAAHEIGHALGLGHSSD-------------------PDAIM  131 (158)
T ss_pred             EEECCCCCCC---CCEEEECCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCC-------------------CCCEE
T ss_conf             7751269865---7338965786530589876651155665454020026688999-------------------86677


Q ss_pred             EECCCCCCCCCCCCCEECCHHHHHHHHHHHHCC
Q ss_conf             523567766665520002112214899998608
Q gi|254780384|r  164 SYFMPQDSMMDASFGYCATPMVSDIVAVQKIYG  196 (665)
Q Consensus       164 SY~~~~~~~~~~~~~~~~tpm~~DI~Alq~lYG  196 (665)
                      .......      .....++-..||.+||.|||
T Consensus       132 ~p~y~~~------~~~~~~L~~dDi~gIq~LYG  158 (158)
T pfam00413       132 YPYYSPY------KDPVFRLDQDDIKGIQSLYG  158 (158)
T ss_pred             CCCCCCC------CCCCCCCCHHHHHHHHHHCC
T ss_conf             7602378------78878799899999998659


No 4  
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=99.80  E-value=4.1e-19  Score=132.82  Aligned_cols=138  Identities=29%  Similarity=0.371  Sum_probs=89.5

Q ss_pred             ECCCEEEEECCCCCCCEEEEEEEECCHHHHHHHHHHHHHHHHHHCCEEEEECCC--CCEEEECC---------------C
Q ss_conf             404346750576567502688750799999999999998865747568982478--71689615---------------8
Q gi|254780384|r   23 WSNYFIRYKPDVLTTHRLSFDITELNPNAQEVARWALGEWSKVVDLTFEETSIH--SDIKFISS---------------N   85 (665)
Q Consensus        23 ~~~~~~~~~~d~~~~~~~t~~it~l~~~~~~~~R~Al~~ws~va~ItF~Ev~~~--~d~~f~~~---------------~   85 (665)
                      |+...++++...++..       --...+|+++|.||+.|++|++|+|+||+..  +||++.+.               .
T Consensus         2 W~k~~lty~i~~~~~~-------l~~~~~r~~i~~Af~~Ws~v~~l~F~ev~~~~~AdI~i~f~~~~h~d~~~fDG~gg~   74 (157)
T cd04278           2 WSKTNLTYRILNYPPD-------LPRDDVRRAIARAFRVWSDVTPLTFREVTSGQEADIRISFARGNHGDGYPFDGPGGT   74 (157)
T ss_pred             CCCCCEEEEEECCCCC-------CCHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCCEEEEEECCCCCCCCCCCCCCCE
T ss_conf             9887058998427999-------899999999999999986647855598158887878999866766898885788886


Q ss_pred             CEEEEECCCCCCCCCCCCCCCCCCCCCCC-CCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEE
Q ss_conf             53566314677531224433544333666-66734663003234655187888876667666677754101420244675
Q gi|254780384|r   86 NGYVCTPRYDSRYFMEINFDKADVWRYGI-GKGTILSHNALHEIGHALGLMHPGSYNGGYPVYGVDNDYENDSFLTSVMS  164 (665)
Q Consensus        86 ~g~a~~p~~~~~~~~~~~~~~~~~~~~~~-~~g~y~~~t~iHEIGHALGL~Hp~~~~~g~~~~~~~~~~~~d~~~~TvMS  164 (665)
                      -++++.|...   .+.+.++....|.... ..+...+.+++|||||||||.|.-+                   .-+||.
T Consensus        75 lahA~~p~~~---~G~ihfd~~e~w~~~~~~~~~~l~~va~HEIGHaLGL~Hs~~-------------------~~siM~  132 (157)
T cd04278          75 LAHAFFPGGI---GGDIHFDDDEQWTLGSDSGGTDLFSVAAHEIGHALGLGHSSD-------------------PDSIMY  132 (157)
T ss_pred             EEEEECCCCC---CCEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCC-------------------CCCEEC
T ss_conf             8987568886---641887777662126788760189999987565406888899-------------------975333


Q ss_pred             ECCCCCCCCCCCCCEECCHHHHHHHHHHHHCC
Q ss_conf             23567766665520002112214899998608
Q gi|254780384|r  165 YFMPQDSMMDASFGYCATPMVSDIVAVQKIYG  196 (665)
Q Consensus       165 Y~~~~~~~~~~~~~~~~tpm~~DI~Alq~lYG  196 (665)
                      -.....       .....+-..||.+||.|||
T Consensus       133 p~y~~~-------~~~~~L~~dDi~giq~LYG  157 (157)
T cd04278         133 PYYQGP-------VPKFKLSQDDIRGIQALYG  157 (157)
T ss_pred             CCCCCC-------CCCCCCCHHHHHHHHHHCC
T ss_conf             503588-------8789989899999998759


No 5  
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.67  E-value=2.5e-16  Score=115.95  Aligned_cols=125  Identities=22%  Similarity=0.199  Sum_probs=76.4

Q ss_pred             CHHHHHHHHHHHHHHHHHHCCEEEEECC---CCCEEEECCC----------CEEEEECCCCCCCCCCCCCCCCCCCCCCC
Q ss_conf             9999999999999886574756898247---8716896158----------53566314677531224433544333666
Q gi|254780384|r   48 NPNAQEVARWALGEWSKVVDLTFEETSI---HSDIKFISSN----------NGYVCTPRYDSRYFMEINFDKADVWRYGI  114 (665)
Q Consensus        48 ~~~~~~~~R~Al~~ws~va~ItF~Ev~~---~~d~~f~~~~----------~g~a~~p~~~~~~~~~~~~~~~~~~~~~~  114 (665)
                      -...++++|.||+.|++|++|+|+|+..   .+||++...+          -+.++.|......................
T Consensus        19 ~~~~~~Ai~~A~~~Ws~v~~L~Fve~~~~~~~adI~I~f~~~~~~~g~gg~lA~a~~p~~~~~~~~~~~~~~~~~~~~~~   98 (156)
T cd04279          19 AQSWLQAVKQAAAEWENVGPLKFVYNPEEDNDADIVIFFDRPPPVGGAGGGLARAGFPLISDGNRKLFNRTDINLGPGQP   98 (156)
T ss_pred             HHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCEEEEECCCCCCCCCCCCCEEECCCCCCCCCCCCCCEEEEEECCCCC
T ss_conf             99999999999999856389369982477777876999658999889898686716898887776655011442011688


Q ss_pred             CCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCEECCHHHHHHHHHHHH
Q ss_conf             66734663003234655187888876667666677754101420244675235677666655200021122148999986
Q gi|254780384|r  115 GKGTILSHNALHEIGHALGLMHPGSYNGGYPVYGVDNDYENDSFLTSVMSYFMPQDSMMDASFGYCATPMVSDIVAVQKI  194 (665)
Q Consensus       115 ~~g~y~~~t~iHEIGHALGL~Hp~~~~~g~~~~~~~~~~~~d~~~~TvMSY~~~~~~~~~~~~~~~~tpm~~DI~Alq~l  194 (665)
                      ..+...+.+++|||||+|||.|-.+.                  .-.||.-.....      ..+...+-..||.+||.|
T Consensus        99 ~~~~~l~~vA~HEiGHaLGL~Hss~~------------------~~aiM~P~y~~~------~~~~~~L~~dDi~gIq~l  154 (156)
T cd04279          99 RGAENLQAIALHELGHALGLWHHSDR------------------PEDAMYPSQGQG------PDGNPTLSARDVATLKRL  154 (156)
T ss_pred             CCCHHHHHHHHHHHCCCCCCCCCCCC------------------CCCEEEEEECCC------CCCCCCCCHHHHHHHHHH
T ss_conf             66020999999986010144667899------------------666663252157------777898998999999997


Q ss_pred             CC
Q ss_conf             08
Q gi|254780384|r  195 YG  196 (665)
Q Consensus       195 YG  196 (665)
                      ||
T Consensus       155 Yg  156 (156)
T cd04279         155 YG  156 (156)
T ss_pred             CC
T ss_conf             59


No 6  
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=99.66  E-value=6.4e-16  Score=113.51  Aligned_cols=142  Identities=21%  Similarity=0.215  Sum_probs=94.0

Q ss_pred             EEEECCHHHHHHHHHHHHHHHHHHCCEEEEECCC---CCEEEE-------CCCCEEEEECCCCCCCCCCCCCCCCCCCCC
Q ss_conf             8750799999999999998865747568982478---716896-------158535663146775312244335443336
Q gi|254780384|r   43 DITELNPNAQEVARWALGEWSKVVDLTFEETSIH---SDIKFI-------SSNNGYVCTPRYDSRYFMEINFDKADVWRY  112 (665)
Q Consensus        43 ~it~l~~~~~~~~R~Al~~ws~va~ItF~Ev~~~---~d~~f~-------~~~~g~a~~p~~~~~~~~~~~~~~~~~~~~  112 (665)
                      +-..++..+|++++.|++.|++++.|+|++++..   .++.+.       .+..++++.+......-...+.+....   
T Consensus        15 ~~~~~~~~~~~~I~~Am~~~~~~tcIrF~~~~~e~~~~~i~~~~~~~~~~~~~gc~S~vG~~~~~~~q~~~l~~~~~---   91 (167)
T cd00203          15 EEENLSAQIQSLILIAMQIWRDYLNIRFVLVGVEIDKADIAILVTRQDFDGGTGGWAYLGRVCDSLRGVGVLQDNQS---   91 (167)
T ss_pred             HHCCCCHHHHHHHHHHHHHHHHCCCCCCEECCCCCCCCCEEEEEEEECCCCCCEEEECCCCCCCCCCCEEEECCCCC---
T ss_conf             01158999999999999999857674737788887765367764330358998448518988679985389658974---


Q ss_pred             CCCCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCC-CCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCEECCHHHHHHHHH
Q ss_conf             6666734663003234655187888876667666-677754101420244675235677666655200021122148999
Q gi|254780384|r  113 GIGKGTILSHNALHEIGHALGLMHPGSYNGGYPV-YGVDNDYENDSFLTSVMSYFMPQDSMMDASFGYCATPMVSDIVAV  191 (665)
Q Consensus       113 ~~~~g~y~~~t~iHEIGHALGL~Hp~~~~~g~~~-~~~~~~~~~d~~~~TvMSY~~~~~~~~~~~~~~~~tpm~~DI~Al  191 (665)
                         .+.+...|++|||||||||.|++.....+.. .-....+..+....|||+|.....     +.+-...+-..||..|
T Consensus        92 ---~~~~~~gti~HElgHaLGf~He~sr~dRD~yv~i~~~~~~~~ydy~SVMhY~~~~f-----s~gqr~~~S~~Di~~i  163 (167)
T cd00203          92 ---GTKEGAQTIAHELGHALGFYHDHDRKDRDDYPTIDDTLNAEDDDYYSVMSYTKGSF-----SDGQRKDFSQCDIDQI  163 (167)
T ss_pred             ---CCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCEEEECCCCCCCCCCCCEECCCCCCC-----CCCCCCCCCHHHHHHH
T ss_conf             ---33255424899999984887555666667876871366789776661612489886-----8766567899999999


Q ss_pred             HHHC
Q ss_conf             9860
Q gi|254780384|r  192 QKIY  195 (665)
Q Consensus       192 q~lY  195 (665)
                      +.||
T Consensus       164 n~lY  167 (167)
T cd00203         164 NKLY  167 (167)
T ss_pred             HHHC
T ss_conf             9639


No 7  
>KOG1565 consensus
Probab=99.57  E-value=3.6e-15  Score=108.96  Aligned_cols=138  Identities=29%  Similarity=0.425  Sum_probs=93.5

Q ss_pred             CCCCEEEEEEEEC-----CHHHHHHHHHHHHHHHHHHCCEEEEECC--CCCEEEECC---------------CCEEEEEC
Q ss_conf             5675026887507-----9999999999999886574756898247--871689615---------------85356631
Q gi|254780384|r   35 LTTHRLSFDITEL-----NPNAQEVARWALGEWSKVVDLTFEETSI--HSDIKFISS---------------NNGYVCTP   92 (665)
Q Consensus        35 ~~~~~~t~~it~l-----~~~~~~~~R~Al~~ws~va~ItF~Ev~~--~~d~~f~~~---------------~~g~a~~p   92 (665)
                      +...++|+-|...     ..+.+.+++.|++.|++|+.++|+||..  .+||.+.+.               .-++++.|
T Consensus       106 W~k~~lT~ri~n~~~~l~~~~v~~~~~~Af~~Ws~vtpl~f~e~~~~~~aDi~i~F~~~~h~d~~PFDG~~g~laHAf~P  185 (469)
T KOG1565         106 WNKEHLTYRIKNYTPYLPQAEVRCAKSEAFKLWSDVTPLTFQEVKEEGEADIRISFFPGDHGDGFPFDGPGGVLAHAFFP  185 (469)
T ss_pred             CCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCEECCCCC
T ss_conf             64231321113538888878888899999853266888866467887888525533116789988625889732045478


Q ss_pred             CCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCC
Q ss_conf             46775312244335443336666673466300323465518788887666766667775410142024467523567766
Q gi|254780384|r   93 RYDSRYFMEINFDKADVWRYGIGKGTILSHNALHEIGHALGLMHPGSYNGGYPVYGVDNDYENDSFLTSVMSYFMPQDSM  172 (665)
Q Consensus        93 ~~~~~~~~~~~~~~~~~~~~~~~~g~y~~~t~iHEIGHALGL~Hp~~~~~g~~~~~~~~~~~~d~~~~TvMSY~~~~~~~  172 (665)
                      ...  .-++..+..++.|..+...|.-.+++++|||||+|||+|--+                   ...||......   
T Consensus       186 g~~--~~G~~hfD~dE~Wt~~~~~g~~l~~Va~HEiGH~LGL~HS~~-------------------~~aiM~P~y~~---  241 (469)
T KOG1565         186 GPG--IGGDLHFDKDETWTYGDSNGVDLFLVAAHEIGHALGLGHSSD-------------------PDAIMYPFYQP---  241 (469)
T ss_pred             CCC--CCCCCCCCCCCCEECCCCCCCCHHHHHHHHCCCCCCCCCCCC-------------------CCCCCCCCCCC---
T ss_conf             888--877654675425121677550067775430110026677778-------------------45013652326---


Q ss_pred             CCCCCCEECCHHHHHHHHHHHHCCCCCC
Q ss_conf             6655200021122148999986087666
Q gi|254780384|r  173 MDASFGYCATPMVSDIVAVQKIYGAPKG  200 (665)
Q Consensus       173 ~~~~~~~~~tpm~~DI~Alq~lYGan~~  200 (665)
                        ....+.  +-..||.+||+|||.+..
T Consensus       242 --~~~~~~--L~~DDv~giq~lYG~~~~  265 (469)
T KOG1565         242 --DSGNFD--LSQDDVRGIQHLYGGPPR  265 (469)
T ss_pred             --CCCCCC--CCHHHHHHHHHHHCCCCC
T ss_conf             --888851--673235666987099844


No 8  
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=99.47  E-value=7.4e-14  Score=101.05  Aligned_cols=151  Identities=21%  Similarity=0.237  Sum_probs=93.0

Q ss_pred             EEEEEEEE-CCHHHHHHHHHHHHHHHHHHCCEEEEECC--CCCEEEECCC------CEEEEECCCCC---CCCCCCCCCC
Q ss_conf             02688750-79999999999999886574756898247--8716896158------53566314677---5312244335
Q gi|254780384|r   39 RLSFDITE-LNPNAQEVARWALGEWSKVVDLTFEETSI--HSDIKFISSN------NGYVCTPRYDS---RYFMEINFDK  106 (665)
Q Consensus        39 ~~t~~it~-l~~~~~~~~R~Al~~ws~va~ItF~Ev~~--~~d~~f~~~~------~g~a~~p~~~~---~~~~~~~~~~  106 (665)
                      .|+|=|.. +...-|+++|.+++.|.....+.|.++..  .+|+++..-+      ...++.|+...   +.+....+..
T Consensus         3 pI~~yid~~~p~~~r~ai~~gi~~WN~af~~gFk~a~~~~~~dir~~~~~~~~~~~~~~~ygps~~dp~tgeI~~a~v~~   82 (165)
T cd04268           3 PITYYIDDSVPDKLRAAILDAIEAWNKAFAIGFKNANDVDPADIRYSVIRWIPYNDGTWSYGPSQVDPLTGEILLARVYL   82 (165)
T ss_pred             CEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEEE
T ss_conf             87999379999899999999999999998720565122475566321599632689866617854599984188557876


Q ss_pred             CCCCCCCCCCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCEECCHHHH
Q ss_conf             44333666667346630032346551878888766676666777541014202446752356776666552000211221
Q gi|254780384|r  107 ADVWRYGIGKGTILSHNALHEIGHALGLMHPGSYNGGYPVYGVDNDYENDSFLTSVMSYFMPQDSMMDASFGYCATPMVS  186 (665)
Q Consensus       107 ~~~~~~~~~~g~y~~~t~iHEIGHALGL~Hp~~~~~g~~~~~~~~~~~~d~~~~TvMSY~~~~~~~~~~~~~~~~tpm~~  186 (665)
                      .... . .-.+.+..+++.||+||+|||.|.+....   .+.............|||.|...... ......++.+++.+
T Consensus        83 ~~~~-~-~~~~~~~~~v~~HEvGHtLGL~Hn~~~S~---~~~~~~~~~~~~~~~SVMdY~~~n~~-~~~~~~~~~~ig~y  156 (165)
T cd04268          83 YSSF-V-EYSGARLRNTAEHELGHALGLRHNFAASD---RDDNVDLLAEKGDTSSVMDYAPSNFS-IQLGDGQKYTIGPY  156 (165)
T ss_pred             ECHH-H-HHHHHHHHHHHHHHHHHHHCCCCCCCCCC---CCCCHHHHCCCCCCCEEECCCCCCCC-CCCCCCCCCCCCHH
T ss_conf             2039-9-99999999999998763505754453201---46844654158998614268775557-68545568998568


Q ss_pred             HHHHHHHHC
Q ss_conf             489999860
Q gi|254780384|r  187 DIVAVQKIY  195 (665)
Q Consensus       187 DI~Alq~lY  195 (665)
                      ||.||+++|
T Consensus       157 D~~aI~~~Y  165 (165)
T cd04268         157 DIAAIKKLY  165 (165)
T ss_pred             HHHHHHHCC
T ss_conf             999998539


No 9  
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=99.38  E-value=8.1e-12  Score=88.76  Aligned_cols=157  Identities=18%  Similarity=0.174  Sum_probs=84.4

Q ss_pred             CCEEEEEE-EECCHHHHHHHHHHHHHHHHHHCCEEEEECC-CCCEEEECCC--CEEEEECCCCCCC-CCCCCCCCCCCCC
Q ss_conf             75026887-5079999999999999886574756898247-8716896158--5356631467753-1224433544333
Q gi|254780384|r   37 THRLSFDI-TELNPNAQEVARWALGEWSKVVDLTFEETSI-HSDIKFISSN--NGYVCTPRYDSRY-FMEINFDKADVWR  111 (665)
Q Consensus        37 ~~~~t~~i-t~l~~~~~~~~R~Al~~ws~va~ItF~Ev~~-~~d~~f~~~~--~g~a~~p~~~~~~-~~~~~~~~~~~~~  111 (665)
                      +.+|.+-| .+.+..+|+.+|.|++.|+..++|+|+|++. .+++++.+..  ..+.+.-...-.. -.....+.+  |.
T Consensus         6 G~~l~~~f~~~~~~~~r~~V~~a~~~W~~~tnirF~~~~~~~adIRI~F~~~~G~wS~VGtd~~~~~~~~~Tmnlg--~~   83 (198)
T cd04327           6 GTVLRIAFLGGPDAFLKDKVRAAAREWLPYANLKFKFVTDADADIRISFTPGDGYWSYVGTDALLIGADAPTMNLG--WF   83 (198)
T ss_pred             CCEEEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCEECC--CC
T ss_conf             9889999878999999999999999996267567798888888689995549975376686523578888714546--56


Q ss_pred             CCCCCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCCC----------CCCCCC-------------C------CCCCEEEE
Q ss_conf             666667346630032346551878888766676666----------777541-------------0------14202446
Q gi|254780384|r  112 YGIGKGTILSHNALHEIGHALGLMHPGSYNGGYPVY----------GVDNDY-------------E------NDSFLTSV  162 (665)
Q Consensus       112 ~~~~~g~y~~~t~iHEIGHALGL~Hp~~~~~g~~~~----------~~~~~~-------------~------~d~~~~Tv  162 (665)
                      .....+....-+++||||||||+.|-+......-..          .....+             .      ..--..||
T Consensus        84 ~~~~~~~~~~~~~~he~~h~~g~~he~~~~~~~i~Wd~~~v~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~s~yD~~SI  163 (198)
T cd04327          84 TDDTPDPEFSRVVLHEFGHALGFIHEHQSPAANIPWDKEAVYAYFSGPPNWDRETVINHNVFAKLDDGDVAYSPYDPDSI  163 (198)
T ss_pred             CCCCCCCCCCEEHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCE
T ss_conf             78775533001005776766426002238767877578999999853789867786654434305623345886996505


Q ss_pred             EEECCCCCC-CCCCCCCEECCHHHHHHHHHHHHC
Q ss_conf             752356776-666552000211221489999860
Q gi|254780384|r  163 MSYFMPQDS-MMDASFGYCATPMVSDIVAVQKIY  195 (665)
Q Consensus       163 MSY~~~~~~-~~~~~~~~~~tpm~~DI~Alq~lY  195 (665)
                      |-|..|..- ........+.++--.|++-|..+|
T Consensus       164 MhY~~p~~~t~~~~~~~~n~~LS~~Dk~f~~~~Y  197 (198)
T cd04327         164 MHYPFPGSLTLDGEEVPPNRTLSDKDKAFMRLLY  197 (198)
T ss_pred             EECCCCHHHCCCCEECCCCCCCCHHHHHHHHHHC
T ss_conf             5357687562798775788858988999999858


No 10 
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=99.26  E-value=2.5e-11  Score=85.75  Aligned_cols=91  Identities=34%  Similarity=0.553  Sum_probs=57.0

Q ss_pred             CEEEEEE--EECCHH-HHHHHHHHHHHHHHHHCCEEEEECCCCC--EEEECCCC-----EEEEECCCCCCCCCCCCCCCC
Q ss_conf             5026887--507999-9999999999886574756898247871--68961585-----356631467753122443354
Q gi|254780384|r   38 HRLSFDI--TELNPN-AQEVARWALGEWSKVVDLTFEETSIHSD--IKFISSNN-----GYVCTPRYDSRYFMEINFDKA  107 (665)
Q Consensus        38 ~~~t~~i--t~l~~~-~~~~~R~Al~~ws~va~ItF~Ev~~~~d--~~f~~~~~-----g~a~~p~~~~~~~~~~~~~~~  107 (665)
                      .++++.|  ..++.. .|+++|.||+.|++++.|+|+|++...+  +.|...+.     +++..|  .    +...++  
T Consensus         8 ~~v~Y~i~~~~~~~~~~r~~i~~A~~~w~~~Tci~F~~~~~~~~~~i~f~~~~~~gc~~s~~~~~--~----g~q~~~--   79 (140)
T smart00235        8 GTVPYVIDSSSLSPEEVREAIARAFAEWSDVTCLRFVERTSTADIYISFGKGDGSGCTLSHAGRP--G----GDQHFS--   79 (140)
T ss_pred             CEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCEEEEEEECCCCCCCEEEEECC--C----CCEEEE--
T ss_conf             97889988899997999999999999998547863896567887189998688899745763044--8----823345--


Q ss_pred             CCCCCCCCCCCEEEEEHHHHHHHHHCCCCCCCCCC
Q ss_conf             43336666673466300323465518788887666
Q gi|254780384|r  108 DVWRYGIGKGTILSHNALHEIGHALGLMHPGSYNG  142 (665)
Q Consensus       108 ~~~~~~~~~g~y~~~t~iHEIGHALGL~Hp~~~~~  142 (665)
                            .+.+.....|++|||||||||.|.+....
T Consensus        80 ------~~~~c~~~g~i~HEigHaLGl~He~~r~D  108 (140)
T smart00235       80 ------LGNGCINTGVAAHELGHALGLYHEQSRSD  108 (140)
T ss_pred             ------CCCCCCCCCHHHHHHHHHCCCCCCCCCCC
T ss_conf             ------38881666624553453426655457763


No 11 
>pfam12388 Peptidase_M57 Dual-action HEIGH metallo-peptidase. The catalytic triad for this family of proteases is HE-H-H, which in many members is in the sequence motif HEIGH.
Probab=99.12  E-value=8.4e-10  Score=76.58  Aligned_cols=172  Identities=20%  Similarity=0.240  Sum_probs=95.9

Q ss_pred             CCCHHHHHHHHHCCEEECCCEEEEECCCCCCCEEEEEE-----EECCHHHHHHHHHHHHHHHHHHC--CEEEEE----CC
Q ss_conf             55799999987358354043467505765675026887-----50799999999999998865747--568982----47
Q gi|254780384|r    7 VYTTERIADHLLRDQYWSNYFIRYKPDVLTTHRLSFDI-----TELNPNAQEVARWALGEWSKVVD--LTFEET----SI   75 (665)
Q Consensus         7 ~~t~d~~ad~l~~~~~~~~~~~~~~~d~~~~~~~t~~i-----t~l~~~~~~~~R~Al~~ws~va~--ItF~Ev----~~   75 (665)
                      ..|.+|+.+.+..+.--..-+.+...-....++|++=.     +.|+..++++.++|++.|..+ +  |+|+-+    ..
T Consensus         8 ~~s~~~l~~~~~~~~~~~eqYrT~nlV~~~~r~I~v~~~tg~~~~l~~~~~tal~~AI~~yN~l-~l~l~F~~t~gtn~~   86 (211)
T pfam12388         8 FLTEKELNKLELSEQNDQKQYRTNNLVTGSPRTITIIGYTGGSQALTAKGQTALDDAVNNYNNL-GLDISFRLTFGTNYQ   86 (211)
T ss_pred             EECHHHHHHHHCCCCCCCEEEEEEEEECCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHC-CCEEEEEEEECCCCC
T ss_conf             7719997565504787600520111645898589999827865103178999999999987640-760799998435777


Q ss_pred             CCCEEEECC-------CCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHCCCCCCCC-C------
Q ss_conf             871689615-------85356631467753122443354433366666734663003234655187888876-6------
Q gi|254780384|r   76 HSDIKFISS-------NNGYVCTPRYDSRYFMEINFDKADVWRYGIGKGTILSHNALHEIGHALGLMHPGSY-N------  141 (665)
Q Consensus        76 ~~d~~f~~~-------~~g~a~~p~~~~~~~~~~~~~~~~~~~~~~~~g~y~~~t~iHEIGHALGL~Hp~~~-~------  141 (665)
                      ..|+.....       ..+.|.+|...+.-+..+.+  ....+.+.   ....+++.|||||.+||-|. || +      
T Consensus        87 ~~di~v~~~~~~~~~g~ggsAGFP~s~G~P~~~V~i--~~~~~~~~---~~~~~vi~HEiGHciGfRHT-DyfnRsSCG~  160 (211)
T pfam12388        87 NADMVVYDNSVNNPSGSGGSAGFPDSNGDPAKFVQI--YDLENGST---NVNEHVITHEIGHSIGFRHT-DYFDRSSCGQ  160 (211)
T ss_pred             CCCEEEEECCCCCCCCCEEECCCCCCCCCCCCEEEE--ECCCCCCC---CHHHHHHHHHHHCEECCCCC-CCCCCCCCCC
T ss_conf             776799954556788851041288778998866998--23578882---03345452320222222113-5557444466


Q ss_pred             -C--CC----CCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCEECCHHHHHHHHHHHHC
Q ss_conf             -6--76----666777541014202446752356776666552000211221489999860
Q gi|254780384|r  142 -G--GY----PVYGVDNDYENDSFLTSVMSYFMPQDSMMDASFGYCATPMVSDIVAVQKIY  195 (665)
Q Consensus       142 -~--g~----~~~~~~~~~~~d~~~~TvMSY~~~~~~~~~~~~~~~~tpm~~DI~Alq~lY  195 (665)
                       +  |.    .+.-+..|...|.  .|||---        ++......+-..||.||++||
T Consensus       161 ~~nEG~agvGAihIpGTPtg~D~--~SiM~aC--------~~~~~~g~F~~~DitAL~~LY  211 (211)
T pfam12388       161 GGNEGTGGVGAVYIPGTPTGRDN--TSIMQAC--------FSGGEDGEFNSNDITALLAMY  211 (211)
T ss_pred             CCCCCCCCCCCEECCCCCCCCCC--CHHHHHH--------CCCCCCCCCCCCCHHHHHHHC
T ss_conf             77778776564677899989987--5487600--------048878886742098898639


No 12 
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=98.94  E-value=2.2e-08  Score=68.03  Aligned_cols=97  Identities=20%  Similarity=0.314  Sum_probs=61.5

Q ss_pred             CCCCEEEEEEE-ECCHHHHHHHHHHHHHHHHHHCCEEEEECCCCC-EEEECCCCE-EEEECCCCCCCCCCCCCCCCCCCC
Q ss_conf             56750268875-079999999999999886574756898247871-689615853-566314677531224433544333
Q gi|254780384|r   35 LTTHRLSFDIT-ELNPNAQEVARWALGEWSKVVDLTFEETSIHSD-IKFISSNNG-YVCTPRYDSRYFMEINFDKADVWR  111 (665)
Q Consensus        35 ~~~~~~t~~it-~l~~~~~~~~R~Al~~ws~va~ItF~Ev~~~~d-~~f~~~~~g-~a~~p~~~~~~~~~~~~~~~~~~~  111 (665)
                      ++..++-+.|. .|+..++.+++.|++.|++.+-|+|++-+...+ +.|.....| +++.-..+.+         .+.  
T Consensus        10 Wpn~~VPY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~pr~~e~~yi~f~~~~~GC~S~vG~~g~g---------~Q~--   78 (200)
T cd04281          10 WPGGVIPYVIDGNFTGSQRAMFKQAMRHWENFTCVTFVERTPEENYIVFTYRPCGCCSYVGRRGNG---------PQA--   78 (200)
T ss_pred             CCCCEEEEEECCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCEEEEECCCCCEEECCCCCCCC---------CEE--
T ss_conf             989889899899799999999999999998679765588899987799972899777328830787---------467--


Q ss_pred             CCCCCCCEEEEEHHHHHHHHHCCCCCCCCCC
Q ss_conf             6666673466300323465518788887666
Q gi|254780384|r  112 YGIGKGTILSHNALHEIGHALGLMHPGSYNG  142 (665)
Q Consensus       112 ~~~~~g~y~~~t~iHEIGHALGL~Hp~~~~~  142 (665)
                      ..++.+-...-|++|||+||||+-|-+.-..
T Consensus        79 isl~~~C~~~G~i~HEl~HalGf~HEqsR~D  109 (200)
T cd04281          79 ISIGKNCDKFGIVVHELGHVIGFWHEHTRPD  109 (200)
T ss_pred             EECCCCCCCCCCHHHHHHHHHCCCCCCCCCC
T ss_conf             6637876756302999988754655434667


No 13 
>pfam01400 Astacin Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this family have an amino terminal propeptide which is cleaved to give the active protease domain. All other linked domains are found to the carboxyl terminus of this domain. This family includes: Astacin, a digestive enzyme from Crayfish. Meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain. Proteins involved in morphogenesis, and Tolloid from drosophila.
Probab=98.92  E-value=3.4e-08  Score=66.85  Aligned_cols=95  Identities=22%  Similarity=0.354  Sum_probs=60.6

Q ss_pred             CCEEEEEEE-ECCHHHHHHHHHHHHHHHHHHCCEEEEECCCCC---EEEECCCCEEEEECCCCCCCCCCCCCCCCCCCCC
Q ss_conf             750268875-079999999999999886574756898247871---6896158535663146775312244335443336
Q gi|254780384|r   37 THRLSFDIT-ELNPNAQEVARWALGEWSKVVDLTFEETSIHSD---IKFISSNNGYVCTPRYDSRYFMEINFDKADVWRY  112 (665)
Q Consensus        37 ~~~~t~~it-~l~~~~~~~~R~Al~~ws~va~ItF~Ev~~~~d---~~f~~~~~g~a~~p~~~~~~~~~~~~~~~~~~~~  112 (665)
                      ..++-+.|. .+++.++++++.|++.|++.+-|+|++-+...+   +.|.....-+++.-..++.          +.  .
T Consensus         5 ~~~VpY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~pr~~~~~~~~i~~~~~~gC~S~vG~~gg~----------Q~--i   72 (192)
T pfam01400         5 NGPIPYVIDSSFTGLARALIRQAMRHWEQKTCIRFVPRTSAPDNNYLFFFKGDGCYSYVGRNGGA----------QP--V   72 (192)
T ss_pred             CCEEEEEECCCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCEEEEEECCCCEEEECCEECCE----------EE--E
T ss_conf             98897998997999999999999999985887537858889887699994399453624700880----------25--8


Q ss_pred             CCCCCCEEEEEHHHHHHHHHCCCCCCCCCCC
Q ss_conf             6666734663003234655187888876667
Q gi|254780384|r  113 GIGKGTILSHNALHEIGHALGLMHPGSYNGG  143 (665)
Q Consensus       113 ~~~~g~y~~~t~iHEIGHALGL~Hp~~~~~g  143 (665)
                      .++.+-...-+++|||+||||+-|-+.-...
T Consensus        73 slg~gC~~~g~i~HEl~HaLGf~HEhsR~DR  103 (192)
T pfam01400        73 SLGNGCDKFGIIVHELGHALGFWHEQSRPDR  103 (192)
T ss_pred             EECCCCCCCCCCHHHHHHHHCCCCCCCCCCC
T ss_conf             8688858677017888888455554467774


No 14 
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related  metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different  substrate specificities and cooperatively digest the chorion.
Probab=98.87  E-value=3.4e-08  Score=66.86  Aligned_cols=85  Identities=20%  Similarity=0.312  Sum_probs=56.6

Q ss_pred             EECCHHHHHHHHHHHHHHHHHHCCEEEEECCCCC-EEEECCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCC-CCCEEEE
Q ss_conf             5079999999999999886574756898247871-6896158535663146775312244335443336666-6734663
Q gi|254780384|r   45 TELNPNAQEVARWALGEWSKVVDLTFEETSIHSD-IKFISSNNGYVCTPRYDSRYFMEINFDKADVWRYGIG-KGTILSH  122 (665)
Q Consensus        45 t~l~~~~~~~~R~Al~~ws~va~ItF~Ev~~~~d-~~f~~~~~g~a~~p~~~~~~~~~~~~~~~~~~~~~~~-~g~y~~~  122 (665)
                      ..|++.++.+++.|++.|++.+-|+|++-+...+ ++|.....-+++.-..++.    ..+        .++ .+-....
T Consensus        12 ~~~~~~~~~~I~~Am~~~~~~TCirF~~r~~e~~yi~i~~~~GC~S~vGr~gg~----Q~i--------sl~~~gC~~~g   79 (182)
T cd04283          12 PQYSENERAVIEKAMQEFETLTCVRFVPRTTERDYLNIESRSGCWSYIGRQGGR----QTV--------SLQKQGCMYKG   79 (182)
T ss_pred             CCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCEEEEECCCCEEECCCEECCE----EEE--------EECCCCCCCCC
T ss_conf             989999999999999999858967788578987579995299785405710981----248--------86899978767


Q ss_pred             EHHHHHHHHHCCCCCCCCC
Q ss_conf             0032346551878888766
Q gi|254780384|r  123 NALHEIGHALGLMHPGSYN  141 (665)
Q Consensus       123 t~iHEIGHALGL~Hp~~~~  141 (665)
                      |++|||+|||||-|-+.-.
T Consensus        80 ~i~HEl~HalGf~HEqsR~   98 (182)
T cd04283          80 IIQHELLHALGFYHEQTRS   98 (182)
T ss_pred             HHHHHHHHHHCCCCCCCCC
T ss_conf             2078888884555544666


No 15 
>cd04280 ZnMc_astacin_like Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site. Members of this family may have an amino terminal propeptide, which is cleaved to yield the active protease domain, which is consequently always found at the N-terminus in multi-domain architectures. This family includes: astacin, a digestive enzyme from Crayfish; meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain, proteins involved in (bone) morphogenesis, tolloid from drosophila, and the sea urchin SPAN protein, which may also play a role in development.
Probab=98.76  E-value=1.9e-07  Score=62.36  Aligned_cols=89  Identities=20%  Similarity=0.331  Sum_probs=58.9

Q ss_pred             EEEE-ECCHHHHHHHHHHHHHHHHHHCCEEEEECCCCC-EEEECCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCE
Q ss_conf             8875-079999999999999886574756898247871-68961585356631467753122443354433366666734
Q gi|254780384|r   42 FDIT-ELNPNAQEVARWALGEWSKVVDLTFEETSIHSD-IKFISSNNGYVCTPRYDSRYFMEINFDKADVWRYGIGKGTI  119 (665)
Q Consensus        42 ~~it-~l~~~~~~~~R~Al~~ws~va~ItF~Ev~~~~d-~~f~~~~~g~a~~p~~~~~~~~~~~~~~~~~~~~~~~~g~y  119 (665)
                      +-|. .+++..|++++.|++.|++.+-|.|++-+...+ ++|.....-+++.-..++    .      +..  .++.+-.
T Consensus         6 Y~i~~~~~~~~~~~I~~Am~~~~~~TCirF~pr~~~~~yi~f~~~~gC~S~vG~~~g----~------q~i--sl~~~C~   73 (180)
T cd04280           6 YVIDGSFDESDRSLILRAMREIESNTCIRFVPRTTEKDYIRIVKGSGCWSYVGRVGG----R------QVV--SLGSGCF   73 (180)
T ss_pred             EEECCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCEEEEECCCCEEECCCCCCC----E------EEE--EECCCCC
T ss_conf             998898999999999999999986796535888998758999659971361570188----2------368--8478868


Q ss_pred             EEEEHHHHHHHHHCCCCCCCCCC
Q ss_conf             66300323465518788887666
Q gi|254780384|r  120 LSHNALHEIGHALGLMHPGSYNG  142 (665)
Q Consensus       120 ~~~t~iHEIGHALGL~Hp~~~~~  142 (665)
                      ..-+++|||+|||||-|-+.-..
T Consensus        74 ~~g~i~HEl~HalGf~HE~~R~D   96 (180)
T cd04280          74 SLGTIVHELMHALGFYHEQSRPD   96 (180)
T ss_pred             CCCHHHHHHHHHHCCCCCCCCCC
T ss_conf             67633999998745655434667


No 16 
>KOG3714 consensus
Probab=98.74  E-value=1.4e-07  Score=63.09  Aligned_cols=152  Identities=20%  Similarity=0.257  Sum_probs=87.2

Q ss_pred             CCCCCCEEEEEEEE-CCHHHHHHHHHHHHHHHHHHCCEEEEECCC-CCEEEECCCCE-EEEECCCCCCCCCCCCCCCCCC
Q ss_conf             76567502688750-799999999999998865747568982478-71689615853-5663146775312244335443
Q gi|254780384|r   33 DVLTTHRLSFDITE-LNPNAQEVARWALGEWSKVVDLTFEETSIH-SDIKFISSNNG-YVCTPRYDSRYFMEINFDKADV  109 (665)
Q Consensus        33 d~~~~~~~t~~it~-l~~~~~~~~R~Al~~ws~va~ItF~Ev~~~-~d~~f~~~~~g-~a~~p~~~~~~~~~~~~~~~~~  109 (665)
                      ..++...+-+-|.+ ++..+|.++|.|++.|+..+=|+|+|.+.. -+..+.....| +.++...++..           
T Consensus        80 ~~Wp~~~ipY~i~~~~~~~~r~~i~~A~~~~~~~TCi~F~~~~~~~~~~~~~~~~~gC~S~VGr~gg~~-----------  148 (411)
T KOG3714          80 RRWPNGVIPYYIDGSFTSSQRALIRQAMREIENHTCIRFVERTTPDKDYLIVFTGGGCYSYVGRRGGGQ-----------  148 (411)
T ss_pred             CCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCEEEECCCCCCCCCCCCCCCCC-----------
T ss_conf             258898122498888898999999999999973956535557887776178727997313127778865-----------


Q ss_pred             CCCCCCCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCC---------------C---------CCCCCCCCCCCEEEEEEE
Q ss_conf             3366666734663003234655187888876667666---------------6---------777541014202446752
Q gi|254780384|r  110 WRYGIGKGTILSHNALHEIGHALGLMHPGSYNGGYPV---------------Y---------GVDNDYENDSFLTSVMSY  165 (665)
Q Consensus       110 ~~~~~~~g~y~~~t~iHEIGHALGL~Hp~~~~~g~~~---------------~---------~~~~~~~~d~~~~TvMSY  165 (665)
                      -...++.+-..+-|++|||+||||+-|-|.-...+..               +         ....+|+    --|||-|
T Consensus       149 q~~sl~~~C~~~G~i~HEl~HaLGf~HehsR~DRD~yV~I~~~ni~~~~~~nF~k~~~~~~~~~~~pYD----ygSvMHY  224 (411)
T KOG3714         149 QLLSLGDGCDRFGTIVHELMHALGFWHEHSRPDRDNYVSINWDNIDPGQEYNFEKYSPDEVTTYGVPYD----YGSVMHY  224 (411)
T ss_pred             CCCCCCCCCCCCCEEHHHHHHHHHHHCCCCCCCCCCEEEEEHHHCCHHHHHHHHHCCHHHHHCCCCCCC----CCCCCCC
T ss_conf             321117874767610566776641310256545436279841308866655566468645421598566----7752464


Q ss_pred             CCCCC---CCC----CC--C----CCEECCHHHHHHHHHHHHCCCCC
Q ss_conf             35677---666----65--5----20002112214899998608766
Q gi|254780384|r  166 FMPQD---SMM----DA--S----FGYCATPMVSDIVAVQKIYGAPK  199 (665)
Q Consensus       166 ~~~~~---~~~----~~--~----~~~~~tpm~~DI~Alq~lYGan~  199 (665)
                      .....   ...    ..  .    -+-...+-..||.-|-.||.-..
T Consensus       225 ~~~afs~~~~~~ti~~~~~~~~~~mGqr~~~S~~Di~~iN~~Y~C~~  271 (411)
T KOG3714         225 APYAFSKNGSLPTIVPKDNGFQNTMGQRERLSFYDIRKINKLYCCPE  271 (411)
T ss_pred             CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCC
T ss_conf             88666658877602036665224557647687878999876208987


No 17 
>cd04282 ZnMc_meprin Zinc-dependent metalloprotease, meprin_like subfamily. Meprins are membrane-bound or secreted extracellular proteases, which cleave a variety of targets, including peptides such as parathyroid hormone, gastrin, and cholecystokinin, cytokines such as osteopontin, and proteins such as collagen IV, fibronectin, casein and gelatin. Meprins may also be able to release proteins from the cell surface. Closely related meprin alpha- and beta-subunits form homo- and hetero-oligomers; these complexes are found on epithelial cells of the intestine, for example, and are also expressed in certain cancer cells.
Probab=98.67  E-value=2.5e-07  Score=61.70  Aligned_cols=91  Identities=22%  Similarity=0.254  Sum_probs=59.8

Q ss_pred             EEEEEE-EECCHHHHHHHHHHHHHHHHHHCCEEEEECCCCC-EEEECCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCC
Q ss_conf             026887-5079999999999999886574756898247871-68961585356631467753122443354433366666
Q gi|254780384|r   39 RLSFDI-TELNPNAQEVARWALGEWSKVVDLTFEETSIHSD-IKFISSNNGYVCTPRYDSRYFMEINFDKADVWRYGIGK  116 (665)
Q Consensus        39 ~~t~~i-t~l~~~~~~~~R~Al~~ws~va~ItF~Ev~~~~d-~~f~~~~~g~a~~p~~~~~~~~~~~~~~~~~~~~~~~~  116 (665)
                      ++-+.| ..++...++.++.|++.|++.+-|+|++-+...+ +.|.....-+++.-..++.          +.  ..++.
T Consensus        49 ~VPY~i~~~~~~~~~~~I~~Am~~~~~~TCirF~~r~~e~~yi~~~~~~GC~S~VG~~gg~----------Q~--vslg~  116 (230)
T cd04282          49 PIPYILDDSLDLNAKGVILKAFEMYRLKSCVDFKPYEGESNYIFFFKGSGCWSMVGDQQGG----------QN--LSIGA  116 (230)
T ss_pred             CEEEEECCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCEEEEECCCCCCCCCCCCCCE----------EE--EEECC
T ss_conf             1669988879999999999999999858954148678987559995699511422653880----------56--65078


Q ss_pred             CCEEEEEHHHHHHHHHCCCCCCCCC
Q ss_conf             7346630032346551878888766
Q gi|254780384|r  117 GTILSHNALHEIGHALGLMHPGSYN  141 (665)
Q Consensus       117 g~y~~~t~iHEIGHALGL~Hp~~~~  141 (665)
                      |-....|++|||+||||+-|-+.-.
T Consensus       117 gC~~~G~i~HEl~HaLGf~HEqsR~  141 (230)
T cd04282         117 GCDYKATVEHEFLHALGFYHEQSRS  141 (230)
T ss_pred             CCCCCCHHHHHHHHHHCCCCCCCCC
T ss_conf             8787766799999885566542366


No 18 
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=98.49  E-value=3.6e-08  Score=66.76  Aligned_cols=157  Identities=20%  Similarity=0.231  Sum_probs=82.4

Q ss_pred             CCCEEEEEEEECC-HHHHHHHHHHHHHHHHHHC-CEE-----EEEC----CCCCEEE-----ECC-CCEEEEECCCCC--
Q ss_conf             6750268875079-9999999999998865747-568-----9824----7871689-----615-853566314677--
Q gi|254780384|r   36 TTHRLSFDITELN-PNAQEVARWALGEWSKVVD-LTF-----EETS----IHSDIKF-----ISS-NNGYVCTPRYDS--   96 (665)
Q Consensus        36 ~~~~~t~~it~l~-~~~~~~~R~Al~~ws~va~-ItF-----~Ev~----~~~d~~f-----~~~-~~g~a~~p~~~~--   96 (665)
                      +.+-|.|=|.... ..-++++|+....|....- +=|     +++.    +.+|++.     +.+ +.+.++.|+...  
T Consensus         6 p~~PIvfyid~~~P~~~r~ai~~Gv~~WN~AFE~aGfknAi~vk~~P~d~d~~DiRyn~Irw~~s~~~~~ayGps~~dPr   85 (197)
T cd04276           6 PKEPIVYYLDNTFPEKYRDAIREGVLYWNKAFEKAGFKNAIIVKVLPDDADPGDIRYNVIRWIHSPNGGWAYGPSVVDPR   85 (197)
T ss_pred             CCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCC
T ss_conf             87887999459998899999999999999999872775416975089997977742668999637987666688757999


Q ss_pred             -CCCCCCCC-------CCCCCCCCCCCCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCC-CC----CCCCCEEEEE
Q ss_conf             -53122443-------35443336666673466300323465518788887666766667775-41----0142024467
Q gi|254780384|r   97 -RYFMEINF-------DKADVWRYGIGKGTILSHNALHEIGHALGLMHPGSYNGGYPVYGVDN-DY----ENDSFLTSVM  163 (665)
Q Consensus        97 -~~~~~~~~-------~~~~~~~~~~~~g~y~~~t~iHEIGHALGL~Hp~~~~~g~~~~~~~~-~~----~~d~~~~TvM  163 (665)
                       +-+....+       .....|.. .-.+..-++++.||+||+|||.|=+-   ++..+..+. ..    ......-|||
T Consensus        86 TGEIl~a~I~~~~~~~~~~~~~~~-~~~~~~lr~~~aHEVGHtLGL~HNf~---aS~~~~~~~l~~~~~t~~~g~~~SVM  161 (197)
T cd04276          86 TGEILKADVILYSGFLRQDQLWYE-DLLAASLRYLLAHEVGHTLGLRHNFK---ASSDGSNEELEDPLGTKEKGATSSVM  161 (197)
T ss_pred             CCCEEEEEEEEEHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHCCCCCCCC---HHCCCCHHHCCCHHHHHCCCCCCEEC
T ss_conf             866776578874177654799999-89999999999987640316533010---00258845402887772178754632


Q ss_pred             EECCCCCC--CCCCCCCEECCHHHHHHHHHHHHCC
Q ss_conf             52356776--6665520002112214899998608
Q gi|254780384|r  164 SYFMPQDS--MMDASFGYCATPMVSDIVAVQKIYG  196 (665)
Q Consensus       164 SY~~~~~~--~~~~~~~~~~tpm~~DI~Alq~lYG  196 (665)
                      -|....-.  ..+....++.+++.||..||++-|.
T Consensus       162 DY~~~N~~~~~~d~~~~~~~~~G~YD~~AI~~gY~  196 (197)
T cd04276         162 DYPPPNVAAQGEDQGDYYPPTIGPYDKWAIEYGYT  196 (197)
T ss_pred             CCCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCC
T ss_conf             57432213476546531579985042667301357


No 19 
>pfam08548 Peptidase_M10_C Peptidase M10 serralysin C terminal. Serralysins are peptidases related to mammalian matrix metallopeptidases (MMPs). The peptidase unit is found at the N terminal while this domain at the C terminal forms a corkscrew and is thought to be important for secretion of the protein through the bacterial cell wall. This domain contains the calcium ion binding domain pfam00353.
Probab=98.45  E-value=2.9e-06  Score=55.23  Aligned_cols=55  Identities=24%  Similarity=0.434  Sum_probs=43.8

Q ss_pred             CEEEECCCEEEEEECCCCCCEEEECCCCCEEECCCCCCEEECCCCCEEEEEECCC
Q ss_conf             5477348257446617765168826667468667863458888873389982366
Q gi|254780384|r  259 NMTIAQHTVIESVVGSSGNDVVIGNSANNVFFESSGNDVFDGSSGLDTFFYYAPS  313 (665)
Q Consensus       259 n~~Ia~gt~IEna~g~~g~D~i~gn~~~n~l~gg~g~d~~~g~~g~Dt~~~~~~~  313 (665)
                      --.+-.++..+.++|..++|+|.|+.++++|.|+.|+|+|.||.|.|++++....
T Consensus        81 IEn~~GgsG~D~l~Gn~~~n~L~Gg~G~D~l~Gg~G~D~l~GG~G~D~f~~~~g~  135 (220)
T pfam08548        81 IENAIGGSGNDLLIGNDAANILKGGAGNDIIYGGGGADQLWGGAGKDTFVYGAAS  135 (220)
T ss_pred             EEEEECCCCCCEEECCCCCCEEECCCCCCEEECCCCCCEEECCCCCEEEEECCCC
T ss_conf             6688666767689878888557878998868777887189889983499840766


No 20 
>COG2931 RTX toxins and related Ca2+-binding proteins [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=98.40  E-value=1.9e-06  Score=56.36  Aligned_cols=60  Identities=27%  Similarity=0.302  Sum_probs=35.1

Q ss_pred             CEEEEEECCCCCCEEEECCCCC--EEECCCCCCEEECC-CCCEEEEEECCCCCEEEEEECCCE
Q ss_conf             2574466177651688266674--68667863458888-873389982366513787205825
Q gi|254780384|r  266 TVIESVVGSSGNDVVIGNSANN--VFFESSGNDVFDGS-SGLDTFFYYAPSYLYKIYRFENAI  325 (665)
Q Consensus       266 t~IEna~g~~g~D~i~gn~~~n--~l~gg~g~d~~~g~-~g~Dt~~~~~~~~~~~~~~~~~~~  325 (665)
                      ...++++++.++|+|+++...+  +|.++.|+|++.++ .+.|+++-........+....+..
T Consensus       155 ~~~~~l~g~~g~d~l~~~~~~~~~~~~~~~G~d~~~~~~~~~d~~~~~~~~~~~~~~~~~g~d  217 (510)
T COG2931         155 NGNDILIGGSGNDTLTGNSGADDRVINGGAGDDTILGGGNGDDTVFGGGSGGNATLDGGNGGD  217 (510)
T ss_pred             CCCCEECCCCCCCEEECCCCCCCEEEECCCCCCEEECCCCCCCEEEECCCCCCCCCCCCCCCC
T ss_conf             223232157766358146666510575243640330456677416712346750012677877


No 21 
>COG2931 RTX toxins and related Ca2+-binding proteins [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=98.18  E-value=7.3e-06  Score=52.82  Aligned_cols=127  Identities=20%  Similarity=0.201  Sum_probs=77.2

Q ss_pred             EEEEECCCCCCEEEECCCCCEE-ECCCCCCEEECCCCCEEEEEECCCCCEEEEEECCCEEEEECCCCCCEEEEEEE--EE
Q ss_conf             7446617765168826667468-66786345888887338998236651378720582599956888725675467--99
Q gi|254780384|r  268 IESVVGSSGNDVVIGNSANNVF-FESSGNDVFDGSSGLDTFFYYAPSYLYKIYRFENAIIVYDANEHKQDFLSDVE--RL  344 (665)
Q Consensus       268 IEna~g~~g~D~i~gn~~~n~l-~gg~g~d~~~g~~g~Dt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~d~~~~iE--~~  344 (665)
                      .+.+.|..++|++.|..+++.| .++.|+|+|.|+.|+|++++....  ..++...+...+..+  .+.|++....  ..
T Consensus       275 ~d~l~G~~g~d~~~g~~g~d~~~~gg~g~d~l~gg~gnd~~~~~~~~--d~i~~~~~~~~~~gg--~g~d~~~~~~~~~~  350 (510)
T COG2931         275 NDTLKGGAGNDTLLGGAGNDTLTIGGGGNDTLDGGAGNDTLDFSGGD--DTIYGGAGNDTLDGG--AGNDTLAGNAGALA  350 (510)
T ss_pred             CCCCCCCCCCCEEECCCCCCEEECCCCCCEECCCCCCCCEEECCCCC--CEEEECCCCCCEECC--CCCCEEEECCCCCE
T ss_conf             45334677864474456754121167662332466777679513776--337703576535325--67662550334304


Q ss_pred             EECCCE--ECCCCCCCCCEEECCCCCCCEEEEECCCCHHHHHHHHHHCCCCCCCCC
Q ss_conf             963840--012444442024245787516764034201154445641434586313
Q gi|254780384|r  345 RFADQC--IDSVNVKQRSILEYTASYEDLIQVIGQDVFESMKHFYEFGLSEGRDIT  398 (665)
Q Consensus       345 ~f~d~t--~~~~~~~~~d~l~~~a~~ddl~~~~G~d~~~~~~hy~~~G~~EGrd~~  398 (665)
                      .|..+.  ....+...++.+...+++|.|.+..|.|++.+..+.+.+-++-|.|.-
T Consensus       351 ~~~gg~g~d~i~g~~~~~~l~Gg~gnd~l~Gg~G~D~l~gg~G~D~l~GG~G~D~f  406 (510)
T COG2931         351 LLNGGDGNDTISGNDGNDTLIGGAGNDTLSGGAGSDTLVGGGGNDTLTGGAGADTF  406 (510)
T ss_pred             ECCCCCCCCEEECCCCCCEEECCCCCCEEECCCCCCEECCCCCCCEEECCCCCCEE
T ss_conf             42588654243047764435324566556536666144168775568768765457


No 22 
>pfam05572 Peptidase_M43 Pregnancy-associated plasma protein-A. Pregnancy-associated plasma protein A (PAPP-A) is a metallo-protease belonging to Merops family M43. It cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the human ovary and the cardiovascular system.
Probab=97.70  E-value=1.8e-05  Score=50.39  Aligned_cols=55  Identities=25%  Similarity=0.207  Sum_probs=30.0

Q ss_pred             CCEEEEECCCCCCCCCC--CCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHCCCCCCCCC
Q ss_conf             85356631467753122--4433544333666667346630032346551878888766
Q gi|254780384|r   85 NNGYVCTPRYDSRYFME--INFDKADVWRYGIGKGTILSHNALHEIGHALGLMHPGSYN  141 (665)
Q Consensus        85 ~~g~a~~p~~~~~~~~~--~~~~~~~~~~~~~~~g~y~~~t~iHEIGHALGL~Hp~~~~  141 (665)
                      ..|++.+|.........  +.++....... ....+. ..|++|||||-|||-|+|..+
T Consensus        34 ~lGyA~~P~~~~~~d~~~~v~~~~~~~G~~-~~~~~~-g~TltHE~GH~lgL~Htf~~~   90 (152)
T pfam05572        34 NSGVAWYPDSGMSADGVARVTFNGAYLGAD-STSTNF-SSTLTHEFGHFLGLRHTFEGG   90 (152)
T ss_pred             CCEEEECCCCCCCCCCCEEEEEECCCCCCC-CCCCCC-CCCEEEECHHHHCCCCCCCCC
T ss_conf             286874899988888732899805425677-776666-650245312241743312689


No 23 
>cd04275 ZnMc_pappalysin_like Zinc-dependent metalloprotease, pappalysin_like subfamily. The pregnancy-associated plasma protein A (PAPP-A or pappalysin-1) cleaves insulin-like growth factor-binding proteins 4 and 5, thereby promoting cell growth by releasing bound growth factor. This model includes pappalysins and related metalloprotease domains from all three kingdoms of life. The three-dimensional structure of an archaeal representative, ulilysin, has been solved.
Probab=97.37  E-value=7.8e-05  Score=46.60  Aligned_cols=55  Identities=24%  Similarity=0.271  Sum_probs=30.3

Q ss_pred             CCEEEEECCCCCCC--CC-CCCCCCCCCCCCCCCCCCEE-EEEHHHHHHHHHCCCCCCCCC
Q ss_conf             85356631467753--12-24433544333666667346-630032346551878888766
Q gi|254780384|r   85 NNGYVCTPRYDSRY--FM-EINFDKADVWRYGIGKGTIL-SHNALHEIGHALGLMHPGSYN  141 (665)
Q Consensus        85 ~~g~a~~p~~~~~~--~~-~~~~~~~~~~~~~~~~g~y~-~~t~iHEIGHALGL~Hp~~~~  141 (665)
                      ..|++.+|......  .. .+.++....  .+.....|. -.|++|||||-|||-|.|...
T Consensus       100 ~lG~a~fP~~~~~~~~~~dGvvi~~~~~--~~~~~~~~~~g~TltHEvGH~lGL~Htf~~~  158 (225)
T cd04275         100 LLGYATFPDSLVSLAFITDGVVINPSSL--PGGSAAPYNLGDTATHEVGHWLGLYHTFQGG  158 (225)
T ss_pred             EEEEEECCCCCCCCCCCCCEEEEECCEE--CCCCCCCCCCCCEEEEECCHHCCCEEECCCC
T ss_conf             0189868988889885378799916362--7987777665431466622010631001589


No 24 
>pfam02031 Peptidase_M7 Streptomyces extracellular neutral proteinase (M7) family.
Probab=97.04  E-value=0.001  Score=39.83  Aligned_cols=89  Identities=19%  Similarity=0.278  Sum_probs=45.0

Q ss_pred             EEEEEEEECCHHHHHHHHHHHHHH-HHHHCCEEEEECCCCCEEEECCCCE-EEEECCCCCCCCCCCCCCCCCCCCCCCCC
Q ss_conf             026887507999999999999988-6574756898247871689615853-56631467753122443354433366666
Q gi|254780384|r   39 RLSFDITELNPNAQEVARWALGEW-SKVVDLTFEETSIHSDIKFISSNNG-YVCTPRYDSRYFMEINFDKADVWRYGIGK  116 (665)
Q Consensus        39 ~~t~~it~l~~~~~~~~R~Al~~w-s~va~ItF~Ev~~~~d~~f~~~~~g-~a~~p~~~~~~~~~~~~~~~~~~~~~~~~  116 (665)
                      +++++.+.--.-..+.+ .+.+.| +.|.|++++| ...+++++...+.. ..+.+..+.+ -+.++.....      ..
T Consensus         3 Tv~YdaS~A~~f~s~ia-~~aaiWN~sV~NV~L~~-g~~a~~~~~~~~~~~gs~a~t~g~G-~g~i~~~~~~------~q   73 (132)
T pfam02031         3 TVTYDASNAPSFRSQIA-RSAQIWNSSVSNVRLQE-GSNADFTYYEGNDSRGSYASTDGHG-RGYIFLDYRQ------NQ   73 (132)
T ss_pred             EEEECCCCCCHHHHHHH-HHHHHHHCCCCCEEEEC-CCCCCEEEEECCCCCCCCCCCCCCC-CEEEEEEHHH------HC
T ss_conf             99974777831799999-99988745577467504-8887779995788887656437998-4789724388------65


Q ss_pred             CCEEEEEHHHHHHHHHCCCC
Q ss_conf             73466300323465518788
Q gi|254780384|r  117 GTILSHNALHEIGHALGLMH  136 (665)
Q Consensus       117 g~y~~~t~iHEIGHALGL~H  136 (665)
                      +-..--..-|||||.|||--
T Consensus        74 gy~~~RI~aHE~GH~LGLPD   93 (132)
T pfam02031        74 QYDSTRVTAHETGHVLGLPD   93 (132)
T ss_pred             CCCCHHHHHHHHCCCCCCCC
T ss_conf             88723110013222247998


No 25 
>COG5549 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=96.97  E-value=0.00048  Score=41.84  Aligned_cols=92  Identities=24%  Similarity=0.255  Sum_probs=53.4

Q ss_pred             ECCHHH--HHHHHHHHHHHHHHHCCEEEEECCCCCEEEECCCC---EEEEECCCCCCCCCCC--CCCC--CC-----CCC
Q ss_conf             079999--99999999988657475689824787168961585---3566314677531224--4335--44-----333
Q gi|254780384|r   46 ELNPNA--QEVARWALGEWSKVVDLTFEETSIHSDIKFISSNN---GYVCTPRYDSRYFMEI--NFDK--AD-----VWR  111 (665)
Q Consensus        46 ~l~~~~--~~~~R~Al~~ws~va~ItF~Ev~~~~d~~f~~~~~---g~a~~p~~~~~~~~~~--~~~~--~~-----~~~  111 (665)
                      ....++  ++++.+|.|.|.+..++.-+|-++-+||++.-.+.   |....+.........-  +.+.  ..     ..+
T Consensus        98 n~~~ap~wq~a~~tava~wa~~fpl~ive~~eeaDItie~~n~pgtg~~~~grArtal~~~efl~~~~~~~~lnhr~~i~  177 (236)
T COG5549          98 NVEGAPRWQGAYLTAVAGWAKTFPLIIVERFEEADITIEVGNPPGTGWRQYGRARTALIAYEFLGHALGLGHLNHRGDIM  177 (236)
T ss_pred             CCCCCHHHHHHHHHHHHHHHHHCCCEEEECCEEEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCC
T ss_conf             32116068889999999898738805640120200799941789876223236778888777865036752113430025


Q ss_pred             CCC-CCCCEEEEEHHHHHHHHHCCC-CC
Q ss_conf             666-667346630032346551878-88
Q gi|254780384|r  112 YGI-GKGTILSHNALHEIGHALGLM-HP  137 (665)
Q Consensus       112 ~~~-~~g~y~~~t~iHEIGHALGL~-Hp  137 (665)
                      ..+ -...+--.|+.||||||||+- |.
T Consensus       178 ~~pg~~~e~L~~tarhElGhaLgi~ghs  205 (236)
T COG5549         178 YPPGELRENLNPTARHELGHALGIWGHS  205 (236)
T ss_pred             CCCCCCHHHHHHHHHHHHCCHHEECCCC
T ss_conf             7864016664478887632021111334


No 26 
>COG1913 Predicted Zn-dependent proteases [General function prediction only]
Probab=96.06  E-value=0.0019  Score=38.19  Aligned_cols=17  Identities=53%  Similarity=0.872  Sum_probs=14.2

Q ss_pred             EEEHHHHHHHHHCCCCC
Q ss_conf             63003234655187888
Q gi|254780384|r  121 SHNALHEIGHALGLMHP  137 (665)
Q Consensus       121 ~~t~iHEIGHALGL~Hp  137 (665)
                      +--.+|||||.|||+|=
T Consensus       125 ~KEv~HElGH~~GL~HC  141 (181)
T COG1913         125 VKEVLHELGHLLGLSHC  141 (181)
T ss_pred             HHHHHHHHHHHCCCCCC
T ss_conf             99998876534174658


No 27 
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=96.01  E-value=0.0072  Score=34.74  Aligned_cols=22  Identities=36%  Similarity=0.390  Sum_probs=18.0

Q ss_pred             EEEEHHHHHHHHHCCCCCCCCC
Q ss_conf             6630032346551878888766
Q gi|254780384|r  120 LSHNALHEIGHALGLMHPGSYN  141 (665)
Q Consensus       120 ~~~t~iHEIGHALGL~Hp~~~~  141 (665)
                      +..|+.|||||.||+.|-++..
T Consensus       145 ~~~t~AHElGH~lG~~HD~~~~  166 (220)
T cd04272         145 GVYTMTHELAHLLGAPHDGSPP  166 (220)
T ss_pred             HHHHHHHHHHHHCCCCCCCCCC
T ss_conf             0334567667642886677888


No 28 
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=95.92  E-value=0.0073  Score=34.70  Aligned_cols=38  Identities=29%  Similarity=0.366  Sum_probs=24.5

Q ss_pred             EEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEC
Q ss_conf             66300323465518788887666766667775410142024467523
Q gi|254780384|r  120 LSHNALHEIGHALGLMHPGSYNGGYPVYGVDNDYENDSFLTSVMSYF  166 (665)
Q Consensus       120 ~~~t~iHEIGHALGL~Hp~~~~~g~~~~~~~~~~~~d~~~~TvMSY~  166 (665)
                      .-.|+.|||||.||+.|-.+...-.         .......-|||+.
T Consensus       133 ~a~t~AHElGH~lGm~HD~~~~~~~---------~~~~~~~~IMs~~  170 (192)
T cd04267         133 TALTMAHELGHNLGAEHDGGDELAF---------ECDGGGNYIMAPV  170 (192)
T ss_pred             EEEEHHHHHHHHCCCCCCCCCCCCC---------CCCCCCCEECCCC
T ss_conf             1212256666344886789987467---------7899998871687


No 29 
>pfam05547 Peptidase_M6 Immune inhibitor A peptidase M6. The insect pathogenic Gram-positive Bacillus thuringiensis secretes immune inhibitor A, a metallopeptidase, which specifically cleaves host antibacterial proteins. A homologue of immune inhibitor A, PrtV, has been identified in the Gram-negative human pathogen Vibrio cholerae.
Probab=95.61  E-value=0.016  Score=32.60  Aligned_cols=73  Identities=22%  Similarity=0.095  Sum_probs=39.9

Q ss_pred             CCCCEEEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCEECCHHHHHHHHHHHH
Q ss_conf             66734663003234655187888876667666677754101420244675235677666655200021122148999986
Q gi|254780384|r  115 GKGTILSHNALHEIGHALGLMHPGSYNGGYPVYGVDNDYENDSFLTSVMSYFMPQDSMMDASFGYCATPMVSDIVAVQKI  194 (665)
Q Consensus       115 ~~g~y~~~t~iHEIGHALGL~Hp~~~~~g~~~~~~~~~~~~d~~~~TvMSY~~~~~~~~~~~~~~~~tpm~~DI~Alq~l  194 (665)
                      .+.+-+.-++.||.||.|||  |=+|+-...      ...+....|++||...-.....   ..-|..+-.+.-+=+|..
T Consensus       217 qPed~~~GVfaHEfGHdLGL--PD~YDT~~~------g~g~~vg~WslMssGSw~G~~~---gt~p~~fs~~~k~~lq~~  285 (646)
T pfam05547       217 QPEDGAAGVFAHEYGHDLGL--PDEYDTQYT------GGGEPVGYWSLMSSGSWAGKIA---GTEPTSFSPQNKEFLQKQ  285 (646)
T ss_pred             CCCCCCEEEEEEECCCCCCC--CCCCCCCCC------CCCCCCCCEEEECCCCCCCCCC---CCCCCCCCHHHHHHHHHH
T ss_conf             56677347898413444689--750136777------8998744366522675378668---988887687889999886


Q ss_pred             CCCC
Q ss_conf             0876
Q gi|254780384|r  195 YGAP  198 (665)
Q Consensus       195 YGan  198 (665)
                      ||-+
T Consensus       286 ~Gg~  289 (646)
T pfam05547       286 IGGN  289 (646)
T ss_pred             HCCC
T ss_conf             4589


No 30 
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=95.22  E-value=0.0064  Score=35.06  Aligned_cols=19  Identities=42%  Similarity=0.690  Sum_probs=15.6

Q ss_pred             EEHHHHHHHHHCCCCCCCC
Q ss_conf             3003234655187888876
Q gi|254780384|r  122 HNALHEIGHALGLMHPGSY  140 (665)
Q Consensus       122 ~t~iHEIGHALGL~Hp~~~  140 (665)
                      -..+|||||.+||+|=-++
T Consensus       125 Ke~~HE~GH~~GL~HC~~~  143 (177)
T PRK13267        125 KEVTHELGHTLGLEHCDNP  143 (177)
T ss_pred             HHHHHHHHHHCCCCCCCCC
T ss_conf             8874355766376568999


No 31 
>pfam10462 Peptidase_M66 Peptidase M66. This family of metallopeptidases contains StcE, a virulence factor found in Shiga toxigenic Escherichia coli organisms. StcE peptidase cleaves C1 esterase inhibitor.
Probab=95.17  E-value=0.0053  Score=35.55  Aligned_cols=16  Identities=44%  Similarity=0.578  Sum_probs=14.1

Q ss_pred             EEHHHHHHHHHCCCCC
Q ss_conf             3003234655187888
Q gi|254780384|r  122 HNALHEIGHALGLMHP  137 (665)
Q Consensus       122 ~t~iHEIGHALGL~Hp  137 (665)
                      -.++||+||+.||.|=
T Consensus       194 nefsHE~GHa~GL~Hy  209 (304)
T pfam10462       194 NEFSHEQGHNYGLGHY  209 (304)
T ss_pred             CEEEECCHHHCCCCCC
T ss_conf             7547202101467867


No 32 
>pfam07998 Peptidase_M54 Peptidase family M54. This is a family of metallopeptidases. Two human proteins have been reported to degrade synthetic substrates and peptides.
Probab=95.05  E-value=0.008  Score=34.47  Aligned_cols=21  Identities=33%  Similarity=0.607  Sum_probs=16.7

Q ss_pred             EEEEHHHHHHHHHCCCCCCCC
Q ss_conf             663003234655187888876
Q gi|254780384|r  120 LSHNALHEIGHALGLMHPGSY  140 (665)
Q Consensus       120 ~~~t~iHEIGHALGL~Hp~~~  140 (665)
                      ..-..+|||||.+||+|=-++
T Consensus       126 ~~Ke~~HElGH~fGL~HC~~~  146 (176)
T pfam07998       126 VVKEVTHELGHTYGLSHCNNT  146 (176)
T ss_pred             HHHHHHHHHHHHCCCCCCCCC
T ss_conf             999999998886277568999


No 33 
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=94.78  E-value=0.0088  Score=34.22  Aligned_cols=21  Identities=33%  Similarity=0.447  Sum_probs=18.6

Q ss_pred             CCEEEEEHHHHHHHHHCCCCC
Q ss_conf             734663003234655187888
Q gi|254780384|r  117 GTILSHNALHEIGHALGLMHP  137 (665)
Q Consensus       117 g~y~~~t~iHEIGHALGL~Hp  137 (665)
                      -++.++++-|||||.||.-|-
T Consensus       142 t~~~w~v~AHEiGHnfGa~HD  162 (228)
T cd04271         142 TSNEWQVFAHEIGHTFGAVHD  162 (228)
T ss_pred             ECCCEEEEEEHHHCCCCCCCC
T ss_conf             536405633013122578667


No 34 
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=94.76  E-value=0.012  Score=33.47  Aligned_cols=19  Identities=37%  Similarity=0.606  Sum_probs=15.6

Q ss_pred             EEHHHHHHHHHCCCCCCCC
Q ss_conf             3003234655187888876
Q gi|254780384|r  122 HNALHEIGHALGLMHPGSY  140 (665)
Q Consensus       122 ~t~iHEIGHALGL~Hp~~~  140 (665)
                      .++.|||||.||+.|-.++
T Consensus       133 ~~~AHElGH~lG~~HD~~~  151 (194)
T cd04269         133 VTMAHELGHNLGMEHDDGG  151 (194)
T ss_pred             HHHHHHHHHHCCCCCCCCC
T ss_conf             9998998864488668998


No 35 
>pfam11350 DUF3152 Protein of unknown function (DUF3152). Some members in this family of proteins are annotated as membrane proteins however this cannot be confirmed. Currently there is no known function.
Probab=94.66  E-value=0.022  Score=31.79  Aligned_cols=87  Identities=20%  Similarity=0.244  Sum_probs=50.7

Q ss_pred             HHHHHHHHHHH---HHHHHHHCCEEEEECCC-CCEEEECCCCEE---EEECC----------CCCCCCCCC-CCCCCCCC
Q ss_conf             99999999999---98865747568982478-716896158535---66314----------677531224-43354433
Q gi|254780384|r   49 PNAQEVARWAL---GEWSKVVDLTFEETSIH-SDIKFISSNNGY---VCTPR----------YDSRYFMEI-NFDKADVW  110 (665)
Q Consensus        49 ~~~~~~~R~Al---~~ws~va~ItF~Ev~~~-~d~~f~~~~~g~---a~~p~----------~~~~~~~~~-~~~~~~~~  110 (665)
                      +.-.+++...|   ..|...-.+.|+.|... .|+++...+-..   ...+.          .....++.. |......+
T Consensus        51 ~~fA~~V~~tL~DpRgW~~~g~~~F~rV~~g~~Df~i~Lasp~Tt~~lC~gl~t~~e~SC~~~~~VviN~~RW~~ga~~f  130 (189)
T pfam11350        51 DAFAAMVDATLADPRGWTHDGTFAFRRVDSGAPDFRISLASPGTTRELCAGLDTGGEVSCYNGDRVVINEARWVRGAPPF  130 (189)
T ss_pred             HHHHHHHHHHHCCCCCCCCCCCEEEEECCCCCCCEEEEECCCCCHHHCCCCCCCCCEEEECCCCEEEEEHHHHHCCCCCC
T ss_conf             99999999985699876668861589867899988999778660743255767687787469997998789965368653


Q ss_pred             CCCCCCCCEEEEEHHHHHHHHHCCCCC
Q ss_conf             366666734663003234655187888
Q gi|254780384|r  111 RYGIGKGTILSHNALHEIGHALGLMHP  137 (665)
Q Consensus       111 ~~~~~~g~y~~~t~iHEIGHALGL~Hp  137 (665)
                        +-...+|--+.+-||+||+||-.|-
T Consensus       131 --~gdl~~YR~y~INHEVGH~LG~~H~  155 (189)
T pfam11350       131 --EGDLGSYRQYVINHEVGHAIGYGHE  155 (189)
T ss_pred             --CCCHHHHHHHHHHHHHHHHCCCCCC
T ss_conf             --6589999999870323201255776


No 36 
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=94.52  E-value=0.015  Score=32.81  Aligned_cols=20  Identities=40%  Similarity=0.733  Sum_probs=16.3

Q ss_pred             EEEHHHHHHHHHCCCCCCCC
Q ss_conf             63003234655187888876
Q gi|254780384|r  121 SHNALHEIGHALGLMHPGSY  140 (665)
Q Consensus       121 ~~t~iHEIGHALGL~Hp~~~  140 (665)
                      -.|+.|||||.||+.|-.+.
T Consensus       141 a~t~AHElGHnLGm~HD~~~  160 (207)
T cd04273         141 AFTIAHELGHVLGMPHDGDG  160 (207)
T ss_pred             HHHHHHHHHHHCCCCCCCCC
T ss_conf             99999987764298668999


No 37 
>pfam01421 Reprolysin Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endopeptidases, but there are also some mammalian proteins, e.g. fertilin. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes.
Probab=94.28  E-value=0.016  Score=32.70  Aligned_cols=19  Identities=37%  Similarity=0.681  Sum_probs=15.5

Q ss_pred             EEEHHHHHHHHHCCCCCCC
Q ss_conf             6300323465518788887
Q gi|254780384|r  121 SHNALHEIGHALGLMHPGS  139 (665)
Q Consensus       121 ~~t~iHEIGHALGL~Hp~~  139 (665)
                      -.|+.|||||.||+.|-.+
T Consensus       132 a~t~AHElGH~LG~~HD~~  150 (198)
T pfam01421       132 AVTMAHELGHNLGMTHDDI  150 (198)
T ss_pred             EEEHHHHHHHHCCCCCCCC
T ss_conf             1001233451157757899


No 38 
>pfam05548 Peptidase_M11 Gametolysin peptidase M11. In the unicellular biflagellated alga, Chlamydomonas reinhardtii, gametolysin, a zinc-containing metallo-protease, is responsible for the degradation of the cell wall. Homologues of gametolysin have also been reported in the simple multicellular organism, Volvox.
Probab=94.01  E-value=0.021  Score=31.97  Aligned_cols=32  Identities=25%  Similarity=0.321  Sum_probs=23.4

Q ss_pred             CCCCCCCCCCCEEEEEHHHHHHHHHCCCCCCC
Q ss_conf             43336666673466300323465518788887
Q gi|254780384|r  108 DVWRYGIGKGTILSHNALHEIGHALGLMHPGS  139 (665)
Q Consensus       108 ~~~~~~~~~g~y~~~t~iHEIGHALGL~Hp~~  139 (665)
                      ..|....+.+...+.|++||.+|-+||.|.+.
T Consensus       124 ~~w~~~~~~~~~~~~t~~hE~~HN~GL~Ha~~  155 (303)
T pfam05548       124 QTWLQTSGYGVQRWATIMQEAIHNYGLWHSWR  155 (303)
T ss_pred             CEEEEECCCCCCCCHHHHHHHHHHHHHHCCCC
T ss_conf             07999656555661056664533323110446


No 39 
>pfam12044 Metallopep Putative peptidase family. This family of proteins is functionally uncharacterized. However, it does contain an HEXXH motif characteristic of metallopeptidases. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 625 to 773 amino acids in length.
Probab=93.85  E-value=0.021  Score=31.89  Aligned_cols=21  Identities=48%  Similarity=0.661  Sum_probs=15.3

Q ss_pred             EEEEEHHHHHHHHHCCCCCCC
Q ss_conf             466300323465518788887
Q gi|254780384|r  119 ILSHNALHEIGHALGLMHPGS  139 (665)
Q Consensus       119 y~~~t~iHEIGHALGL~Hp~~  139 (665)
                      .+.-.++|||||++||.|.-+
T Consensus       312 i~lGA~lHEiGH~fg~pH~~~  332 (419)
T pfam12044       312 ITIGAFLHEIGHLFGLPHQED  332 (419)
T ss_pred             HHHHHHHHHHHHHCCCCCCCC
T ss_conf             312088775464248999988


No 40 
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than pfam05547.
Probab=92.61  E-value=0.078  Score=28.49  Aligned_cols=41  Identities=24%  Similarity=-0.009  Sum_probs=25.5

Q ss_pred             CEEEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEC
Q ss_conf             3466300323465518788887666766667775410142024467523
Q gi|254780384|r  118 TILSHNALHEIGHALGLMHPGSYNGGYPVYGVDNDYENDSFLTSVMSYF  166 (665)
Q Consensus       118 ~y~~~t~iHEIGHALGL~Hp~~~~~g~~~~~~~~~~~~d~~~~TvMSY~  166 (665)
                      .-+.-|+.||.||.|||-.-=++.....        ..--..|.+|+..
T Consensus       163 ~~~igv~~HE~gH~lGlPDlYd~~~~~~--------~~gvG~wdlM~~G  203 (286)
T TIGR03296       163 DGGVGVIAHELGHDLGLPDLYDTSYDGG--------GEPVGYWSLMSSG  203 (286)
T ss_pred             CCCEEEEEHHHHCCCCCCCCCCCCCCCC--------CCCCCCEEECCCC
T ss_conf             8735567522214038975355567888--------8888761121488


No 41 
>pfam11996 DUF3491 Protein of unknown function (DUF3491). This family of proteins is functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 286 to 3225 amino acids in length. This protein is found associated with pfam04488. This protein is found associated with pfam04488.
Probab=90.66  E-value=0.93  Score=21.99  Aligned_cols=32  Identities=16%  Similarity=0.014  Sum_probs=12.5

Q ss_pred             EEEEECCCCCEEEEE--EECCCCCCEEECCCCCC
Q ss_conf             898722897325998--50478742474458731
Q gi|254780384|r  219 IQTIYDSGGDDLLDL--TLIGGVKNYIDLNTESW  250 (665)
Q Consensus       219 ~~tI~D~gG~DTid~--s~~~~~~~~IdL~~g~~  250 (665)
                      .++-.|..+.-||+.  ...+....+|||+.--.
T Consensus       347 i~~~s~~~~~~~i~I~LadD~~~pqtiDls~ivP  380 (934)
T pfam11996       347 IFSSSPTVQTKTIDIILADDNDHPQTIDLSSIVP  380 (934)
T ss_pred             EEECCCCCCCEEEEEEEECCCCCCCEEEHHHCCC
T ss_conf             9804876783479999925899853678536050


No 42 
>TIGR00181 pepF oligoendopeptidase F; InterPro: IPR004438   Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases .   Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins.   Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule.    This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), the type example being oligoendopeptidase F from Lactococcus lactis. The enzyme hydrolyses peptides of 7 and 17 amino acids with fairly broad specificity. Differences in substrate specificity should be expected in other species. The gene is duplicated in L. lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis.
Probab=90.33  E-value=0.075  Score=28.59  Aligned_cols=67  Identities=19%  Similarity=0.376  Sum_probs=33.9

Q ss_pred             EEECCHHHHHHHHHHH-HH-HHHHHCCEEEEECCCCCEEEECCCCEEEEECCCC--CCCCCCCCCCCCCCCCCCCCCCCE
Q ss_conf             7507999999999999-98-8657475689824787168961585356631467--753122443354433366666734
Q gi|254780384|r   44 ITELNPNAQEVARWAL-GE-WSKVVDLTFEETSIHSDIKFISSNNGYVCTPRYD--SRYFMEINFDKADVWRYGIGKGTI  119 (665)
Q Consensus        44 it~l~~~~~~~~R~Al-~~-ws~va~ItF~Ev~~~~d~~f~~~~~g~a~~p~~~--~~~~~~~~~~~~~~~~~~~~~g~y  119 (665)
                      +..|.++-...+|.|| .. |=|++     |..  |-      .+|+.....+.  ...++.-+|....         --
T Consensus       331 L~~LG~EY~~~~~~a~~~~RWvD~~-----en~--gK------r~GAYS~g~y~Gl~~pYILMNw~~~~---------~~  388 (611)
T TIGR00181       331 LEPLGEEYVKILKRAFSEERWVDYY-----ENK--GK------RSGAYSIGGYKGLVKPYILMNWDGTV---------LN  388 (611)
T ss_pred             HHHHHHHHHHHHHHHCCCCCCEEEC-----CCC--CC------CCCCEECCCCCCCCCCEEEECCCCCC---------HH
T ss_conf             7650388899999843799717535-----667--78------55511237889866543671446750---------22


Q ss_pred             EEEEHHHHHHHHH
Q ss_conf             6630032346551
Q gi|254780384|r  120 LSHNALHEIGHAL  132 (665)
Q Consensus       120 ~~~t~iHEIGHAL  132 (665)
                      ...||+||+||++
T Consensus       389 ~~~TLaHE~GHS~  401 (611)
T TIGR00181       389 SVFTLAHELGHSM  401 (611)
T ss_pred             HHHHHHHHHHHHH
T ss_conf             2579999855679


No 43 
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family; InterPro: IPR011976   Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases .   Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins.   Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule.    This family consists of probable oligoendopeptidases belonging to MEROPS peptidase family M3, subfamily M3B (clan MA(E)). The family is related to lactococcal PepF and group B streptococcal PepB (IPR004438 from INTERPRO) but in a distinct clade with considerable sequence differences not only to IPR004438 from INTERPRO but also to IPR011977 from INTERPRO. Likely substrates are small peptides and not whole proteins, as with PepF, but members are not characterised and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family..
Probab=89.80  E-value=0.099  Score=27.87  Aligned_cols=73  Identities=25%  Similarity=0.250  Sum_probs=35.1

Q ss_pred             HHHHHHHHHHHHHHHHCCEEEEECCCCCEEEEC--CC--CEEE-EECCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHH
Q ss_conf             999999999988657475689824787168961--58--5356-631467753122443354433366666734663003
Q gi|254780384|r   51 AQEVARWALGEWSKVVDLTFEETSIHSDIKFIS--SN--NGYV-CTPRYDSRYFMEINFDKADVWRYGIGKGTILSHNAL  125 (665)
Q Consensus        51 ~~~~~R~Al~~ws~va~ItF~Ev~~~~d~~f~~--~~--~g~a-~~p~~~~~~~~~~~~~~~~~~~~~~~~g~y~~~t~i  125 (665)
                      ..+.+..++...+.-+.--|.-.-...=+.+..  +-  .|+- ++|.+.. .++-+|+|..          .+.--|++
T Consensus       277 ~~~~~~~my~~l~~~f~~~F~~m~~~nlLDL~~r~~Ka~GGyC~~Lp~~~~-pFiFsNfngt----------~~Di~~l~  345 (553)
T TIGR02289       277 LLEKAEKMYKELSLEFKEFFNFMLEKNLLDLESRKGKAAGGYCTYLPKYKA-PFIFSNFNGT----------SGDIDVLT  345 (553)
T ss_pred             HHHHHHHHHHHHCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCC-CEEEECCCCC----------CCHHHHHH
T ss_conf             999999999840878999999887528834313789877743153556678-6078647888----------31367888


Q ss_pred             HHHHHHHCC
Q ss_conf             234655187
Q gi|254780384|r  126 HEIGHALGL  134 (665)
Q Consensus       126 HEIGHALGL  134 (665)
                      ||+|||+=.
T Consensus       346 HE~GHafh~  354 (553)
T TIGR02289       346 HEAGHAFHV  354 (553)
T ss_pred             HHHHHHHHH
T ss_conf             865688976


No 44 
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=89.72  E-value=0.22  Score=25.77  Aligned_cols=20  Identities=30%  Similarity=0.469  Sum_probs=16.3

Q ss_pred             EEEHHHHHHHHHCCCCCCCC
Q ss_conf             63003234655187888876
Q gi|254780384|r  121 SHNALHEIGHALGLMHPGSY  140 (665)
Q Consensus       121 ~~t~iHEIGHALGL~Hp~~~  140 (665)
                      -.|+.|||||.||..|-.+.
T Consensus       168 ~~~~~he~gh~~~~~h~~~~  187 (244)
T cd04270         168 DLVTAHELGHNFGSPHDPDI  187 (244)
T ss_pred             EEEEEHHHHCCCCCCCCCCC
T ss_conf             25521112202689888998


No 45 
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=89.52  E-value=0.11  Score=27.56  Aligned_cols=24  Identities=38%  Similarity=0.607  Sum_probs=19.1

Q ss_pred             CCCCCCEEEEEHHHHHHHHHCCCC
Q ss_conf             666673466300323465518788
Q gi|254780384|r  113 GIGKGTILSHNALHEIGHALGLMH  136 (665)
Q Consensus       113 ~~~~g~y~~~t~iHEIGHALGL~H  136 (665)
                      .-..|..-.+++||||||-.||+-
T Consensus       102 eetlgd~vthvliHEIgHhFGLsD  125 (136)
T COG3824         102 EETLGDQVTHVLIHEIGHHFGLSD  125 (136)
T ss_pred             HHHHHHHHHHHHHHHHHHHCCCCH
T ss_conf             366765744302353221238777


No 46 
>KOG4525 consensus
Probab=86.04  E-value=0.33  Score=24.70  Aligned_cols=80  Identities=20%  Similarity=0.089  Sum_probs=36.8

Q ss_pred             EEEEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCEE---CCHHHHHHH-HHHHH
Q ss_conf             46630032346551878888766676666777541014202446752356776666552000---211221489-99986
Q gi|254780384|r  119 ILSHNALHEIGHALGLMHPGSYNGGYPVYGVDNDYENDSFLTSVMSYFMPQDSMMDASFGYC---ATPMVSDIV-AVQKI  194 (665)
Q Consensus       119 y~~~t~iHEIGHALGL~Hp~~~~~g~~~~~~~~~~~~d~~~~TvMSY~~~~~~~~~~~~~~~---~tpm~~DI~-Alq~l  194 (665)
                      .+.-.++|||||.||+.|.-+.     ++...-++  -++.+|+--.-.   .-....+..|   ..-...-|. ++..+
T Consensus       300 itlGA~~HElGH~lgcpHq~~G-----vMlrgy~~--lNRsftt~e~~~---vrt~s~g~aPv~~keec~w~~lD~~Rf~  369 (614)
T KOG4525         300 ITLGAVCHELGHCLGCPHQSEG-----VMLRGYDY--LNRSFTTDEFEC---VRTGSKGLAPVLAKEECPWRILDARRFS  369 (614)
T ss_pred             HHHHHHHHHHHHCCCCCCCCCC-----EEECCCHH--HCCEEEECCEEE---EECCCCCCCCCCCCCCCCHHHHHHHHHC
T ss_conf             5788999875530489988773-----57437311--120134020025---5227767687553022746666445420


Q ss_pred             CCCCCCCCCCCEEE
Q ss_conf             08766666773463
Q gi|254780384|r  195 YGAPKGSLGNNIYK  208 (665)
Q Consensus       195 YGan~~~~gdt~y~  208 (665)
                      |-...+-.+|-+|.
T Consensus       370 yhp~f~lp~dptyp  383 (614)
T KOG4525         370 YHPTFSLPGDPTYP  383 (614)
T ss_pred             CCCCCCCCCCCCCC
T ss_conf             14454689999887


No 47 
>KOG3607 consensus
Probab=85.98  E-value=0.25  Score=25.41  Aligned_cols=19  Identities=37%  Similarity=0.730  Sum_probs=15.2

Q ss_pred             EEEEHHHHHHHHHCCCCCC
Q ss_conf             6630032346551878888
Q gi|254780384|r  120 LSHNALHEIGHALGLMHPG  138 (665)
Q Consensus       120 ~~~t~iHEIGHALGL~Hp~  138 (665)
                      ...++-|||||.||+.|-=
T Consensus       323 ~a~v~AhelgH~lGm~hD~  341 (716)
T KOG3607         323 FAVVLAHELGHNLGMIHDE  341 (716)
T ss_pred             HHHHHHHHHHHHCCCCCCC
T ss_conf             7899999878652850066


No 48 
>pfam00353 HemolysinCabind Hemolysin-type calcium-binding repeat (2 copies).
Probab=85.27  E-value=0.42  Score=24.07  Aligned_cols=20  Identities=35%  Similarity=0.683  Sum_probs=14.0

Q ss_pred             ECCCCCCEEECCCCCEEEEE
Q ss_conf             66786345888887338998
Q gi|254780384|r  290 FESSGNDVFDGSSGLDTFFY  309 (665)
Q Consensus       290 ~gg~g~d~~~g~~g~Dt~~~  309 (665)
                      .|+.|+|+|+|+.|+|++..
T Consensus         1 ~Gg~GnD~L~Gg~GnD~i~~   20 (26)
T pfam00353         1 YGGAGNDTLYGGAGNDTLXX   20 (26)
T ss_pred             CCCCCCCCEECCCCCCCEEE
T ss_conf             98766662064668761310


No 49 
>pfam06262 DUF1025 Domain of unknown function (DUF1025). Family of bacterial protein with undetermined function.
Probab=82.51  E-value=0.47  Score=23.80  Aligned_cols=21  Identities=43%  Similarity=0.833  Sum_probs=16.4

Q ss_pred             CCCEEEEEHHHHHHHHHCCCC
Q ss_conf             673466300323465518788
Q gi|254780384|r  116 KGTILSHNALHEIGHALGLMH  136 (665)
Q Consensus       116 ~g~y~~~t~iHEIGHALGL~H  136 (665)
                      ...--..|++|||||=+|++.
T Consensus        68 L~~~V~~tviHEiaH~fG~dd   88 (96)
T pfam06262        68 LGELVRHVVIHEIGHHFGLSD   88 (96)
T ss_pred             HHHHHHHHHHHHHHHHCCCCH
T ss_conf             999999999999887708988


No 50 
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=81.50  E-value=0.46  Score=23.85  Aligned_cols=14  Identities=36%  Similarity=0.444  Sum_probs=11.2

Q ss_pred             EEEEEHHHHHHHHH
Q ss_conf             46630032346551
Q gi|254780384|r  119 ILSHNALHEIGHAL  132 (665)
Q Consensus       119 y~~~t~iHEIGHAL  132 (665)
                      ....|+.||+|||+
T Consensus       221 ~dv~TLaHElGHa~  234 (427)
T cd06459         221 DDVFTLAHELGHAF  234 (427)
T ss_pred             CCHHHHHHHHHHHH
T ss_conf             31568898631899


No 51 
>pfam03272 Enhancin Viral enhancin protein.
Probab=81.02  E-value=0.92  Score=22.02  Aligned_cols=26  Identities=8%  Similarity=-0.128  Sum_probs=16.6

Q ss_pred             CCCCCCCEEECCCCCCCEEEEECCCC
Q ss_conf             44444202424578751676403420
Q gi|254780384|r  354 VNVKQRSILEYTASYEDLIQVIGQDV  379 (665)
Q Consensus       354 ~~~~~~d~l~~~a~~ddl~~~~G~d~  379 (665)
                      ...+.++.+....-.|.+.+.+-.|.
T Consensus       531 ~S~l~~e~~~~lGl~D~~~A~~~Vd~  556 (775)
T pfam03272       531 KSPLLNETGHLLGLNDRLAATFYVDY  556 (775)
T ss_pred             CCCCCEEEEEEECCCCCEEEEEEEEC
T ss_conf             77010007999846871389999954


No 52 
>pfam01457 Peptidase_M8 Leishmanolysin.
Probab=80.96  E-value=0.58  Score=23.25  Aligned_cols=20  Identities=35%  Similarity=0.499  Sum_probs=15.6

Q ss_pred             EEEEEHHHHHHHHHCCCCCC
Q ss_conf             46630032346551878888
Q gi|254780384|r  119 ILSHNALHEIGHALGLMHPG  138 (665)
Q Consensus       119 y~~~t~iHEIGHALGL~Hp~  138 (665)
                      ...-+..|||.||||....+
T Consensus       239 ~~vrv~aHEiaHALGFs~~~  258 (569)
T pfam01457       239 LVTRVVTHEIAHALGFSVTF  258 (569)
T ss_pred             EEEEHHHHHHHHHHCCCHHH
T ss_conf             14238988888886358989


No 53 
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=80.74  E-value=0.6  Score=23.14  Aligned_cols=17  Identities=35%  Similarity=0.632  Sum_probs=14.0

Q ss_pred             EEEHHHHHHHHHCCCCC
Q ss_conf             63003234655187888
Q gi|254780384|r  121 SHNALHEIGHALGLMHP  137 (665)
Q Consensus       121 ~~t~iHEIGHALGL~Hp  137 (665)
                      --++.|||.||||....
T Consensus       257 vRv~aHEiAHALGFs~~  273 (615)
T PTZ00257        257 TRTVTHEVAHALGFSSV  273 (615)
T ss_pred             HHHHHHHHHHHHCCCHH
T ss_conf             78999999998757998


No 54 
>pfam09471 Peptidase_M64 IgA Peptidase M64. This is a family of highly selective metallo-endopeptidases. The primary structure of the Clostridium ramosum IgA proteinase shows no significant overall similarity to any other known metallo-endopeptidase.
Probab=80.68  E-value=0.39  Score=24.27  Aligned_cols=26  Identities=38%  Similarity=0.562  Sum_probs=17.8

Q ss_pred             CCEEEEEHHHHHHHHH-CCCCCCCCCCCC
Q ss_conf             7346630032346551-878888766676
Q gi|254780384|r  117 GTILSHNALHEIGHAL-GLMHPGSYNGGY  144 (665)
Q Consensus       117 g~y~~~t~iHEIGHAL-GL~Hp~~~~~g~  144 (665)
                      ..+.-.+++||+||.+ ||.-  ||.++.
T Consensus       206 n~~~~~v~vHE~GHsf~gLaD--EY~y~~  232 (257)
T pfam09471       206 NSQASYIAVHELGHSFAGLAD--EYYYGG  232 (257)
T ss_pred             CCCCCCEEEEECCCCCCCCCH--HHCCCC
T ss_conf             756663348843532344230--111686


No 55 
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family; InterPro: IPR011977   Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases .   Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins.   Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule.    This family consists of probable oligoendopeptidases belonging to MEROPS peptidase family M3, subfamily M3B (clan MA(E)). It is related to lactococcal PepF and group B streptococcal PepB (IPR004438 from INTERPRO) but in a distinct clade with considerable sequence differences not only to IPR004438 from INTERPRO but also to IPR011976 from INTERPRO. Members are not characterised with respect to their substrates and the activity profile may differ..
Probab=78.27  E-value=0.67  Score=22.85  Aligned_cols=13  Identities=38%  Similarity=0.521  Sum_probs=11.1

Q ss_pred             EEEEHHHHHHHHH
Q ss_conf             6630032346551
Q gi|254780384|r  120 LSHNALHEIGHAL  132 (665)
Q Consensus       120 ~~~t~iHEIGHAL  132 (665)
                      .-.|+-||+|||+
T Consensus       385 dV~TLAHELGHA~  397 (600)
T TIGR02290       385 DVSTLAHELGHAY  397 (600)
T ss_pred             HHHHHHHHHHHHH
T ss_conf             5899989865589


No 56 
>pfam01432 Peptidase_M3 Peptidase family M3. This is the Thimet oligopeptidase family, large family of mammalian and bacterial oligopeptidases that cleave medium sized peptides. The group also contains mitochondrial intermediate peptidase which is encoded by nuclear DNA but functions within the mitochondria to remove the leader sequence.
Probab=77.46  E-value=0.75  Score=22.57  Aligned_cols=19  Identities=16%  Similarity=0.349  Sum_probs=12.1

Q ss_pred             HHHHHHHHHHCCEEEEECC
Q ss_conf             9999886574756898247
Q gi|254780384|r   57 WALGEWSKVVDLTFEETSI   75 (665)
Q Consensus        57 ~Al~~ws~va~ItF~Ev~~   75 (665)
                      -.+..+..+-+|.|+++..
T Consensus       141 gl~~l~~~lfg~~~~~~~~  159 (448)
T pfam01432       141 GLFGLFERLFGITFVLEPL  159 (448)
T ss_pred             HHHHHHHHHHHHHHHCCCC
T ss_conf             9999987653234320566


No 57 
>PTZ00337 surface protease GP63; Provisional
Probab=76.79  E-value=0.82  Score=22.31  Aligned_cols=22  Identities=36%  Similarity=0.481  Sum_probs=16.3

Q ss_pred             CCEEEEEHHHHHHHHHCCCCCC
Q ss_conf             7346630032346551878888
Q gi|254780384|r  117 GTILSHNALHEIGHALGLMHPG  138 (665)
Q Consensus       117 g~y~~~t~iHEIGHALGL~Hp~  138 (665)
                      .....-+..|||.||||..+..
T Consensus       226 ~~~~vRvaaHEiAHALGFs~~~  247 (567)
T PTZ00337        226 TNGYVRVAAHELGHALGFARDQ  247 (567)
T ss_pred             CCHHHHHHHHHHHHHHCCCHHH
T ss_conf             4015789999998886368889


No 58 
>pfam02382 RTX RTX N-terminal domain. The RTX family of bacterial toxins are a group of cytolysins and cytotoxins. This Pfam family represents the N-terminal domain which is found in association with a glycine-rich repeat domain and hemolysinCabind pfam00353.
Probab=74.19  E-value=4.9  Score=17.67  Aligned_cols=20  Identities=30%  Similarity=0.531  Sum_probs=7.9

Q ss_pred             CCCCCCEEECCCCCEEEEEE
Q ss_conf             67863458888873389982
Q gi|254780384|r  291 ESSGNDVFDGSSGLDTFFYY  310 (665)
Q Consensus       291 gg~g~d~~~g~~g~Dt~~~~  310 (665)
                      -+.|.-.++||.|.|+++|.
T Consensus       622 ~gsGs~~idgG~G~D~V~Ys  641 (653)
T pfam02382       622 LGSGSTEVDAGEGHDVVYYS  641 (653)
T ss_pred             ECCCEEEEECCCCCEEEEEC
T ss_conf             53770688568871368853


No 59 
>cd06258 Peptidase_M3_like The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases.  The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called olig
Probab=71.80  E-value=1.1  Score=21.48  Aligned_cols=17  Identities=35%  Similarity=0.430  Sum_probs=11.8

Q ss_pred             EEEEEHHHHHHHHH-CCC
Q ss_conf             46630032346551-878
Q gi|254780384|r  119 ILSHNALHEIGHAL-GLM  135 (665)
Q Consensus       119 y~~~t~iHEIGHAL-GL~  135 (665)
                      .-..|+.||.|||| .|-
T Consensus       153 ~ev~TlfHEfGHalh~ll  170 (365)
T cd06258         153 DDINTLFHEFGHAVHFLL  170 (365)
T ss_pred             HHHHHHHHHHHHHHHHHC
T ss_conf             999999999878888743


No 60 
>KOG3538 consensus
Probab=70.87  E-value=1.4  Score=20.89  Aligned_cols=22  Identities=36%  Similarity=0.652  Sum_probs=19.3

Q ss_pred             EEEEHHHHHHHHHCCCCCCCCC
Q ss_conf             6630032346551878888766
Q gi|254780384|r  120 LSHNALHEIGHALGLMHPGSYN  141 (665)
Q Consensus       120 ~~~t~iHEIGHALGL~Hp~~~~  141 (665)
                      .-.|+.||+||-+|+.|-+..+
T Consensus       317 ~aftiahE~gH~~gm~hd~~~~  338 (845)
T KOG3538         317 SAFTIAHELGHNFGMEHDGRGN  338 (845)
T ss_pred             CCCCEECCCCCCCCCCCCCCCC
T ss_conf             6554000025688865677578


No 61 
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=70.85  E-value=1.2  Score=21.27  Aligned_cols=10  Identities=40%  Similarity=0.803  Sum_probs=3.6

Q ss_pred             EEHHHHHHHH
Q ss_conf             3003234655
Q gi|254780384|r  122 HNALHEIGHA  131 (665)
Q Consensus       122 ~t~iHEIGHA  131 (665)
                      .|+.||+||+
T Consensus       382 ~TLaHElGHs  391 (598)
T COG1164         382 FTLAHELGHS  391 (598)
T ss_pred             HHHHHHHHHH
T ss_conf             6874650189


No 62 
>KOG3658 consensus
Probab=70.71  E-value=1.8  Score=20.24  Aligned_cols=20  Identities=15%  Similarity=0.252  Sum_probs=8.2

Q ss_pred             ECCCEECCCCCCCCCEEECC
Q ss_conf             63840012444442024245
Q gi|254780384|r  346 FADQCIDSVNVKQRSILEYT  365 (665)
Q Consensus       346 f~d~t~~~~~~~~~d~l~~~  365 (665)
                      +..|.....+.+.-.+.+|.
T Consensus       365 y~~G~~~sLNtGi~T~~NYg  384 (764)
T KOG3658         365 YNEGKKRSLNTGISTSVNYG  384 (764)
T ss_pred             CCCCEEEEEECCEEEEEECC
T ss_conf             26760578513355421058


No 63 
>pfam06594 HCBP_related Haemolysin-type calcium binding protein related domain. This family consists of a number of bacteria specific domains which are found in haemolysin-type calcium binding proteins. This family is found in conjunction with pfam00353 and is often found in multiple copies.
Probab=69.67  E-value=5.8  Score=17.21  Aligned_cols=14  Identities=29%  Similarity=0.498  Sum_probs=12.6

Q ss_pred             EEEEEEEEECCCEE
Q ss_conf             75467999638400
Q gi|254780384|r  338 LSDVERLRFADQCI  351 (665)
Q Consensus       338 ~~~iE~~~f~d~t~  351 (665)
                      ..+||.++|+|||+
T Consensus        24 ~~~iE~i~FADGT~   37 (43)
T pfam06594        24 GYGIEQIEFADGTV   37 (43)
T ss_pred             CCEEEEEEECCCCE
T ss_conf             86376899879975


No 64 
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family.  This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=69.58  E-value=1.5  Score=20.66  Aligned_cols=13  Identities=31%  Similarity=0.728  Sum_probs=10.0

Q ss_pred             EEHHHHHHHHHCC
Q ss_conf             3003234655187
Q gi|254780384|r  122 HNALHEIGHALGL  134 (665)
Q Consensus       122 ~t~iHEIGHALGL  134 (665)
                      -..+||+|||+--
T Consensus       137 s~v~HElGHA~aA  149 (277)
T cd06162         137 SGVVHEMGHGVAA  149 (277)
T ss_pred             HHHHHHHHHHHHH
T ss_conf             9999999899999


No 65 
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=66.88  E-value=1.9  Score=20.14  Aligned_cols=18  Identities=33%  Similarity=0.338  Sum_probs=13.3

Q ss_pred             EEEEEHHHHHHHHHCCCC
Q ss_conf             466300323465518788
Q gi|254780384|r  119 ILSHNALHEIGHALGLMH  136 (665)
Q Consensus       119 y~~~t~iHEIGHALGL~H  136 (665)
                      --..|+.||+||+|=-.+
T Consensus        71 r~rFtlAHELGH~llH~~   88 (213)
T COG2856          71 RKRFTLAHELGHALLHTD   88 (213)
T ss_pred             HHHHHHHHHHHHHHHCCC
T ss_conf             888789998758996265


No 66 
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits.  TOP has been shown to degrade peptides released by the proteasom
Probab=66.33  E-value=1.8  Score=20.32  Aligned_cols=36  Identities=11%  Similarity=0.266  Sum_probs=14.9

Q ss_pred             HHHHHCCCCCC-CCCCCEEECCCCCCCCCEEEEEECCCCCEEE
Q ss_conf             99986087666-6677346325767876358987228973259
Q gi|254780384|r  190 AVQKIYGAPKG-SLGNNIYKIEVQQKDSPFIQTIYDSGGDDLL  231 (665)
Q Consensus       190 Alq~lYGan~~-~~gdt~y~~~~~~~~~~~~~tI~D~gG~DTi  231 (665)
                      ..+.|||..-. ...-.+  |.+..    -.+.|+|..+...|
T Consensus       168 l~~~LfGi~f~~~~~~~~--WH~dV----r~~~V~d~~~g~~l  204 (472)
T cd06455         168 IYQRLFGLRFEEVPDASV--WHEDV----RLYSVWDADTGEFL  204 (472)
T ss_pred             HHHHHHCEEEEECCCCCC--CCCCC----EEEEEEECCCCCEE
T ss_conf             999984845587787887--78765----69999988999478


No 67 
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=65.11  E-value=2.4  Score=19.53  Aligned_cols=15  Identities=40%  Similarity=0.554  Sum_probs=11.4

Q ss_pred             EEHHHHHHHHHCCCC
Q ss_conf             300323465518788
Q gi|254780384|r  122 HNALHEIGHALGLMH  136 (665)
Q Consensus       122 ~t~iHEIGHALGL~H  136 (665)
                      ..++||+||++=-++
T Consensus        55 sVllHElgHal~Ar~   69 (227)
T cd06164          55 SVLLHELGHSLVARR   69 (227)
T ss_pred             HHHHHHHHHHHHHHH
T ss_conf             999999999999999


No 68 
>cd06457 M3A_MIP Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein.  MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbo
Probab=64.66  E-value=2.1  Score=19.81  Aligned_cols=18  Identities=6%  Similarity=0.017  Sum_probs=7.7

Q ss_pred             HHHHHHHHHHCCEEEEEC
Q ss_conf             999988657475689824
Q gi|254780384|r   57 WALGEWSKVVDLTFEETS   74 (665)
Q Consensus        57 ~Al~~ws~va~ItF~Ev~   74 (665)
                      -.+.....+-+|+|+|+.
T Consensus       138 Glf~i~~~Lfgi~~~~~~  155 (458)
T cd06457         138 GLSRLFSRLYGIRLVPVP  155 (458)
T ss_pred             HHHHHHHHHHCEEEEECC
T ss_conf             999999998574568746


No 69 
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=64.57  E-value=2.5  Score=19.43  Aligned_cols=16  Identities=44%  Similarity=0.482  Sum_probs=12.1

Q ss_pred             EEHHHHHHHHHCCCCC
Q ss_conf             3003234655187888
Q gi|254780384|r  122 HNALHEIGHALGLMHP  137 (665)
Q Consensus       122 ~t~iHEIGHALGL~Hp  137 (665)
                      ..++||+||++--++-
T Consensus        40 sVl~HElgH~l~A~~~   55 (208)
T cd06161          40 SVLLHELGHALVARRY   55 (208)
T ss_pred             HHHHHHHHHHHHHHHC
T ss_conf             9999999999999994


No 70 
>pfam05279 Asp-B-Hydro_N Aspartyl beta-hydroxylase N-terminal region. This family includes the N-terminal regions of the junctin, junctate and aspartyl beta-hydroxylase proteins. Junctate is an integral ER/SR membrane calcium binding protein, which comes from an alternatively spliced form of the same gene that generates aspartyl beta-hydroxylase and junctin. Aspartyl beta-hydroxylase catalyses the post-translational hydroxylation of aspartic acid or asparagine residues contained within epidermal growth factor (EGF) domains of proteins.
Probab=63.60  E-value=2.8  Score=19.13  Aligned_cols=22  Identities=23%  Similarity=0.277  Sum_probs=16.3

Q ss_pred             HHHHHHHHHHHHHHHCCEEEEE
Q ss_conf             9999999998865747568982
Q gi|254780384|r   52 QEVARWALGEWSKVVDLTFEET   73 (665)
Q Consensus        52 ~~~~R~Al~~ws~va~ItF~Ev   73 (665)
                      =-++-.-|..|.+||=+=|-=|
T Consensus        15 WfmViaLLGvWtSVaVVwFdlV   36 (232)
T pfam05279        15 WFMVIALLGVWTSVAVVWFDLV   36 (232)
T ss_pred             HHHHHHHHHHHHHHHEEEEEEC
T ss_conf             9999999887761304554400


No 71 
>PRK10280 dipeptidyl carboxypeptidase II; Provisional
Probab=62.10  E-value=2.3  Score=19.57  Aligned_cols=33  Identities=12%  Similarity=0.257  Sum_probs=14.9

Q ss_pred             HHHHHCCCCC-CCCCCCEEECCCCCCCCCEEEEEECCCCC
Q ss_conf             9998608766-66677346325767876358987228973
Q gi|254780384|r  190 AVQKIYGAPK-GSLGNNIYKIEVQQKDSPFIQTIYDSGGD  228 (665)
Q Consensus       190 Alq~lYGan~-~~~gdt~y~~~~~~~~~~~~~tI~D~gG~  228 (665)
                      ..+.|||..- -...-.+  |.+..    -.+.|+|..|.
T Consensus       373 ~~~~Lfgi~f~e~~~~~v--wh~dV----~~~~V~d~~g~  406 (681)
T PRK10280        373 TANQLFGIKFVERFDIPV--YHPDV----RVWEIFDHNGV  406 (681)
T ss_pred             HHHHHHCEEEEECCCCCC--CCCCE----EEEEEECCCCC
T ss_conf             999983868786587876--57874----79999858997


No 72 
>pfam01434 Peptidase_M41 Peptidase family M41.
Probab=60.89  E-value=2.9  Score=19.04  Aligned_cols=17  Identities=59%  Similarity=0.882  Sum_probs=12.3

Q ss_pred             EEEHHHHHHHHH-CCCCC
Q ss_conf             630032346551-87888
Q gi|254780384|r  121 SHNALHEIGHAL-GLMHP  137 (665)
Q Consensus       121 ~~t~iHEIGHAL-GL~Hp  137 (665)
                      ..+.+||.|||| ++--|
T Consensus         9 ~~vA~HEaGHAlva~~l~   26 (192)
T pfam01434         9 RLVAYHEAGHALVGLLLP   26 (192)
T ss_pred             HHHHHHHHHHHHHHHHCC
T ss_conf             999999999999999846


No 73 
>LOAD_S2Pmetalloprt consensus
Probab=60.71  E-value=3.2  Score=18.75  Aligned_cols=18  Identities=39%  Similarity=0.503  Sum_probs=13.0

Q ss_pred             EEHHHHHHHHHCCCCCCC
Q ss_conf             300323465518788887
Q gi|254780384|r  122 HNALHEIGHALGLMHPGS  139 (665)
Q Consensus       122 ~t~iHEIGHALGL~Hp~~  139 (665)
                      ...+||+||++=-++-|.
T Consensus         9 ~v~vHElgH~~~A~~~G~   26 (148)
T LOAD_S2Pmetall    9 SVVVHELGHALVARRFGV   26 (148)
T ss_pred             HHHHHHHHHHHHHHHCCC
T ss_conf             999999999999999699


No 74 
>KOG2921 consensus
Probab=60.08  E-value=3.2  Score=18.73  Aligned_cols=13  Identities=46%  Similarity=0.761  Sum_probs=9.7

Q ss_pred             EHHHHHHHHHCCC
Q ss_conf             0032346551878
Q gi|254780384|r  123 NALHEIGHALGLM  135 (665)
Q Consensus       123 t~iHEIGHALGL~  135 (665)
                      ..+|||||||---
T Consensus       134 ~vvHElGHalAA~  146 (484)
T KOG2921         134 VVVHELGHALAAA  146 (484)
T ss_pred             HHHHHHHHHHHHH
T ss_conf             8788765799987


No 75 
>PRK10911 oligopeptidase A; Provisional
Probab=59.12  E-value=3.1  Score=18.84  Aligned_cols=16  Identities=25%  Similarity=0.385  Sum_probs=8.8

Q ss_pred             EHHHHHHHHHCCCCCC
Q ss_conf             0032346551878888
Q gi|254780384|r  123 NALHEIGHALGLMHPG  138 (665)
Q Consensus       123 t~iHEIGHALGL~Hp~  138 (665)
                      .+=||+..-||-++=-
T Consensus       264 ~LR~E~A~LLGy~s~A  279 (680)
T PRK10911        264 ALRHELAQLLGFENYA  279 (680)
T ss_pred             HHHHHHHHHCCCCCHH
T ss_conf             9999999983998499


No 76 
>pfam02163 Peptidase_M50 Peptidase family M50.
Probab=56.69  E-value=4.1  Score=18.09  Aligned_cols=17  Identities=41%  Similarity=0.536  Sum_probs=11.6

Q ss_pred             EHHHHHHHHHCCCCCCC
Q ss_conf             00323465518788887
Q gi|254780384|r  123 NALHEIGHALGLMHPGS  139 (665)
Q Consensus       123 t~iHEIGHALGL~Hp~~  139 (665)
                      ..+||+||++=-++-++
T Consensus        10 i~~HE~gH~~~Ar~~G~   26 (205)
T pfam02163        10 VVVHELGHALVARRFGV   26 (205)
T ss_pred             HHHHHHHHHHHHHHCCC
T ss_conf             99999999999999299


No 77 
>COG0339 Dcp Zn-dependent oligopeptidases [Amino acid transport and metabolism]
Probab=56.59  E-value=3.6  Score=18.47  Aligned_cols=17  Identities=18%  Similarity=0.255  Sum_probs=8.9

Q ss_pred             EHHHHHHHHHCCCCCCC
Q ss_conf             00323465518788887
Q gi|254780384|r  123 NALHEIGHALGLMHPGS  139 (665)
Q Consensus       123 t~iHEIGHALGL~Hp~~  139 (665)
                      .+=+|+-+-||-.+=-+
T Consensus       267 ~LR~E~A~LLGf~~yA~  283 (683)
T COG0339         267 KLRAELAKLLGFSNYAE  283 (683)
T ss_pred             HHHHHHHHHCCCCCHHH
T ss_conf             99999998749641888


No 78 
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=54.52  E-value=4.7  Score=17.75  Aligned_cols=18  Identities=39%  Similarity=0.540  Sum_probs=11.9

Q ss_pred             EEHHHHHHHHHCCCCCCC
Q ss_conf             300323465518788887
Q gi|254780384|r  122 HNALHEIGHALGLMHPGS  139 (665)
Q Consensus       122 ~t~iHEIGHALGL~Hp~~  139 (665)
                      ...+||+||++=-+.-++
T Consensus        10 ~i~iHE~gH~~~A~~~G~   27 (180)
T cd05709          10 SVTVHELGHALVARRLGV   27 (180)
T ss_pred             HHHHHHHHHHHHHHHCCC
T ss_conf             999999999999999699


No 79 
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=54.02  E-value=4.5  Score=17.84  Aligned_cols=17  Identities=47%  Similarity=0.706  Sum_probs=12.0

Q ss_pred             EEEEHHHHHHHHH---CCCC
Q ss_conf             6630032346551---8788
Q gi|254780384|r  120 LSHNALHEIGHAL---GLMH  136 (665)
Q Consensus       120 ~~~t~iHEIGHAL---GL~H  136 (665)
                      +....|||.||||   ||.-
T Consensus       159 ~l~~~iHE~GHalYEqgl~~  178 (396)
T cd06460         159 ALFSTIHETGHALYEQGLPP  178 (396)
T ss_pred             HHHHHHHHHHHHHHHCCCCH
T ss_conf             99999987516788619897


No 80 
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=52.63  E-value=5.3  Score=17.45  Aligned_cols=19  Identities=37%  Similarity=0.587  Sum_probs=13.9

Q ss_pred             EEEEHHHHHHHHHCCCCCC
Q ss_conf             6630032346551878888
Q gi|254780384|r  120 LSHNALHEIGHALGLMHPG  138 (665)
Q Consensus       120 ~~~t~iHEIGHALGL~Hp~  138 (665)
                      ....++||+||.+--++-+
T Consensus        41 ~~il~~HElGH~l~a~~~g   59 (183)
T cd06160          41 LAILGIHEMGHYLAARRHG   59 (183)
T ss_pred             HHHHHHHHHHHHHHHHHCC
T ss_conf             9999999999999999939


No 81 
>KOG2090 consensus
Probab=46.94  E-value=6.6  Score=16.87  Aligned_cols=26  Identities=23%  Similarity=0.322  Sum_probs=15.8

Q ss_pred             HHCCCCHHHHHHHH---HHHHCCCCCCCCCC
Q ss_conf             51675002477888---89975404688631
Q gi|254780384|r  539 AFADAPDLVQAGKD---HFSNFGYEEGRSIT  566 (665)
Q Consensus       539 a~G~d~~~~~aa~~---HYi~~G~~EGR~~~  566 (665)
                      -|-.||....+|-+   |...  |.++|.|-
T Consensus       641 ~Fe~dPfsR~aGek~r~eil~--hGG~~~P~  669 (704)
T KOG2090         641 LFENDPFSRKAGEKFRKEILK--HGGGRDPA  669 (704)
T ss_pred             HHHCCCCCHHHHHHHHHHHHH--HCCCCCHH
T ss_conf             981596322446899999998--45999869


No 82 
>TIGR00054 TIGR00054 membrane-associated zinc metalloprotease, putative; InterPro: IPR004387   Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases .   Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins.   Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule.    This family contains putative zinc metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM). The N-terminal region of contains a perfectly conserved motif HEXGH, where the Glu is the active site and the His residues coordinate the metal cation. The family of bacterial and plant proteins also includes a region that hits the PDZ domain (IPR001478 from INTERPRO), found in a number of proteins targeted to the membrane by binding to a peptide ligand . The family includes EcfE, which is a homolog of human site-2 protease (S2P), a membrane-bound zinc metalloprotease involved in regulated intramembrane proteolysis. In Escherichia coli EcfE activates the sigma(E) pathway of stress response through a site-2 cleavage of anti-sigma(E), RseA.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016021 integral to membrane.
Probab=45.41  E-value=7.4  Score=16.54  Aligned_cols=11  Identities=36%  Similarity=0.501  Sum_probs=7.1

Q ss_pred             EEEHHHHHHHH
Q ss_conf             63003234655
Q gi|254780384|r  121 SHNALHEIGHA  131 (665)
Q Consensus       121 ~~t~iHEIGHA  131 (665)
                      ...++||+||=
T Consensus        16 ~Li~vHElGHF   26 (463)
T TIGR00054        16 VLIFVHELGHF   26 (463)
T ss_pred             HHHHHHHHHHH
T ss_conf             99999988899


No 83 
>cd02641 R3H_Smubp-2_like R3H domain of Smubp-2_like proteins.  Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.
Probab=43.80  E-value=10  Score=15.68  Aligned_cols=15  Identities=40%  Similarity=0.651  Sum_probs=13.4

Q ss_pred             EHHHHHHHHHCCCCC
Q ss_conf             003234655187888
Q gi|254780384|r  123 NALHEIGHALGLMHP  137 (665)
Q Consensus       123 t~iHEIGHALGL~Hp  137 (665)
                      ..||+|.|-+||.|-
T Consensus        32 ~~vH~lA~~~GL~h~   46 (60)
T cd02641          32 LLVHELAEELGLRHE   46 (60)
T ss_pred             HHHHHHHHHCCCCCC
T ss_conf             999999998299764


No 84 
>pfam08463 EcoEI_R_C EcoEI R protein C-terminal. The restriction enzyme EcoEI recognizes 5'-GAGN(7)ATGC-3' and is composed of the three proteins R, M, and S. The domain described here is found at the C-terminus of the R protein (HsdR) which is required for both nuclease and ATPase activity.
Probab=42.48  E-value=11  Score=15.48  Aligned_cols=61  Identities=23%  Similarity=0.337  Sum_probs=30.9

Q ss_pred             CCCCCCCCCHHHHHCCHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHCCCHHHHHHCCHHHHHHHHHHH
Q ss_conf             68886544011120232166651675002477888899754046886311768876738107777606513478868873
Q gi|254780384|r  518 GRNPDAFKALEYIASHEDLINAFADAPDLVQAGKDHFSNFGYEEGRSITFNPDLYLTSYDDLWNVFGSDRSAAAQHYITH  597 (665)
Q Consensus       518 GR~~~~Fda~~Ylasn~DL~~a~G~d~~~~~aa~~HYi~~G~~EGR~~~Fda~~YLa~y~Dl~~~fg~d~~~a~~Hyi~~  597 (665)
                      +++.+.||.++|+|=+        ..|++...=..+             +--..|++.|.+-+..|   ....+.+|+..
T Consensus       130 ~~d~D~FDvL~hiAf~--------~pplTR~ERa~~-------------vkk~~~f~~y~~~ar~v---Le~lLdkY~~~  185 (232)
T pfam08463       130 GKDSDPFDLLCHIAFD--------QPPLTRKERAEQ-------------VKKRNYFTKYSEKAREV---LEALLDKYADE  185 (232)
T ss_pred             CCCCCHHHHHHHHHCC--------CCCCCHHHHHHH-------------HHHHHHHHHCCHHHHHH---HHHHHHHHHHH
T ss_conf             7537689999999816--------998778999999-------------87666887639999999---99999999972


Q ss_pred             HHHHC
Q ss_conf             56407
Q gi|254780384|r  598 GMKEG  602 (665)
Q Consensus       598 G~~EG  602 (665)
                      |..|-
T Consensus       186 Gi~el  190 (232)
T pfam08463       186 GIQEL  190 (232)
T ss_pred             CCCCC
T ss_conf             90030


No 85 
>pfam10263 SprT-like SprT-like family. This family represents a domain found in eukaryotes and prokaryotes. The domain contains a characteristic motif of the zinc metallopeptidases. This family includes the bacterial SprT protein.
Probab=42.40  E-value=12  Score=15.20  Aligned_cols=21  Identities=38%  Similarity=0.469  Sum_probs=15.3

Q ss_pred             EEEEEHHHHHHHHH------CCCCCCC
Q ss_conf             46630032346551------8788887
Q gi|254780384|r  119 ILSHNALHEIGHAL------GLMHPGS  139 (665)
Q Consensus       119 y~~~t~iHEIGHAL------GL~Hp~~  139 (665)
                      ....|++||+-|++      +-.+||.
T Consensus        57 ~l~~tl~HEm~H~~~~~~~~~~~~~HG   83 (153)
T pfam10263        57 FLIETLLHEMCHAALFVLFGGRGYPHG   83 (153)
T ss_pred             HHHHHHHHHHHHHHHHHHHCCCCCCCC
T ss_conf             999888999999999976178889988


No 86 
>pfam02074 Peptidase_M32 Carboxypeptidase Taq (M32) metallopeptidase.
Probab=42.31  E-value=9.1  Score=16.02  Aligned_cols=12  Identities=50%  Similarity=0.875  Sum_probs=7.3

Q ss_pred             EEEHHHHHHHHH
Q ss_conf             630032346551
Q gi|254780384|r  121 SHNALHEIGHAL  132 (665)
Q Consensus       121 ~~t~iHEIGHAL  132 (665)
                      ..-.|||.||||
T Consensus       260 l~~~iHE~GHal  271 (494)
T pfam02074       260 IFGTIHEFGHAL  271 (494)
T ss_pred             HHHHHHHHHHHH
T ss_conf             999998762889


No 87 
>PRK10733 hflB ATP-dependent metalloprotease; Reviewed
Probab=41.39  E-value=9.1  Score=16.02  Aligned_cols=11  Identities=27%  Similarity=0.374  Sum_probs=5.0

Q ss_pred             CCCCHHHHHHH
Q ss_conf             66704455552
Q gi|254780384|r  606 GISGMQFKSVM  616 (665)
Q Consensus       606 ~l~~~qf~~~~  616 (665)
                      +|+..++..-.
T Consensus       583 TL~geEi~~Il  593 (644)
T PRK10733        583 TIDAPQIDDLM  593 (644)
T ss_pred             CCCHHHHHHHH
T ss_conf             85899999996


No 88 
>cd06158 S2P-M50_like_1 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with a minimal core protein and no PDZ domains.
Probab=39.74  E-value=11  Score=15.51  Aligned_cols=14  Identities=36%  Similarity=0.422  Sum_probs=6.7

Q ss_pred             HHHHHHHHHCCCCC
Q ss_conf             03234655187888
Q gi|254780384|r  124 ALHEIGHALGLMHP  137 (665)
Q Consensus       124 ~iHEIGHALGL~Hp  137 (665)
                      .+||++||+=-..-
T Consensus        13 ~~HE~aHa~~A~~~   26 (181)
T cd06158          13 TLHEFAHAYVAYRL   26 (181)
T ss_pred             HHHHHHHHHHHHHC
T ss_conf             99999999999984


No 89 
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP.  Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=38.99  E-value=11  Score=15.41  Aligned_cols=14  Identities=36%  Similarity=0.373  Sum_probs=9.6

Q ss_pred             EEHHHHHHHHHCCC
Q ss_conf             30032346551878
Q gi|254780384|r  122 HNALHEIGHALGLM  135 (665)
Q Consensus       122 ~t~iHEIGHALGL~  135 (665)
                      ..++||.||-+=-+
T Consensus        11 lV~vHElGH~~~Ar   24 (182)
T cd06163          11 LIFVHELGHFLVAK   24 (182)
T ss_pred             HHHHHHHHHHHHHH
T ss_conf             99999899899999


No 90 
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain; InterPro: IPR013428    Proteins containing this domain also contain several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (IPR013427 from INTERPRO) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) (Q44091 from SWISSPROT) shows some homolgy to this domain but lacks additional sequence regions found in this entry and belongs to a different clade of the larger family of homologues. .
Probab=36.98  E-value=17  Score=14.39  Aligned_cols=25  Identities=24%  Similarity=0.235  Sum_probs=15.0

Q ss_pred             CCCCE-EEEEHHHHHHHHHCCCCCCC
Q ss_conf             66734-66300323465518788887
Q gi|254780384|r  115 GKGTI-LSHNALHEIGHALGLMHPGS  139 (665)
Q Consensus       115 ~~g~y-~~~t~iHEIGHALGL~Hp~~  139 (665)
                      ++|.| ...|++-|+.|..||.-...
T Consensus        58 gDGkyd~~~VF~~~~~~~~~L~~a~~   83 (459)
T TIGR02604        58 GDGKYDKSKVFAEELSLVTGLAVAVG   83 (459)
T ss_pred             CCCCCCEEEECCCCCCCCCCCCCCCC
T ss_conf             88850146542468532356100258


No 91 
>COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism]
Probab=36.47  E-value=12  Score=15.22  Aligned_cols=10  Identities=60%  Similarity=1.059  Sum_probs=5.2

Q ss_pred             EHHHHHHHHH
Q ss_conf             0032346551
Q gi|254780384|r  123 NALHEIGHAL  132 (665)
Q Consensus       123 t~iHEIGHAL  132 (665)
                      ..|||-||||
T Consensus       262 g~iHE~GHAl  271 (497)
T COG2317         262 GTIHETGHAL  271 (497)
T ss_pred             HHHHHHHHHH
T ss_conf             9998531577


No 92 
>cd02639 R3H_RRM R3H domain of mainly fungal proteins which are associated with a RNA recognition motif (RRM) domain. Present in this group is the RNA-binding post-transcriptional regulator Cip2 (Csx1-interacting protein 2) involved in counteracting Csx1 function. Csx1 plays a central role in controlling gene expression during oxidative stress. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=36.39  E-value=15  Score=14.74  Aligned_cols=17  Identities=29%  Similarity=0.481  Sum_probs=13.7

Q ss_pred             EHHHHHHHHHCCCCCCC
Q ss_conf             00323465518788887
Q gi|254780384|r  123 NALHEIGHALGLMHPGS  139 (665)
Q Consensus       123 t~iHEIGHALGL~Hp~~  139 (665)
                      -++|.+.|-|||.|--.
T Consensus        32 rivh~La~~lGL~h~s~   48 (60)
T cd02639          32 RIVHLLASRLGLNHVSD   48 (60)
T ss_pred             HHHHHHHHHCCCCEEEE
T ss_conf             99999998819920440


No 93 
>pfam06114 DUF955 Domain of unknown function (DUF955). Family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to pfam01435.
Probab=36.18  E-value=12  Score=15.22  Aligned_cols=14  Identities=36%  Similarity=0.468  Sum_probs=11.1

Q ss_pred             EEEEHHHHHHHHHC
Q ss_conf             66300323465518
Q gi|254780384|r  120 LSHNALHEIGHALG  133 (665)
Q Consensus       120 ~~~t~iHEIGHALG  133 (665)
                      -..++.|||||.+-
T Consensus        41 ~~f~~aHElgH~~l   54 (121)
T pfam06114        41 QRFTLAHELGHLLL   54 (121)
T ss_pred             HHHHHHHHHHHHHH
T ss_conf             59999999999993


No 94 
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=35.08  E-value=18  Score=14.19  Aligned_cols=20  Identities=35%  Similarity=0.308  Sum_probs=14.1

Q ss_pred             EEEEEHHHHHHHHH----CCCCCC
Q ss_conf             46630032346551----878888
Q gi|254780384|r  119 ILSHNALHEIGHAL----GLMHPG  138 (665)
Q Consensus       119 y~~~t~iHEIGHAL----GL~Hp~  138 (665)
                      ....|++||+-|+.    |-.+||
T Consensus        58 ~l~~tl~HElcH~~~~~~~~~~~H   81 (146)
T smart00731       58 RLRETLLHELCHAALYLFGRGYGH   81 (146)
T ss_pred             HHHHHHHHHHHHHHHHHHCCCCCC
T ss_conf             999878999999999997798987


No 95 
>PRK10779 zinc metallopeptidase; Provisional
Probab=34.52  E-value=14  Score=14.93  Aligned_cols=17  Identities=24%  Similarity=0.137  Sum_probs=11.0

Q ss_pred             EEEEHHHHHHHHHCCCC
Q ss_conf             66300323465518788
Q gi|254780384|r  120 LSHNALHEIGHALGLMH  136 (665)
Q Consensus       120 ~~~t~iHEIGHALGL~H  136 (665)
                      +...++||.||=|=-|-
T Consensus        15 ~~~V~iHElGHfl~Ak~   31 (449)
T PRK10779         15 GVLITVHEFGHFWVARR   31 (449)
T ss_pred             HHHHHHHHHHHHHHHHH
T ss_conf             99999885787999998


No 96 
>KOG3854 consensus
Probab=34.17  E-value=13  Score=14.99  Aligned_cols=12  Identities=33%  Similarity=0.481  Sum_probs=6.4

Q ss_pred             CCCCCHHHHHCC
Q ss_conf             654401112023
Q gi|254780384|r  522 DAFKALEYIASH  533 (665)
Q Consensus       522 ~~Fda~~Ylasn  533 (665)
                      ..|..|.|-+..
T Consensus       376 d~wKrWa~ra~~  387 (505)
T KOG3854         376 DNWKRWAYRANM  387 (505)
T ss_pred             HHHHHHHHHHHH
T ss_conf             389999999863


No 97 
>pfam06189 5-nucleotidase 5'-nucleotidase. This family consists of both eukaryotic and prokaryotic 5'-nucleotidase sequences (EC:3.1.3.5).
Probab=33.80  E-value=24  Score=13.47  Aligned_cols=13  Identities=31%  Similarity=0.302  Sum_probs=6.1

Q ss_pred             CEEECCCCCCCCC
Q ss_conf             1010014554221
Q gi|254780384|r  410 DLRNVFGSDRGAA  422 (665)
Q Consensus       410 Dl~~~~g~d~~~a  422 (665)
                      .|+-+|..|.+.+
T Consensus       120 qlRIAFDgDaVLF  132 (263)
T pfam06189       120 QLRIAFDGDAVLF  132 (263)
T ss_pred             CEEEEECCCEEEE
T ss_conf             4589736874786


No 98 
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=29.81  E-value=15  Score=14.71  Aligned_cols=23  Identities=30%  Similarity=0.517  Sum_probs=14.1

Q ss_pred             HHHHCCCHHCCCCCC-CCCCHHHH
Q ss_conf             211004500268886-54401112
Q gi|254780384|r  508 RHYITHGVKEGRNPD-AFKALEYI  530 (665)
Q Consensus       508 ~Hy~~~G~~EGR~~~-~Fda~~Yl  530 (665)
                      -||.-.+.+-.|++. .|+...|-
T Consensus       212 ghwtgh~~rl~rdLs~~~~sr~yA  235 (316)
T COG4227         212 GHWTGHEARLDRDLSRAFGSREYA  235 (316)
T ss_pred             CCCCCCHHHHHHHHCCCCCHHHHH
T ss_conf             365576034423322432025789


No 99 
>TIGR01241 FtsH_fam ATP-dependent metallopeptidase HflB; InterPro: IPR005936   Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases .   Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins.   Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule.    This group of metallopeptidases belong to MEROPS peptidase family M41 (FtsH endopeptidase family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH.    FtsH is a membrane-anchored ATP-dependent protease that degrades misfolded or misassembled membrane proteins as well as a subset of cytoplasmic regulatory proteins. FtsH is a 647-residue protein of 70 kDa, with two putative transmembrane segments towards its N terminus which anchor the protein to the membrane, giving rise to a periplasmic domain of 70 residues and a cytoplasmic segment of 520 residues containing the ATPase and protease domains . ; GO: 0004222 metalloendopeptidase activity, 0030163 protein catabolic process, 0016020 membrane.
Probab=29.77  E-value=18  Score=14.23  Aligned_cols=26  Identities=46%  Similarity=0.594  Sum_probs=17.1

Q ss_pred             EEEHHHHHHHHH-CCCCCCCCCCCCCCCCCC
Q ss_conf             630032346551-878888766676666777
Q gi|254780384|r  121 SHNALHEIGHAL-GLMHPGSYNGGYPVYGVD  150 (665)
Q Consensus       121 ~~t~iHEIGHAL-GL~Hp~~~~~g~~~~~~~  150 (665)
                      ..|.-||=|||| |+--|+.    +|+..++
T Consensus       317 ~~vAYHEaGHAl~G~~~~~~----DpV~KvT  343 (505)
T TIGR01241       317 KIVAYHEAGHALVGLLLKDA----DPVHKVT  343 (505)
T ss_pred             CEEEEECHHHHHHHHHCCCC----CCCCCEE
T ss_conf             22201115789999735344----7523256


No 100
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=28.60  E-value=21  Score=13.77  Aligned_cols=12  Identities=25%  Similarity=0.540  Sum_probs=9.3

Q ss_pred             EEEHHHHHHHHH
Q ss_conf             630032346551
Q gi|254780384|r  121 SHNALHEIGHAL  132 (665)
Q Consensus       121 ~~t~iHEIGHAL  132 (665)
                      -...+||.+|++
T Consensus       119 v~~vvHE~~Hgi  130 (263)
T cd06159         119 VGVVVHELSHGI  130 (263)
T ss_pred             HHHHHHHHHHHH
T ss_conf             999999888899


No 101
>pfam01742 Peptidase_M27 Clostridial neurotoxin zinc protease. These toxins are zinc proteases that block neurotransmitter release by proteolytic cleavage of synaptic proteins such as synaptobrevins, syntaxin and SNAP-25.
Probab=27.23  E-value=19  Score=14.08  Aligned_cols=13  Identities=23%  Similarity=0.215  Sum_probs=5.2

Q ss_pred             CCCCCCCCHHHHH
Q ss_conf             1455422114467
Q gi|254780384|r  415 FGSDRGAAARHYI  427 (665)
Q Consensus       415 ~g~d~~~al~HYi  427 (665)
                      |-.|....|.|=+
T Consensus       210 F~~DPAl~LmheL  222 (408)
T pfam01742       210 FYVDPALELMHEL  222 (408)
T ss_pred             EECCHHHHHHHHH
T ss_conf             2238899999999


No 102
>cd02640 R3H_NRF R3H domain of the NF-kappaB-repression factor (NRF). NRF is a nuclear inhibitor of NF-kappaB proteins that can silence the IFNbeta promoter via binding to a negative regulatory element (NRE). Beside R3H NRF also contains a G-patch domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=26.85  E-value=28  Score=13.11  Aligned_cols=15  Identities=27%  Similarity=0.499  Sum_probs=13.3

Q ss_pred             EHHHHHHHHHCCCCC
Q ss_conf             003234655187888
Q gi|254780384|r  123 NALHEIGHALGLMHP  137 (665)
Q Consensus       123 t~iHEIGHALGL~Hp  137 (665)
                      .+||+|..-|||+|-
T Consensus        32 ~~iH~~a~~~gLksk   46 (60)
T cd02640          32 ALIHQIAQKYGLKSR   46 (60)
T ss_pred             HHHHHHHHHCCCCCC
T ss_conf             999999999199221


No 103
>TIGR01338 phycocy_alpha phycocyanin, alpha subunit; InterPro: IPR006246   These sequences in this family represent the phycocyanin alpha subunit. Other, homologous phyobiliproteins of the phycobilisome include phycocyanin alpha chain and the allophycocyanin and phycoerythrin alpha and beta chains. Not included are the closely related phycoerythrocyanin alpha subunit sequences.; GO: 0006118 electron transport, 0015979 photosynthesis, 0030089 phycobilisome.
Probab=25.64  E-value=24  Score=13.45  Aligned_cols=70  Identities=19%  Similarity=0.267  Sum_probs=45.7

Q ss_pred             CCCCCHHHHHCCHH--HHHHHCCCCHHHHHHHHHHHHCCCCC-CCCCCCCHHHHHHHCCCHHHHHHCCHHHH--HHHHHH
Q ss_conf             65440111202321--66651675002477888899754046-88631176887673810777760651347--886887
Q gi|254780384|r  522 DAFKALEYIASHED--LINAFADAPDLVQAGKDHFSNFGYEE-GRSITFNPDLYLTSYDDLWNVFGSDRSAA--AQHYIT  596 (665)
Q Consensus       522 ~~Fda~~Ylasn~D--L~~a~G~d~~~~~aa~~HYi~~G~~E-GR~~~Fda~~YLa~y~Dl~~~fg~d~~~a--~~Hyi~  596 (665)
                      ..-|.-+||.--.=  +.+  +      +..+|.|+-.|..| .|++...|+||.++-.=+++..|-.+..+  ..-|+.
T Consensus        83 C~rdiGyylr~~ty~~~~G--~------tGPld~yl~~G~~e~nr~f~lsPsWy~eal~yik~~hGlsG~~a~ean~y~d  154 (161)
T TIGR01338        83 CVRDIGYYLRMVTYALVVG--G------TGPLDEYLVAGLEEINRAFDLSPSWYVEALKYIKANHGLSGQAAVEANSYLD  154 (161)
T ss_pred             HHHHHHHHHHHHHHHHHHC--C------CCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH
T ss_conf             5677789999999998826--8------8744789998799986653267268999999998614888612456778999


Q ss_pred             HHH
Q ss_conf             356
Q gi|254780384|r  597 HGM  599 (665)
Q Consensus       597 ~G~  599 (665)
                      +-+
T Consensus       155 y~i  157 (161)
T TIGR01338       155 YAI  157 (161)
T ss_pred             HHH
T ss_conf             998


No 104
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=25.35  E-value=24  Score=13.43  Aligned_cols=83  Identities=22%  Similarity=0.181  Sum_probs=39.6

Q ss_pred             HHHHHHHHHHHHHHHHHCCEEEEECCCCCEEEECCCCEEEEECCCCCCCCCCCCCCC-------CCCCCCCCCCCCEEEE
Q ss_conf             999999999998865747568982478716896158535663146775312244335-------4433366666734663
Q gi|254780384|r   50 NAQEVARWALGEWSKVVDLTFEETSIHSDIKFISSNNGYVCTPRYDSRYFMEINFDK-------ADVWRYGIGKGTILSH  122 (665)
Q Consensus        50 ~~~~~~R~Al~~ws~va~ItF~Ev~~~~d~~f~~~~~g~a~~p~~~~~~~~~~~~~~-------~~~~~~~~~~g~y~~~  122 (665)
                      -.|..++..++.||.|..-.=-  +...-.+++.+++|-...|-..........|+.       .+..-.++...  +--
T Consensus        19 wAQ~kvks~y~KySkV~~s~g~--TGaevAr~iLd~nGl~dV~Ve~v~G~LTDHYDP~~kvvrLSe~~y~g~Sia--~~a   94 (226)
T COG2738          19 WAQSKVKSTYKKYSKVRASSGL--TGAEVARMILDENGLYDVPVEEVPGTLTDHYDPRRKVVRLSEANYYGPSIA--AIA   94 (226)
T ss_pred             HHHHHHHHHHHHHHHCCCCCCC--CHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCHHHHEEECCCCCCCCCCHH--HHH
T ss_conf             9999999889987631676898--479999999855587665046406876566680451256452014786388--898


Q ss_pred             EHHHHHHHHHCCCC
Q ss_conf             00323465518788
Q gi|254780384|r  123 NALHEIGHALGLMH  136 (665)
Q Consensus       123 t~iHEIGHALGL~H  136 (665)
                      ..-||.|||+-=+|
T Consensus        95 VAAHEVGHAiQd~~  108 (226)
T COG2738          95 VAAHEVGHAIQDQE  108 (226)
T ss_pred             HHHHHHHHHHHHHC
T ss_conf             99987627776220


No 105
>COG3793 TerB Tellurite resistance protein [Inorganic ion transport and metabolism]
Probab=24.86  E-value=32  Score=12.71  Aligned_cols=10  Identities=70%  Similarity=0.943  Sum_probs=3.7

Q ss_pred             HHHHHHHHCC
Q ss_conf             3234655187
Q gi|254780384|r  125 LHEIGHALGL  134 (665)
Q Consensus       125 iHEIGHALGL  134 (665)
                      +|||-|+|||
T Consensus       128 l~eI~~aLGL  137 (144)
T COG3793         128 LREIAGALGL  137 (144)
T ss_pred             HHHHHHHHCC
T ss_conf             9999999388


No 106
>pfam07108 PipA PipA protein. This family consists of several Salmonella PipA (pathogenicity island-encoded protein A) and related phage sequences. PipA is thought to contribute to enteric but not to systemic salmonellosis.
Probab=24.61  E-value=28  Score=13.04  Aligned_cols=14  Identities=21%  Similarity=0.396  Sum_probs=8.2

Q ss_pred             CHHHHHHHHHHHHH
Q ss_conf             51347886887356
Q gi|254780384|r  586 DRSAAAQHYITHGM  599 (665)
Q Consensus       586 d~~~a~~Hyi~~G~  599 (665)
                      +...++.|-|.+-+
T Consensus       175 dt~RsFiHEiVHAL  188 (228)
T pfam07108       175 DTKRSFIHEVVHAL  188 (228)
T ss_pred             HHHHHHHHHHHHHH
T ss_conf             67999999999998


No 107
>pfam02102 Peptidase_M35 Deuterolysin metalloprotease (M35) family.
Probab=23.76  E-value=30  Score=12.88  Aligned_cols=28  Identities=11%  Similarity=0.459  Sum_probs=13.4

Q ss_pred             HCCCCCCCCCCCCHHHHHHHCCCHHHHHHCC
Q ss_conf             7540468863117688767381077776065
Q gi|254780384|r  556 NFGYEEGRSITFNPDLYLTSYDDLWNVFGSD  586 (665)
Q Consensus       556 ~~G~~EGR~~~Fda~~YLa~y~Dl~~~fg~d  586 (665)
                      .||+..-|.++  ++ ..-+|+|-=..|++.
T Consensus       320 aYGy~~~~~Ls--a~-qAlnNADsYalfAna  347 (352)
T pfam02102       320 GYGYDAATALS--AS-QALNNADSYALYANA  347 (352)
T ss_pred             CCCHHHHHCCC--HH-HHHHCHHHHHHHHHH
T ss_conf             33588883259--88-876581769999868


No 108
>TIGR00962 atpA ATP synthase F1, alpha subunit; InterPro: IPR005294   ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (F-, V- and A-ATPases contain rotary motors) and in the type of ions they transport , .  F-ATPases (F1F0-ATPases) in mitochondria, chloroplasts and bacterial plasma membranes are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases) are primarily found in eukaryotic vacuoles, catalysing ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases) are found in Archaea and function like F-ATPases. P-ATPases (E1E2-ATPases) are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP.   F-ATPases (also known as F1F0-ATPase, or H(+)-transporting two-sector ATPase) (3.6.3.14 from EC) are composed of two linked complexes: the F1 ATPase complex is the catalytic core and is composed of 5 subunits (alpha, beta, gamma, delta, epsilon), while the F0 ATPase complex is the membrane-embedded proton channel that is composed of at least 3 subunits (A-C), nine in mitochondria (A-G, F6, F8). Both the F1 and F0 complexes are rotary motors that are coupled back-to-back. In the F1 complex, the central gamma subunit forms the rotor inside the cylinder made of the alpha(3)beta(3) subunits, while in the F0 complex, the ring-shaped C subunits forms the rotor. The two rotors rotate in opposite directions, but the F0 rotor is usually stronger, using the force from the proton gradient to push the F1 rotor in reverse in order to drive ATP synthesis . These ATPases can also work in reverse to hydrolyse ATP to create a proton gradient.   This entry represents the alpha subunit found in the F1 complex of F-ATPases. In F-ATPases, there are three copies each of the alpha and beta subunits that form the catalytic core of the F1 complex, while the remaining F1 subunits (gamma, delta, epsilon) form part of the stalks. There is a substrate-binding site on each of the alpha and beta subunits, those on the beta subunits being catalytic, while those on the alpha subunits are regulatory. The alpha-subunit contains a highly conserved adenine-specific non-catalytic nucleotide-binding domain, with a conserved amino acid sequence of Gly-X-X-X-X-Gly-Lys. The alpha and beta subunits form a cylinder that is attached to the central stalk. The alpha/beta subunits undergo a sequence of conformational changes leading to the formation of ATP from ADP, which are induced by the rotation of the gamma subunit, itself is driven by the movement of protons through the F0 complex C subunit .   More information about this protein can be found at Protein of the Month: ATP Synthases .; GO: 0016820 hydrolase activity acting on acid anhydrides catalyzing transmembrane movement of substances, 0015986 ATP synthesis coupled proton transport, 0016469 proton-transporting two-sector ATPase complex.
Probab=23.25  E-value=38  Score=12.24  Aligned_cols=30  Identities=30%  Similarity=0.464  Sum_probs=17.9

Q ss_pred             HHHHHCCCHHHH--HHCCHHHHHHHHHHHHHH
Q ss_conf             876738107777--606513478868873564
Q gi|254780384|r  571 LYLTSYDDLWNV--FGSDRSAAAQHYITHGMK  600 (665)
Q Consensus       571 ~YLa~y~Dl~~~--fg~d~~~a~~Hyi~~G~~  600 (665)
                      +-||.|-.|-++  ||+|.+.|.+.=|.+|.|
T Consensus       402 LeLAQYRELeAFsQFaSDLD~ATk~qLerG~R  433 (520)
T TIGR00962       402 LELAQYRELEAFSQFASDLDEATKAQLERGKR  433 (520)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC
T ss_conf             55754889988864012234899999860672


No 109
>PRK13262 ureE urease accessory protein UreE; Provisional
Probab=22.97  E-value=31  Score=12.82  Aligned_cols=16  Identities=44%  Similarity=0.584  Sum_probs=11.9

Q ss_pred             HHHHHHHHHCCCCCCC
Q ss_conf             0323465518788887
Q gi|254780384|r  124 ALHEIGHALGLMHPGS  139 (665)
Q Consensus       124 ~iHEIGHALGL~Hp~~  139 (665)
                      .--|+|||||=.|=--
T Consensus       105 tafELGHalGNQHW~a  120 (231)
T PRK13262        105 TCFELGHALGNQHWKA  120 (231)
T ss_pred             HHHHHHHHHHCCCHHH
T ss_conf             9999756650232165


No 110
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=21.68  E-value=31  Score=12.79  Aligned_cols=12  Identities=42%  Similarity=0.515  Sum_probs=6.8

Q ss_pred             EEHHHHHHHHHC
Q ss_conf             300323465518
Q gi|254780384|r  122 HNALHEIGHALG  133 (665)
Q Consensus       122 ~t~iHEIGHALG  133 (665)
                      -++-|||||-.|
T Consensus       132 gViAHEigHv~q  143 (484)
T COG4783         132 GVIAHEIGHVAQ  143 (484)
T ss_pred             HHHHHHHHHHHH
T ss_conf             999999988741


No 111
>pfam09954 DUF2188 Uncharacterized protein conserved in bacteria (DUF2188). This domain, found in various hypothetical bacterial proteins, has no known function.
Probab=21.41  E-value=40  Score=12.14  Aligned_cols=22  Identities=23%  Similarity=0.422  Sum_probs=13.1

Q ss_pred             CCCEEEECCCC-----CCCCCCCCCCH
Q ss_conf             75238801478-----86632222201
Q gi|254780384|r  635 KKVDLYTDYFG-----DGNYYDSDLYS  656 (665)
Q Consensus       635 ~~g~l~~d~dg-----~g~~~~~~~~~  656 (665)
                      +...+||-.||     .|+.-.|+.+.
T Consensus        87 ~~~hV~~edd~W~VkseGAkrAS~tfd  113 (131)
T pfam09954        87 KAEHVYKHDDGWAVKSEGAKRASEVFD  113 (131)
T ss_pred             CCEEEEECCCCEEEECCCHHHHHHHHH
T ss_conf             661688628936784044466664231


No 112
>COG4744 Uncharacterized conserved protein [Function unknown]
Probab=20.95  E-value=18  Score=14.23  Aligned_cols=41  Identities=27%  Similarity=0.397  Sum_probs=33.4

Q ss_pred             CEEEEEEECCCCCCCCCCCCCEECCHHHHHHHHHHHHCCCC
Q ss_conf             02446752356776666552000211221489999860876
Q gi|254780384|r  158 FLTSVMSYFMPQDSMMDASFGYCATPMVSDIVAVQKIYGAP  198 (665)
Q Consensus       158 ~~~TvMSY~~~~~~~~~~~~~~~~tpm~~DI~Alq~lYGan  198 (665)
                      ...-||||.-+.....+.+-....+||..|..++.+.||-.
T Consensus        40 LI~lv~Sy~lp~~lss~~~~~~v~npgeadmk~v~kd~Gqe   80 (121)
T COG4744          40 LIALVMSYHLPELLSSNEDITIVKNPGEADMKIVIKDYGQE   80 (121)
T ss_pred             HHHHHHHCCCCCCCCCCCCCEEEECCCCCCEEEEEHHCCCC
T ss_conf             99998751774456888871577367642206672422916


Done!