Query         psy5221
Match_columns 436
No_of_seqs    384 out of 3006
Neff          8.6 
Searched_HMMs 46136
Date          Fri Aug 16 20:28:40 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy5221.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5221hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PRK05457 heat shock protein Ht 100.0 1.2E-43 2.7E-48  337.9  29.6  251   17-279    32-283 (284)
  2 PRK01265 heat shock protein Ht 100.0 4.7E-41   1E-45  322.5  29.8  252   20-281    42-323 (324)
  3 PRK04897 heat shock protein Ht 100.0 2.5E-41 5.5E-46  325.1  27.9  253   19-278    40-297 (298)
  4 PRK02870 heat shock protein Ht 100.0 1.8E-40 3.9E-45  320.4  29.3  248   20-279    71-334 (336)
  5 PRK03072 heat shock protein Ht 100.0 2.2E-40 4.9E-45  316.8  28.8  248   21-282    32-285 (288)
  6 PRK02391 heat shock protein Ht 100.0 2.1E-40 4.5E-45  317.2  27.9  250   20-282    37-292 (296)
  7 PRK03982 heat shock protein Ht 100.0 3.8E-39 8.3E-44  309.6  29.1  251   21-284    30-285 (288)
  8 PRK01345 heat shock protein Ht 100.0 5.2E-39 1.1E-43  310.5  28.0  248   22-284    30-283 (317)
  9 PRK03001 M48 family peptidase; 100.0 5.8E-38 1.3E-42  300.6  28.9  246   22-283    30-281 (283)
 10 PRK11597 heat shock chaperone  100.0 8.9E-33 1.9E-37  233.9  15.8  139  291-431     1-139 (142)
 11 PRK10743 heat shock protein Ib 100.0 5.2E-32 1.1E-36  228.9  15.6  136  292-427     2-137 (137)
 12 COG0501 HtpX Zn-dependent prot 100.0 5.5E-30 1.2E-34  249.2  26.5  240   22-283    56-300 (302)
 13 KOG2719|consensus              100.0 9.2E-30   2E-34  246.5  25.7  224   32-281   192-425 (428)
 14 COG0071 IbpA Molecular chapero  99.9 4.1E-27 8.8E-32  203.5  15.3  124  303-427    19-146 (146)
 15 PF01435 Peptidase_M48:  Peptid  99.9 5.9E-27 1.3E-31  218.2  11.7  185   60-275    29-226 (226)
 16 cd06470 ACD_IbpA-B_like Alpha-  99.9 1.3E-23 2.7E-28  166.2  12.8   89  325-413     1-90  (90)
 17 cd06472 ACD_ScHsp26_like Alpha  99.9 1.4E-22 3.1E-27  161.2  12.2   87  326-413     1-92  (92)
 18 PF00011 HSP20:  Hsp20/alpha cr  99.9 1.4E-21   3E-26  158.9  13.6   99  328-427     1-102 (102)
 19 cd06471 ACD_LpsHSP_like Group   99.9   1E-21 2.2E-26  156.7  12.2   87  325-413     1-93  (93)
 20 cd06497 ACD_alphaA-crystallin_  99.9 2.2E-21 4.8E-26  151.4  11.3   80  328-413     4-86  (86)
 21 cd06478 ACD_HspB4-5-6 Alpha-cr  99.8 1.5E-20 3.3E-25  145.9  11.3   79  329-413     2-83  (83)
 22 cd06479 ACD_HspB7_like Alpha c  99.8 1.2E-20 2.6E-25  144.8  10.0   77  328-413     2-81  (81)
 23 cd06498 ACD_alphaB-crystallin_  99.8 2.2E-20 4.7E-25  145.1  11.4   77  332-413     4-83  (84)
 24 COG4783 Putative Zn-dependent   99.8 1.8E-19 3.9E-24  176.2  18.1  179   64-284    76-264 (484)
 25 cd06476 ACD_HspB2_like Alpha c  99.8 7.4E-20 1.6E-24  141.5  10.9   77  332-413     4-83  (83)
 26 cd06475 ACD_HspB1_like Alpha c  99.8 8.3E-20 1.8E-24  142.5  11.2   80  327-412     3-85  (86)
 27 cd06477 ACD_HspB3_Like Alpha c  99.8 2.2E-19 4.8E-24  138.3  10.9   76  332-412     4-82  (83)
 28 cd06482 ACD_HspB10 Alpha cryst  99.8 2.2E-19 4.7E-24  139.5  10.3   79  333-413     6-87  (87)
 29 cd06481 ACD_HspB9_like Alpha c  99.8 3.1E-19 6.6E-24  139.7  10.6   79  332-413     4-87  (87)
 30 cd06526 metazoan_ACD Alpha-cry  99.8   2E-18 4.4E-23  134.4   9.7   75  334-413     6-83  (83)
 31 cd06464 ACD_sHsps-like Alpha-c  99.7 1.9E-17   4E-22  130.5  11.7   85  328-413     1-88  (88)
 32 KOG2661|consensus               99.7 3.8E-17 8.2E-22  151.5  13.1  187   47-283   201-399 (424)
 33 COG4784 Putative Zn-dependent   99.7 1.7E-16 3.7E-21  147.8  14.7  178   63-283    65-254 (479)
 34 cd06480 ACD_HspB8_like Alpha-c  99.7 1.9E-16 4.1E-21  123.8  10.2   77  332-413    12-91  (91)
 35 KOG0710|consensus               99.6 1.7E-15 3.6E-20  136.7   7.7  106  321-427    81-195 (196)
 36 KOG3591|consensus               99.6 1.5E-14 3.2E-19  127.0  12.1  101  326-432    64-167 (173)
 37 PF05569 Peptidase_M56:  BlaR1   99.5 2.8E-12 6.1E-17  124.5  19.1  154   64-276   143-299 (299)
 38 cd00298 ACD_sHsps_p23-like Thi  99.3 2.5E-11 5.4E-16   92.8  10.3   76  332-413     3-80  (80)
 39 cd06469 p23_DYX1C1_like p23_li  98.9 7.4E-09 1.6E-13   79.4   9.2   65  332-414     3-69  (78)
 40 COG4219 MecR1 Antirepressor re  98.9 3.6E-07 7.8E-12   86.1  20.3  125   64-234   138-265 (337)
 41 PF05455 GvpH:  GvpH;  InterPro  98.8 6.3E-08 1.4E-12   84.0  10.1   74  323-414    90-168 (177)
 42 cd06463 p23_like Proteins cont  98.5 9.8E-07 2.1E-11   68.1   9.4   70  332-414     3-74  (84)
 43 cd06466 p23_CS_SGT1_like p23_l  98.1 1.4E-05   3E-10   62.0   8.1   73  328-414     1-75  (84)
 44 PF04969 CS:  CS domain;  Inter  97.7   0.001 2.2E-08   50.5  11.4   75  325-413     1-79  (79)
 45 PF04228 Zn_peptidase:  Putativ  97.6   0.001 2.2E-08   63.8  12.4   67   64-133    99-185 (292)
 46 PF06114 DUF955:  Domain of unk  97.4  0.0016 3.5E-08   53.5   9.7   33  100-136    28-60  (122)
 47 cd06465 p23_hB-ind1_like p23_l  97.3  0.0037   8E-08   50.9  10.5   76  325-415     1-78  (108)
 48 PF10463 Peptidase_U49:  Peptid  97.2 0.00056 1.2E-08   61.8   5.0   20  116-135    99-118 (206)
 49 PF08190 PIH1:  pre-RNA process  97.1  0.0021 4.6E-08   63.2   8.5   64  333-412   259-327 (328)
 50 cd06489 p23_CS_hSgt1_like p23_  96.9  0.0078 1.7E-07   46.5   8.7   73  328-414     1-75  (84)
 51 PRK09672 phage exclusion prote  96.7  0.0026 5.7E-08   59.9   5.6   21  115-135   162-182 (305)
 52 cd06467 p23_NUDC_like p23_like  96.6   0.018   4E-07   44.4   9.0   69  329-414     3-75  (85)
 53 cd06493 p23_NUDCD1_like p23_NU  96.4   0.038 8.3E-07   42.8   9.2   70  328-414     2-75  (85)
 54 PF01863 DUF45:  Protein of unk  96.3   0.008 1.7E-07   54.8   6.2   68   64-135   114-181 (205)
 55 cd06488 p23_melusin_like p23_l  96.3   0.054 1.2E-06   42.2   9.8   74  327-414     3-78  (87)
 56 PF04298 Zn_peptidase_2:  Putat  96.2    0.68 1.5E-05   42.3  17.7   65   65-134    40-105 (222)
 57 cd06494 p23_NUDCD2_like p23-li  95.9    0.12 2.6E-06   40.7   9.9   74  323-414     4-81  (93)
 58 PRK04351 hypothetical protein;  95.9   0.043 9.4E-07   47.2   7.9   62   64-129     6-72  (149)
 59 cd06468 p23_CacyBP p23_like do  95.7    0.14 3.1E-06   40.1   9.9   75  326-414     3-83  (92)
 60 COG1451 Predicted metal-depend  95.6   0.044 9.5E-07   50.7   7.5   69   64-136   125-193 (223)
 61 COG2321 Predicted metalloprote  95.1    0.17 3.7E-06   47.2   9.5   67   67-133    99-182 (295)
 62 PF13699 DUF4157:  Domain of un  94.9   0.038 8.2E-07   42.2   4.0   60   67-132     6-75  (79)
 63 PF10263 SprT-like:  SprT-like   93.9    0.36 7.8E-06   41.9   8.5   70   64-133     4-75  (157)
 64 smart00731 SprT SprT homologue  93.6    0.18 3.8E-06   43.4   5.9   66   64-131     3-72  (146)
 65 cd00237 p23 p23 binds heat sho  93.6     1.4 3.1E-05   35.6  10.8   74  325-414     2-77  (106)
 66 PF10026 DUF2268:  Predicted Zn  93.3    0.23   5E-06   45.0   6.4   69   64-133     5-80  (195)
 67 PF13203 DUF2201_N:  Putative m  92.8    0.24 5.2E-06   47.9   6.2   38  101-138    43-80  (292)
 68 PF04450 BSP:  Peptidase of pla  92.6    0.28   6E-06   44.7   5.8   70   64-134    30-112 (205)
 69 COG2738 Predicted Zn-dependent  91.4     2.6 5.7E-05   37.5  10.1   70   65-138    43-112 (226)
 70 cd06495 p23_NUDCD3_like p23-li  90.4     4.5 9.7E-05   32.4  10.1   77  324-414     4-85  (102)
 71 COG2856 Predicted Zn peptidase  90.3    0.31 6.6E-06   44.6   3.7   31   99-133    57-87  (213)
 72 KOG1309|consensus               90.3     1.1 2.4E-05   39.3   6.8   42  325-367     4-45  (196)
 73 PLN03088 SGT1,  suppressor of   90.3     1.4   3E-05   43.9   8.7   77  325-415   157-235 (356)
 74 COG4900 Predicted metallopepti  90.1    0.77 1.7E-05   36.7   5.3   67   64-131     9-93  (133)
 75 cd06492 p23_mNUDC_like p23-lik  89.3     5.1 0.00011   31.1   9.4   66  333-414     6-77  (87)
 76 cd06490 p23_NCB5OR p23_like do  89.0       6 0.00013   30.6   9.6   73  327-414     1-78  (87)
 77 PF02031 Peptidase_M7:  Strepto  88.5     1.4   3E-05   36.5   5.8   39   88-131    52-90  (132)
 78 PRK04860 hypothetical protein;  88.4     1.8 3.9E-05   37.8   6.9   66   64-131     9-76  (160)
 79 COG3091 SprT Zn-dependent meta  87.1     1.6 3.5E-05   37.1   5.6   66   63-130     6-73  (156)
 80 PF13349 DUF4097:  Domain of un  84.2      11 0.00023   32.6   9.9   81  324-410    65-147 (166)
 81 PF12388 Peptidase_M57:  Dual-a  82.9     1.1 2.3E-05   40.8   2.9   34  100-135   117-151 (211)
 82 COG3590 PepO Predicted metallo  80.5    0.45 9.8E-06   48.9  -0.5   55   74-131   434-500 (654)
 83 COG3864 Uncharacterized protei  79.7     2.3   5E-05   40.6   3.9   73   64-136     7-87  (396)
 84 COG3824 Predicted Zn-dependent  77.9     2.1 4.5E-05   34.9   2.7   36   95-131    81-122 (136)
 85 COG4324 Predicted aminopeptida  76.0     2.7 5.9E-05   39.2   3.2   33  104-136   180-218 (376)
 86 cd04279 ZnMc_MMP_like_1 Zinc-d  75.7     1.7 3.8E-05   37.5   1.9   17  115-131   101-117 (156)
 87 PF01421 Reprolysin:  Reprolysi  75.3     2.1 4.5E-05   38.7   2.3   40   90-131   105-144 (199)
 88 cd04269 ZnMc_adamalysin_II_lik  74.6     2.6 5.6E-05   37.9   2.8   17  115-131   128-144 (194)
 89 PF14891 Peptidase_M91:  Effect  74.6       8 0.00017   34.2   5.8   67   65-131    31-116 (174)
 90 PF08325 WLM:  WLM domain;  Int  74.5     7.8 0.00017   34.7   5.7   23  113-135    77-99  (186)
 91 PF00413 Peptidase_M10:  Matrix  74.3       2 4.3E-05   36.8   1.9   18  114-131   101-118 (154)
 92 cd04268 ZnMc_MMP_like Zinc-dep  73.2     2.1 4.5E-05   37.2   1.8   18  114-131    90-107 (165)
 93 cd04270 ZnMc_TACE_like Zinc-de  72.9     2.3   5E-05   39.9   2.1   18  114-131   163-180 (244)
 94 COG1164 Oligoendopeptidase F [  70.9     5.9 0.00013   42.3   4.8   50   80-134   345-396 (598)
 95 PF13485 Peptidase_MA_2:  Pepti  70.2     3.1 6.6E-05   33.9   2.0   23  112-134    19-41  (128)
 96 PF09768 Peptidase_M76:  Peptid  69.2      15 0.00033   32.4   6.2   63   64-131    18-84  (173)
 97 PF13688 Reprolysin_5:  Metallo  68.6     2.5 5.4E-05   38.0   1.2   18  114-131   138-155 (196)
 98 PF14247 DUF4344:  Domain of un  67.6      19 0.00042   33.1   6.8   23  261-283   160-182 (220)
 99 PRK06926 flagellar motor prote  67.2 1.1E+02  0.0023   29.3  11.9   30  109-138   120-149 (271)
100 PRK00446 cyaY frataxin-like pr  67.0     7.1 0.00015   31.5   3.4   31  398-435    29-59  (105)
101 PF13582 Reprolysin_3:  Metallo  66.7       3 6.5E-05   34.3   1.2   13  119-131   108-120 (124)
102 TIGR00181 pepF oligoendopeptid  65.9     4.4 9.6E-05   43.3   2.7   42   88-134   353-394 (591)
103 PRK00523 hypothetical protein;  65.2      25 0.00055   26.0   5.6   45   26-79     12-56  (72)
104 PRK13267 archaemetzincin-like   64.9      15 0.00032   32.8   5.3   67   65-131    56-138 (179)
105 PF13574 Reprolysin_2:  Metallo  64.1     3.6 7.9E-05   36.3   1.4   14  118-131   111-124 (173)
106 TIGR03421 FeS_CyaY iron donor   64.1     7.9 0.00017   31.0   3.1   31  398-435    27-57  (102)
107 PF01447 Peptidase_M4:  Thermol  62.1      18 0.00039   31.2   5.2   62   64-130    79-147 (150)
108 cd04278 ZnMc_MMP Zinc-dependen  61.8       4 8.7E-05   35.3   1.2   17  115-131   104-120 (157)
109 PF05572 Peptidase_M43:  Pregna  61.7     4.3 9.4E-05   35.2   1.4   21  114-134    65-86  (154)
110 cd00503 Frataxin Frataxin is a  61.4     9.4  0.0002   30.8   3.2   31  398-435    30-60  (105)
111 PF01433 Peptidase_M1:  Peptida  59.7      34 0.00073   34.1   7.6   66   64-134   234-311 (390)
112 PF10023 DUF2265:  Predicted am  59.6     4.9 0.00011   39.3   1.5   32  106-137   150-187 (337)
113 cd04267 ZnMc_ADAM_like Zinc-de  59.2     3.8 8.2E-05   36.7   0.6   17  115-131   130-146 (192)
114 cd06460 M32_Taq Peptidase fami  58.4      82  0.0018   31.8   9.9   66   64-133   109-174 (396)
115 PRK05518 rpl6p 50S ribosomal p  58.3      38 0.00082   30.2   6.7   45  348-412    13-57  (180)
116 TIGR03422 mito_frataxin fratax  56.4     9.4  0.0002   30.3   2.3   29  399-434    30-58  (97)
117 TIGR02414 pepN_proteo aminopep  56.0     9.5 0.00021   42.6   3.1   20  116-135   281-300 (863)
118 TIGR03653 arch_L6P archaeal ri  55.5      48   0.001   29.2   6.9   45  348-412     7-51  (170)
119 PF13583 Reprolysin_4:  Metallo  55.3     5.7 0.00012   36.2   1.1   12  120-131   139-150 (206)
120 cd04271 ZnMc_ADAM_fungal Zinc-  55.1     4.4 9.5E-05   37.6   0.3   12  120-131   147-158 (228)
121 PF01491 Frataxin_Cyay:  Fratax  55.1      14 0.00031   29.9   3.3   31  397-434    31-61  (109)
122 PF01431 Peptidase_M13:  Peptid  54.8     9.3  0.0002   34.6   2.4   35  100-134     8-52  (206)
123 cd04272 ZnMc_salivary_gland_MP  54.7     7.2 0.00016   35.9   1.6   15  117-131   144-158 (220)
124 cd00203 ZnMc Zinc-dependent me  54.6     5.5 0.00012   34.5   0.8   17  115-131    93-109 (167)
125 TIGR02412 pepN_strep_liv amino  53.0      32 0.00069   38.5   6.6   68   65-136   229-305 (831)
126 PF06262 DUF1025:  Possibl zinc  52.5      25 0.00055   27.9   4.2   36   99-134    49-89  (97)
127 PF04972 BON:  BON domain;  Int  52.1      30 0.00064   24.6   4.3   26  344-369    12-37  (64)
128 cd06477 ACD_HspB3_Like Alpha c  52.1      28 0.00061   26.7   4.3   30  336-365    51-82  (83)
129 PTZ00027 60S ribosomal protein  52.1      48  0.0011   29.8   6.4   48  348-413    13-60  (190)
130 PRK05498 rplF 50S ribosomal pr  52.0      41 0.00089   29.9   5.9   44  348-412    12-55  (178)
131 TIGR03654 L6_bact ribosomal pr  51.0      54  0.0012   29.0   6.5   45  348-413    11-55  (175)
132 PRK01844 hypothetical protein;  50.6      59  0.0013   24.2   5.4   47   24-79      9-55  (72)
133 PF12315 DUF3633:  Protein of u  50.5      16 0.00035   33.1   3.1   20  112-131    87-106 (212)
134 KOG3314|consensus               50.1      70  0.0015   27.8   6.6   68   65-135    40-108 (194)
135 COG1965 CyaY Protein implicate  48.6      14 0.00031   29.5   2.2   31  398-435    30-60  (106)
136 TIGR02289 M3_not_pepF oligoend  48.4      13 0.00029   39.3   2.7   65   64-133   274-352 (549)
137 COG3763 Uncharacterized protei  47.9      54  0.0012   24.1   4.8   40   31-79     16-55  (71)
138 smart00235 ZnMc Zinc-dependent  47.6     9.1  0.0002   32.2   1.1   13  119-131    87-99  (140)
139 cd06471 ACD_LpsHSP_like Group   47.0      29 0.00064   26.8   3.8   31  334-364    61-91  (93)
140 TIGR02411 leuko_A4_hydro leuko  46.6      26 0.00056   37.6   4.5   31  102-134   265-295 (601)
141 PRK14015 pepN aminopeptidase N  46.2      34 0.00074   38.4   5.5   20  117-136   295-314 (875)
142 cd04276 ZnMc_MMP_like_2 Zinc-d  44.0      16 0.00034   33.2   2.0   31  101-131    92-129 (197)
143 PF14730 DUF4468:  Domain of un  44.0 1.1E+02  0.0024   23.7   6.6   17  397-413    69-85  (91)
144 cd06459 M3B_Oligoendopeptidase  43.9      15 0.00032   37.3   2.1   41   88-133   196-237 (427)
145 KOG1047|consensus               43.6      14  0.0003   38.4   1.8   52   79-135   254-305 (613)
146 PF02163 Peptidase_M50:  Peptid  43.0      16 0.00034   32.7   1.9   14  118-131     7-20  (192)
147 TIGR03793 TOMM_pelo TOMM prope  43.0 1.1E+02  0.0024   23.1   6.2   44   69-123    28-76  (77)
148 PTZ00179 60S ribosomal protein  42.4      79  0.0017   28.4   6.2   47  348-412    12-58  (189)
149 cd04327 ZnMc_MMP_like_3 Zinc-d  42.3      17 0.00036   32.9   2.0   16  116-131    90-105 (198)
150 cd04277 ZnMc_serralysin_like Z  41.6      15 0.00034   32.6   1.6   34  101-134    94-130 (186)
151 PRK01379 cyaY frataxin-like pr  41.2      28 0.00062   27.9   2.9   30  398-435    30-59  (103)
152 CHL00140 rpl6 ribosomal protei  40.6      72  0.0016   28.3   5.7   44  348-412    12-55  (178)
153 PF12992 DUF3876:  Domain of un  40.6      89  0.0019   24.7   5.5   41  322-363    23-68  (95)
154 TIGR02290 M3_fam_3 oligoendope  40.2      19 0.00041   38.5   2.3   41   88-133   349-390 (587)
155 PHA00527 hypothetical protein   39.8      69  0.0015   25.5   4.7   49   65-131    28-86  (129)
156 KOG3624|consensus               39.4      43 0.00092   36.6   4.9   43   88-133   481-533 (687)
157 cd04273 ZnMc_ADAMTS_like Zinc-  39.2     5.9 0.00013   36.0  -1.5   14  118-131   140-153 (207)
158 PF06861 BALF1:  BALF1 protein;  39.0      22 0.00048   31.0   2.1   23  113-135   127-149 (182)
159 cd06161 S2P-M50_SpoIVFB SpoIVF  39.0      19 0.00042   32.8   1.8   15  117-131    37-51  (208)
160 cd06163 S2P-M50_PDZ_RseP-like   38.6      18 0.00038   32.3   1.5   13  119-131    10-22  (182)
161 cd05709 S2P-M50 Site-2 proteas  38.4      20 0.00043   31.6   1.8   14  118-131     8-21  (180)
162 cd06472 ACD_ScHsp26_like Alpha  37.1      52  0.0011   25.4   3.8   32  333-364    58-90  (92)
163 cd06481 ACD_HspB9_like Alpha c  37.0      41 0.00089   25.9   3.2   32  334-365    53-86  (87)
164 TIGR02421 QEGLA conserved hypo  36.8      75  0.0016   31.7   5.7   60   61-127   136-197 (366)
165 cd06164 S2P-M50_SpoIVFB_CBS Sp  36.6      22 0.00048   32.9   1.8   15  117-131    52-66  (227)
166 PF01432 Peptidase_M3:  Peptida  36.1      26 0.00056   36.1   2.5   18  113-131   238-255 (458)
167 cd06526 metazoan_ACD Alpha-cry  36.0      51  0.0011   24.9   3.5   31  335-365    50-82  (83)
168 cd06464 ACD_sHsps-like Alpha-c  35.5      61  0.0013   24.2   4.0   33  333-365    54-87  (88)
169 PF00011 HSP20:  Hsp20/alpha cr  34.5      56  0.0012   25.6   3.7   38  333-370    53-91  (102)
170 PF13402 M60-like:  Peptidase M  33.9 1.9E+02  0.0042   27.7   8.1   64   65-133   153-234 (307)
171 PF08014 DUF1704:  Domain of un  33.6   1E+02  0.0022   30.7   6.0   63   63-133   113-181 (349)
172 KOG1046|consensus               33.4      79  0.0017   35.7   5.8   66   66-136   265-342 (882)
173 PF00347 Ribosomal_L6:  Ribosom  33.3      97  0.0021   22.8   4.7   47  348-413     2-48  (77)
174 PF12725 DUF3810:  Protein of u  33.0      24 0.00053   34.5   1.6   19  113-131   191-209 (318)
175 PF01954 DUF104:  Protein of un  32.9      49  0.0011   23.7   2.7   17  397-413     4-20  (60)
176 cd06160 S2P-M50_like_2 Unchara  32.6      29 0.00062   31.0   1.8   16  116-131    39-54  (183)
177 cd06478 ACD_HspB4-5-6 Alpha-cr  32.3      59  0.0013   24.7   3.3   30  336-365    51-82  (83)
178 KOG3607|consensus               32.2      29 0.00063   37.8   2.1   42   88-131   295-336 (716)
179 PF08308 PEGA:  PEGA domain;  I  32.2 1.4E+02  0.0031   21.5   5.4   41  326-366    26-67  (71)
180 PF14913 DPCD:  DPCD protein fa  31.7 2.1E+02  0.0046   25.6   7.0   73  325-413    86-168 (194)
181 KOG3413|consensus               31.2      51  0.0011   28.0   2.9   30  398-427    74-107 (156)
182 cd06159 S2P-M50_PDZ_Arch Uncha  30.9      29 0.00064   32.9   1.7   15  117-131   117-131 (263)
183 cd06498 ACD_alphaB-crystallin_  30.6      72  0.0016   24.4   3.6   31  336-366    51-83  (84)
184 TIGR03296 M6dom_TIGR03296 M6 f  30.1      13 0.00027   35.8  -0.9   12  119-130   166-177 (286)
185 PF13398 Peptidase_M50B:  Pepti  30.0      32 0.00069   31.2   1.7   15  116-130    20-34  (200)
186 PF09471 Peptidase_M64:  IgA Pe  29.6      32  0.0007   32.6   1.8   17  114-130   212-228 (264)
187 cd06480 ACD_HspB8_like Alpha-c  29.6   1E+02  0.0022   24.1   4.2   30  335-364    58-89  (91)
188 COG0097 RplF Ribosomal protein  29.5 1.7E+02  0.0038   25.9   6.1   47  346-412    10-56  (178)
189 COG4004 Uncharacterized protei  28.7 2.7E+02  0.0058   21.8   6.2   48  304-365    12-59  (96)
190 TIGR03750 conj_TIGR03750 conju  28.6 3.2E+02   0.007   22.2   8.0   41   33-81     59-99  (111)
191 cd06476 ACD_HspB2_like Alpha c  28.3   1E+02  0.0022   23.5   4.0   30  336-365    51-82  (83)
192 PF14521 Aspzincin_M35:  Lysine  28.3      80  0.0017   27.0   3.8   40   89-129    66-107 (148)
193 cd06162 S2P-M50_PDZ_SREBP Ster  28.1      35 0.00077   32.6   1.7   15  117-131   134-148 (277)
194 KOG0175|consensus               25.8 1.4E+02   0.003   28.0   5.0   58   79-136   179-241 (285)
195 cd02175 GH16_lichenase lichena  25.7 2.7E+02  0.0058   25.2   7.1   53  348-402    31-88  (212)
196 COG4823 AbiF Abortive infectio  25.5      45 0.00097   31.3   1.8   14  115-128    89-102 (299)
197 PHA02456 zinc metallopeptidase  25.2      38 0.00083   27.3   1.2   30  101-134    66-95  (141)
198 PRK11767 SpoVR family protein;  24.9 1.2E+02  0.0026   31.4   4.9   67   64-130    20-121 (498)
199 cd06455 M3A_TOP Peptidase M3 T  23.7      39 0.00084   35.0   1.3   18  115-133   261-278 (472)
200 PF07873 YabP:  YabP family;  I  23.6      50  0.0011   24.0   1.5   21  346-366    23-43  (66)
201 COG4227 Antirestriction protei  23.6      54  0.0012   30.8   1.9   23  114-136   199-221 (316)
202 PF10988 DUF2807:  Protein of u  23.4 3.3E+02  0.0071   23.5   7.1   61  352-430     4-66  (181)
203 COG0308 PepN Aminopeptidase N   22.9      51  0.0011   37.0   2.0   21  116-136   305-325 (859)
204 COG5504 Predicted Zn-dependent  22.9      67  0.0015   30.1   2.4   19  113-131   135-153 (280)
205 PF10460 Peptidase_M30:  Peptid  22.8      99  0.0021   30.8   3.8   33  100-132   118-153 (366)
206 PRK11198 LysM domain/BON super  22.5 1.1E+02  0.0024   26.1   3.6   28  342-369    36-63  (147)
207 cd06158 S2P-M50_like_1 Unchara  22.1      55  0.0012   29.1   1.7   13  118-130     9-21  (181)
208 cd04283 ZnMc_hatching_enzyme Z  21.9      42  0.0009   30.0   0.9   13  119-131    78-90  (182)
209 PF11694 DUF3290:  Protein of u  21.3 2.6E+02  0.0057   24.0   5.6   27   61-87     78-104 (149)
210 PF01400 Astacin:  Astacin (Pep  21.2      56  0.0012   29.3   1.6   62   63-131    23-92  (191)
211 PF06305 DUF1049:  Protein of u  21.2 2.3E+02   0.005   20.2   4.7   23   16-38     18-40  (68)
212 cd04281 ZnMc_BMP1_TLD Zinc-dep  21.0      44 0.00096   30.3   0.9   13  119-131    88-100 (200)
213 COG2317 Zn-dependent carboxype  21.0 4.2E+02  0.0091   27.4   7.7   64   64-131   209-272 (497)
214 cd06482 ACD_HspB10 Alpha cryst  20.7 1.6E+02  0.0035   22.7   3.9   30  335-364    54-85  (87)
215 PF03672 UPF0154:  Uncharacteri  20.5   2E+02  0.0043   20.9   3.9   37   34-79     12-48  (64)
216 TIGR02856 spore_yqfC sporulati  20.5      71  0.0015   24.6   1.8   43  322-366    17-61  (85)
217 PF05309 TraE:  TraE protein;    20.4 1.8E+02  0.0039   25.8   4.7   48  349-411   131-179 (187)
218 TIGR02921 PEP_integral PEP-CTE  20.3 1.8E+02   0.004   30.8   5.1   59  349-414   419-483 (952)

No 1  
>PRK05457 heat shock protein HtpX; Provisional
Probab=100.00  E-value=1.2e-43  Score=337.90  Aligned_cols=251  Identities=51%  Similarity=0.781  Sum_probs=203.2

Q ss_pred             chHHHHHHHHHHHHHHHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCCCeEEEECC-CccEEEeC
Q psy5221          17 NIKILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANISTPEVAIYNG-EINAFATG   95 (436)
Q Consensus        17 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~p~v~v~~~-~~NA~~~G   95 (436)
                      ..++++++++++++++.++++++++++..+.++++++++|.++++++|++.++++|+++|+|.|++|++|+ .+|||++|
T Consensus        32 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~a~~i~~~~~~~~~~L~~~v~~la~~~g~p~p~v~v~~~~~~NAfa~G  111 (284)
T PRK05457         32 NLGGLLVFAAVFGFGGSFISLLMSKWMAKRSTGAEVIEQPRNETERWLVETVARQARQAGIGMPEVAIYHSPEINAFATG  111 (284)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCeECCCCCCchHHHHHHHHHHHHHhCCCCCCCEEEEeCCCceEEEec
Confidence            36677888899999999999999999999999999998788777778999999999999999999999999 99999999


Q ss_pred             CCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCCCCCC
Q psy5221          96 AFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSN  175 (436)
Q Consensus        96 ~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~  175 (436)
                      .++++++|++|+||++.+|++|++||+|||+||++++|+.++.++.++...+..++..++..+...+..   .+.. ..+
T Consensus       112 ~~~~~~~V~vt~gLl~~L~~~El~aVlAHElgHi~~~d~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~-~~~  187 (284)
T PRK05457        112 ASKNNSLVAVSTGLLQNMSRDEVEAVLAHEISHIANGDMVTMTLIQGVVNTFVIFLSRIIAQIVDRFVS---GNEE-GNG  187 (284)
T ss_pred             CCCCCeEEEeehHHhhhCCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc---cCcc-ccc
Confidence            999899999999999999999999999999999999999988777666566655554444433322221   1111 111


Q ss_pred             chhHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhcCCchHHHHHHHHhhcccccCCChhhhhhhcccCccc
Q psy5221         176 YNHKIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVNDLPKSIAILGINNNFK  255 (436)
Q Consensus       176 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~g~~~~l~~aL~kl~~~~~~~~~~~~~~~~~~~~~~~  255 (436)
                         +..++...++..++.++..++.+++||.+||+||++|++++| |+++++||.|+...+. ...++..+++++.++  
T Consensus       188 ---~~~~~~~~l~~~~~~~~~~ll~~~~SR~rEy~AD~~Aa~ltg-p~~L~~AL~KL~~~~~-~~~~~~~~~~~i~~~--  260 (284)
T PRK05457        188 ---IGYFIVSIVLEIVFGILASIIVMWFSRHREFRADAGGAKLAG-REKMIAALQRLKTSYE-PQLPGSMAAFGINGK--  260 (284)
T ss_pred             ---HHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhhC-HHHHHHHHHHHHhhCc-cCCChHHHHhhccCc--
Confidence               222222233334455667788899999999999999999988 7999999999998765 445666777777664  


Q ss_pred             chhhhhhhccCCChHHHHHHHHhc
Q psy5221         256 KKSFLKLFATHPSFEKRIAILSLL  279 (436)
Q Consensus       256 ~~~~~~~~~tHP~~~~RI~~L~~~  279 (436)
                       ..+..|++|||++.+||++|++.
T Consensus       261 -~~~~~lfsTHP~~~eRI~~L~~~  283 (284)
T PRK05457        261 -SGLSELFMSHPPLEKRIAALRSG  283 (284)
T ss_pred             -hhHHHHHcCCcCHHHHHHHHHhC
Confidence             24789999999999999999864


No 2  
>PRK01265 heat shock protein HtpX; Provisional
Probab=100.00  E-value=4.7e-41  Score=322.47  Aligned_cols=252  Identities=25%  Similarity=0.302  Sum_probs=189.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCCCeEEEECC-CccEEEeCCCC
Q psy5221          20 ILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANISTPEVAIYNG-EINAFATGAFK   98 (436)
Q Consensus        20 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~p~v~v~~~-~~NA~~~G~~~   98 (436)
                      +.++..+++++++.++.++++++++.++++++.++ |.++++++|++.++++|+++|+|.|++|++|+ .+|||++|.++
T Consensus        42 ~~~~~~~~~~~~~~~~~~~~sp~li~~~~~a~~~~-p~~~~~~~L~~~v~~la~~~g~~~p~vyv~~~~~~NAfa~G~~~  120 (324)
T PRK01265         42 GLILGILIFVFFLNIIQWLFGPYMINAAYRTVEVT-PTDPVYGWLYSIVAEVAKYNGIRVPKVYIADVPFPNAFAYGSPI  120 (324)
T ss_pred             HHHHHHHHHHHHHHHHHHHHhHHHHHHHcCCeeCC-CCCcccHHHHHHHHHHHHHcCCCCCeEEEecCCCCCeEEeccCC
Confidence            33455577788889999999999999999999887 66654566999999999999999999999999 99999999987


Q ss_pred             CccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc-CCCCCCCCCch
Q psy5221          99 NSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNS-YDDSNYNSNYN  177 (436)
Q Consensus        99 ~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~-~~~~~~~~~~~  177 (436)
                      .++.|++++||++.+++||++||+|||+||++++|+.+++... .+..+..++...+.   ...+++. +.+....    
T Consensus       121 ~~~~Ivvt~gLl~~l~~~El~aVlAHElgHik~~d~~~~~~~~-~i~~~~~~~~~~~~---~~~~~~~~~~~~~~~----  192 (324)
T PRK01265        121 AGKRIAITLPLLKILNRDEIKAVAGHELGHLKHRDVELLMAIG-LIPTLIYYLGYSLF---WGGMFGGGGGGRGNN----  192 (324)
T ss_pred             CCCEEEEehHHHhhCCHHHHHHHHHHHHHHHHcccHHHHHHHH-HHHHHHHHHHHHHH---HHHHhcccccCCCcc----
Confidence            6677999999999999999999999999999999998876543 33333333322211   1122221 1111111    


Q ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhh-hcCCchHHHHHHHHhhccccc---------CCChhhhhh
Q psy5221         178 HKIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAK-LLGTPDPMISALRHLSNISIV---------NDLPKSIAI  247 (436)
Q Consensus       178 ~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~-~~g~~~~l~~aL~kl~~~~~~---------~~~~~~~~~  247 (436)
                       ...+....++..++.+++.++.+++||.+||+||++|++ ++|+|+++++||.|+..+...         ...++..++
T Consensus       193 -~~~l~l~~ll~~~~~~i~~~l~~aiSR~rEy~AD~~aa~~~tg~p~~LasAL~KL~~~~~~~~~~~~~~~~~~~~~~~~  271 (324)
T PRK01265        193 -GGLLFLIGIALMAVSFVFNLLVLSINRMREAYADVNSALTVPGGAENLQTALAKITLSMDPGALERFKKKSTTNQMASM  271 (324)
T ss_pred             -chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHhhcCChHHHHHHHHHHHhcccccccccccccccCChhhhh
Confidence             011122223334456667888999999999999999998 569999999999999986542         112344667


Q ss_pred             hcccCcccc------------------hhhhhhhccCCChHHHHHHHHhccc
Q psy5221         248 LGINNNFKK------------------KSFLKLFATHPSFEKRIAILSLLKL  281 (436)
Q Consensus       248 ~~~~~~~~~------------------~~~~~~~~tHP~~~~RI~~L~~~~~  281 (436)
                      +++.++.++                  ..+.++|+|||++++||++|++++.
T Consensus       272 l~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lFsTHPp~e~Ri~rL~~~~~  323 (324)
T PRK01265        272 LFFSNAIEEVPTWDARELVEYWKTTKVPWYADIFSDHPHPAKRIQLLEKLSK  323 (324)
T ss_pred             heeeCcccccccchhhhhhhhhcccchhHHHHHhCCCcChHHHHHHHHHhhh
Confidence            777776422                  2477999999999999999998754


No 3  
>PRK04897 heat shock protein HtpX; Provisional
Probab=100.00  E-value=2.5e-41  Score=325.14  Aligned_cols=253  Identities=28%  Similarity=0.369  Sum_probs=188.6

Q ss_pred             HHHHHHHHHHHHHHHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCCCeEEEECC-CccEEEeCCC
Q psy5221          19 KILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANISTPEVAIYNG-EINAFATGAF   97 (436)
Q Consensus        19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~p~v~v~~~-~~NA~~~G~~   97 (436)
                      ...+++++++++++.++.++.+++++.+..++++++ |.  ++|+|++.++++|+++|+|.|+||++++ .+|||++|.+
T Consensus        40 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~-~~--~~p~L~~~v~~la~~~gip~p~v~v~~~~~~NAfa~G~~  116 (298)
T PRK04897         40 LGGLIIALIIGVIYALIMIFQSTNVVMSMNHAREVT-EE--EAPELWHIVEDMAMVAQIPMPRVFIIDDPSPNAFATGSS  116 (298)
T ss_pred             hhHHHHHHHHHHHHHHHHHHhhHHHHHHhCCCEECC-hh--hhHHHHHHHHHHHHHcCCCCCcEEEecCCCCceEEeccC
Confidence            355677788899999999999999999999999987 33  3456999999999999999999999999 9999999999


Q ss_pred             CCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc-CC--CCCCCC
Q psy5221          98 KNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNS-YD--DSNYNS  174 (436)
Q Consensus        98 ~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~-~~--~~~~~~  174 (436)
                      +.++.|++|+||++.++++|++||+|||+||++++|+.++++... +..++.++..++..   ..+++. +.  ......
T Consensus       117 ~~~~~v~vt~gLl~~l~~~El~aVlAHElgHi~~~d~~~~~~~~~-~~~~~~~l~~~~~~---~~~~~~~~~~~~~~~~~  192 (298)
T PRK04897        117 PKNAAVAVTTGLLAIMNREELEGVIGHEISHIRNYDIRLSTIAVA-LASAITLLSDIAGR---MMWWGGGSRRRDDDRDG  192 (298)
T ss_pred             CCCcEEEeehHHHhhCCHHHHHHHHHHHHHHHhcCCHHHHHHHHH-HHHHHHHHHHHHHH---HHHhccccccccccccc
Confidence            999999999999999999999999999999999999988776643 33333332222111   111111 01  111111


Q ss_pred             CchhHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhcCCchHHHHHHHHhhcccccCC-ChhhhhhhcccCc
Q psy5221         175 NYNHKIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVND-LPKSIAILGINNN  253 (436)
Q Consensus       175 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~g~~~~l~~aL~kl~~~~~~~~-~~~~~~~~~~~~~  253 (436)
                      +...+..++...+...+..+...++.+++||.+||+||++|++++++|+++++||.|+...+.... .....+++++.+|
T Consensus       193 ~~~~~~~~~~~l~~~i~~~~~~~ll~~~~SR~rE~~AD~~A~~lt~~p~~La~AL~KL~~~~~~~~~~~~~~~~~~i~~p  272 (298)
T PRK04897        193 GGLQIILLIVSLLLLILAPLAATLIQLAISRQREYLADASSVELTRNPQGLISALEKISNSQPMKHPVDDASAALYISDP  272 (298)
T ss_pred             chhhHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhCCHHHHHHHHHHHHhccccccccChHHHHhccCCC
Confidence            101111122222222333445567789999999999999999999999999999999998754322 2234456677777


Q ss_pred             ccchhhhhhhccCCChHHHHHHHHh
Q psy5221         254 FKKKSFLKLFATHPSFEKRIAILSL  278 (436)
Q Consensus       254 ~~~~~~~~~~~tHP~~~~RI~~L~~  278 (436)
                      ..+..+..||+|||++.+||++|++
T Consensus       273 ~~~~~~~~lfsTHP~~~eRI~~L~~  297 (298)
T PRK04897        273 LKKKGLSKLFDTHPPIEERIERLKN  297 (298)
T ss_pred             cccchHHHHHcCCcCHHHHHHHHHc
Confidence            5444568899999999999999985


No 4  
>PRK02870 heat shock protein HtpX; Provisional
Probab=100.00  E-value=1.8e-40  Score=320.44  Aligned_cols=248  Identities=27%  Similarity=0.411  Sum_probs=186.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHhHHHHHHHhCCCEEecCCCC---cchhHHHHHHHHHHHhcCCC-CCeEEEECC-CccEEEe
Q psy5221          20 ILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSN---FTEDWLLKTVNKISKNANIS-TPEVAIYNG-EINAFAT   94 (436)
Q Consensus        20 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~---~~~~~L~~~l~~l~~~~g~~-~p~v~v~~~-~~NA~~~   94 (436)
                      .++++.+++++++.++.||++.+++.+.++++++. |.+   .++++|++.++++|.++|+| .|+||++++ .+|||++
T Consensus        71 ~~~~~~~~~~~~~~~~~y~~~~~~~l~~~~a~~I~-~~~~~p~~~~~L~~~ve~La~~ag~p~~p~V~vi~~~~~NAFA~  149 (336)
T PRK02870         71 TATLIMSLVAVISILVTFQNFDKIMLSGTEYKEIT-PENALSLQERQLYNVVEELLVAAGLRFMPKVYIIDAPYMNAFAS  149 (336)
T ss_pred             hHHHHHHHHHHHHHHHHHHhhHHHHHHHcCCEEcC-CCCCCchhhHHHHHHHHHHHHHcCCCCCCeEEEEcCCCCceEEe
Confidence            45666778899999999999999999999999997 322   23567999999999999999 899999999 9999999


Q ss_pred             CCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCCCCC
Q psy5221          95 GAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNS  174 (436)
Q Consensus        95 G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~  174 (436)
                      |.+++++.|++++||++.++++|++||||||+||++|+|+..+.+...+. .+..++..   ... ..++|..++ .+. 
T Consensus       150 G~~~~~~~Ivvt~GLL~~L~~dEL~aVlAHELgHik~~di~~~~~~~~l~-~~~~~~~~---~~~-~~~~g~~~~-~~~-  222 (336)
T PRK02870        150 GYSEKSAMVAITTGLLEKLDRDELQAVMAHELSHIRHGDIRLTLCVGVLS-NIMLIVAD---FLF-YSFMGNRRN-SGA-  222 (336)
T ss_pred             cCCCCCcEEEEehHHhhhCCHHHHHHHHHHHHHHHHcccHHHHHHHHHHH-HHHHHHHH---HHH-HHHhcCCcc-ccc-
Confidence            99887888999999999999999999999999999999997765543322 22222211   111 112231111 111 


Q ss_pred             CchhHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhcCCchHHHHHHHHhhcccccCC---------Chhhh
Q psy5221         175 NYNHKIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVND---------LPKSI  245 (436)
Q Consensus       175 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~g~~~~l~~aL~kl~~~~~~~~---------~~~~~  245 (436)
                      . ...   ....++..++.++..++.+++||.+||+||++|++++++|+++++||.|++..+....         ..+..
T Consensus       223 ~-~~~---~~~l~l~~~~~~~~~ll~~~iSR~rEy~AD~~Aa~ltg~p~aLasAL~KL~~~~~~~~~~~~~~~~~~~~~~  298 (336)
T PRK02870        223 N-RAR---MIILILRYVLPILTVLLMLFLSRTREYMADAGAVELMRDNEPMARALQKISNDHAQNDEQYAYKHTDHESTR  298 (336)
T ss_pred             c-hhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhCCHHHHHHHHHHHHhccccCccccccccccCChhh
Confidence            1 001   1112333344455667889999999999999999999999999999999998764433         12344


Q ss_pred             hhhcccCcccc--hhhhhhhccCCChHHHHHHHHhc
Q psy5221         246 AILGINNNFKK--KSFLKLFATHPSFEKRIAILSLL  279 (436)
Q Consensus       246 ~~~~~~~~~~~--~~~~~~~~tHP~~~~RI~~L~~~  279 (436)
                      +++++.+|...  ....++|+|||++++||++|+..
T Consensus       299 a~~~i~~p~~~~~~~~~~LfsTHPp~e~RI~rL~~~  334 (336)
T PRK02870        299 RAAYLFDPAGISPGSLSDAFSTHPSIENRLAALGGK  334 (336)
T ss_pred             hhhhccCCcccccccHhHHHcCCCCHHHHHHHHhhc
Confidence            56778777531  24678999999999999999864


No 5  
>PRK03072 heat shock protein HtpX; Provisional
Probab=100.00  E-value=2.2e-40  Score=316.77  Aligned_cols=248  Identities=24%  Similarity=0.325  Sum_probs=187.9

Q ss_pred             HHHHHHHHHHHHHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCCCeEEEECC-CccEEEeCCCCC
Q psy5221          21 LTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANISTPEVAIYNG-EINAFATGAFKN   99 (436)
Q Consensus        21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~p~v~v~~~-~~NA~~~G~~~~   99 (436)
                      .+++++++++++.+++++++++++.+.+++++++ |.+  +|+|++.++++|+++|+|.|++|++|+ .+|||++|..++
T Consensus        32 ~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~v~-~~~--~p~L~~~v~~la~~~g~p~p~vyv~~~~~~NAFa~G~~~~  108 (288)
T PRK03072         32 GLGIAVLIAVGMNAYVYWNSDKLALRAMHAQPVS-EVQ--APAMYRIVRELSTAARQPMPRLYISPTAAPNAFATGRNPR  108 (288)
T ss_pred             HHHHHHHHHHHHHHHHHHHhHHHHHHhcCCEECC-hhh--hHHHHHHHHHHHHHcCCCCCCEEEecCCCCceEEecCCCC
Confidence            4566678888999999999999999999999986 544  345999999999999999999999999 999999998777


Q ss_pred             ccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCCCCCCchhH
Q psy5221         100 SSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHK  179 (436)
Q Consensus       100 ~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~  179 (436)
                      +..|++|+||++.++++|++||||||+||++++|+.++.+...+. .++.++.....++   ..++. ......  ...+
T Consensus       109 ~~~v~vt~gLl~~l~~~El~aVlAHElgHi~~~d~~~~~~~~~~~-~~i~~l~~~~~~~---~~~~~-~~~~~~--~~~~  181 (288)
T PRK03072        109 NAAVCCTEGILQILNERELRGVLGHELSHVYNRDILISSVAGALA-SVITYLANMAMFA---GMFGG-RRDNDG--PNPL  181 (288)
T ss_pred             CcEEEecHHHHHhCCHHHHHHHHHHHHHHHhcCCHHHHHHHHHHH-HHHHHHHHHHHHH---HHhcc-cccccc--chHH
Confidence            788999999999999999999999999999999999877654322 2222222111111   11111 111111  1111


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhcCCchHHHHHHHHhhcccccCCCh-----hhhhhhcccCcc
Q psy5221         180 IYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVNDLP-----KSIAILGINNNF  254 (436)
Q Consensus       180 ~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~g~~~~l~~aL~kl~~~~~~~~~~-----~~~~~~~~~~~~  254 (436)
                      +.+    ++..+..+...++.+.+||.+||+||++|++++++|+++++||.|+...+.....+     +...++++.+|.
T Consensus       182 ~~~----~~~l~~~~~~~~~~~~~SR~rE~~AD~~A~~l~~~p~~La~AL~KL~~~~~~~~~~~~~~~~~~~~~~i~~p~  257 (288)
T PRK03072        182 ALL----LVSLLGPIAATVIQLAISRSREYQADESGAELTGDPLALASALRKISGGVQAAPLPPEPQLASQAHLMIANPF  257 (288)
T ss_pred             HHH----HHHHHHHHHHHHHHHHHHhHHHHHHhHHHHHHhCCHHHHHHHHHHHHhccccCCCCccccchhhhhhhhcCcc
Confidence            111    22223334566778999999999999999999999999999999999876433322     245667777775


Q ss_pred             cchhhhhhhccCCChHHHHHHHHhcccc
Q psy5221         255 KKKSFLKLFATHPSFEKRIAILSLLKLS  282 (436)
Q Consensus       255 ~~~~~~~~~~tHP~~~~RI~~L~~~~~~  282 (436)
                      .+..+..+++|||++.+||++|+++..+
T Consensus       258 ~~~~~~~l~sTHP~~~eRI~~L~~~~~~  285 (288)
T PRK03072        258 RAGGIGRLFSTHPPMADRIARLEQMAGR  285 (288)
T ss_pred             cchHHHHHHcCCcCHHHHHHHHHHHhhh
Confidence            4446889999999999999999987665


No 6  
>PRK02391 heat shock protein HtpX; Provisional
Probab=100.00  E-value=2.1e-40  Score=317.24  Aligned_cols=250  Identities=25%  Similarity=0.377  Sum_probs=185.2

Q ss_pred             HHHHHHHHHHHHHHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCCCeEEEECC-CccEEEeCCCC
Q psy5221          20 ILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANISTPEVAIYNG-EINAFATGAFK   98 (436)
Q Consensus        20 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~p~v~v~~~-~~NA~~~G~~~   98 (436)
                      ..+++++++++++.++.++++++++.+.+++++++ |.++  |+|++.++++|+++|+|.|++|++++ .+|||++|.++
T Consensus        37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~--p~L~~~v~~la~~~~~~~p~v~v~~~~~~NAfa~G~~~  113 (296)
T PRK02391         37 VSLVLIVVIAGGFLLAQYFFSDKLALWSMGARIVS-EDEY--PELHAMVERLCALADLPKPRVAVADSDVPNAFATGRSP  113 (296)
T ss_pred             HHHHHHHHHHHHHHHHHHHHhHHHHHHHcCCEECC-hhhC--HHHHHHHHHHHHHcCCCCCcEEEEeCCCCceEEecCCC
Confidence            34666777888889999999999999999999887 5444  34999999999999999999999999 99999999998


Q ss_pred             CccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCCCCCCchh
Q psy5221          99 NSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNH  178 (436)
Q Consensus        99 ~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~  178 (436)
                      ++++|++|+||++.+|++|++||+|||+||++++|+.++.+... +..+..++.   .+.   .+++........ . . 
T Consensus       114 ~~~~V~vt~gLl~~L~~~El~aVlaHElgHi~~~di~~~~i~~~-~~~~~~~l~---~~~---~~~~~~~~~~~~-~-~-  183 (296)
T PRK02391        114 KNAVVCVTTGLMRRLDPDELEAVLAHELSHVKNRDVAVMTIASF-LSTIAFLIV---RWG---FYFGGFGGRGGG-G-G-  183 (296)
T ss_pred             CCcEEEecHHHHhhCCHHHHHHHHHHHHHHHHcCCHHHHHHHHH-HHHHHHHHH---HHH---HHhccccCCCCc-c-c-
Confidence            89999999999999999999999999999999999988766532 222222211   111   111110000000 0 0 


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhcCCchHHHHHHHHhhcccccCCC-----hhhhhhhcccCc
Q psy5221         179 KIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVNDL-----PKSIAILGINNN  253 (436)
Q Consensus       179 ~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~g~~~~l~~aL~kl~~~~~~~~~-----~~~~~~~~~~~~  253 (436)
                      ...++...++..+..++..++.+.+||.+||+||++|++++|+|+++++||.|+.......+.     ......+++.++
T Consensus       184 ~~~~~~~~~~~~~~~~~~~~l~~~~SR~rE~~AD~~Aa~ltg~p~~LasAL~KL~~~~~~~~~~~~~~~~~~~~~~i~p~  263 (296)
T PRK02391        184 GGGILVVILVSLVVWAISFLLIRALSRYREFAADRGAAIITGRPSALASALMKISGRMDRVPTEDLREAEGMNAFFIIPA  263 (296)
T ss_pred             chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhCCHHHHHHHHHHHHcccccCCchhhcccHHHhhhcccCC
Confidence            111222233334455556677899999999999999999999999999999999976422211     122234445444


Q ss_pred             ccchhhhhhhccCCChHHHHHHHHhcccc
Q psy5221         254 FKKKSFLKLFATHPSFEKRIAILSLLKLS  282 (436)
Q Consensus       254 ~~~~~~~~~~~tHP~~~~RI~~L~~~~~~  282 (436)
                      ..+..+..|++|||++.+||++|++++.+
T Consensus       264 ~~~~~~~~l~sTHP~~~eRI~~L~~~~~~  292 (296)
T PRK02391        264 LSGGSLGRLFSTHPPLEKRIAQLEKLERE  292 (296)
T ss_pred             CCcchHHHHhcCCcCHHHHHHHHHHHHHh
Confidence            23345788999999999999999987655


No 7  
>PRK03982 heat shock protein HtpX; Provisional
Probab=100.00  E-value=3.8e-39  Score=309.55  Aligned_cols=251  Identities=28%  Similarity=0.380  Sum_probs=184.2

Q ss_pred             HHHHHHHHHHHHHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCCCeEEEECC-CccEEEeCCCCC
Q psy5221          21 LTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANISTPEVAIYNG-EINAFATGAFKN   99 (436)
Q Consensus        21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~p~v~v~~~-~~NA~~~G~~~~   99 (436)
                      ++++.+++++++.++.+++++++..+.+++++++ +.+  +|+|++.++++|+++|++.|++|++|+ .+|||++|++++
T Consensus        30 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~l~-~~~--~p~L~~~v~~la~~~g~~~p~v~v~~~~~~NAfa~G~~~~  106 (288)
T PRK03982         30 GPIIAILLALIPNLISYYYSDKIVLASYNARIVS-EEE--APELYRIVERLAERANIPKPKVAIVPTQTPNAFATGRDPK  106 (288)
T ss_pred             HHHHHHHHHHHHHHHHHHHhHHHHHHhcCCEECC-hhh--hHHHHHHHHHHHHHcCCCCCeEEEEeCCCcceEEeccCCC
Confidence            3445566677778889999999999999999887 333  455999999999999999999999999 999999999988


Q ss_pred             ccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc-CCCCCCCCCchh
Q psy5221         100 SSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNS-YDDSNYNSNYNH  178 (436)
Q Consensus       100 ~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~-~~~~~~~~~~~~  178 (436)
                      +++|++++||++.+|+||++||+|||+||++++|+.++.+...+. ..+.++..++   ....+++. +.+.... + . 
T Consensus       107 ~~~V~vt~gLl~~l~~~El~AVlAHElgHi~~~h~~~~~~~~~~~-~~~~~l~~~~---~~~~~~~~~~~~~~~~-~-~-  179 (288)
T PRK03982        107 HAVVAVTEGILNLLNEDELEGVIAHELTHIKNRDTLIQTIAATLA-GAIMYLAQWL---SWGLWFGGGGRDDRNG-G-N-  179 (288)
T ss_pred             CeEEEeehHHHhhCCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHH-HHHHHHHHHH---HHHHHhcccCcccccc-c-h-
Confidence            888999999999999999999999999999999998877664322 2222222111   11122221 1111111 0 1 


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhcCCchHHHHHHHHhhcccccCCC---hhhhhhhcccCccc
Q psy5221         179 KIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVNDL---PKSIAILGINNNFK  255 (436)
Q Consensus       179 ~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~g~~~~l~~aL~kl~~~~~~~~~---~~~~~~~~~~~~~~  255 (436)
                         ++...+...+..+...++.+.+||.+||+||++|++++++|+++++||.|++..+...+.   ....+++++.+|..
T Consensus       180 ---~~~~~~~~~~~~~~~~l~~~~~SR~~E~~AD~~A~~~~~~p~~l~~aL~kL~~~~~~~~~~~~~~~~~~~~~~~p~~  256 (288)
T PRK03982        180 ---PIGSLLLIILAPIAATLIQFAISRQREFSADEGGARLTGNPLALANALQKLEKGVRYIPLKNGNPATAHMFIINPFR  256 (288)
T ss_pred             ---HHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhHHHHHHhCCHHHHHHHHHHHHhhhccCCCCCCCHHHHhHhhcCCCC
Confidence               111112223333445677899999999999999999999999999999999876432221   12234455555544


Q ss_pred             chhhhhhhccCCChHHHHHHHHhcccccc
Q psy5221         256 KKSFLKLFATHPSFEKRIAILSLLKLSLY  284 (436)
Q Consensus       256 ~~~~~~~~~tHP~~~~RI~~L~~~~~~~~  284 (436)
                      +..+..+++|||++.+||++|++.++++.
T Consensus       257 ~~~~~~l~sTHP~~~eRI~~l~~~~~~~~  285 (288)
T PRK03982        257 GQFLANLFSTHPPTEERIERLLEMAQEMG  285 (288)
T ss_pred             CchhhHHhCCCcCHHHHHHHHHHHHHhcc
Confidence            44578999999999999999998877654


No 8  
>PRK01345 heat shock protein HtpX; Provisional
Probab=100.00  E-value=5.2e-39  Score=310.49  Aligned_cols=248  Identities=23%  Similarity=0.307  Sum_probs=184.7

Q ss_pred             HHHHHHHHHHHHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCCCeEEEECC-CccEEEeCCCCCc
Q psy5221          22 TISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANISTPEVAIYNG-EINAFATGAFKNS  100 (436)
Q Consensus        22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~p~v~v~~~-~~NA~~~G~~~~~  100 (436)
                      +++++++++++.++.+|.+++++.+.+++++++ |.+.  |+|++.++++|+++|++.|+||++|+ .+|||++|.++.+
T Consensus        30 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~-~~~~--p~L~~~v~~La~~agi~~p~v~vid~~~~NAFa~G~~~~~  106 (317)
T PRK01345         30 MMIALVIAAGMNLFSYWNSDKMVLRMYGAQEVD-ERSA--PELYRMVRDLARRAGLPMPKVYIIDNPQPNAFATGRNPEN  106 (317)
T ss_pred             HHHHHHHHHHHHHHHHHHhHHHHHHHcCCeECC-cccC--HHHHHHHHHHHHHcCCCCCcEEEEcCCCcceEEecCCCCC
Confidence            556677888889999999999999999999887 4443  34999999999999999999999999 9999999998888


Q ss_pred             cEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCCCCCCchhHH
Q psy5221         101 SLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKI  180 (436)
Q Consensus       101 ~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~  180 (436)
                      +.|++|+||++.+++||++||||||+||++++|+.++.+...+...+ ..+    ..+.  .+++.....  ..++  ..
T Consensus       107 ~~V~vt~gLL~~L~~dEL~aVlAHElgHi~~~d~~~~~l~~~~~~~~-~~l----~~~~--~~~~~~~~~--~~~~--~~  175 (317)
T PRK01345        107 AAVAATTGLLQRLSPEEVAGVMAHELAHVKNRDTLTMTITATLAGAI-SML----ANFA--FFFGGNREN--NNGP--LG  175 (317)
T ss_pred             eEEEechHHHhhCCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH-HHH----HHHH--HHhcCCccc--ccch--HH
Confidence            89999999999999999999999999999999998876654322111 111    1110  112111111  1111  11


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhcCCchHHHHHHHHhhcccccC-----CChhhhhhhcccCccc
Q psy5221         181 YGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVN-----DLPKSIAILGINNNFK  255 (436)
Q Consensus       181 ~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~g~~~~l~~aL~kl~~~~~~~-----~~~~~~~~~~~~~~~~  255 (436)
                       ++...+...+..+...++.+.+||.+||+||++|++++++|++++++|.|++......     ..+....++++.++..
T Consensus       176 -~i~~~~~~~~~~~~~~l~~~~~SR~rE~~AD~~A~~ltg~p~~L~~AL~KL~~~~~~~~~~~~~~~~~~~~~~~~~~~~  254 (317)
T PRK01345        176 -LVGTLAAMIVAPLAAMLVQMAISRTREYAADRRGAEICGNPLWLASALGKIERGAHGVPNEEAERNPATAHMFIINPLS  254 (317)
T ss_pred             -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHhhhccCcccccccChHHHHHHhcCCcc
Confidence             1111122223334445677899999999999999999999999999999999764221     1123445666666654


Q ss_pred             chhhhhhhccCCChHHHHHHHHhcccccc
Q psy5221         256 KKSFLKLFATHPSFEKRIAILSLLKLSLY  284 (436)
Q Consensus       256 ~~~~~~~~~tHP~~~~RI~~L~~~~~~~~  284 (436)
                      ......+|+|||++.+||++|+++++++.
T Consensus       255 ~~~~~~lfsTHP~~~eRI~~L~~~~~~~~  283 (317)
T PRK01345        255 GEGMDNLFSTHPATENRIAALQRMAGEMG  283 (317)
T ss_pred             ccchhHHhcCCcChHHHHHHHHHHHHhcC
Confidence            44567899999999999999999887764


No 9  
>PRK03001 M48 family peptidase; Provisional
Probab=100.00  E-value=5.8e-38  Score=300.64  Aligned_cols=246  Identities=24%  Similarity=0.361  Sum_probs=179.0

Q ss_pred             HHHHHHHHHHHHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCCCeEEEECC-CccEEEeCCCCCc
Q psy5221          22 TISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANISTPEVAIYNG-EINAFATGAFKNS  100 (436)
Q Consensus        22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~p~v~v~~~-~~NA~~~G~~~~~  100 (436)
                      +++++++++++.++.++++++++.+.++++++++ .  +.|.|++.++++|+++|+|.|++|++|+ .+|||++|.++.+
T Consensus        30 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~l~~-~--~~p~L~~~v~~la~~~g~~~p~v~v~~~~~~NAfa~G~~~~~  106 (283)
T PRK03001         30 MLIALLFALGMNFFSYWFSDKMVLKMYNAQEVDE-N--TAPQFYRMVRELAQRAGLPMPKVYLINEDQPNAFATGRNPEH  106 (283)
T ss_pred             HHHHHHHHHHHHHHHHHHhHHHHHHHcCCEECCc-c--ccHHHHHHHHHHHHHcCCCCCeEEEecCCCcceEEecCCCCC
Confidence            4445667778889999999999999999987662 2  2345999999999999999999999999 9999999998877


Q ss_pred             cEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCCCCCCchhHH
Q psy5221         101 SLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKI  180 (436)
Q Consensus       101 ~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~  180 (436)
                      +.|++++||++.+|++|++||||||+||++++|+.++.+...+...+ ..    +..+.  .+++.... +.. ....+.
T Consensus       107 ~~Ivvt~gLl~~l~~~El~aVlAHElgHi~~~h~~~~~~~~~~~~~~-~~----~~~~~--~~~~~~~~-~~~-~~~~~~  177 (283)
T PRK03001        107 AAVAATTGILRVLSEREIRGVMAHELAHVKHRDILISTISATMAGAI-SA----LANFA--MFFGGRDE-NGR-PVNPIA  177 (283)
T ss_pred             eEEEecHHHHhhCCHHHHHHHHHHHHHHHhCCChHHHHHHHHHHHHH-HH----HHHHH--HHhcCCCc-ccc-ccchHH
Confidence            88999999999999999999999999999999998876543221111 11    11111  11221011 111 001111


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhcCCchHHHHHHHHhhcccccCC-----ChhhhhhhcccCccc
Q psy5221         181 YGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVND-----LPKSIAILGINNNFK  255 (436)
Q Consensus       181 ~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~g~~~~l~~aL~kl~~~~~~~~-----~~~~~~~~~~~~~~~  255 (436)
                      .+    ++..+......++.+.+||.+||+||++|++++++|+++++||.|+...+...+     ......++++.+|..
T Consensus       178 ~~----~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~l~~~p~~l~~AL~Kl~~~~~~~p~~~~~~~~~~~~~~~~~p~~  253 (283)
T PRK03001        178 GI----AVAILAPLAASLIQMAISRAREFEADRGGARISGDPQALASALDKIHRYASGIPFQAAEAHPATAQMMIINPLS  253 (283)
T ss_pred             HH----HHHHHHHHHHHHHHHHHhHHHHHHHhHHHHHHhCCHHHHHHHHHHHHhhhccCCcccccCCHHHHHHHhcCCCC
Confidence            11    112222334556778999999999999999999999999999999998764321     111223455555543


Q ss_pred             chhhhhhhccCCChHHHHHHHHhccccc
Q psy5221         256 KKSFLKLFATHPSFEKRIAILSLLKLSL  283 (436)
Q Consensus       256 ~~~~~~~~~tHP~~~~RI~~L~~~~~~~  283 (436)
                      +..+..+++|||++.+||++|++.+.+.
T Consensus       254 ~~~~~~l~~THP~~~eRI~~l~~~~~~~  281 (283)
T PRK03001        254 GGGLANLFSTHPSTEERIARLMAMARTG  281 (283)
T ss_pred             cchHHHHHcCCcCHHHHHHHHHHHHHhc
Confidence            3457899999999999999999876653


No 10 
>PRK11597 heat shock chaperone IbpB; Provisional
Probab=100.00  E-value=8.9e-33  Score=233.92  Aligned_cols=139  Identities=41%  Similarity=0.741  Sum_probs=121.6

Q ss_pred             ccccccccCccCcchHHHHhhhhhhhhcccCCCCCceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEeccc
Q psy5221         291 MTVSAFAPFYRSTIGFDRLAQIFDNVRTEAQQSYPPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADT  370 (436)
Q Consensus       291 m~~~~~~p~~~~~~~~d~l~~~~~~~~~~~~~~~p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~  370 (436)
                      |+...++||++...+||+|++.|.+..  ....+|++||+++++++|+|.++|||++||||+|++++|.|+|+|+++.++
T Consensus         1 ~~~~~~~~~~~~~~~~d~l~~~~~~~~--~~~~~P~vdI~e~~~~~y~v~adlPGv~kedi~V~v~~~~LtI~ge~~~~~   78 (142)
T PRK11597          1 MRNYDLSPLLRQWIGFDKLANALQNAG--ESQSFPPYNIEKSDDNHYRITLALAGFRQEDLDIQLEGTRLTVKGTPEQPE   78 (142)
T ss_pred             CCccccchhhcccccHHHHHHHhcccC--ccCCCCcEEEEEcCCCEEEEEEEeCCCCHHHeEEEEECCEEEEEEEEcccc
Confidence            455678999999999999998776532  245689999995467899999999999999999999999999999987766


Q ss_pred             CCCceEEEeeeeeeEEEEEECCCccceEeEEeeCcEEEEEEecCCCCCCCCeEeeecCCCC
Q psy5221         371 QQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLIREVPEALKPRKIYINTTKN  431 (436)
Q Consensus       371 ~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~~GvL~i~~pk~~~~~~~~~~I~I~~~~~  431 (436)
                      ++.+|+++||.+|+|+|+|.||++||.-+|+|+||||+|++||..+++.++|+|+|+..++
T Consensus        79 ~~~~~~~~Er~~g~F~R~f~LP~~vd~~~A~~~nGVL~I~lPK~~~~~~~~rkI~I~~~~~  139 (142)
T PRK11597         79 KEVKWLHQGLVNQPFSLSFTLAENMEVSGATFVNGLLHIDLIRNEPEAIAPQRIAISERPA  139 (142)
T ss_pred             CCCcEEEEEEeCcEEEEEEECCCCcccCcCEEcCCEEEEEEeccCccccCCcEEEECCccc
Confidence            7789999999999999999999999977999999999999999655777899999986553


No 11 
>PRK10743 heat shock protein IbpA; Provisional
Probab=100.00  E-value=5.2e-32  Score=228.91  Aligned_cols=136  Identities=49%  Similarity=0.862  Sum_probs=118.8

Q ss_pred             cccccccCccCcchHHHHhhhhhhhhcccCCCCCceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccC
Q psy5221         292 TVSAFAPFYRSTIGFDRLAQIFDNVRTEAQQSYPPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQ  371 (436)
Q Consensus       292 ~~~~~~p~~~~~~~~d~l~~~~~~~~~~~~~~~p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~  371 (436)
                      +..+++|++++..+||+|++.|..........+|++||+++++++|+|.++|||++||||+|++++|.|+|+|+++.+.+
T Consensus         2 ~~~~~~~~~~~~~~~d~lf~~~~~~~~~~~~~~p~~di~ee~~~~~~v~aelPGv~kedi~V~v~~~~LtI~ge~~~~~~   81 (137)
T PRK10743          2 RNFDLSPLYRSAIGFDRLFNLLENNQSQSNGGYPPYNVELVDENHYRIAIAVAGFAESELEITAQDNLLVVKGAHADEQK   81 (137)
T ss_pred             CccccChhhhcccCHHHHhhhhhhhhhcccCCCCcEEEEEcCCCEEEEEEECCCCCHHHeEEEEECCEEEEEEEECcccc
Confidence            34578888888899999988887643333455699999954899999999999999999999999999999999887777


Q ss_pred             CCceEEEeeeeeeEEEEEECCCccceEeEEeeCcEEEEEEecCCCCCCCCeEeeec
Q psy5221         372 QRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLIREVPEALKPRKIYIN  427 (436)
Q Consensus       372 ~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~~GvL~i~~pk~~~~~~~~~~I~I~  427 (436)
                      +.+|+++||.+|+|+|+|.||++||.-+|+|+||||+|++||..+++.++|+|+|+
T Consensus        82 ~~~~~~~Er~~g~F~R~~~LP~~Vd~~~A~~~dGVL~I~lPK~~~~~~~~r~I~I~  137 (137)
T PRK10743         82 ERTYLYQGIAERNFERKFQLAENIHVRGANLVNGLLYIDLERVIPEAKKPRRIEIN  137 (137)
T ss_pred             CCcEEEEEEECCEEEEEEECCCCcccCcCEEeCCEEEEEEeCCCccccCCeEEeeC
Confidence            78899999999999999999999996679999999999999965677789999985


No 12 
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=99.97  E-value=5.5e-30  Score=249.19  Aligned_cols=240  Identities=30%  Similarity=0.458  Sum_probs=174.0

Q ss_pred             HHHHHHHHHHHHHHHHHhHHHHHHHhCCCEEecCCC---CcchhHHHHHHHHHHHhcCCC-CCeEEEECC-CccEEEeCC
Q psy5221          22 TISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPS---NFTEDWLLKTVNKISKNANIS-TPEVAIYNG-EINAFATGA   96 (436)
Q Consensus        22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~---~~~~~~L~~~l~~l~~~~g~~-~p~v~v~~~-~~NA~~~G~   96 (436)
                      +++..+++....++.+++++.+..+.++...+..+.   ...+.++...+.+++.+.+.+ .|+++++++ .+|||++|.
T Consensus        56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~a~~~~~~~~~~v~i~~~~~~NAFa~g~  135 (302)
T COG0501          56 LLIALLLAFAALLISLLFSKALVLKSLGALTLSEPILLAPRLYAVLLLKVAELARQAGIPHMPEVYILETPQPNAFALGG  135 (302)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhceeecccccccchHHHHHHHHHHHHHHHHCCCCCCCeeEEecCCCccceecCC
Confidence            557777777888899999999999999886665332   222344556999999999999 899999999 999999998


Q ss_pred             CCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCCCCCCc
Q psy5221          97 FKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNY  176 (436)
Q Consensus        97 ~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~~  176 (436)
                      ++.++.|++++||++.+|+||++||||||+||++++|+.++.............+.   .........+     .     
T Consensus       136 ~~~~~~V~vt~gLl~~l~~dEl~aVlaHElgHi~~rd~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~-----~-----  202 (302)
T COG0501         136 GPKNGRVVVTTGLLDLLNDDELEAVLAHELGHIKNRHTLVRLTLRGLLASAFVLLA---TLALAAGLLG-----E-----  202 (302)
T ss_pred             CCCCeeEEecHHHHhhCCHHHHHHHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHH---HHHHHHHhhc-----c-----
Confidence            87778899999999999999999999999999999999984444333333222221   1111110000     0     


Q ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhcCCchHHHHHHHHhhcccccCCChhhhhhhcccCcccc
Q psy5221         177 NHKIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVNDLPKSIAILGINNNFKK  256 (436)
Q Consensus       177 ~~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~g~~~~l~~aL~kl~~~~~~~~~~~~~~~~~~~~~~~~  256 (436)
                         .......+........+..+.+.+||.+|++||++|+++++ |+.++++|.|+...+...     .......++..+
T Consensus       203 ---~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~l~~-~~~l~~aL~kl~~~~~~~-----~~~~~~~~~~~~  273 (302)
T COG0501         203 ---AALALLLLLLLLALFLATLLVLAFSRKREYEADRFAAKLTG-PEKLASALQKLARLSGRA-----NSKAFIASGFSG  273 (302)
T ss_pred             ---hHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhchhhccC-hHHHHHHHHHHHhhhccc-----chhhhhcCcccc
Confidence               11222334444555667788999999999999999999999 799999999999865320     011111122111


Q ss_pred             hhhhhhhccCCChHHHHHHHHhccccc
Q psy5221         257 KSFLKLFATHPSFEKRIAILSLLKLSL  283 (436)
Q Consensus       257 ~~~~~~~~tHP~~~~RI~~L~~~~~~~  283 (436)
                      .....+++|||++.+||++|++.....
T Consensus       274 ~~~~~~~stHP~~~~Ri~~L~~~~~~~  300 (302)
T COG0501         274 GRLQALFSTHPPLAERIAALRQLALTV  300 (302)
T ss_pred             cchHHHHhcCCCHHHHHHHHHHHHHhh
Confidence            223689999999999999999876543


No 13 
>KOG2719|consensus
Probab=99.97  E-value=9.2e-30  Score=246.48  Aligned_cols=224  Identities=17%  Similarity=0.218  Sum_probs=170.3

Q ss_pred             HHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCCCeEEEECC-----CccEEEeCCCCCccEEEEc
Q psy5221          32 GSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANISTPEVAIYNG-----EINAFATGAFKNSSLIAIS  106 (436)
Q Consensus        32 ~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~p~v~v~~~-----~~NA~~~G~~~~~~~v~it  106 (436)
                      +.++...+.+-......+..   .|.+++.  |++.++++|++.|+|..+++++++     ..||+.+|++..+|+++.+
T Consensus       192 ~~l~l~ti~p~~i~P~f~K~---~PLe~g~--l~~~Ie~la~s~gfp~~k~~vi~~s~rs~hsNAyfyG~~~~KRIvIyD  266 (428)
T KOG2719|consen  192 LSLLLLTIYPGFIAPLFGKF---TPLEEGD--LKEKIERLADSVGFPLSKYRVIDGSKRSSHSNAYFYGLCKNKRIVIYD  266 (428)
T ss_pred             HHHHHHHHHHHhhhhhhcCC---CCCCCCc--hHHHHHHHHHhcCCCceEEEEEecCCCCCCCCeeeeeccccceEEEeh
Confidence            33444444555555555554   4787777  999999999999999999999994     9999999999999999999


Q ss_pred             HHHH--hh-cCHHHHHHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHh--hhhccCCCCCCCCCchhHHH
Q psy5221         107 ANLL--ES-MTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDC--FLLNSYDDSNYNSNYNHKIY  181 (436)
Q Consensus       107 ~gLl--~~-l~~~El~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~--~~~g~~~~~~~~~~~~~~~~  181 (436)
                      + |+  +. +++||+.||+|||+||++++|+.++.++..+...+.+++   ++.+..+  .+...|...   ..+..++.
T Consensus       267 t-Ll~~~~~~~~eel~AVl~HELGHW~~~H~~K~~ii~~~~l~l~~ll---F~~~~~~~~ly~a~Gf~~---~~P~~ig~  339 (428)
T KOG2719|consen  267 T-LLLEEEHLNNEELVAVLAHELGHWKLNHVLKNIIIMQIHLFLEFLL---FGFLYRNPKLYAAFGFID---EQPSLIGF  339 (428)
T ss_pred             h-hhhhhhccccHHHHHHHHHHhhHHHHhhHHHHHHHHHHHHHHHHHH---HHHHHcCcchheeecCCC---CCcchhHH
Confidence            8 55  33 489999999999999999999999888755433333222   2222221  111111211   12344566


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhcCCchHHHHHHHHhhcccccCCChhhhhhhcccCcccchhhhh
Q psy5221         182 GLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVNDLPKSIAILGINNNFKKKSFLK  261 (436)
Q Consensus       182 ~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~g~~~~l~~aL~kl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  261 (436)
                      ++.+.....+...+....++..||+.||+||++|.++ |..+.+.+||.|+..++.+...+++             .|+.
T Consensus       340 livf~~~l~py~~l~~~~~n~~sR~fEyqAD~fA~kl-GYg~~L~~AL~KL~~dnlsf~~~D~-------------LYs~  405 (428)
T KOG2719|consen  340 LIVFQFVLAPYRALLNFLMNLISRRFEYQADAFAKKL-GYGKDLRQALIKLFVDNLSFPVSDP-------------LYSA  405 (428)
T ss_pred             HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHc-CCchhHHHHHHHHhhhhcCCCCCcH-------------HHHH
Confidence            6555566666777888899999999999999999999 7779999999999999877666664             4999


Q ss_pred             hhccCCChHHHHHHHHhccc
Q psy5221         262 LFATHPSFEKRIAILSLLKL  281 (436)
Q Consensus       262 ~~~tHP~~~~RI~~L~~~~~  281 (436)
                      |..|||++.|||++++....
T Consensus       406 ~~~~HPtvleRl~~l~~~~~  425 (428)
T KOG2719|consen  406 WHHLHPTVLERLDALDYLSE  425 (428)
T ss_pred             HHhcCchHHHHHHHHHHHHh
Confidence            99999999999999987654


No 14 
>COG0071 IbpA Molecular chaperone (small heat shock protein) [Posttranslational modification, protein turnover, chaperones]
Probab=99.95  E-value=4.1e-27  Score=203.48  Aligned_cols=124  Identities=35%  Similarity=0.659  Sum_probs=109.0

Q ss_pred             cchHHHHhhhhhhhhcccCCCCCceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecc--cCCCceEEEee
Q psy5221         303 TIGFDRLAQIFDNVRTEAQQSYPPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQAD--TQQRNFLHRGI  380 (436)
Q Consensus       303 ~~~~d~l~~~~~~~~~~~~~~~p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~--~~~~~~~~~e~  380 (436)
                      ..+||++++.+.+.......++|++||++ +++.|+|.++||||+|+||+|+++++.|+|+|+++.+  .++..++++|+
T Consensus        19 ~~~fd~~~~~~~~~~~~~~~~~P~vdi~e-~~~~~~I~~elPG~~kedI~I~~~~~~l~I~g~~~~~~~~~~~~~~~~e~   97 (146)
T COG0071          19 SPGFDRLFREFGNLPESRPTGTPPVDIEE-TDDEYRITAELPGVDKEDIEITVEGNTLTIRGEREEEEEEEEEGYLRRER   97 (146)
T ss_pred             ccchhhhhhhhhcccccccCCCCcEEEEE-cCCEEEEEEEcCCCChHHeEEEEECCEEEEEEEecccccccCCceEEEEE
Confidence            35788888888765433335789999995 8899999999999999999999999999999999863  56789999999


Q ss_pred             eeeeEEEEEECCCccc--eEeEEeeCcEEEEEEecCCCCCCCCeEeeec
Q psy5221         381 ATRNFEHRFQLADHVH--VVAAFFDQGILNIDLIREVPEALKPRKIYIN  427 (436)
Q Consensus       381 ~~~~f~r~~~lP~~v~--~~~A~~~~GvL~i~~pk~~~~~~~~~~I~I~  427 (436)
                      .+|+|+|+|.||+.|+  .++|+|+||+|+|++||..++..++++|+|+
T Consensus        98 ~~~~f~r~~~Lp~~v~~~~~~A~~~nGvL~I~lpk~~~~~~~~~~i~I~  146 (146)
T COG0071          98 AYGEFERTFRLPEKVDPEVIKAKYKNGLLTVTLPKAEPEEKKPKRIEIE  146 (146)
T ss_pred             EeeeEEEEEECcccccccceeeEeeCcEEEEEEeccccccccCceeecC
Confidence            9999999999999999  5999999999999999976666678999885


No 15 
>PF01435 Peptidase_M48:  Peptidase family M48 This is family M48 in the peptidase classification. ;  InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated.  The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases [].  HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=99.94  E-value=5.9e-27  Score=218.22  Aligned_cols=185  Identities=30%  Similarity=0.466  Sum_probs=95.5

Q ss_pred             chhHHHHHHHHHHHhc--CCCCCeEEEECC-CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHH
Q psy5221          60 TEDWLLKTVNKISKNA--NISTPEVAIYNG-EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVT  136 (436)
Q Consensus        60 ~~~~L~~~l~~l~~~~--g~~~p~v~v~~~-~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~  136 (436)
                      +.++|++.++++|+++  +.+.+++|++++ .+|||++|.++ ++.|+++.||++.++++|++||||||+||++++|..+
T Consensus        29 ~~~~L~~~v~~l~~~~~~~~~~~~v~v~~~~~~NA~~~g~~~-~~~I~v~~~ll~~~~~~el~aVlaHElgH~~~~h~~~  107 (226)
T PF01435_consen   29 EDPELRRIVEELARRAGLGIPPPRVYVIDSPSPNAFATGGGP-RKRIVVTSGLLESLSEDELAAVLAHELGHIKHRHILK  107 (226)
T ss_dssp             HHHHHHHHHHHHHHHHHCTSS--EEEEE--SSEEEEEETTTC---EEEEEHHHHHHSSHHHHHHHHHHHHHHHHTTHCCC
T ss_pred             HHHHHHHHHHHHHHHhcCCCCCCeEEEEcCCCCcEEEEccCC-CcEEEEeChhhhcccHHHHHHHHHHHHHHHHcCCcch
Confidence            3456999999999999  888889999999 99999999887 6779999999999999999999999999999999876


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhh
Q psy5221         137 MLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSA  216 (436)
Q Consensus       137 ~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~  216 (436)
                      ......+...+..+....+.... ....              ...  .......++..+..++.+.+||.+|++||++|+
T Consensus       108 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~--------------~~~--~~~~~~~~~~~~~~~~~~~~sr~~E~~AD~~a~  170 (226)
T PF01435_consen  108 SLLISLLLSILFFALLALLIGSM-SLFS--------------AFG--FIDILGILIAFLFQLLTNAFSRRQEYEADRYAA  170 (226)
T ss_dssp             CCCHHH-HHHHHHHHHT-----H-HHHH--------------HHH------------HHSTT------HHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHhhhh-hhhh--------------hhh--hcccchhhHHHHHHHhhcchhHHHHHHHHHHHH
Confidence            53332222111111110000000 0000              000  000000111134456779999999999999999


Q ss_pred             hhcCCchHH----------HHHHHHhhcccccCCChhhhhhhcccCcccchhhhhhhccCCChHHHHHH
Q psy5221         217 KLLGTPDPM----------ISALRHLSNISIVNDLPKSIAILGINNNFKKKSFLKLFATHPSFEKRIAI  275 (436)
Q Consensus       217 ~~~g~~~~l----------~~aL~kl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tHP~~~~RI~~  275 (436)
                      +++|++..+          .++|.++...+.....             ....+..+++|||++.+||++
T Consensus       171 ~~~~~~~~l~~a~~~~~~~~~~l~~l~~~~~~~~~-------------~~~~~~~~~~tHP~~~~Ri~~  226 (226)
T PF01435_consen  171 RLGGDPALLARALYKPAAAISALEKLAEANSMRPD-------------SDWRYSSLFSTHPSTEERIAA  226 (226)
T ss_dssp             HH------HHHTTS-TTHHHHHHHHHT------------------------------------HHHHHH
T ss_pred             HhcCCcHHHHHhCCCHHHHHHHHHHHHHHhccccC-------------CccccchhcCCCcCHHHHhCc
Confidence            998885554          4455555442111111             123467899999999999985


No 16 
>cd06470 ACD_IbpA-B_like Alpha-crystallin domain (ACD) found in Escherichia coli inclusion body-associated proteins IbpA and IbpB, and similar proteins.  IbpA and IbpB are 16 kDa small heat shock proteins (sHsps). sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. IbpA and IbpB are produced during high-level production of various heterologous proteins, specifically human prorenin, renin and bovine insulin-like growth factor 2 (bIGF-2), and are strongly associated with inclusion bodies containing these heterologous proteins. IbpA and IbpB work as an integrated system to stabilize thermally aggregated proteins in a disaggregation competent state.  The chaperone activity of IbpB is also significantly elevated as the temperature increases from normal to heat shock. The high temperature results in the disassociation of 2-3-MDa IbpB oligomers into smaller approximately 6
Probab=99.91  E-value=1.3e-23  Score=166.17  Aligned_cols=89  Identities=56%  Similarity=0.960  Sum_probs=82.3

Q ss_pred             CceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEeccc-CCCceEEEeeeeeeEEEEEECCCccceEeEEee
Q psy5221         325 PPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADT-QQRNFLHRGIATRNFEHRFQLADHVHVVAAFFD  403 (436)
Q Consensus       325 p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~-~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~  403 (436)
                      |++||+++++++|+|.++|||++||||+|+++++.|+|+|+++.+. ++.+|+++|+.+|+|+|+|.||++++..+|+|+
T Consensus         1 p~~di~e~~~~~~~v~~~lPG~~kedi~v~~~~~~L~I~g~~~~~~~~~~~~~~~e~~~g~f~R~~~LP~~vd~~~A~~~   80 (90)
T cd06470           1 PPYNIEKTGENNYRITLAVAGFSEDDLEIEVENNQLTVTGKKADEENEEREYLHRGIAKRAFERSFNLADHVKVKGAELE   80 (90)
T ss_pred             CCeeeEEcCCCeEEEEEECCCCCHHHeEEEEECCEEEEEEEEcccccCCCcEEEEEEeceEEEEEEECCCCceECeeEEe
Confidence            7899995345899999999999999999999999999999988764 567899999999999999999999998899999


Q ss_pred             CcEEEEEEec
Q psy5221         404 QGILNIDLIR  413 (436)
Q Consensus       404 ~GvL~i~~pk  413 (436)
                      ||+|+|++|+
T Consensus        81 ~GvL~I~l~~   90 (90)
T cd06470          81 NGLLTIDLER   90 (90)
T ss_pred             CCEEEEEEEC
Confidence            9999999986


No 17 
>cd06472 ACD_ScHsp26_like Alpha crystallin domain (ACD) found in Saccharomyces cerevisiae (Sc) small heat shock protein (Hsp)26 and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. ScHsp26 is temperature-regulated, it switches from an inactive to a chaperone-active form upon elevation in temperature. It associates into large 24-mers storage forms which upon heat shock disassociate into dimers. These dimers initiate the interaction with non-native substrate proteins and re-assemble into large globular assemblies having one monomer of substrate bound per dimer. This group also contains Arabidopsis thaliana (Ath) Hsp15.7, a peroxisomal matrix protein which can complement the morphological phenotype of S. cerevisiae mutants deficient in Hsps26. AthHsp15.7 is minimally expressed under normal conditions and is strongly induced by heat and oxidative st
Probab=99.89  E-value=1.4e-22  Score=161.18  Aligned_cols=87  Identities=20%  Similarity=0.450  Sum_probs=79.5

Q ss_pred             ceeEEEecCceEEEEEecCCCCCCceEEEEECC-EEEEEEEEecc--cCCCceEEEeeeeeeEEEEEECCCccc--eEeE
Q psy5221         326 PYNIELVDENKYRITMAIAGFNRSDLEIETERD-TLRITGRKQAD--TQQRNFLHRGIATRNFEHRFQLADHVH--VVAA  400 (436)
Q Consensus       326 ~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~-~l~i~g~~~~~--~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A  400 (436)
                      ++|++ |++++|+|.++|||++||||+|+++++ .|+|+|+++.+  .++.+++++|+.+|+|+|+|.||++++  .++|
T Consensus         1 ~~dv~-E~~~~~~i~~~lPGv~~edi~i~v~~~~~L~I~g~~~~~~~~~~~~~~~~e~~~g~f~r~i~LP~~v~~~~i~A   79 (92)
T cd06472           1 RVDWK-ETPEAHVFKADVPGVKKEDVKVEVEDGRVLRISGERKKEEEKKGDDWHRVERSSGRFVRRFRLPENADADEVKA   79 (92)
T ss_pred             CccEE-EcCCeEEEEEECCCCChHhEEEEEeCCCEEEEEEEecccccccCCCEEEEEEeccEEEEEEECCCCCCHHHCEE
Confidence            47999 599999999999999999999999975 99999998754  346789999999999999999999998  4999


Q ss_pred             EeeCcEEEEEEec
Q psy5221         401 FFDQGILNIDLIR  413 (436)
Q Consensus       401 ~~~~GvL~i~~pk  413 (436)
                      +|+||+|+|++||
T Consensus        80 ~~~nGvL~I~lPK   92 (92)
T cd06472          80 FLENGVLTVTVPK   92 (92)
T ss_pred             EEECCEEEEEecC
Confidence            9999999999998


No 18 
>PF00011 HSP20:  Hsp20/alpha crystallin family This prints entry is a subset of the Pfam entry.;  InterPro: IPR002068 Prokaryotic and eukaryotic organisms respond to heat shock or other environmental stress by inducing the synthesis of proteins collectively known as heat-shock proteins (hsp) []. Amongst them is a family of proteins with an average molecular weight of 20 Kd, known as the hsp20 proteins []. These seem to act as chaperones that can protect other proteins against heat-induced denaturation and aggregation. Hsp20 proteins seem to form large heterooligomeric aggregates. Structurally, this family is characterised by the presence of a conserved C-terminal domain of about 100 residues.; PDB: 2BOL_B 3N3E_B 2H50_P 2H53_F 2BYU_L 1GME_D 3VQM_J 3VQK_E 3VQL_A 3AAC_A ....
Probab=99.87  E-value=1.4e-21  Score=158.88  Aligned_cols=99  Identities=24%  Similarity=0.566  Sum_probs=81.9

Q ss_pred             eEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEeeCc
Q psy5221         328 NIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFFDQG  405 (436)
Q Consensus       328 ~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~~~G  405 (436)
                      ||. |++++|.|.++|||+++|||+|+++++.|+|+|+++....+..+.++|+.+++|.|+|.||+++|  .++|.|+||
T Consensus         1 di~-e~~~~~~i~~~lpG~~~edi~I~~~~~~L~I~g~~~~~~~~~~~~~~~~~~~~f~r~~~lP~~vd~~~i~a~~~~G   79 (102)
T PF00011_consen    1 DIK-EDEDEYIIKVDLPGFDKEDIKIKVDDNKLVISGKRKEEEEDDRYYRSERRYGSFERSIRLPEDVDPDKIKASYENG   79 (102)
T ss_dssp             EEE-ESSSEEEEEEE-TTS-GGGEEEEEETTEEEEEEEEEGEECTTCEEEE-S-SEEEEEEEE-STTB-GGG-EEEETTS
T ss_pred             CeE-ECCCEEEEEEECCCCChHHEEEEEecCccceeceeeeeeeeeeeeecccccceEEEEEcCCCcCCcceEEEEecCC
Confidence            787 59999999999999999999999999999999999965667788889999999999999999999  499999999


Q ss_pred             EEEEEEecCCCCCC-CCeEeeec
Q psy5221         406 ILNIDLIREVPEAL-KPRKIYIN  427 (436)
Q Consensus       406 vL~i~~pk~~~~~~-~~~~I~I~  427 (436)
                      +|+|++||..+.+. .+++|+|+
T Consensus        80 vL~I~~pk~~~~~~~~~~~I~I~  102 (102)
T PF00011_consen   80 VLTITIPKKEEEEDSQPKRIPIK  102 (102)
T ss_dssp             EEEEEEEBSSSCTTSSSCEE-ET
T ss_pred             EEEEEEEccccccCCCCeEEEeC
Confidence            99999999755443 68999986


No 19 
>cd06471 ACD_LpsHSP_like Group of bacterial proteins containing an alpha crystallin domain (ACD) similar to Lactobacillus plantarum (Lp) small heat shock proteins (sHsp) HSP 18.5, HSP 18.55 and HSP 19.3. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Transcription of the genes encoding Lp HSP 18.5, 18.55 and 19.3 is regulated by a variety of stresses including heat, cold and ethanol. Early growing L. plantarum cells contain elevated levels of these mRNAs which rapidly fall of as the cells enter stationary phase. Also belonging to this group is Bifidobacterium breve (Bb) HSP20 and Oenococcus oenis (syn. Leuconostoc oenos) (Oo) HSP18.  Transcription of the gene encoding BbHSP20 is strongly induced following heat or osmotic shock, and that of the gene encoding OoHSP18 following heat, ethanol or acid shock. OoHSP18 is peripherally associated with the cytoplasmic me
Probab=99.87  E-value=1e-21  Score=156.70  Aligned_cols=87  Identities=21%  Similarity=0.489  Sum_probs=79.1

Q ss_pred             CceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEeccc----CCCceEEEeeeeeeEEEEEECCCccc--eE
Q psy5221         325 PPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADT----QQRNFLHRGIATRNFEHRFQLADHVH--VV  398 (436)
Q Consensus       325 p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~----~~~~~~~~e~~~~~f~r~~~lP~~v~--~~  398 (436)
                      |++||+ |++++|+|.++|||++|+||+|+++++.|+|+|+++...    .+.+++++|+.+|+|+|+|.|| +++  .+
T Consensus         1 ~~~di~-e~~~~~~i~~~lPGv~~edi~v~~~~~~L~I~g~~~~~~~~~~~~~~~~~~e~~~g~f~r~~~lp-~v~~~~i   78 (93)
T cd06471           1 MKTDIK-ETDDEYIVEADLPGFKKEDIKLDYKDGYLTISAKRDESKDEKDKKGNYIRRERYYGSFSRSFYLP-NVDEEEI   78 (93)
T ss_pred             CceeEE-EcCCEEEEEEECCCCCHHHeEEEEECCEEEEEEEEccccccccccCCEEEEeeeccEEEEEEECC-CCCHHHC
Confidence            479999 599999999999999999999999999999999997532    2357999999999999999999 677  49


Q ss_pred             eEEeeCcEEEEEEec
Q psy5221         399 AAFFDQGILNIDLIR  413 (436)
Q Consensus       399 ~A~~~~GvL~i~~pk  413 (436)
                      +|+|+||+|+|++||
T Consensus        79 ~A~~~dGvL~I~lPK   93 (93)
T cd06471          79 KAKYENGVLKITLPK   93 (93)
T ss_pred             EEEEECCEEEEEEcC
Confidence            999999999999998


No 20 
>cd06497 ACD_alphaA-crystallin_HspB4 Alpha-crystallin domain found in the small heat shock protein (sHsp) alphaA-crystallin (HspB4, 20kDa). sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Alpha crystallin, an abundant protein in the mammalian lens, is a large (700 kDa) heteropolymer composed of HspB4 and HspB5, generally in a molar ratio of HspB4:HspB5 of 3:1.  Only trace amounts of HspB4 are found in tissues other than the lens. HspB5 does not belong to this group. Mutations inHspB4 have been associated with Autosomal Dominant Congenital Cataract (ADCC). The chaperone-like functions of HspB4 are considered important for maintaining lens transparency and preventing cataract.
Probab=99.86  E-value=2.2e-21  Score=151.40  Aligned_cols=80  Identities=20%  Similarity=0.495  Sum_probs=72.1

Q ss_pred             eEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEe-eC
Q psy5221         328 NIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFF-DQ  404 (436)
Q Consensus       328 ~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~-~~  404 (436)
                      +|+ |++++|.|.++||||+||||+|++++|.|+|+|+++++.++.+|.++|     |+|+|.||++||  .++|+| +|
T Consensus         4 ~v~-e~~~~~~v~~dlpG~~~edi~V~v~~~~L~I~g~~~~~~~~~~~~~~e-----f~R~~~LP~~Vd~~~i~A~~~~d   77 (86)
T cd06497           4 EVR-SDRDKFTIYLDVKHFSPEDLTVKVLDDYVEIHGKHSERQDDHGYISRE-----FHRRYRLPSNVDQSAITCSLSAD   77 (86)
T ss_pred             eEE-EcCCEEEEEEECCCCCHHHeEEEEECCEEEEEEEEcceeCCCCEEEEE-----EEEEEECCCCCChHHeEEEeCCC
Confidence            677 599999999999999999999999999999999986655566676654     999999999999  499999 89


Q ss_pred             cEEEEEEec
Q psy5221         405 GILNIDLIR  413 (436)
Q Consensus       405 GvL~i~~pk  413 (436)
                      |+|+|++||
T Consensus        78 GvL~I~~PK   86 (86)
T cd06497          78 GMLTFSGPK   86 (86)
T ss_pred             CEEEEEecC
Confidence            999999998


No 21 
>cd06478 ACD_HspB4-5-6 Alpha-crystallin domain found in alphaA-crystallin (HspB4), alphaB-crystallin (HspB5), and the small heat shock protein (sHsp) HspB6, also known as Hsp20. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Alpha crystallin, an abundant protein in the mammalian lens, is a large (700 kDa) heteropolymer composed of HspB4 and HspB5, generally in a molar ratio of HspB4:HspB5 of 3:1.  Only trace amounts of HspB4 are found in tissues other than the lens. HspB5 on the other hand is also expressed constitutively in other tissues including brain, heart, and type I and type IIa skeletal muscle fibers, and in several cancers including gliomas, renal cell carcinomas, basal-like and metaplastic breast carcinomas, and head and neck cancer.  HspB5's functions include effects on the apoptotic pathway and on metastasis.  Phosphorylation of HspB5 reduces its ol
Probab=99.84  E-value=1.5e-20  Score=145.88  Aligned_cols=79  Identities=20%  Similarity=0.533  Sum_probs=69.9

Q ss_pred             EEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEe-eCc
Q psy5221         329 IELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFF-DQG  405 (436)
Q Consensus       329 i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~-~~G  405 (436)
                      |. +++++|.|.++||||+||||+|++++|.|+|+|+++.+.++.+|+++|     |+|+|.||++||  .++|+| +||
T Consensus         2 ~~-~~~~~~~v~~dlpG~~~edI~V~v~~~~L~I~g~~~~~~~~~~~~~~e-----f~R~~~LP~~vd~~~i~A~~~~dG   75 (83)
T cd06478           2 VR-LDKDRFSVNLDVKHFSPEELSVKVLGDFVEIHGKHEERQDEHGFISRE-----FHRRYRLPPGVDPAAITSSLSADG   75 (83)
T ss_pred             ee-ecCceEEEEEECCCCCHHHeEEEEECCEEEEEEEEceEcCCCCEEEEE-----EEEEEECCCCcChHHeEEEECCCC
Confidence            44 589999999999999999999999999999999986555556666554     999999999999  499999 699


Q ss_pred             EEEEEEec
Q psy5221         406 ILNIDLIR  413 (436)
Q Consensus       406 vL~i~~pk  413 (436)
                      +|+|++||
T Consensus        76 vL~I~~PK   83 (83)
T cd06478          76 VLTISGPR   83 (83)
T ss_pred             EEEEEecC
Confidence            99999998


No 22 
>cd06479 ACD_HspB7_like Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB7, also known as cardiovascular small heat shock protein (cvHsp), and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. HspB7 is a 25-kDa protein, preferentially expressed in heart and skeletal muscle. It binds the cytoskeleton protein alpha-filamin (also known as actin-binding protein 280). The expression of HspB7 is increased during rat muscle aging.  Its expression is also modulated in obesity implicating this protein in this and related metabolic disorders. As the human gene encoding HspB7 is mapped to chromosome 1p36.23-p34.3 it is a positional candidate for several dystrophies and myopathies.
Probab=99.84  E-value=1.2e-20  Score=144.76  Aligned_cols=77  Identities=22%  Similarity=0.454  Sum_probs=69.5

Q ss_pred             eEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEe-eC
Q psy5221         328 NIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFF-DQ  404 (436)
Q Consensus       328 ~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~-~~  404 (436)
                      ||. |++++|.|.++||||+||||+|++++|.|+|+|+++.+.+        ..+|+|+|+|.||++||  .++|.| +|
T Consensus         2 ~v~-e~~~~~~v~~dlpG~~pedi~V~v~~~~L~I~ger~~~~~--------~~~g~F~R~~~LP~~vd~e~v~A~l~~~   72 (81)
T cd06479           2 NVK-TLGDTYQFAVDVSDFSPEDIIVTTSNNQIEVHAEKLASDG--------TVMNTFTHKCQLPEDVDPTSVSSSLGED   72 (81)
T ss_pred             Ccc-CcCCeEEEEEECCCCCHHHeEEEEECCEEEEEEEEeccCC--------CEEEEEEEEEECCCCcCHHHeEEEecCC
Confidence            677 5999999999999999999999999999999999875431        24899999999999999  499997 99


Q ss_pred             cEEEEEEec
Q psy5221         405 GILNIDLIR  413 (436)
Q Consensus       405 GvL~i~~pk  413 (436)
                      |+|+|+++|
T Consensus        73 GvL~I~~~~   81 (81)
T cd06479          73 GTLTIKARR   81 (81)
T ss_pred             CEEEEEecC
Confidence            999999986


No 23 
>cd06498 ACD_alphaB-crystallin_HspB5 Alpha-crystallin domain found in the small heat shock protein (sHsp) alphaB-crystallin (HspB5, 20kDa). sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Alpha crystallin, an abundant protein in the mammalian lens, is a large (700 kDa) heteropolymer composed of HspB4 and HspB5, generally in a molar ratio of HspB4:HspB5 of 3:1.  HspB4 does not belong to this group. HspB5 shows increased synthesis in response to stress. HspB5 is also expressed constitutively in other tissues including brain, heart, and type I and type IIa skeletal muscle fibers, and in several cancers including gliomas, renal cell carcinomas, basal-like and metaplastic breast carcinomas, and head and neck cancer.  Its functions include effects on the apoptotic pathway and on metastasis.  Phosphorylation of HspB5 reduces its oligomerization and anti-apoptotic activ
Probab=99.84  E-value=2.2e-20  Score=145.05  Aligned_cols=77  Identities=17%  Similarity=0.526  Sum_probs=68.9

Q ss_pred             ecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEee-CcEEE
Q psy5221         332 VDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFFD-QGILN  408 (436)
Q Consensus       332 e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~~-~GvL~  408 (436)
                      +++++|.|.++||||+||||+|++++|.|+|+|+++.+.++.+|++     ++|+|+|.||++||  .++|+|+ ||+|+
T Consensus         4 ~~~~~~~v~~dlpG~~~edi~V~v~~~~L~I~g~~~~~~~~~~~~~-----~eF~R~~~LP~~vd~~~i~A~~~~dGvL~   78 (84)
T cd06498           4 LEKDKFSVNLDVKHFSPEELKVKVLGDFIEIHGKHEERQDEHGFIS-----REFQRKYRIPADVDPLTITSSLSPDGVLT   78 (84)
T ss_pred             eCCceEEEEEECCCCCHHHeEEEEECCEEEEEEEEcceeCCCCEEE-----EEEEEEEECCCCCChHHcEEEeCCCCEEE
Confidence            4789999999999999999999999999999999876555566654     35999999999999  4999995 99999


Q ss_pred             EEEec
Q psy5221         409 IDLIR  413 (436)
Q Consensus       409 i~~pk  413 (436)
                      |++||
T Consensus        79 I~lPk   83 (84)
T cd06498          79 VCGPR   83 (84)
T ss_pred             EEEeC
Confidence            99998


No 24 
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=99.83  E-value=1.8e-19  Score=176.16  Aligned_cols=179  Identities=22%  Similarity=0.196  Sum_probs=121.0

Q ss_pred             HHHHHHHHHHhcCCC--CCeEEEECC-CccEEEeCCCCCccEEEEcHHHHhhc-CHHHHHHHHHHHHHhhhcChHHHHHH
Q psy5221          64 LLKTVNKISKNANIS--TPEVAIYNG-EINAFATGAFKNSSLIAISANLLESM-TKLEIKAIIAHEISHISNGDMVTMLL  139 (436)
Q Consensus        64 L~~~l~~l~~~~g~~--~p~v~v~~~-~~NA~~~G~~~~~~~v~it~gLl~~l-~~~El~aVLAHElgHi~~~h~~~~~~  139 (436)
                      +.+.-+.+++.++.+  +-+++++++ ..|||++.    ++.|++++||+-.. ||+|+++|||||+||+..+|..+..-
T Consensus        76 v~~~g~rL~~~a~~~~~~f~f~lV~d~~iNAFA~~----Gg~v~vntGLll~ae~esElagViAHEigHv~qrH~aR~~e  151 (484)
T COG4783          76 VNSLGQRLAAAADLVKTPFTFFLVNDDSINAFATP----GGYVVVNTGLLLTAENESELAGVIAHEIGHVAQRHLARSME  151 (484)
T ss_pred             HHHHHHHHHHhcCCCCCCeEEEEecCCccchhhcC----CceEEEehHHHHhcCCHHHHHHHHHHHHHHHhhhhHHHHHH
Confidence            445555666677776  447889999 99999993    34588999999988 99999999999999999999877553


Q ss_pred             HHH--HHHHHHHHHHHHHHHHHHhhhhccCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhh
Q psy5221         140 IQG--VINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAK  217 (436)
Q Consensus       140 ~~~--~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~  217 (436)
                      -+.  ...++...+.+++...     .   ....+.   .+++.-    .      .....-...|||.+|.+||++|+.
T Consensus       152 ~~~r~~~~~i~~ml~gi~aa~-----a---~~~ag~---a~iag~----~------a~~~~g~L~~sR~~E~eADr~Gi~  210 (484)
T COG4783         152 QQQRAAPMAIAGMLLGILAAL-----A---GADAGM---AGIAGA----L------AGAAQGQLNFSRQNEQEADRIGIT  210 (484)
T ss_pred             HHhhhchhHHHHHHHHHHHHH-----h---CccccH---HHHHHH----H------HHhhhhhhhcchhhHHHHHHHHHH
Confidence            311  0111111111111111     1   111111   111110    0      011112577999999999999998


Q ss_pred             hc----CCchHHHHHHHHhhcccccCCChhhhhhhcccCcccchhhhhhhccCCChHHHHHHHHhcccccc
Q psy5221         218 LL----GTPDPMISALRHLSNISIVNDLPKSIAILGINNNFKKKSFLKLFATHPSFEKRIAILSLLKLSLY  284 (436)
Q Consensus       218 ~~----g~~~~l~~aL~kl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tHP~~~~RI~~L~~~~~~~~  284 (436)
                      .+    .+|.+|.++++|+.......                 .....|+.|||-+.+||..+++++.+++
T Consensus       211 ~L~raGydp~gM~~ff~rl~~~~~~~-----------------~~~p~yl~THPlp~~RIa~lr~ra~q~p  264 (484)
T COG4783         211 TLVRAGYDPQGMPEFFERLADQLRYG-----------------GQPPEYLLTHPLPEERIADLRNRAEQSP  264 (484)
T ss_pred             HHHHcCCCchhHHHHHHHHHHHHhcC-----------------CCCChHHhcCCCchhHHHHHHHHHHhCC
Confidence            76    58999999999998643111                 1235689999999999999998888875


No 25 
>cd06476 ACD_HspB2_like Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB2/heat shock 27kDa protein 2 and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits.  HspB2 is preferentially and constitutively expressed in skeletal muscle and heart. HspB2 shows homooligomeric activity and forms aggregates in muscle cytosol. Although its expression is not induced by heat shock, it redistributes to the insoluble fraction in response to heat shock. In the mouse heart, HspB2 plays a role in maintaining energetic balance, by protecting cardiac energetics during ischemia/reperfusion, and allowing  for increased work during acute inotropic challenge. hHspB2 [previously also known as myotonic dystrophy protein kinase (DMPK) binding protein (MKBP)]  is selectively up-regulated in skeletal muscles from myotonic dystrophy patients.
Probab=99.82  E-value=7.4e-20  Score=141.48  Aligned_cols=77  Identities=23%  Similarity=0.507  Sum_probs=67.7

Q ss_pred             ecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEee-CcEEE
Q psy5221         332 VDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFFD-QGILN  408 (436)
Q Consensus       332 e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~~-~GvL~  408 (436)
                      .++++|.|.++||||+||||+|++++|.|+|+|+++.+.++.++.     .++|+|+|.||++||  .++|.|+ ||+|+
T Consensus         4 ~~~d~y~v~~dlpG~~~edi~V~v~~~~L~I~g~~~~~~~~~~~~-----~~eF~R~~~LP~~vd~~~v~A~~~~dGvL~   78 (83)
T cd06476           4 SEDDKYQVFLDVCHFTPDEITVRTVDNLLEVSARHPQRMDRHGFV-----SREFTRTYILPMDVDPLLVRASLSHDGILC   78 (83)
T ss_pred             ccCCeEEEEEEcCCCCHHHeEEEEECCEEEEEEEEcceecCCCEE-----EEEEEEEEECCCCCChhhEEEEecCCCEEE
Confidence            368899999999999999999999999999999986554444454     456999999999999  4999996 99999


Q ss_pred             EEEec
Q psy5221         409 IDLIR  413 (436)
Q Consensus       409 i~~pk  413 (436)
                      |++||
T Consensus        79 I~~Pr   83 (83)
T cd06476          79 IQAPR   83 (83)
T ss_pred             EEecC
Confidence            99997


No 26 
>cd06475 ACD_HspB1_like Alpha crystallin domain (ACD) found in mammalian small (s)heat shock protein (Hsp)-27 (also denoted HspB1 in human) and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Hsp27 shows enhanced synthesis in response to stress. It is a molecular chaperone which interacts with a large number of different proteins. It is found in many types of human cells including breast, uterus, cervix, platelets and cancer cells. Hsp27 has diverse cellular functions including, chaperoning, regulation of actin polymerization, keratinocyte differentiation, regulation of inflammatory pathways in keratinocytes, and protection from oxidative stress through modulating glutathione levels. It is also a subunit of AUF1-containing protein complexes. It has been linked to several transduction pathways regulating cellular functions including differentiat
Probab=99.82  E-value=8.3e-20  Score=142.53  Aligned_cols=80  Identities=20%  Similarity=0.511  Sum_probs=70.8

Q ss_pred             eeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEee-
Q psy5221         327 YNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFFD-  403 (436)
Q Consensus       327 ~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~~-  403 (436)
                      .||+ |++++|.|.++|||++||||+|+++++.|+|+|+++.+.++..+.     .++|+|+|+||++||  .++|.|+ 
T Consensus         3 ~~i~-e~~~~~~v~~dlPG~~~edi~V~v~~~~L~I~g~~~~~~~~~~~~-----~~~f~R~f~LP~~vd~~~v~A~~~~   76 (86)
T cd06475           3 SEIR-QTADRWKVSLDVNHFAPEELVVKTKDGVVEITGKHEEKQDEHGFV-----SRCFTRKYTLPPGVDPTAVTSSLSP   76 (86)
T ss_pred             ceEE-EcCCeEEEEEECCCCCHHHEEEEEECCEEEEEEEECcCcCCCCEE-----EEEEEEEEECCCCCCHHHcEEEECC
Confidence            4898 599999999999999999999999999999999987654444442     458999999999999  4999997 


Q ss_pred             CcEEEEEEe
Q psy5221         404 QGILNIDLI  412 (436)
Q Consensus       404 ~GvL~i~~p  412 (436)
                      ||+|+|++|
T Consensus        77 dGvL~I~lP   85 (86)
T cd06475          77 DGILTVEAP   85 (86)
T ss_pred             CCeEEEEec
Confidence            999999998


No 27 
>cd06477 ACD_HspB3_Like Alpha crystallin domain (ACD) found in mammalian HspB3, also known as heat-shock protein 27-like protein (HSPL27, 17-kDa) and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. HspB3 is expressed in adult skeletal muscle, smooth muscle, and heart, and in several other fetal tissues.  In muscle cells HspB3 forms an oligomeric 150 kDa complex with myotonic dystrophy protein kinase-binding protein (MKBP/ HspB2), this complex may comprise one of two independent muscle-cell specific chaperone systems. The expression of HspB3 is induced during muscle differentiation controlled by the myogenic factor MyoD. HspB3 may also interact with Hsp22 (HspB8).
Probab=99.81  E-value=2.2e-19  Score=138.30  Aligned_cols=76  Identities=24%  Similarity=0.461  Sum_probs=67.5

Q ss_pred             ecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEe-eCcEEE
Q psy5221         332 VDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFF-DQGILN  408 (436)
Q Consensus       332 e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~-~~GvL~  408 (436)
                      |++++|+|.++||||+||||+|+++++.|+|+|+++.+.++.++.     .++|+|+|.||++|+  .++|+| +||||+
T Consensus         4 e~~~~~~v~~dlpG~~~edI~V~v~~~~L~I~ge~~~~~~~~~~~-----~r~F~R~~~LP~~Vd~~~v~A~~~~dGvL~   78 (83)
T cd06477           4 EGKPMFQILLDVVQFRPEDIIIQVFEGWLLIKGQHGVRMDEHGFI-----SRSFTRQYQLPDGVEHKDLSAMLCHDGILV   78 (83)
T ss_pred             cCCceEEEEEEcCCCCHHHeEEEEECCEEEEEEEEccccCCCCEE-----EEEEEEEEECCCCcchheEEEEEcCCCEEE
Confidence            589999999999999999999999999999999987755445553     349999999999999  499997 899999


Q ss_pred             EEEe
Q psy5221         409 IDLI  412 (436)
Q Consensus       409 i~~p  412 (436)
                      |+.|
T Consensus        79 I~~~   82 (83)
T cd06477          79 VETK   82 (83)
T ss_pred             EEec
Confidence            9976


No 28 
>cd06482 ACD_HspB10 Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB10, also known as sperm outer dense fiber protein (ODFP), and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Human (h) HspB10 occurs exclusively in the axoneme of sperm cells and may have a cytoskeletal role.
Probab=99.81  E-value=2.2e-19  Score=139.52  Aligned_cols=79  Identities=13%  Similarity=0.282  Sum_probs=69.4

Q ss_pred             cCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEeeCc-EEEE
Q psy5221         333 DENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFFDQG-ILNI  409 (436)
Q Consensus       333 ~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~~~G-vL~i  409 (436)
                      ++++|+|.+||||++||||+|++++|.|+|+|+++.+.++..  .+||.+|+|+|+|.||++||  .++|+|+|| +|+|
T Consensus         6 ~~~~~~v~adlPG~~kedI~V~v~~~~L~I~ger~~~~e~~~--~~er~~g~F~R~f~LP~~Vd~d~i~A~~~~~~~l~i   83 (87)
T cd06482           6 DSSNVLASVDVCGFEPDQVKVKVKDGKVQVSAERENRYDCLG--SKKYSYMNICKEFSLPPGVDEKDVTYSYGLGSVVKI   83 (87)
T ss_pred             cCCEEEEEEECCCCCHHHeEEEEECCEEEEEEEEecccccCC--ccEEEEEEEEEEEECCCCcChHHcEEEEcCCCEEEE
Confidence            578999999999999999999999999999999976543222  24789999999999999999  499999987 9999


Q ss_pred             EEec
Q psy5221         410 DLIR  413 (436)
Q Consensus       410 ~~pk  413 (436)
                      .-|.
T Consensus        84 ~~~~   87 (87)
T cd06482          84 ETPC   87 (87)
T ss_pred             eeCC
Confidence            9874


No 29 
>cd06481 ACD_HspB9_like Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB9 and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Human (h) HspB9 is expressed exclusively in the normal testis and in various tumor samples and is a cancer/testis antigen. hHspB9  interacts with TCTEL1 (T-complex testis expressed protein -1), a subunit of dynein. hHspB9 and TCTEL1 are co-expressed in similar cells within the testis and in tumor cells. Included in this group is Xenopus Hsp30, a developmentally-regulated heat-inducible molecular chaperone.
Probab=99.80  E-value=3.1e-19  Score=139.66  Aligned_cols=79  Identities=23%  Similarity=0.462  Sum_probs=69.2

Q ss_pred             ecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccC--CCceEEEeeeeeeEEEEEECCCccc--eEeEEe-eCcE
Q psy5221         332 VDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQ--QRNFLHRGIATRNFEHRFQLADHVH--VVAAFF-DQGI  406 (436)
Q Consensus       332 e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~--~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~-~~Gv  406 (436)
                      |.+++|.|.++||||+||||+|++++|.|+|+|+++.+.+  ..+|.   +.+|+|+|+|.||++||  .++|.| +||+
T Consensus         4 ~~~d~~~v~~dlpG~~~edI~V~v~~~~L~I~g~~~~~~~~~~~~~~---~~~~~F~R~~~LP~~Vd~~~i~A~~~~dGv   80 (87)
T cd06481           4 DGKEGFSLKLDVRGFSPEDLSVRVDGRKLVVTGKREKKNEDEKGSFS---YEYQEFVREAQLPEHVDPEAVTCSLSPSGH   80 (87)
T ss_pred             CccceEEEEEECCCCChHHeEEEEECCEEEEEEEEeeecccCCCcEE---EEeeEEEEEEECCCCcChHHeEEEeCCCce
Confidence            4678999999999999999999999999999999875432  24443   35899999999999999  499999 9999


Q ss_pred             EEEEEec
Q psy5221         407 LNIDLIR  413 (436)
Q Consensus       407 L~i~~pk  413 (436)
                      |+|++|+
T Consensus        81 L~I~~P~   87 (87)
T cd06481          81 LHIRAPR   87 (87)
T ss_pred             EEEEcCC
Confidence            9999996


No 30 
>cd06526 metazoan_ACD Alpha-crystallin domain (ACD) of metazoan alpha-crystallin-type small(s) heat shock proteins (Hsps). sHsps are small stress induced proteins with monomeric masses between 12 -43 kDa, whose common feature is the Alpha-crystallin domain  (ACD). sHsps are generally active as large oligomers consisting of multiple subunits, and are believed to be ATP-independent chaperones that prevent aggregation and are important in refolding in combination with other Hsps.
Probab=99.77  E-value=2e-18  Score=134.42  Aligned_cols=75  Identities=24%  Similarity=0.610  Sum_probs=66.9

Q ss_pred             CceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEeeC-cEEEEE
Q psy5221         334 ENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFFDQ-GILNID  410 (436)
Q Consensus       334 ~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~~~-GvL~i~  410 (436)
                      +++|.|.++||||+||||+|+++++.|+|+|+++.+.++.     ++.+++|+|+|.||++||.  ++|+|+| |+|+|+
T Consensus         6 ~~~~~v~~dlpG~~~edI~v~v~~~~L~I~g~~~~~~~~~-----~~~~~~f~r~~~LP~~vd~~~i~A~~~~~GvL~I~   80 (83)
T cd06526           6 DEKFQVTLDVKGFKPEELKVKVSDNKLVVEGKHEEREDEH-----GYVSREFTRRYQLPEGVDPDSVTSSLSSDGVLTIE   80 (83)
T ss_pred             CeeEEEEEECCCCCHHHcEEEEECCEEEEEEEEeeeccCC-----CEEEEEEEEEEECCCCCChHHeEEEeCCCcEEEEE
Confidence            4599999999999999999999999999999987654422     3457899999999999994  9999998 999999


Q ss_pred             Eec
Q psy5221         411 LIR  413 (436)
Q Consensus       411 ~pk  413 (436)
                      +||
T Consensus        81 ~Pk   83 (83)
T cd06526          81 APK   83 (83)
T ss_pred             ecC
Confidence            997


No 31 
>cd06464 ACD_sHsps-like Alpha-crystallin domain (ACD) of alpha-crystallin-type small(s) heat shock proteins (Hsps). sHsps are small stress induced proteins with monomeric masses between 12 -43 kDa, whose common feature is the Alpha-crystallin domain  (ACD). sHsps are generally active as large oligomers consisting of multiple subunits, and are believed to be ATP-independent chaperones that prevent aggregation and are important in refolding in combination with other Hsps.
Probab=99.74  E-value=1.9e-17  Score=130.52  Aligned_cols=85  Identities=25%  Similarity=0.585  Sum_probs=76.9

Q ss_pred             eEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccC-CCceEEEeeeeeeEEEEEECCCccce--EeEEeeC
Q psy5221         328 NIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQ-QRNFLHRGIATRNFEHRFQLADHVHV--VAAFFDQ  404 (436)
Q Consensus       328 ~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~-~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~~~  404 (436)
                      |++ |++++|.+.++|||++++||+|+++++.|.|+|+++.... ...+...++.+|.|+|+|.||..+|.  ++|.|+|
T Consensus         1 ~i~-e~~~~~~i~~~lpg~~~~~i~V~v~~~~l~I~g~~~~~~~~~~~~~~~~~~~~~f~r~~~LP~~vd~~~i~a~~~~   79 (88)
T cd06464           1 DVY-ETDDAYVVEADLPGFKKEDIKVEVEDGVLTISGEREEEEEEEENYLRRERSYGSFSRSFRLPEDVDPDKIKASLEN   79 (88)
T ss_pred             CcE-EcCCEEEEEEECCCCCHHHeEEEEECCEEEEEEEEecccccCCcEEEEEEeCcEEEEEEECCCCcCHHHcEEEEeC
Confidence            466 4889999999999999999999999999999999987643 33678888999999999999999994  9999999


Q ss_pred             cEEEEEEec
Q psy5221         405 GILNIDLIR  413 (436)
Q Consensus       405 GvL~i~~pk  413 (436)
                      |+|+|++||
T Consensus        80 G~L~I~~pk   88 (88)
T cd06464          80 GVLTITLPK   88 (88)
T ss_pred             CEEEEEEcC
Confidence            999999997


No 32 
>KOG2661|consensus
Probab=99.73  E-value=3.8e-17  Score=151.48  Aligned_cols=187  Identities=16%  Similarity=0.140  Sum_probs=123.4

Q ss_pred             hCCCEEecCCCCcchhHHHHHHHHHHHhcC----CC--CCeEEEECC-CccEEEeCCCCCccEEEEcHHHHhhc-CHHHH
Q psy5221          47 SMNLMVIKNPSNFTEDWLLKTVNKISKNAN----IS--TPEVAIYNG-EINAFATGAFKNSSLIAISANLLESM-TKLEI  118 (436)
Q Consensus        47 ~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g----~~--~p~v~v~~~-~~NA~~~G~~~~~~~v~it~gLl~~l-~~~El  118 (436)
                      .++.+.++ +..+..-.+.+++.++.+..+    +.  .-++.|+++ .+|||+.+   ++. |++-+|+|..+ +++++
T Consensus       201 eyq~~mLp-~~hp~sl~V~~vlk~iIea~~~~~slsgIkWeihVVndPipNAFvLP---gGK-vfVFtgiLn~ck~ddgl  275 (424)
T KOG2661|consen  201 EYQNDMLP-EKHPRSLAVKEVLKHIIEANKDVPSLSGIKWEIHVVNDPIPNAFVLP---GGK-VFVFTGILNSCKDDDGL  275 (424)
T ss_pred             HhcCccCC-cCCchhhHHHHHHHHHHHHhccCCcccCceeEEEEecCCCCceeecc---CCe-EEEEechhhcccChHHH
Confidence            35555555 333444446677777777666    33  237889999 99999995   345 44555799999 99999


Q ss_pred             HHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHH
Q psy5221         119 KAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYGLITTILQIFLGFGASI  198 (436)
Q Consensus       119 ~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  198 (436)
                      ++|||||+||-..||.....-..-++.    ++..+++.+....                        ...-  -.....
T Consensus       276 AtvLgHE~aHaVarH~AEki~k~~~~s----iLgLvlyt~~~a~------------------------~~n~--~Ll~~f  325 (424)
T KOG2661|consen  276 ATVLGHEIAHAVARHAAEKIGKVHLLS----ILGLVLYTMIWAI------------------------CPND--KLLEYF  325 (424)
T ss_pred             HHHHHHHHHHHHHHHHHHhhhhhHHHH----HHHHHHHHHHhhc------------------------cchH--HHHHHH
Confidence            999999999999999877543322211    1111111111000                        0000  011224


Q ss_pred             HHHHHhhHHHHHhcHhhhhh----cCCchHHHHHHHHhhcccccCCChhhhhhhcccCcccchhhhhhhccCCChHHHHH
Q psy5221         199 VVAWFSRNREFRADHGSAKL----LGTPDPMISALRHLSNISIVNDLPKSIAILGINNNFKKKSFLKLFATHPSFEKRIA  274 (436)
Q Consensus       199 ~~~~~SR~~E~~AD~~a~~~----~g~~~~l~~aL~kl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tHP~~~~RI~  274 (436)
                      +.+.+||.+|.|||.+|.-+    |.+|++....+++|.....+...               .....|.||||+..+||+
T Consensus       326 lrlPfSRKMEtEADyIGLlLma~Acfdpras~tvwErM~~~egqmg~---------------~~~~eflSTHPSskkRie  390 (424)
T KOG2661|consen  326 LRLPFSRKMETEADYIGLLLMAKACFDPRASSTVWERMEFVEGQMGQ---------------PKMPEFLSTHPSSKKRIE  390 (424)
T ss_pred             hcCcchhhhhhhhhHHHHHHHHHhhcCcccchHHHHHHHHhhhhcCC---------------CCCchhhhcCCCccchhH
Confidence            56889999999999998755    46889999999998775422211               124678999999999999


Q ss_pred             HHHhccccc
Q psy5221         275 ILSLLKLSL  283 (436)
Q Consensus       275 ~L~~~~~~~  283 (436)
                      .+.++-.+.
T Consensus       391 ~~~~lLpqA  399 (424)
T KOG2661|consen  391 YLDRLLPQA  399 (424)
T ss_pred             HHHHhchHH
Confidence            998765554


No 33 
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=99.71  E-value=1.7e-16  Score=147.76  Aligned_cols=178  Identities=24%  Similarity=0.302  Sum_probs=123.1

Q ss_pred             HHHHHHHHHHHhcCCC------CCeEEEECC-CccEEEeCCCCCccEEEEcHHHHhhc-CHHHHHHHHHHHHHhhhcChH
Q psy5221          63 WLLKTVNKISKNANIS------TPEVAIYNG-EINAFATGAFKNSSLIAISANLLESM-TKLEIKAIIAHEISHISNGDM  134 (436)
Q Consensus        63 ~L~~~l~~l~~~~g~~------~p~v~v~~~-~~NA~~~G~~~~~~~v~it~gLl~~l-~~~El~aVLAHElgHi~~~h~  134 (436)
                      .|.+++.++..++-..      ..++.|+|+ .+|||+.+    +++++++.||+..+ |+.|++|||+||+||+..+|.
T Consensus        65 Kler~Vari~g~lt~~S~~p~q~YriTilnSP~INAFALP----GGYlYitRGLlAland~sEvAAVl~HEmgHVtAnHg  140 (479)
T COG4784          65 KLERMVARIVGALTAVSENPQQTYRITILNSPNINAFALP----GGYLYITRGLLALANDSSEVAAVLAHEMGHVTANHG  140 (479)
T ss_pred             HHHHHHHHHHhHhhhhccCCCceEEEEEecCCCccccccC----CceEEEehhHHHHcCCHHHHHHHHHhhhhheecchh
Confidence            3777777777765432      236789999 99999995    56799999999999 889999999999999999998


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHh
Q psy5221         135 VTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYGLITTILQIFLGFGASIVVAWFSRNREFRADHG  214 (436)
Q Consensus       135 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~  214 (436)
                      ....-..    .-..+...+....+.       ....+.                 .......+-...|||.+|.+||.+
T Consensus       141 i~rQ~~e----~a~~ia~rvva~vl~-------~~~agk-----------------~A~~rGklrla~fsRnqELqAD~i  192 (479)
T COG4784         141 IQRQQRE----AAEVIASRVVAEVLG-------SDAAGK-----------------QALIRGKLRLAQFSRNQELQADAI  192 (479)
T ss_pred             HHHHHHH----HHHHHHHHHHHHHhC-------Ccchhh-----------------HHHhhhhHHHhhhccchhhhhhhh
Confidence            7654432    111222222222111       111111                 111223345688999999999999


Q ss_pred             hhhhc----CCchHHHHHHHHhhcccccCCChhhhhhhcccCcccchhhhhhhccCCChHHHHHHHHhccccc
Q psy5221         215 SAKLL----GTPDPMISALRHLSNISIVNDLPKSIAILGINNNFKKKSFLKLFATHPSFEKRIAILSLLKLSL  283 (436)
Q Consensus       215 a~~~~----g~~~~l~~aL~kl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tHP~~~~RI~~L~~~~~~~  283 (436)
                      ++++.    .+|-++++.|..|+.+..-..          .++.... -..+++|||.+.+||+.....+.+.
T Consensus       193 G~~~lgeAGYDP~A~~rfl~sm~ay~~F~s----------~~g~adq-sldfl~sHPntpqRiqla~~hARq~  254 (479)
T COG4784         193 GIKMLGEAGYDPYAAARFLQSMAAYTDFRS----------VSGAADQ-SLDFLASHPNTPQRIQLARRHARQF  254 (479)
T ss_pred             hHHHHHhcCCChHHHHHHHHHHHhhhhhcc----------cCcchhh-hcchhhcCCCChHHHHHHHHHHHhh
Confidence            99987    478999999999988753211          1111011 2568999999999999888766654


No 34 
>cd06480 ACD_HspB8_like Alpha-crystallin domain (ACD) found in mammalian 21.6 KDa small heat shock protein (sHsp) HspB8, also denoted as Hsp22 in humans, and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. A chaperone complex formed of HspB8 and Bag3 stimulates degradation of protein complexes by macroautophagy. HspB8 also forms complexes with Hsp27 (HspB1), MKBP (HspB2), HspB3, alphaB-crystallin (HspB5), Hsp20 (HspB6), and cvHsp (HspB7). These latter interactions may depend on phosphorylation of the respective partner sHsp. HspB8 may participate in the regulation of cell proliferation, cardiac hypertrophy, apoptosis, and carcinogenesis. Point mutations in HspB8 have been correlated with the development of several congenital neurological diseases, including Charcot Marie tooth disease and distal motor neuropathy type II.
Probab=99.69  E-value=1.9e-16  Score=123.78  Aligned_cols=77  Identities=18%  Similarity=0.459  Sum_probs=68.2

Q ss_pred             ecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEee-CcEEE
Q psy5221         332 VDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFFD-QGILN  408 (436)
Q Consensus       332 e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~~-~GvL~  408 (436)
                      +++++|.|.+++.||+||||+|++.++.|+|+|+++.+.++.++.     .++|+|+|.||++||  .++|.+. ||+|+
T Consensus        12 ~~~~~f~v~ldv~gF~pEDL~Vkv~~~~L~V~Gkh~~~~~e~g~~-----~r~F~R~~~LP~~Vd~~~v~s~l~~dGvL~   86 (91)
T cd06480          12 NSSEPWKVCVNVHSFKPEELTVKTKDGFVEVSGKHEEQQKEGGIV-----SKNFTKKIQLPPEVDPVTVFASLSPEGLLI   86 (91)
T ss_pred             CCCCcEEEEEEeCCCCHHHcEEEEECCEEEEEEEECcccCCCCEE-----EEEEEEEEECCCCCCchhEEEEeCCCCeEE
Confidence            467899999999999999999999999999999998765444443     589999999999999  3999998 99999


Q ss_pred             EEEec
Q psy5221         409 IDLIR  413 (436)
Q Consensus       409 i~~pk  413 (436)
                      |++|.
T Consensus        87 IeaP~   91 (91)
T cd06480          87 IEAPQ   91 (91)
T ss_pred             EEcCC
Confidence            99983


No 35 
>KOG0710|consensus
Probab=99.60  E-value=1.7e-15  Score=136.74  Aligned_cols=106  Identities=21%  Similarity=0.476  Sum_probs=90.8

Q ss_pred             CCCCCceeEEEecCceEEEEEecCCCCCCceEEEEECC-EEEEEEEEeccc----CCCceEEEeeeeeeEEEEEECCCcc
Q psy5221         321 QQSYPPYNIELVDENKYRITMAIAGFNRSDLEIETERD-TLRITGRKQADT----QQRNFLHRGIATRNFEHRFQLADHV  395 (436)
Q Consensus       321 ~~~~p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~-~l~i~g~~~~~~----~~~~~~~~e~~~~~f~r~~~lP~~v  395 (436)
                      ....++.||. |++++|++.+++||++++|++|+++++ +|+|+|+++.+.    +...+...|+..|.|.|+|.||+++
T Consensus        81 ~~~~~~~~v~-e~~~~~~~~~~~Pgl~ke~iKv~~~~~~~l~isGe~~~e~e~~~~~~~~~~~E~~~g~F~r~~~lPenv  159 (196)
T KOG0710|consen   81 SEARVPWDVK-ESPDAHEFKVDLPGLKKEDIKVEVEDEKVLTISGERKKEEEESGSGKKWKRVERKLGKFKRRFELPENV  159 (196)
T ss_pred             ccccCCcccc-cCCCceEEEeeCCCCCchhceEEeccCcEEEEecccccccccccCCccceeehhcccceEeeecCCccc
Confidence            3456788998 599999999999999999999999988 799999988763    3456788899999999999999999


Q ss_pred             ce--EeEEeeCcEEEEEEecCCC--CCCCCeEeeec
Q psy5221         396 HV--VAAFFDQGILNIDLIREVP--EALKPRKIYIN  427 (436)
Q Consensus       396 ~~--~~A~~~~GvL~i~~pk~~~--~~~~~~~I~I~  427 (436)
                      +.  |+|.|+||||+|++||..+  +..+.+.|+|+
T Consensus       160 ~~d~ikA~~~nGVL~VvvpK~~~~~~~~~v~~i~i~  195 (196)
T KOG0710|consen  160 DVDEIKAEMENGVLTVVVPKLEPLLKKPKVRQIAIS  195 (196)
T ss_pred             cHHHHHHHhhCCeEEEEEecccccccCCccceeecc
Confidence            83  9999999999999999766  34555666654


No 36 
>KOG3591|consensus
Probab=99.58  E-value=1.5e-14  Score=127.02  Aligned_cols=101  Identities=21%  Similarity=0.494  Sum_probs=84.9

Q ss_pred             ceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEe-
Q psy5221         326 PYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFF-  402 (436)
Q Consensus       326 ~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~-  402 (436)
                      ..++. .++++|.|.+|+..|+||||+|++.|+.|.|+|++.+..++.++.     .++|.|+|.||++||.  |++.+ 
T Consensus        64 ~~~~~-~~~~~F~V~lDV~~F~PeEl~Vk~~~~~l~V~gkHeer~d~~G~v-----~R~F~R~y~LP~~vdp~~V~S~LS  137 (173)
T KOG3591|consen   64 ASEIV-NDKDKFEVNLDVHQFKPEELKVKTDDNTLEVEGKHEEKEDEHGYV-----SRSFVRKYLLPEDVDPTSVTSTLS  137 (173)
T ss_pred             ccccc-cCCCcEEEEEEcccCcccceEEEeCCCEEEEEeeeccccCCCCeE-----EEEEEEEecCCCCCChhheEEeeC
Confidence            45665 588999999999999999999999999999999998887777776     6689999999999993  77777 


Q ss_pred             eCcEEEEEEecCCCCCCCCeEeeecCCCCC
Q psy5221         403 DQGILNIDLIREVPEALKPRKIYINTTKNT  432 (436)
Q Consensus       403 ~~GvL~i~~pk~~~~~~~~~~I~I~~~~~~  432 (436)
                      .||+|+|+.||..+.....|.|+|+..+..
T Consensus       138 ~dGvLtI~ap~~~~~~~~er~ipI~~~~~~  167 (173)
T KOG3591|consen  138 SDGVLTIEAPKPPPKQDNERSIPIEQVGPS  167 (173)
T ss_pred             CCceEEEEccCCCCcCccceEEeEeecCcc
Confidence            589999999995443335899999865443


No 37 
>PF05569 Peptidase_M56:  BlaR1 peptidase M56;  InterPro: IPR008756 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M56 (clan M-). The predicted active site residues for members of this family occur in the motif HEXXH. The type example is BlaR1 peptidase from Bacillus licheniformis. Production of beta-Lactamase and penicillin-binding protein 2a (which mediate staphylococcal resistance to beta-lactam antibiotics) is regulated by a signal-transducing integral membrane protein and a transcriptional repressor. The signal transducer is a fusion protein with penicillin-binding and zinc metalloprotease domains. The signal for protein expression is transmitted by site-specific proteolytic cleavage of both the transducer, which auto-activates, and the repressor, which is inactivated, unblocking gene transcription. 
Probab=99.47  E-value=2.8e-12  Score=124.55  Aligned_cols=154  Identities=19%  Similarity=0.303  Sum_probs=110.4

Q ss_pred             HHHHHHHHHHhcCCCC-CeEEEECCCccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHH
Q psy5221          64 LLKTVNKISKNANIST-PEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQG  142 (436)
Q Consensus        64 L~~~l~~l~~~~g~~~-p~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~  142 (436)
                      +.+.+++++.+.|++. .++++.+.....+++|+.  ++.|++..++++.++++|+.+|++||+.|++++|.....+...
T Consensus       143 ~~~~l~~~~~~~~~~~~~~i~~s~~i~sP~~~G~~--~p~I~lP~~~~~~~~~~el~~il~HEl~Hikr~D~~~~~l~~l  220 (299)
T PF05569_consen  143 LQALLEECKEELGIKRPIRIRVSSGISSPFVFGFL--RPVIVLPESLLEDLSEEELRAILLHELAHIKRRDLLWKLLAEL  220 (299)
T ss_pred             HHHHHHHHHHHhCCCCceEEEEcCCCCCCeeecCc--ceEEEecCccccccCHHHHHHHHHHHHHHHHCCChHHHHHHHH
Confidence            7899999999999874 455554546678899976  5679999999999999999999999999999999877544321


Q ss_pred             HHHHHHHHHHHHHHHHHHhhhhccCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhc--C
Q psy5221         143 VINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLL--G  220 (436)
Q Consensus       143 ~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~--g  220 (436)
                      +  .       .+.|                                  +..++.++...+.+.+|+.||+.+++.+  +
T Consensus       221 ~--~-------~l~W----------------------------------fnP~~~~~~~~~~~~~E~~cD~~vl~~l~~~  257 (299)
T PF05569_consen  221 L--C-------ALHW----------------------------------FNPLVWLLRRRIRRDRELACDEAVLRNLGKE  257 (299)
T ss_pred             H--H-------HHHH----------------------------------hhHHHHHHHHHHHHHHHHhhhHHHHHhcCch
Confidence            0  0       1111                                  1122335668889999999999999994  4


Q ss_pred             CchHHHHHHHHhhcccccCCChhhhhhhcccCcccchhhhhhhccCCChHHHHHHH
Q psy5221         221 TPDPMISALRHLSNISIVNDLPKSIAILGINNNFKKKSFLKLFATHPSFEKRIAIL  276 (436)
Q Consensus       221 ~~~~l~~aL~kl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tHP~~~~RI~~L  276 (436)
                      +..+.+++|.++.........+.              ....+..+.-.+.+||..|
T Consensus       258 ~~~~Y~~~Ll~~~~~~~~~~~~~--------------~~~~~~~~~~~lk~RI~~I  299 (299)
T PF05569_consen  258 ERKAYAETLLKVAKRSQQFKRPP--------------LASSFAFSKSQLKRRIKMI  299 (299)
T ss_pred             hHHHHHHHHHHHHHhhcCCCcch--------------hhhhccCChHHHHHHHHhC
Confidence            55788999999987753311111              1122334677788888753


No 38 
>cd00298 ACD_sHsps_p23-like This domain family includes the alpha-crystallin domain (ACD) of alpha-crystallin-type small heat shock proteins (sHsps) and a similar domain found in p23-like proteins.  sHsps are small stress induced proteins with monomeric masses between 12 -43 kDa, whose common feature is this ACD. sHsps are generally active as large oligomers consisting of multiple subunits, and are believed to be ATP-independent chaperones that prevent aggregation and are important in refolding in combination with other Hsps. p23 is a cochaperone of the Hsp90 chaperoning pathway. It binds Hsp90 and participates in the folding of a number of Hsp90 clients including the progesterone receptor. p23 also has a passive chaperoning activity. p23 in addition may act as the cytosolic prostaglandin E2 synthase. Included in this family is the p23-like C-terminal CHORD-SGT1 (CS) domain of suppressor of G2 allele of Skp1 (Sgt1) and  the p23-like domains of human butyrate-induced transcript 1 (hB-ind
Probab=99.29  E-value=2.5e-11  Score=92.80  Aligned_cols=76  Identities=25%  Similarity=0.554  Sum_probs=68.5

Q ss_pred             ecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEeeCcEEEE
Q psy5221         332 VDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFFDQGILNI  409 (436)
Q Consensus       332 e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~~~GvL~i  409 (436)
                      ++++.|.+.+++||+.+++++|+++++.|.|+|++....+      .+...+.|.+.+.||..++.  ++|.+.+|+|+|
T Consensus         3 q~~~~v~i~i~~~~~~~~~i~v~~~~~~l~v~~~~~~~~~------~~~~~~~~~~~~~L~~~i~~~~~~~~~~~~~l~i   76 (80)
T cd00298           3 QTDDEVVVTVDLPGVKKEDIKVEVEDNVLTISGKREEEEE------RERSYGEFERSFELPEDVDPEKSKASLENGVLEI   76 (80)
T ss_pred             EcCCEEEEEEECCCCCHHHeEEEEECCEEEEEEEEcCCCc------ceEeeeeEEEEEECCCCcCHHHCEEEEECCEEEE
Confidence            4788999999999999999999999999999999875443      45567899999999999994  999999999999


Q ss_pred             EEec
Q psy5221         410 DLIR  413 (436)
Q Consensus       410 ~~pk  413 (436)
                      ++||
T Consensus        77 ~l~K   80 (80)
T cd00298          77 TLPK   80 (80)
T ss_pred             EEcC
Confidence            9997


No 39 
>cd06469 p23_DYX1C1_like p23_like domain found in proteins similar to dyslexia susceptibility 1 (DYX1) candidate 1 (C1) protein, DYX1C1. The human gene encoding this protein is a positional candidate gene for developmental dyslexia (DD), it is located on 15q21.3 by the DYX1 DD susceptibility locus (15q15-21). Independent association studies have reported conflicting results. However, association of short-term memory, which plays a role in DD, with a variant within the DYX1C1 gene has been reported. Most proteins belonging to this group contain a C-terminal tetratricopeptide repeat (TPR) protein binding region.
Probab=98.93  E-value=7.4e-09  Score=79.43  Aligned_cols=65  Identities=17%  Similarity=0.320  Sum_probs=59.3

Q ss_pred             ecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEeeCcEEEE
Q psy5221         332 VDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFFDQGILNI  409 (436)
Q Consensus       332 e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~~~GvL~i  409 (436)
                      ++++.+.|.+++||++++|++|+++++.|.|+|                  ..|.+.+.||..++  ..+|++++|.|.|
T Consensus         3 Qt~~~v~i~i~~p~v~~~~v~v~~~~~~l~i~~------------------~~~~~~~~l~~~I~~e~~~~~~~~~~l~i   64 (78)
T cd06469           3 QTDEDVKISVPLKGVKTSKVDIFCSDLYLKVNF------------------PPYLFELDLAAPIDDEKSSAKIGNGVLVF   64 (78)
T ss_pred             ccCCEEEEEEEeCCCccccceEEEecCEEEEcC------------------CCEEEEEeCcccccccccEEEEeCCEEEE
Confidence            378899999999999999999999999999986                  14788999999999  4999999999999


Q ss_pred             EEecC
Q psy5221         410 DLIRE  414 (436)
Q Consensus       410 ~~pk~  414 (436)
                      ++||.
T Consensus        65 ~L~K~   69 (78)
T cd06469          65 TLVKK   69 (78)
T ss_pred             EEEeC
Confidence            99994


No 40 
>COG4219 MecR1 Antirepressor regulating drug resistance, predicted signal transduction N-terminal membrane component [Transcription / Signal transduction mechanisms]
Probab=98.88  E-value=3.6e-07  Score=86.09  Aligned_cols=125  Identities=18%  Similarity=0.241  Sum_probs=86.6

Q ss_pred             HHHHHHHHHHhcCCCCCeEEEECCCccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHHH
Q psy5221          64 LLKTVNKISKNANISTPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGV  143 (436)
Q Consensus        64 L~~~l~~l~~~~g~~~p~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~~  143 (436)
                      ..+.+..+..+.....+.+..-+...+-.++|..  +..|++-+-+.+.++++|++-|++||++|+++||...+.+..  
T Consensus       138 ~~~~~~~~~~~~~~k~i~ir~s~~i~~P~v~gl~--kp~IvlP~d~~~r~~~ee~~yIilHEl~Hlk~gD~i~n~i~~--  213 (337)
T COG4219         138 KRKIVTILKNHQYKKHILIRKSKAIDGPMVFGLV--KPCIVLPADFVERLTDEELKYIILHELSHLKRGDAIINLIVV--  213 (337)
T ss_pred             HHHHHHHHHHhhhccCeeEeecccCCCceeeccC--cceEEccHHHHhhcCHHhhhhhHhHHHhhhhcccHHHHHHHH--
Confidence            5778888888888776554444436677778865  556888888999999999999999999999999987644431  


Q ss_pred             HHHHHHHHHHHHHHHHHhhhhccCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhcCCc-
Q psy5221         144 INTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTP-  222 (436)
Q Consensus       144 ~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~g~~-  222 (436)
                            .+. ++.|      +                            ..+..+....+--.+|..||+.+.... +| 
T Consensus       214 ------~~~-~l~W------f----------------------------NP~v~l~~~~~~~D~E~aCDa~vL~~~-~~~  251 (337)
T COG4219         214 ------VLG-VLFW------F----------------------------NPLVHLGKRKIRIDQEIACDAAVLARI-NPE  251 (337)
T ss_pred             ------HHh-HHhh------c----------------------------ChHHHHHHHHHHhhHHHHhhHHHHhcc-ChH
Confidence                  111 1111      1                            112234556777889999999999874 33 


Q ss_pred             --hHHHHHHHHhhc
Q psy5221         223 --DPMISALRHLSN  234 (436)
Q Consensus       223 --~~l~~aL~kl~~  234 (436)
                        ...++++.++..
T Consensus       252 err~YaEsil~~l~  265 (337)
T COG4219         252 ERRTYAESILKLLL  265 (337)
T ss_pred             HHHHHHHHHHHHHh
Confidence              445566666444


No 41 
>PF05455 GvpH:  GvpH;  InterPro: IPR008633 This family consists of archaeal GvpH proteins which are thought to be involved in gas vesicle synthesis [].
Probab=98.76  E-value=6.3e-08  Score=83.99  Aligned_cols=74  Identities=19%  Similarity=0.333  Sum_probs=57.7

Q ss_pred             CCCceeEEEecCc-eEEEEEecCCCCCCc-eEEEEECC--EEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc-e
Q psy5221         323 SYPPYNIELVDEN-KYRITMAIAGFNRSD-LEIETERD--TLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH-V  397 (436)
Q Consensus       323 ~~p~~~i~~e~~~-~~~v~~~lpG~~~ed-i~v~~~~~--~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~-~  397 (436)
                      ..+-+++.+ .++ .++|.+||||++++| |+|.++.+  .|+|+.                 .+.+.+++.||...- .
T Consensus        90 ~~~~vdtre-~dDge~~VvAdLPGVs~dd~idV~l~~d~~~L~i~~-----------------~~~~~krv~L~~~~~e~  151 (177)
T PF05455_consen   90 ESIHVDTRE-RDDGELVVVADLPGVSDDDAIDVTLDDDEGALTIRV-----------------GEKYLKRVALPWPDPEI  151 (177)
T ss_pred             ceeeeeeEe-cCCCcEEEEEeCCCCCcccceeeEeecCCceEEEec-----------------CCceEeeEecCCCccce
Confidence            346678874 455 799999999999998 99999854  666652                 223668899997633 5


Q ss_pred             EeEEeeCcEEEEEEecC
Q psy5221         398 VAAFFDQGILNIDLIRE  414 (436)
Q Consensus       398 ~~A~~~~GvL~i~~pk~  414 (436)
                      +++.|+||||+|++-|.
T Consensus       152 ~~~t~nNgILEIri~~~  168 (177)
T PF05455_consen  152 TSATFNNGILEIRIRRT  168 (177)
T ss_pred             eeEEEeCceEEEEEeec
Confidence            89999999999999883


No 42 
>cd06463 p23_like Proteins containing this p23_like domain include p23 and its Saccharomyces cerevisiae (Sc) homolog Sba1. Both are co-chaperones for the heat shock protein (Hsp) 90.  p23 binds Hsp90 and participates in the folding of a number of Hsp90 clients, including the progesterone receptor. p23 also has a passive chaperoning activity and in addition may participate in prostaglandin synthesis.  Both p23 and Sba1p can regulate telomerase activity. This group includes domains similar to the C-terminal CHORD-SGT1 (CS) domain of suppressor of G2 allele of Skp1 (Sgt1). Sgt1 interacts with multiple protein complexes and has the features of a co-chaperone. Human (h) Sgt1 interacts with both Hsp70 and Hsp90, and has been shown to bind Hsp90 through its CS domain.  Saccharomyces cerevisiae (Sc) Sgt1 is a subunit of both core kinetochore and SCF (Skp1-Cul1-F-box) ubiquitin ligase complexes. Sgt1 is required for pathogen resistance in plants.  This group also includes the p23_like domains of
Probab=98.50  E-value=9.8e-07  Score=68.14  Aligned_cols=70  Identities=11%  Similarity=0.192  Sum_probs=61.3

Q ss_pred             ecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEeeCcEEEE
Q psy5221         332 VDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFFDQGILNI  409 (436)
Q Consensus       332 e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~~~GvL~i  409 (436)
                      ++++.+.|.+.+||..+++++|.++++.|+|++...   +          .+.|...+.|+..++.  .++++++|.|.|
T Consensus         3 Q~~~~v~i~v~~~~~~~~~~~v~~~~~~l~i~~~~~---~----------~~~~~~~~~L~~~I~~~~s~~~~~~~~l~i   69 (84)
T cd06463           3 QTLDEVTITIPLKDVTKKDVKVEFTPKSLTVSVKGG---G----------GKEYLLEGELFGPIDPEESKWTVEDRKIEI   69 (84)
T ss_pred             ccccEEEEEEEcCCCCccceEEEEecCEEEEEeeCC---C----------CCceEEeeEccCccchhhcEEEEeCCEEEE
Confidence            378899999999999999999999999999998753   1          2457788899999994  999999999999


Q ss_pred             EEecC
Q psy5221         410 DLIRE  414 (436)
Q Consensus       410 ~~pk~  414 (436)
                      +++|.
T Consensus        70 ~L~K~   74 (84)
T cd06463          70 TLKKK   74 (84)
T ss_pred             EEEEC
Confidence            99995


No 43 
>cd06466 p23_CS_SGT1_like p23_like domain similar to the C-terminal CHORD-SGT1 (CS) domain of Sgt1 (suppressor of G2 allele of Skp1). Sgt1 interacts with multiple protein complexes and has the features of a cochaperone. Human (h) Sgt1 interacts with both Hsp70 and Hsp90, and has been shown to bind Hsp90 through its CS domain.  Saccharomyces cerevisiae (Sc) Sgt1 is a subunit of both core kinetochore and SCF (Skp1-Cul1-F-box) ubiquitin ligase complexes. Sgt1 is required for pathogen resistance in plants. ScSgt1 is needed for the G1/S and G2/M cell-cycle transitions, and for assembly of the core kinetochore complex (CBF3) via activation of Ctf13, the F-box protein. Binding of Hsp82 (a yeast Hsp90 homologue) to ScSgt1, promotes the binding of Sgt1 to Skp1 and of Skp1 to Ctf13.  Some proteins in this group have an SGT1-specific (SGS) domain at the extreme C-terminus. The ScSgt1-SGS domain binds adenylate cyclase.  The hSgt1-SGS domain interacts with some S100 family proteins, and studies sug
Probab=98.12  E-value=1.4e-05  Score=61.97  Aligned_cols=73  Identities=10%  Similarity=0.151  Sum_probs=61.5

Q ss_pred             eEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEeeCc
Q psy5221         328 NIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFFDQG  405 (436)
Q Consensus       328 ~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~~~G  405 (436)
                      |.++ +++.+.|.+.+||+.++|++|.++++.|.|++...   +          .+.|...+.|+..|+.  .++.+.+|
T Consensus         1 dW~Q-t~~~v~i~v~~~~~~~~~v~v~~~~~~l~i~~~~~---~----------~~~~~~~~~L~~~I~~~~s~~~~~~~   66 (84)
T cd06466           1 DWYQ-TDTSVTVTIYAKNVDKEDVKVEFNEQSLSVSIILP---G----------GSEYQLELDLFGPIDPEQSKVSVLPT   66 (84)
T ss_pred             Cccc-cCCEEEEEEEECCCCHHHCEEEEecCEEEEEEECC---C----------CCeEEEecccccccCchhcEEEEeCe
Confidence            3454 88999999999999999999999999999986642   0          2357778889999884  88999999


Q ss_pred             EEEEEEecC
Q psy5221         406 ILNIDLIRE  414 (436)
Q Consensus       406 vL~i~~pk~  414 (436)
                      .+.|+++|.
T Consensus        67 ~vei~L~K~   75 (84)
T cd06466          67 KVEITLKKA   75 (84)
T ss_pred             EEEEEEEcC
Confidence            999999994


No 44 
>PF04969 CS:  CS domain;  InterPro: IPR017447 The function of the CS domain is unknown. The CS domain is sometimes found C-terminal to the CHORD domain (IPR007051 from INTERPRO) in metazoan proteins, but occurs separately from the CHORD domain in plants. This association is thought to be indicative of an functional interaction between CS and CHORD domains [].; PDB: 1WGV_A 2KMW_A 2O30_B 1WH0_A 1EJF_A 2RH0_B 1RL1_A 2CR0_A 1WFI_A 2XCM_D ....
Probab=97.67  E-value=0.001  Score=50.48  Aligned_cols=75  Identities=11%  Similarity=0.258  Sum_probs=59.9

Q ss_pred             CceeEEEecCceEEEEEecCCCC--CCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeE
Q psy5221         325 PPYNIELVDENKYRITMAIAGFN--RSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAA  400 (436)
Q Consensus       325 p~~~i~~e~~~~~~v~~~lpG~~--~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A  400 (436)
                      |.++.++ +++.+.|.+.+++..  ++|++|+++++.+.++......             ..|.-...|...|+.  .+.
T Consensus         1 ~~y~W~Q-t~~~V~v~i~~~~~~~~~~dv~v~~~~~~l~v~~~~~~~-------------~~~~~~~~L~~~I~~~~s~~   66 (79)
T PF04969_consen    1 PRYDWYQ-TDDEVTVTIPVKPVDISKEDVKVDFTDTSLSVSIKSGDG-------------KEYLLEGELFGEIDPDESTW   66 (79)
T ss_dssp             SSEEEEE-ESSEEEEEEE-TTTTSSGGGEEEEEETTEEEEEEEETTS-------------CEEEEEEEBSS-BECCCEEE
T ss_pred             CCeEEEE-CCCEEEEEEEEcCCCCChHHeEEEEEeeEEEEEEEccCC-------------ceEEEEEEEeeeEcchhcEE
Confidence            6788886 899999999996664  9999999999999999654211             245566789999884  899


Q ss_pred             EeeCcEEEEEEec
Q psy5221         401 FFDQGILNIDLIR  413 (436)
Q Consensus       401 ~~~~GvL~i~~pk  413 (436)
                      .++++.|.|+|.|
T Consensus        67 ~~~~~~i~i~L~K   79 (79)
T PF04969_consen   67 KVKDNKIEITLKK   79 (79)
T ss_dssp             EEETTEEEEEEEB
T ss_pred             EEECCEEEEEEEC
Confidence            9999999999987


No 45 
>PF04228 Zn_peptidase:  Putative neutral zinc metallopeptidase;  InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=97.58  E-value=0.001  Score=63.76  Aligned_cols=67  Identities=15%  Similarity=0.296  Sum_probs=44.8

Q ss_pred             HHHHHHHHHHhcCCC--CCeEEEECC-----------CccEEEeCCCCCccEEEEcHHHHhhcC-------HHHHHHHHH
Q psy5221          64 LLKTVNKISKNANIS--TPEVAIYNG-----------EINAFATGAFKNSSLIAISANLLESMT-------KLEIKAIIA  123 (436)
Q Consensus        64 L~~~l~~l~~~~g~~--~p~v~v~~~-----------~~NA~~~G~~~~~~~v~it~gLl~~l~-------~~El~aVLA  123 (436)
                      +.+......+++|.+  .|++++++.           ..|+|=   +|....|+++..+++.|.       +--...|||
T Consensus        99 le~~W~~~~~~~g~~y~~P~lv~~~~~~~t~CG~a~s~~gpFY---Cp~D~tIYlD~~f~~~L~~~~ga~G~~a~ayVlA  175 (292)
T PF04228_consen   99 LEDVWTPQFPQAGLPYRPPKLVLFSGSVQTGCGTASSATGPFY---CPADQTIYLDLSFFDELQQRFGASGDFAQAYVLA  175 (292)
T ss_pred             HHHHHHHHHHHcCCCCCCCEEEEeCCCCCCCCCCCcCCCCCEe---CCCCCEEEechHHHHHHHHHhCCccHHHHHHHHH
Confidence            334445666677876  688888776           123332   466778999988776552       234577999


Q ss_pred             HHHHhhhcCh
Q psy5221         124 HEISHISNGD  133 (436)
Q Consensus       124 HElgHi~~~h  133 (436)
                      ||+||..++.
T Consensus       176 HEyGHHVQ~l  185 (292)
T PF04228_consen  176 HEYGHHVQNL  185 (292)
T ss_pred             HHHHHHHHHH
Confidence            9999987754


No 46 
>PF06114 DUF955:  Domain of unknown function (DUF955);  InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=97.37  E-value=0.0016  Score=53.50  Aligned_cols=33  Identities=21%  Similarity=0.277  Sum_probs=26.7

Q ss_pred             ccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHH
Q psy5221         100 SSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVT  136 (436)
Q Consensus       100 ~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~  136 (436)
                      ..+|+++.    .+++.+..+++|||+||+..+|...
T Consensus        28 ~~~I~in~----~~~~~~~~f~laHELgH~~~~~~~~   60 (122)
T PF06114_consen   28 NPIIFINS----NLSPERQRFTLAHELGHILLHHGDE   60 (122)
T ss_dssp             TTEEEEES----SS-HHHHHHHHHHHHHHHHHHH-HH
T ss_pred             CCEEEECC----CCCHHHHHHHHHHHHHHHHhhhccc
Confidence            67788888    3789999999999999999988643


No 47 
>cd06465 p23_hB-ind1_like p23_like domain found in human (h) butyrate-induced transcript 1 (B-ind1) and similar proteins. hB-ind1 participates in signaling by the small GTPase Rac1. It binds to Rac1 and enhances different Rac1 effects including activation of nuclear factor (NF) kappaB and activation of c-Jun N-terminal kinase (JNK). hB-ind1 also plays a part in the RNA replication and particle production of Hepatitis C virus (HCV)  through its interaction with heat shock protein Hsp90, HCV nonstructural protein 5A (NS5A), and the immunophilin FKBP8.  hB-ind1 is upregulated in the outer layer of Chinese hamster V79 cells grown as multicell spheroids, versus in the same cells grown as monolayers. This group includes the Saccharomyces cerevisiae Sba1, a co-chaperone of the Hsp90. Sba1 has been shown to be is required for telomere length maintenance, and may modulate telomerase DNA-binding activity.
Probab=97.27  E-value=0.0037  Score=50.90  Aligned_cols=76  Identities=12%  Similarity=0.190  Sum_probs=61.4

Q ss_pred             CceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEe
Q psy5221         325 PPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFF  402 (436)
Q Consensus       325 p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~  402 (436)
                      |+++-++ +++.+.|.+.+||+  ++++|++..+.|.|++....  ++          ..|.-.+.|...|+.  .+.++
T Consensus         1 p~~~W~Q-t~~~V~i~i~~~~~--~~~~V~~~~~~l~v~~~~~~--~~----------~~y~~~~~L~~~I~pe~s~~~v   65 (108)
T cd06465           1 PPVLWAQ-RSDVVYLTIELPDA--KDPKIKLEPTSLSFKAKGGG--GG----------KKYEFDLEFYKEIDPEESKYKV   65 (108)
T ss_pred             CceeeeE-CCCEEEEEEEeCCC--CCcEEEEECCEEEEEEEcCC--CC----------eeEEEEeEhhhhccccccEEEe
Confidence            6889997 89999999999998  88999999999999985321  11          124556688888883  78888


Q ss_pred             eCcEEEEEEecCC
Q psy5221         403 DQGILNIDLIREV  415 (436)
Q Consensus       403 ~~GvL~i~~pk~~  415 (436)
                      .++.+.|+|.|..
T Consensus        66 ~~~kveI~L~K~~   78 (108)
T cd06465          66 TGRQIEFVLRKKE   78 (108)
T ss_pred             cCCeEEEEEEECC
Confidence            8999999999953


No 48 
>PF10463 Peptidase_U49:  Peptidase U49;  InterPro: IPR019504 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.   The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported.  This entry contains peptidases belonging to MEROPS peptidase family U49 (Lit peptidase, clan U-). The Lit peptidase from Escherichia coli functions in bacterial cell death in response to infection by Enterobacteria phage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death. 
Probab=97.16  E-value=0.00056  Score=61.79  Aligned_cols=20  Identities=30%  Similarity=0.429  Sum_probs=17.5

Q ss_pred             HHHHHHHHHHHHhhhcChHH
Q psy5221         116 LEIKAIIAHEISHISNGDMV  135 (436)
Q Consensus       116 ~El~aVLAHElgHi~~~h~~  135 (436)
                      --+..+++||++|+..+|..
T Consensus        99 ~A~~fil~HE~~Hv~~~h~~  118 (206)
T PF10463_consen   99 CAIAFILLHELAHVVLGHEG  118 (206)
T ss_pred             HHHHHHHHHHHHHHHHcCcc
Confidence            46789999999999999964


No 49 
>PF08190 PIH1:  pre-RNA processing PIH1/Nop17
Probab=97.06  E-value=0.0021  Score=63.20  Aligned_cols=64  Identities=20%  Similarity=0.374  Sum_probs=54.9

Q ss_pred             cCceEEEEEecCCC-CCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEee--CcEE
Q psy5221         333 DENKYRITMAIAGF-NRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFFD--QGIL  407 (436)
Q Consensus       333 ~~~~~~v~~~lpG~-~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~~--~GvL  407 (436)
                      ..+.++|+++|||+ +..+|+++|.++.|.|+....                .|+-.+.||..||  ..+|.|.  .+.|
T Consensus       259 ~p~~lvv~i~LP~~~s~~~i~LdV~~~~l~l~~~~~----------------~y~L~l~LP~~V~~~~~~Akf~~~~~~L  322 (328)
T PF08190_consen  259 SPEELVVEIELPGVESASDIDLDVSEDRLSLSSPKP----------------KYRLDLPLPYPVDEDNGKAKFDKKTKTL  322 (328)
T ss_pred             CCceEEEEEECCCcCccceeEEEEeCCEEEEEeCCC----------------ceEEEccCCCcccCCCceEEEccCCCEE
Confidence            36789999999999 899999999999999984431                3667799999999  4999996  5999


Q ss_pred             EEEEe
Q psy5221         408 NIDLI  412 (436)
Q Consensus       408 ~i~~p  412 (436)
                      +||+|
T Consensus       323 ~vtlp  327 (328)
T PF08190_consen  323 TVTLP  327 (328)
T ss_pred             EEEEE
Confidence            99998


No 50 
>cd06489 p23_CS_hSgt1_like p23_like domain similar to the C-terminal CS (CHORD-SGT1) domain of human (h) Sgt1 and related proteins. hSgt1 is a co-chaperone which has been shown to be elevated in HEp-2 cells as a result of stress conditions such as heat shock. It interacts with the heat shock proteins (HSPs) Hsp70 and Hsp90, and it expression pattern is synchronized with these two Hsps. The interaction with HSP90 has been shown to involve the hSgt1_CS domain, and appears to be required for correct kinetochore assembly and efficient cell division.  Some proteins in this subgroup contain a tetratricopeptide repeat (TPR) HSP-binding domain N-terminal to this CS domain, and most proteins in this subgroup contain a Sgt1-specific (SGS) domain C-terminal to the CS domain. The SGS domain interacts with some S100 family proteins. Studies suggest that S100A6 modulates in a Ca2+ dependent manner the interactions of hSgt1 with Hsp90 and Hsp70. The yeast Sgt1 CS domain is not found in this subgroup.
Probab=96.90  E-value=0.0078  Score=46.51  Aligned_cols=73  Identities=16%  Similarity=0.205  Sum_probs=56.0

Q ss_pred             eEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEeeCc
Q psy5221         328 NIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFFDQG  405 (436)
Q Consensus       328 ~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~~~G  405 (436)
                      |.++ +++.+.|.+.++|+.+++++|+++++.|.+++...   ++.          .|.-.+.|...|+.  .+.+..++
T Consensus         1 dW~Q-~~~~V~iti~~k~~~~~~~~v~~~~~~l~~~~~~~---~~~----------~y~~~~~L~~~I~p~~s~~~v~~~   66 (84)
T cd06489           1 DWYQ-TESQVVITILIKNVKPEDVSVEFEKRELSATVKLP---SGN----------DYSLKLHLLHPIVPEQSSYKILST   66 (84)
T ss_pred             Cccc-cCCEEEEEEEECCCCHHHCEEEEeCCEEEEEEECC---CCC----------cEEEeeecCceecchhcEEEEeCc
Confidence            3455 78889999999999999999999999999997652   111          24445677777663  56666688


Q ss_pred             EEEEEEecC
Q psy5221         406 ILNIDLIRE  414 (436)
Q Consensus       406 vL~i~~pk~  414 (436)
                      -+.|++.|.
T Consensus        67 kiei~L~K~   75 (84)
T cd06489          67 KIEIKLKKT   75 (84)
T ss_pred             EEEEEEEcC
Confidence            899999994


No 51 
>PRK09672 phage exclusion protein Lit; Provisional
Probab=96.74  E-value=0.0026  Score=59.94  Aligned_cols=21  Identities=24%  Similarity=0.298  Sum_probs=18.0

Q ss_pred             HHHHHHHHHHHHHhhhcChHH
Q psy5221         115 KLEIKAIIAHEISHISNGDMV  135 (436)
Q Consensus       115 ~~El~aVLAHElgHi~~~h~~  135 (436)
                      -..++.|+.||++|+..+|..
T Consensus       162 l~A~a~i~~HEiaHv~~~h~~  182 (305)
T PRK09672        162 LCALAWILLHEIAHVEFQHSS  182 (305)
T ss_pred             HHHHHHHHHHHHHHHHhcccc
Confidence            346889999999999999963


No 52 
>cd06467 p23_NUDC_like p23_like domain of NUD (nuclear distribution) C and similar proteins. Aspergillus nidulas (An) NUDC is needed for nuclear movement. AnNUDC is localized at the hyphal cortex, and binds NUDF at spindle pole bodies (SPBs) and in the cytoplasm at different stages in the cell cycle. At the SPBs it is part of the dynein molecular motor/NUDF complex that regulates microtubule dynamics.  Mammalian(m) NUDC associates both with the dynein complex and also with an anti-inflammatory enzyme, platelet activating factor acetylhydrolase I, PAF-AH(I) complex, through binding mNUDF, the regulatory beta subunit of PAF-AH(I).  mNUDC is important for cell proliferation both in normal and tumor tissues.  Its expression is elevated in various cell types undergoing mitosis or stimulated to proliferate, with high expression levels observed in leukemic cells and tumors.  For a leukemic cell line, human NUDC was shown to activate the thrombopoietin (TPO) receptor (Mpl) by binding to its ext
Probab=96.63  E-value=0.018  Score=44.39  Aligned_cols=69  Identities=16%  Similarity=0.214  Sum_probs=52.2

Q ss_pred             EEEecCceEEEEEecC-CCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEeeC-
Q psy5221         329 IELVDENKYRITMAIA-GFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFFDQ-  404 (436)
Q Consensus       329 i~~e~~~~~~v~~~lp-G~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~~~-  404 (436)
                      -++ +++.+.|.+.+| +++++|++|++.++.|.|+.+.      ..+          .-.-.|...|+.  ..-.+++ 
T Consensus         3 W~Q-t~~~V~i~i~~~~~~~~~dv~v~~~~~~l~v~~~~------~~~----------~l~~~L~~~I~~~~s~w~~~~~   65 (85)
T cd06467           3 WTQ-TLDEVTVTIPLPEGTKSKDVKVEITPKHLKVGVKG------GEP----------LLDGELYAKVKVDESTWTLEDG   65 (85)
T ss_pred             EEe-eCCEEEEEEECCCCCcceeEEEEEEcCEEEEEECC------CCc----------eEcCcccCceeEcCCEEEEeCC
Confidence            454 788999999997 7899999999999999998542      011          111246777774  5557788 


Q ss_pred             cEEEEEEecC
Q psy5221         405 GILNIDLIRE  414 (436)
Q Consensus       405 GvL~i~~pk~  414 (436)
                      ..+.|+++|.
T Consensus        66 ~~v~i~L~K~   75 (85)
T cd06467          66 KLLEITLEKR   75 (85)
T ss_pred             CEEEEEEEEC
Confidence            9999999994


No 53 
>cd06493 p23_NUDCD1_like p23_NUDCD1: p23-like NUD (nuclear distribution) C-like domain found in human NUD (nuclear distribution) C domain-containing protein 1, NUDCD1 (also known as CML66), and similar proteins. NUDCD1/CML66 is a broadly immunogenic tumor associated antigen, which is highly expressed in a variety of solid tumors and in leukemias. In normal tissues high expression of NUDCD1/CML66 is limited to testis and heart.
Probab=96.35  E-value=0.038  Score=42.77  Aligned_cols=70  Identities=16%  Similarity=0.248  Sum_probs=51.2

Q ss_pred             eEEEecCceEEEEEecC-CCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEeeC
Q psy5221         328 NIELVDENKYRITMAIA-GFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFFDQ  404 (436)
Q Consensus       328 ~i~~e~~~~~~v~~~lp-G~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~~~  404 (436)
                      +-++ +++.+.|.+.+| |++++|++|++..+.|.++...     +..          + -.-.|...|+.  .+-..++
T Consensus         2 ~W~Q-t~~~V~v~i~~p~~~~~~dv~v~~~~~~l~v~~~~-----~~~----------~-~~g~L~~~I~~d~Stw~i~~   64 (85)
T cd06493           2 YWQQ-TEEDLTLTIRLPEDTTKEDIRIKFLPDHISIALKD-----QAP----------L-LEGKLYSSIDHESSTWIIKE   64 (85)
T ss_pred             ccEE-eCCEEEEEEECCCCCChhhEEEEEecCEEEEEeCC-----CCe----------E-EeCcccCcccccCcEEEEeC
Confidence            4455 888999999996 9999999999999999997421     001          1 12256777774  5555666


Q ss_pred             c-EEEEEEecC
Q psy5221         405 G-ILNIDLIRE  414 (436)
Q Consensus       405 G-vL~i~~pk~  414 (436)
                      | .|.|++.|.
T Consensus        65 ~~~l~i~L~K~   75 (85)
T cd06493          65 NKSLEVSLIKK   75 (85)
T ss_pred             CCEEEEEEEEC
Confidence            6 699999994


No 54 
>PF01863 DUF45:  Protein of unknown function DUF45;  InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=96.34  E-value=0.008  Score=54.80  Aligned_cols=68  Identities=18%  Similarity=0.301  Sum_probs=53.0

Q ss_pred             HHHHHHHHHHhcCCCCCeEEEECCCccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHH
Q psy5221          64 LLKTVNKISKNANISTPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMV  135 (436)
Q Consensus        64 L~~~l~~l~~~~g~~~p~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~  135 (436)
                      +.+.+...++..|++++++.+-+...   ..|.++.++.|.++.-|+ .++++-+..|+.||++|+.+.+.-
T Consensus       114 l~~~~~~~~~~~~~~~~~i~ir~~ks---rWGsc~~~~~I~ln~~L~-~~P~~~idYVvvHEL~Hl~~~nHs  181 (205)
T PF01863_consen  114 LPERLKKYAKKLGLPPPKIKIRDMKS---RWGSCSSKGNITLNWRLV-MAPPEVIDYVVVHELCHLRHPNHS  181 (205)
T ss_pred             HHHHHHHHHHHcCCCcceEEEeehhh---ccccCCCCCcEEeecccc-cCCccHHHHHHHHHHHHhccCCCC
Confidence            66777888999999999988877511   256556677798998444 479999999999999999875543


No 55 
>cd06488 p23_melusin_like p23_like domain similar to the C-terminal (tail) domain of vertebrate Melusin and related proteins. Melusin's tail domain interacts with the cytoplasmic domain of beta1-A and beta1-D isoforms of beta1 integrin, it does not bind other integrin beta subunits. Melusin is a muscle-specific protein expressed in skeletal and cardiac muscles but not in smooth muscle or other tissues. It is needed for heart hypertrophy following mechanical overload. The integrin-binding portion of this domain appears to be sequestered in the full length melusin protein, Ca2+ may modulate the protein's conformation exposing this binding site. This group includes Chordc1, also known as Chp-1, which is conserved from vertebrates to humans.  Mammalian Chordc1 interacts with the heat shock protein (HSP) Hsp90 and is implicated in circadian and/or homeostatic mechanisms in the brain. The N-terminal portions of proteins belonging to this group contain two cysteine and histidine rich domain (C
Probab=96.30  E-value=0.054  Score=42.16  Aligned_cols=74  Identities=12%  Similarity=0.091  Sum_probs=58.9

Q ss_pred             eeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEeeC
Q psy5221         327 YNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFFDQ  404 (436)
Q Consensus       327 ~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~~~  404 (436)
                      +|.++ +++.+.|.+-+.|++++++++.++++.|+++...+.   +          ..|.-.+.|-..|+.  .+-....
T Consensus         3 ~dW~Q-s~~~V~ItI~~k~~~~~~~~v~~~~~~l~v~~~~~~---~----------~~y~~~l~L~~~I~~~~s~~~v~~   68 (87)
T cd06488           3 HDWHQ-TGSHVVVSVYAKNSNPELSVVEANSTVLTIHIVFEG---N----------KEFQLDIELWGVIDVEKSSVNMLP   68 (87)
T ss_pred             ccEee-CCCEEEEEEEECcCCccceEEEecCCEEEEEEECCC---C----------ceEEEEeeccceEChhHcEEEecC
Confidence            57886 889999999999999999999999999998755421   1          125566777777774  6666788


Q ss_pred             cEEEEEEecC
Q psy5221         405 GILNIDLIRE  414 (436)
Q Consensus       405 GvL~i~~pk~  414 (436)
                      +-+.|++.|.
T Consensus        69 ~kvei~L~K~   78 (87)
T cd06488          69 TKVEIKLRKA   78 (87)
T ss_pred             cEEEEEEEeC
Confidence            9999999994


No 56 
>PF04298 Zn_peptidase_2:  Putative neutral zinc metallopeptidase;  InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=96.22  E-value=0.68  Score=42.33  Aligned_cols=65  Identities=15%  Similarity=0.205  Sum_probs=44.3

Q ss_pred             HHHHHHHHHhcCCCCCeEEEECC-CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChH
Q psy5221          65 LKTVNKISKNANISTPEVAIYNG-EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDM  134 (436)
Q Consensus        65 ~~~l~~l~~~~g~~~p~v~v~~~-~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~  134 (436)
                      .+.-+.+.++.|+.--+|..+++ --+    =..|..+.|-++..-.+.-| =--.||-|||.||-.+...
T Consensus        40 ae~Ar~iL~~~gl~~V~Ve~~~G~LtD----HYdP~~k~vrLS~~vy~~~S-iaAvaVAAHEvGHAiQ~a~  105 (222)
T PF04298_consen   40 AEVARHILDRNGLSDVRVERVPGELTD----HYDPRNKVVRLSEDVYNGRS-IAAVAVAAHEVGHAIQHAE  105 (222)
T ss_pred             HHHHHHHHHHCCCCCeeEEEeCCCCCC----CcCCCCCEEEeCCccCCCCC-HHHHHHHHHHHhHHHhccc
Confidence            57888999999998656666654 101    12366777888875444322 2456899999999998774


No 57 
>cd06494 p23_NUDCD2_like p23-like NUD (nuclear distribution) C-like found in human NUDC domain-containing protein 2 (NUDCD2) and similar proteins.  Little is known about the function of the proteins in this subgroup.
Probab=95.86  E-value=0.12  Score=40.74  Aligned_cols=74  Identities=12%  Similarity=0.115  Sum_probs=54.8

Q ss_pred             CCCceeEEEecCceEEEEEecC-CCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEe
Q psy5221         323 SYPPYNIELVDENKYRITMAIA-GFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVA  399 (436)
Q Consensus       323 ~~p~~~i~~e~~~~~~v~~~lp-G~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~  399 (436)
                      .++.++-++ +.+.+.|.+.+| |++++|++|++..+.|.|.-+-      .     +.-.|      .|+..|+  ...
T Consensus         4 ~~~~y~W~Q-T~~eV~v~i~lp~~~~~kdv~V~i~~~~l~V~~~g------~-----~~l~G------~L~~~I~~dest   65 (93)
T cd06494           4 KTPWGCWYQ-TMDEVFIEVNVPPGTRAKDVKCKLGSRDISLAVKG------Q-----EVLKG------KLFDSVVADECT   65 (93)
T ss_pred             cCCCcEEEe-EcCEEEEEEECCCCCceeeEEEEEEcCEEEEEECC------E-----EEEcC------cccCccCcccCE
Confidence            456788887 889999999888 8999999999999999997411      0     11133      3455555  356


Q ss_pred             EEeeCcE-EEEEEecC
Q psy5221         400 AFFDQGI-LNIDLIRE  414 (436)
Q Consensus       400 A~~~~Gv-L~i~~pk~  414 (436)
                      -.+++|- |.|+|.|.
T Consensus        66 Wtled~k~l~I~L~K~   81 (93)
T cd06494          66 WTLEDRKLIRIVLTKS   81 (93)
T ss_pred             EEEECCcEEEEEEEeC
Confidence            6677775 89999994


No 58 
>PRK04351 hypothetical protein; Provisional
Probab=95.86  E-value=0.043  Score=47.24  Aligned_cols=62  Identities=24%  Similarity=0.295  Sum_probs=47.1

Q ss_pred             HHHHHHHHHHh-cCCCCCeEEEECC----CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhh
Q psy5221          64 LLKTVNKISKN-ANISTPEVAIYNG----EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHI  129 (436)
Q Consensus        64 L~~~l~~l~~~-~g~~~p~v~v~~~----~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi  129 (436)
                      |.+.+++++.. .+.+.+.-..++.    ..-++..    ....|-++..+++..+++++..||+||++|+
T Consensus         6 l~~l~~~~s~~~F~~~f~~~v~~n~RlrttgG~~~l----~~~~I~lnp~ll~~~~~~~l~~vv~HElcH~   72 (149)
T PRK04351          6 LQRLVEEISLEYFGKPFRHQAYFNKRLRTTGGRYLL----KDHHIEFNPKMLEEYGLEELIGIIKHELCHY   72 (149)
T ss_pred             HHHHHHHHHHHHhCCCCCcEEEEeccchhhhheeec----CCCeEEeCHHHHhhccHHHHHhhHHHHHHHH
Confidence            99999999986 5766554455555    2233322    2456889998999999999999999999997


No 59 
>cd06468 p23_CacyBP p23_like domain found in proteins similar to Calcyclin-Binding Protein(CacyBP)/Siah-1-interacting protein (SIP). CacyBP/SIP interacts with S100A6 (calcyclin), with some other members of the S100 family, with tubulin, and with Siah-1 and Skp-1. The latter two are components of the ubiquitin ligase that regulates beta-catenin degradation. The beta-catenin gene is an oncogene participating in tumorigenesis in many different cancers. Overexpression of CacyBP/SIP, in part through its effect on the expression of beta-catenin, inhibits the proliferation, tumorigenicity, and invasion of gastric cancer cells. CacyBP/SIP is abundant in neurons and neuroblastoma NB2a cells. An extensive re-organization of microtubules accompanies the differentiation of NB2a cells. CacyBP/SIP may contribute to NB2a cell differentiation through binding to and increasing the oligomerization of tubulin. CacyBP/SIP is also implicated in differentiation of erythroid cells, rat neonatal cardiomyocytes
Probab=95.70  E-value=0.14  Score=40.08  Aligned_cols=75  Identities=11%  Similarity=0.286  Sum_probs=56.3

Q ss_pred             ceeEEEecCceEEEEEecCCCCC---CceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEE-ECCCccce--Ee
Q psy5221         326 PYNIELVDENKYRITMAIAGFNR---SDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRF-QLADHVHV--VA  399 (436)
Q Consensus       326 ~~~i~~e~~~~~~v~~~lpG~~~---edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~-~lP~~v~~--~~  399 (436)
                      .++-++ +++.+.|.+.+|+..+   +|++|+++++.|.|++...   ++.          .|.-.+ +|-..|+.  .+
T Consensus         3 ~y~W~Q-t~~~V~i~i~~~~~~~~~~~~v~v~~~~~~l~v~~~~~---~~~----------~~~~~~~~L~~~I~~e~s~   68 (92)
T cd06468           3 KYAWDQ-SDKFVKIYITLKGVHQLPKENIQVEFTERSFELKVHDL---NGK----------NYRFTINRLLKKIDPEKSS   68 (92)
T ss_pred             eeeeec-CCCEEEEEEEccCCCcCCcccEEEEecCCEEEEEEECC---CCc----------EEEEEehHhhCccCccccE
Confidence            466775 8899999999999987   9999999999999987421   111          122233 36677773  67


Q ss_pred             EEeeCcEEEEEEecC
Q psy5221         400 AFFDQGILNIDLIRE  414 (436)
Q Consensus       400 A~~~~GvL~i~~pk~  414 (436)
                      ....++-+.|++.|.
T Consensus        69 ~~~~~~ki~i~L~K~   83 (92)
T cd06468          69 FKVKTDRIVITLAKK   83 (92)
T ss_pred             EEEeCCEEEEEEEeC
Confidence            777889999999994


No 60 
>COG1451 Predicted metal-dependent hydrolase [General function prediction only]
Probab=95.61  E-value=0.044  Score=50.65  Aligned_cols=69  Identities=14%  Similarity=0.187  Sum_probs=50.0

Q ss_pred             HHHHHHHHHHhcCCCCCeEEEECCCccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHH
Q psy5221          64 LLKTVNKISKNANISTPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVT  136 (436)
Q Consensus        64 L~~~l~~l~~~~g~~~p~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~  136 (436)
                      +...+...+...|.+...+.+-+.   --..|.+...+.|.++. .+..++++.+..|++||++|++..+.-.
T Consensus       125 l~~~~~~~~~~l~~~~~~~~ik~~---k~~WGScs~~~~i~~~~-~l~~~p~~~i~YVvvHELaHLke~nHs~  193 (223)
T COG1451         125 LEIRLKEYAKKLGVPPRAIKLKNM---KRRWGSCSKAGEIRFNW-RLVMAPEEVIDYVVVHELAHLKEKNHSK  193 (223)
T ss_pred             HHHHHHHHHHHhCCCccceeeeec---cceeeeecCCCcEEeeh-hhhcCCHHHHHHHHHHHHHHHhhhhccH
Confidence            556677778888888777777653   11233333333677887 7777899999999999999999977544


No 61 
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=95.12  E-value=0.17  Score=47.22  Aligned_cols=67  Identities=18%  Similarity=0.343  Sum_probs=44.1

Q ss_pred             HHHHHHHhcCCC--CCeEEEECC------CccEEEeC--CCCCccEEEEcHHHHhhcC-------HHHHHHHHHHHHHhh
Q psy5221          67 TVNKISKNANIS--TPEVAIYNG------EINAFATG--AFKNSSLIAISANLLESMT-------KLEIKAIIAHEISHI  129 (436)
Q Consensus        67 ~l~~l~~~~g~~--~p~v~v~~~------~~NA~~~G--~~~~~~~v~it~gLl~~l~-------~~El~aVLAHElgHi  129 (436)
                      ....+-+..|..  .|++.++..      ..+--+.|  .+|..+.++++..+++.|.       +=.-+-|+|||.||.
T Consensus        99 ~W~~if~~~~~~Y~~Ptlvlf~~~v~t~CG~assasGPFYCP~D~kvYlDlsFf~~m~~~fga~GdfAqaYViAHEVGHH  178 (295)
T COG2321          99 TWMQIFQESGRTYQKPTLVLFSGQVRTGCGFASSASGPFYCPADTKVYLDLSFFDEMKTKFGASGDFAQAYVIAHEVGHH  178 (295)
T ss_pred             HHHHHHHHhcccccCCeEEEecCccccCcCCCCcCCCCeecCCCceEEEehhHHHHHHHHhcCCccHHHHHHHHhhhhHH
Confidence            334455555543  678877765      11222232  2577888999999998874       335678999999998


Q ss_pred             hcCh
Q psy5221         130 SNGD  133 (436)
Q Consensus       130 ~~~h  133 (436)
                      .++-
T Consensus       179 VQnl  182 (295)
T COG2321         179 VQNL  182 (295)
T ss_pred             HHHH
Confidence            8754


No 62 
>PF13699 DUF4157:  Domain of unknown function (DUF4157)
Probab=94.92  E-value=0.038  Score=42.19  Aligned_cols=60  Identities=20%  Similarity=0.342  Sum_probs=40.4

Q ss_pred             HHHHHHHhcCCCCCeEEEECC----------CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcC
Q psy5221          67 TVNKISKNANISTPEVAIYNG----------EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNG  132 (436)
Q Consensus        67 ~l~~l~~~~g~~~p~v~v~~~----------~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~  132 (436)
                      .-.++....|.+..+|.|-.+          ...||+.|     ..|++..|-++ .+..+=.++||||++|+.+.
T Consensus         6 ~r~~~e~~~G~dl~~Vrvh~~~~a~~~~~~~~A~A~T~G-----~~I~f~~g~~~-~~s~~~~~llaHEl~Hv~Qq   75 (79)
T PF13699_consen    6 IRSRLERAFGADLSDVRVHTGPAASRAAAALGARAFTVG-----NDIYFAPGKYN-PDSPEGRALLAHELAHVVQQ   75 (79)
T ss_pred             HHHHHHHHhCCCccceEEEeCCchhhhhhccCCeEEEEC-----CEEEEcCCCcC-CCCCCcchhHhHHHHHHHhh
Confidence            445666677888777777543          67788886     23777554332 24445678999999999874


No 63 
>PF10263 SprT-like:  SprT-like family;  InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases. 
Probab=93.89  E-value=0.36  Score=41.88  Aligned_cols=70  Identities=16%  Similarity=0.137  Sum_probs=47.3

Q ss_pred             HHHHHHHHHHhcCCCCCeEEEECC-CccEEEeCCCCC-ccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcCh
Q psy5221          64 LLKTVNKISKNANISTPEVAIYNG-EINAFATGAFKN-SSLIAISANLLESMTKLEIKAIIAHEISHISNGD  133 (436)
Q Consensus        64 L~~~l~~l~~~~g~~~p~v~v~~~-~~NA~~~G~~~~-~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h  133 (436)
                      +++.+++..=.-.++.+.|.+-.. .-.+-..-..+. ...|.++..+++..+++++..+|.|||.|.....
T Consensus         4 ~f~~~n~~~F~~~L~~~~i~~~~~~~~~~G~~~~~~~~~~~I~ls~~~~~~~~~~~~~~tL~HEm~H~~~~~   75 (157)
T PF10263_consen    4 LFDRFNEEYFGGKLPPIPITWSKRMKRTAGRCRYKRRSPCEIRLSPKLLDRNPEEELIDTLLHEMAHAAAYV   75 (157)
T ss_pred             HHHHHHHHHhCCCCCCeEEEEECCCCCceEEEEECCCCceEEEECHHHHHhhHHHHHHHHHHHHHHHHHhhh
Confidence            677777777767777655555523 222222211111 2369999999998899999999999999987743


No 64 
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=93.60  E-value=0.18  Score=43.43  Aligned_cols=66  Identities=18%  Similarity=0.259  Sum_probs=43.2

Q ss_pred             HHHHHHHHHHh-c--CCCCCeEEEECCCccEEEeCCCC-CccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhc
Q psy5221          64 LLKTVNKISKN-A--NISTPEVAIYNGEINAFATGAFK-NSSLIAISANLLESMTKLEIKAIIAHEISHISN  131 (436)
Q Consensus        64 L~~~l~~l~~~-~--g~~~p~v~v~~~~~NA~~~G~~~-~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~  131 (436)
                      |.+.++++-.+ .  .++.|.|..-...-.+  .|... ....|.++..|++..+++++..||.|||+|...
T Consensus         3 L~~~~~~~n~~~F~~~l~~~~i~w~~r~~~~--~G~~~~~~~~I~ln~~l~~~~~~~~l~~~l~HEm~H~~~   72 (146)
T smart00731        3 LTQRLEDASLRVFGRKLPHPKVVWNKRLRKT--GGRCLLKSAEIRLNPKLLTENGRDRLRETLLHELCHAAL   72 (146)
T ss_pred             HHHHHHHHHHHHHCCCCCCCEEEEehhhhhh--hHHhhcCCCEEEeCHHHHhhccHHHHHhhHHHHHHHHHH
Confidence            66666666532 3  3455544333321111  22222 367799999898888999999999999999986


No 65 
>cd00237 p23 p23 binds heat shock protein (Hsp)90 and participates in the folding of a number of Hsp90 clients, including the progesterone receptor. p23 also has a passive chaperoning activity and in addition may participate in prostaglandin synthesis.
Probab=93.58  E-value=1.4  Score=35.55  Aligned_cols=74  Identities=11%  Similarity=0.056  Sum_probs=51.5

Q ss_pred             CceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEe
Q psy5221         325 PPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFF  402 (436)
Q Consensus       325 p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~  402 (436)
                      |+++-++ +++.+.|.+++|+  .+|++|+++++.|+++|...   ++..|          .-.+.|=..|+.  .+-..
T Consensus         2 p~v~WaQ-r~~~V~ltI~v~d--~~d~~v~l~~~~l~f~~~~~---~g~~y----------~~~l~l~~~I~pe~Sk~~v   65 (106)
T cd00237           2 AKTLWYD-RRDYVFIEFCVED--SKDVKVDFEKSKLTFSCLNG---DNVKI----------YNEIELYDRVDPNDSKHKR   65 (106)
T ss_pred             CcceeeE-CCCEEEEEEEeCC--CCCcEEEEecCEEEEEEECC---CCcEE----------EEEEEeecccCcccCeEEe
Confidence            6788886 8889999999999  58999999999999998432   22222          223445455553  33333


Q ss_pred             eCcEEEEEEecC
Q psy5221         403 DQGILNIDLIRE  414 (436)
Q Consensus       403 ~~GvL~i~~pk~  414 (436)
                      ..--+.|.+.|.
T Consensus        66 ~~r~ve~~L~K~   77 (106)
T cd00237          66 TDRSILCCLRKG   77 (106)
T ss_pred             CCceEEEEEEeC
Confidence            455677788884


No 66 
>PF10026 DUF2268:  Predicted Zn-dependent protease (DUF2268);  InterPro: IPR018728  This domain, found in various hypothetical bacterial proteins, as well as predicted zinc dependent proteases, has no known function. 
Probab=93.29  E-value=0.23  Score=44.96  Aligned_cols=69  Identities=20%  Similarity=0.125  Sum_probs=43.4

Q ss_pred             HHHHHHHHHHhcCCCCCeEEEECC-Ccc------EEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcCh
Q psy5221          64 LLKTVNKISKNANISTPEVAIYNG-EIN------AFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGD  133 (436)
Q Consensus        64 L~~~l~~l~~~~g~~~p~v~v~~~-~~N------A~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h  133 (436)
                      +.+.+.++.+....+--+|++... ..|      --..|.+.....|++.- +-...+.+++.+++|||+-|..+..
T Consensus         5 i~~~~~~~~~~~~~~~i~v~i~p~~~~~~~~~~~~g~~g~~~~~~~i~l~~-~~~~~~~~~l~~~iaHE~hH~~r~~   80 (195)
T PF10026_consen    5 IEEALEKSIELLPGPDIPVFIFPADPENPFLIPELGGKGGGAIPGYIFLFL-LPNDYSLEELPALIAHEYHHNCRYE   80 (195)
T ss_pred             HHHHHHHHHHHcCCCCCCEEEEeccCCCcccccccCcccccCCCCEEEEEe-cCCcccHHHHHHHHHHHHHHHHHHh
Confidence            567777777777776555665554 112      11112222234465654 4555688999999999999997644


No 67 
>PF13203 DUF2201_N:  Putative metallopeptidase domain
Probab=92.83  E-value=0.24  Score=47.88  Aligned_cols=38  Identities=18%  Similarity=0.135  Sum_probs=34.1

Q ss_pred             cEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHH
Q psy5221         101 SLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTML  138 (436)
Q Consensus       101 ~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~  138 (436)
                      ..|+++...+..++.+|+.+|++||+-|+..+|..+..
T Consensus        43 ~~l~~nP~~~~~l~~~~~~~~l~HevlH~~~~H~~r~~   80 (292)
T PF13203_consen   43 RRLYYNPEFLESLSPEERVGLLLHEVLHCLLRHPWRRG   80 (292)
T ss_pred             cEEEECcHHHhcCCHHHHHHHHHHHHHHHHccchhhhc
Confidence            36999999999999999999999999999999986643


No 68 
>PF04450 BSP:  Peptidase of plants and bacteria;  InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=92.56  E-value=0.28  Score=44.72  Aligned_cols=70  Identities=21%  Similarity=0.212  Sum_probs=49.8

Q ss_pred             HHHHHHHHHHhcCCC------CCeEE-EECC-CccEEEeCCCCCccEEEEcHHHHhhcC-----HHHHHHHHHHHHHhhh
Q psy5221          64 LLKTVNKISKNANIS------TPEVA-IYNG-EINAFATGAFKNSSLIAISANLLESMT-----KLEIKAIIAHEISHIS  130 (436)
Q Consensus        64 L~~~l~~l~~~~g~~------~p~v~-v~~~-~~NA~~~G~~~~~~~v~it~gLl~~l~-----~~El~aVLAHElgHi~  130 (436)
                      |.+.+..+.+.+-.+      ...|. ++++ ..=|++.| .+....|.++...++...     ..|+..||-||+.|+.
T Consensus        30 L~~a~~~V~~~ly~~~~~~~~v~~Vt~~~~~~~gVA~t~g-d~~~~~I~~S~~~i~~~~~~~~~~~Ei~Gvl~HE~~H~~  108 (205)
T PF04450_consen   30 LRDASRFVWRLLYQSPADRKPVRSVTLILDDMDGVAYTSG-DDDHKEIHFSARYIAKYPADGDVRDEIIGVLYHEMVHCW  108 (205)
T ss_pred             HHHHHHHHHHHhCCCCCCCCcccEEEEEEECCCeeEEEec-CCCccEEEEeHHHHhhcccccchHHHHHHHHHHHHHHHh
Confidence            555555555544332      23444 4555 77788888 455677999999999885     3599999999999999


Q ss_pred             cChH
Q psy5221         131 NGDM  134 (436)
Q Consensus       131 ~~h~  134 (436)
                      +.+.
T Consensus       109 Q~~~  112 (205)
T PF04450_consen  109 QWDG  112 (205)
T ss_pred             hcCC
Confidence            9775


No 69 
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=91.36  E-value=2.6  Score=37.46  Aligned_cols=70  Identities=13%  Similarity=0.091  Sum_probs=45.5

Q ss_pred             HHHHHHHHHhcCCCCCeEEEECCCccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHH
Q psy5221          65 LKTVNKISKNANISTPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTML  138 (436)
Q Consensus        65 ~~~l~~l~~~~g~~~p~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~  138 (436)
                      .+.-+.+.+..|+-.-.|-.+++..   +--..|++++|.++..-+..-+- .-.||-|||.||-.+.+..+..
T Consensus        43 aevAr~iLd~nGl~dV~Ve~v~G~L---TDHYDP~~kvvrLSe~~y~g~Si-a~~aVAAHEVGHAiQd~~~Y~~  112 (226)
T COG2738          43 AEVARMILDENGLYDVPVEEVPGTL---TDHYDPRRKVVRLSEANYYGPSI-AAIAVAAHEVGHAIQDQEDYAF  112 (226)
T ss_pred             HHHHHHHHhhcCCccceeeeecCCc---ccccChhhheeeccccccCCccH-HHHHHHHHHhhHHHhhhcccHH
Confidence            4677788888998644455555410   11124778888888755444333 4468999999999887755433


No 70 
>cd06495 p23_NUDCD3_like p23-like NUD (nuclear distribution) C-like domain found in human NUDC domain-containing protein 3 (NUDCD3) and similar proteins.   Little is known about the function of the proteins in this subgroup.
Probab=90.41  E-value=4.5  Score=32.44  Aligned_cols=77  Identities=8%  Similarity=0.226  Sum_probs=53.0

Q ss_pred             CCceeEEEecCceEEEEEecC-CC-CCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEe
Q psy5221         324 YPPYNIELVDENKYRITMAIA-GF-NRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVA  399 (436)
Q Consensus       324 ~p~~~i~~e~~~~~~v~~~lp-G~-~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~  399 (436)
                      .+.|.-.+ +.+.+.|.+.+| |. +..|++|++..+.|.|.-+..  .++..++     .|      .|+..|+  -..
T Consensus         4 ~e~Y~WtQ-Tl~eV~V~i~lp~~~~~~kdv~v~i~~~~l~v~~~~~--~~~~~~i-----~G------~L~~~V~~des~   69 (102)
T cd06495           4 RENYTWSQ-DYTDVEVRVPVPKDVVKGRQVSVDLQSSSIRVSVRDG--GGEKVLM-----EG------EFTHKINTENSL   69 (102)
T ss_pred             CCceEEEe-ECCeEEEEEECCCCCccceEEEEEEEcCEEEEEEecC--CCCceEE-----eC------cccCcccCccce
Confidence            45666776 888999999999 53 578999999999999874310  0001111     33      3455555  366


Q ss_pred             EEeeCcE-EEEEEecC
Q psy5221         400 AFFDQGI-LNIDLIRE  414 (436)
Q Consensus       400 A~~~~Gv-L~i~~pk~  414 (436)
                      -.++||- |.|+|-|.
T Consensus        70 Wtled~~~l~I~L~K~   85 (102)
T cd06495          70 WSLEPGKCVLLSLSKC   85 (102)
T ss_pred             EEEeCCCEEEEEEEEC
Confidence            6788864 89999994


No 71 
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=90.29  E-value=0.31  Score=44.63  Aligned_cols=31  Identities=23%  Similarity=0.243  Sum_probs=24.6

Q ss_pred             CccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcCh
Q psy5221          99 NSSLIAISANLLESMTKLEIKAIIAHEISHISNGD  133 (436)
Q Consensus        99 ~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h  133 (436)
                      .++.|+++.    ..+++.-.+++|||+||+..+.
T Consensus        57 ~~~~I~iN~----n~~~~r~rFtlAHELGH~llH~   87 (213)
T COG2856          57 EKPVIYINA----NNSLERKRFTLAHELGHALLHT   87 (213)
T ss_pred             cCceEEEeC----CCCHHHHHHHHHHHHhHHHhcc
Confidence            345677776    4489999999999999988754


No 72 
>KOG1309|consensus
Probab=90.27  E-value=1.1  Score=39.28  Aligned_cols=42  Identities=19%  Similarity=0.167  Sum_probs=37.8

Q ss_pred             CceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEe
Q psy5221         325 PPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQ  367 (436)
Q Consensus       325 p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~  367 (436)
                      +..|.|+ +++..+|.+-.+|+.++|++|.+.++.|.|.-+.+
T Consensus         4 ~r~DwyQ-t~~~vvIti~~k~v~~~~v~v~~s~~~l~~~~~~~   45 (196)
T KOG1309|consen    4 IRHDWYQ-TETSVVITIFAKNVPKEDVNVEISENTLSIVIQLP   45 (196)
T ss_pred             ccceeec-CCceEEEEEEecCCCccceeEEeecceEEEEEecC
Confidence            5689996 99999999999999999999999999999886654


No 73 
>PLN03088 SGT1,  suppressor of G2 allele of SKP1; Provisional
Probab=90.26  E-value=1.4  Score=43.87  Aligned_cols=77  Identities=8%  Similarity=0.051  Sum_probs=58.2

Q ss_pred             CceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEe
Q psy5221         325 PPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFF  402 (436)
Q Consensus       325 p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~  402 (436)
                      +.+|.|+ +++.+.|.+-+.|+.+++++|++.++.|.|+-...   .+.          .|.-.+.|-..|+.  .+.+.
T Consensus       157 ~r~dWyQ-s~~~V~i~i~~k~~~~~~~~v~~~~~~l~v~~~~~---~~~----------~y~~~~~L~~~I~p~~s~~~v  222 (356)
T PLN03088        157 YRHEFYQ-KPEEVVVTVFAKGVPAENVNVDFGEQILSVVIEVP---GED----------AYHLQPRLFGKIIPDKCKYEV  222 (356)
T ss_pred             cccceee-cCCEEEEEEEecCCChHHcEEEeecCEEEEEEecC---CCc----------ceeecccccccccccccEEEE
Confidence            5688997 89999999999999999999999999999986432   111          24444677777773  55555


Q ss_pred             eCcEEEEEEecCC
Q psy5221         403 DQGILNIDLIREV  415 (436)
Q Consensus       403 ~~GvL~i~~pk~~  415 (436)
                      ..--+.|+|.|..
T Consensus       223 ~~~Kiei~l~K~~  235 (356)
T PLN03088        223 LSTKIEIRLAKAE  235 (356)
T ss_pred             ecceEEEEEecCC
Confidence            5668899998853


No 74 
>COG4900 Predicted metallopeptidase [General function prediction only]
Probab=90.14  E-value=0.77  Score=36.75  Aligned_cols=67  Identities=16%  Similarity=0.184  Sum_probs=44.4

Q ss_pred             HHHHHHHHHHhcCCC---CCeEEEECC---CccEEE------------eCCCCCccEEEEcHHHHhhcCHHHHHHHHHHH
Q psy5221          64 LLKTVNKISKNANIS---TPEVAIYNG---EINAFA------------TGAFKNSSLIAISANLLESMTKLEIKAIIAHE  125 (436)
Q Consensus        64 L~~~l~~l~~~~g~~---~p~v~v~~~---~~NA~~------------~G~~~~~~~v~it~gLl~~l~~~El~aVLAHE  125 (436)
                      ..+.++.+..++|+.   ..+|+++-+   ..-|++            .|..|.--+-+++. =++.|+.+|-.-|++||
T Consensus         9 ve~~~~~~V~~lgLdyi~~~rv~vVys~gS~~~A~ARIwg~pki~~e~lglnP~YviEl~se-kF~rLs~~ekvKviiHE   87 (133)
T COG4900           9 VEADIKNAVVRLGLDYIFQVRVVVVYSPGSHSKAVARIWGIPKIFQEVLGLNPVYVIELLSE-KFKRLSCAEKVKVIIHE   87 (133)
T ss_pred             HHHHHHHHHHHhCcceeeeeeEEEEECCCCcceehhhhhcccHHHHHHhCCCCeeeeeeehh-hcCCCChHHHHHHHHHH
Confidence            457788888889987   456666654   223322            24444322223333 57788999999999999


Q ss_pred             HHhhhc
Q psy5221         126 ISHISN  131 (436)
Q Consensus       126 lgHi~~  131 (436)
                      +.||-.
T Consensus        88 llHIP~   93 (133)
T COG4900          88 LLHIPA   93 (133)
T ss_pred             HhcCcc
Confidence            999865


No 75 
>cd06492 p23_mNUDC_like p23-like NUD (nuclear distribution) C-like domain of mammalian(m) NUDC and similar proteins. Mammalian(m) NUDC associates both with the dynein complex and also with an anti-inflammatory enzyme, platelet activating factor acetylhydrolase I, PAF-AH(I) complex, through binding mNUDF, the regulatory beta subunit of PAF-AH(I).  mNUDC is important for cell proliferation both in normal and tumor tissues.  Its expression is elevated in various cell types undergoing mitosis or stimulated to proliferate, with high expression levels observed in leukemic cells and tumors. For a leukemic cell line, human NUDC was shown to activate the thrombopoietin (TPO) receptor (Mpl) by binding to its extracellular domain, and promoting cell proliferation and differentiation.
Probab=89.30  E-value=5.1  Score=31.06  Aligned_cols=66  Identities=11%  Similarity=0.279  Sum_probs=46.3

Q ss_pred             cCceEEEEEecC---CCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEeeCc-E
Q psy5221         333 DENKYRITMAIA---GFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFFDQG-I  406 (436)
Q Consensus       333 ~~~~~~v~~~lp---G~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~~~G-v  406 (436)
                      +.++..|.+.+|   |+++.|++|++..+.|.|.-+.      ...+    -.|      .|...|+  ...-.+++| .
T Consensus         6 T~~ev~v~v~l~~~~~~~~kdv~v~i~~~~l~v~~~g------~~~~----i~G------~L~~~V~~des~Wtled~~~   69 (87)
T cd06492           6 TLSEVELKVPFKVSFRLKGKDVVVDIQRKHLKVGLKG------QPPI----IDG------ELYNEVKVEESSWLIEDGKV   69 (87)
T ss_pred             ecCEEEEEEECCCCCCccceEEEEEEecCEEEEEECC------CceE----EeC------cccCcccccccEEEEeCCCE
Confidence            667789999986   3889999999999999986321      1111    123      3445555  366667786 8


Q ss_pred             EEEEEecC
Q psy5221         407 LNIDLIRE  414 (436)
Q Consensus       407 L~i~~pk~  414 (436)
                      |.|++-|.
T Consensus        70 l~i~L~K~   77 (87)
T cd06492          70 VTVNLEKI   77 (87)
T ss_pred             EEEEEEEC
Confidence            99999994


No 76 
>cd06490 p23_NCB5OR p23_like domain found in NAD(P)H cytochrome b5 (NCB5) oxidoreductase (OR) and similar proteins.  NCB5OR is widely expressed in human organs and tissues and is localized in the ER (endoplasmic reticulum). It appears to play a critical role in maintaining viable pancreatic beta cells. Mice homozygous for a targeted knockout (KO) of the gene encoding NCB5OR develop an early-onset nonautoimmune diabetes phenotype with a non-inflammatory beta-cell deficiency.  The role of NCB5OR in beta cells may be in maintaining or regulating their redox status. Proteins in this group in addition contain an N-terminal cytochrome b5 domain and a C-terminal cytochrome b5 oxidoreductase domain.  The gene encoding NCB5OR has been considered as a positional candidate for type II diabetes and other diabetes subtypes related to B-cell dysfunction, however variation in its coding region does not appear not to be a major contributor to the pathogenesis of these diseases.
Probab=88.96  E-value=6  Score=30.60  Aligned_cols=73  Identities=11%  Similarity=0.189  Sum_probs=49.0

Q ss_pred             eeEEEecCceEEEEEecCC--CCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce-EeEEee
Q psy5221         327 YNIELVDENKYRITMAIAG--FNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV-VAAFFD  403 (436)
Q Consensus       327 ~~i~~e~~~~~~v~~~lpG--~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~-~~A~~~  403 (436)
                      +|.|+ +++.++|.+-.++  .+++++.+..+++.|+|+-...    +          ..|...+.|=..++. .+.++.
T Consensus         1 ~DWyQ-t~~~Vtitiy~K~~~~~~~~v~v~~~~~~l~v~~~~~----~----------~~~~~~~~L~~~I~~~~~~~~~   65 (87)
T cd06490           1 YDWFQ-TDSEVTIVVYTKSKGNPADIVIVDDQQRELRVEIILG----D----------KSYLLHLDLSNEVQWPCEVRIS   65 (87)
T ss_pred             CCceE-CCCEEEEEEEEcccCCCCccEEEECCCCEEEEEEECC----C----------ceEEEeeeccccCCCCcEEEEc
Confidence            47787 8999999999885  4555666666777888874332    1          124555666666662 234444


Q ss_pred             --CcEEEEEEecC
Q psy5221         404 --QGILNIDLIRE  414 (436)
Q Consensus       404 --~GvL~i~~pk~  414 (436)
                        -|.++|+++|.
T Consensus        66 ~~~~KVEI~L~K~   78 (87)
T cd06490          66 TETGKIELVLKKK   78 (87)
T ss_pred             ccCceEEEEEEcC
Confidence              78999999994


No 77 
>PF02031 Peptidase_M7:  Streptomyces extracellular neutral proteinase (M7) family;  InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=88.46  E-value=1.4  Score=36.46  Aligned_cols=39  Identities=26%  Similarity=0.217  Sum_probs=23.3

Q ss_pred             CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhc
Q psy5221          88 EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISN  131 (436)
Q Consensus        88 ~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~  131 (436)
                      ..-|.+.|.+  +.+|.++..--+..+.-   -|.+||+||+..
T Consensus        52 ~sya~~~g~G--~G~I~l~~~~~qgy~~~---RIaaHE~GHiLG   90 (132)
T PF02031_consen   52 GSYASTDGLG--SGYIFLDYQQNQGYNST---RIAAHELGHILG   90 (132)
T ss_dssp             --EEEE-SSS---EEEEEEHHHHHHS-HH---HHHHHHHHHHHT
T ss_pred             CcccccCCCC--cEEEEechHHhhCCccc---eeeeehhccccC
Confidence            4446666654  67799987544444432   299999999865


No 78 
>PRK04860 hypothetical protein; Provisional
Probab=88.36  E-value=1.8  Score=37.76  Aligned_cols=66  Identities=18%  Similarity=0.191  Sum_probs=41.4

Q ss_pred             HHHHHHHHHHhcCCC--CCeEEEECCCccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhc
Q psy5221          64 LLKTVNKISKNANIS--TPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISN  131 (436)
Q Consensus        64 L~~~l~~l~~~~g~~--~p~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~  131 (436)
                      +...++.-....|-+  .|.+..-.-.-.|-.+..  ...-|-++..++..-+++++..||.||++|+.-
T Consensus         9 ~~~~~~~a~~~f~~~f~~p~~~f~~R~rtaG~~~l--~~~~I~~Np~ll~~~~~~~l~~~v~HEl~H~~~   76 (160)
T PRK04860          9 LRECLAQANLYFKRTFPEPKVSYTQRGTSAGTAWL--QSNEIRLNPVLLLENQQAFIDEVVPHELAHLLV   76 (160)
T ss_pred             HHHHHHHHHHHhCCCCCCCEEEEeecchhhcchhH--hcCCeeeCHHHHhhCcHHHHHhHHHHHHHHHHH
Confidence            344444444455655  455543222222433322  234488898898888999999999999999854


No 79 
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=87.14  E-value=1.6  Score=37.14  Aligned_cols=66  Identities=21%  Similarity=0.188  Sum_probs=44.5

Q ss_pred             HHHHHHHHHHHhcCCC--CCeEEEECCCccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhh
Q psy5221          63 WLLKTVNKISKNANIS--TPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHIS  130 (436)
Q Consensus        63 ~L~~~l~~l~~~~g~~--~p~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~  130 (436)
                      .|.+.++++.....-+  .|+++.-.-.-+|-..-+ . +.-|-++.-++....++.+.-|+.||++|+.
T Consensus         6 ~L~~~~~~as~~~~r~~~~p~~~~n~Rg~taG~ayL-~-~~~I~lNP~ll~en~~~f~~~vV~HELaHl~   73 (156)
T COG3091           6 KLQQCVEQASLKFFRKFFRPKASYNQRGRTAGGAYL-L-KSEIRLNPKLLEENGEDFIEQVVPHELAHLH   73 (156)
T ss_pred             HHHHHHHHHHHHhcCCCCCcceehhhhhhhcchhhc-c-ccccccCHHHHHHccHHHHHHHHHHHHHHHH
Confidence            4788888887776543  555543222333322212 2 2257888889999999999999999999974


No 80 
>PF13349 DUF4097:  Domain of unknown function (DUF4097)
Probab=84.22  E-value=11  Score=32.63  Aligned_cols=81  Identities=17%  Similarity=0.210  Sum_probs=48.2

Q ss_pred             CCceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCcc--ceEeEE
Q psy5221         324 YPPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHV--HVVAAF  401 (436)
Q Consensus       324 ~p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v--~~~~A~  401 (436)
                      ...+.|...+++.+.+  +.   +.+.++++.++++|.|+.+.....-...+..... ...-+-.+.||++.  +.++..
T Consensus        65 ~~~V~I~~~~~~~i~v--~~---~~k~~~~~~~~~~L~I~~~~~~~~~~~~~~~~~~-~~~~~i~I~lP~~~~l~~i~i~  138 (166)
T PF13349_consen   65 NGDVEIKPSDDDKIKV--EY---NGKKPEISVEGGTLTIKSKDRESFFFKGFNFNNS-DNKSKITIYLPKDYKLDKIDIK  138 (166)
T ss_pred             ceeEEEEEcCCccEEE--EE---cCcEEEEEEcCCEEEEEEecccccccceEEEccc-CCCcEEEEEECCCCceeEEEEE
Confidence            3557776533444444  44   3227999999999999977221111111111111 23455668999986  468888


Q ss_pred             eeCcEEEEE
Q psy5221         402 FDQGILNID  410 (436)
Q Consensus       402 ~~~GvL~i~  410 (436)
                      ..+|-+++.
T Consensus       139 ~~~G~i~i~  147 (166)
T PF13349_consen  139 TSSGDITIE  147 (166)
T ss_pred             eccccEEEE
Confidence            889966654


No 81 
>PF12388 Peptidase_M57:  Dual-action HEIGH metallo-peptidase;  InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=82.87  E-value=1.1  Score=40.81  Aligned_cols=34  Identities=26%  Similarity=0.313  Sum_probs=25.2

Q ss_pred             ccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhc-ChHH
Q psy5221         100 SSLIAISANLLESMTKLEIKAIIAHEISHISN-GDMV  135 (436)
Q Consensus       100 ~~~v~it~gLl~~l~~~El~aVLAHElgHi~~-~h~~  135 (436)
                      .+.|.+. + ++..+.+..+.||+|||||... ||.-
T Consensus       117 ~~~I~I~-~-~~~~~~~~~~hvi~HEiGH~IGfRHTD  151 (211)
T PF12388_consen  117 YKFIQIY-G-LSNYSVNVIEHVITHEIGHCIGFRHTD  151 (211)
T ss_pred             CceEEEE-e-cCCCchhHHHHHHHHHhhhhccccccC
Confidence            4456663 2 4666888999999999999765 6654


No 82 
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=80.48  E-value=0.45  Score=48.93  Aligned_cols=55  Identities=22%  Similarity=0.215  Sum_probs=37.5

Q ss_pred             hcCCCCCe-EEEECC-CccEEEeCCCCCccEEEEcHHHHhhc--CH--------HHHHHHHHHHHHhhhc
Q psy5221          74 NANISTPE-VAIYNG-EINAFATGAFKNSSLIAISANLLESM--TK--------LEIKAIIAHEISHISN  131 (436)
Q Consensus        74 ~~g~~~p~-v~v~~~-~~NA~~~G~~~~~~~v~it~gLl~~l--~~--------~El~aVLAHElgHi~~  131 (436)
                      +.|-|+.+ -..++. .+|||--   |.+..|++-.++|+.-  ++        .-+-|||||||||---
T Consensus       434 K~~kPVDr~eW~M~pq~VNAYYn---p~~N~IVFPAAILQ~PFfd~ea~~a~NYGgIGaVIgHEI~HgFD  500 (654)
T COG3590         434 KIGKPVDRDEWEMPPQTVNAYYN---PQKNEIVFPAAILQAPFFDPEADSAANYGGIGAVIGHEIGHGFD  500 (654)
T ss_pred             HhCCCCchhhcCCCHHHhhhhcC---CCCceEeeeHHhcCCCCCCCCcchhhcccCccceehhhhccccc
Confidence            44555432 233344 8899866   6677899999898752  22        2488999999999754


No 83 
>COG3864 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=79.75  E-value=2.3  Score=40.64  Aligned_cols=73  Identities=16%  Similarity=0.165  Sum_probs=48.2

Q ss_pred             HHHHHHHHHHhcCCCCC------eEEEECC--CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHH
Q psy5221          64 LLKTVNKISKNANISTP------EVAIYNG--EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMV  135 (436)
Q Consensus        64 L~~~l~~l~~~~g~~~p------~v~v~~~--~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~  135 (436)
                      +...+.++.+...+...      .+.++..  .+|+-.+-.+...-.++++.-.+..++.+++.|+|-||+-|+...|+.
T Consensus         7 ~~k~flr~~~es~f~l~~~~~ffa~~Li~~k~i~s~~~~~a~t~~~~~y~NPei~~~~p~~~~~aLl~HEV~Hi~l~Hi~   86 (396)
T COG3864           7 FRKKFLRFIEESNFDLMNEDNFFALFLIQIKDIASACETKASTSYFTMYFNPEIFLNCPISEMKALLKHEVYHIMLNHIK   86 (396)
T ss_pred             HHHHHHHHhccCCceeeccchhhhhhhhhhccchhHHHhhccCCceEEEeCHHHHccCCHHHHHHHHHHHHHHHHHHHHH
Confidence            55666666665554321      1122333  334433322233556888888999999999999999999999999985


Q ss_pred             H
Q psy5221         136 T  136 (436)
Q Consensus       136 ~  136 (436)
                      +
T Consensus        87 r   87 (396)
T COG3864          87 R   87 (396)
T ss_pred             h
Confidence            4


No 84 
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=77.89  E-value=2.1  Score=34.90  Aligned_cols=36  Identities=19%  Similarity=0.350  Sum_probs=25.0

Q ss_pred             CCCCCccEEEEcHHHHhhcCH------HHHHHHHHHHHHhhhc
Q psy5221          95 GAFKNSSLIAISANLLESMTK------LEIKAIIAHEISHISN  131 (436)
Q Consensus        95 G~~~~~~~v~it~gLl~~l~~------~El~aVLAHElgHi~~  131 (436)
                      |-+| .++-+....+++.+.+      +++.-|+-||+||.-.
T Consensus        81 G~~P-~rItlYRrailDywae~eetlgd~vthvliHEIgHhFG  122 (136)
T COG3824          81 GELP-DRITLYRRALLDYWAENEETLGDQVTHVLIHEIGHHFG  122 (136)
T ss_pred             CCCC-ceeeeeHHHHHHHHhhhhhhHhhHhhhhhhhhhhhhcC
Confidence            4345 4556667778888744      4567889999999754


No 85 
>COG4324 Predicted aminopeptidase [General function prediction only]
Probab=76.00  E-value=2.7  Score=39.18  Aligned_cols=33  Identities=27%  Similarity=0.396  Sum_probs=26.1

Q ss_pred             EEcHHHHhhc---CHHHHHHHHHHHHHhhh---cChHHH
Q psy5221         104 AISANLLESM---TKLEIKAIIAHEISHIS---NGDMVT  136 (436)
Q Consensus       104 ~it~gLl~~l---~~~El~aVLAHElgHi~---~~h~~~  136 (436)
                      .++++||+.|   ++.+|+.+|-||++|.+   ++|+..
T Consensus       180 wFdDPlLstmlr~dd~~lA~LIFHELAHQk~Y~~~DtAF  218 (376)
T COG4324         180 WFDDPLLSTMLRQDDTYLASLIFHELAHQKIYVNNDTAF  218 (376)
T ss_pred             ccccHHHHHHhcCChHHHHHHHHHHHhhheEeecCcchH
Confidence            3566777776   89999999999999976   466644


No 86 
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=75.66  E-value=1.7  Score=37.54  Aligned_cols=17  Identities=29%  Similarity=0.446  Sum_probs=15.1

Q ss_pred             HHHHHHHHHHHHHhhhc
Q psy5221         115 KLEIKAIIAHEISHISN  131 (436)
Q Consensus       115 ~~El~aVLAHElgHi~~  131 (436)
                      ..++.+|+.|||||...
T Consensus       101 ~~~~~~~~~HEiGHaLG  117 (156)
T cd04279         101 AENLQAIALHELGHALG  117 (156)
T ss_pred             chHHHHHHHHHhhhhhc
Confidence            56899999999999876


No 87 
>PF01421 Reprolysin:  Reprolysin (M12B) family zinc metalloprotease  This Prosite motif covers only the active site.;  InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=75.30  E-value=2.1  Score=38.74  Aligned_cols=40  Identities=30%  Similarity=0.310  Sum_probs=23.2

Q ss_pred             cEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhc
Q psy5221          90 NAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISN  131 (436)
Q Consensus        90 NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~  131 (436)
                      -|+..|.+..+.-+.+..  ...-+...++.++|||+||.-.
T Consensus       105 ~a~~~~~C~~~~s~~i~~--~~~~~~~~~a~~~AHelGH~lG  144 (199)
T PF01421_consen  105 LAYVGGMCSPSRSCGIVE--DHSRSGLSFAVIIAHELGHNLG  144 (199)
T ss_dssp             EE-TT-TTSTTTSEEEEE---SSSSHHHHHHHHHHHHHHHTT
T ss_pred             eEeCCCCCCcCCCCcEee--eccchhHHHHHHHHHHHHHhcC
Confidence            355555554433343333  1124577889999999999754


No 88 
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=74.64  E-value=2.6  Score=37.88  Aligned_cols=17  Identities=29%  Similarity=0.362  Sum_probs=14.4

Q ss_pred             HHHHHHHHHHHHHhhhc
Q psy5221         115 KLEIKAIIAHEISHISN  131 (436)
Q Consensus       115 ~~El~aVLAHElgHi~~  131 (436)
                      ....+.++|||+||.-.
T Consensus       128 ~~~~a~~~AHElGH~lG  144 (194)
T cd04269         128 LLLFAVTMAHELGHNLG  144 (194)
T ss_pred             hHHHHHHHHHHHHhhcC
Confidence            45778999999999874


No 89 
>PF14891 Peptidase_M91:  Effector protein
Probab=74.58  E-value=8  Score=34.22  Aligned_cols=67  Identities=13%  Similarity=0.198  Sum_probs=35.3

Q ss_pred             HHHHHHHHHhcCCCCCeEEEECC-CccE-EEeCCC-----CCccEEEEcHHHHh------------hcCHHHHHHHHHHH
Q psy5221          65 LKTVNKISKNANISTPEVAIYNG-EINA-FATGAF-----KNSSLIAISANLLE------------SMTKLEIKAIIAHE  125 (436)
Q Consensus        65 ~~~l~~l~~~~g~~~p~v~v~~~-~~NA-~~~G~~-----~~~~~v~it~gLl~------------~l~~~El~aVLAHE  125 (436)
                      +++|+.|.+.......+|+|... ..|. ...+..     .....|.++.....            .-....=..+|+||
T Consensus        31 ~~mL~~l~~~~~~~~~~vtI~~~~~~~~~~~~~~~~~~~~G~~~~I~~np~~~~~~~~~~~~~~~~~~~~~~p~v~L~HE  110 (174)
T PF14891_consen   31 QQMLRELEKRSDSGEKKVTISEIDSDNGPYSPGSNSASQLGSDATINWNPSKYSIGDGTEGSPINGGWSPRPPFVVLYHE  110 (174)
T ss_pred             HHHHHHHHhhccCCCCCeEEecCCCCCCccccCccccccCCCceEEEECCCcccccCCCCcccccccchhHHHHHHHHHH
Confidence            56777776655544556666655 2121 111100     12344555553000            01244567999999


Q ss_pred             HHhhhc
Q psy5221         126 ISHISN  131 (436)
Q Consensus       126 lgHi~~  131 (436)
                      |+|..+
T Consensus       111 L~HA~~  116 (174)
T PF14891_consen  111 LIHAYD  116 (174)
T ss_pred             HHHHHH
Confidence            999876


No 90 
>PF08325 WLM:  WLM domain;  InterPro: IPR013536 The WLM (WSS1-like metalloprotease) domain is a globular domain related to the zincin-like superfamily of Zn-dependent peptidase. Since the WLM domain contains all known active site residues of zincins, it is predicted to be a catalytically active peptidase domain. The WLM domain is a eukaryotic domain represented in plants, fungi, Plasmodium, and kinetoplastids. By contrast, it is absent in animals, Cryptosporidium, and Microsporidia, suggesting that it has been lost on multiple occasions during the evolution of eukaryotes. The WLM domain is found either in stand-alone form or in association with other domains such as the RanBP2 zinc finger , the ubiquitin domain, or the PUB/PUG domain. This domain could function as a specific de-SUMOylating domain of distinct protein complexes in the nucleus and the cytoplasm []. It has been suggested to form a segregated alpha/beta structure with eight helices and five strands. Proteins containign this domain include yeast WSS1 (a weak suppressor of the Ub-related protein SMT3), and various putative metalloproteases from plant and fungal species.
Probab=74.47  E-value=7.8  Score=34.75  Aligned_cols=23  Identities=22%  Similarity=0.286  Sum_probs=19.6

Q ss_pred             cCHHHHHHHHHHHHHhhhcChHH
Q psy5221         113 MTKLEIKAIIAHEISHISNGDMV  135 (436)
Q Consensus       113 l~~~El~aVLAHElgHi~~~h~~  135 (436)
                      ++-+++..|+-|||+|+.++..-
T Consensus        77 l~~~~i~~t~lHELaH~~~~~H~   99 (186)
T PF08325_consen   77 LPYETILGTMLHELAHNVHGPHD   99 (186)
T ss_pred             eeHHHHHHHHHHHHHhcccCCcc
Confidence            37799999999999999986543


No 91 
>PF00413 Peptidase_M10:  Matrixin This Prosite motif covers only the active site.;  InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)).  The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=74.28  E-value=2  Score=36.81  Aligned_cols=18  Identities=22%  Similarity=0.455  Sum_probs=15.0

Q ss_pred             CHHHHHHHHHHHHHhhhc
Q psy5221         114 TKLEIKAIIAHEISHISN  131 (436)
Q Consensus       114 ~~~El~aVLAHElgHi~~  131 (436)
                      +..++..|+.|||||...
T Consensus       101 ~~~~~~~v~~HEiGHaLG  118 (154)
T PF00413_consen  101 SGNDLQSVAIHEIGHALG  118 (154)
T ss_dssp             SSEEHHHHHHHHHHHHTT
T ss_pred             hhhhhhhhhhhccccccC
Confidence            445799999999999854


No 92 
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=73.19  E-value=2.1  Score=37.20  Aligned_cols=18  Identities=17%  Similarity=0.366  Sum_probs=15.2

Q ss_pred             CHHHHHHHHHHHHHhhhc
Q psy5221         114 TKLEIKAIIAHEISHISN  131 (436)
Q Consensus       114 ~~~El~aVLAHElgHi~~  131 (436)
                      +.+.+.++++|||||...
T Consensus        90 ~~~~~~~~~~HEiGHaLG  107 (165)
T cd04268          90 SGARLRNTAEHELGHALG  107 (165)
T ss_pred             HHHHHHHHHHHHHHHHhc
Confidence            446799999999999765


No 93 
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=72.87  E-value=2.3  Score=39.90  Aligned_cols=18  Identities=28%  Similarity=0.329  Sum_probs=14.9

Q ss_pred             CHHHHHHHHHHHHHhhhc
Q psy5221         114 TKLEIKAIIAHEISHISN  131 (436)
Q Consensus       114 ~~~El~aVLAHElgHi~~  131 (436)
                      .....+.++||||||--.
T Consensus       163 ~~~~~a~t~AHElGHnlG  180 (244)
T cd04270         163 PTKESDLVTAHELGHNFG  180 (244)
T ss_pred             chhHHHHHHHHHHHHhcC
Confidence            345688999999999876


No 94 
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=70.89  E-value=5.9  Score=42.32  Aligned_cols=50  Identities=20%  Similarity=0.304  Sum_probs=34.2

Q ss_pred             CeEEEECC-CccEEEeCCCC-CccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChH
Q psy5221          80 PEVAIYNG-EINAFATGAFK-NSSLIAISANLLESMTKLEIKAIIAHEISHISNGDM  134 (436)
Q Consensus        80 p~v~v~~~-~~NA~~~G~~~-~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~  134 (436)
                      .+++--++ .+-||+.|..+ ....|+.+-  ...+   .=..+|+||+||..|.-.
T Consensus       345 iD~~~~~gKrsGaYs~~~~~~~~p~IlmN~--~gt~---~dV~TLaHElGHs~Hs~~  396 (598)
T COG1164         345 IDVYPRKGKRSGAYSIGFYKGDHPFILMNY--DGTL---RDVFTLAHELGHSVHSYF  396 (598)
T ss_pred             eeccCCCCCCCCcccCCCCCCCCCeEEEeC--CCch---hHHHHHHHHccHHHHHHH
Confidence            34444445 89999999877 677776653  2222   225789999999988643


No 95 
>PF13485 Peptidase_MA_2:  Peptidase MA superfamily
Probab=70.19  E-value=3.1  Score=33.89  Aligned_cols=23  Identities=22%  Similarity=0.331  Sum_probs=18.9

Q ss_pred             hcCHHHHHHHHHHHHHhhhcChH
Q psy5221         112 SMTKLEIKAIIAHEISHISNGDM  134 (436)
Q Consensus       112 ~l~~~El~aVLAHElgHi~~~h~  134 (436)
                      ..+.+.+..||+||++|....+.
T Consensus        19 ~~~~~~~~~~l~HE~~H~~~~~~   41 (128)
T PF13485_consen   19 GSDEDWLDRVLAHELAHQWFGNY   41 (128)
T ss_pred             CCCHHHHHHHHHHHHHHHHHHHH
Confidence            35778889999999999987654


No 96 
>PF09768 Peptidase_M76:  Peptidase M76 family;  InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=69.19  E-value=15  Score=32.44  Aligned_cols=63  Identities=8%  Similarity=0.138  Sum_probs=37.3

Q ss_pred             HHHHHHHHHHhcCCC--CCeEEEECC-CccEEEeCCCC-CccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhc
Q psy5221          64 LLKTVNKISKNANIS--TPEVAIYNG-EINAFATGAFK-NSSLIAISANLLESMTKLEIKAIIAHEISHISN  131 (436)
Q Consensus        64 L~~~l~~l~~~~g~~--~p~v~v~~~-~~NA~~~G~~~-~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~  131 (436)
                      ++-+++.+. ++|-+  .++...... ..+ ...|+.| .+++++...-   .-+..++.-+|+|||-|..-
T Consensus        18 v~fl~~~~~-~~gc~~~~~~~i~c~~C~~~-~~Ggf~p~~~~I~lC~N~---~~~~~~l~~~l~HELIHayD   84 (173)
T PF09768_consen   18 VRFLMEALK-KLGCPPVPPRHIKCEPCDSS-VSGGFDPSKKGIVLCQNR---IRSQGHLEDTLTHELIHAYD   84 (173)
T ss_pred             HHHHHHHHH-HcCCCCCCCCCeEEEECcCC-CcCCccCCCCCEEEeeCC---CCCHHHHHHHHHHHHHHHHH
Confidence            444444444 45543  233333333 333 4456666 5666665541   24999999999999999764


No 97 
>PF13688 Reprolysin_5:  Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=68.61  E-value=2.5  Score=37.98  Aligned_cols=18  Identities=28%  Similarity=0.255  Sum_probs=14.7

Q ss_pred             CHHHHHHHHHHHHHhhhc
Q psy5221         114 TKLEIKAIIAHEISHISN  131 (436)
Q Consensus       114 ~~~El~aVLAHElgHi~~  131 (436)
                      +...-.-++|||+||.-.
T Consensus       138 ~~~~~~~~~AHEiGH~lG  155 (196)
T PF13688_consen  138 PTYNGAITFAHEIGHNLG  155 (196)
T ss_dssp             -HHHHHHHHHHHHHHHTT
T ss_pred             CCCceehhhHHhHHHhcC
Confidence            467888999999999765


No 98 
>PF14247 DUF4344:  Domain of unknown function (DUF4344)
Probab=67.64  E-value=19  Score=33.10  Aligned_cols=23  Identities=17%  Similarity=0.051  Sum_probs=18.1

Q ss_pred             hhhccCCChHHHHHHHHhccccc
Q psy5221         261 KLFATHPSFEKRIAILSLLKLSL  283 (436)
Q Consensus       261 ~~~~tHP~~~~RI~~L~~~~~~~  283 (436)
                      .+++.|..-.+|-..+..+.-..
T Consensus       160 ~~~deHsld~QR~yn~lCl~yGs  182 (220)
T PF14247_consen  160 PFWDEHSLDLQRYYNLLCLMYGS  182 (220)
T ss_pred             cccccCCchHHHHHhhHHhhccC
Confidence            47889999999999988765443


No 99 
>PRK06926 flagellar motor protein MotP; Reviewed
Probab=67.22  E-value=1.1e+02  Score=29.26  Aligned_cols=30  Identities=13%  Similarity=0.158  Sum_probs=24.6

Q ss_pred             HHhhcCHHHHHHHHHHHHHhhhcChHHHHH
Q psy5221         109 LLESMTKLEIKAIIAHEISHISNGDMVTML  138 (436)
Q Consensus       109 Ll~~l~~~El~aVLAHElgHi~~~h~~~~~  138 (436)
                      +.+..++++++.+|-+|+-....+|....-
T Consensus       120 vvDG~~~~~i~~iLe~ei~~~~~r~~~~~~  149 (271)
T PRK06926        120 AIDGWEPETIRDIMMAEIAAMEERHRKGRR  149 (271)
T ss_pred             HHCCCCHHHHHHHHHHHHHHHHHHHHhHHH
Confidence            344579999999999999999999975433


No 100
>PRK00446 cyaY frataxin-like protein; Provisional
Probab=66.98  E-value=7.1  Score=31.49  Aligned_cols=31  Identities=19%  Similarity=0.312  Sum_probs=22.7

Q ss_pred             EeEEeeCcEEEEEEecCCCCCCCCeEeeecCCCCCCCC
Q psy5221         398 VAAFFDQGILNIDLIREVPEALKPRKIYINTTKNTDRI  435 (436)
Q Consensus       398 ~~A~~~~GvL~i~~pk~~~~~~~~~~I~I~~~~~~~~~  435 (436)
                      +.+.+.+|||+|+++.       ..+|-|+..++.+||
T Consensus        29 ~D~e~~~gVLti~f~~-------~~~~VINkQ~p~~QI   59 (105)
T PRK00446         29 IDCERNGGVLTLTFEN-------GSKIIINRQEPLHEL   59 (105)
T ss_pred             eeeeccCCEEEEEECC-------CCEEEEeCCCchhhe
Confidence            7788899999999986       255666666665553


No 101
>PF13582 Reprolysin_3:  Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=66.66  E-value=3  Score=34.32  Aligned_cols=13  Identities=38%  Similarity=0.419  Sum_probs=10.8

Q ss_pred             HHHHHHHHHhhhc
Q psy5221         119 KAIIAHEISHISN  131 (436)
Q Consensus       119 ~aVLAHElgHi~~  131 (436)
                      ..+++||+||.-.
T Consensus       108 ~~~~~HEiGH~lG  120 (124)
T PF13582_consen  108 VDTFAHEIGHNLG  120 (124)
T ss_dssp             TTHHHHHHHHHTT
T ss_pred             ceEeeehhhHhcC
Confidence            3899999999753


No 102
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=65.88  E-value=4.4  Score=43.28  Aligned_cols=42  Identities=21%  Similarity=0.311  Sum_probs=29.0

Q ss_pred             CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChH
Q psy5221          88 EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDM  134 (436)
Q Consensus        88 ~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~  134 (436)
                      ...||++|..+..++|+.+-     -..-.=...||||+||-.|.-.
T Consensus       353 ~~Ga~~~~~~~~~p~il~N~-----~~~~~dv~TLaHElGHa~H~~~  394 (591)
T TIGR00181       353 RSGAYSIGGYKVKPYILMNW-----DGTLNSVFTLAHELGHSMHSYF  394 (591)
T ss_pred             CCCcccCCCCCCCCeEEEec-----CCCcchHHHHHHHhhhHHHHHH
Confidence            77899999877677775542     1112235689999999976543


No 103
>PRK00523 hypothetical protein; Provisional
Probab=65.24  E-value=25  Score=26.03  Aligned_cols=45  Identities=20%  Similarity=0.201  Sum_probs=26.0

Q ss_pred             HHHHHHHHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCC
Q psy5221          26 IIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANIST   79 (436)
Q Consensus        26 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~   79 (436)
                      ++..+++.+..+|++.+.....++..|   |.++      +.++.+....|-++
T Consensus        12 i~~li~G~~~Gffiark~~~k~l~~NP---pine------~mir~M~~QMGqKP   56 (72)
T PRK00523         12 IPLLIVGGIIGYFVSKKMFKKQIRENP---PITE------NMIRAMYMQMGRKP   56 (72)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHCc---CCCH------HHHHHHHHHhCCCc
Confidence            333444556667777777776666543   3332      46666666666653


No 104
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=64.93  E-value=15  Score=32.76  Aligned_cols=67  Identities=16%  Similarity=0.128  Sum_probs=32.7

Q ss_pred             HHHHHHHHHhcCCCC-CeEEEECC----CccEEEeCCCCC-ccEEEEcHHHHhhc----------CHHHHHHHHHHHHHh
Q psy5221          65 LKTVNKISKNANIST-PEVAIYNG----EINAFATGAFKN-SSLIAISANLLESM----------TKLEIKAIIAHEISH  128 (436)
Q Consensus        65 ~~~l~~l~~~~g~~~-p~v~v~~~----~~NA~~~G~~~~-~~~v~it~gLl~~l----------~~~El~aVLAHElgH  128 (436)
                      ..+++.+......+. .-+.|.+.    ....|+.|.... .+.-+++..-++.-          -.+-+.-...||+||
T Consensus        56 ~~iL~~l~~~~~~~~~~vl~vt~~DLy~~~~nfvfG~a~~~~~~aVvS~~Rl~~~~~~~~~~~~l~~~R~~k~~~HElGH  135 (179)
T PRK13267         56 EKFLPLLSRIGRFNGDKNIGITDCDLYYRGLNFVFGLAYPNLRGAVISTYRLRPEFYGNKPDSELFEERVRKEVTHELGH  135 (179)
T ss_pred             HHHHHHHHhhCCCCCceEEEEEccccCCCCCCceeEEeecCCCEEEEEccccChhhccccchHHHHHHHHHHHHHHHHHH
Confidence            345555544433232 23444444    344578886432 22233333222211          123345558999999


Q ss_pred             hhc
Q psy5221         129 ISN  131 (436)
Q Consensus       129 i~~  131 (436)
                      .-.
T Consensus       136 ~lG  138 (179)
T PRK13267        136 TLG  138 (179)
T ss_pred             HcC
Confidence            854


No 105
>PF13574 Reprolysin_2:  Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=64.14  E-value=3.6  Score=36.34  Aligned_cols=14  Identities=29%  Similarity=0.266  Sum_probs=11.2

Q ss_pred             HHHHHHHHHHhhhc
Q psy5221         118 IKAIIAHEISHISN  131 (436)
Q Consensus       118 l~aVLAHElgHi~~  131 (436)
                      -.-++|||+||.-.
T Consensus       111 ~~~~~aHElGH~lG  124 (173)
T PF13574_consen  111 GIDTFAHELGHQLG  124 (173)
T ss_dssp             HHHHHHHHHHHHHT
T ss_pred             eeeeehhhhHhhcC
Confidence            34569999999876


No 106
>TIGR03421 FeS_CyaY iron donor protein CyaY. Members of this protein family are the iron-sulfur cluster (FeS) metabolism protein CyaY, a homolog of eukaryotic frataxin. ISC is one of several bacterial systems for FeS assembly; we find by Partial Phylogenetic Profiling vs. the ISC system that CyaY most like work with the ISC system for FeS cluster biosynthesis. A study of of cyaY mutants in Salmonella enterica bears this out. Although the trusted cutoff is set low enough to include eukaryotic frataxin sequences, a narrower, exception-type model (TIGR03421) identifies identifies members of that specific set.
Probab=64.14  E-value=7.9  Score=31.03  Aligned_cols=31  Identities=19%  Similarity=0.295  Sum_probs=22.6

Q ss_pred             EeEEeeCcEEEEEEecCCCCCCCCeEeeecCCCCCCCC
Q psy5221         398 VAAFFDQGILNIDLIREVPEALKPRKIYINTTKNTDRI  435 (436)
Q Consensus       398 ~~A~~~~GvL~i~~pk~~~~~~~~~~I~I~~~~~~~~~  435 (436)
                      +.+.+.+|||+|+++.       ..+|-|+..++.+||
T Consensus        27 ~D~e~~~gVLti~f~~-------~~~~VINkQ~p~~QI   57 (102)
T TIGR03421        27 IDCERAGGVLTLTFEN-------GSQIIINKQEPLHQI   57 (102)
T ss_pred             eeeecCCCEEEEEECC-------CCEEEEeCCchhhhh
Confidence            7788889999999985       356666666655543


No 107
>PF01447 Peptidase_M4:  Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ;  InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=62.11  E-value=18  Score=31.17  Aligned_cols=62  Identities=18%  Similarity=0.213  Sum_probs=28.9

Q ss_pred             HHHHHHHHHHhcCCC---CCeEEEE--CC-CccEEEeCCCCCccEEEEcHHHHhhcCHHH-HHHHHHHHHHhhh
Q psy5221          64 LLKTVNKISKNANIS---TPEVAIY--NG-EINAFATGAFKNSSLIAISANLLESMTKLE-IKAIIAHEISHIS  130 (436)
Q Consensus        64 L~~~l~~l~~~~g~~---~p~v~v~--~~-~~NA~~~G~~~~~~~v~it~gLl~~l~~~E-l~aVLAHElgHi~  130 (436)
                      .++...+.-.+-++.   .+-+-.+  .. ..|||--|    .. +++..|--..+.+=. -.=|+||||+|-.
T Consensus        79 v~d~y~~~~gr~siD~~G~~~~~~Vhyg~~~~NAfW~g----~~-m~yGdG~~~~f~~~~~~lDVvaHEltHGV  147 (150)
T PF01447_consen   79 VYDYYKNVFGRNSIDGNGMPIISRVHYGKNYNNAFWNG----SQ-MVYGDGDGQIFKPFASSLDVVAHELTHGV  147 (150)
T ss_dssp             HHHHHHHHHSS-STTSS-S-EEEEESESSSTT-EEE-S----SS-EEEE---SSSBS-GGG-HHHHHHHHHHHH
T ss_pred             HHHHHHHHHCCCCcCCCCcEEEEEEeCCCCccCccccC----CE-EEEECCCCcccccCccccceeeecccccc
Confidence            344455555555554   2222223  22 78999876    22 666765443332211 1349999999953


No 108
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=61.75  E-value=4  Score=35.29  Aligned_cols=17  Identities=29%  Similarity=0.454  Sum_probs=14.4

Q ss_pred             HHHHHHHHHHHHHhhhc
Q psy5221         115 KLEIKAIIAHEISHISN  131 (436)
Q Consensus       115 ~~El~aVLAHElgHi~~  131 (436)
                      ...+..|+.|||||...
T Consensus       104 ~~~~~~~~~HEiGHaLG  120 (157)
T cd04278         104 GTDLFSVAAHEIGHALG  120 (157)
T ss_pred             cchHHHHHHHHhccccc
Confidence            35699999999999865


No 109
>PF05572 Peptidase_M43:  Pregnancy-associated plasma protein-A;  InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=61.73  E-value=4.3  Score=35.17  Aligned_cols=21  Identities=14%  Similarity=0.281  Sum_probs=14.5

Q ss_pred             CHHHHHHHHHHHHHhhhc-ChH
Q psy5221         114 TKLEIKAIIAHEISHISN-GDM  134 (436)
Q Consensus       114 ~~~El~aVLAHElgHi~~-~h~  134 (436)
                      +.....-+|+||+||+-. .|+
T Consensus        65 ~~~~~g~TltHEvGH~LGL~Ht   86 (154)
T PF05572_consen   65 SQYNFGKTLTHEVGHWLGLYHT   86 (154)
T ss_dssp             TTS-SSHHHHHHHHHHTT---T
T ss_pred             Cccccccchhhhhhhhhccccc
Confidence            445567899999999976 554


No 110
>cd00503 Frataxin Frataxin is a nuclear-encoded mitochondrial protein implicated in Friedreich's ataxia (FRDA), an human autosomal recessive neurodegenerative disease; Frataxin is found in eukaryotes and in purple bacteria; lack of frataxin causes iron to accumulate in the mitochondrial matrix suggesting that frataxin is involved in mitochondrial iron homeostasis and possibly in iron transport; the domain has an alpha-beta fold consisting of two helices flanking an antiparallel beta sheet.
Probab=61.41  E-value=9.4  Score=30.76  Aligned_cols=31  Identities=19%  Similarity=0.265  Sum_probs=23.1

Q ss_pred             EeEEeeCcEEEEEEecCCCCCCCCeEeeecCCCCCCCC
Q psy5221         398 VAAFFDQGILNIDLIREVPEALKPRKIYINTTKNTDRI  435 (436)
Q Consensus       398 ~~A~~~~GvL~i~~pk~~~~~~~~~~I~I~~~~~~~~~  435 (436)
                      +.+.+.+|||+|+++.       ..+|-|+..++.+||
T Consensus        30 ~D~e~~~gVLti~f~~-------~~~~VINkQ~p~~QI   60 (105)
T cd00503          30 IDVETQGGVLTLTFGN-------GSTIVINRQEPLRQI   60 (105)
T ss_pred             EeeeccCCEEEEEECC-------CCEEEEeCCchhhhh
Confidence            7778889999999984       356667766665554


No 111
>PF01433 Peptidase_M1:  Peptidase family M1 This is family M1 in the peptidase classification.;  InterPro: IPR014782 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M1 (clan MA(E)), the type example being aminopeptidase N from Homo sapiens (Human). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA.  Membrane alanine aminopeptidase (3.4.11.2 from EC) is part of the HEXXH+E group; it consists entirely of aminopeptidases, spread across a wide variety of species []. Functional studies show that CD13/APN catalyzes the removal of single amino acids from the amino terminus of small peptides and probably plays a role in their final digestion; one family member (leukotriene-A4 hydrolase) is known to hydrolyse the epoxide leukotriene-A4 to form an inflammatory mediator []. This hydrolase has been shown to have aminopeptidase activity [], and the zinc ligands of the M1 family were identified by site-directed mutagenesis on this enzyme [] CD13 participates in trimming peptides bound to MHC class II molecules [] and cleaves MIP-1 chemokine, which alters target cell specificity from basophils to eosinophils []. CD13 acts as a receptor for specific strains of RNA viruses (coronaviruses) which cause a relatively large percentage of upper respiratory trace infections. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 2XQ0_A 2XPY_A 2XPZ_A 3SE6_B 3EBH_A 3EBG_A 3T8V_A 3Q44_A 3Q43_A 3EBI_A ....
Probab=59.67  E-value=34  Score=34.13  Aligned_cols=66  Identities=23%  Similarity=0.277  Sum_probs=37.3

Q ss_pred             HHHHHHHHHHhcCCCC--C--eEEEECC-CccEEEeCCCCCccEEEEcH-HHHhh------cCHHHHHHHHHHHHHhhhc
Q psy5221          64 LLKTVNKISKNANIST--P--EVAIYNG-EINAFATGAFKNSSLIAISA-NLLES------MTKLEIKAIIAHEISHISN  131 (436)
Q Consensus        64 L~~~l~~l~~~~g~~~--p--~v~v~~~-~~NA~~~G~~~~~~~v~it~-gLl~~------l~~~El~aVLAHElgHi~~  131 (436)
                      ..+.+.-+.+..|.+.  +  ++..+++ ..++.-.     -+.|.+.. .++..      -+..++..++|||++|..-
T Consensus       234 ~~~~l~~~~~~~g~~yp~~k~~~v~~p~~~~~~me~-----~g~i~~~~~~l~~~~~~~~~~~~~~~~~~iahElahqWf  308 (390)
T PF01433_consen  234 APKALEYYEEYFGIPYPFKKLDIVAVPDFPFGGMEN-----WGLITYRESYLLYDPDISTIGDKQEIASLIAHELAHQWF  308 (390)
T ss_dssp             HHHHHHHHHHHHTS--SSSEEEEEEEST-SSSEE-------TTEEEEEGGGTS-STTTS-HHHHHHHHHHHHHHHHTTTB
T ss_pred             hHHHHHHHHhhccccceecceeEEEEeccccccccc-----cccccccccccccCcccccchhhhhhHHHHHHHHHHHHh
Confidence            4456666777788753  3  3455555 3344322     22344443 23321      1245789999999999999


Q ss_pred             ChH
Q psy5221         132 GDM  134 (436)
Q Consensus       132 ~h~  134 (436)
                      |+.
T Consensus       309 Gn~  311 (390)
T PF01433_consen  309 GNL  311 (390)
T ss_dssp             TTT
T ss_pred             ccC
Confidence            885


No 112
>PF10023 DUF2265:  Predicted aminopeptidase (DUF2265);  InterPro: IPR014553 This group represents a predicted aminopeptidase.
Probab=59.58  E-value=4.9  Score=39.26  Aligned_cols=32  Identities=31%  Similarity=0.477  Sum_probs=24.3

Q ss_pred             cHHHHhhc---CHHHHHHHHHHHHHhhh---cChHHHH
Q psy5221         106 SANLLESM---TKLEIKAIIAHEISHIS---NGDMVTM  137 (436)
Q Consensus       106 t~gLl~~l---~~~El~aVLAHElgHi~---~~h~~~~  137 (436)
                      +++++..+   ++.+|+.+|=||++|-.   .+|+..+
T Consensus       150 ~DPlLSt~l~~~~~~LA~LIfHELaHq~~Yv~~dt~FN  187 (337)
T PF10023_consen  150 DDPLLSTMLRYPDGELARLIFHELAHQTLYVKGDTAFN  187 (337)
T ss_pred             CCcccccccCCCchHHHHHHHHHHhhceeecCCCchhh
Confidence            44566655   89999999999999954   5776543


No 113
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=59.17  E-value=3.8  Score=36.71  Aligned_cols=17  Identities=29%  Similarity=0.348  Sum_probs=14.1

Q ss_pred             HHHHHHHHHHHHHhhhc
Q psy5221         115 KLEIKAIIAHEISHISN  131 (436)
Q Consensus       115 ~~El~aVLAHElgHi~~  131 (436)
                      ......++|||+||.-.
T Consensus       130 ~~~~~~~~aHElGH~lG  146 (192)
T cd04267         130 TLLTALTMAHELGHNLG  146 (192)
T ss_pred             ceeehhhhhhhHHhhcC
Confidence            45678899999999875


No 114
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=58.36  E-value=82  Score=31.84  Aligned_cols=66  Identities=9%  Similarity=-0.008  Sum_probs=40.4

Q ss_pred             HHHHHHHHHHhcCCCCCeEEEECCCccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcCh
Q psy5221          64 LLKTVNKISKNANISTPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGD  133 (436)
Q Consensus        64 L~~~l~~l~~~~g~~~p~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h  133 (436)
                      =.+...++++.+|....+-+ +|..++.|+.|+++...  .+++- .+.-+--.=..-++||+||-.+-.
T Consensus       109 Q~~~~~~~~~~~g~df~~gr-iD~s~hpF~~~~~~~dv--RItt~-y~~~d~~~~l~t~iHE~GHalye~  174 (396)
T cd06460         109 QEALGRELLEALGFDFDRGR-LDVSAHPFTGGLGPGDV--RITTR-YDENDFRSALFSTIHETGHALYEQ  174 (396)
T ss_pred             HHHHHHHHHHHhCCcccCCe-eecCCCCCCCCCCCCCc--eEEee-eCCcchHHHHHHHHHHhhHHHHHh
Confidence            34556788999999865433 44455679998866543  33330 111112233446899999988866


No 115
>PRK05518 rpl6p 50S ribosomal protein L6P; Reviewed
Probab=58.27  E-value=38  Score=30.19  Aligned_cols=45  Identities=18%  Similarity=0.252  Sum_probs=32.6

Q ss_pred             CCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccceEeEEeeCcEEEEEEe
Q psy5221         348 RSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLI  412 (436)
Q Consensus       348 ~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~~GvL~i~~p  412 (436)
                      |++++|+++++.++|+|.+                |+.++.|.-|    .++...+||-+.|+..
T Consensus        13 P~~V~v~i~~~~v~VkGp~----------------G~L~~~~~~~----~v~i~~~~~~i~v~~~   57 (180)
T PRK05518         13 PEGVTVEIEGLVVTVKGPK----------------GELTRDFWYP----GVTISVEDGKVVIETE   57 (180)
T ss_pred             CCCCEEEEECCEEEEECCC----------------eEEEEEecCC----cEEEEEECCEEEEEEC
Confidence            6899999999999999885                5566555432    2444568887777754


No 116
>TIGR03422 mito_frataxin frataxin. Frataxin is a mitochondrial protein, mutation of which leads to the disease Friedreich's ataxia. Its orthologs are widely distributed in the bacteria, associated with the ISC system for iron-sulfur cluster assembly, and designated CyaY. This exception-type model allows those examples of frataxin per se that score above the trusted cutoff to the CyaY equivalog-type model (TIGR03421) to be named appropriately.
Probab=56.43  E-value=9.4  Score=30.29  Aligned_cols=29  Identities=17%  Similarity=0.312  Sum_probs=20.4

Q ss_pred             eEEeeCcEEEEEEecCCCCCCCCeEeeecCCCCCCC
Q psy5221         399 AAFFDQGILNIDLIREVPEALKPRKIYINTTKNTDR  434 (436)
Q Consensus       399 ~A~~~~GvL~i~~pk~~~~~~~~~~I~I~~~~~~~~  434 (436)
                      .+.+.+|||+|+++.       ..++-|+..++.+|
T Consensus        30 D~e~~~gVLti~~~~-------~~~~VINkQ~p~~Q   58 (97)
T TIGR03422        30 DVEYSSGVLTLELPS-------VGTYVINKQPPNKQ   58 (97)
T ss_pred             ccccCCCEEEEEECC-------CCEEEEeCCChhhH
Confidence            777889999999975       24555655555444


No 117
>TIGR02414 pepN_proteo aminopeptidase N, Escherichia coli type. The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, EC 3.4.11.2, after the Escherichia coli enzyme, suggesting a similar activity profile. This family consists of all aminopeptidases closely related to E. coli PepN and presumed to have similar (not identical) function. Nearly all are found in Proteobacteria, but members are found also in Cyanobacteria, plants, and apicomplexan parasites. This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (TIGR02412), from the membrane bound aminopeptidase N family in animals, etc.
Probab=55.98  E-value=9.5  Score=42.60  Aligned_cols=20  Identities=40%  Similarity=0.507  Sum_probs=16.8

Q ss_pred             HHHHHHHHHHHHhhhcChHH
Q psy5221         116 LEIKAIIAHEISHISNGDMV  135 (436)
Q Consensus       116 ~El~aVLAHElgHi~~~h~~  135 (436)
                      ..+..|+|||++|-..|+..
T Consensus       281 ~~i~~VIaHElaHqWfGNlV  300 (863)
T TIGR02414       281 ERIESVIAHEYFHNWTGNRV  300 (863)
T ss_pred             HHHHHHHHHHHHHHHhccee
Confidence            34789999999999988854


No 118
>TIGR03653 arch_L6P archaeal ribosomal protein L6P. Members of this protein family are the archaeal ribosomal protein L6P. The top-scoring proteins not selected by this model are eukaryotic cytosolic ribosomal protein L9. Bacterial ribosomal protein L6 scores lower and is described by a distinct model.
Probab=55.52  E-value=48  Score=29.23  Aligned_cols=45  Identities=11%  Similarity=0.198  Sum_probs=32.2

Q ss_pred             CCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccceEeEEeeCcEEEEEEe
Q psy5221         348 RSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLI  412 (436)
Q Consensus       348 ~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~~GvL~i~~p  412 (436)
                      |++++|+++++.++|+|.+                |+.++.|. |..   ++...+++-|.|+.+
T Consensus         7 P~~V~v~i~~~~i~vkGp~----------------G~L~~~~~-~~~---v~i~~~~~~i~v~~~   51 (170)
T TIGR03653         7 PEGVSVTIEGNIVTVKGPK----------------GEVTRELW-YPG---IEISVEDGKVVIETD   51 (170)
T ss_pred             CCCCEEEEeCCEEEEECCC----------------eEEEEEEe-CCc---EEEEEeCCEEEEEeC
Confidence            6899999999999999885                55665553 333   344557887777754


No 119
>PF13583 Reprolysin_4:  Metallo-peptidase family M12B Reprolysin-like
Probab=55.31  E-value=5.7  Score=36.21  Aligned_cols=12  Identities=42%  Similarity=0.504  Sum_probs=10.1

Q ss_pred             HHHHHHHHhhhc
Q psy5221         120 AIIAHEISHISN  131 (436)
Q Consensus       120 aVLAHElgHi~~  131 (436)
                      -++|||+||.-.
T Consensus       139 ~~~aHEiGH~lG  150 (206)
T PF13583_consen  139 QTFAHEIGHNLG  150 (206)
T ss_pred             hHHHHHHHHHhc
Confidence            459999999876


No 120
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=55.14  E-value=4.4  Score=37.62  Aligned_cols=12  Identities=42%  Similarity=0.648  Sum_probs=10.5

Q ss_pred             HHHHHHHHhhhc
Q psy5221         120 AIIAHEISHISN  131 (436)
Q Consensus       120 aVLAHElgHi~~  131 (436)
                      .++||||||.-.
T Consensus       147 ~t~AHElGHnLG  158 (228)
T cd04271         147 QVFAHEIGHTFG  158 (228)
T ss_pred             eehhhhhhhhcC
Confidence            599999999874


No 121
>PF01491 Frataxin_Cyay:  Frataxin-like domain;  InterPro: IPR002908 The eukaryotic proteins in this entry include frataxin, the protein that is mutated in Friedreich's ataxia [], and related sequences. Friedreich's ataxia is a progressive neurodegenerative disorder caused by loss of function mutations in the gene encoding frataxin (FRDA). Frataxin mRNA is predominantly expressed in tissues with a high metabolic rate (including liver, kidney, brown fat and heart). Mouse and yeast frataxin homologues contain a potential N-terminal mitochondrial targeting sequence, and human frataxin has been observed to co-localise with a mitochondrial protein. Furthermore, disruption of the yeast gene has been shown to result in mitochondrial dysfunction. Friedreich's ataxia is thus believed to be a mitochondrial disease caused by a mutation in the nuclear genome (specifically, expansion of an intronic GAA triplet repeat) [, , ]. The bacterial proteins in this entry are iron-sulphur cluster (FeS) metabolism CyaY proteins hmologous to eukaryotic frataxin. Partial Phylogenetic Profiling [] suggests that CyaY most likely functions as part of the ISC system for FeS cluster biosynthesis, and is supported by expermimental data in some species [, ]. ; PDB: 1EW4_A 2P1X_A 1SOY_A 2EFF_A 3T3T_B 3S4M_A 3T3K_A 3S5D_A 1LY7_A 3T3X_B ....
Probab=55.06  E-value=14  Score=29.89  Aligned_cols=31  Identities=16%  Similarity=0.191  Sum_probs=22.2

Q ss_pred             eEeEEeeCcEEEEEEecCCCCCCCCeEeeecCCCCCCC
Q psy5221         397 VVAAFFDQGILNIDLIREVPEALKPRKIYINTTKNTDR  434 (436)
Q Consensus       397 ~~~A~~~~GvL~i~~pk~~~~~~~~~~I~I~~~~~~~~  434 (436)
                      -+.+.+.+|||+|++|.       ..++-|+..++.+|
T Consensus        31 d~d~e~~~gVLti~~~~-------~~~~VINkQ~p~~Q   61 (109)
T PF01491_consen   31 DIDVERSGGVLTIEFPD-------GGQYVINKQPPNRQ   61 (109)
T ss_dssp             TEEEEEETTEEEEEETT-------SEEEEEEEECCCTE
T ss_pred             ceEEEccCCEEEEEECC-------CCEEEEeCCCHHHH
Confidence            37899999999999975       25555655555444


No 122
>PF01431 Peptidase_M13:  Peptidase family M13 This is family M13 in the peptidase classification. ;  InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell [].  Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=54.82  E-value=9.3  Score=34.57  Aligned_cols=35  Identities=26%  Similarity=0.325  Sum_probs=23.8

Q ss_pred             ccEEEEcHHHHhh----------cCHHHHHHHHHHHHHhhhcChH
Q psy5221         100 SSLIAISANLLES----------MTKLEIKAIIAHEISHISNGDM  134 (436)
Q Consensus       100 ~~~v~it~gLl~~----------l~~~El~aVLAHElgHi~~~h~  134 (436)
                      ...|++..|++..          ++=.-+-+||||||+|.-..+.
T Consensus         8 ~N~i~ip~~~l~~P~f~~~~p~~~~yg~lG~ilahel~hafd~~g   52 (206)
T PF01431_consen    8 FNSIVIPAGILQPPFFDPNYPPALNYGGLGFILAHELMHAFDPEG   52 (206)
T ss_dssp             TTEEEEEGGGSSTTT--TTS-HHHHHHTHHHHHHHHHHHCTSTTG
T ss_pred             cCEEEecHHHhCCccCCCCCCHHHHHHHHHHHHHHHHHHHHHHhH
Confidence            4446666666553          2335688999999999876654


No 123
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=54.71  E-value=7.2  Score=35.86  Aligned_cols=15  Identities=20%  Similarity=0.268  Sum_probs=12.7

Q ss_pred             HHHHHHHHHHHhhhc
Q psy5221         117 EIKAIIAHEISHISN  131 (436)
Q Consensus       117 El~aVLAHElgHi~~  131 (436)
                      ....++|||+||.-.
T Consensus       144 ~~~~~~AHElGH~lG  158 (220)
T cd04272         144 YGVYTMTHELAHLLG  158 (220)
T ss_pred             ccHHHHHHHHHHHhC
Confidence            458999999999874


No 124
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=54.58  E-value=5.5  Score=34.52  Aligned_cols=17  Identities=35%  Similarity=0.366  Sum_probs=14.5

Q ss_pred             HHHHHHHHHHHHHhhhc
Q psy5221         115 KLEIKAIIAHEISHISN  131 (436)
Q Consensus       115 ~~El~aVLAHElgHi~~  131 (436)
                      ......+++||+||...
T Consensus        93 ~~~~~~~~~HElGH~LG  109 (167)
T cd00203          93 TKEGAQTIAHELGHALG  109 (167)
T ss_pred             cccchhhHHHHHHHHhC
Confidence            45789999999999875


No 125
>TIGR02412 pepN_strep_liv aminopeptidase N, Streptomyces lividans type. This family is a subset of the members of the zinc metallopeptidase family M1 (pfam01433), with a single member characterized in Streptomyces lividans 66 and designated aminopeptidase N. The spectrum of activity may differ somewhat from the aminopeptidase N clade of E. coli and most other Proteobacteria, well separated phylogenetically within the M1 family. The M1 family also includes leukotriene A-4 hydrolase/aminopeptidase (with a bifunctional active site).
Probab=53.03  E-value=32  Score=38.46  Aligned_cols=68  Identities=25%  Similarity=0.410  Sum_probs=38.0

Q ss_pred             HHHHHHHHHhcCCCCC----eEEEECCCccEEEeCCCCCccEEEEcHHHHhh--cC---HHHHHHHHHHHHHhhhcChHH
Q psy5221          65 LKTVNKISKNANISTP----EVAIYNGEINAFATGAFKNSSLIAISANLLES--MT---KLEIKAIIAHEISHISNGDMV  135 (436)
Q Consensus        65 ~~~l~~l~~~~g~~~p----~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~--l~---~~El~aVLAHElgHi~~~h~~  135 (436)
                      .+.++-+.+..|+|.|    ++..+++    |..|.-.+-+.|.+...++..  .+   .+.+..|+|||++|...|+..
T Consensus       229 ~~~l~~~e~~fg~pYP~~k~d~V~vP~----f~~GaMEn~Glit~~e~~l~~~~~~~~~~~~~~~viaHElAHqWFGnlV  304 (831)
T TIGR02412       229 RQGLAFFHRKFGYPYPFKKYDQIFVPE----FNAGAMENAGCVTFAENFLHRAEATRAEKENRAGVILHEMAHMWFGDLV  304 (831)
T ss_pred             HHHHHHHHHHhCCCCCcccCCEEEcCC----CCCCcccccceeeechhhccCCcCCHHHHHHHHHHHHHHHHHHHhCCEe
Confidence            4455666677887633    3333443    111211222345555544421  12   335778999999999999865


Q ss_pred             H
Q psy5221         136 T  136 (436)
Q Consensus       136 ~  136 (436)
                      .
T Consensus       305 T  305 (831)
T TIGR02412       305 T  305 (831)
T ss_pred             c
Confidence            4


No 126
>PF06262 DUF1025:  Possibl zinc metallo-peptidase;  InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=52.51  E-value=25  Score=27.85  Aligned_cols=36  Identities=19%  Similarity=0.235  Sum_probs=24.7

Q ss_pred             CccEEEEcHHHHhhc-CHHH----HHHHHHHHHHhhhcChH
Q psy5221          99 NSSLIAISANLLESM-TKLE----IKAIIAHEISHISNGDM  134 (436)
Q Consensus        99 ~~~~v~it~gLl~~l-~~~E----l~aVLAHElgHi~~~h~  134 (436)
                      ..+++++-..+...+ +.+|    +.-++-||+||+-..+-
T Consensus        49 p~rI~lyR~pl~~~~~~~~eL~~~I~~tlvhEiah~fG~~~   89 (97)
T PF06262_consen   49 PDRIVLYRRPLERRARSREELAELIRDTLVHEIAHHFGISD   89 (97)
T ss_dssp             -EEEEEEHHHHHHT-SSHHHHHHHHHHHHHHHHHHHTT--H
T ss_pred             CCEEEEehHHHHHHhCCHHHHHHHHHHHHHHHHHHHcCCCH
Confidence            467888888888877 6554    56688899999877553


No 127
>PF04972 BON:  BON domain;  InterPro: IPR007055 The BON domain is typically ~60 residues long and has an alpha/beta predicted fold. There is a conserved glycine residue and several hydrophobic regions. This pattern of conservation is more suggestive of a binding or structural function rather than a catalytic function. Most proteobacteria seem to possess one or two BON-containing proteins, typically of the OsmY-type proteins; outside of this group the distribution is more disparate.  The OsmY protein is an Escherichia coli 20 kDa outer membrane or periplasmic protein that is expressed in response to a variety of stress conditions, in particular, helping to provide protection against osmotic shock. One hypothesis is that OsmY prevents shrinkage of the cytoplasmic compartment by contacting the phospholipid interfaces surrounding the periplasmic space. The domain architecture of two BON domains alone suggests that these domains contact the surfaces of phospholipids, with each domain contacting a membrane [].; PDB: 2L26_A 2KGS_A 2KSM_A.
Probab=52.14  E-value=30  Score=24.56  Aligned_cols=26  Identities=12%  Similarity=0.319  Sum_probs=20.8

Q ss_pred             CCCCCCceEEEEECCEEEEEEEEecc
Q psy5221         344 AGFNRSDLEIETERDTLRITGRKQAD  369 (436)
Q Consensus       344 pG~~~edi~v~~~~~~l~i~g~~~~~  369 (436)
                      |+++..+|+|.+.++.++++|.-...
T Consensus        12 ~~~~~~~i~v~v~~g~v~L~G~v~s~   37 (64)
T PF04972_consen   12 PWLPDSNISVSVENGVVTLSGEVPSQ   37 (64)
T ss_dssp             -CTT-TTEEEEEECTEEEEEEEESSC
T ss_pred             cccCCCeEEEEEECCEEEEEeeCcHH
Confidence            46777799999999999999997543


No 128
>cd06477 ACD_HspB3_Like Alpha crystallin domain (ACD) found in mammalian HspB3, also known as heat-shock protein 27-like protein (HSPL27, 17-kDa) and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. HspB3 is expressed in adult skeletal muscle, smooth muscle, and heart, and in several other fetal tissues.  In muscle cells HspB3 forms an oligomeric 150 kDa complex with myotonic dystrophy protein kinase-binding protein (MKBP/ HspB2), this complex may comprise one of two independent muscle-cell specific chaperone systems. The expression of HspB3 is induced during muscle differentiation controlled by the myogenic factor MyoD. HspB3 may also interact with Hsp22 (HspB8).
Probab=52.13  E-value=28  Score=26.69  Aligned_cols=30  Identities=13%  Similarity=0.275  Sum_probs=27.1

Q ss_pred             eEEEEEecC-CCCCCceEEEE-ECCEEEEEEE
Q psy5221         336 KYRITMAIA-GFNRSDLEIET-ERDTLRITGR  365 (436)
Q Consensus       336 ~~~v~~~lp-G~~~edi~v~~-~~~~l~i~g~  365 (436)
                      .|.=.+.|| +++++.|+=+. +++.|+|.|+
T Consensus        51 ~F~R~~~LP~~Vd~~~v~A~~~~dGvL~I~~~   82 (83)
T cd06477          51 SFTRQYQLPDGVEHKDLSAMLCHDGILVVETK   82 (83)
T ss_pred             EEEEEEECCCCcchheEEEEEcCCCEEEEEec
Confidence            788899999 99999999997 7999999975


No 129
>PTZ00027 60S ribosomal protein L6; Provisional
Probab=52.07  E-value=48  Score=29.78  Aligned_cols=48  Identities=8%  Similarity=0.185  Sum_probs=34.0

Q ss_pred             CCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccceEeEEeeCcEEEEEEec
Q psy5221         348 RSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLIR  413 (436)
Q Consensus       348 ~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~~GvL~i~~pk  413 (436)
                      |++++|+++++.++|+|.+                |+.++.|.  ..-..++...+||.|.|+-+.
T Consensus        13 P~~V~V~i~~~~v~VkGp~----------------G~L~~~~~--~~~~~i~i~~~~~~i~v~~~~   60 (190)
T PTZ00027         13 PEGVTVTVKSRKVTVTGKY----------------GELTRSFR--HLPVDIKLSKDGKYIKVEMWF   60 (190)
T ss_pred             CCCCEEEEECCEEEEECCC----------------ceEEEEec--CCCceEEEEeCCCEEEEEeCC
Confidence            7999999999999999874                55665443  211245566788877777543


No 130
>PRK05498 rplF 50S ribosomal protein L6; Validated
Probab=51.96  E-value=41  Score=29.88  Aligned_cols=44  Identities=9%  Similarity=0.234  Sum_probs=32.2

Q ss_pred             CCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccceEeEEeeCcEEEEEEe
Q psy5221         348 RSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLI  412 (436)
Q Consensus       348 ~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~~GvL~i~~p  412 (436)
                      |++++|+++++.++|+|.+                |+.++.|  |..++   ...+|+-|.|+..
T Consensus        12 P~~V~v~~~~~~v~vkGp~----------------G~l~~~~--~~~v~---i~~~~~~i~v~~~   55 (178)
T PRK05498         12 PAGVEVTINGNVVTVKGPK----------------GELSRTL--NPDVT---VKVEDNEITVTRP   55 (178)
T ss_pred             CCCCEEEEECCEEEEECCC----------------EEEEEEc--CCCeE---EEEECCEEEEEcC
Confidence            6999999999999999874                5666666  54433   3457787777654


No 131
>TIGR03654 L6_bact ribosomal protein L6, bacterial type.
Probab=51.03  E-value=54  Score=29.04  Aligned_cols=45  Identities=13%  Similarity=0.299  Sum_probs=32.8

Q ss_pred             CCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccceEeEEeeCcEEEEEEec
Q psy5221         348 RSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLIR  413 (436)
Q Consensus       348 ~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~~GvL~i~~pk  413 (436)
                      |++++|+++++.++|+|.+                |+.++.|  |..+   +...+|+.|.|+.+.
T Consensus        11 P~~V~v~~~~~~v~v~Gp~----------------G~l~~~l--~~~i---~i~~~~~~i~v~~~~   55 (175)
T TIGR03654        11 PAGVEVTIDGNVVTVKGPK----------------GELSRTL--HPGV---TVKVEDGQLTVSRPN   55 (175)
T ss_pred             CCCcEEEEeCCEEEEEcCC----------------eEEEEEc--CCCe---EEEEECCEEEEEecC
Confidence            6999999999999999874                5666666  5443   345577877777543


No 132
>PRK01844 hypothetical protein; Provisional
Probab=50.57  E-value=59  Score=24.16  Aligned_cols=47  Identities=9%  Similarity=0.121  Sum_probs=26.3

Q ss_pred             HHHHHHHHHHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCC
Q psy5221          24 SSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANIST   79 (436)
Q Consensus        24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~   79 (436)
                      +.++..+++++..+|++.+.....++..|   |.++      +.++.+....|-++
T Consensus         9 l~I~~li~G~~~Gff~ark~~~k~lk~NP---pine------~mir~Mm~QMGqkP   55 (72)
T PRK01844          9 VGVVALVAGVALGFFIARKYMMNYLQKNP---PINE------QMLKMMMMQMGQKP   55 (72)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHCC---CCCH------HHHHHHHHHhCCCc
Confidence            33334444556667777777766665553   3332      46666666666543


No 133
>PF12315 DUF3633:  Protein of unknown function (DUF3633);  InterPro: IPR022087  This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM. 
Probab=50.52  E-value=16  Score=33.09  Aligned_cols=20  Identities=30%  Similarity=0.429  Sum_probs=17.0

Q ss_pred             hcCHHHHHHHHHHHHHhhhc
Q psy5221         112 SMTKLEIKAIIAHEISHISN  131 (436)
Q Consensus       112 ~l~~~El~aVLAHElgHi~~  131 (436)
                      .|+.--..+|||||++|..-
T Consensus        87 GLPrll~gsiLAHE~mHa~L  106 (212)
T PF12315_consen   87 GLPRLLTGSILAHELMHAWL  106 (212)
T ss_pred             CCCHHHHhhHHHHHHHHHHh
Confidence            46778889999999999865


No 134
>KOG3314|consensus
Probab=50.10  E-value=70  Score=27.84  Aligned_cols=68  Identities=10%  Similarity=0.092  Sum_probs=38.9

Q ss_pred             HHHHHHHHHhcCCCCCeEEEECCCccEEEe-CCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHH
Q psy5221          65 LKTVNKISKNANISTPEVAIYNGEINAFAT-GAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMV  135 (436)
Q Consensus        65 ~~~l~~l~~~~g~~~p~v~v~~~~~NA~~~-G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~  135 (436)
                      -..+-+..++.|-...+-.+.-..-|+.+. |+-|.+++++....|   .+.+++.-|++|||-|.--.+-.
T Consensus        40 Vkfl~~am~k~~c~~~d~~isc~~C~~~~~GGy~~~~gIvlCqN~l---~~q~h~n~vv~HElIH~fDd~r~  108 (194)
T KOG3314|consen   40 VKFLMEAMEKSGCRVGDNFISCVVCTGPVAGGYTPGRGIVLCQNRL---TIQDHVNQVVIHELIHAFDDCRA  108 (194)
T ss_pred             HHHHHHHHHHcCCCccCCceEEeeCCCCccCCccCCCceEEecccc---chHHHHHHHHHHHHHHHHHhhhh
Confidence            344455555666443222222112233333 455667766655522   37899999999999998766543


No 135
>COG1965 CyaY Protein implicated in iron transport, frataxin homolog [Inorganic ion transport and metabolism]
Probab=48.55  E-value=14  Score=29.54  Aligned_cols=31  Identities=19%  Similarity=0.347  Sum_probs=23.4

Q ss_pred             EeEEeeCcEEEEEEecCCCCCCCCeEeeecCCCCCCCC
Q psy5221         398 VAAFFDQGILNIDLIREVPEALKPRKIYINTTKNTDRI  435 (436)
Q Consensus       398 ~~A~~~~GvL~i~~pk~~~~~~~~~~I~I~~~~~~~~~  435 (436)
                      +.+.+.+|||+|+++.       .-+|-|+..+|.+||
T Consensus        30 ~D~d~qg~VlTl~f~n-------gs~iiINkQ~P~~qi   60 (106)
T COG1965          30 IDCEIQGGVLTLTFDN-------GSQIIINKQEPLQQI   60 (106)
T ss_pred             cceecCCCEEEEEECC-------CcEEEEeCCChHHHH
Confidence            6677889999999986       256777777765553


No 136
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=48.43  E-value=13  Score=39.29  Aligned_cols=65  Identities=20%  Similarity=0.196  Sum_probs=37.1

Q ss_pred             HHHHHHHHHHhcCCCCCe------------EEEECC-CccEEEeCCCC-CccEEEEcHHHHhhcCHHHHHHHHHHHHHhh
Q psy5221          64 LLKTVNKISKNANISTPE------------VAIYNG-EINAFATGAFK-NSSLIAISANLLESMTKLEIKAIIAHEISHI  129 (436)
Q Consensus        64 L~~~l~~l~~~~g~~~p~------------v~v~~~-~~NA~~~G~~~-~~~~v~it~gLl~~l~~~El~aVLAHElgHi  129 (436)
                      +.+.+.++...+|...-+            ++--++ ...||++++.+ ..++|+.+-  -.  +-++ ..+|+||+||.
T Consensus       274 ~~~~~~~~~~~l~~~~~e~~~~~~~~~~iD~~~r~gK~~Gayc~~~~~~~~P~I~~Nf--~~--t~~d-v~TL~HElGHa  348 (549)
T TIGR02289       274 LLEKAEKMYKELSLEFYEFFNFMREKNLLDLESRKGKAAGGYCTYLPKYKAPFIFSNF--NG--TSGD-IDVLTHEAGHA  348 (549)
T ss_pred             HHHHHHHHhcCCCcHHHHHHHHHHhcCccccCCCCCCCCCcccCCCCCCCCcEEEEeC--CC--ChhH-HHHHHHHhhHH
Confidence            555666666655554222            122223 78899999765 345554331  11  3333 45799999999


Q ss_pred             hcCh
Q psy5221         130 SNGD  133 (436)
Q Consensus       130 ~~~h  133 (436)
                      .|..
T Consensus       349 ~H~~  352 (549)
T TIGR02289       349 FHVY  352 (549)
T ss_pred             HHHH
Confidence            8754


No 137
>COG3763 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=47.89  E-value=54  Score=24.14  Aligned_cols=40  Identities=15%  Similarity=0.164  Sum_probs=23.6

Q ss_pred             HHHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCC
Q psy5221          31 TGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANIST   79 (436)
Q Consensus        31 ~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~   79 (436)
                      ++.+..+|++.+...+.++-.|   |.++      +.++.+..+.|-++
T Consensus        16 ~G~~~G~fiark~~~k~lk~NP---pine------~~iR~M~~qmGqKp   55 (71)
T COG3763          16 AGLIGGFFIARKQMKKQLKDNP---PINE------EMIRMMMAQMGQKP   55 (71)
T ss_pred             HHHHHHHHHHHHHHHHHHhhCC---CCCH------HHHHHHHHHhCCCc
Confidence            3345557778888777766653   4443      35555555556543


No 138
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=47.65  E-value=9.1  Score=32.22  Aligned_cols=13  Identities=31%  Similarity=0.555  Sum_probs=10.8

Q ss_pred             HHHHHHHHHhhhc
Q psy5221         119 KAIIAHEISHISN  131 (436)
Q Consensus       119 ~aVLAHElgHi~~  131 (436)
                      .+|++||+||...
T Consensus        87 ~~~~~HEigHaLG   99 (140)
T smart00235       87 TGVAAHELGHALG   99 (140)
T ss_pred             cccHHHHHHHHhc
Confidence            3599999999864


No 139
>cd06471 ACD_LpsHSP_like Group of bacterial proteins containing an alpha crystallin domain (ACD) similar to Lactobacillus plantarum (Lp) small heat shock proteins (sHsp) HSP 18.5, HSP 18.55 and HSP 19.3. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Transcription of the genes encoding Lp HSP 18.5, 18.55 and 19.3 is regulated by a variety of stresses including heat, cold and ethanol. Early growing L. plantarum cells contain elevated levels of these mRNAs which rapidly fall of as the cells enter stationary phase. Also belonging to this group is Bifidobacterium breve (Bb) HSP20 and Oenococcus oenis (syn. Leuconostoc oenos) (Oo) HSP18.  Transcription of the gene encoding BbHSP20 is strongly induced following heat or osmotic shock, and that of the gene encoding OoHSP18 following heat, ethanol or acid shock. OoHSP18 is peripherally associated with the cytoplasmic me
Probab=46.96  E-value=29  Score=26.82  Aligned_cols=31  Identities=13%  Similarity=0.277  Sum_probs=26.9

Q ss_pred             CceEEEEEecCCCCCCceEEEEECCEEEEEE
Q psy5221         334 ENKYRITMAIAGFNRSDLEIETERDTLRITG  364 (436)
Q Consensus       334 ~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g  364 (436)
                      -..|.-.+.+|.+++++++-+.+++.|+|+-
T Consensus        61 ~g~f~r~~~lp~v~~~~i~A~~~dGvL~I~l   91 (93)
T cd06471          61 YGSFSRSFYLPNVDEEEIKAKYENGVLKITL   91 (93)
T ss_pred             ccEEEEEEECCCCCHHHCEEEEECCEEEEEE
Confidence            3467777899999999999999999999974


No 140
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=46.61  E-value=26  Score=37.61  Aligned_cols=31  Identities=26%  Similarity=0.414  Sum_probs=21.7

Q ss_pred             EEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChH
Q psy5221         102 LIAISANLLESMTKLEIKAIIAHEISHISNGDM  134 (436)
Q Consensus       102 ~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~  134 (436)
                      +.+.+..++.  .+.+...|+|||++|-.-|+.
T Consensus       265 ltf~~~~ll~--~d~s~~~viaHElAHqWfGNl  295 (601)
T TIGR02411       265 LTFATPTLIA--GDRSNVDVIAHELAHSWSGNL  295 (601)
T ss_pred             ceeecccccc--CChhhhhhHHHHHHhhccCce
Confidence            4455554442  344567899999999999875


No 141
>PRK14015 pepN aminopeptidase N; Provisional
Probab=46.16  E-value=34  Score=38.40  Aligned_cols=20  Identities=45%  Similarity=0.595  Sum_probs=17.0

Q ss_pred             HHHHHHHHHHHhhhcChHHH
Q psy5221         117 EIKAIIAHEISHISNGDMVT  136 (436)
Q Consensus       117 El~aVLAHElgHi~~~h~~~  136 (436)
                      .+..|+|||++|-..|+...
T Consensus       295 ~i~~vIaHElaHqWFGNlVT  314 (875)
T PRK14015        295 RIESVIAHEYFHNWTGNRVT  314 (875)
T ss_pred             HHHHHHHHHHHHHHHhCcce
Confidence            48899999999999888643


No 142
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=44.02  E-value=16  Score=33.16  Aligned_cols=31  Identities=16%  Similarity=0.333  Sum_probs=21.4

Q ss_pred             cEEEEcHHHHhhc-------CHHHHHHHHHHHHHhhhc
Q psy5221         101 SLIAISANLLESM-------TKLEIKAIIAHEISHISN  131 (436)
Q Consensus       101 ~~v~it~gLl~~l-------~~~El~aVLAHElgHi~~  131 (436)
                      ..|.+..+.+...       ..+-+..+++||+||...
T Consensus        92 a~V~l~~~~~~~~~~~~~~~~~~~~~~~~~he~gh~lG  129 (197)
T cd04276          92 ADVILYSGFLRQDQLWYEDLLAASLRYLLAHEVGHTLG  129 (197)
T ss_pred             EEEEeCchhhccchhHHHHHHHHHHHHHHHHHHHHHhc
Confidence            3456666665443       235588999999999765


No 143
>PF14730 DUF4468:  Domain of unknown function (DUF4468) with TBP-like fold
Probab=43.98  E-value=1.1e+02  Score=23.65  Aligned_cols=17  Identities=6%  Similarity=0.128  Sum_probs=14.9

Q ss_pred             eEeEEeeCcEEEEEEec
Q psy5221         397 VVAAFFDQGILNIDLIR  413 (436)
Q Consensus       397 ~~~A~~~~GvL~i~~pk  413 (436)
                      .+++.++||-.++++-+
T Consensus        69 ~l~i~~kDgk~r~~~~~   85 (91)
T PF14730_consen   69 TLIIDCKDGKYRLTITN   85 (91)
T ss_pred             EEEEEEECCEEEEEEEE
Confidence            49999999999999865


No 144
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=43.95  E-value=15  Score=37.33  Aligned_cols=41  Identities=22%  Similarity=0.259  Sum_probs=24.8

Q ss_pred             CccEEEeCCCC-CccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcCh
Q psy5221          88 EINAFATGAFK-NSSLIAISANLLESMTKLEIKAIIAHEISHISNGD  133 (436)
Q Consensus        88 ~~NA~~~G~~~-~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h  133 (436)
                      ..-||+.++.+ ...+|+.+  .-.  +-++ ...|+||+||..|..
T Consensus       196 ~~gaf~~~~~~~~~p~i~~n--~~~--~~~~-v~tl~HE~GHa~h~~  237 (427)
T cd06459         196 RSGAYCTGLPPGKHPFILMN--FNG--TLDD-VFTLAHELGHAFHSY  237 (427)
T ss_pred             CCCeecCCCCCCCCCeEEec--CCC--Chhh-HHHHHHHhhHHHHHH
Confidence            66789998863 33334332  122  3333 567999999976644


No 145
>KOG1047|consensus
Probab=43.65  E-value=14  Score=38.40  Aligned_cols=52  Identities=27%  Similarity=0.477  Sum_probs=33.0

Q ss_pred             CCeEEEECCCccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHH
Q psy5221          79 TPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMV  135 (436)
Q Consensus        79 ~p~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~  135 (436)
                      .-++.|++.   -|=+|.-.+-...++|..|+.  .+.-+.-|||||++|-..|...
T Consensus       254 ryDllvlPp---SFP~gGMENPcltF~TpTlla--GDrsl~~vIaHEIAHSWtGNlV  305 (613)
T KOG1047|consen  254 RYDLLVLPP---SFPFGGMENPCLTFVTPTLLA--GDRSLVDVIAHEIAHSWTGNLV  305 (613)
T ss_pred             cceEEEecC---CCCcccccCcceeeecchhhc--CCcchhhHHHHHhhhhhccccc
Confidence            346666664   112222233455667775654  5667899999999998877653


No 146
>PF02163 Peptidase_M50:  Peptidase family M50;  InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM).  Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=43.03  E-value=16  Score=32.65  Aligned_cols=14  Identities=29%  Similarity=0.555  Sum_probs=11.5

Q ss_pred             HHHHHHHHHHhhhc
Q psy5221         118 IKAIIAHEISHISN  131 (436)
Q Consensus       118 l~aVLAHElgHi~~  131 (436)
                      +.+++.||+||..-
T Consensus         7 ~i~i~~HE~gH~~~   20 (192)
T PF02163_consen    7 LISIVLHELGHALA   20 (192)
T ss_dssp             HHHHHHHHHHHHHH
T ss_pred             cccccccccccccc
Confidence            46889999999864


No 147
>TIGR03793 TOMM_pelo TOMM propeptide domain. This model represents a domain that is conserved among a large number of putative thiazole/oxazole-modified microcins (TOMM). Oddly, most of this seqence region appears homologous to nitrile hydratase subunits. This family is expanded especially in Pelotomaculum thermopropionicum SI.
Probab=43.03  E-value=1.1e+02  Score=23.08  Aligned_cols=44  Identities=11%  Similarity=0.205  Sum_probs=28.5

Q ss_pred             HHHHHhcCCCCC---eEEEECC--CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHH
Q psy5221          69 NKISKNANISTP---EVAIYNG--EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIA  123 (436)
Q Consensus        69 ~~l~~~~g~~~p---~v~v~~~--~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLA  123 (436)
                      ...++..|+..|   ++.|+++  ..--++++..|.        |   .+|++||.+|-+
T Consensus        28 raaL~e~G~~~P~~~~i~VvE~t~~~~~lVlP~~P~--------~---~lse~~L~~vag   76 (77)
T TIGR03793        28 KEALEREGVQVPAEVEVKVVEESPTVLYLVLPVNPD--------I---ELTDEQLDAVAG   76 (77)
T ss_pred             HHHHHHhCCCCCCceEEEEEEcCCCeEEEEecCCCC--------C---CCCHHHHHHhhC
Confidence            455566688754   6777776  444445654443        2   689999999864


No 148
>PTZ00179 60S ribosomal protein L9; Provisional
Probab=42.35  E-value=79  Score=28.41  Aligned_cols=47  Identities=6%  Similarity=0.078  Sum_probs=32.0

Q ss_pred             CCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccceEeEEeeCcEEEEEEe
Q psy5221         348 RSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLI  412 (436)
Q Consensus       348 ~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~~GvL~i~~p  412 (436)
                      |++++|+++++.++|+|.+                |+.++.  +|..--.+....+|+.|.|+-+
T Consensus        12 P~~V~V~i~~~~ItVkGpk----------------G~Ls~~--~~~~~~~i~i~~~~~~I~v~~~   58 (189)
T PTZ00179         12 PEDVTVSVKDRIVTVKGKR----------------GTLTKD--LRHLQLDFRVNKKNRTFTAVRW   58 (189)
T ss_pred             CCCCEEEEeCCEEEEECCC----------------cEEEEE--cCCCCcEEEEEecCCEEEEEeC
Confidence            6999999999999999875                445543  3432112455667787777744


No 149
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=42.32  E-value=17  Score=32.88  Aligned_cols=16  Identities=25%  Similarity=0.445  Sum_probs=13.4

Q ss_pred             HHHHHHHHHHHHhhhc
Q psy5221         116 LEIKAIIAHEISHISN  131 (436)
Q Consensus       116 ~El~aVLAHElgHi~~  131 (436)
                      .+..+++.||+||...
T Consensus        90 ~~~~~~i~HElgHaLG  105 (198)
T cd04327          90 PEFSRVVLHEFGHALG  105 (198)
T ss_pred             hhHHHHHHHHHHHHhc
Confidence            4567899999999876


No 150
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=41.65  E-value=15  Score=32.57  Aligned_cols=34  Identities=18%  Similarity=0.010  Sum_probs=21.0

Q ss_pred             cEEEEcHHHHhhc--CHHHHHHHHHHHHHhhhc-ChH
Q psy5221         101 SLIAISANLLESM--TKLEIKAIIAHEISHISN-GDM  134 (436)
Q Consensus       101 ~~v~it~gLl~~l--~~~El~aVLAHElgHi~~-~h~  134 (436)
                      ..|.++.......  .......++.||+||... +|.
T Consensus        94 g~i~~~~~~~~~~~~~g~~~~~t~~HEiGHaLGL~H~  130 (186)
T cd04277          94 GDIWFNSSYDTNSDSPGSYGYQTIIHEIGHALGLEHP  130 (186)
T ss_pred             ceeEEecCcccccCCCChhhHHHHHHHHHHHhcCCCC
Confidence            3455554322221  245679999999999866 443


No 151
>PRK01379 cyaY frataxin-like protein; Provisional
Probab=41.20  E-value=28  Score=27.89  Aligned_cols=30  Identities=23%  Similarity=0.279  Sum_probs=20.0

Q ss_pred             EeEEeeCcEEEEEEecCCCCCCCCeEeeecCCCCCCCC
Q psy5221         398 VAAFFDQGILNIDLIREVPEALKPRKIYINTTKNTDRI  435 (436)
Q Consensus       398 ~~A~~~~GvL~i~~pk~~~~~~~~~~I~I~~~~~~~~~  435 (436)
                      +.+.+.+|||+|+.+.       . .|-|+..++.+||
T Consensus        30 ~D~e~~~gVLtl~~~~-------g-t~VINkQ~p~~QI   59 (103)
T PRK01379         30 IDVDLQGDILNLDTDK-------G-IYVINKQSAAKEI   59 (103)
T ss_pred             eeeeccCCEEEEEeCC-------c-EEEEeCCChhhhh
Confidence            6677789999998643       1 2666666665553


No 152
>CHL00140 rpl6 ribosomal protein L6; Validated
Probab=40.61  E-value=72  Score=28.31  Aligned_cols=44  Identities=9%  Similarity=0.289  Sum_probs=30.7

Q ss_pred             CCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccceEeEEeeCcEEEEEEe
Q psy5221         348 RSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLI  412 (436)
Q Consensus       348 ~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~~GvL~i~~p  412 (436)
                      |++++|+++++.++|+|.+                |+.++  .+|..   ++...+|+.+.|+.+
T Consensus        12 P~~V~v~i~~~~v~vkGp~----------------G~l~~--~~~~~---v~i~~~~~~i~v~~~   55 (178)
T CHL00140         12 PDNVNVSIDDQIIKVKGPK----------------GTLSR--KIPDL---ITIEIQDNSLFVSKK   55 (178)
T ss_pred             CCCCEEEEECCEEEEECCC----------------EEEEE--ECCCC---eEEEEeCCEEEEEcC
Confidence            6899999999999999875                44443  44553   334557777666654


No 153
>PF12992 DUF3876:  Domain of unknown function, B. Theta Gene description (DUF3876);  InterPro: IPR024452 This bacterial family of conserved proteins has no known function. 
Probab=40.55  E-value=89  Score=24.68  Aligned_cols=41  Identities=20%  Similarity=0.272  Sum_probs=32.9

Q ss_pred             CCCCceeEEEecCceEEEEEecCCC-----CCCceEEEEECCEEEEE
Q psy5221         322 QSYPPYNIELVDENKYRITMAIAGF-----NRSDLEIETERDTLRIT  363 (436)
Q Consensus       322 ~~~p~~~i~~e~~~~~~v~~~lpG~-----~~edi~v~~~~~~l~i~  363 (436)
                      ..-|++.|++ +++.|.|.+--+.-     +++.-.|.-+++.+-|.
T Consensus        23 ~~~P~v~I~r-~g~~Y~vti~~~~~~~~~~~p~tY~i~~~~g~~fI~   68 (95)
T PF12992_consen   23 NGKPDVTIYR-NGGSYKVTITYRSGYTGRAKPETYPIQEEDGNLFIE   68 (95)
T ss_pred             CCCCCEEEEE-CCCeEEEEEEEEcCcCCcccceEEEEEEeCCEEEEe
Confidence            4469999996 78889998877665     77888888888888876


No 154
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=40.17  E-value=19  Score=38.47  Aligned_cols=41  Identities=20%  Similarity=0.289  Sum_probs=25.5

Q ss_pred             CccEEEeCCCCC-ccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcCh
Q psy5221          88 EINAFATGAFKN-SSLIAISANLLESMTKLEIKAIIAHEISHISNGD  133 (436)
Q Consensus        88 ~~NA~~~G~~~~-~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h  133 (436)
                      ...||+.|+.+. ..+|+.+-  -.  +-++ ...|+||+||-.|.-
T Consensus       349 ~~Ga~~~~~~~~~~p~i~~N~--~~--~~~~-v~TL~HE~GHa~H~~  390 (587)
T TIGR02290       349 RGGAFCTGFPPSKEPRVLMNY--DG--SRRD-VSTLAHELGHAYHSE  390 (587)
T ss_pred             CCCcccCCCCCCCCCEEEEec--CC--Cchh-HHHHHHHhhHHHHHH
Confidence            778999886553 24444431  11  2222 356999999999843


No 155
>PHA00527 hypothetical protein
Probab=39.85  E-value=69  Score=25.50  Aligned_cols=49  Identities=14%  Similarity=0.177  Sum_probs=28.9

Q ss_pred             HHHHHHHHHhcCCCCCeEE---------EECC-CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhc
Q psy5221          65 LKTVNKISKNANISTPEVA---------IYNG-EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISN  131 (436)
Q Consensus        65 ~~~l~~l~~~~g~~~p~v~---------v~~~-~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~  131 (436)
                      +...+++|.++|.+..+--         -..+ .-|.+++|..       -+           -++-|+||.+|...
T Consensus        28 ~~EWQ~~~~~LGVD~~~~~mla~~~~S~~~s~~~~~L~~~GVF-------NG-----------K~~T~~HECAH~AF   86 (129)
T PHA00527         28 KEEWQQACIHLGVDSGGNEMLAGATQSYCNTETGENLYLLGVF-------NG-----------KAATLVHECAHVAF   86 (129)
T ss_pred             hHHHHHHHHHhCCCcCCchhhhccccccccccCCCeEEEEEEe-------cc-----------HHHHHHHHHHHHHH
Confidence            3567889999998753321         1111 4455555421       11           24679999999754


No 156
>KOG3624|consensus
Probab=39.38  E-value=43  Score=36.55  Aligned_cols=43  Identities=28%  Similarity=0.421  Sum_probs=30.0

Q ss_pred             CccEEEeCCCCCccEEEEcHHHHhh----------cCHHHHHHHHHHHHHhhhcCh
Q psy5221          88 EINAFATGAFKNSSLIAISANLLES----------MTKLEIKAIIAHEISHISNGD  133 (436)
Q Consensus        88 ~~NA~~~G~~~~~~~v~it~gLl~~----------l~~~El~aVLAHElgHi~~~h  133 (436)
                      ..|||-.   +.+..|.+--|+++.          ++=.-+-+|||||++|---.+
T Consensus       481 ~~na~Y~---~~~N~i~~pa~ilq~P~f~~~~P~~~nyg~iG~vigHEl~H~FD~~  533 (687)
T KOG3624|consen  481 QVNAFYS---PEKNEIVFPAGLLQPPFFDLSYPDYLNYGGIGFVIGHELTHGFDDQ  533 (687)
T ss_pred             eeecccc---CCCceEEEehhcccCCCCCcccchhhhhHHHHHHHHHHHhhccccc
Confidence            7777655   334556677777764          455668899999999955433


No 157
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=39.20  E-value=5.9  Score=36.04  Aligned_cols=14  Identities=36%  Similarity=0.513  Sum_probs=11.9

Q ss_pred             HHHHHHHHHHhhhc
Q psy5221         118 IKAIIAHEISHISN  131 (436)
Q Consensus       118 l~aVLAHElgHi~~  131 (436)
                      .+.++|||+||.-.
T Consensus       140 ~a~~~aHElGH~LG  153 (207)
T cd04273         140 SAFTIAHELGHVLG  153 (207)
T ss_pred             eEEeeeeechhhcC
Confidence            57899999999873


No 158
>PF06861 BALF1:  BALF1 protein;  InterPro: IPR010677  Epstein-Barr virus (strain GD1) (HHV-4), a human tumour DNA virus and a prominent member of gamma-herpesviruses, encodes homologues of cellular antiapoptotic viral Bcl-2 proteins BALF1 and BHRF1. They protect the virus from apoptosis in its host cell during virus synthesis [, ]. The virus infects B lymphocytes to establish a latent infection and yield proliferating, growth-transformed B cells in vitro. Bcl-2 genes are essential for the initial evasion of apoptosis which allows it to establish a latent infection or cause cellular transformation, or both []. Bcl-2 family proteins can inhibit or induce programmed cell death in part by counteracting the activity of other BCL-2 family members. BALF1, inhibits the antiapoptotic activity of EBV BHRF1 and of KSBcl-2 in several transfected cell lines. BALF1 fails, however, to inhibit the cellular BCL-2 family member, BCL-x(L). Thus, BALF1 acts as a negative regulator of the survival function of BHRF1, similar to the counterbalance observed between cellular BCL-2 family members []. 
Probab=39.04  E-value=22  Score=30.99  Aligned_cols=23  Identities=13%  Similarity=0.218  Sum_probs=20.6

Q ss_pred             cCHHHHHHHHHHHHHhhhcChHH
Q psy5221         113 MTKLEIKAIIAHEISHISNGDMV  135 (436)
Q Consensus       113 l~~~El~aVLAHElgHi~~~h~~  135 (436)
                      .++.|-.++++|++||+..+|-.
T Consensus       127 ~~d~e~~s~v~~~lA~Fy~~~r~  149 (182)
T PF06861_consen  127 LNDHENASLVSHALAHFYLRYRR  149 (182)
T ss_pred             cCchHHHHHHHHHHHHHHHHHHH
Confidence            48999999999999999998843


No 159
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=38.99  E-value=19  Score=32.80  Aligned_cols=15  Identities=20%  Similarity=0.348  Sum_probs=12.2

Q ss_pred             HHHHHHHHHHHhhhc
Q psy5221         117 EIKAIIAHEISHISN  131 (436)
Q Consensus       117 El~aVLAHElgHi~~  131 (436)
                      =+.+|+.||+||..-
T Consensus        37 l~~~v~iHElgH~~~   51 (208)
T cd06161          37 LFLSVLLHELGHALV   51 (208)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            467899999999753


No 160
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP.  Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=38.58  E-value=18  Score=32.32  Aligned_cols=13  Identities=31%  Similarity=0.445  Sum_probs=10.8

Q ss_pred             HHHHHHHHHhhhc
Q psy5221         119 KAIIAHEISHISN  131 (436)
Q Consensus       119 ~aVLAHElgHi~~  131 (436)
                      ..|+-||+||+.-
T Consensus        10 ~~v~iHElGH~~~   22 (182)
T cd06163          10 ILIFVHELGHFLV   22 (182)
T ss_pred             HHHHHHHHHHHHH
Confidence            4689999999865


No 161
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=38.41  E-value=20  Score=31.63  Aligned_cols=14  Identities=21%  Similarity=0.432  Sum_probs=11.4

Q ss_pred             HHHHHHHHHHhhhc
Q psy5221         118 IKAIIAHEISHISN  131 (436)
Q Consensus       118 l~aVLAHElgHi~~  131 (436)
                      +.+|+.||+||..-
T Consensus         8 ~i~i~iHE~gH~~~   21 (180)
T cd05709           8 LISVTVHELGHALV   21 (180)
T ss_pred             HHHHHHHHHHHHHH
Confidence            45789999999863


No 162
>cd06472 ACD_ScHsp26_like Alpha crystallin domain (ACD) found in Saccharomyces cerevisiae (Sc) small heat shock protein (Hsp)26 and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. ScHsp26 is temperature-regulated, it switches from an inactive to a chaperone-active form upon elevation in temperature. It associates into large 24-mers storage forms which upon heat shock disassociate into dimers. These dimers initiate the interaction with non-native substrate proteins and re-assemble into large globular assemblies having one monomer of substrate bound per dimer. This group also contains Arabidopsis thaliana (Ath) Hsp15.7, a peroxisomal matrix protein which can complement the morphological phenotype of S. cerevisiae mutants deficient in Hsps26. AthHsp15.7 is minimally expressed under normal conditions and is strongly induced by heat and oxidative st
Probab=37.09  E-value=52  Score=25.42  Aligned_cols=32  Identities=9%  Similarity=0.194  Sum_probs=28.0

Q ss_pred             cCceEEEEEecC-CCCCCceEEEEECCEEEEEE
Q psy5221         333 DENKYRITMAIA-GFNRSDLEIETERDTLRITG  364 (436)
Q Consensus       333 ~~~~~~v~~~lp-G~~~edi~v~~~~~~l~i~g  364 (436)
                      ....|.-.+.|| ++++++|+-+.+++.|+|+-
T Consensus        58 ~~g~f~r~i~LP~~v~~~~i~A~~~nGvL~I~l   90 (92)
T cd06472          58 SSGRFVRRFRLPENADADEVKAFLENGVLTVTV   90 (92)
T ss_pred             eccEEEEEEECCCCCCHHHCEEEEECCEEEEEe
Confidence            356899999999 68999999999999999973


No 163
>cd06481 ACD_HspB9_like Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB9 and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Human (h) HspB9 is expressed exclusively in the normal testis and in various tumor samples and is a cancer/testis antigen. hHspB9  interacts with TCTEL1 (T-complex testis expressed protein -1), a subunit of dynein. hHspB9 and TCTEL1 are co-expressed in similar cells within the testis and in tumor cells. Included in this group is Xenopus Hsp30, a developmentally-regulated heat-inducible molecular chaperone.
Probab=37.00  E-value=41  Score=25.90  Aligned_cols=32  Identities=6%  Similarity=0.091  Sum_probs=28.0

Q ss_pred             CceEEEEEecC-CCCCCceEEEE-ECCEEEEEEE
Q psy5221         334 ENKYRITMAIA-GFNRSDLEIET-ERDTLRITGR  365 (436)
Q Consensus       334 ~~~~~v~~~lp-G~~~edi~v~~-~~~~l~i~g~  365 (436)
                      ...|.=.+.|| +++++.|+=++ .++.|+|+.-
T Consensus        53 ~~~F~R~~~LP~~Vd~~~i~A~~~~dGvL~I~~P   86 (87)
T cd06481          53 YQEFVREAQLPEHVDPEAVTCSLSPSGHLHIRAP   86 (87)
T ss_pred             eeEEEEEEECCCCcChHHeEEEeCCCceEEEEcC
Confidence            35799999998 79999999999 8999999853


No 164
>TIGR02421 QEGLA conserved hypothetical protein. Members of this family include a possible metal-binding motif HEXXXH and, nearby, a perfectly conserved motif QEGLA. All members belong to the Proteobacteria, including Agrobacterium tumefaciens and several species of Vibrio and Pseudomonas, and are found in only one copy per chromosome (Vibrio vulnificus, with two chromosomes, has two). The function is unknown.
Probab=36.80  E-value=75  Score=31.70  Aligned_cols=60  Identities=15%  Similarity=0.244  Sum_probs=43.5

Q ss_pred             hhHHHHHHHHHHHhcCCC-CCeEEEECC-CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHH
Q psy5221          61 EDWLLKTVNKISKNANIS-TPEVAIYNG-EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEIS  127 (436)
Q Consensus        61 ~~~L~~~l~~l~~~~g~~-~p~v~v~~~-~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElg  127 (436)
                      .....+.+++.++..+.. ..+|.+-++ ..+|.+.|    + .|.+..+  ..+++.++.+++-||+|
T Consensus       136 A~~a~~~~~~~~~~y~~~~~~~V~~sd~l~a~a~v~~----~-~l~i~~~--a~fs~~~l~~L~~HEig  197 (366)
T TIGR02421       136 ATEAAEILQQRLEDYFGEETIRVTLSDDLPAGAMVSG----D-KLKLNSD--AMFSERDLEALIHHEIG  197 (366)
T ss_pred             HHHHHHHHHHHHHHhCCCCceEEEECcchhHHHhccC----C-eEEECCC--CCcCHHHHHHHHHHhHH
Confidence            344788888888888875 335555566 67887664    2 3767762  34699999999999999


No 165
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=36.58  E-value=22  Score=32.95  Aligned_cols=15  Identities=20%  Similarity=0.364  Sum_probs=12.1

Q ss_pred             HHHHHHHHHHHhhhc
Q psy5221         117 EIKAIIAHEISHISN  131 (436)
Q Consensus       117 El~aVLAHElgHi~~  131 (436)
                      =+.+|+.||+||..-
T Consensus        52 l~~~v~iHElgH~~~   66 (227)
T cd06164          52 LFASVLLHELGHSLV   66 (227)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            457899999999753


No 166
>PF01432 Peptidase_M3:  Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.;  InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains:  mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=36.12  E-value=26  Score=36.05  Aligned_cols=18  Identities=17%  Similarity=0.420  Sum_probs=14.0

Q ss_pred             cCHHHHHHHHHHHHHhhhc
Q psy5221         113 MTKLEIKAIIAHEISHISN  131 (436)
Q Consensus       113 l~~~El~aVLAHElgHi~~  131 (436)
                      ++-+++ ..|+||+||..|
T Consensus       238 l~~~~v-~tLfHE~GHa~H  255 (458)
T PF01432_consen  238 LSHDDV-ETLFHEFGHAMH  255 (458)
T ss_dssp             -SHHHH-HHHHHHHHHHHH
T ss_pred             cChhhH-HHHHHHHhHHHH
Confidence            356676 789999999876


No 167
>cd06526 metazoan_ACD Alpha-crystallin domain (ACD) of metazoan alpha-crystallin-type small(s) heat shock proteins (Hsps). sHsps are small stress induced proteins with monomeric masses between 12 -43 kDa, whose common feature is the Alpha-crystallin domain  (ACD). sHsps are generally active as large oligomers consisting of multiple subunits, and are believed to be ATP-independent chaperones that prevent aggregation and are important in refolding in combination with other Hsps.
Probab=35.96  E-value=51  Score=24.91  Aligned_cols=31  Identities=10%  Similarity=0.235  Sum_probs=27.0

Q ss_pred             ceEEEEEecC-CCCCCceEEEEEC-CEEEEEEE
Q psy5221         335 NKYRITMAIA-GFNRSDLEIETER-DTLRITGR  365 (436)
Q Consensus       335 ~~~~v~~~lp-G~~~edi~v~~~~-~~l~i~g~  365 (436)
                      ..|.=.+.|| ++++++++-++.+ +.|+|+..
T Consensus        50 ~~f~r~~~LP~~vd~~~i~A~~~~~GvL~I~~P   82 (83)
T cd06526          50 REFTRRYQLPEGVDPDSVTSSLSSDGVLTIEAP   82 (83)
T ss_pred             EEEEEEEECCCCCChHHeEEEeCCCcEEEEEec
Confidence            4788899998 7799999999998 99999864


No 168
>cd06464 ACD_sHsps-like Alpha-crystallin domain (ACD) of alpha-crystallin-type small(s) heat shock proteins (Hsps). sHsps are small stress induced proteins with monomeric masses between 12 -43 kDa, whose common feature is the Alpha-crystallin domain  (ACD). sHsps are generally active as large oligomers consisting of multiple subunits, and are believed to be ATP-independent chaperones that prevent aggregation and are important in refolding in combination with other Hsps.
Probab=35.54  E-value=61  Score=24.17  Aligned_cols=33  Identities=12%  Similarity=0.204  Sum_probs=28.6

Q ss_pred             cCceEEEEEecC-CCCCCceEEEEECCEEEEEEE
Q psy5221         333 DENKYRITMAIA-GFNRSDLEIETERDTLRITGR  365 (436)
Q Consensus       333 ~~~~~~v~~~lp-G~~~edi~v~~~~~~l~i~g~  365 (436)
                      ....|.-.+.|| ++++++++-+++++.|+|+..
T Consensus        54 ~~~~f~r~~~LP~~vd~~~i~a~~~~G~L~I~~p   87 (88)
T cd06464          54 SYGSFSRSFRLPEDVDPDKIKASLENGVLTITLP   87 (88)
T ss_pred             eCcEEEEEEECCCCcCHHHcEEEEeCCEEEEEEc
Confidence            467899999999 678999999999999999853


No 169
>PF00011 HSP20:  Hsp20/alpha crystallin family This prints entry is a subset of the Pfam entry.;  InterPro: IPR002068 Prokaryotic and eukaryotic organisms respond to heat shock or other environmental stress by inducing the synthesis of proteins collectively known as heat-shock proteins (hsp) []. Amongst them is a family of proteins with an average molecular weight of 20 Kd, known as the hsp20 proteins []. These seem to act as chaperones that can protect other proteins against heat-induced denaturation and aggregation. Hsp20 proteins seem to form large heterooligomeric aggregates. Structurally, this family is characterised by the presence of a conserved C-terminal domain of about 100 residues.; PDB: 2BOL_B 3N3E_B 2H50_P 2H53_F 2BYU_L 1GME_D 3VQM_J 3VQK_E 3VQL_A 3AAC_A ....
Probab=34.54  E-value=56  Score=25.57  Aligned_cols=38  Identities=13%  Similarity=0.241  Sum_probs=28.8

Q ss_pred             cCceEEEEEecC-CCCCCceEEEEECCEEEEEEEEeccc
Q psy5221         333 DENKYRITMAIA-GFNRSDLEIETERDTLRITGRKQADT  370 (436)
Q Consensus       333 ~~~~~~v~~~lp-G~~~edi~v~~~~~~l~i~g~~~~~~  370 (436)
                      ....|.-.+.|| +++.+.|+-.+++|.|+|+..+....
T Consensus        53 ~~~~f~r~~~lP~~vd~~~i~a~~~~GvL~I~~pk~~~~   91 (102)
T PF00011_consen   53 RYGSFERSIRLPEDVDPDKIKASYENGVLTITIPKKEEE   91 (102)
T ss_dssp             -SEEEEEEEE-STTB-GGG-EEEETTSEEEEEEEBSSSC
T ss_pred             ccceEEEEEcCCCcCCcceEEEEecCCEEEEEEEccccc
Confidence            345788899998 67899999999999999999887654


No 170
>PF13402 M60-like:  Peptidase M60-like family; PDB: 4FCA_A.
Probab=33.90  E-value=1.9e+02  Score=27.66  Aligned_cols=64  Identities=9%  Similarity=0.045  Sum_probs=30.5

Q ss_pred             HHHHHHHHHhcCCCCC----------e-EEEECC---CccEEEeCCCCCccEEEEcHHHHhh-cCHHHH---HHHHHHHH
Q psy5221          65 LKTVNKISKNANISTP----------E-VAIYNG---EINAFATGAFKNSSLIAISANLLES-MTKLEI---KAIIAHEI  126 (436)
Q Consensus        65 ~~~l~~l~~~~g~~~p----------~-v~v~~~---~~NA~~~G~~~~~~~v~it~gLl~~-l~~~El---~aVLAHEl  126 (436)
                      .++++...+-+|++..          . -++.+.   ...+++.|    .++.+... -.+. ++.+.+   .=-++||+
T Consensus       153 D~ii~~~~el~Gl~~~~~~~~~~~~~~~r~v~~v~~~~g~m~a~g----~~i~~~~~-~~~~~l~~~~~~~~~WG~~HE~  227 (307)
T PF13402_consen  153 DRIIDAEYELAGLDKSSPGPENNPMPNNRFVFDVQISAGYMHASG----YPIGFPPN-WMNELLNPNPLRKGGWGPWHEL  227 (307)
T ss_dssp             HHHHHHHHHHTT-BSS--GGGB--S--EEEEEETT----SEEEET----TEEEEETT---HHHH-HHHHHHH-HHHHHHH
T ss_pred             HHHHHHHHHHhCCCcccCCccccCcccceEEEeccccccceeecC----CcEEeeCc-HHhcccCHhHcCCCCeeehhhh
Confidence            3455555556777641          1 233333   24555555    23333333 3333 366655   44689999


Q ss_pred             HhhhcCh
Q psy5221         127 SHISNGD  133 (436)
Q Consensus       127 gHi~~~h  133 (436)
                      ||..+.-
T Consensus       228 GH~~Q~~  234 (307)
T PF13402_consen  228 GHNHQQG  234 (307)
T ss_dssp             HHHH-BG
T ss_pred             hhhcCcc
Confidence            9998866


No 171
>PF08014 DUF1704:  Domain of unknown function (DUF1704);  InterPro: IPR012548 This family contains many hypothetical proteins.
Probab=33.62  E-value=1e+02  Score=30.67  Aligned_cols=63  Identities=17%  Similarity=0.295  Sum_probs=42.0

Q ss_pred             HHHHHHHHHHHhc----CCCCCeEEEECC-CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHH-hhhcCh
Q psy5221          63 WLLKTVNKISKNA----NISTPEVAIYNG-EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEIS-HISNGD  133 (436)
Q Consensus        63 ~L~~~l~~l~~~~----g~~~p~v~v~~~-~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElg-Hi~~~h  133 (436)
                      .+...+++.+++.    +++ ..|.+-++ ..+|.+.+    ++ |.+..+  ..+++.++.+++-||+| |+....
T Consensus       113 ~~~~~~~~~~~~y~~~~~~~-~~V~~sddl~a~A~v~~----~~-l~I~~~--~~fs~~~l~~L~~HEigvH~lt~~  181 (349)
T PF08014_consen  113 EAVSRLQERLKKYFGKEGFE-VKVELSDDLLARAMVSG----DR-LKINKN--AMFSERDLEALLHHEIGVHLLTTL  181 (349)
T ss_pred             HHHHHHHHHHHHHhcccCce-EEEEEcCCcchhhcccC----Ce-eEEcCC--CCcCHHHHHHHHHHhhhhhhcccc
Confidence            3667777777776    443 35666666 78886664    22 666652  24599999999999996 555443


No 172
>KOG1046|consensus
Probab=33.36  E-value=79  Score=35.66  Aligned_cols=66  Identities=24%  Similarity=0.373  Sum_probs=37.7

Q ss_pred             HHHHHHHHhcCCC--CC--eEEEECC-CccEEEeCCCCCccEEEEcHH-HH-hh----c-CHHHHHHHHHHHHHhhhcCh
Q psy5221          66 KTVNKISKNANIS--TP--EVAIYNG-EINAFATGAFKNSSLIAISAN-LL-ES----M-TKLEIKAIIAHEISHISNGD  133 (436)
Q Consensus        66 ~~l~~l~~~~g~~--~p--~v~v~~~-~~NA~~~G~~~~~~~v~it~g-Ll-~~----l-~~~El~aVLAHElgHi~~~h  133 (436)
                      +.++-..+-.|++  .|  ++..+++ ...|+--     -+.|.+... |+ +.    . +...++-|+|||+||..-|.
T Consensus       265 ~~L~~~e~~f~i~yPLpK~D~iavPdf~~GAMEN-----wGLvtyre~~lL~~~~~ss~~~k~~va~vIaHElAHQWFGN  339 (882)
T KOG1046|consen  265 KVLEFYEDYFGIPYPLPKLDLVAVPDFSAGAMEN-----WGLVTYRETALLYDPQTSSSSNKQRVAEVIAHELAHQWFGN  339 (882)
T ss_pred             HHHHHHHHHhCCCCCCccccEEecCCccccchhc-----CcceeeeehhhccCCCcCcHHHHHHHHHHHHHHHHHHHhcC
Confidence            4555555666775  44  4455555 4444322     122333322 21 11    1 35669999999999999998


Q ss_pred             HHH
Q psy5221         134 MVT  136 (436)
Q Consensus       134 ~~~  136 (436)
                      ...
T Consensus       340 LVT  342 (882)
T KOG1046|consen  340 LVT  342 (882)
T ss_pred             ccc
Confidence            654


No 173
>PF00347 Ribosomal_L6:  Ribosomal protein L6;  InterPro: IPR020040 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits.  Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. L6 is a protein from the large (50S) subunit. In Escherichia coli, it is located in the aminoacyl-tRNA binding site of the peptidyltransferase centre, and is known to bind directly to 23S rRNA. It belongs to a family of ribosomal proteins, including L6 from bacteria, cyanelles (structures that perform similar functions to chloroplasts, but have structural and biochemical characteristics of Cyanobacteria) and mitochondria; and L9 from mammals, Drosophila, plants and yeast. L6 contains two domains with almost identical folds, suggesting that is was derived by the duplication of an ancient RNA-binding protein gene. Analysis reveals several sites on the protein surface where interactions with other ribosome components may occur, the N terminus being involved in protein-protein interactions and the C terminus containing possible RNA-binding sites []. This entry represents the alpha-beta domain found duplicated in ribosomal L6 proteins. This domain consists of two beta-sheets and one alpha-helix packed around single core [].; GO: 0003735 structural constituent of ribosome, 0019843 rRNA binding, 0006412 translation, 0005840 ribosome; PDB: 2HGJ_H 2HGQ_H 2HGU_H 1S1I_H 3O5H_I 3O58_I 3J16_F 3IZS_F 2V47_H 2WDJ_H ....
Probab=33.28  E-value=97  Score=22.79  Aligned_cols=47  Identities=9%  Similarity=0.222  Sum_probs=32.7

Q ss_pred             CCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccceEeEEeeCcEEEEEEec
Q psy5221         348 RSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLIR  413 (436)
Q Consensus       348 ~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~~GvL~i~~pk  413 (436)
                      |+.++|+++++.++++|.+                |.  .++.+|+.+ .++...+|+.+++....
T Consensus         2 P~gV~v~~~~~~i~v~G~~----------------g~--l~~~~~~~v-~v~~~~~~~~~~~~~~~   48 (77)
T PF00347_consen    2 PEGVKVTIKGNIITVKGPK----------------GE--LSRPIPPGV-KVEIKVEDNKITVSVLS   48 (77)
T ss_dssp             STTCEEEEETTEEEEESSS----------------SE--EEEEETTTE-EEEEEEETTSEEEEEEE
T ss_pred             CCcEEEEEeCcEEEEECCC----------------Ee--EEEECCCCe-eEEEEcCCCceEEEECc
Confidence            6789999999999999764                22  446777762 23333668877776654


No 174
>PF12725 DUF3810:  Protein of unknown function (DUF3810);  InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=32.98  E-value=24  Score=34.48  Aligned_cols=19  Identities=21%  Similarity=0.391  Sum_probs=16.2

Q ss_pred             cCHHHHHHHHHHHHHhhhc
Q psy5221         113 MTKLEIKAIIAHEISHISN  131 (436)
Q Consensus       113 l~~~El~aVLAHElgHi~~  131 (436)
                      +.+-.+=+.++||+||...
T Consensus       191 ~p~~~~P~T~~HElAHq~G  209 (318)
T PF12725_consen  191 LPPYSLPFTICHELAHQLG  209 (318)
T ss_pred             CCcccccHHHHHHHHHHhC
Confidence            4566788999999999987


No 175
>PF01954 DUF104:  Protein of unknown function DUF104;  InterPro: IPR008203 This family includes short archaebacterial proteins of unknown function. Archaeoglobus fulgidus has twelve copies of this protein, with several being clustered together in the genome.; PDB: 2NWT_A.
Probab=32.89  E-value=49  Score=23.70  Aligned_cols=17  Identities=24%  Similarity=0.372  Sum_probs=12.0

Q ss_pred             eEeEEeeCcEEEEEEec
Q psy5221         397 VVAAFFDQGILNIDLIR  413 (436)
Q Consensus       397 ~~~A~~~~GvL~i~~pk  413 (436)
                      .|+|.|+||+|+--=|-
T Consensus         4 ~I~aiYe~GvlkPl~~~   20 (60)
T PF01954_consen    4 VIEAIYENGVLKPLEPV   20 (60)
T ss_dssp             -EEEEEETTEEEECS--
T ss_pred             eEEEEEECCEEEECCCC
Confidence            48999999999855443


No 176
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=32.56  E-value=29  Score=31.03  Aligned_cols=16  Identities=19%  Similarity=0.220  Sum_probs=12.3

Q ss_pred             HHHHHHHHHHHHhhhc
Q psy5221         116 LEIKAIIAHEISHISN  131 (436)
Q Consensus       116 ~El~aVLAHElgHi~~  131 (436)
                      -=+..+..||+||..-
T Consensus        39 ~l~~~l~iHElgH~~~   54 (183)
T cd06160          39 ALLAILGIHEMGHYLA   54 (183)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3466788999999753


No 177
>cd06478 ACD_HspB4-5-6 Alpha-crystallin domain found in alphaA-crystallin (HspB4), alphaB-crystallin (HspB5), and the small heat shock protein (sHsp) HspB6, also known as Hsp20. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Alpha crystallin, an abundant protein in the mammalian lens, is a large (700 kDa) heteropolymer composed of HspB4 and HspB5, generally in a molar ratio of HspB4:HspB5 of 3:1.  Only trace amounts of HspB4 are found in tissues other than the lens. HspB5 on the other hand is also expressed constitutively in other tissues including brain, heart, and type I and type IIa skeletal muscle fibers, and in several cancers including gliomas, renal cell carcinomas, basal-like and metaplastic breast carcinomas, and head and neck cancer.  HspB5's functions include effects on the apoptotic pathway and on metastasis.  Phosphorylation of HspB5 reduces its ol
Probab=32.30  E-value=59  Score=24.73  Aligned_cols=30  Identities=13%  Similarity=0.335  Sum_probs=25.6

Q ss_pred             eEEEEEecC-CCCCCceEEEE-ECCEEEEEEE
Q psy5221         336 KYRITMAIA-GFNRSDLEIET-ERDTLRITGR  365 (436)
Q Consensus       336 ~~~v~~~lp-G~~~edi~v~~-~~~~l~i~g~  365 (436)
                      +|.=.+.|| +++++.|+=++ .+|.|+|+..
T Consensus        51 ef~R~~~LP~~vd~~~i~A~~~~dGvL~I~~P   82 (83)
T cd06478          51 EFHRRYRLPPGVDPAAITSSLSADGVLTISGP   82 (83)
T ss_pred             EEEEEEECCCCcChHHeEEEECCCCEEEEEec
Confidence            388889998 69999999999 5999999853


No 178
>KOG3607|consensus
Probab=32.22  E-value=29  Score=37.84  Aligned_cols=42  Identities=19%  Similarity=0.184  Sum_probs=26.0

Q ss_pred             CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhc
Q psy5221          88 EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISN  131 (436)
Q Consensus        88 ~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~  131 (436)
                      ...||+.|+.......-+..  ...-+....+.|+||||||...
T Consensus       295 ~G~a~~~~mCs~~~s~gv~~--~~~~~~~~~a~v~AhelgH~lG  336 (716)
T KOG3607|consen  295 VGLAYFGGMCSPGHSGGVNK--FHSDILLAFAVVLAHELGHNLG  336 (716)
T ss_pred             eceeecccccCcccccceee--cCcccchhHHHHHHHHHHhhcC
Confidence            55677777654332232222  1111467889999999999865


No 179
>PF08308 PEGA:  PEGA domain;  InterPro: IPR013229 This domain is found in both archaea and bacteria and has similarity to S-layer (surface layer) proteins. It is named after the characteristic PEGA sequence motif found in this domain. The secondary structure of this domain is predicted to be beta-strands.
Probab=32.20  E-value=1.4e+02  Score=21.45  Aligned_cols=41  Identities=10%  Similarity=0.191  Sum_probs=29.5

Q ss_pred             ceeEEEecCceEEEEEecCCCCCCceEEEEECC-EEEEEEEE
Q psy5221         326 PYNIELVDENKYRITMAIAGFNRSDLEIETERD-TLRITGRK  366 (436)
Q Consensus       326 ~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~-~l~i~g~~  366 (436)
                      |+.+..-....|.|++..||+.+..-+|.+..+ ...|+.+-
T Consensus        26 p~~~~~l~~G~~~v~v~~~Gy~~~~~~v~v~~~~~~~v~~~L   67 (71)
T PF08308_consen   26 PLTLKDLPPGEHTVTVEKPGYEPYTKTVTVKPGETTTVNVTL   67 (71)
T ss_pred             cceeeecCCccEEEEEEECCCeeEEEEEEECCCCEEEEEEEE
Confidence            344542256789999999999988888888744 66666553


No 180
>PF14913 DPCD:  DPCD protein family
Probab=31.72  E-value=2.1e+02  Score=25.59  Aligned_cols=73  Identities=8%  Similarity=0.235  Sum_probs=50.1

Q ss_pred             CceeEEEecCceEEEEEecCCCCCCceEEEEECC--EEEEEEEEecccCCCceEEEeeeeeeEEEEEECCC--------c
Q psy5221         325 PPYNIELVDENKYRITMAIAGFNRSDLEIETERD--TLRITGRKQADTQQRNFLHRGIATRNFEHRFQLAD--------H  394 (436)
Q Consensus       325 p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~--~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~--------~  394 (436)
                      -|+-+.+++..+|+-++-===+.+|--+|+++++  .++|+-.      .++          |.++|.+|+        +
T Consensus        86 nP~~~r~dTk~~fqWRIRNLPYP~dvYsVtvd~~~r~ivvRTt------NKK----------YyKk~~IPDl~R~~l~l~  149 (194)
T PF14913_consen   86 NPIFVRRDTKTSFQWRIRNLPYPKDVYSVTVDEDERCIVVRTT------NKK----------YYKKFSIPDLDRCGLPLE  149 (194)
T ss_pred             CCEEEEEcCccceEEEEccCCCCccceEEEEcCCCcEEEEECc------Ccc----------ceeEecCCcHHhhCCCcc
Confidence            4555556778888888753335678788888764  6888732      222          445666664        2


Q ss_pred             cceEeEEeeCcEEEEEEec
Q psy5221         395 VHVVAAFFDQGILNIDLIR  413 (436)
Q Consensus       395 v~~~~A~~~~GvL~i~~pk  413 (436)
                      -+.++-...|..|.|+-.|
T Consensus       150 ~~~ls~~h~nNTLIIsYkK  168 (194)
T PF14913_consen  150 QSALSFAHQNNTLIISYKK  168 (194)
T ss_pred             hhhceeeeecCeEEEEecC
Confidence            2248999999999999988


No 181
>KOG3413|consensus
Probab=31.16  E-value=51  Score=28.03  Aligned_cols=30  Identities=13%  Similarity=0.350  Sum_probs=18.8

Q ss_pred             EeEEeeCcEEEEEEecC----CCCCCCCeEeeec
Q psy5221         398 VAAFFDQGILNIDLIRE----VPEALKPRKIYIN  427 (436)
Q Consensus       398 ~~A~~~~GvL~i~~pk~----~~~~~~~~~I~I~  427 (436)
                      -.+.|.||||+|.++-.    ..+....++|=..
T Consensus        74 ~Dv~y~~GVLTl~lg~~GTYViNKQ~PnkQIWlS  107 (156)
T KOG3413|consen   74 FDVDYADGVLTLKLGSVGTYVINKQPPNKQIWLS  107 (156)
T ss_pred             cccccccceEEEEecCceeEEecCCCCcceeeee
Confidence            45679999999999741    1233344555554


No 182
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=30.95  E-value=29  Score=32.91  Aligned_cols=15  Identities=27%  Similarity=0.372  Sum_probs=11.9

Q ss_pred             HHHHHHHHHHHhhhc
Q psy5221         117 EIKAIIAHEISHISN  131 (436)
Q Consensus       117 El~aVLAHElgHi~~  131 (436)
                      =+.+|+.||+||..-
T Consensus       117 L~isv~iHElgHa~~  131 (263)
T cd06159         117 LVVGVVVHELSHGIL  131 (263)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            456799999999753


No 183
>cd06498 ACD_alphaB-crystallin_HspB5 Alpha-crystallin domain found in the small heat shock protein (sHsp) alphaB-crystallin (HspB5, 20kDa). sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Alpha crystallin, an abundant protein in the mammalian lens, is a large (700 kDa) heteropolymer composed of HspB4 and HspB5, generally in a molar ratio of HspB4:HspB5 of 3:1.  HspB4 does not belong to this group. HspB5 shows increased synthesis in response to stress. HspB5 is also expressed constitutively in other tissues including brain, heart, and type I and type IIa skeletal muscle fibers, and in several cancers including gliomas, renal cell carcinomas, basal-like and metaplastic breast carcinomas, and head and neck cancer.  Its functions include effects on the apoptotic pathway and on metastasis.  Phosphorylation of HspB5 reduces its oligomerization and anti-apoptotic activ
Probab=30.62  E-value=72  Score=24.36  Aligned_cols=31  Identities=10%  Similarity=0.249  Sum_probs=26.7

Q ss_pred             eEEEEEecC-CCCCCceEEEEE-CCEEEEEEEE
Q psy5221         336 KYRITMAIA-GFNRSDLEIETE-RDTLRITGRK  366 (436)
Q Consensus       336 ~~~v~~~lp-G~~~edi~v~~~-~~~l~i~g~~  366 (436)
                      +|.=.+.|| +++++.|+=+.+ +|.|+|+..+
T Consensus        51 eF~R~~~LP~~vd~~~i~A~~~~dGvL~I~lPk   83 (84)
T cd06498          51 EFQRKYRIPADVDPLTITSSLSPDGVLTVCGPR   83 (84)
T ss_pred             EEEEEEECCCCCChHHcEEEeCCCCEEEEEEeC
Confidence            488889998 799999999995 9999998654


No 184
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=30.12  E-value=13  Score=35.84  Aligned_cols=12  Identities=42%  Similarity=0.590  Sum_probs=10.3

Q ss_pred             HHHHHHHHHhhh
Q psy5221         119 KAIIAHEISHIS  130 (436)
Q Consensus       119 ~aVLAHElgHi~  130 (436)
                      ..|++||+||.-
T Consensus       166 igv~~HE~gH~l  177 (286)
T TIGR03296       166 VGVIAHELGHDL  177 (286)
T ss_pred             eeeeehhhhccc
Confidence            699999999953


No 185
>PF13398 Peptidase_M50B:  Peptidase M50B-like
Probab=30.00  E-value=32  Score=31.17  Aligned_cols=15  Identities=20%  Similarity=0.437  Sum_probs=12.0

Q ss_pred             HHHHHHHHHHHHhhh
Q psy5221         116 LEIKAIIAHEISHIS  130 (436)
Q Consensus       116 ~El~aVLAHElgHi~  130 (436)
                      =.+..++.||+||..
T Consensus        20 ~~~l~t~~HE~gHal   34 (200)
T PF13398_consen   20 FRLLVTFVHELGHAL   34 (200)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            356679999999964


No 186
>PF09471 Peptidase_M64:  IgA Peptidase M64;  InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=29.64  E-value=32  Score=32.64  Aligned_cols=17  Identities=24%  Similarity=0.421  Sum_probs=13.3

Q ss_pred             CHHHHHHHHHHHHHhhh
Q psy5221         114 TKLEIKAIIAHEISHIS  130 (436)
Q Consensus       114 ~~~El~aVLAHElgHi~  130 (436)
                      +......|+.||+||--
T Consensus       212 ~~~~~~~v~vHE~GHsf  228 (264)
T PF09471_consen  212 DNPSFKQVVVHEFGHSF  228 (264)
T ss_dssp             TSTTHHHHHHHHHHHHT
T ss_pred             Ccccccceeeeeccccc
Confidence            33478999999999943


No 187
>cd06480 ACD_HspB8_like Alpha-crystallin domain (ACD) found in mammalian 21.6 KDa small heat shock protein (sHsp) HspB8, also denoted as Hsp22 in humans, and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. A chaperone complex formed of HspB8 and Bag3 stimulates degradation of protein complexes by macroautophagy. HspB8 also forms complexes with Hsp27 (HspB1), MKBP (HspB2), HspB3, alphaB-crystallin (HspB5), Hsp20 (HspB6), and cvHsp (HspB7). These latter interactions may depend on phosphorylation of the respective partner sHsp. HspB8 may participate in the regulation of cell proliferation, cardiac hypertrophy, apoptosis, and carcinogenesis. Point mutations in HspB8 have been correlated with the development of several congenital neurological diseases, including Charcot Marie tooth disease and distal motor neuropathy type II.
Probab=29.57  E-value=1e+02  Score=24.10  Aligned_cols=30  Identities=7%  Similarity=0.123  Sum_probs=26.6

Q ss_pred             ceEEEEEecC-CCCCCceEEEEE-CCEEEEEE
Q psy5221         335 NKYRITMAIA-GFNRSDLEIETE-RDTLRITG  364 (436)
Q Consensus       335 ~~~~v~~~lp-G~~~edi~v~~~-~~~l~i~g  364 (436)
                      .+|.=.+.|| +++.++|+=.+. ++.|+|++
T Consensus        58 r~F~R~~~LP~~Vd~~~v~s~l~~dGvL~Iea   89 (91)
T cd06480          58 KNFTKKIQLPPEVDPVTVFASLSPEGLLIIEA   89 (91)
T ss_pred             EEEEEEEECCCCCCchhEEEEeCCCCeEEEEc
Confidence            5688899998 899999999998 88999985


No 188
>COG0097 RplF Ribosomal protein L6P/L9E [Translation, ribosomal structure and biogenesis]
Probab=29.47  E-value=1.7e+02  Score=25.93  Aligned_cols=47  Identities=19%  Similarity=0.357  Sum_probs=31.6

Q ss_pred             CCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccceEeEEeeCcEEEEEEe
Q psy5221         346 FNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLI  412 (436)
Q Consensus       346 ~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~~GvL~i~~p  412 (436)
                      ..|++++|+++++.++++|.+                |+.+|.|  +..+  ++-..+|+.+.+...
T Consensus        10 ~~P~gV~V~i~~~~v~vkGpk----------------GeL~~~~--~~~~--v~v~~~~~~~vv~~~   56 (178)
T COG0097          10 VIPAGVTVSIEGQVVTVKGPK----------------GELTREF--HDNV--VKVEVEDNILVVRPV   56 (178)
T ss_pred             ecCCCeEEEEeccEEEEECCC----------------cEEEEEe--cCcc--eEEEecCCEEEEeec
Confidence            348999999999999999875                3444333  3333  566666776655543


No 189
>COG4004 Uncharacterized protein conserved in archaea [Function unknown]
Probab=28.71  E-value=2.7e+02  Score=21.76  Aligned_cols=48  Identities=21%  Similarity=0.369  Sum_probs=35.3

Q ss_pred             chHHHHhhhhhhhhcccCCCCCceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEE
Q psy5221         304 IGFDRLAQIFDNVRTEAQQSYPPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGR  365 (436)
Q Consensus       304 ~~~d~l~~~~~~~~~~~~~~~p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~  365 (436)
                      ..-|++.+.++..+         +++.+ .+|  .+....||+  +.|+|+.+++.|.|.+.
T Consensus        12 ~~~dri~~~l~e~g---------~~v~~-eGD--~ivas~pgi--s~ieik~E~kkL~v~t~   59 (96)
T COG4004          12 PDPDRIMRGLSELG---------WTVSE-EGD--RIVASSPGI--SRIEIKPENKKLLVNTT   59 (96)
T ss_pred             CCHHHHHHHHHHhC---------eeEee-ccc--EEEEecCCc--eEEEEecccceEEEecc
Confidence            34577777776643         46664 444  778899999  56889999999999873


No 190
>TIGR03750 conj_TIGR03750 conjugative transfer region protein, TIGR03750 family. Members of this protein family are found occasionally on plasmids. Usually, however, they are found on the bacterial main chromosome in regions flanked by markers of conjugative transfer and/or transposition.
Probab=28.56  E-value=3.2e+02  Score=22.21  Aligned_cols=41  Identities=15%  Similarity=0.217  Sum_probs=28.6

Q ss_pred             HHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCCCe
Q psy5221          33 SIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANISTPE   81 (436)
Q Consensus        33 ~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~p~   81 (436)
                      .+...++..+.+.+..+.+    |    +.++++.++....+.++....
T Consensus        59 ~~l~v~~gg~~l~rlKRGr----P----e~yl~r~l~~~~~~~~l~~~~   99 (111)
T TIGR03750        59 PILVVLIGGKLLARLKRGK----P----EGYLYRKLEWKLARLGLGRHR   99 (111)
T ss_pred             HHHHHHHhHHHHHHHHcCC----C----chHHHHHHHHHHHHcCCCCCC
Confidence            3455666777777777766    3    334999999999998875433


No 191
>cd06476 ACD_HspB2_like Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB2/heat shock 27kDa protein 2 and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits.  HspB2 is preferentially and constitutively expressed in skeletal muscle and heart. HspB2 shows homooligomeric activity and forms aggregates in muscle cytosol. Although its expression is not induced by heat shock, it redistributes to the insoluble fraction in response to heat shock. In the mouse heart, HspB2 plays a role in maintaining energetic balance, by protecting cardiac energetics during ischemia/reperfusion, and allowing  for increased work during acute inotropic challenge. hHspB2 [previously also known as myotonic dystrophy protein kinase (DMPK) binding protein (MKBP)]  is selectively up-regulated in skeletal muscles from myotonic dystrophy patients.
Probab=28.31  E-value=1e+02  Score=23.51  Aligned_cols=30  Identities=10%  Similarity=0.100  Sum_probs=25.7

Q ss_pred             eEEEEEecC-CCCCCceEEEEE-CCEEEEEEE
Q psy5221         336 KYRITMAIA-GFNRSDLEIETE-RDTLRITGR  365 (436)
Q Consensus       336 ~~~v~~~lp-G~~~edi~v~~~-~~~l~i~g~  365 (436)
                      +|.=.+.|| +++++.|+=+.. +|.|+|+..
T Consensus        51 eF~R~~~LP~~vd~~~v~A~~~~dGvL~I~~P   82 (83)
T cd06476          51 EFTRTYILPMDVDPLLVRASLSHDGILCIQAP   82 (83)
T ss_pred             EEEEEEECCCCCChhhEEEEecCCCEEEEEec
Confidence            488889999 799999999996 999999853


No 192
>PF14521 Aspzincin_M35:  Lysine-specific metallo-endopeptidase ; PDB: 2X3C_A 2X3A_A 2X3B_A 1GE7_B 1GE6_A 1GE5_A 1G12_A.
Probab=28.27  E-value=80  Score=27.03  Aligned_cols=40  Identities=18%  Similarity=0.174  Sum_probs=24.1

Q ss_pred             ccEEEeCCCCCccEEEEcHHHHhhc--CHHHHHHHHHHHHHhh
Q psy5221          89 INAFATGAFKNSSLIAISANLLESM--TKLEIKAIIAHEISHI  129 (436)
Q Consensus        89 ~NA~~~G~~~~~~~v~it~gLl~~l--~~~El~aVLAHElgHi  129 (436)
                      .-|++....+ ...|.+-......-  ..+.-+++|.||+.|+
T Consensus        66 ~~a~~~~~~~-~~~IyLc~~F~~~p~~g~~Sk~~TLiHE~SHf  107 (148)
T PF14521_consen   66 TYAYVYPDSP-TYTIYLCPAFFSAPTTGKDSKEGTLIHEWSHF  107 (148)
T ss_dssp             SSEEE-TTST--TEEEE-HHHHHS-SSSTT-HHHHHHHHHHHS
T ss_pred             eEEEEECCCC-ceEEEEChhhcCCCCCCCCchHHHHHHhhhhh
Confidence            3444444322 24688888777742  4567799999999994


No 193
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family.  This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=28.07  E-value=35  Score=32.58  Aligned_cols=15  Identities=20%  Similarity=0.233  Sum_probs=12.5

Q ss_pred             HHHHHHHHHHHhhhc
Q psy5221         117 EIKAIIAHEISHISN  131 (436)
Q Consensus       117 El~aVLAHElgHi~~  131 (436)
                      =+.+|+.||+||..-
T Consensus       134 l~isvvvHElgHal~  148 (277)
T cd06162         134 LLISGVVHEMGHGVA  148 (277)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            377899999999754


No 194
>KOG0175|consensus
Probab=25.80  E-value=1.4e+02  Score=28.03  Aligned_cols=58  Identities=12%  Similarity=0.157  Sum_probs=48.7

Q ss_pred             CCeEEEECC-----CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHH
Q psy5221          79 TPEVAIYNG-----EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVT  136 (436)
Q Consensus        79 ~p~v~v~~~-----~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~  136 (436)
                      -|.+|.+|+     .-|.|..|.+....+=++++|.=-.|+++|-..+----+.|-.+||...
T Consensus       179 GP~lyYVDseG~Rl~G~~FSVGSGs~yAYGVLDsgYr~dls~eEA~~L~rrAI~hAThRDayS  241 (285)
T KOG0175|consen  179 GPGLYYVDSEGTRLSGDLFSVGSGSTYAYGVLDSGYRYDLSDEEAYDLARRAIYHATHRDAYS  241 (285)
T ss_pred             CCceEEEcCCCCEecCceEeecCCCceeEEeeccCCCCCCCHHHHHHHHHHHHHHHHhccccc
Confidence            478999988     7899999998877777888888778999998888888888888888653


No 195
>cd02175 GH16_lichenase lichenase, member of glycosyl hydrolase family 16. Lichenase, also known as 1,3-1,4-beta-glucanase, is a member of glycosyl hydrolase family 16, that specifically cleaves 1,4-beta-D-glucosidic bonds in mixed-linked beta glucans that also contain 1,3-beta-D-glucosidic linkages.  Natural substrates of beta-glucanase are beta-glucans from grain endosperm cell walls or lichenan from the Islandic moss, Cetraria islandica.  This protein is found not only in bacteria but also in anaerobic fungi.  This domain includes two seven-stranded antiparallel beta-sheets that are adjacent to one another forming a compact, jellyroll beta-sandwich structure.
Probab=25.72  E-value=2.7e+02  Score=25.18  Aligned_cols=53  Identities=23%  Similarity=0.324  Sum_probs=31.2

Q ss_pred             CCceEEEEECCEEEEEEEEecccC----CCceEEEe-eeeeeEEEEEECCCccceEeEEe
Q psy5221         348 RSDLEIETERDTLRITGRKQADTQ----QRNFLHRG-IATRNFEHRFQLADHVHVVAAFF  402 (436)
Q Consensus       348 ~edi~v~~~~~~l~i~g~~~~~~~----~~~~~~~e-~~~~~f~r~~~lP~~v~~~~A~~  402 (436)
                      +++++  ++++.|.|+..++....    ......++ ..+|.|+-++++|...-...|-+
T Consensus        31 ~~nv~--v~~g~L~l~~~~~~~~~~~~tsg~i~S~~~f~yG~~ear~k~~~~~G~~~Afw   88 (212)
T cd02175          31 ADNVE--FSDGGLALTLTNDTYGEKPYACGEYRTRGFYGYGRYEVRMKPAKGSGVVSSFF   88 (212)
T ss_pred             cccEE--EECCeEEEEEeCCcCCCCccccceEEECceEEeeEEEEEEEcCCCCeEEEEEE
Confidence            45544  44788888877654211    11121112 57999999999997533444444


No 196
>COG4823 AbiF Abortive infection bacteriophage resistance protein [Defense mechanisms]
Probab=25.54  E-value=45  Score=31.27  Aligned_cols=14  Identities=43%  Similarity=0.608  Sum_probs=12.5

Q ss_pred             HHHHHHHHHHHHHh
Q psy5221         115 KLEIKAIIAHEISH  128 (436)
Q Consensus       115 ~~El~aVLAHElgH  128 (436)
                      +-+++.||||++||
T Consensus        89 Et~iKs~iAyllg~  102 (299)
T COG4823          89 ETEIKSVIAYLLGH  102 (299)
T ss_pred             HHHHHHHHHHHhcc
Confidence            45789999999999


No 197
>PHA02456 zinc metallopeptidase motif-containing protein
Probab=25.19  E-value=38  Score=27.28  Aligned_cols=30  Identities=20%  Similarity=0.276  Sum_probs=18.5

Q ss_pred             cEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChH
Q psy5221         101 SLIAISANLLESMTKLEIKAIIAHEISHISNGDM  134 (436)
Q Consensus       101 ~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~  134 (436)
                      ..|.++..    ..+.--+--|+||++|+.+...
T Consensus        66 ~~i~IDP~----~~~KGC~~TL~HEL~H~WQ~Rs   95 (141)
T PHA02456         66 GWIEIDPD----YANKGCRDTLAHELNHAWQFRT   95 (141)
T ss_pred             eEEEECCc----ccccchHHHHHHHHHHHHhhhc
Confidence            34555541    2233455678999999987553


No 198
>PRK11767 SpoVR family protein; Provisional
Probab=24.91  E-value=1.2e+02  Score=31.44  Aligned_cols=67  Identities=16%  Similarity=0.314  Sum_probs=41.1

Q ss_pred             HHHHHHHHHHhcCCC--CCeEEEECC--CccEEEe-CC------------------------CC-CccEEEEcHHHHhhc
Q psy5221          64 LLKTVNKISKNANIS--TPEVAIYNG--EINAFAT-GA------------------------FK-NSSLIAISANLLESM  113 (436)
Q Consensus        64 L~~~l~~l~~~~g~~--~p~v~v~~~--~~NA~~~-G~------------------------~~-~~~~v~it~gLl~~l  113 (436)
                      ..+.|+++|+.+|+.  +..+-|++.  ...|++. |+                        .. .--+|+-|.+-+.-|
T Consensus        20 ~~~~I~~iA~~~GLD~yp~~~EIi~~eqml~~~as~GmP~rY~HWsfGk~y~~~~~~Y~~gl~glaYEiVINSnPciAyL   99 (498)
T PRK11767         20 YLDEIERVAKEYGLDTYPNQIEVITAEQMMDAYSSVGMPINYRHWSFGKHFIETEQLYRRGQMGLAYEIVINSNPCIAYL   99 (498)
T ss_pred             HHHHHHHHHHHcCCCCCCceEEEECHHHHHHHHHhcCCCCCCCcccccHHHHHHHHHHhcCCCCCceEEEecCchHHHHH
Confidence            467788899999997  345556665  4444443 21                        11 123455555554444


Q ss_pred             ----CHHHHHHHHHH-HHHhhh
Q psy5221         114 ----TKLEIKAIIAH-EISHIS  130 (436)
Q Consensus       114 ----~~~El~aVLAH-ElgHi~  130 (436)
                          +.-.-+-|+|| ++||..
T Consensus       100 me~Ntl~~q~LViAHv~yGHnd  121 (498)
T PRK11767        100 MEENTMTMQALVIAHACYGHNS  121 (498)
T ss_pred             hccCcHHHHHHHHHHHHHhhhh
Confidence                55567899999 699963


No 199
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits.  TOP has been shown to degrade peptides released by the proteasom
Probab=23.70  E-value=39  Score=35.02  Aligned_cols=18  Identities=22%  Similarity=0.289  Sum_probs=13.4

Q ss_pred             HHHHHHHHHHHHHhhhcCh
Q psy5221         115 KLEIKAIIAHEISHISNGD  133 (436)
Q Consensus       115 ~~El~aVLAHElgHi~~~h  133 (436)
                      -+++ ..|+||+||..|.-
T Consensus       261 ~~~V-~TLfHEfGHalH~~  278 (472)
T cd06455         261 HDEV-ETFFHEFGHVIHHL  278 (472)
T ss_pred             HHHH-HHHHHHHHHHHHHH
Confidence            3554 56899999998743


No 200
>PF07873 YabP:  YabP family;  InterPro: IPR022476 Members of this protein family are the YabP and YqfC proteins of the bacterial sporulation program, as found in Bacillus subtilis, Clostridium tetani, and other spore-forming members of the Firmicutes. ; PDB: 2KYI_B 3IPF_B 2KS0_A.
Probab=23.63  E-value=50  Score=23.96  Aligned_cols=21  Identities=24%  Similarity=0.621  Sum_probs=18.2

Q ss_pred             CCCCceEEEEECCEEEEEEEE
Q psy5221         346 FNRSDLEIETERDTLRITGRK  366 (436)
Q Consensus       346 ~~~edi~v~~~~~~l~i~g~~  366 (436)
                      ++++.|.+....+.|.|+|+.
T Consensus        23 f~~~~I~l~t~~g~l~I~G~~   43 (66)
T PF07873_consen   23 FDDEEIRLNTKKGKLTIKGEG   43 (66)
T ss_dssp             EETTEEEEEETTEEEEEEEEE
T ss_pred             ECCCEEEEEeCCEEEEEECce
Confidence            567889999999999999984


No 201
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=23.58  E-value=54  Score=30.84  Aligned_cols=23  Identities=17%  Similarity=0.293  Sum_probs=19.2

Q ss_pred             CHHHHHHHHHHHHHhhhcChHHH
Q psy5221         114 TKLEIKAIIAHEISHISNGDMVT  136 (436)
Q Consensus       114 ~~~El~aVLAHElgHi~~~h~~~  136 (436)
                      +..--.+.++||+||+..+....
T Consensus       199 da~~yyaTl~HElghwtgh~~rl  221 (316)
T COG4227         199 DAINYYATLLHELGHWTGHEARL  221 (316)
T ss_pred             chHhHHHHHHHHhccccCchhhh
Confidence            67788999999999998876543


No 202
>PF10988 DUF2807:  Protein of unknown function (DUF2807);  InterPro: IPR021255  This bacterial family of proteins has no known function. ; PDB: 3JX8_A 3LJY_C 3LYC_A 3PET_A.
Probab=23.43  E-value=3.3e+02  Score=23.53  Aligned_cols=61  Identities=13%  Similarity=0.245  Sum_probs=0.0

Q ss_pred             EEEEECC-EEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCc-cceEeEEeeCcEEEEEEecCCCCCCCCeEeeecCC
Q psy5221         352 EIETERD-TLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADH-VHVVAAFFDQGILNIDLIREVPEALKPRKIYINTT  429 (436)
Q Consensus       352 ~v~~~~~-~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~-v~~~~A~~~~GvL~i~~pk~~~~~~~~~~I~I~~~  429 (436)
                      +|++.++ .+.+.                 ....++-++.-|++ ++.++...+||.|.|...+. -...++-+|.|...
T Consensus         4 ~i~v~~~~~V~l~-----------------~g~~~~v~v~~~~~l~~~i~~~v~~g~L~I~~~~~-~~~~~~~~v~V~~~   65 (181)
T PF10988_consen    4 KIEVSGNIEVELV-----------------QGDSPSVEVEADENLLDRIKVEVKDGTLKISYKKN-ISGSKPVKVRVTAP   65 (181)
T ss_dssp             EEEEESSCEEEEE-----------------E-SS-EEEEEEEHHHHCCEEEEEETTEEEEEE-SC-CTCTSTEEEEEEES
T ss_pred             EEEEcCCeEEEEE-----------------ECCCcEEEEEEChhhcceEEEEEECCEEEEEECCC-cCCCccEEEEEEcC


Q ss_pred             C
Q psy5221         430 K  430 (436)
Q Consensus       430 ~  430 (436)
                      .
T Consensus        66 ~   66 (181)
T PF10988_consen   66 S   66 (181)
T ss_dssp             -
T ss_pred             c


No 203
>COG0308 PepN Aminopeptidase N [Amino acid transport and metabolism]
Probab=22.95  E-value=51  Score=37.04  Aligned_cols=21  Identities=38%  Similarity=0.645  Sum_probs=17.9

Q ss_pred             HHHHHHHHHHHHhhhcChHHH
Q psy5221         116 LEIKAIIAHEISHISNGDMVT  136 (436)
Q Consensus       116 ~El~aVLAHElgHi~~~h~~~  136 (436)
                      +..+.|+|||++|-..|+...
T Consensus       305 ~~~~~viaHElaHqWfGnlVT  325 (859)
T COG0308         305 ENVEEVIAHELAHQWFGNLVT  325 (859)
T ss_pred             HHHHHHHHHHHhhhcccceee
Confidence            567779999999999999654


No 204
>COG5504 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=22.92  E-value=67  Score=30.06  Aligned_cols=19  Identities=37%  Similarity=0.427  Sum_probs=17.1

Q ss_pred             cCHHHHHHHHHHHHHhhhc
Q psy5221         113 MTKLEIKAIIAHEISHISN  131 (436)
Q Consensus       113 l~~~El~aVLAHElgHi~~  131 (436)
                      .|..++.|++|||+-|..+
T Consensus       135 ~~~~~v~aliaHE~HH~~R  153 (280)
T COG5504         135 STITSVPALIAHEYHHNCR  153 (280)
T ss_pred             CCccchHHHHHHHHHhhhe
Confidence            6888999999999999766


No 205
>PF10460 Peptidase_M30:  Peptidase M30;  InterPro: IPR019501 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases [].  This family contains metallopeptidases belonging to MEROPS peptidase family M30 (hyicolysin family, clan MA). Hyicolysin has a zinc ion which is liganded by two histidine and one glutamate residue. 
Probab=22.85  E-value=99  Score=30.84  Aligned_cols=33  Identities=24%  Similarity=0.195  Sum_probs=24.0

Q ss_pred             ccEEEEcHHHHhhc---CHHHHHHHHHHHHHhhhcC
Q psy5221         100 SSLIAISANLLESM---TKLEIKAIIAHEISHISNG  132 (436)
Q Consensus       100 ~~~v~it~gLl~~l---~~~El~aVLAHElgHi~~~  132 (436)
                      ..++.++...+..-   ..+...+.||||+=|..+-
T Consensus       118 ~e~~YiD~~~~~~~~~~~~~~~~sTlAHEfQHmInf  153 (366)
T PF10460_consen  118 SEYFYIDSETLYLGGNSGPDTVYSTLAHEFQHMINF  153 (366)
T ss_pred             eeEEEEecHHhhccCCccHHHHHHHHHHHHHHHHHH
Confidence            45566666554432   3678999999999999874


No 206
>PRK11198 LysM domain/BON superfamily protein; Provisional
Probab=22.48  E-value=1.1e+02  Score=26.14  Aligned_cols=28  Identities=14%  Similarity=0.420  Sum_probs=23.3

Q ss_pred             ecCCCCCCceEEEEECCEEEEEEEEecc
Q psy5221         342 AIAGFNRSDLEIETERDTLRITGRKQAD  369 (436)
Q Consensus       342 ~lpG~~~edi~v~~~~~~l~i~g~~~~~  369 (436)
                      .=.|....+++|.++++.++++|.-...
T Consensus        36 ~~~~~~~~~i~V~v~~G~v~l~G~v~s~   63 (147)
T PRK11198         36 SKQGLGDADVNVQVEDGKATVSGDAASQ   63 (147)
T ss_pred             HhcCCCcCCceEEEeCCEEEEEEEeCCH
Confidence            3357888899999999999999987643


No 207
>cd06158 S2P-M50_like_1 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with a minimal core protein and no PDZ domains.
Probab=22.06  E-value=55  Score=29.10  Aligned_cols=13  Identities=38%  Similarity=0.521  Sum_probs=11.0

Q ss_pred             HHHHHHHHHHhhh
Q psy5221         118 IKAIIAHEISHIS  130 (436)
Q Consensus       118 l~aVLAHElgHi~  130 (436)
                      +-++..||+||-.
T Consensus         9 ~~~i~~HE~aHa~   21 (181)
T cd06158           9 LLAITLHEFAHAY   21 (181)
T ss_pred             HHHHHHHHHHHHH
Confidence            4588999999976


No 208
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related  metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different  substrate specificities and cooperatively digest the chorion.
Probab=21.95  E-value=42  Score=29.97  Aligned_cols=13  Identities=46%  Similarity=0.649  Sum_probs=11.3

Q ss_pred             HHHHHHHHHhhhc
Q psy5221         119 KAIIAHEISHISN  131 (436)
Q Consensus       119 ~aVLAHElgHi~~  131 (436)
                      .+++.||+||...
T Consensus        78 ~G~i~HEl~HaLG   90 (182)
T cd04283          78 KGIIQHELLHALG   90 (182)
T ss_pred             cchHHHHHHHHhC
Confidence            5899999999765


No 209
>PF11694 DUF3290:  Protein of unknown function (DUF3290);  InterPro: IPR021707  This family of proteins with unknown function appears to be restricted to Firmicutes. 
Probab=21.29  E-value=2.6e+02  Score=24.03  Aligned_cols=27  Identities=15%  Similarity=0.311  Sum_probs=22.7

Q ss_pred             hhHHHHHHHHHHHhcCCCCCeEEEECC
Q psy5221          61 EDWLLKTVNKISKNANISTPEVAIYNG   87 (436)
Q Consensus        61 ~~~L~~~l~~l~~~~g~~~p~v~v~~~   87 (436)
                      ....-..++.+|+..+.+..+|++=.+
T Consensus        78 ~~~~~~fi~~vA~~~~V~~~~v~VNst  104 (149)
T PF11694_consen   78 SSQMVHFIESVAKDLGVSKEEVYVNST  104 (149)
T ss_pred             HHHHHHHHHHHHHHhCCChheEEEecc
Confidence            445788999999999999999988655


No 210
>PF01400 Astacin:  Astacin (Peptidase family M12A) This Prosite motif covers only the active site.;  InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=21.25  E-value=56  Score=29.30  Aligned_cols=62  Identities=19%  Similarity=0.171  Sum_probs=32.3

Q ss_pred             HHHHHHHHHHHhcCCC---CC-----eEEEECCCccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhc
Q psy5221          63 WLLKTVNKISKNANIS---TP-----EVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISN  131 (436)
Q Consensus        63 ~L~~~l~~l~~~~g~~---~p-----~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~  131 (436)
                      .+++.++.+.+...+.   ..     -+.+.++....-.+|...+...|.++.+    +   .-.+++.||+||...
T Consensus        23 ~I~~Am~~~e~~TcI~F~~~~~~~~~~i~~~~~~gC~S~vG~~~g~q~i~l~~~----c---~~~~~i~HEl~HaLG   92 (191)
T PF01400_consen   23 RIRKAMDEWEKNTCIRFVERTENEDDYISFSNGSGCWSYVGRQGGEQTINLGDG----C---FSVGTILHELGHALG   92 (191)
T ss_dssp             HHHHHHHHHHHHSSEEEEE-SSSSSSEEEEESSSSEEEESS--SSEEEEEE-TT----C----SHHHHHHHHHHHHT
T ss_pred             HHHHHHHHHHhCCCEEEEECCCCCceEEEeecCccccchhhhcCcceeEEecce----e---CCccchHHHHHHHHh
Confidence            4777888888776664   11     1222222222233454444455555531    2   135799999999765


No 211
>PF06305 DUF1049:  Protein of unknown function (DUF1049);  InterPro: IPR010445 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=21.15  E-value=2.3e+02  Score=20.17  Aligned_cols=23  Identities=17%  Similarity=0.273  Sum_probs=10.3

Q ss_pred             CchHHHHHHHHHHHHHHHHHHHH
Q psy5221          16 LNIKILTISSIIFGFTGSIISLL   38 (436)
Q Consensus        16 ~~~~~~~~~~~~~~~~~~~~~~~   38 (436)
                      ..+..++++++++|+++.++..+
T Consensus        18 ~pl~l~il~~f~~G~llg~l~~~   40 (68)
T PF06305_consen   18 LPLGLLILIAFLLGALLGWLLSL   40 (68)
T ss_pred             chHHHHHHHHHHHHHHHHHHHHH
Confidence            33444445555555444443333


No 212
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=21.02  E-value=44  Score=30.30  Aligned_cols=13  Identities=31%  Similarity=0.639  Sum_probs=11.3

Q ss_pred             HHHHHHHHHhhhc
Q psy5221         119 KAIIAHEISHISN  131 (436)
Q Consensus       119 ~aVLAHElgHi~~  131 (436)
                      .+++.|||+|...
T Consensus        88 ~Gti~HEl~HaLG  100 (200)
T cd04281          88 FGIVVHELGHVIG  100 (200)
T ss_pred             CchHHHHHHHHhc
Confidence            5899999999766


No 213
>COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism]
Probab=21.00  E-value=4.2e+02  Score=27.37  Aligned_cols=64  Identities=13%  Similarity=0.027  Sum_probs=43.6

Q ss_pred             HHHHHHHHHHhcCCCCCeEEEECCCccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhc
Q psy5221          64 LLKTVNKISKNANISTPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISN  131 (436)
Q Consensus        64 L~~~l~~l~~~~g~~~p~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~  131 (436)
                      -.+.-..+.+.+|++..+ -.+|..+--|++|.+++.  |-+|+ =.+.-+.-.-..=+-||.||-..
T Consensus       209 q~~~~~~vl~~lGfdf~~-GRlD~S~HPF~~g~~~~D--VRITT-Ry~~~df~~aL~g~iHE~GHAlY  272 (497)
T COG2317         209 QEALGLRVLELLGFDFER-GRLDVSVHPFTTGLPIND--VRITT-RYNEQDFRSALFGTIHETGHALY  272 (497)
T ss_pred             HHHHHHHHHHHhCCCccC-ccccCCCCCccCCCCCCc--eeEEe-ecCCccHHHHHHHHHHhhhhHHH
Confidence            556778888999998654 334447778999987544  66777 34444544444555699999765


No 214
>cd06482 ACD_HspB10 Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB10, also known as sperm outer dense fiber protein (ODFP), and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Human (h) HspB10 occurs exclusively in the axoneme of sperm cells and may have a cytoskeletal role.
Probab=20.75  E-value=1.6e+02  Score=22.69  Aligned_cols=30  Identities=10%  Similarity=0.192  Sum_probs=26.3

Q ss_pred             ceEEEEEecC-CCCCCceEEEEECC-EEEEEE
Q psy5221         335 NKYRITMAIA-GFNRSDLEIETERD-TLRITG  364 (436)
Q Consensus       335 ~~~~v~~~lp-G~~~edi~v~~~~~-~l~i~g  364 (436)
                      ..|.=.+.|| +++.+.|+=+.+++ .|+|.+
T Consensus        54 g~F~R~f~LP~~Vd~d~i~A~~~~~~~l~i~~   85 (87)
T cd06482          54 MNICKEFSLPPGVDEKDVTYSYGLGSVVKIET   85 (87)
T ss_pred             EEEEEEEECCCCcChHHcEEEEcCCCEEEEee
Confidence            4677889999 79999999999887 999985


No 215
>PF03672 UPF0154:  Uncharacterised protein family (UPF0154);  InterPro: IPR005359 The proteins in this entry are functionally uncharacterised.
Probab=20.51  E-value=2e+02  Score=20.93  Aligned_cols=37  Identities=14%  Similarity=0.216  Sum_probs=19.4

Q ss_pred             HHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCC
Q psy5221          34 IISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANIST   79 (436)
Q Consensus        34 ~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~   79 (436)
                      +..+|++.+.....+...+   |.++      +.++.+....|-.+
T Consensus        12 ~~Gff~ar~~~~k~l~~NP---pine------~mir~M~~QMG~kp   48 (64)
T PF03672_consen   12 VIGFFIARKYMEKQLKENP---PINE------KMIRAMMMQMGRKP   48 (64)
T ss_pred             HHHHHHHHHHHHHHHHHCC---CCCH------HHHHHHHHHhCCCc
Confidence            3444555555555444332   3332      46777777777654


No 216
>TIGR02856 spore_yqfC sporulation protein YqfC. This small protein, designated YqfC in Bacillus subtilis, is both restricted to and universal in sporulating species of the Firmcutes, such as Bacillus subtilis and Clostridium perfringens. It is part of the sigma(E)-controlled regulon, and its mutation leads to a sporulation defect.
Probab=20.51  E-value=71  Score=24.61  Aligned_cols=43  Identities=12%  Similarity=0.334  Sum_probs=28.9

Q ss_pred             CCCCceeEEEecCceEEEE--EecCCCCCCceEEEEECCEEEEEEEE
Q psy5221         322 QSYPPYNIELVDENKYRIT--MAIAGFNRSDLEIETERDTLRITGRK  366 (436)
Q Consensus       322 ~~~p~~~i~~e~~~~~~v~--~~lpG~~~edi~v~~~~~~l~i~g~~  366 (436)
                      -..|.+.+.  +.+...|+  -.+=-|+.+.|.++...+.|+|+|+.
T Consensus        17 ~~~p~itl~--gr~~~~Ien~k~I~~y~~~~I~l~t~~G~l~I~G~~   61 (85)
T TIGR02856        17 LDLPRITLI--GNEHIYIENHRGLVVFSPEEVKLNSTNGKITIEGKN   61 (85)
T ss_pred             cCCCEEEEE--CCcEEEEECccceEEECCCEEEEEcCceEEEEEccc
Confidence            345666664  34444443  12334578999999999999999984


No 217
>PF05309 TraE:  TraE protein;  InterPro: IPR007973 This family consists of several bacterial sex pilus assembly and synthesis proteins (TraE). Conjugal transfer of plasmids from donor to recipient cells is a complex process in which a cell-to-cell contact plays a key role. Many genes encoded by self-transmissible plasmids are required for various processes of conjugation, including pilus formation, stabilisation of mating pairs, conjugative DNA metabolism, surface exclusion and regulation of transfer gene expression []. The exact function of the TraE protein is unknown.; GO: 0000746 conjugation
Probab=20.37  E-value=1.8e+02  Score=25.85  Aligned_cols=48  Identities=15%  Similarity=0.200  Sum_probs=25.9

Q ss_pred             CceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc-eEeEEeeCcEEEEEE
Q psy5221         349 SDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH-VVAAFFDQGILNIDL  411 (436)
Q Consensus       349 edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~-~~~A~~~~GvL~i~~  411 (436)
                      ++++++-+++.+.|+|..+.-.+++..               -++... .++-.++||.|.++=
T Consensus       131 ~~i~~d~~~~~V~V~G~l~t~~g~~~~---------------~~~~~~y~~~~~~~~g~~~L~~  179 (187)
T PF05309_consen  131 KSIEVDPETLTVFVTGTLKTWIGDKKV---------------SSEDKTYRLQFKYRNGRLWLKS  179 (187)
T ss_pred             eEEEEecCCCEEEEEEEEEEEECCccc---------------cceeEEEEEEEEEeCCEEEEee
Confidence            455555667777778876432111110               011111 377788888887753


No 218
>TIGR02921 PEP_integral PEP-CTERM family integral membrane protein. Members of this protein family, found in three different species so far, have a PEP-CTERM sequence at the carboxyl-terminus (see model TIGR02595), but are unusual among PEP-CTERM proteins in having multiple predicted transmembrane segments. The function is unknown. It is proposed that a member of the EpsH family, to be designated exosortase (see TIGR02602), recognizes and cleaves PEP-CTERM proteins in a manner analogous to the cleavage of LPXTG proteins by sortase (see Haft, et al., 2006).
Probab=20.25  E-value=1.8e+02  Score=30.82  Aligned_cols=59  Identities=15%  Similarity=0.175  Sum_probs=35.1

Q ss_pred             CceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccceEeEEe-eCcEEE-----EEEecC
Q psy5221         349 SDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFF-DQGILN-----IDLIRE  414 (436)
Q Consensus       349 edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~-~~GvL~-----i~~pk~  414 (436)
                      -+|+|+++++.-+.+-+-.-.+...+.+     -|.|+  |+||++.....--+ .||+|.     +-+||.
T Consensus       419 h~VtVeIeg~iA~TEIEqTF~NPN~r~L-----EGEls--FPLPEgAtVTGyALdvdGkL~Daw~~VVVEKE  483 (952)
T TIGR02921       419 MAITVEEHGDNADIEIVETLENQTPENH-----EVFFH--FSLPEEAAITGLWLGDDAKDDDKFAFALAPRG  483 (952)
T ss_pred             eeEEEEEECCeEEEEEEEEEECCCCCce-----eEEEE--ecCCCCCeeeeeeecCCccccccccceeccHH
Confidence            4677777777555544333222222222     45555  99999988544333 477887     888883


Done!