Query psy5221
Match_columns 436
No_of_seqs 384 out of 3006
Neff 8.6
Searched_HMMs 46136
Date Fri Aug 16 20:28:40 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5221.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5221hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PRK05457 heat shock protein Ht 100.0 1.2E-43 2.7E-48 337.9 29.6 251 17-279 32-283 (284)
2 PRK01265 heat shock protein Ht 100.0 4.7E-41 1E-45 322.5 29.8 252 20-281 42-323 (324)
3 PRK04897 heat shock protein Ht 100.0 2.5E-41 5.5E-46 325.1 27.9 253 19-278 40-297 (298)
4 PRK02870 heat shock protein Ht 100.0 1.8E-40 3.9E-45 320.4 29.3 248 20-279 71-334 (336)
5 PRK03072 heat shock protein Ht 100.0 2.2E-40 4.9E-45 316.8 28.8 248 21-282 32-285 (288)
6 PRK02391 heat shock protein Ht 100.0 2.1E-40 4.5E-45 317.2 27.9 250 20-282 37-292 (296)
7 PRK03982 heat shock protein Ht 100.0 3.8E-39 8.3E-44 309.6 29.1 251 21-284 30-285 (288)
8 PRK01345 heat shock protein Ht 100.0 5.2E-39 1.1E-43 310.5 28.0 248 22-284 30-283 (317)
9 PRK03001 M48 family peptidase; 100.0 5.8E-38 1.3E-42 300.6 28.9 246 22-283 30-281 (283)
10 PRK11597 heat shock chaperone 100.0 8.9E-33 1.9E-37 233.9 15.8 139 291-431 1-139 (142)
11 PRK10743 heat shock protein Ib 100.0 5.2E-32 1.1E-36 228.9 15.6 136 292-427 2-137 (137)
12 COG0501 HtpX Zn-dependent prot 100.0 5.5E-30 1.2E-34 249.2 26.5 240 22-283 56-300 (302)
13 KOG2719|consensus 100.0 9.2E-30 2E-34 246.5 25.7 224 32-281 192-425 (428)
14 COG0071 IbpA Molecular chapero 99.9 4.1E-27 8.8E-32 203.5 15.3 124 303-427 19-146 (146)
15 PF01435 Peptidase_M48: Peptid 99.9 5.9E-27 1.3E-31 218.2 11.7 185 60-275 29-226 (226)
16 cd06470 ACD_IbpA-B_like Alpha- 99.9 1.3E-23 2.7E-28 166.2 12.8 89 325-413 1-90 (90)
17 cd06472 ACD_ScHsp26_like Alpha 99.9 1.4E-22 3.1E-27 161.2 12.2 87 326-413 1-92 (92)
18 PF00011 HSP20: Hsp20/alpha cr 99.9 1.4E-21 3E-26 158.9 13.6 99 328-427 1-102 (102)
19 cd06471 ACD_LpsHSP_like Group 99.9 1E-21 2.2E-26 156.7 12.2 87 325-413 1-93 (93)
20 cd06497 ACD_alphaA-crystallin_ 99.9 2.2E-21 4.8E-26 151.4 11.3 80 328-413 4-86 (86)
21 cd06478 ACD_HspB4-5-6 Alpha-cr 99.8 1.5E-20 3.3E-25 145.9 11.3 79 329-413 2-83 (83)
22 cd06479 ACD_HspB7_like Alpha c 99.8 1.2E-20 2.6E-25 144.8 10.0 77 328-413 2-81 (81)
23 cd06498 ACD_alphaB-crystallin_ 99.8 2.2E-20 4.7E-25 145.1 11.4 77 332-413 4-83 (84)
24 COG4783 Putative Zn-dependent 99.8 1.8E-19 3.9E-24 176.2 18.1 179 64-284 76-264 (484)
25 cd06476 ACD_HspB2_like Alpha c 99.8 7.4E-20 1.6E-24 141.5 10.9 77 332-413 4-83 (83)
26 cd06475 ACD_HspB1_like Alpha c 99.8 8.3E-20 1.8E-24 142.5 11.2 80 327-412 3-85 (86)
27 cd06477 ACD_HspB3_Like Alpha c 99.8 2.2E-19 4.8E-24 138.3 10.9 76 332-412 4-82 (83)
28 cd06482 ACD_HspB10 Alpha cryst 99.8 2.2E-19 4.7E-24 139.5 10.3 79 333-413 6-87 (87)
29 cd06481 ACD_HspB9_like Alpha c 99.8 3.1E-19 6.6E-24 139.7 10.6 79 332-413 4-87 (87)
30 cd06526 metazoan_ACD Alpha-cry 99.8 2E-18 4.4E-23 134.4 9.7 75 334-413 6-83 (83)
31 cd06464 ACD_sHsps-like Alpha-c 99.7 1.9E-17 4E-22 130.5 11.7 85 328-413 1-88 (88)
32 KOG2661|consensus 99.7 3.8E-17 8.2E-22 151.5 13.1 187 47-283 201-399 (424)
33 COG4784 Putative Zn-dependent 99.7 1.7E-16 3.7E-21 147.8 14.7 178 63-283 65-254 (479)
34 cd06480 ACD_HspB8_like Alpha-c 99.7 1.9E-16 4.1E-21 123.8 10.2 77 332-413 12-91 (91)
35 KOG0710|consensus 99.6 1.7E-15 3.6E-20 136.7 7.7 106 321-427 81-195 (196)
36 KOG3591|consensus 99.6 1.5E-14 3.2E-19 127.0 12.1 101 326-432 64-167 (173)
37 PF05569 Peptidase_M56: BlaR1 99.5 2.8E-12 6.1E-17 124.5 19.1 154 64-276 143-299 (299)
38 cd00298 ACD_sHsps_p23-like Thi 99.3 2.5E-11 5.4E-16 92.8 10.3 76 332-413 3-80 (80)
39 cd06469 p23_DYX1C1_like p23_li 98.9 7.4E-09 1.6E-13 79.4 9.2 65 332-414 3-69 (78)
40 COG4219 MecR1 Antirepressor re 98.9 3.6E-07 7.8E-12 86.1 20.3 125 64-234 138-265 (337)
41 PF05455 GvpH: GvpH; InterPro 98.8 6.3E-08 1.4E-12 84.0 10.1 74 323-414 90-168 (177)
42 cd06463 p23_like Proteins cont 98.5 9.8E-07 2.1E-11 68.1 9.4 70 332-414 3-74 (84)
43 cd06466 p23_CS_SGT1_like p23_l 98.1 1.4E-05 3E-10 62.0 8.1 73 328-414 1-75 (84)
44 PF04969 CS: CS domain; Inter 97.7 0.001 2.2E-08 50.5 11.4 75 325-413 1-79 (79)
45 PF04228 Zn_peptidase: Putativ 97.6 0.001 2.2E-08 63.8 12.4 67 64-133 99-185 (292)
46 PF06114 DUF955: Domain of unk 97.4 0.0016 3.5E-08 53.5 9.7 33 100-136 28-60 (122)
47 cd06465 p23_hB-ind1_like p23_l 97.3 0.0037 8E-08 50.9 10.5 76 325-415 1-78 (108)
48 PF10463 Peptidase_U49: Peptid 97.2 0.00056 1.2E-08 61.8 5.0 20 116-135 99-118 (206)
49 PF08190 PIH1: pre-RNA process 97.1 0.0021 4.6E-08 63.2 8.5 64 333-412 259-327 (328)
50 cd06489 p23_CS_hSgt1_like p23_ 96.9 0.0078 1.7E-07 46.5 8.7 73 328-414 1-75 (84)
51 PRK09672 phage exclusion prote 96.7 0.0026 5.7E-08 59.9 5.6 21 115-135 162-182 (305)
52 cd06467 p23_NUDC_like p23_like 96.6 0.018 4E-07 44.4 9.0 69 329-414 3-75 (85)
53 cd06493 p23_NUDCD1_like p23_NU 96.4 0.038 8.3E-07 42.8 9.2 70 328-414 2-75 (85)
54 PF01863 DUF45: Protein of unk 96.3 0.008 1.7E-07 54.8 6.2 68 64-135 114-181 (205)
55 cd06488 p23_melusin_like p23_l 96.3 0.054 1.2E-06 42.2 9.8 74 327-414 3-78 (87)
56 PF04298 Zn_peptidase_2: Putat 96.2 0.68 1.5E-05 42.3 17.7 65 65-134 40-105 (222)
57 cd06494 p23_NUDCD2_like p23-li 95.9 0.12 2.6E-06 40.7 9.9 74 323-414 4-81 (93)
58 PRK04351 hypothetical protein; 95.9 0.043 9.4E-07 47.2 7.9 62 64-129 6-72 (149)
59 cd06468 p23_CacyBP p23_like do 95.7 0.14 3.1E-06 40.1 9.9 75 326-414 3-83 (92)
60 COG1451 Predicted metal-depend 95.6 0.044 9.5E-07 50.7 7.5 69 64-136 125-193 (223)
61 COG2321 Predicted metalloprote 95.1 0.17 3.7E-06 47.2 9.5 67 67-133 99-182 (295)
62 PF13699 DUF4157: Domain of un 94.9 0.038 8.2E-07 42.2 4.0 60 67-132 6-75 (79)
63 PF10263 SprT-like: SprT-like 93.9 0.36 7.8E-06 41.9 8.5 70 64-133 4-75 (157)
64 smart00731 SprT SprT homologue 93.6 0.18 3.8E-06 43.4 5.9 66 64-131 3-72 (146)
65 cd00237 p23 p23 binds heat sho 93.6 1.4 3.1E-05 35.6 10.8 74 325-414 2-77 (106)
66 PF10026 DUF2268: Predicted Zn 93.3 0.23 5E-06 45.0 6.4 69 64-133 5-80 (195)
67 PF13203 DUF2201_N: Putative m 92.8 0.24 5.2E-06 47.9 6.2 38 101-138 43-80 (292)
68 PF04450 BSP: Peptidase of pla 92.6 0.28 6E-06 44.7 5.8 70 64-134 30-112 (205)
69 COG2738 Predicted Zn-dependent 91.4 2.6 5.7E-05 37.5 10.1 70 65-138 43-112 (226)
70 cd06495 p23_NUDCD3_like p23-li 90.4 4.5 9.7E-05 32.4 10.1 77 324-414 4-85 (102)
71 COG2856 Predicted Zn peptidase 90.3 0.31 6.6E-06 44.6 3.7 31 99-133 57-87 (213)
72 KOG1309|consensus 90.3 1.1 2.4E-05 39.3 6.8 42 325-367 4-45 (196)
73 PLN03088 SGT1, suppressor of 90.3 1.4 3E-05 43.9 8.7 77 325-415 157-235 (356)
74 COG4900 Predicted metallopepti 90.1 0.77 1.7E-05 36.7 5.3 67 64-131 9-93 (133)
75 cd06492 p23_mNUDC_like p23-lik 89.3 5.1 0.00011 31.1 9.4 66 333-414 6-77 (87)
76 cd06490 p23_NCB5OR p23_like do 89.0 6 0.00013 30.6 9.6 73 327-414 1-78 (87)
77 PF02031 Peptidase_M7: Strepto 88.5 1.4 3E-05 36.5 5.8 39 88-131 52-90 (132)
78 PRK04860 hypothetical protein; 88.4 1.8 3.9E-05 37.8 6.9 66 64-131 9-76 (160)
79 COG3091 SprT Zn-dependent meta 87.1 1.6 3.5E-05 37.1 5.6 66 63-130 6-73 (156)
80 PF13349 DUF4097: Domain of un 84.2 11 0.00023 32.6 9.9 81 324-410 65-147 (166)
81 PF12388 Peptidase_M57: Dual-a 82.9 1.1 2.3E-05 40.8 2.9 34 100-135 117-151 (211)
82 COG3590 PepO Predicted metallo 80.5 0.45 9.8E-06 48.9 -0.5 55 74-131 434-500 (654)
83 COG3864 Uncharacterized protei 79.7 2.3 5E-05 40.6 3.9 73 64-136 7-87 (396)
84 COG3824 Predicted Zn-dependent 77.9 2.1 4.5E-05 34.9 2.7 36 95-131 81-122 (136)
85 COG4324 Predicted aminopeptida 76.0 2.7 5.9E-05 39.2 3.2 33 104-136 180-218 (376)
86 cd04279 ZnMc_MMP_like_1 Zinc-d 75.7 1.7 3.8E-05 37.5 1.9 17 115-131 101-117 (156)
87 PF01421 Reprolysin: Reprolysi 75.3 2.1 4.5E-05 38.7 2.3 40 90-131 105-144 (199)
88 cd04269 ZnMc_adamalysin_II_lik 74.6 2.6 5.6E-05 37.9 2.8 17 115-131 128-144 (194)
89 PF14891 Peptidase_M91: Effect 74.6 8 0.00017 34.2 5.8 67 65-131 31-116 (174)
90 PF08325 WLM: WLM domain; Int 74.5 7.8 0.00017 34.7 5.7 23 113-135 77-99 (186)
91 PF00413 Peptidase_M10: Matrix 74.3 2 4.3E-05 36.8 1.9 18 114-131 101-118 (154)
92 cd04268 ZnMc_MMP_like Zinc-dep 73.2 2.1 4.5E-05 37.2 1.8 18 114-131 90-107 (165)
93 cd04270 ZnMc_TACE_like Zinc-de 72.9 2.3 5E-05 39.9 2.1 18 114-131 163-180 (244)
94 COG1164 Oligoendopeptidase F [ 70.9 5.9 0.00013 42.3 4.8 50 80-134 345-396 (598)
95 PF13485 Peptidase_MA_2: Pepti 70.2 3.1 6.6E-05 33.9 2.0 23 112-134 19-41 (128)
96 PF09768 Peptidase_M76: Peptid 69.2 15 0.00033 32.4 6.2 63 64-131 18-84 (173)
97 PF13688 Reprolysin_5: Metallo 68.6 2.5 5.4E-05 38.0 1.2 18 114-131 138-155 (196)
98 PF14247 DUF4344: Domain of un 67.6 19 0.00042 33.1 6.8 23 261-283 160-182 (220)
99 PRK06926 flagellar motor prote 67.2 1.1E+02 0.0023 29.3 11.9 30 109-138 120-149 (271)
100 PRK00446 cyaY frataxin-like pr 67.0 7.1 0.00015 31.5 3.4 31 398-435 29-59 (105)
101 PF13582 Reprolysin_3: Metallo 66.7 3 6.5E-05 34.3 1.2 13 119-131 108-120 (124)
102 TIGR00181 pepF oligoendopeptid 65.9 4.4 9.6E-05 43.3 2.7 42 88-134 353-394 (591)
103 PRK00523 hypothetical protein; 65.2 25 0.00055 26.0 5.6 45 26-79 12-56 (72)
104 PRK13267 archaemetzincin-like 64.9 15 0.00032 32.8 5.3 67 65-131 56-138 (179)
105 PF13574 Reprolysin_2: Metallo 64.1 3.6 7.9E-05 36.3 1.4 14 118-131 111-124 (173)
106 TIGR03421 FeS_CyaY iron donor 64.1 7.9 0.00017 31.0 3.1 31 398-435 27-57 (102)
107 PF01447 Peptidase_M4: Thermol 62.1 18 0.00039 31.2 5.2 62 64-130 79-147 (150)
108 cd04278 ZnMc_MMP Zinc-dependen 61.8 4 8.7E-05 35.3 1.2 17 115-131 104-120 (157)
109 PF05572 Peptidase_M43: Pregna 61.7 4.3 9.4E-05 35.2 1.4 21 114-134 65-86 (154)
110 cd00503 Frataxin Frataxin is a 61.4 9.4 0.0002 30.8 3.2 31 398-435 30-60 (105)
111 PF01433 Peptidase_M1: Peptida 59.7 34 0.00073 34.1 7.6 66 64-134 234-311 (390)
112 PF10023 DUF2265: Predicted am 59.6 4.9 0.00011 39.3 1.5 32 106-137 150-187 (337)
113 cd04267 ZnMc_ADAM_like Zinc-de 59.2 3.8 8.2E-05 36.7 0.6 17 115-131 130-146 (192)
114 cd06460 M32_Taq Peptidase fami 58.4 82 0.0018 31.8 9.9 66 64-133 109-174 (396)
115 PRK05518 rpl6p 50S ribosomal p 58.3 38 0.00082 30.2 6.7 45 348-412 13-57 (180)
116 TIGR03422 mito_frataxin fratax 56.4 9.4 0.0002 30.3 2.3 29 399-434 30-58 (97)
117 TIGR02414 pepN_proteo aminopep 56.0 9.5 0.00021 42.6 3.1 20 116-135 281-300 (863)
118 TIGR03653 arch_L6P archaeal ri 55.5 48 0.001 29.2 6.9 45 348-412 7-51 (170)
119 PF13583 Reprolysin_4: Metallo 55.3 5.7 0.00012 36.2 1.1 12 120-131 139-150 (206)
120 cd04271 ZnMc_ADAM_fungal Zinc- 55.1 4.4 9.5E-05 37.6 0.3 12 120-131 147-158 (228)
121 PF01491 Frataxin_Cyay: Fratax 55.1 14 0.00031 29.9 3.3 31 397-434 31-61 (109)
122 PF01431 Peptidase_M13: Peptid 54.8 9.3 0.0002 34.6 2.4 35 100-134 8-52 (206)
123 cd04272 ZnMc_salivary_gland_MP 54.7 7.2 0.00016 35.9 1.6 15 117-131 144-158 (220)
124 cd00203 ZnMc Zinc-dependent me 54.6 5.5 0.00012 34.5 0.8 17 115-131 93-109 (167)
125 TIGR02412 pepN_strep_liv amino 53.0 32 0.00069 38.5 6.6 68 65-136 229-305 (831)
126 PF06262 DUF1025: Possibl zinc 52.5 25 0.00055 27.9 4.2 36 99-134 49-89 (97)
127 PF04972 BON: BON domain; Int 52.1 30 0.00064 24.6 4.3 26 344-369 12-37 (64)
128 cd06477 ACD_HspB3_Like Alpha c 52.1 28 0.00061 26.7 4.3 30 336-365 51-82 (83)
129 PTZ00027 60S ribosomal protein 52.1 48 0.0011 29.8 6.4 48 348-413 13-60 (190)
130 PRK05498 rplF 50S ribosomal pr 52.0 41 0.00089 29.9 5.9 44 348-412 12-55 (178)
131 TIGR03654 L6_bact ribosomal pr 51.0 54 0.0012 29.0 6.5 45 348-413 11-55 (175)
132 PRK01844 hypothetical protein; 50.6 59 0.0013 24.2 5.4 47 24-79 9-55 (72)
133 PF12315 DUF3633: Protein of u 50.5 16 0.00035 33.1 3.1 20 112-131 87-106 (212)
134 KOG3314|consensus 50.1 70 0.0015 27.8 6.6 68 65-135 40-108 (194)
135 COG1965 CyaY Protein implicate 48.6 14 0.00031 29.5 2.2 31 398-435 30-60 (106)
136 TIGR02289 M3_not_pepF oligoend 48.4 13 0.00029 39.3 2.7 65 64-133 274-352 (549)
137 COG3763 Uncharacterized protei 47.9 54 0.0012 24.1 4.8 40 31-79 16-55 (71)
138 smart00235 ZnMc Zinc-dependent 47.6 9.1 0.0002 32.2 1.1 13 119-131 87-99 (140)
139 cd06471 ACD_LpsHSP_like Group 47.0 29 0.00064 26.8 3.8 31 334-364 61-91 (93)
140 TIGR02411 leuko_A4_hydro leuko 46.6 26 0.00056 37.6 4.5 31 102-134 265-295 (601)
141 PRK14015 pepN aminopeptidase N 46.2 34 0.00074 38.4 5.5 20 117-136 295-314 (875)
142 cd04276 ZnMc_MMP_like_2 Zinc-d 44.0 16 0.00034 33.2 2.0 31 101-131 92-129 (197)
143 PF14730 DUF4468: Domain of un 44.0 1.1E+02 0.0024 23.7 6.6 17 397-413 69-85 (91)
144 cd06459 M3B_Oligoendopeptidase 43.9 15 0.00032 37.3 2.1 41 88-133 196-237 (427)
145 KOG1047|consensus 43.6 14 0.0003 38.4 1.8 52 79-135 254-305 (613)
146 PF02163 Peptidase_M50: Peptid 43.0 16 0.00034 32.7 1.9 14 118-131 7-20 (192)
147 TIGR03793 TOMM_pelo TOMM prope 43.0 1.1E+02 0.0024 23.1 6.2 44 69-123 28-76 (77)
148 PTZ00179 60S ribosomal protein 42.4 79 0.0017 28.4 6.2 47 348-412 12-58 (189)
149 cd04327 ZnMc_MMP_like_3 Zinc-d 42.3 17 0.00036 32.9 2.0 16 116-131 90-105 (198)
150 cd04277 ZnMc_serralysin_like Z 41.6 15 0.00034 32.6 1.6 34 101-134 94-130 (186)
151 PRK01379 cyaY frataxin-like pr 41.2 28 0.00062 27.9 2.9 30 398-435 30-59 (103)
152 CHL00140 rpl6 ribosomal protei 40.6 72 0.0016 28.3 5.7 44 348-412 12-55 (178)
153 PF12992 DUF3876: Domain of un 40.6 89 0.0019 24.7 5.5 41 322-363 23-68 (95)
154 TIGR02290 M3_fam_3 oligoendope 40.2 19 0.00041 38.5 2.3 41 88-133 349-390 (587)
155 PHA00527 hypothetical protein 39.8 69 0.0015 25.5 4.7 49 65-131 28-86 (129)
156 KOG3624|consensus 39.4 43 0.00092 36.6 4.9 43 88-133 481-533 (687)
157 cd04273 ZnMc_ADAMTS_like Zinc- 39.2 5.9 0.00013 36.0 -1.5 14 118-131 140-153 (207)
158 PF06861 BALF1: BALF1 protein; 39.0 22 0.00048 31.0 2.1 23 113-135 127-149 (182)
159 cd06161 S2P-M50_SpoIVFB SpoIVF 39.0 19 0.00042 32.8 1.8 15 117-131 37-51 (208)
160 cd06163 S2P-M50_PDZ_RseP-like 38.6 18 0.00038 32.3 1.5 13 119-131 10-22 (182)
161 cd05709 S2P-M50 Site-2 proteas 38.4 20 0.00043 31.6 1.8 14 118-131 8-21 (180)
162 cd06472 ACD_ScHsp26_like Alpha 37.1 52 0.0011 25.4 3.8 32 333-364 58-90 (92)
163 cd06481 ACD_HspB9_like Alpha c 37.0 41 0.00089 25.9 3.2 32 334-365 53-86 (87)
164 TIGR02421 QEGLA conserved hypo 36.8 75 0.0016 31.7 5.7 60 61-127 136-197 (366)
165 cd06164 S2P-M50_SpoIVFB_CBS Sp 36.6 22 0.00048 32.9 1.8 15 117-131 52-66 (227)
166 PF01432 Peptidase_M3: Peptida 36.1 26 0.00056 36.1 2.5 18 113-131 238-255 (458)
167 cd06526 metazoan_ACD Alpha-cry 36.0 51 0.0011 24.9 3.5 31 335-365 50-82 (83)
168 cd06464 ACD_sHsps-like Alpha-c 35.5 61 0.0013 24.2 4.0 33 333-365 54-87 (88)
169 PF00011 HSP20: Hsp20/alpha cr 34.5 56 0.0012 25.6 3.7 38 333-370 53-91 (102)
170 PF13402 M60-like: Peptidase M 33.9 1.9E+02 0.0042 27.7 8.1 64 65-133 153-234 (307)
171 PF08014 DUF1704: Domain of un 33.6 1E+02 0.0022 30.7 6.0 63 63-133 113-181 (349)
172 KOG1046|consensus 33.4 79 0.0017 35.7 5.8 66 66-136 265-342 (882)
173 PF00347 Ribosomal_L6: Ribosom 33.3 97 0.0021 22.8 4.7 47 348-413 2-48 (77)
174 PF12725 DUF3810: Protein of u 33.0 24 0.00053 34.5 1.6 19 113-131 191-209 (318)
175 PF01954 DUF104: Protein of un 32.9 49 0.0011 23.7 2.7 17 397-413 4-20 (60)
176 cd06160 S2P-M50_like_2 Unchara 32.6 29 0.00062 31.0 1.8 16 116-131 39-54 (183)
177 cd06478 ACD_HspB4-5-6 Alpha-cr 32.3 59 0.0013 24.7 3.3 30 336-365 51-82 (83)
178 KOG3607|consensus 32.2 29 0.00063 37.8 2.1 42 88-131 295-336 (716)
179 PF08308 PEGA: PEGA domain; I 32.2 1.4E+02 0.0031 21.5 5.4 41 326-366 26-67 (71)
180 PF14913 DPCD: DPCD protein fa 31.7 2.1E+02 0.0046 25.6 7.0 73 325-413 86-168 (194)
181 KOG3413|consensus 31.2 51 0.0011 28.0 2.9 30 398-427 74-107 (156)
182 cd06159 S2P-M50_PDZ_Arch Uncha 30.9 29 0.00064 32.9 1.7 15 117-131 117-131 (263)
183 cd06498 ACD_alphaB-crystallin_ 30.6 72 0.0016 24.4 3.6 31 336-366 51-83 (84)
184 TIGR03296 M6dom_TIGR03296 M6 f 30.1 13 0.00027 35.8 -0.9 12 119-130 166-177 (286)
185 PF13398 Peptidase_M50B: Pepti 30.0 32 0.00069 31.2 1.7 15 116-130 20-34 (200)
186 PF09471 Peptidase_M64: IgA Pe 29.6 32 0.0007 32.6 1.8 17 114-130 212-228 (264)
187 cd06480 ACD_HspB8_like Alpha-c 29.6 1E+02 0.0022 24.1 4.2 30 335-364 58-89 (91)
188 COG0097 RplF Ribosomal protein 29.5 1.7E+02 0.0038 25.9 6.1 47 346-412 10-56 (178)
189 COG4004 Uncharacterized protei 28.7 2.7E+02 0.0058 21.8 6.2 48 304-365 12-59 (96)
190 TIGR03750 conj_TIGR03750 conju 28.6 3.2E+02 0.007 22.2 8.0 41 33-81 59-99 (111)
191 cd06476 ACD_HspB2_like Alpha c 28.3 1E+02 0.0022 23.5 4.0 30 336-365 51-82 (83)
192 PF14521 Aspzincin_M35: Lysine 28.3 80 0.0017 27.0 3.8 40 89-129 66-107 (148)
193 cd06162 S2P-M50_PDZ_SREBP Ster 28.1 35 0.00077 32.6 1.7 15 117-131 134-148 (277)
194 KOG0175|consensus 25.8 1.4E+02 0.003 28.0 5.0 58 79-136 179-241 (285)
195 cd02175 GH16_lichenase lichena 25.7 2.7E+02 0.0058 25.2 7.1 53 348-402 31-88 (212)
196 COG4823 AbiF Abortive infectio 25.5 45 0.00097 31.3 1.8 14 115-128 89-102 (299)
197 PHA02456 zinc metallopeptidase 25.2 38 0.00083 27.3 1.2 30 101-134 66-95 (141)
198 PRK11767 SpoVR family protein; 24.9 1.2E+02 0.0026 31.4 4.9 67 64-130 20-121 (498)
199 cd06455 M3A_TOP Peptidase M3 T 23.7 39 0.00084 35.0 1.3 18 115-133 261-278 (472)
200 PF07873 YabP: YabP family; I 23.6 50 0.0011 24.0 1.5 21 346-366 23-43 (66)
201 COG4227 Antirestriction protei 23.6 54 0.0012 30.8 1.9 23 114-136 199-221 (316)
202 PF10988 DUF2807: Protein of u 23.4 3.3E+02 0.0071 23.5 7.1 61 352-430 4-66 (181)
203 COG0308 PepN Aminopeptidase N 22.9 51 0.0011 37.0 2.0 21 116-136 305-325 (859)
204 COG5504 Predicted Zn-dependent 22.9 67 0.0015 30.1 2.4 19 113-131 135-153 (280)
205 PF10460 Peptidase_M30: Peptid 22.8 99 0.0021 30.8 3.8 33 100-132 118-153 (366)
206 PRK11198 LysM domain/BON super 22.5 1.1E+02 0.0024 26.1 3.6 28 342-369 36-63 (147)
207 cd06158 S2P-M50_like_1 Unchara 22.1 55 0.0012 29.1 1.7 13 118-130 9-21 (181)
208 cd04283 ZnMc_hatching_enzyme Z 21.9 42 0.0009 30.0 0.9 13 119-131 78-90 (182)
209 PF11694 DUF3290: Protein of u 21.3 2.6E+02 0.0057 24.0 5.6 27 61-87 78-104 (149)
210 PF01400 Astacin: Astacin (Pep 21.2 56 0.0012 29.3 1.6 62 63-131 23-92 (191)
211 PF06305 DUF1049: Protein of u 21.2 2.3E+02 0.005 20.2 4.7 23 16-38 18-40 (68)
212 cd04281 ZnMc_BMP1_TLD Zinc-dep 21.0 44 0.00096 30.3 0.9 13 119-131 88-100 (200)
213 COG2317 Zn-dependent carboxype 21.0 4.2E+02 0.0091 27.4 7.7 64 64-131 209-272 (497)
214 cd06482 ACD_HspB10 Alpha cryst 20.7 1.6E+02 0.0035 22.7 3.9 30 335-364 54-85 (87)
215 PF03672 UPF0154: Uncharacteri 20.5 2E+02 0.0043 20.9 3.9 37 34-79 12-48 (64)
216 TIGR02856 spore_yqfC sporulati 20.5 71 0.0015 24.6 1.8 43 322-366 17-61 (85)
217 PF05309 TraE: TraE protein; 20.4 1.8E+02 0.0039 25.8 4.7 48 349-411 131-179 (187)
218 TIGR02921 PEP_integral PEP-CTE 20.3 1.8E+02 0.004 30.8 5.1 59 349-414 419-483 (952)
No 1
>PRK05457 heat shock protein HtpX; Provisional
Probab=100.00 E-value=1.2e-43 Score=337.90 Aligned_cols=251 Identities=51% Similarity=0.781 Sum_probs=203.2
Q ss_pred chHHHHHHHHHHHHHHHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCCCeEEEECC-CccEEEeC
Q psy5221 17 NIKILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANISTPEVAIYNG-EINAFATG 95 (436)
Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~p~v~v~~~-~~NA~~~G 95 (436)
..++++++++++++++.++++++++++..+.++++++++|.++++++|++.++++|+++|+|.|++|++|+ .+|||++|
T Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~a~~i~~~~~~~~~~L~~~v~~la~~~g~p~p~v~v~~~~~~NAfa~G 111 (284)
T PRK05457 32 NLGGLLVFAAVFGFGGSFISLLMSKWMAKRSTGAEVIEQPRNETERWLVETVARQARQAGIGMPEVAIYHSPEINAFATG 111 (284)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCeECCCCCCchHHHHHHHHHHHHHhCCCCCCCEEEEeCCCceEEEec
Confidence 36677888899999999999999999999999999998788777778999999999999999999999999 99999999
Q ss_pred CCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCCCCCC
Q psy5221 96 AFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSN 175 (436)
Q Consensus 96 ~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~ 175 (436)
.++++++|++|+||++.+|++|++||+|||+||++++|+.++.++.++...+..++..++..+...+.. .+.. ..+
T Consensus 112 ~~~~~~~V~vt~gLl~~L~~~El~aVlAHElgHi~~~d~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~-~~~ 187 (284)
T PRK05457 112 ASKNNSLVAVSTGLLQNMSRDEVEAVLAHEISHIANGDMVTMTLIQGVVNTFVIFLSRIIAQIVDRFVS---GNEE-GNG 187 (284)
T ss_pred CCCCCeEEEeehHHhhhCCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc---cCcc-ccc
Confidence 999899999999999999999999999999999999999988777666566655554444433322221 1111 111
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhcCCchHHHHHHHHhhcccccCCChhhhhhhcccCccc
Q psy5221 176 YNHKIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVNDLPKSIAILGINNNFK 255 (436)
Q Consensus 176 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~g~~~~l~~aL~kl~~~~~~~~~~~~~~~~~~~~~~~ 255 (436)
+..++...++..++.++..++.+++||.+||+||++|++++| |+++++||.|+...+. ...++..+++++.++
T Consensus 188 ---~~~~~~~~l~~~~~~~~~~ll~~~~SR~rEy~AD~~Aa~ltg-p~~L~~AL~KL~~~~~-~~~~~~~~~~~i~~~-- 260 (284)
T PRK05457 188 ---IGYFIVSIVLEIVFGILASIIVMWFSRHREFRADAGGAKLAG-REKMIAALQRLKTSYE-PQLPGSMAAFGINGK-- 260 (284)
T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhhC-HHHHHHHHHHHHhhCc-cCCChHHHHhhccCc--
Confidence 222222233334455667788899999999999999999988 7999999999998765 445666777777664
Q ss_pred chhhhhhhccCCChHHHHHHHHhc
Q psy5221 256 KKSFLKLFATHPSFEKRIAILSLL 279 (436)
Q Consensus 256 ~~~~~~~~~tHP~~~~RI~~L~~~ 279 (436)
..+..|++|||++.+||++|++.
T Consensus 261 -~~~~~lfsTHP~~~eRI~~L~~~ 283 (284)
T PRK05457 261 -SGLSELFMSHPPLEKRIAALRSG 283 (284)
T ss_pred -hhHHHHHcCCcCHHHHHHHHHhC
Confidence 24789999999999999999864
No 2
>PRK01265 heat shock protein HtpX; Provisional
Probab=100.00 E-value=4.7e-41 Score=322.47 Aligned_cols=252 Identities=25% Similarity=0.302 Sum_probs=189.7
Q ss_pred HHHHHHHHHHHHHHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCCCeEEEECC-CccEEEeCCCC
Q psy5221 20 ILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANISTPEVAIYNG-EINAFATGAFK 98 (436)
Q Consensus 20 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~p~v~v~~~-~~NA~~~G~~~ 98 (436)
+.++..+++++++.++.++++++++.++++++.++ |.++++++|++.++++|+++|+|.|++|++|+ .+|||++|.++
T Consensus 42 ~~~~~~~~~~~~~~~~~~~~sp~li~~~~~a~~~~-p~~~~~~~L~~~v~~la~~~g~~~p~vyv~~~~~~NAfa~G~~~ 120 (324)
T PRK01265 42 GLILGILIFVFFLNIIQWLFGPYMINAAYRTVEVT-PTDPVYGWLYSIVAEVAKYNGIRVPKVYIADVPFPNAFAYGSPI 120 (324)
T ss_pred HHHHHHHHHHHHHHHHHHHHhHHHHHHHcCCeeCC-CCCcccHHHHHHHHHHHHHcCCCCCeEEEecCCCCCeEEeccCC
Confidence 33455577788889999999999999999999887 66654566999999999999999999999999 99999999987
Q ss_pred CccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc-CCCCCCCCCch
Q psy5221 99 NSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNS-YDDSNYNSNYN 177 (436)
Q Consensus 99 ~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~-~~~~~~~~~~~ 177 (436)
.++.|++++||++.+++||++||+|||+||++++|+.+++... .+..+..++...+. ...+++. +.+....
T Consensus 121 ~~~~Ivvt~gLl~~l~~~El~aVlAHElgHik~~d~~~~~~~~-~i~~~~~~~~~~~~---~~~~~~~~~~~~~~~---- 192 (324)
T PRK01265 121 AGKRIAITLPLLKILNRDEIKAVAGHELGHLKHRDVELLMAIG-LIPTLIYYLGYSLF---WGGMFGGGGGGRGNN---- 192 (324)
T ss_pred CCCEEEEehHHHhhCCHHHHHHHHHHHHHHHHcccHHHHHHHH-HHHHHHHHHHHHHH---HHHHhcccccCCCcc----
Confidence 6677999999999999999999999999999999998876543 33333333322211 1122221 1111111
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhh-hcCCchHHHHHHHHhhccccc---------CCChhhhhh
Q psy5221 178 HKIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAK-LLGTPDPMISALRHLSNISIV---------NDLPKSIAI 247 (436)
Q Consensus 178 ~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~-~~g~~~~l~~aL~kl~~~~~~---------~~~~~~~~~ 247 (436)
...+....++..++.+++.++.+++||.+||+||++|++ ++|+|+++++||.|+..+... ...++..++
T Consensus 193 -~~~l~l~~ll~~~~~~i~~~l~~aiSR~rEy~AD~~aa~~~tg~p~~LasAL~KL~~~~~~~~~~~~~~~~~~~~~~~~ 271 (324)
T PRK01265 193 -GGLLFLIGIALMAVSFVFNLLVLSINRMREAYADVNSALTVPGGAENLQTALAKITLSMDPGALERFKKKSTTNQMASM 271 (324)
T ss_pred -chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHhhcCChHHHHHHHHHHHhcccccccccccccccCChhhhh
Confidence 011122223334456667888999999999999999998 569999999999999986542 112344667
Q ss_pred hcccCcccc------------------hhhhhhhccCCChHHHHHHHHhccc
Q psy5221 248 LGINNNFKK------------------KSFLKLFATHPSFEKRIAILSLLKL 281 (436)
Q Consensus 248 ~~~~~~~~~------------------~~~~~~~~tHP~~~~RI~~L~~~~~ 281 (436)
+++.++.++ ..+.++|+|||++++||++|++++.
T Consensus 272 l~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lFsTHPp~e~Ri~rL~~~~~ 323 (324)
T PRK01265 272 LFFSNAIEEVPTWDARELVEYWKTTKVPWYADIFSDHPHPAKRIQLLEKLSK 323 (324)
T ss_pred heeeCcccccccchhhhhhhhhcccchhHHHHHhCCCcChHHHHHHHHHhhh
Confidence 777776422 2477999999999999999998754
No 3
>PRK04897 heat shock protein HtpX; Provisional
Probab=100.00 E-value=2.5e-41 Score=325.14 Aligned_cols=253 Identities=28% Similarity=0.369 Sum_probs=188.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCCCeEEEECC-CccEEEeCCC
Q psy5221 19 KILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANISTPEVAIYNG-EINAFATGAF 97 (436)
Q Consensus 19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~p~v~v~~~-~~NA~~~G~~ 97 (436)
...+++++++++++.++.++.+++++.+..++++++ |. ++|+|++.++++|+++|+|.|+||++++ .+|||++|.+
T Consensus 40 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~-~~--~~p~L~~~v~~la~~~gip~p~v~v~~~~~~NAfa~G~~ 116 (298)
T PRK04897 40 LGGLIIALIIGVIYALIMIFQSTNVVMSMNHAREVT-EE--EAPELWHIVEDMAMVAQIPMPRVFIIDDPSPNAFATGSS 116 (298)
T ss_pred hhHHHHHHHHHHHHHHHHHHhhHHHHHHhCCCEECC-hh--hhHHHHHHHHHHHHHcCCCCCcEEEecCCCCceEEeccC
Confidence 355677788899999999999999999999999987 33 3456999999999999999999999999 9999999999
Q ss_pred CCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc-CC--CCCCCC
Q psy5221 98 KNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNS-YD--DSNYNS 174 (436)
Q Consensus 98 ~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~-~~--~~~~~~ 174 (436)
+.++.|++|+||++.++++|++||+|||+||++++|+.++++... +..++.++..++.. ..+++. +. ......
T Consensus 117 ~~~~~v~vt~gLl~~l~~~El~aVlAHElgHi~~~d~~~~~~~~~-~~~~~~~l~~~~~~---~~~~~~~~~~~~~~~~~ 192 (298)
T PRK04897 117 PKNAAVAVTTGLLAIMNREELEGVIGHEISHIRNYDIRLSTIAVA-LASAITLLSDIAGR---MMWWGGGSRRRDDDRDG 192 (298)
T ss_pred CCCcEEEeehHHHhhCCHHHHHHHHHHHHHHHhcCCHHHHHHHHH-HHHHHHHHHHHHHH---HHHhccccccccccccc
Confidence 999999999999999999999999999999999999988776643 33333332222111 111111 01 111111
Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhcCCchHHHHHHHHhhcccccCC-ChhhhhhhcccCc
Q psy5221 175 NYNHKIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVND-LPKSIAILGINNN 253 (436)
Q Consensus 175 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~g~~~~l~~aL~kl~~~~~~~~-~~~~~~~~~~~~~ 253 (436)
+...+..++...+...+..+...++.+++||.+||+||++|++++++|+++++||.|+...+.... .....+++++.+|
T Consensus 193 ~~~~~~~~~~~l~~~i~~~~~~~ll~~~~SR~rE~~AD~~A~~lt~~p~~La~AL~KL~~~~~~~~~~~~~~~~~~i~~p 272 (298)
T PRK04897 193 GGLQIILLIVSLLLLILAPLAATLIQLAISRQREYLADASSVELTRNPQGLISALEKISNSQPMKHPVDDASAALYISDP 272 (298)
T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhCCHHHHHHHHHHHHhccccccccChHHHHhccCCC
Confidence 101111122222222333445567789999999999999999999999999999999998754322 2234456677777
Q ss_pred ccchhhhhhhccCCChHHHHHHHHh
Q psy5221 254 FKKKSFLKLFATHPSFEKRIAILSL 278 (436)
Q Consensus 254 ~~~~~~~~~~~tHP~~~~RI~~L~~ 278 (436)
..+..+..||+|||++.+||++|++
T Consensus 273 ~~~~~~~~lfsTHP~~~eRI~~L~~ 297 (298)
T PRK04897 273 LKKKGLSKLFDTHPPIEERIERLKN 297 (298)
T ss_pred cccchHHHHHcCCcCHHHHHHHHHc
Confidence 5444568899999999999999985
No 4
>PRK02870 heat shock protein HtpX; Provisional
Probab=100.00 E-value=1.8e-40 Score=320.44 Aligned_cols=248 Identities=27% Similarity=0.411 Sum_probs=186.8
Q ss_pred HHHHHHHHHHHHHHHHHHHhHHHHHHHhCCCEEecCCCC---cchhHHHHHHHHHHHhcCCC-CCeEEEECC-CccEEEe
Q psy5221 20 ILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSN---FTEDWLLKTVNKISKNANIS-TPEVAIYNG-EINAFAT 94 (436)
Q Consensus 20 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~---~~~~~L~~~l~~l~~~~g~~-~p~v~v~~~-~~NA~~~ 94 (436)
.++++.+++++++.++.||++.+++.+.++++++. |.+ .++++|++.++++|.++|+| .|+||++++ .+|||++
T Consensus 71 ~~~~~~~~~~~~~~~~~y~~~~~~~l~~~~a~~I~-~~~~~p~~~~~L~~~ve~La~~ag~p~~p~V~vi~~~~~NAFA~ 149 (336)
T PRK02870 71 TATLIMSLVAVISILVTFQNFDKIMLSGTEYKEIT-PENALSLQERQLYNVVEELLVAAGLRFMPKVYIIDAPYMNAFAS 149 (336)
T ss_pred hHHHHHHHHHHHHHHHHHHhhHHHHHHHcCCEEcC-CCCCCchhhHHHHHHHHHHHHHcCCCCCCeEEEEcCCCCceEEe
Confidence 45666778899999999999999999999999997 322 23567999999999999999 899999999 9999999
Q ss_pred CCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCCCCC
Q psy5221 95 GAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNS 174 (436)
Q Consensus 95 G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~ 174 (436)
|.+++++.|++++||++.++++|++||||||+||++|+|+..+.+...+. .+..++.. ... ..++|..++ .+.
T Consensus 150 G~~~~~~~Ivvt~GLL~~L~~dEL~aVlAHELgHik~~di~~~~~~~~l~-~~~~~~~~---~~~-~~~~g~~~~-~~~- 222 (336)
T PRK02870 150 GYSEKSAMVAITTGLLEKLDRDELQAVMAHELSHIRHGDIRLTLCVGVLS-NIMLIVAD---FLF-YSFMGNRRN-SGA- 222 (336)
T ss_pred cCCCCCcEEEEehHHhhhCCHHHHHHHHHHHHHHHHcccHHHHHHHHHHH-HHHHHHHH---HHH-HHHhcCCcc-ccc-
Confidence 99887888999999999999999999999999999999997765543322 22222211 111 112231111 111
Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhcCCchHHHHHHHHhhcccccCC---------Chhhh
Q psy5221 175 NYNHKIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVND---------LPKSI 245 (436)
Q Consensus 175 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~g~~~~l~~aL~kl~~~~~~~~---------~~~~~ 245 (436)
. ... ....++..++.++..++.+++||.+||+||++|++++++|+++++||.|++..+.... ..+..
T Consensus 223 ~-~~~---~~~l~l~~~~~~~~~ll~~~iSR~rEy~AD~~Aa~ltg~p~aLasAL~KL~~~~~~~~~~~~~~~~~~~~~~ 298 (336)
T PRK02870 223 N-RAR---MIILILRYVLPILTVLLMLFLSRTREYMADAGAVELMRDNEPMARALQKISNDHAQNDEQYAYKHTDHESTR 298 (336)
T ss_pred c-hhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhCCHHHHHHHHHHHHhccccCccccccccccCChhh
Confidence 1 001 1112333344455667889999999999999999999999999999999998764433 12344
Q ss_pred hhhcccCcccc--hhhhhhhccCCChHHHHHHHHhc
Q psy5221 246 AILGINNNFKK--KSFLKLFATHPSFEKRIAILSLL 279 (436)
Q Consensus 246 ~~~~~~~~~~~--~~~~~~~~tHP~~~~RI~~L~~~ 279 (436)
+++++.+|... ....++|+|||++++||++|+..
T Consensus 299 a~~~i~~p~~~~~~~~~~LfsTHPp~e~RI~rL~~~ 334 (336)
T PRK02870 299 RAAYLFDPAGISPGSLSDAFSTHPSIENRLAALGGK 334 (336)
T ss_pred hhhhccCCcccccccHhHHHcCCCCHHHHHHHHhhc
Confidence 56778777531 24678999999999999999864
No 5
>PRK03072 heat shock protein HtpX; Provisional
Probab=100.00 E-value=2.2e-40 Score=316.77 Aligned_cols=248 Identities=24% Similarity=0.325 Sum_probs=187.9
Q ss_pred HHHHHHHHHHHHHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCCCeEEEECC-CccEEEeCCCCC
Q psy5221 21 LTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANISTPEVAIYNG-EINAFATGAFKN 99 (436)
Q Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~p~v~v~~~-~~NA~~~G~~~~ 99 (436)
.+++++++++++.+++++++++++.+.+++++++ |.+ +|+|++.++++|+++|+|.|++|++|+ .+|||++|..++
T Consensus 32 ~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~v~-~~~--~p~L~~~v~~la~~~g~p~p~vyv~~~~~~NAFa~G~~~~ 108 (288)
T PRK03072 32 GLGIAVLIAVGMNAYVYWNSDKLALRAMHAQPVS-EVQ--APAMYRIVRELSTAARQPMPRLYISPTAAPNAFATGRNPR 108 (288)
T ss_pred HHHHHHHHHHHHHHHHHHHhHHHHHHhcCCEECC-hhh--hHHHHHHHHHHHHHcCCCCCCEEEecCCCCceEEecCCCC
Confidence 4566678888999999999999999999999986 544 345999999999999999999999999 999999998777
Q ss_pred ccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCCCCCCchhH
Q psy5221 100 SSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHK 179 (436)
Q Consensus 100 ~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~ 179 (436)
+..|++|+||++.++++|++||||||+||++++|+.++.+...+. .++.++.....++ ..++. ...... ...+
T Consensus 109 ~~~v~vt~gLl~~l~~~El~aVlAHElgHi~~~d~~~~~~~~~~~-~~i~~l~~~~~~~---~~~~~-~~~~~~--~~~~ 181 (288)
T PRK03072 109 NAAVCCTEGILQILNERELRGVLGHELSHVYNRDILISSVAGALA-SVITYLANMAMFA---GMFGG-RRDNDG--PNPL 181 (288)
T ss_pred CcEEEecHHHHHhCCHHHHHHHHHHHHHHHhcCCHHHHHHHHHHH-HHHHHHHHHHHHH---HHhcc-cccccc--chHH
Confidence 788999999999999999999999999999999999877654322 2222222111111 11111 111111 1111
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhcCCchHHHHHHHHhhcccccCCCh-----hhhhhhcccCcc
Q psy5221 180 IYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVNDLP-----KSIAILGINNNF 254 (436)
Q Consensus 180 ~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~g~~~~l~~aL~kl~~~~~~~~~~-----~~~~~~~~~~~~ 254 (436)
+.+ ++..+..+...++.+.+||.+||+||++|++++++|+++++||.|+...+.....+ +...++++.+|.
T Consensus 182 ~~~----~~~l~~~~~~~~~~~~~SR~rE~~AD~~A~~l~~~p~~La~AL~KL~~~~~~~~~~~~~~~~~~~~~~i~~p~ 257 (288)
T PRK03072 182 ALL----LVSLLGPIAATVIQLAISRSREYQADESGAELTGDPLALASALRKISGGVQAAPLPPEPQLASQAHLMIANPF 257 (288)
T ss_pred HHH----HHHHHHHHHHHHHHHHHHhHHHHHHhHHHHHHhCCHHHHHHHHHHHHhccccCCCCccccchhhhhhhhcCcc
Confidence 111 22223334566778999999999999999999999999999999999876433322 245667777775
Q ss_pred cchhhhhhhccCCChHHHHHHHHhcccc
Q psy5221 255 KKKSFLKLFATHPSFEKRIAILSLLKLS 282 (436)
Q Consensus 255 ~~~~~~~~~~tHP~~~~RI~~L~~~~~~ 282 (436)
.+..+..+++|||++.+||++|+++..+
T Consensus 258 ~~~~~~~l~sTHP~~~eRI~~L~~~~~~ 285 (288)
T PRK03072 258 RAGGIGRLFSTHPPMADRIARLEQMAGR 285 (288)
T ss_pred cchHHHHHHcCCcCHHHHHHHHHHHhhh
Confidence 4446889999999999999999987665
No 6
>PRK02391 heat shock protein HtpX; Provisional
Probab=100.00 E-value=2.1e-40 Score=317.24 Aligned_cols=250 Identities=25% Similarity=0.377 Sum_probs=185.2
Q ss_pred HHHHHHHHHHHHHHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCCCeEEEECC-CccEEEeCCCC
Q psy5221 20 ILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANISTPEVAIYNG-EINAFATGAFK 98 (436)
Q Consensus 20 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~p~v~v~~~-~~NA~~~G~~~ 98 (436)
..+++++++++++.++.++++++++.+.+++++++ |.++ |+|++.++++|+++|+|.|++|++++ .+|||++|.++
T Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~--p~L~~~v~~la~~~~~~~p~v~v~~~~~~NAfa~G~~~ 113 (296)
T PRK02391 37 VSLVLIVVIAGGFLLAQYFFSDKLALWSMGARIVS-EDEY--PELHAMVERLCALADLPKPRVAVADSDVPNAFATGRSP 113 (296)
T ss_pred HHHHHHHHHHHHHHHHHHHHhHHHHHHHcCCEECC-hhhC--HHHHHHHHHHHHHcCCCCCcEEEEeCCCCceEEecCCC
Confidence 34666777888889999999999999999999887 5444 34999999999999999999999999 99999999998
Q ss_pred CccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCCCCCCchh
Q psy5221 99 NSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNH 178 (436)
Q Consensus 99 ~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~ 178 (436)
++++|++|+||++.+|++|++||+|||+||++++|+.++.+... +..+..++. .+. .+++........ . .
T Consensus 114 ~~~~V~vt~gLl~~L~~~El~aVlaHElgHi~~~di~~~~i~~~-~~~~~~~l~---~~~---~~~~~~~~~~~~-~-~- 183 (296)
T PRK02391 114 KNAVVCVTTGLMRRLDPDELEAVLAHELSHVKNRDVAVMTIASF-LSTIAFLIV---RWG---FYFGGFGGRGGG-G-G- 183 (296)
T ss_pred CCcEEEecHHHHhhCCHHHHHHHHHHHHHHHHcCCHHHHHHHHH-HHHHHHHHH---HHH---HHhccccCCCCc-c-c-
Confidence 89999999999999999999999999999999999988766532 222222211 111 111110000000 0 0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhcCCchHHHHHHHHhhcccccCCC-----hhhhhhhcccCc
Q psy5221 179 KIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVNDL-----PKSIAILGINNN 253 (436)
Q Consensus 179 ~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~g~~~~l~~aL~kl~~~~~~~~~-----~~~~~~~~~~~~ 253 (436)
...++...++..+..++..++.+.+||.+||+||++|++++|+|+++++||.|+.......+. ......+++.++
T Consensus 184 ~~~~~~~~~~~~~~~~~~~~l~~~~SR~rE~~AD~~Aa~ltg~p~~LasAL~KL~~~~~~~~~~~~~~~~~~~~~~i~p~ 263 (296)
T PRK02391 184 GGGILVVILVSLVVWAISFLLIRALSRYREFAADRGAAIITGRPSALASALMKISGRMDRVPTEDLREAEGMNAFFIIPA 263 (296)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhCCHHHHHHHHHHHHcccccCCchhhcccHHHhhhcccCC
Confidence 111222233334455556677899999999999999999999999999999999976422211 122234445444
Q ss_pred ccchhhhhhhccCCChHHHHHHHHhcccc
Q psy5221 254 FKKKSFLKLFATHPSFEKRIAILSLLKLS 282 (436)
Q Consensus 254 ~~~~~~~~~~~tHP~~~~RI~~L~~~~~~ 282 (436)
..+..+..|++|||++.+||++|++++.+
T Consensus 264 ~~~~~~~~l~sTHP~~~eRI~~L~~~~~~ 292 (296)
T PRK02391 264 LSGGSLGRLFSTHPPLEKRIAQLEKLERE 292 (296)
T ss_pred CCcchHHHHhcCCcCHHHHHHHHHHHHHh
Confidence 23345788999999999999999987655
No 7
>PRK03982 heat shock protein HtpX; Provisional
Probab=100.00 E-value=3.8e-39 Score=309.55 Aligned_cols=251 Identities=28% Similarity=0.380 Sum_probs=184.2
Q ss_pred HHHHHHHHHHHHHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCCCeEEEECC-CccEEEeCCCCC
Q psy5221 21 LTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANISTPEVAIYNG-EINAFATGAFKN 99 (436)
Q Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~p~v~v~~~-~~NA~~~G~~~~ 99 (436)
++++.+++++++.++.+++++++..+.+++++++ +.+ +|+|++.++++|+++|++.|++|++|+ .+|||++|++++
T Consensus 30 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~l~-~~~--~p~L~~~v~~la~~~g~~~p~v~v~~~~~~NAfa~G~~~~ 106 (288)
T PRK03982 30 GPIIAILLALIPNLISYYYSDKIVLASYNARIVS-EEE--APELYRIVERLAERANIPKPKVAIVPTQTPNAFATGRDPK 106 (288)
T ss_pred HHHHHHHHHHHHHHHHHHHhHHHHHHhcCCEECC-hhh--hHHHHHHHHHHHHHcCCCCCeEEEEeCCCcceEEeccCCC
Confidence 3445566677778889999999999999999887 333 455999999999999999999999999 999999999988
Q ss_pred ccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc-CCCCCCCCCchh
Q psy5221 100 SSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNS-YDDSNYNSNYNH 178 (436)
Q Consensus 100 ~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~-~~~~~~~~~~~~ 178 (436)
+++|++++||++.+|+||++||+|||+||++++|+.++.+...+. ..+.++..++ ....+++. +.+.... + .
T Consensus 107 ~~~V~vt~gLl~~l~~~El~AVlAHElgHi~~~h~~~~~~~~~~~-~~~~~l~~~~---~~~~~~~~~~~~~~~~-~-~- 179 (288)
T PRK03982 107 HAVVAVTEGILNLLNEDELEGVIAHELTHIKNRDTLIQTIAATLA-GAIMYLAQWL---SWGLWFGGGGRDDRNG-G-N- 179 (288)
T ss_pred CeEEEeehHHHhhCCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHH-HHHHHHHHHH---HHHHHhcccCcccccc-c-h-
Confidence 888999999999999999999999999999999998877664322 2222222111 11122221 1111111 0 1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhcCCchHHHHHHHHhhcccccCCC---hhhhhhhcccCccc
Q psy5221 179 KIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVNDL---PKSIAILGINNNFK 255 (436)
Q Consensus 179 ~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~g~~~~l~~aL~kl~~~~~~~~~---~~~~~~~~~~~~~~ 255 (436)
++...+...+..+...++.+.+||.+||+||++|++++++|+++++||.|++..+...+. ....+++++.+|..
T Consensus 180 ---~~~~~~~~~~~~~~~~l~~~~~SR~~E~~AD~~A~~~~~~p~~l~~aL~kL~~~~~~~~~~~~~~~~~~~~~~~p~~ 256 (288)
T PRK03982 180 ---PIGSLLLIILAPIAATLIQFAISRQREFSADEGGARLTGNPLALANALQKLEKGVRYIPLKNGNPATAHMFIINPFR 256 (288)
T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhHHHHHHhCCHHHHHHHHHHHHhhhccCCCCCCCHHHHhHhhcCCCC
Confidence 111112223333445677899999999999999999999999999999999876432221 12234455555544
Q ss_pred chhhhhhhccCCChHHHHHHHHhcccccc
Q psy5221 256 KKSFLKLFATHPSFEKRIAILSLLKLSLY 284 (436)
Q Consensus 256 ~~~~~~~~~tHP~~~~RI~~L~~~~~~~~ 284 (436)
+..+..+++|||++.+||++|++.++++.
T Consensus 257 ~~~~~~l~sTHP~~~eRI~~l~~~~~~~~ 285 (288)
T PRK03982 257 GQFLANLFSTHPPTEERIERLLEMAQEMG 285 (288)
T ss_pred CchhhHHhCCCcCHHHHHHHHHHHHHhcc
Confidence 44578999999999999999998877654
No 8
>PRK01345 heat shock protein HtpX; Provisional
Probab=100.00 E-value=5.2e-39 Score=310.49 Aligned_cols=248 Identities=23% Similarity=0.307 Sum_probs=184.7
Q ss_pred HHHHHHHHHHHHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCCCeEEEECC-CccEEEeCCCCCc
Q psy5221 22 TISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANISTPEVAIYNG-EINAFATGAFKNS 100 (436)
Q Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~p~v~v~~~-~~NA~~~G~~~~~ 100 (436)
+++++++++++.++.+|.+++++.+.+++++++ |.+. |+|++.++++|+++|++.|+||++|+ .+|||++|.++.+
T Consensus 30 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~-~~~~--p~L~~~v~~La~~agi~~p~v~vid~~~~NAFa~G~~~~~ 106 (317)
T PRK01345 30 MMIALVIAAGMNLFSYWNSDKMVLRMYGAQEVD-ERSA--PELYRMVRDLARRAGLPMPKVYIIDNPQPNAFATGRNPEN 106 (317)
T ss_pred HHHHHHHHHHHHHHHHHHhHHHHHHHcCCeECC-cccC--HHHHHHHHHHHHHcCCCCCcEEEEcCCCcceEEecCCCCC
Confidence 556677888889999999999999999999887 4443 34999999999999999999999999 9999999998888
Q ss_pred cEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCCCCCCchhHH
Q psy5221 101 SLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKI 180 (436)
Q Consensus 101 ~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~ 180 (436)
+.|++|+||++.+++||++||||||+||++++|+.++.+...+...+ ..+ ..+. .+++..... ..++ ..
T Consensus 107 ~~V~vt~gLL~~L~~dEL~aVlAHElgHi~~~d~~~~~l~~~~~~~~-~~l----~~~~--~~~~~~~~~--~~~~--~~ 175 (317)
T PRK01345 107 AAVAATTGLLQRLSPEEVAGVMAHELAHVKNRDTLTMTITATLAGAI-SML----ANFA--FFFGGNREN--NNGP--LG 175 (317)
T ss_pred eEEEechHHHhhCCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH-HHH----HHHH--HHhcCCccc--ccch--HH
Confidence 89999999999999999999999999999999998876654322111 111 1110 112111111 1111 11
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhcCCchHHHHHHHHhhcccccC-----CChhhhhhhcccCccc
Q psy5221 181 YGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVN-----DLPKSIAILGINNNFK 255 (436)
Q Consensus 181 ~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~g~~~~l~~aL~kl~~~~~~~-----~~~~~~~~~~~~~~~~ 255 (436)
++...+...+..+...++.+.+||.+||+||++|++++++|++++++|.|++...... ..+....++++.++..
T Consensus 176 -~i~~~~~~~~~~~~~~l~~~~~SR~rE~~AD~~A~~ltg~p~~L~~AL~KL~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 254 (317)
T PRK01345 176 -LVGTLAAMIVAPLAAMLVQMAISRTREYAADRRGAEICGNPLWLASALGKIERGAHGVPNEEAERNPATAHMFIINPLS 254 (317)
T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHhhhccCcccccccChHHHHHHhcCCcc
Confidence 1111122223334445677899999999999999999999999999999999764221 1123445666666654
Q ss_pred chhhhhhhccCCChHHHHHHHHhcccccc
Q psy5221 256 KKSFLKLFATHPSFEKRIAILSLLKLSLY 284 (436)
Q Consensus 256 ~~~~~~~~~tHP~~~~RI~~L~~~~~~~~ 284 (436)
......+|+|||++.+||++|+++++++.
T Consensus 255 ~~~~~~lfsTHP~~~eRI~~L~~~~~~~~ 283 (317)
T PRK01345 255 GEGMDNLFSTHPATENRIAALQRMAGEMG 283 (317)
T ss_pred ccchhHHhcCCcChHHHHHHHHHHHHhcC
Confidence 44567899999999999999999887764
No 9
>PRK03001 M48 family peptidase; Provisional
Probab=100.00 E-value=5.8e-38 Score=300.64 Aligned_cols=246 Identities=24% Similarity=0.361 Sum_probs=179.0
Q ss_pred HHHHHHHHHHHHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCCCeEEEECC-CccEEEeCCCCCc
Q psy5221 22 TISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANISTPEVAIYNG-EINAFATGAFKNS 100 (436)
Q Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~p~v~v~~~-~~NA~~~G~~~~~ 100 (436)
+++++++++++.++.++++++++.+.++++++++ . +.|.|++.++++|+++|+|.|++|++|+ .+|||++|.++.+
T Consensus 30 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~l~~-~--~~p~L~~~v~~la~~~g~~~p~v~v~~~~~~NAfa~G~~~~~ 106 (283)
T PRK03001 30 MLIALLFALGMNFFSYWFSDKMVLKMYNAQEVDE-N--TAPQFYRMVRELAQRAGLPMPKVYLINEDQPNAFATGRNPEH 106 (283)
T ss_pred HHHHHHHHHHHHHHHHHHhHHHHHHHcCCEECCc-c--ccHHHHHHHHHHHHHcCCCCCeEEEecCCCcceEEecCCCCC
Confidence 4445667778889999999999999999987662 2 2345999999999999999999999999 9999999998877
Q ss_pred cEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCCCCCCchhHH
Q psy5221 101 SLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKI 180 (436)
Q Consensus 101 ~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~ 180 (436)
+.|++++||++.+|++|++||||||+||++++|+.++.+...+...+ .. +..+. .+++.... +.. ....+.
T Consensus 107 ~~Ivvt~gLl~~l~~~El~aVlAHElgHi~~~h~~~~~~~~~~~~~~-~~----~~~~~--~~~~~~~~-~~~-~~~~~~ 177 (283)
T PRK03001 107 AAVAATTGILRVLSEREIRGVMAHELAHVKHRDILISTISATMAGAI-SA----LANFA--MFFGGRDE-NGR-PVNPIA 177 (283)
T ss_pred eEEEecHHHHhhCCHHHHHHHHHHHHHHHhCCChHHHHHHHHHHHHH-HH----HHHHH--HHhcCCCc-ccc-ccchHH
Confidence 88999999999999999999999999999999998876543221111 11 11111 11221011 111 001111
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhcCCchHHHHHHHHhhcccccCC-----ChhhhhhhcccCccc
Q psy5221 181 YGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVND-----LPKSIAILGINNNFK 255 (436)
Q Consensus 181 ~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~g~~~~l~~aL~kl~~~~~~~~-----~~~~~~~~~~~~~~~ 255 (436)
.+ ++..+......++.+.+||.+||+||++|++++++|+++++||.|+...+...+ ......++++.+|..
T Consensus 178 ~~----~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~l~~~p~~l~~AL~Kl~~~~~~~p~~~~~~~~~~~~~~~~~p~~ 253 (283)
T PRK03001 178 GI----AVAILAPLAASLIQMAISRAREFEADRGGARISGDPQALASALDKIHRYASGIPFQAAEAHPATAQMMIINPLS 253 (283)
T ss_pred HH----HHHHHHHHHHHHHHHHHhHHHHHHHhHHHHHHhCCHHHHHHHHHHHHhhhccCCcccccCCHHHHHHHhcCCCC
Confidence 11 112222334556778999999999999999999999999999999998764321 111223455555543
Q ss_pred chhhhhhhccCCChHHHHHHHHhccccc
Q psy5221 256 KKSFLKLFATHPSFEKRIAILSLLKLSL 283 (436)
Q Consensus 256 ~~~~~~~~~tHP~~~~RI~~L~~~~~~~ 283 (436)
+..+..+++|||++.+||++|++.+.+.
T Consensus 254 ~~~~~~l~~THP~~~eRI~~l~~~~~~~ 281 (283)
T PRK03001 254 GGGLANLFSTHPSTEERIARLMAMARTG 281 (283)
T ss_pred cchHHHHHcCCcCHHHHHHHHHHHHHhc
Confidence 3457899999999999999999876653
No 10
>PRK11597 heat shock chaperone IbpB; Provisional
Probab=100.00 E-value=8.9e-33 Score=233.92 Aligned_cols=139 Identities=41% Similarity=0.741 Sum_probs=121.6
Q ss_pred ccccccccCccCcchHHHHhhhhhhhhcccCCCCCceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEeccc
Q psy5221 291 MTVSAFAPFYRSTIGFDRLAQIFDNVRTEAQQSYPPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADT 370 (436)
Q Consensus 291 m~~~~~~p~~~~~~~~d~l~~~~~~~~~~~~~~~p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~ 370 (436)
|+...++||++...+||+|++.|.+.. ....+|++||+++++++|+|.++|||++||||+|++++|.|+|+|+++.++
T Consensus 1 ~~~~~~~~~~~~~~~~d~l~~~~~~~~--~~~~~P~vdI~e~~~~~y~v~adlPGv~kedi~V~v~~~~LtI~ge~~~~~ 78 (142)
T PRK11597 1 MRNYDLSPLLRQWIGFDKLANALQNAG--ESQSFPPYNIEKSDDNHYRITLALAGFRQEDLDIQLEGTRLTVKGTPEQPE 78 (142)
T ss_pred CCccccchhhcccccHHHHHHHhcccC--ccCCCCcEEEEEcCCCEEEEEEEeCCCCHHHeEEEEECCEEEEEEEEcccc
Confidence 455678999999999999998776532 245689999995467899999999999999999999999999999987766
Q ss_pred CCCceEEEeeeeeeEEEEEECCCccceEeEEeeCcEEEEEEecCCCCCCCCeEeeecCCCC
Q psy5221 371 QQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLIREVPEALKPRKIYINTTKN 431 (436)
Q Consensus 371 ~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~~GvL~i~~pk~~~~~~~~~~I~I~~~~~ 431 (436)
++.+|+++||.+|+|+|+|.||++||.-+|+|+||||+|++||..+++.++|+|+|+..++
T Consensus 79 ~~~~~~~~Er~~g~F~R~f~LP~~vd~~~A~~~nGVL~I~lPK~~~~~~~~rkI~I~~~~~ 139 (142)
T PRK11597 79 KEVKWLHQGLVNQPFSLSFTLAENMEVSGATFVNGLLHIDLIRNEPEAIAPQRIAISERPA 139 (142)
T ss_pred CCCcEEEEEEeCcEEEEEEECCCCcccCcCEEcCCEEEEEEeccCccccCCcEEEECCccc
Confidence 7789999999999999999999999977999999999999999655777899999986553
No 11
>PRK10743 heat shock protein IbpA; Provisional
Probab=100.00 E-value=5.2e-32 Score=228.91 Aligned_cols=136 Identities=49% Similarity=0.862 Sum_probs=118.8
Q ss_pred cccccccCccCcchHHHHhhhhhhhhcccCCCCCceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccC
Q psy5221 292 TVSAFAPFYRSTIGFDRLAQIFDNVRTEAQQSYPPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQ 371 (436)
Q Consensus 292 ~~~~~~p~~~~~~~~d~l~~~~~~~~~~~~~~~p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~ 371 (436)
+..+++|++++..+||+|++.|..........+|++||+++++++|+|.++|||++||||+|++++|.|+|+|+++.+.+
T Consensus 2 ~~~~~~~~~~~~~~~d~lf~~~~~~~~~~~~~~p~~di~ee~~~~~~v~aelPGv~kedi~V~v~~~~LtI~ge~~~~~~ 81 (137)
T PRK10743 2 RNFDLSPLYRSAIGFDRLFNLLENNQSQSNGGYPPYNVELVDENHYRIAIAVAGFAESELEITAQDNLLVVKGAHADEQK 81 (137)
T ss_pred CccccChhhhcccCHHHHhhhhhhhhhcccCCCCcEEEEEcCCCEEEEEEECCCCCHHHeEEEEECCEEEEEEEECcccc
Confidence 34578888888899999988887643333455699999954899999999999999999999999999999999887777
Q ss_pred CCceEEEeeeeeeEEEEEECCCccceEeEEeeCcEEEEEEecCCCCCCCCeEeeec
Q psy5221 372 QRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLIREVPEALKPRKIYIN 427 (436)
Q Consensus 372 ~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~~GvL~i~~pk~~~~~~~~~~I~I~ 427 (436)
+.+|+++||.+|+|+|+|.||++||.-+|+|+||||+|++||..+++.++|+|+|+
T Consensus 82 ~~~~~~~Er~~g~F~R~~~LP~~Vd~~~A~~~dGVL~I~lPK~~~~~~~~r~I~I~ 137 (137)
T PRK10743 82 ERTYLYQGIAERNFERKFQLAENIHVRGANLVNGLLYIDLERVIPEAKKPRRIEIN 137 (137)
T ss_pred CCcEEEEEEECCEEEEEEECCCCcccCcCEEeCCEEEEEEeCCCccccCCeEEeeC
Confidence 78899999999999999999999996679999999999999965677789999985
No 12
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=99.97 E-value=5.5e-30 Score=249.19 Aligned_cols=240 Identities=30% Similarity=0.458 Sum_probs=174.0
Q ss_pred HHHHHHHHHHHHHHHHHhHHHHHHHhCCCEEecCCC---CcchhHHHHHHHHHHHhcCCC-CCeEEEECC-CccEEEeCC
Q psy5221 22 TISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPS---NFTEDWLLKTVNKISKNANIS-TPEVAIYNG-EINAFATGA 96 (436)
Q Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~---~~~~~~L~~~l~~l~~~~g~~-~p~v~v~~~-~~NA~~~G~ 96 (436)
+++..+++....++.+++++.+..+.++...+..+. ...+.++...+.+++.+.+.+ .|+++++++ .+|||++|.
T Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~a~~~~~~~~~~v~i~~~~~~NAFa~g~ 135 (302)
T COG0501 56 LLIALLLAFAALLISLLFSKALVLKSLGALTLSEPILLAPRLYAVLLLKVAELARQAGIPHMPEVYILETPQPNAFALGG 135 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhceeecccccccchHHHHHHHHHHHHHHHHCCCCCCCeeEEecCCCccceecCC
Confidence 557777777888899999999999999886665332 222344556999999999999 899999999 999999998
Q ss_pred CCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCCCCCCc
Q psy5221 97 FKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNY 176 (436)
Q Consensus 97 ~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~~ 176 (436)
++.++.|++++||++.+|+||++||||||+||++++|+.++.............+. .........+ .
T Consensus 136 ~~~~~~V~vt~gLl~~l~~dEl~aVlaHElgHi~~rd~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~-----~----- 202 (302)
T COG0501 136 GPKNGRVVVTTGLLDLLNDDELEAVLAHELGHIKNRHTLVRLTLRGLLASAFVLLA---TLALAAGLLG-----E----- 202 (302)
T ss_pred CCCCeeEEecHHHHhhCCHHHHHHHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHH---HHHHHHHhhc-----c-----
Confidence 87778899999999999999999999999999999999984444333333222221 1111110000 0
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhcCCchHHHHHHHHhhcccccCCChhhhhhhcccCcccc
Q psy5221 177 NHKIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVNDLPKSIAILGINNNFKK 256 (436)
Q Consensus 177 ~~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~g~~~~l~~aL~kl~~~~~~~~~~~~~~~~~~~~~~~~ 256 (436)
.......+........+..+.+.+||.+|++||++|+++++ |+.++++|.|+...+... .......++..+
T Consensus 203 ---~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~l~~-~~~l~~aL~kl~~~~~~~-----~~~~~~~~~~~~ 273 (302)
T COG0501 203 ---AALALLLLLLLLALFLATLLVLAFSRKREYEADRFAAKLTG-PEKLASALQKLARLSGRA-----NSKAFIASGFSG 273 (302)
T ss_pred ---hHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhchhhccC-hHHHHHHHHHHHhhhccc-----chhhhhcCcccc
Confidence 11222334444555667788999999999999999999999 799999999999865320 011111122111
Q ss_pred hhhhhhhccCCChHHHHHHHHhccccc
Q psy5221 257 KSFLKLFATHPSFEKRIAILSLLKLSL 283 (436)
Q Consensus 257 ~~~~~~~~tHP~~~~RI~~L~~~~~~~ 283 (436)
.....+++|||++.+||++|++.....
T Consensus 274 ~~~~~~~stHP~~~~Ri~~L~~~~~~~ 300 (302)
T COG0501 274 GRLQALFSTHPPLAERIAALRQLALTV 300 (302)
T ss_pred cchHHHHhcCCCHHHHHHHHHHHHHhh
Confidence 223689999999999999999876543
No 13
>KOG2719|consensus
Probab=99.97 E-value=9.2e-30 Score=246.48 Aligned_cols=224 Identities=17% Similarity=0.218 Sum_probs=170.3
Q ss_pred HHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCCCeEEEECC-----CccEEEeCCCCCccEEEEc
Q psy5221 32 GSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANISTPEVAIYNG-----EINAFATGAFKNSSLIAIS 106 (436)
Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~p~v~v~~~-----~~NA~~~G~~~~~~~v~it 106 (436)
+.++...+.+-......+.. .|.+++. |++.++++|++.|+|..+++++++ ..||+.+|++..+|+++.+
T Consensus 192 ~~l~l~ti~p~~i~P~f~K~---~PLe~g~--l~~~Ie~la~s~gfp~~k~~vi~~s~rs~hsNAyfyG~~~~KRIvIyD 266 (428)
T KOG2719|consen 192 LSLLLLTIYPGFIAPLFGKF---TPLEEGD--LKEKIERLADSVGFPLSKYRVIDGSKRSSHSNAYFYGLCKNKRIVIYD 266 (428)
T ss_pred HHHHHHHHHHHhhhhhhcCC---CCCCCCc--hHHHHHHHHHhcCCCceEEEEEecCCCCCCCCeeeeeccccceEEEeh
Confidence 33444444555555555554 4787777 999999999999999999999994 9999999999999999999
Q ss_pred HHHH--hh-cCHHHHHHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHh--hhhccCCCCCCCCCchhHHH
Q psy5221 107 ANLL--ES-MTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDC--FLLNSYDDSNYNSNYNHKIY 181 (436)
Q Consensus 107 ~gLl--~~-l~~~El~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~--~~~g~~~~~~~~~~~~~~~~ 181 (436)
+ |+ +. +++||+.||+|||+||++++|+.++.++..+...+.+++ ++.+..+ .+...|... ..+..++.
T Consensus 267 t-Ll~~~~~~~~eel~AVl~HELGHW~~~H~~K~~ii~~~~l~l~~ll---F~~~~~~~~ly~a~Gf~~---~~P~~ig~ 339 (428)
T KOG2719|consen 267 T-LLLEEEHLNNEELVAVLAHELGHWKLNHVLKNIIIMQIHLFLEFLL---FGFLYRNPKLYAAFGFID---EQPSLIGF 339 (428)
T ss_pred h-hhhhhhccccHHHHHHHHHHhhHHHHhhHHHHHHHHHHHHHHHHHH---HHHHHcCcchheeecCCC---CCcchhHH
Confidence 8 55 33 489999999999999999999999888755433333222 2222221 111111211 12344566
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhcCCchHHHHHHHHhhcccccCCChhhhhhhcccCcccchhhhh
Q psy5221 182 GLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVNDLPKSIAILGINNNFKKKSFLK 261 (436)
Q Consensus 182 ~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~g~~~~l~~aL~kl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 261 (436)
++.+.....+...+....++..||+.||+||++|.++ |..+.+.+||.|+..++.+...+++ .|+.
T Consensus 340 livf~~~l~py~~l~~~~~n~~sR~fEyqAD~fA~kl-GYg~~L~~AL~KL~~dnlsf~~~D~-------------LYs~ 405 (428)
T KOG2719|consen 340 LIVFQFVLAPYRALLNFLMNLISRRFEYQADAFAKKL-GYGKDLRQALIKLFVDNLSFPVSDP-------------LYSA 405 (428)
T ss_pred HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHc-CCchhHHHHHHHHhhhhcCCCCCcH-------------HHHH
Confidence 6555566666777888899999999999999999999 7779999999999999877666664 4999
Q ss_pred hhccCCChHHHHHHHHhccc
Q psy5221 262 LFATHPSFEKRIAILSLLKL 281 (436)
Q Consensus 262 ~~~tHP~~~~RI~~L~~~~~ 281 (436)
|..|||++.|||++++....
T Consensus 406 ~~~~HPtvleRl~~l~~~~~ 425 (428)
T KOG2719|consen 406 WHHLHPTVLERLDALDYLSE 425 (428)
T ss_pred HHhcCchHHHHHHHHHHHHh
Confidence 99999999999999987654
No 14
>COG0071 IbpA Molecular chaperone (small heat shock protein) [Posttranslational modification, protein turnover, chaperones]
Probab=99.95 E-value=4.1e-27 Score=203.48 Aligned_cols=124 Identities=35% Similarity=0.659 Sum_probs=109.0
Q ss_pred cchHHHHhhhhhhhhcccCCCCCceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecc--cCCCceEEEee
Q psy5221 303 TIGFDRLAQIFDNVRTEAQQSYPPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQAD--TQQRNFLHRGI 380 (436)
Q Consensus 303 ~~~~d~l~~~~~~~~~~~~~~~p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~--~~~~~~~~~e~ 380 (436)
..+||++++.+.+.......++|++||++ +++.|+|.++||||+|+||+|+++++.|+|+|+++.+ .++..++++|+
T Consensus 19 ~~~fd~~~~~~~~~~~~~~~~~P~vdi~e-~~~~~~I~~elPG~~kedI~I~~~~~~l~I~g~~~~~~~~~~~~~~~~e~ 97 (146)
T COG0071 19 SPGFDRLFREFGNLPESRPTGTPPVDIEE-TDDEYRITAELPGVDKEDIEITVEGNTLTIRGEREEEEEEEEEGYLRRER 97 (146)
T ss_pred ccchhhhhhhhhcccccccCCCCcEEEEE-cCCEEEEEEEcCCCChHHeEEEEECCEEEEEEEecccccccCCceEEEEE
Confidence 35788888888765433335789999995 8899999999999999999999999999999999863 56789999999
Q ss_pred eeeeEEEEEECCCccc--eEeEEeeCcEEEEEEecCCCCCCCCeEeeec
Q psy5221 381 ATRNFEHRFQLADHVH--VVAAFFDQGILNIDLIREVPEALKPRKIYIN 427 (436)
Q Consensus 381 ~~~~f~r~~~lP~~v~--~~~A~~~~GvL~i~~pk~~~~~~~~~~I~I~ 427 (436)
.+|+|+|+|.||+.|+ .++|+|+||+|+|++||..++..++++|+|+
T Consensus 98 ~~~~f~r~~~Lp~~v~~~~~~A~~~nGvL~I~lpk~~~~~~~~~~i~I~ 146 (146)
T COG0071 98 AYGEFERTFRLPEKVDPEVIKAKYKNGLLTVTLPKAEPEEKKPKRIEIE 146 (146)
T ss_pred EeeeEEEEEECcccccccceeeEeeCcEEEEEEeccccccccCceeecC
Confidence 9999999999999999 5999999999999999976666678999885
No 15
>PF01435 Peptidase_M48: Peptidase family M48 This is family M48 in the peptidase classification. ; InterPro: IPR001915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M48 (Ste24 endopeptidase family, clan M-); members of both subfamily are represented. The members of this set of proteins are mostly described as probable protease htpX homologue (3.4.24 from EC) or CAAX prenyl protease 1, which proteolytically removes the C-terminal three residues of farnesylated proteins. They are integral membrane proteins associated with the endoplasmic reticulum and Golgi, binding one zinc ion per subunit. In Saccharomyces cerevisiae (Baker's yeast) Ste24p is required for the first NH2-terminal proteolytic processing event within the a-factor precursor, which takes place after COOH-terminal CAAX modification is complete. The Ste24p contains multiple predicted membrane spans, a zinc metalloprotease motif (HEXXH), and a COOH-terminal ER retrieval signal (KKXX). The HEXXH protease motif is critical for Ste24p activity, since Ste24p fails to function when conserved residues within this motif are mutated. The Ste24p homologues occur in a diverse group of organisms, including Escherichia coli, Schizosaccharomyces pombe (Fission yeast), Haemophilus influenzae, and Homo sapiens (Human), which indicates that the gene is highly conserved throughout evolution. Ste24p and the proteins related to it define a subfamily of proteins that are likely to function as intracellular, membrane-associated zinc metalloproteases []. HtpX is a zinc-dependent endoprotease member of the membrane-localized proteolytic system in E. coli, which participates in the proteolytic quality control of membrane proteins in conjunction with FtsH, a membrane-bound and ATP-dependent protease. Biochemical characterisation revealed that HtpX undergoes self-degradation upon cell disruption or membrane solubilization. It can also degraded casein and cleaves solubilized membrane proteins, for example, SecY []. Expression of HtpX in the plasma membrane is under the control of CpxR, with the metalloproteinase active site of HtpX located on the cytosolic side of the membrane. This suggests a potential role for HtpX in the response to mis-folded proteins [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis, 0016020 membrane; PDB: 3CQB_A 3C37_B.
Probab=99.94 E-value=5.9e-27 Score=218.22 Aligned_cols=185 Identities=30% Similarity=0.466 Sum_probs=95.5
Q ss_pred chhHHHHHHHHHHHhc--CCCCCeEEEECC-CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHH
Q psy5221 60 TEDWLLKTVNKISKNA--NISTPEVAIYNG-EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVT 136 (436)
Q Consensus 60 ~~~~L~~~l~~l~~~~--g~~~p~v~v~~~-~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~ 136 (436)
+.++|++.++++|+++ +.+.+++|++++ .+|||++|.++ ++.|+++.||++.++++|++||||||+||++++|..+
T Consensus 29 ~~~~L~~~v~~l~~~~~~~~~~~~v~v~~~~~~NA~~~g~~~-~~~I~v~~~ll~~~~~~el~aVlaHElgH~~~~h~~~ 107 (226)
T PF01435_consen 29 EDPELRRIVEELARRAGLGIPPPRVYVIDSPSPNAFATGGGP-RKRIVVTSGLLESLSEDELAAVLAHELGHIKHRHILK 107 (226)
T ss_dssp HHHHHHHHHHHHHHHHHCTSS--EEEEE--SSEEEEEETTTC---EEEEEHHHHHHSSHHHHHHHHHHHHHHHHTTHCCC
T ss_pred HHHHHHHHHHHHHHHhcCCCCCCeEEEEcCCCCcEEEEccCC-CcEEEEeChhhhcccHHHHHHHHHHHHHHHHcCCcch
Confidence 3456999999999999 888889999999 99999999887 6779999999999999999999999999999999876
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhh
Q psy5221 137 MLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSA 216 (436)
Q Consensus 137 ~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~ 216 (436)
......+...+..+....+.... .... ... .......++..+..++.+.+||.+|++||++|+
T Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~--------------~~~--~~~~~~~~~~~~~~~~~~~~sr~~E~~AD~~a~ 170 (226)
T PF01435_consen 108 SLLISLLLSILFFALLALLIGSM-SLFS--------------AFG--FIDILGILIAFLFQLLTNAFSRRQEYEADRYAA 170 (226)
T ss_dssp CCCHHH-HHHHHHHHHT-----H-HHHH--------------HHH------------HHSTT------HHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHhhhh-hhhh--------------hhh--hcccchhhHHHHHHHhhcchhHHHHHHHHHHHH
Confidence 53332222111111110000000 0000 000 000000111134456779999999999999999
Q ss_pred hhcCCchHH----------HHHHHHhhcccccCCChhhhhhhcccCcccchhhhhhhccCCChHHHHHH
Q psy5221 217 KLLGTPDPM----------ISALRHLSNISIVNDLPKSIAILGINNNFKKKSFLKLFATHPSFEKRIAI 275 (436)
Q Consensus 217 ~~~g~~~~l----------~~aL~kl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tHP~~~~RI~~ 275 (436)
+++|++..+ .++|.++...+..... ....+..+++|||++.+||++
T Consensus 171 ~~~~~~~~l~~a~~~~~~~~~~l~~l~~~~~~~~~-------------~~~~~~~~~~tHP~~~~Ri~~ 226 (226)
T PF01435_consen 171 RLGGDPALLARALYKPAAAISALEKLAEANSMRPD-------------SDWRYSSLFSTHPSTEERIAA 226 (226)
T ss_dssp HH------HHHTTS-TTHHHHHHHHHT------------------------------------HHHHHH
T ss_pred HhcCCcHHHHHhCCCHHHHHHHHHHHHHHhccccC-------------CccccchhcCCCcCHHHHhCc
Confidence 998885554 4455555442111111 123467899999999999985
No 16
>cd06470 ACD_IbpA-B_like Alpha-crystallin domain (ACD) found in Escherichia coli inclusion body-associated proteins IbpA and IbpB, and similar proteins. IbpA and IbpB are 16 kDa small heat shock proteins (sHsps). sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. IbpA and IbpB are produced during high-level production of various heterologous proteins, specifically human prorenin, renin and bovine insulin-like growth factor 2 (bIGF-2), and are strongly associated with inclusion bodies containing these heterologous proteins. IbpA and IbpB work as an integrated system to stabilize thermally aggregated proteins in a disaggregation competent state. The chaperone activity of IbpB is also significantly elevated as the temperature increases from normal to heat shock. The high temperature results in the disassociation of 2-3-MDa IbpB oligomers into smaller approximately 6
Probab=99.91 E-value=1.3e-23 Score=166.17 Aligned_cols=89 Identities=56% Similarity=0.960 Sum_probs=82.3
Q ss_pred CceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEeccc-CCCceEEEeeeeeeEEEEEECCCccceEeEEee
Q psy5221 325 PPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADT-QQRNFLHRGIATRNFEHRFQLADHVHVVAAFFD 403 (436)
Q Consensus 325 p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~-~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~ 403 (436)
|++||+++++++|+|.++|||++||||+|+++++.|+|+|+++.+. ++.+|+++|+.+|+|+|+|.||++++..+|+|+
T Consensus 1 p~~di~e~~~~~~~v~~~lPG~~kedi~v~~~~~~L~I~g~~~~~~~~~~~~~~~e~~~g~f~R~~~LP~~vd~~~A~~~ 80 (90)
T cd06470 1 PPYNIEKTGENNYRITLAVAGFSEDDLEIEVENNQLTVTGKKADEENEEREYLHRGIAKRAFERSFNLADHVKVKGAELE 80 (90)
T ss_pred CCeeeEEcCCCeEEEEEECCCCCHHHeEEEEECCEEEEEEEEcccccCCCcEEEEEEeceEEEEEEECCCCceECeeEEe
Confidence 7899995345899999999999999999999999999999988764 567899999999999999999999998899999
Q ss_pred CcEEEEEEec
Q psy5221 404 QGILNIDLIR 413 (436)
Q Consensus 404 ~GvL~i~~pk 413 (436)
||+|+|++|+
T Consensus 81 ~GvL~I~l~~ 90 (90)
T cd06470 81 NGLLTIDLER 90 (90)
T ss_pred CCEEEEEEEC
Confidence 9999999986
No 17
>cd06472 ACD_ScHsp26_like Alpha crystallin domain (ACD) found in Saccharomyces cerevisiae (Sc) small heat shock protein (Hsp)26 and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. ScHsp26 is temperature-regulated, it switches from an inactive to a chaperone-active form upon elevation in temperature. It associates into large 24-mers storage forms which upon heat shock disassociate into dimers. These dimers initiate the interaction with non-native substrate proteins and re-assemble into large globular assemblies having one monomer of substrate bound per dimer. This group also contains Arabidopsis thaliana (Ath) Hsp15.7, a peroxisomal matrix protein which can complement the morphological phenotype of S. cerevisiae mutants deficient in Hsps26. AthHsp15.7 is minimally expressed under normal conditions and is strongly induced by heat and oxidative st
Probab=99.89 E-value=1.4e-22 Score=161.18 Aligned_cols=87 Identities=20% Similarity=0.450 Sum_probs=79.5
Q ss_pred ceeEEEecCceEEEEEecCCCCCCceEEEEECC-EEEEEEEEecc--cCCCceEEEeeeeeeEEEEEECCCccc--eEeE
Q psy5221 326 PYNIELVDENKYRITMAIAGFNRSDLEIETERD-TLRITGRKQAD--TQQRNFLHRGIATRNFEHRFQLADHVH--VVAA 400 (436)
Q Consensus 326 ~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~-~l~i~g~~~~~--~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A 400 (436)
++|++ |++++|+|.++|||++||||+|+++++ .|+|+|+++.+ .++.+++++|+.+|+|+|+|.||++++ .++|
T Consensus 1 ~~dv~-E~~~~~~i~~~lPGv~~edi~i~v~~~~~L~I~g~~~~~~~~~~~~~~~~e~~~g~f~r~i~LP~~v~~~~i~A 79 (92)
T cd06472 1 RVDWK-ETPEAHVFKADVPGVKKEDVKVEVEDGRVLRISGERKKEEEKKGDDWHRVERSSGRFVRRFRLPENADADEVKA 79 (92)
T ss_pred CccEE-EcCCeEEEEEECCCCChHhEEEEEeCCCEEEEEEEecccccccCCCEEEEEEeccEEEEEEECCCCCCHHHCEE
Confidence 47999 599999999999999999999999975 99999998754 346789999999999999999999998 4999
Q ss_pred EeeCcEEEEEEec
Q psy5221 401 FFDQGILNIDLIR 413 (436)
Q Consensus 401 ~~~~GvL~i~~pk 413 (436)
+|+||+|+|++||
T Consensus 80 ~~~nGvL~I~lPK 92 (92)
T cd06472 80 FLENGVLTVTVPK 92 (92)
T ss_pred EEECCEEEEEecC
Confidence 9999999999998
No 18
>PF00011 HSP20: Hsp20/alpha crystallin family This prints entry is a subset of the Pfam entry.; InterPro: IPR002068 Prokaryotic and eukaryotic organisms respond to heat shock or other environmental stress by inducing the synthesis of proteins collectively known as heat-shock proteins (hsp) []. Amongst them is a family of proteins with an average molecular weight of 20 Kd, known as the hsp20 proteins []. These seem to act as chaperones that can protect other proteins against heat-induced denaturation and aggregation. Hsp20 proteins seem to form large heterooligomeric aggregates. Structurally, this family is characterised by the presence of a conserved C-terminal domain of about 100 residues.; PDB: 2BOL_B 3N3E_B 2H50_P 2H53_F 2BYU_L 1GME_D 3VQM_J 3VQK_E 3VQL_A 3AAC_A ....
Probab=99.87 E-value=1.4e-21 Score=158.88 Aligned_cols=99 Identities=24% Similarity=0.566 Sum_probs=81.9
Q ss_pred eEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEeeCc
Q psy5221 328 NIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFFDQG 405 (436)
Q Consensus 328 ~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~~~G 405 (436)
||. |++++|.|.++|||+++|||+|+++++.|+|+|+++....+..+.++|+.+++|.|+|.||+++| .++|.|+||
T Consensus 1 di~-e~~~~~~i~~~lpG~~~edi~I~~~~~~L~I~g~~~~~~~~~~~~~~~~~~~~f~r~~~lP~~vd~~~i~a~~~~G 79 (102)
T PF00011_consen 1 DIK-EDEDEYIIKVDLPGFDKEDIKIKVDDNKLVISGKRKEEEEDDRYYRSERRYGSFERSIRLPEDVDPDKIKASYENG 79 (102)
T ss_dssp EEE-ESSSEEEEEEE-TTS-GGGEEEEEETTEEEEEEEEEGEECTTCEEEE-S-SEEEEEEEE-STTB-GGG-EEEETTS
T ss_pred CeE-ECCCEEEEEEECCCCChHHEEEEEecCccceeceeeeeeeeeeeeecccccceEEEEEcCCCcCCcceEEEEecCC
Confidence 787 59999999999999999999999999999999999965667788889999999999999999999 499999999
Q ss_pred EEEEEEecCCCCCC-CCeEeeec
Q psy5221 406 ILNIDLIREVPEAL-KPRKIYIN 427 (436)
Q Consensus 406 vL~i~~pk~~~~~~-~~~~I~I~ 427 (436)
+|+|++||..+.+. .+++|+|+
T Consensus 80 vL~I~~pk~~~~~~~~~~~I~I~ 102 (102)
T PF00011_consen 80 VLTITIPKKEEEEDSQPKRIPIK 102 (102)
T ss_dssp EEEEEEEBSSSCTTSSSCEE-ET
T ss_pred EEEEEEEccccccCCCCeEEEeC
Confidence 99999999755443 68999986
No 19
>cd06471 ACD_LpsHSP_like Group of bacterial proteins containing an alpha crystallin domain (ACD) similar to Lactobacillus plantarum (Lp) small heat shock proteins (sHsp) HSP 18.5, HSP 18.55 and HSP 19.3. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Transcription of the genes encoding Lp HSP 18.5, 18.55 and 19.3 is regulated by a variety of stresses including heat, cold and ethanol. Early growing L. plantarum cells contain elevated levels of these mRNAs which rapidly fall of as the cells enter stationary phase. Also belonging to this group is Bifidobacterium breve (Bb) HSP20 and Oenococcus oenis (syn. Leuconostoc oenos) (Oo) HSP18. Transcription of the gene encoding BbHSP20 is strongly induced following heat or osmotic shock, and that of the gene encoding OoHSP18 following heat, ethanol or acid shock. OoHSP18 is peripherally associated with the cytoplasmic me
Probab=99.87 E-value=1e-21 Score=156.70 Aligned_cols=87 Identities=21% Similarity=0.489 Sum_probs=79.1
Q ss_pred CceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEeccc----CCCceEEEeeeeeeEEEEEECCCccc--eE
Q psy5221 325 PPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADT----QQRNFLHRGIATRNFEHRFQLADHVH--VV 398 (436)
Q Consensus 325 p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~----~~~~~~~~e~~~~~f~r~~~lP~~v~--~~ 398 (436)
|++||+ |++++|+|.++|||++|+||+|+++++.|+|+|+++... .+.+++++|+.+|+|+|+|.|| +++ .+
T Consensus 1 ~~~di~-e~~~~~~i~~~lPGv~~edi~v~~~~~~L~I~g~~~~~~~~~~~~~~~~~~e~~~g~f~r~~~lp-~v~~~~i 78 (93)
T cd06471 1 MKTDIK-ETDDEYIVEADLPGFKKEDIKLDYKDGYLTISAKRDESKDEKDKKGNYIRRERYYGSFSRSFYLP-NVDEEEI 78 (93)
T ss_pred CceeEE-EcCCEEEEEEECCCCCHHHeEEEEECCEEEEEEEEccccccccccCCEEEEeeeccEEEEEEECC-CCCHHHC
Confidence 479999 599999999999999999999999999999999997532 2357999999999999999999 677 49
Q ss_pred eEEeeCcEEEEEEec
Q psy5221 399 AAFFDQGILNIDLIR 413 (436)
Q Consensus 399 ~A~~~~GvL~i~~pk 413 (436)
+|+|+||+|+|++||
T Consensus 79 ~A~~~dGvL~I~lPK 93 (93)
T cd06471 79 KAKYENGVLKITLPK 93 (93)
T ss_pred EEEEECCEEEEEEcC
Confidence 999999999999998
No 20
>cd06497 ACD_alphaA-crystallin_HspB4 Alpha-crystallin domain found in the small heat shock protein (sHsp) alphaA-crystallin (HspB4, 20kDa). sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Alpha crystallin, an abundant protein in the mammalian lens, is a large (700 kDa) heteropolymer composed of HspB4 and HspB5, generally in a molar ratio of HspB4:HspB5 of 3:1. Only trace amounts of HspB4 are found in tissues other than the lens. HspB5 does not belong to this group. Mutations inHspB4 have been associated with Autosomal Dominant Congenital Cataract (ADCC). The chaperone-like functions of HspB4 are considered important for maintaining lens transparency and preventing cataract.
Probab=99.86 E-value=2.2e-21 Score=151.40 Aligned_cols=80 Identities=20% Similarity=0.495 Sum_probs=72.1
Q ss_pred eEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEe-eC
Q psy5221 328 NIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFF-DQ 404 (436)
Q Consensus 328 ~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~-~~ 404 (436)
+|+ |++++|.|.++||||+||||+|++++|.|+|+|+++++.++.+|.++| |+|+|.||++|| .++|+| +|
T Consensus 4 ~v~-e~~~~~~v~~dlpG~~~edi~V~v~~~~L~I~g~~~~~~~~~~~~~~e-----f~R~~~LP~~Vd~~~i~A~~~~d 77 (86)
T cd06497 4 EVR-SDRDKFTIYLDVKHFSPEDLTVKVLDDYVEIHGKHSERQDDHGYISRE-----FHRRYRLPSNVDQSAITCSLSAD 77 (86)
T ss_pred eEE-EcCCEEEEEEECCCCCHHHeEEEEECCEEEEEEEEcceeCCCCEEEEE-----EEEEEECCCCCChHHeEEEeCCC
Confidence 677 599999999999999999999999999999999986655566676654 999999999999 499999 89
Q ss_pred cEEEEEEec
Q psy5221 405 GILNIDLIR 413 (436)
Q Consensus 405 GvL~i~~pk 413 (436)
|+|+|++||
T Consensus 78 GvL~I~~PK 86 (86)
T cd06497 78 GMLTFSGPK 86 (86)
T ss_pred CEEEEEecC
Confidence 999999998
No 21
>cd06478 ACD_HspB4-5-6 Alpha-crystallin domain found in alphaA-crystallin (HspB4), alphaB-crystallin (HspB5), and the small heat shock protein (sHsp) HspB6, also known as Hsp20. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Alpha crystallin, an abundant protein in the mammalian lens, is a large (700 kDa) heteropolymer composed of HspB4 and HspB5, generally in a molar ratio of HspB4:HspB5 of 3:1. Only trace amounts of HspB4 are found in tissues other than the lens. HspB5 on the other hand is also expressed constitutively in other tissues including brain, heart, and type I and type IIa skeletal muscle fibers, and in several cancers including gliomas, renal cell carcinomas, basal-like and metaplastic breast carcinomas, and head and neck cancer. HspB5's functions include effects on the apoptotic pathway and on metastasis. Phosphorylation of HspB5 reduces its ol
Probab=99.84 E-value=1.5e-20 Score=145.88 Aligned_cols=79 Identities=20% Similarity=0.533 Sum_probs=69.9
Q ss_pred EEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEe-eCc
Q psy5221 329 IELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFF-DQG 405 (436)
Q Consensus 329 i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~-~~G 405 (436)
|. +++++|.|.++||||+||||+|++++|.|+|+|+++.+.++.+|+++| |+|+|.||++|| .++|+| +||
T Consensus 2 ~~-~~~~~~~v~~dlpG~~~edI~V~v~~~~L~I~g~~~~~~~~~~~~~~e-----f~R~~~LP~~vd~~~i~A~~~~dG 75 (83)
T cd06478 2 VR-LDKDRFSVNLDVKHFSPEELSVKVLGDFVEIHGKHEERQDEHGFISRE-----FHRRYRLPPGVDPAAITSSLSADG 75 (83)
T ss_pred ee-ecCceEEEEEECCCCCHHHeEEEEECCEEEEEEEEceEcCCCCEEEEE-----EEEEEECCCCcChHHeEEEECCCC
Confidence 44 589999999999999999999999999999999986555556666554 999999999999 499999 699
Q ss_pred EEEEEEec
Q psy5221 406 ILNIDLIR 413 (436)
Q Consensus 406 vL~i~~pk 413 (436)
+|+|++||
T Consensus 76 vL~I~~PK 83 (83)
T cd06478 76 VLTISGPR 83 (83)
T ss_pred EEEEEecC
Confidence 99999998
No 22
>cd06479 ACD_HspB7_like Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB7, also known as cardiovascular small heat shock protein (cvHsp), and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. HspB7 is a 25-kDa protein, preferentially expressed in heart and skeletal muscle. It binds the cytoskeleton protein alpha-filamin (also known as actin-binding protein 280). The expression of HspB7 is increased during rat muscle aging. Its expression is also modulated in obesity implicating this protein in this and related metabolic disorders. As the human gene encoding HspB7 is mapped to chromosome 1p36.23-p34.3 it is a positional candidate for several dystrophies and myopathies.
Probab=99.84 E-value=1.2e-20 Score=144.76 Aligned_cols=77 Identities=22% Similarity=0.454 Sum_probs=69.5
Q ss_pred eEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEe-eC
Q psy5221 328 NIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFF-DQ 404 (436)
Q Consensus 328 ~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~-~~ 404 (436)
||. |++++|.|.++||||+||||+|++++|.|+|+|+++.+.+ ..+|+|+|+|.||++|| .++|.| +|
T Consensus 2 ~v~-e~~~~~~v~~dlpG~~pedi~V~v~~~~L~I~ger~~~~~--------~~~g~F~R~~~LP~~vd~e~v~A~l~~~ 72 (81)
T cd06479 2 NVK-TLGDTYQFAVDVSDFSPEDIIVTTSNNQIEVHAEKLASDG--------TVMNTFTHKCQLPEDVDPTSVSSSLGED 72 (81)
T ss_pred Ccc-CcCCeEEEEEECCCCCHHHeEEEEECCEEEEEEEEeccCC--------CEEEEEEEEEECCCCcCHHHeEEEecCC
Confidence 677 5999999999999999999999999999999999875431 24899999999999999 499997 99
Q ss_pred cEEEEEEec
Q psy5221 405 GILNIDLIR 413 (436)
Q Consensus 405 GvL~i~~pk 413 (436)
|+|+|+++|
T Consensus 73 GvL~I~~~~ 81 (81)
T cd06479 73 GTLTIKARR 81 (81)
T ss_pred CEEEEEecC
Confidence 999999986
No 23
>cd06498 ACD_alphaB-crystallin_HspB5 Alpha-crystallin domain found in the small heat shock protein (sHsp) alphaB-crystallin (HspB5, 20kDa). sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Alpha crystallin, an abundant protein in the mammalian lens, is a large (700 kDa) heteropolymer composed of HspB4 and HspB5, generally in a molar ratio of HspB4:HspB5 of 3:1. HspB4 does not belong to this group. HspB5 shows increased synthesis in response to stress. HspB5 is also expressed constitutively in other tissues including brain, heart, and type I and type IIa skeletal muscle fibers, and in several cancers including gliomas, renal cell carcinomas, basal-like and metaplastic breast carcinomas, and head and neck cancer. Its functions include effects on the apoptotic pathway and on metastasis. Phosphorylation of HspB5 reduces its oligomerization and anti-apoptotic activ
Probab=99.84 E-value=2.2e-20 Score=145.05 Aligned_cols=77 Identities=17% Similarity=0.526 Sum_probs=68.9
Q ss_pred ecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEee-CcEEE
Q psy5221 332 VDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFFD-QGILN 408 (436)
Q Consensus 332 e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~~-~GvL~ 408 (436)
+++++|.|.++||||+||||+|++++|.|+|+|+++.+.++.+|++ ++|+|+|.||++|| .++|+|+ ||+|+
T Consensus 4 ~~~~~~~v~~dlpG~~~edi~V~v~~~~L~I~g~~~~~~~~~~~~~-----~eF~R~~~LP~~vd~~~i~A~~~~dGvL~ 78 (84)
T cd06498 4 LEKDKFSVNLDVKHFSPEELKVKVLGDFIEIHGKHEERQDEHGFIS-----REFQRKYRIPADVDPLTITSSLSPDGVLT 78 (84)
T ss_pred eCCceEEEEEECCCCCHHHeEEEEECCEEEEEEEEcceeCCCCEEE-----EEEEEEEECCCCCChHHcEEEeCCCCEEE
Confidence 4789999999999999999999999999999999876555566654 35999999999999 4999995 99999
Q ss_pred EEEec
Q psy5221 409 IDLIR 413 (436)
Q Consensus 409 i~~pk 413 (436)
|++||
T Consensus 79 I~lPk 83 (84)
T cd06498 79 VCGPR 83 (84)
T ss_pred EEEeC
Confidence 99998
No 24
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=99.83 E-value=1.8e-19 Score=176.16 Aligned_cols=179 Identities=22% Similarity=0.196 Sum_probs=121.0
Q ss_pred HHHHHHHHHHhcCCC--CCeEEEECC-CccEEEeCCCCCccEEEEcHHHHhhc-CHHHHHHHHHHHHHhhhcChHHHHHH
Q psy5221 64 LLKTVNKISKNANIS--TPEVAIYNG-EINAFATGAFKNSSLIAISANLLESM-TKLEIKAIIAHEISHISNGDMVTMLL 139 (436)
Q Consensus 64 L~~~l~~l~~~~g~~--~p~v~v~~~-~~NA~~~G~~~~~~~v~it~gLl~~l-~~~El~aVLAHElgHi~~~h~~~~~~ 139 (436)
+.+.-+.+++.++.+ +-+++++++ ..|||++. ++.|++++||+-.. ||+|+++|||||+||+..+|..+..-
T Consensus 76 v~~~g~rL~~~a~~~~~~f~f~lV~d~~iNAFA~~----Gg~v~vntGLll~ae~esElagViAHEigHv~qrH~aR~~e 151 (484)
T COG4783 76 VNSLGQRLAAAADLVKTPFTFFLVNDDSINAFATP----GGYVVVNTGLLLTAENESELAGVIAHEIGHVAQRHLARSME 151 (484)
T ss_pred HHHHHHHHHHhcCCCCCCeEEEEecCCccchhhcC----CceEEEehHHHHhcCCHHHHHHHHHHHHHHHhhhhHHHHHH
Confidence 445555666677776 447889999 99999993 34588999999988 99999999999999999999877553
Q ss_pred HHH--HHHHHHHHHHHHHHHHHHhhhhccCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhh
Q psy5221 140 IQG--VINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAK 217 (436)
Q Consensus 140 ~~~--~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~ 217 (436)
-+. ...++...+.+++... . ....+. .+++.- . .....-...|||.+|.+||++|+.
T Consensus 152 ~~~r~~~~~i~~ml~gi~aa~-----a---~~~ag~---a~iag~----~------a~~~~g~L~~sR~~E~eADr~Gi~ 210 (484)
T COG4783 152 QQQRAAPMAIAGMLLGILAAL-----A---GADAGM---AGIAGA----L------AGAAQGQLNFSRQNEQEADRIGIT 210 (484)
T ss_pred HHhhhchhHHHHHHHHHHHHH-----h---CccccH---HHHHHH----H------HHhhhhhhhcchhhHHHHHHHHHH
Confidence 311 0111111111111111 1 111111 111110 0 011112577999999999999998
Q ss_pred hc----CCchHHHHHHHHhhcccccCCChhhhhhhcccCcccchhhhhhhccCCChHHHHHHHHhcccccc
Q psy5221 218 LL----GTPDPMISALRHLSNISIVNDLPKSIAILGINNNFKKKSFLKLFATHPSFEKRIAILSLLKLSLY 284 (436)
Q Consensus 218 ~~----g~~~~l~~aL~kl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tHP~~~~RI~~L~~~~~~~~ 284 (436)
.+ .+|.+|.++++|+....... .....|+.|||-+.+||..+++++.+++
T Consensus 211 ~L~raGydp~gM~~ff~rl~~~~~~~-----------------~~~p~yl~THPlp~~RIa~lr~ra~q~p 264 (484)
T COG4783 211 TLVRAGYDPQGMPEFFERLADQLRYG-----------------GQPPEYLLTHPLPEERIADLRNRAEQSP 264 (484)
T ss_pred HHHHcCCCchhHHHHHHHHHHHHhcC-----------------CCCChHHhcCCCchhHHHHHHHHHHhCC
Confidence 76 58999999999998643111 1235689999999999999998888875
No 25
>cd06476 ACD_HspB2_like Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB2/heat shock 27kDa protein 2 and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. HspB2 is preferentially and constitutively expressed in skeletal muscle and heart. HspB2 shows homooligomeric activity and forms aggregates in muscle cytosol. Although its expression is not induced by heat shock, it redistributes to the insoluble fraction in response to heat shock. In the mouse heart, HspB2 plays a role in maintaining energetic balance, by protecting cardiac energetics during ischemia/reperfusion, and allowing for increased work during acute inotropic challenge. hHspB2 [previously also known as myotonic dystrophy protein kinase (DMPK) binding protein (MKBP)] is selectively up-regulated in skeletal muscles from myotonic dystrophy patients.
Probab=99.82 E-value=7.4e-20 Score=141.48 Aligned_cols=77 Identities=23% Similarity=0.507 Sum_probs=67.7
Q ss_pred ecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEee-CcEEE
Q psy5221 332 VDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFFD-QGILN 408 (436)
Q Consensus 332 e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~~-~GvL~ 408 (436)
.++++|.|.++||||+||||+|++++|.|+|+|+++.+.++.++. .++|+|+|.||++|| .++|.|+ ||+|+
T Consensus 4 ~~~d~y~v~~dlpG~~~edi~V~v~~~~L~I~g~~~~~~~~~~~~-----~~eF~R~~~LP~~vd~~~v~A~~~~dGvL~ 78 (83)
T cd06476 4 SEDDKYQVFLDVCHFTPDEITVRTVDNLLEVSARHPQRMDRHGFV-----SREFTRTYILPMDVDPLLVRASLSHDGILC 78 (83)
T ss_pred ccCCeEEEEEEcCCCCHHHeEEEEECCEEEEEEEEcceecCCCEE-----EEEEEEEEECCCCCChhhEEEEecCCCEEE
Confidence 368899999999999999999999999999999986554444454 456999999999999 4999996 99999
Q ss_pred EEEec
Q psy5221 409 IDLIR 413 (436)
Q Consensus 409 i~~pk 413 (436)
|++||
T Consensus 79 I~~Pr 83 (83)
T cd06476 79 IQAPR 83 (83)
T ss_pred EEecC
Confidence 99997
No 26
>cd06475 ACD_HspB1_like Alpha crystallin domain (ACD) found in mammalian small (s)heat shock protein (Hsp)-27 (also denoted HspB1 in human) and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Hsp27 shows enhanced synthesis in response to stress. It is a molecular chaperone which interacts with a large number of different proteins. It is found in many types of human cells including breast, uterus, cervix, platelets and cancer cells. Hsp27 has diverse cellular functions including, chaperoning, regulation of actin polymerization, keratinocyte differentiation, regulation of inflammatory pathways in keratinocytes, and protection from oxidative stress through modulating glutathione levels. It is also a subunit of AUF1-containing protein complexes. It has been linked to several transduction pathways regulating cellular functions including differentiat
Probab=99.82 E-value=8.3e-20 Score=142.53 Aligned_cols=80 Identities=20% Similarity=0.511 Sum_probs=70.8
Q ss_pred eeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEee-
Q psy5221 327 YNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFFD- 403 (436)
Q Consensus 327 ~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~~- 403 (436)
.||+ |++++|.|.++|||++||||+|+++++.|+|+|+++.+.++..+. .++|+|+|+||++|| .++|.|+
T Consensus 3 ~~i~-e~~~~~~v~~dlPG~~~edi~V~v~~~~L~I~g~~~~~~~~~~~~-----~~~f~R~f~LP~~vd~~~v~A~~~~ 76 (86)
T cd06475 3 SEIR-QTADRWKVSLDVNHFAPEELVVKTKDGVVEITGKHEEKQDEHGFV-----SRCFTRKYTLPPGVDPTAVTSSLSP 76 (86)
T ss_pred ceEE-EcCCeEEEEEECCCCCHHHEEEEEECCEEEEEEEECcCcCCCCEE-----EEEEEEEEECCCCCCHHHcEEEECC
Confidence 4898 599999999999999999999999999999999987654444442 458999999999999 4999997
Q ss_pred CcEEEEEEe
Q psy5221 404 QGILNIDLI 412 (436)
Q Consensus 404 ~GvL~i~~p 412 (436)
||+|+|++|
T Consensus 77 dGvL~I~lP 85 (86)
T cd06475 77 DGILTVEAP 85 (86)
T ss_pred CCeEEEEec
Confidence 999999998
No 27
>cd06477 ACD_HspB3_Like Alpha crystallin domain (ACD) found in mammalian HspB3, also known as heat-shock protein 27-like protein (HSPL27, 17-kDa) and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. HspB3 is expressed in adult skeletal muscle, smooth muscle, and heart, and in several other fetal tissues. In muscle cells HspB3 forms an oligomeric 150 kDa complex with myotonic dystrophy protein kinase-binding protein (MKBP/ HspB2), this complex may comprise one of two independent muscle-cell specific chaperone systems. The expression of HspB3 is induced during muscle differentiation controlled by the myogenic factor MyoD. HspB3 may also interact with Hsp22 (HspB8).
Probab=99.81 E-value=2.2e-19 Score=138.30 Aligned_cols=76 Identities=24% Similarity=0.461 Sum_probs=67.5
Q ss_pred ecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEe-eCcEEE
Q psy5221 332 VDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFF-DQGILN 408 (436)
Q Consensus 332 e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~-~~GvL~ 408 (436)
|++++|+|.++||||+||||+|+++++.|+|+|+++.+.++.++. .++|+|+|.||++|+ .++|+| +||||+
T Consensus 4 e~~~~~~v~~dlpG~~~edI~V~v~~~~L~I~ge~~~~~~~~~~~-----~r~F~R~~~LP~~Vd~~~v~A~~~~dGvL~ 78 (83)
T cd06477 4 EGKPMFQILLDVVQFRPEDIIIQVFEGWLLIKGQHGVRMDEHGFI-----SRSFTRQYQLPDGVEHKDLSAMLCHDGILV 78 (83)
T ss_pred cCCceEEEEEEcCCCCHHHeEEEEECCEEEEEEEEccccCCCCEE-----EEEEEEEEECCCCcchheEEEEEcCCCEEE
Confidence 589999999999999999999999999999999987755445553 349999999999999 499997 899999
Q ss_pred EEEe
Q psy5221 409 IDLI 412 (436)
Q Consensus 409 i~~p 412 (436)
|+.|
T Consensus 79 I~~~ 82 (83)
T cd06477 79 VETK 82 (83)
T ss_pred EEec
Confidence 9976
No 28
>cd06482 ACD_HspB10 Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB10, also known as sperm outer dense fiber protein (ODFP), and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Human (h) HspB10 occurs exclusively in the axoneme of sperm cells and may have a cytoskeletal role.
Probab=99.81 E-value=2.2e-19 Score=139.52 Aligned_cols=79 Identities=13% Similarity=0.282 Sum_probs=69.4
Q ss_pred cCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEeeCc-EEEE
Q psy5221 333 DENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFFDQG-ILNI 409 (436)
Q Consensus 333 ~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~~~G-vL~i 409 (436)
++++|+|.+||||++||||+|++++|.|+|+|+++.+.++.. .+||.+|+|+|+|.||++|| .++|+|+|| +|+|
T Consensus 6 ~~~~~~v~adlPG~~kedI~V~v~~~~L~I~ger~~~~e~~~--~~er~~g~F~R~f~LP~~Vd~d~i~A~~~~~~~l~i 83 (87)
T cd06482 6 DSSNVLASVDVCGFEPDQVKVKVKDGKVQVSAERENRYDCLG--SKKYSYMNICKEFSLPPGVDEKDVTYSYGLGSVVKI 83 (87)
T ss_pred cCCEEEEEEECCCCCHHHeEEEEECCEEEEEEEEecccccCC--ccEEEEEEEEEEEECCCCcChHHcEEEEcCCCEEEE
Confidence 578999999999999999999999999999999976543222 24789999999999999999 499999987 9999
Q ss_pred EEec
Q psy5221 410 DLIR 413 (436)
Q Consensus 410 ~~pk 413 (436)
.-|.
T Consensus 84 ~~~~ 87 (87)
T cd06482 84 ETPC 87 (87)
T ss_pred eeCC
Confidence 9874
No 29
>cd06481 ACD_HspB9_like Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB9 and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Human (h) HspB9 is expressed exclusively in the normal testis and in various tumor samples and is a cancer/testis antigen. hHspB9 interacts with TCTEL1 (T-complex testis expressed protein -1), a subunit of dynein. hHspB9 and TCTEL1 are co-expressed in similar cells within the testis and in tumor cells. Included in this group is Xenopus Hsp30, a developmentally-regulated heat-inducible molecular chaperone.
Probab=99.80 E-value=3.1e-19 Score=139.66 Aligned_cols=79 Identities=23% Similarity=0.462 Sum_probs=69.2
Q ss_pred ecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccC--CCceEEEeeeeeeEEEEEECCCccc--eEeEEe-eCcE
Q psy5221 332 VDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQ--QRNFLHRGIATRNFEHRFQLADHVH--VVAAFF-DQGI 406 (436)
Q Consensus 332 e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~--~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~-~~Gv 406 (436)
|.+++|.|.++||||+||||+|++++|.|+|+|+++.+.+ ..+|. +.+|+|+|+|.||++|| .++|.| +||+
T Consensus 4 ~~~d~~~v~~dlpG~~~edI~V~v~~~~L~I~g~~~~~~~~~~~~~~---~~~~~F~R~~~LP~~Vd~~~i~A~~~~dGv 80 (87)
T cd06481 4 DGKEGFSLKLDVRGFSPEDLSVRVDGRKLVVTGKREKKNEDEKGSFS---YEYQEFVREAQLPEHVDPEAVTCSLSPSGH 80 (87)
T ss_pred CccceEEEEEECCCCChHHeEEEEECCEEEEEEEEeeecccCCCcEE---EEeeEEEEEEECCCCcChHHeEEEeCCCce
Confidence 4678999999999999999999999999999999875432 24443 35899999999999999 499999 9999
Q ss_pred EEEEEec
Q psy5221 407 LNIDLIR 413 (436)
Q Consensus 407 L~i~~pk 413 (436)
|+|++|+
T Consensus 81 L~I~~P~ 87 (87)
T cd06481 81 LHIRAPR 87 (87)
T ss_pred EEEEcCC
Confidence 9999996
No 30
>cd06526 metazoan_ACD Alpha-crystallin domain (ACD) of metazoan alpha-crystallin-type small(s) heat shock proteins (Hsps). sHsps are small stress induced proteins with monomeric masses between 12 -43 kDa, whose common feature is the Alpha-crystallin domain (ACD). sHsps are generally active as large oligomers consisting of multiple subunits, and are believed to be ATP-independent chaperones that prevent aggregation and are important in refolding in combination with other Hsps.
Probab=99.77 E-value=2e-18 Score=134.42 Aligned_cols=75 Identities=24% Similarity=0.610 Sum_probs=66.9
Q ss_pred CceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEeeC-cEEEEE
Q psy5221 334 ENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFFDQ-GILNID 410 (436)
Q Consensus 334 ~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~~~-GvL~i~ 410 (436)
+++|.|.++||||+||||+|+++++.|+|+|+++.+.++. ++.+++|+|+|.||++||. ++|+|+| |+|+|+
T Consensus 6 ~~~~~v~~dlpG~~~edI~v~v~~~~L~I~g~~~~~~~~~-----~~~~~~f~r~~~LP~~vd~~~i~A~~~~~GvL~I~ 80 (83)
T cd06526 6 DEKFQVTLDVKGFKPEELKVKVSDNKLVVEGKHEEREDEH-----GYVSREFTRRYQLPEGVDPDSVTSSLSSDGVLTIE 80 (83)
T ss_pred CeeEEEEEECCCCCHHHcEEEEECCEEEEEEEEeeeccCC-----CEEEEEEEEEEECCCCCChHHeEEEeCCCcEEEEE
Confidence 4599999999999999999999999999999987654422 3457899999999999994 9999998 999999
Q ss_pred Eec
Q psy5221 411 LIR 413 (436)
Q Consensus 411 ~pk 413 (436)
+||
T Consensus 81 ~Pk 83 (83)
T cd06526 81 APK 83 (83)
T ss_pred ecC
Confidence 997
No 31
>cd06464 ACD_sHsps-like Alpha-crystallin domain (ACD) of alpha-crystallin-type small(s) heat shock proteins (Hsps). sHsps are small stress induced proteins with monomeric masses between 12 -43 kDa, whose common feature is the Alpha-crystallin domain (ACD). sHsps are generally active as large oligomers consisting of multiple subunits, and are believed to be ATP-independent chaperones that prevent aggregation and are important in refolding in combination with other Hsps.
Probab=99.74 E-value=1.9e-17 Score=130.52 Aligned_cols=85 Identities=25% Similarity=0.585 Sum_probs=76.9
Q ss_pred eEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccC-CCceEEEeeeeeeEEEEEECCCccce--EeEEeeC
Q psy5221 328 NIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQ-QRNFLHRGIATRNFEHRFQLADHVHV--VAAFFDQ 404 (436)
Q Consensus 328 ~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~-~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~~~ 404 (436)
|++ |++++|.+.++|||++++||+|+++++.|.|+|+++.... ...+...++.+|.|+|+|.||..+|. ++|.|+|
T Consensus 1 ~i~-e~~~~~~i~~~lpg~~~~~i~V~v~~~~l~I~g~~~~~~~~~~~~~~~~~~~~~f~r~~~LP~~vd~~~i~a~~~~ 79 (88)
T cd06464 1 DVY-ETDDAYVVEADLPGFKKEDIKVEVEDGVLTISGEREEEEEEEENYLRRERSYGSFSRSFRLPEDVDPDKIKASLEN 79 (88)
T ss_pred CcE-EcCCEEEEEEECCCCCHHHeEEEEECCEEEEEEEEecccccCCcEEEEEEeCcEEEEEEECCCCcCHHHcEEEEeC
Confidence 466 4889999999999999999999999999999999987643 33678888999999999999999994 9999999
Q ss_pred cEEEEEEec
Q psy5221 405 GILNIDLIR 413 (436)
Q Consensus 405 GvL~i~~pk 413 (436)
|+|+|++||
T Consensus 80 G~L~I~~pk 88 (88)
T cd06464 80 GVLTITLPK 88 (88)
T ss_pred CEEEEEEcC
Confidence 999999997
No 32
>KOG2661|consensus
Probab=99.73 E-value=3.8e-17 Score=151.48 Aligned_cols=187 Identities=16% Similarity=0.140 Sum_probs=123.4
Q ss_pred hCCCEEecCCCCcchhHHHHHHHHHHHhcC----CC--CCeEEEECC-CccEEEeCCCCCccEEEEcHHHHhhc-CHHHH
Q psy5221 47 SMNLMVIKNPSNFTEDWLLKTVNKISKNAN----IS--TPEVAIYNG-EINAFATGAFKNSSLIAISANLLESM-TKLEI 118 (436)
Q Consensus 47 ~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g----~~--~p~v~v~~~-~~NA~~~G~~~~~~~v~it~gLl~~l-~~~El 118 (436)
.++.+.++ +..+..-.+.+++.++.+..+ +. .-++.|+++ .+|||+.+ ++. |++-+|+|..+ +++++
T Consensus 201 eyq~~mLp-~~hp~sl~V~~vlk~iIea~~~~~slsgIkWeihVVndPipNAFvLP---gGK-vfVFtgiLn~ck~ddgl 275 (424)
T KOG2661|consen 201 EYQNDMLP-EKHPRSLAVKEVLKHIIEANKDVPSLSGIKWEIHVVNDPIPNAFVLP---GGK-VFVFTGILNSCKDDDGL 275 (424)
T ss_pred HhcCccCC-cCCchhhHHHHHHHHHHHHhccCCcccCceeEEEEecCCCCceeecc---CCe-EEEEechhhcccChHHH
Confidence 35555555 333444446677777777666 33 237889999 99999995 345 44555799999 99999
Q ss_pred HHHHHHHHHhhhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHH
Q psy5221 119 KAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYGLITTILQIFLGFGASI 198 (436)
Q Consensus 119 ~aVLAHElgHi~~~h~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 198 (436)
++|||||+||-..||.....-..-++. ++..+++.+.... ...- -.....
T Consensus 276 AtvLgHE~aHaVarH~AEki~k~~~~s----iLgLvlyt~~~a~------------------------~~n~--~Ll~~f 325 (424)
T KOG2661|consen 276 ATVLGHEIAHAVARHAAEKIGKVHLLS----ILGLVLYTMIWAI------------------------CPND--KLLEYF 325 (424)
T ss_pred HHHHHHHHHHHHHHHHHHhhhhhHHHH----HHHHHHHHHHhhc------------------------cchH--HHHHHH
Confidence 999999999999999877543322211 1111111111000 0000 011224
Q ss_pred HHHHHhhHHHHHhcHhhhhh----cCCchHHHHHHHHhhcccccCCChhhhhhhcccCcccchhhhhhhccCCChHHHHH
Q psy5221 199 VVAWFSRNREFRADHGSAKL----LGTPDPMISALRHLSNISIVNDLPKSIAILGINNNFKKKSFLKLFATHPSFEKRIA 274 (436)
Q Consensus 199 ~~~~~SR~~E~~AD~~a~~~----~g~~~~l~~aL~kl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tHP~~~~RI~ 274 (436)
+.+.+||.+|.|||.+|.-+ |.+|++....+++|.....+... .....|.||||+..+||+
T Consensus 326 lrlPfSRKMEtEADyIGLlLma~Acfdpras~tvwErM~~~egqmg~---------------~~~~eflSTHPSskkRie 390 (424)
T KOG2661|consen 326 LRLPFSRKMETEADYIGLLLMAKACFDPRASSTVWERMEFVEGQMGQ---------------PKMPEFLSTHPSSKKRIE 390 (424)
T ss_pred hcCcchhhhhhhhhHHHHHHHHHhhcCcccchHHHHHHHHhhhhcCC---------------CCCchhhhcCCCccchhH
Confidence 56889999999999998755 46889999999998775422211 124678999999999999
Q ss_pred HHHhccccc
Q psy5221 275 ILSLLKLSL 283 (436)
Q Consensus 275 ~L~~~~~~~ 283 (436)
.+.++-.+.
T Consensus 391 ~~~~lLpqA 399 (424)
T KOG2661|consen 391 YLDRLLPQA 399 (424)
T ss_pred HHHHhchHH
Confidence 998765554
No 33
>COG4784 Putative Zn-dependent protease [General function prediction only]
Probab=99.71 E-value=1.7e-16 Score=147.76 Aligned_cols=178 Identities=24% Similarity=0.302 Sum_probs=123.1
Q ss_pred HHHHHHHHHHHhcCCC------CCeEEEECC-CccEEEeCCCCCccEEEEcHHHHhhc-CHHHHHHHHHHHHHhhhcChH
Q psy5221 63 WLLKTVNKISKNANIS------TPEVAIYNG-EINAFATGAFKNSSLIAISANLLESM-TKLEIKAIIAHEISHISNGDM 134 (436)
Q Consensus 63 ~L~~~l~~l~~~~g~~------~p~v~v~~~-~~NA~~~G~~~~~~~v~it~gLl~~l-~~~El~aVLAHElgHi~~~h~ 134 (436)
.|.+++.++..++-.. ..++.|+|+ .+|||+.+ +++++++.||+..+ |+.|++|||+||+||+..+|.
T Consensus 65 Kler~Vari~g~lt~~S~~p~q~YriTilnSP~INAFALP----GGYlYitRGLlAland~sEvAAVl~HEmgHVtAnHg 140 (479)
T COG4784 65 KLERMVARIVGALTAVSENPQQTYRITILNSPNINAFALP----GGYLYITRGLLALANDSSEVAAVLAHEMGHVTANHG 140 (479)
T ss_pred HHHHHHHHHHhHhhhhccCCCceEEEEEecCCCccccccC----CceEEEehhHHHHcCCHHHHHHHHHhhhhheecchh
Confidence 3777777777765432 236789999 99999995 56799999999999 889999999999999999998
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHh
Q psy5221 135 VTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYGLITTILQIFLGFGASIVVAWFSRNREFRADHG 214 (436)
Q Consensus 135 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~ 214 (436)
....-.. .-..+...+....+. ....+. .......+-...|||.+|.+||.+
T Consensus 141 i~rQ~~e----~a~~ia~rvva~vl~-------~~~agk-----------------~A~~rGklrla~fsRnqELqAD~i 192 (479)
T COG4784 141 IQRQQRE----AAEVIASRVVAEVLG-------SDAAGK-----------------QALIRGKLRLAQFSRNQELQADAI 192 (479)
T ss_pred HHHHHHH----HHHHHHHHHHHHHhC-------Ccchhh-----------------HHHhhhhHHHhhhccchhhhhhhh
Confidence 7654432 111222222222111 111111 111223345688999999999999
Q ss_pred hhhhc----CCchHHHHHHHHhhcccccCCChhhhhhhcccCcccchhhhhhhccCCChHHHHHHHHhccccc
Q psy5221 215 SAKLL----GTPDPMISALRHLSNISIVNDLPKSIAILGINNNFKKKSFLKLFATHPSFEKRIAILSLLKLSL 283 (436)
Q Consensus 215 a~~~~----g~~~~l~~aL~kl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tHP~~~~RI~~L~~~~~~~ 283 (436)
++++. .+|-++++.|..|+.+..-.. .++.... -..+++|||.+.+||+.....+.+.
T Consensus 193 G~~~lgeAGYDP~A~~rfl~sm~ay~~F~s----------~~g~adq-sldfl~sHPntpqRiqla~~hARq~ 254 (479)
T COG4784 193 GIKMLGEAGYDPYAAARFLQSMAAYTDFRS----------VSGAADQ-SLDFLASHPNTPQRIQLARRHARQF 254 (479)
T ss_pred hHHHHHhcCCChHHHHHHHHHHHhhhhhcc----------cCcchhh-hcchhhcCCCChHHHHHHHHHHHhh
Confidence 99987 478999999999988753211 1111011 2568999999999999888766654
No 34
>cd06480 ACD_HspB8_like Alpha-crystallin domain (ACD) found in mammalian 21.6 KDa small heat shock protein (sHsp) HspB8, also denoted as Hsp22 in humans, and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. A chaperone complex formed of HspB8 and Bag3 stimulates degradation of protein complexes by macroautophagy. HspB8 also forms complexes with Hsp27 (HspB1), MKBP (HspB2), HspB3, alphaB-crystallin (HspB5), Hsp20 (HspB6), and cvHsp (HspB7). These latter interactions may depend on phosphorylation of the respective partner sHsp. HspB8 may participate in the regulation of cell proliferation, cardiac hypertrophy, apoptosis, and carcinogenesis. Point mutations in HspB8 have been correlated with the development of several congenital neurological diseases, including Charcot Marie tooth disease and distal motor neuropathy type II.
Probab=99.69 E-value=1.9e-16 Score=123.78 Aligned_cols=77 Identities=18% Similarity=0.459 Sum_probs=68.2
Q ss_pred ecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEee-CcEEE
Q psy5221 332 VDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFFD-QGILN 408 (436)
Q Consensus 332 e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~~-~GvL~ 408 (436)
+++++|.|.+++.||+||||+|++.++.|+|+|+++.+.++.++. .++|+|+|.||++|| .++|.+. ||+|+
T Consensus 12 ~~~~~f~v~ldv~gF~pEDL~Vkv~~~~L~V~Gkh~~~~~e~g~~-----~r~F~R~~~LP~~Vd~~~v~s~l~~dGvL~ 86 (91)
T cd06480 12 NSSEPWKVCVNVHSFKPEELTVKTKDGFVEVSGKHEEQQKEGGIV-----SKNFTKKIQLPPEVDPVTVFASLSPEGLLI 86 (91)
T ss_pred CCCCcEEEEEEeCCCCHHHcEEEEECCEEEEEEEECcccCCCCEE-----EEEEEEEEECCCCCCchhEEEEeCCCCeEE
Confidence 467899999999999999999999999999999998765444443 589999999999999 3999998 99999
Q ss_pred EEEec
Q psy5221 409 IDLIR 413 (436)
Q Consensus 409 i~~pk 413 (436)
|++|.
T Consensus 87 IeaP~ 91 (91)
T cd06480 87 IEAPQ 91 (91)
T ss_pred EEcCC
Confidence 99983
No 35
>KOG0710|consensus
Probab=99.60 E-value=1.7e-15 Score=136.74 Aligned_cols=106 Identities=21% Similarity=0.476 Sum_probs=90.8
Q ss_pred CCCCCceeEEEecCceEEEEEecCCCCCCceEEEEECC-EEEEEEEEeccc----CCCceEEEeeeeeeEEEEEECCCcc
Q psy5221 321 QQSYPPYNIELVDENKYRITMAIAGFNRSDLEIETERD-TLRITGRKQADT----QQRNFLHRGIATRNFEHRFQLADHV 395 (436)
Q Consensus 321 ~~~~p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~-~l~i~g~~~~~~----~~~~~~~~e~~~~~f~r~~~lP~~v 395 (436)
....++.||. |++++|++.+++||++++|++|+++++ +|+|+|+++.+. +...+...|+..|.|.|+|.||+++
T Consensus 81 ~~~~~~~~v~-e~~~~~~~~~~~Pgl~ke~iKv~~~~~~~l~isGe~~~e~e~~~~~~~~~~~E~~~g~F~r~~~lPenv 159 (196)
T KOG0710|consen 81 SEARVPWDVK-ESPDAHEFKVDLPGLKKEDIKVEVEDEKVLTISGERKKEEEESGSGKKWKRVERKLGKFKRRFELPENV 159 (196)
T ss_pred ccccCCcccc-cCCCceEEEeeCCCCCchhceEEeccCcEEEEecccccccccccCCccceeehhcccceEeeecCCccc
Confidence 3456788998 599999999999999999999999988 799999988763 3456788899999999999999999
Q ss_pred ce--EeEEeeCcEEEEEEecCCC--CCCCCeEeeec
Q psy5221 396 HV--VAAFFDQGILNIDLIREVP--EALKPRKIYIN 427 (436)
Q Consensus 396 ~~--~~A~~~~GvL~i~~pk~~~--~~~~~~~I~I~ 427 (436)
+. |+|.|+||||+|++||..+ +..+.+.|+|+
T Consensus 160 ~~d~ikA~~~nGVL~VvvpK~~~~~~~~~v~~i~i~ 195 (196)
T KOG0710|consen 160 DVDEIKAEMENGVLTVVVPKLEPLLKKPKVRQIAIS 195 (196)
T ss_pred cHHHHHHHhhCCeEEEEEecccccccCCccceeecc
Confidence 83 9999999999999999766 34555666654
No 36
>KOG3591|consensus
Probab=99.58 E-value=1.5e-14 Score=127.02 Aligned_cols=101 Identities=21% Similarity=0.494 Sum_probs=84.9
Q ss_pred ceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEe-
Q psy5221 326 PYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFF- 402 (436)
Q Consensus 326 ~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~- 402 (436)
..++. .++++|.|.+|+..|+||||+|++.|+.|.|+|++.+..++.++. .++|.|+|.||++||. |++.+
T Consensus 64 ~~~~~-~~~~~F~V~lDV~~F~PeEl~Vk~~~~~l~V~gkHeer~d~~G~v-----~R~F~R~y~LP~~vdp~~V~S~LS 137 (173)
T KOG3591|consen 64 ASEIV-NDKDKFEVNLDVHQFKPEELKVKTDDNTLEVEGKHEEKEDEHGYV-----SRSFVRKYLLPEDVDPTSVTSTLS 137 (173)
T ss_pred ccccc-cCCCcEEEEEEcccCcccceEEEeCCCEEEEEeeeccccCCCCeE-----EEEEEEEecCCCCCChhheEEeeC
Confidence 45665 588999999999999999999999999999999998887777776 6689999999999993 77777
Q ss_pred eCcEEEEEEecCCCCCCCCeEeeecCCCCC
Q psy5221 403 DQGILNIDLIREVPEALKPRKIYINTTKNT 432 (436)
Q Consensus 403 ~~GvL~i~~pk~~~~~~~~~~I~I~~~~~~ 432 (436)
.||+|+|+.||..+.....|.|+|+..+..
T Consensus 138 ~dGvLtI~ap~~~~~~~~er~ipI~~~~~~ 167 (173)
T KOG3591|consen 138 SDGVLTIEAPKPPPKQDNERSIPIEQVGPS 167 (173)
T ss_pred CCceEEEEccCCCCcCccceEEeEeecCcc
Confidence 589999999995443335899999865443
No 37
>PF05569 Peptidase_M56: BlaR1 peptidase M56; InterPro: IPR008756 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M56 (clan M-). The predicted active site residues for members of this family occur in the motif HEXXH. The type example is BlaR1 peptidase from Bacillus licheniformis. Production of beta-Lactamase and penicillin-binding protein 2a (which mediate staphylococcal resistance to beta-lactam antibiotics) is regulated by a signal-transducing integral membrane protein and a transcriptional repressor. The signal transducer is a fusion protein with penicillin-binding and zinc metalloprotease domains. The signal for protein expression is transmitted by site-specific proteolytic cleavage of both the transducer, which auto-activates, and the repressor, which is inactivated, unblocking gene transcription.
Probab=99.47 E-value=2.8e-12 Score=124.55 Aligned_cols=154 Identities=19% Similarity=0.303 Sum_probs=110.4
Q ss_pred HHHHHHHHHHhcCCCC-CeEEEECCCccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHH
Q psy5221 64 LLKTVNKISKNANIST-PEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQG 142 (436)
Q Consensus 64 L~~~l~~l~~~~g~~~-p~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~ 142 (436)
+.+.+++++.+.|++. .++++.+.....+++|+. ++.|++..++++.++++|+.+|++||+.|++++|.....+...
T Consensus 143 ~~~~l~~~~~~~~~~~~~~i~~s~~i~sP~~~G~~--~p~I~lP~~~~~~~~~~el~~il~HEl~Hikr~D~~~~~l~~l 220 (299)
T PF05569_consen 143 LQALLEECKEELGIKRPIRIRVSSGISSPFVFGFL--RPVIVLPESLLEDLSEEELRAILLHELAHIKRRDLLWKLLAEL 220 (299)
T ss_pred HHHHHHHHHHHhCCCCceEEEEcCCCCCCeeecCc--ceEEEecCccccccCHHHHHHHHHHHHHHHHCCChHHHHHHHH
Confidence 7899999999999874 455554546678899976 5679999999999999999999999999999999877544321
Q ss_pred HHHHHHHHHHHHHHHHHHhhhhccCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhc--C
Q psy5221 143 VINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLL--G 220 (436)
Q Consensus 143 ~~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~--g 220 (436)
+ . .+.| +..++.++...+.+.+|+.||+.+++.+ +
T Consensus 221 ~--~-------~l~W----------------------------------fnP~~~~~~~~~~~~~E~~cD~~vl~~l~~~ 257 (299)
T PF05569_consen 221 L--C-------ALHW----------------------------------FNPLVWLLRRRIRRDRELACDEAVLRNLGKE 257 (299)
T ss_pred H--H-------HHHH----------------------------------hhHHHHHHHHHHHHHHHHhhhHHHHHhcCch
Confidence 0 0 1111 1122335668889999999999999994 4
Q ss_pred CchHHHHHHHHhhcccccCCChhhhhhhcccCcccchhhhhhhccCCChHHHHHHH
Q psy5221 221 TPDPMISALRHLSNISIVNDLPKSIAILGINNNFKKKSFLKLFATHPSFEKRIAIL 276 (436)
Q Consensus 221 ~~~~l~~aL~kl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tHP~~~~RI~~L 276 (436)
+..+.+++|.++.........+. ....+..+.-.+.+||..|
T Consensus 258 ~~~~Y~~~Ll~~~~~~~~~~~~~--------------~~~~~~~~~~~lk~RI~~I 299 (299)
T PF05569_consen 258 ERKAYAETLLKVAKRSQQFKRPP--------------LASSFAFSKSQLKRRIKMI 299 (299)
T ss_pred hHHHHHHHHHHHHHhhcCCCcch--------------hhhhccCChHHHHHHHHhC
Confidence 55788999999987753311111 1122334677788888753
No 38
>cd00298 ACD_sHsps_p23-like This domain family includes the alpha-crystallin domain (ACD) of alpha-crystallin-type small heat shock proteins (sHsps) and a similar domain found in p23-like proteins. sHsps are small stress induced proteins with monomeric masses between 12 -43 kDa, whose common feature is this ACD. sHsps are generally active as large oligomers consisting of multiple subunits, and are believed to be ATP-independent chaperones that prevent aggregation and are important in refolding in combination with other Hsps. p23 is a cochaperone of the Hsp90 chaperoning pathway. It binds Hsp90 and participates in the folding of a number of Hsp90 clients including the progesterone receptor. p23 also has a passive chaperoning activity. p23 in addition may act as the cytosolic prostaglandin E2 synthase. Included in this family is the p23-like C-terminal CHORD-SGT1 (CS) domain of suppressor of G2 allele of Skp1 (Sgt1) and the p23-like domains of human butyrate-induced transcript 1 (hB-ind
Probab=99.29 E-value=2.5e-11 Score=92.80 Aligned_cols=76 Identities=25% Similarity=0.554 Sum_probs=68.5
Q ss_pred ecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEeeCcEEEE
Q psy5221 332 VDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFFDQGILNI 409 (436)
Q Consensus 332 e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~~~GvL~i 409 (436)
++++.|.+.+++||+.+++++|+++++.|.|+|++....+ .+...+.|.+.+.||..++. ++|.+.+|+|+|
T Consensus 3 q~~~~v~i~i~~~~~~~~~i~v~~~~~~l~v~~~~~~~~~------~~~~~~~~~~~~~L~~~i~~~~~~~~~~~~~l~i 76 (80)
T cd00298 3 QTDDEVVVTVDLPGVKKEDIKVEVEDNVLTISGKREEEEE------RERSYGEFERSFELPEDVDPEKSKASLENGVLEI 76 (80)
T ss_pred EcCCEEEEEEECCCCCHHHeEEEEECCEEEEEEEEcCCCc------ceEeeeeEEEEEECCCCcCHHHCEEEEECCEEEE
Confidence 4788999999999999999999999999999999875443 45567899999999999994 999999999999
Q ss_pred EEec
Q psy5221 410 DLIR 413 (436)
Q Consensus 410 ~~pk 413 (436)
++||
T Consensus 77 ~l~K 80 (80)
T cd00298 77 TLPK 80 (80)
T ss_pred EEcC
Confidence 9997
No 39
>cd06469 p23_DYX1C1_like p23_like domain found in proteins similar to dyslexia susceptibility 1 (DYX1) candidate 1 (C1) protein, DYX1C1. The human gene encoding this protein is a positional candidate gene for developmental dyslexia (DD), it is located on 15q21.3 by the DYX1 DD susceptibility locus (15q15-21). Independent association studies have reported conflicting results. However, association of short-term memory, which plays a role in DD, with a variant within the DYX1C1 gene has been reported. Most proteins belonging to this group contain a C-terminal tetratricopeptide repeat (TPR) protein binding region.
Probab=98.93 E-value=7.4e-09 Score=79.43 Aligned_cols=65 Identities=17% Similarity=0.320 Sum_probs=59.3
Q ss_pred ecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEeeCcEEEE
Q psy5221 332 VDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFFDQGILNI 409 (436)
Q Consensus 332 e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~~~GvL~i 409 (436)
++++.+.|.+++||++++|++|+++++.|.|+| ..|.+.+.||..++ ..+|++++|.|.|
T Consensus 3 Qt~~~v~i~i~~p~v~~~~v~v~~~~~~l~i~~------------------~~~~~~~~l~~~I~~e~~~~~~~~~~l~i 64 (78)
T cd06469 3 QTDEDVKISVPLKGVKTSKVDIFCSDLYLKVNF------------------PPYLFELDLAAPIDDEKSSAKIGNGVLVF 64 (78)
T ss_pred ccCCEEEEEEEeCCCccccceEEEecCEEEEcC------------------CCEEEEEeCcccccccccEEEEeCCEEEE
Confidence 378899999999999999999999999999986 14788999999999 4999999999999
Q ss_pred EEecC
Q psy5221 410 DLIRE 414 (436)
Q Consensus 410 ~~pk~ 414 (436)
++||.
T Consensus 65 ~L~K~ 69 (78)
T cd06469 65 TLVKK 69 (78)
T ss_pred EEEeC
Confidence 99994
No 40
>COG4219 MecR1 Antirepressor regulating drug resistance, predicted signal transduction N-terminal membrane component [Transcription / Signal transduction mechanisms]
Probab=98.88 E-value=3.6e-07 Score=86.09 Aligned_cols=125 Identities=18% Similarity=0.241 Sum_probs=86.6
Q ss_pred HHHHHHHHHHhcCCCCCeEEEECCCccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHHHHHHH
Q psy5221 64 LLKTVNKISKNANISTPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGV 143 (436)
Q Consensus 64 L~~~l~~l~~~~g~~~p~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~~~~~~ 143 (436)
..+.+..+..+.....+.+..-+...+-.++|.. +..|++-+-+.+.++++|++-|++||++|+++||...+.+..
T Consensus 138 ~~~~~~~~~~~~~~k~i~ir~s~~i~~P~v~gl~--kp~IvlP~d~~~r~~~ee~~yIilHEl~Hlk~gD~i~n~i~~-- 213 (337)
T COG4219 138 KRKIVTILKNHQYKKHILIRKSKAIDGPMVFGLV--KPCIVLPADFVERLTDEELKYIILHELSHLKRGDAIINLIVV-- 213 (337)
T ss_pred HHHHHHHHHHhhhccCeeEeecccCCCceeeccC--cceEEccHHHHhhcCHHhhhhhHhHHHhhhhcccHHHHHHHH--
Confidence 5778888888888776554444436677778865 556888888999999999999999999999999987644431
Q ss_pred HHHHHHHHHHHHHHHHHhhhhccCCCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcHhhhhhcCCc-
Q psy5221 144 INTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTP- 222 (436)
Q Consensus 144 ~~~~~~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~SR~~E~~AD~~a~~~~g~~- 222 (436)
.+. ++.| + ..+..+....+--.+|..||+.+.... +|
T Consensus 214 ------~~~-~l~W------f----------------------------NP~v~l~~~~~~~D~E~aCDa~vL~~~-~~~ 251 (337)
T COG4219 214 ------VLG-VLFW------F----------------------------NPLVHLGKRKIRIDQEIACDAAVLARI-NPE 251 (337)
T ss_pred ------HHh-HHhh------c----------------------------ChHHHHHHHHHHhhHHHHhhHHHHhcc-ChH
Confidence 111 1111 1 112234556777889999999999874 33
Q ss_pred --hHHHHHHHHhhc
Q psy5221 223 --DPMISALRHLSN 234 (436)
Q Consensus 223 --~~l~~aL~kl~~ 234 (436)
...++++.++..
T Consensus 252 err~YaEsil~~l~ 265 (337)
T COG4219 252 ERRTYAESILKLLL 265 (337)
T ss_pred HHHHHHHHHHHHHh
Confidence 445566666444
No 41
>PF05455 GvpH: GvpH; InterPro: IPR008633 This family consists of archaeal GvpH proteins which are thought to be involved in gas vesicle synthesis [].
Probab=98.76 E-value=6.3e-08 Score=83.99 Aligned_cols=74 Identities=19% Similarity=0.333 Sum_probs=57.7
Q ss_pred CCCceeEEEecCc-eEEEEEecCCCCCCc-eEEEEECC--EEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc-e
Q psy5221 323 SYPPYNIELVDEN-KYRITMAIAGFNRSD-LEIETERD--TLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH-V 397 (436)
Q Consensus 323 ~~p~~~i~~e~~~-~~~v~~~lpG~~~ed-i~v~~~~~--~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~-~ 397 (436)
..+-+++.+ .++ .++|.+||||++++| |+|.++.+ .|+|+. .+.+.+++.||...- .
T Consensus 90 ~~~~vdtre-~dDge~~VvAdLPGVs~dd~idV~l~~d~~~L~i~~-----------------~~~~~krv~L~~~~~e~ 151 (177)
T PF05455_consen 90 ESIHVDTRE-RDDGELVVVADLPGVSDDDAIDVTLDDDEGALTIRV-----------------GEKYLKRVALPWPDPEI 151 (177)
T ss_pred ceeeeeeEe-cCCCcEEEEEeCCCCCcccceeeEeecCCceEEEec-----------------CCceEeeEecCCCccce
Confidence 346678874 455 799999999999998 99999854 666652 223668899997633 5
Q ss_pred EeEEeeCcEEEEEEecC
Q psy5221 398 VAAFFDQGILNIDLIRE 414 (436)
Q Consensus 398 ~~A~~~~GvL~i~~pk~ 414 (436)
+++.|+||||+|++-|.
T Consensus 152 ~~~t~nNgILEIri~~~ 168 (177)
T PF05455_consen 152 TSATFNNGILEIRIRRT 168 (177)
T ss_pred eeEEEeCceEEEEEeec
Confidence 89999999999999883
No 42
>cd06463 p23_like Proteins containing this p23_like domain include p23 and its Saccharomyces cerevisiae (Sc) homolog Sba1. Both are co-chaperones for the heat shock protein (Hsp) 90. p23 binds Hsp90 and participates in the folding of a number of Hsp90 clients, including the progesterone receptor. p23 also has a passive chaperoning activity and in addition may participate in prostaglandin synthesis. Both p23 and Sba1p can regulate telomerase activity. This group includes domains similar to the C-terminal CHORD-SGT1 (CS) domain of suppressor of G2 allele of Skp1 (Sgt1). Sgt1 interacts with multiple protein complexes and has the features of a co-chaperone. Human (h) Sgt1 interacts with both Hsp70 and Hsp90, and has been shown to bind Hsp90 through its CS domain. Saccharomyces cerevisiae (Sc) Sgt1 is a subunit of both core kinetochore and SCF (Skp1-Cul1-F-box) ubiquitin ligase complexes. Sgt1 is required for pathogen resistance in plants. This group also includes the p23_like domains of
Probab=98.50 E-value=9.8e-07 Score=68.14 Aligned_cols=70 Identities=11% Similarity=0.192 Sum_probs=61.3
Q ss_pred ecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEeeCcEEEE
Q psy5221 332 VDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFFDQGILNI 409 (436)
Q Consensus 332 e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~~~GvL~i 409 (436)
++++.+.|.+.+||..+++++|.++++.|+|++... + .+.|...+.|+..++. .++++++|.|.|
T Consensus 3 Q~~~~v~i~v~~~~~~~~~~~v~~~~~~l~i~~~~~---~----------~~~~~~~~~L~~~I~~~~s~~~~~~~~l~i 69 (84)
T cd06463 3 QTLDEVTITIPLKDVTKKDVKVEFTPKSLTVSVKGG---G----------GKEYLLEGELFGPIDPEESKWTVEDRKIEI 69 (84)
T ss_pred ccccEEEEEEEcCCCCccceEEEEecCEEEEEeeCC---C----------CCceEEeeEccCccchhhcEEEEeCCEEEE
Confidence 378899999999999999999999999999998753 1 2457788899999994 999999999999
Q ss_pred EEecC
Q psy5221 410 DLIRE 414 (436)
Q Consensus 410 ~~pk~ 414 (436)
+++|.
T Consensus 70 ~L~K~ 74 (84)
T cd06463 70 TLKKK 74 (84)
T ss_pred EEEEC
Confidence 99995
No 43
>cd06466 p23_CS_SGT1_like p23_like domain similar to the C-terminal CHORD-SGT1 (CS) domain of Sgt1 (suppressor of G2 allele of Skp1). Sgt1 interacts with multiple protein complexes and has the features of a cochaperone. Human (h) Sgt1 interacts with both Hsp70 and Hsp90, and has been shown to bind Hsp90 through its CS domain. Saccharomyces cerevisiae (Sc) Sgt1 is a subunit of both core kinetochore and SCF (Skp1-Cul1-F-box) ubiquitin ligase complexes. Sgt1 is required for pathogen resistance in plants. ScSgt1 is needed for the G1/S and G2/M cell-cycle transitions, and for assembly of the core kinetochore complex (CBF3) via activation of Ctf13, the F-box protein. Binding of Hsp82 (a yeast Hsp90 homologue) to ScSgt1, promotes the binding of Sgt1 to Skp1 and of Skp1 to Ctf13. Some proteins in this group have an SGT1-specific (SGS) domain at the extreme C-terminus. The ScSgt1-SGS domain binds adenylate cyclase. The hSgt1-SGS domain interacts with some S100 family proteins, and studies sug
Probab=98.12 E-value=1.4e-05 Score=61.97 Aligned_cols=73 Identities=10% Similarity=0.151 Sum_probs=61.5
Q ss_pred eEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEeeCc
Q psy5221 328 NIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFFDQG 405 (436)
Q Consensus 328 ~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~~~G 405 (436)
|.++ +++.+.|.+.+||+.++|++|.++++.|.|++... + .+.|...+.|+..|+. .++.+.+|
T Consensus 1 dW~Q-t~~~v~i~v~~~~~~~~~v~v~~~~~~l~i~~~~~---~----------~~~~~~~~~L~~~I~~~~s~~~~~~~ 66 (84)
T cd06466 1 DWYQ-TDTSVTVTIYAKNVDKEDVKVEFNEQSLSVSIILP---G----------GSEYQLELDLFGPIDPEQSKVSVLPT 66 (84)
T ss_pred Cccc-cCCEEEEEEEECCCCHHHCEEEEecCEEEEEEECC---C----------CCeEEEecccccccCchhcEEEEeCe
Confidence 3454 88999999999999999999999999999986642 0 2357778889999884 88999999
Q ss_pred EEEEEEecC
Q psy5221 406 ILNIDLIRE 414 (436)
Q Consensus 406 vL~i~~pk~ 414 (436)
.+.|+++|.
T Consensus 67 ~vei~L~K~ 75 (84)
T cd06466 67 KVEITLKKA 75 (84)
T ss_pred EEEEEEEcC
Confidence 999999994
No 44
>PF04969 CS: CS domain; InterPro: IPR017447 The function of the CS domain is unknown. The CS domain is sometimes found C-terminal to the CHORD domain (IPR007051 from INTERPRO) in metazoan proteins, but occurs separately from the CHORD domain in plants. This association is thought to be indicative of an functional interaction between CS and CHORD domains [].; PDB: 1WGV_A 2KMW_A 2O30_B 1WH0_A 1EJF_A 2RH0_B 1RL1_A 2CR0_A 1WFI_A 2XCM_D ....
Probab=97.67 E-value=0.001 Score=50.48 Aligned_cols=75 Identities=11% Similarity=0.258 Sum_probs=59.9
Q ss_pred CceeEEEecCceEEEEEecCCCC--CCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeE
Q psy5221 325 PPYNIELVDENKYRITMAIAGFN--RSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAA 400 (436)
Q Consensus 325 p~~~i~~e~~~~~~v~~~lpG~~--~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A 400 (436)
|.++.++ +++.+.|.+.+++.. ++|++|+++++.+.++...... ..|.-...|...|+. .+.
T Consensus 1 ~~y~W~Q-t~~~V~v~i~~~~~~~~~~dv~v~~~~~~l~v~~~~~~~-------------~~~~~~~~L~~~I~~~~s~~ 66 (79)
T PF04969_consen 1 PRYDWYQ-TDDEVTVTIPVKPVDISKEDVKVDFTDTSLSVSIKSGDG-------------KEYLLEGELFGEIDPDESTW 66 (79)
T ss_dssp SSEEEEE-ESSEEEEEEE-TTTTSSGGGEEEEEETTEEEEEEEETTS-------------CEEEEEEEBSS-BECCCEEE
T ss_pred CCeEEEE-CCCEEEEEEEEcCCCCChHHeEEEEEeeEEEEEEEccCC-------------ceEEEEEEEeeeEcchhcEE
Confidence 6788886 899999999996664 9999999999999999654211 245566789999884 899
Q ss_pred EeeCcEEEEEEec
Q psy5221 401 FFDQGILNIDLIR 413 (436)
Q Consensus 401 ~~~~GvL~i~~pk 413 (436)
.++++.|.|+|.|
T Consensus 67 ~~~~~~i~i~L~K 79 (79)
T PF04969_consen 67 KVKDNKIEITLKK 79 (79)
T ss_dssp EEETTEEEEEEEB
T ss_pred EEECCEEEEEEEC
Confidence 9999999999987
No 45
>PF04228 Zn_peptidase: Putative neutral zinc metallopeptidase; InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=97.58 E-value=0.001 Score=63.76 Aligned_cols=67 Identities=15% Similarity=0.296 Sum_probs=44.8
Q ss_pred HHHHHHHHHHhcCCC--CCeEEEECC-----------CccEEEeCCCCCccEEEEcHHHHhhcC-------HHHHHHHHH
Q psy5221 64 LLKTVNKISKNANIS--TPEVAIYNG-----------EINAFATGAFKNSSLIAISANLLESMT-------KLEIKAIIA 123 (436)
Q Consensus 64 L~~~l~~l~~~~g~~--~p~v~v~~~-----------~~NA~~~G~~~~~~~v~it~gLl~~l~-------~~El~aVLA 123 (436)
+.+......+++|.+ .|++++++. ..|+|= +|....|+++..+++.|. +--...|||
T Consensus 99 le~~W~~~~~~~g~~y~~P~lv~~~~~~~t~CG~a~s~~gpFY---Cp~D~tIYlD~~f~~~L~~~~ga~G~~a~ayVlA 175 (292)
T PF04228_consen 99 LEDVWTPQFPQAGLPYRPPKLVLFSGSVQTGCGTASSATGPFY---CPADQTIYLDLSFFDELQQRFGASGDFAQAYVLA 175 (292)
T ss_pred HHHHHHHHHHHcCCCCCCCEEEEeCCCCCCCCCCCcCCCCCEe---CCCCCEEEechHHHHHHHHHhCCccHHHHHHHHH
Confidence 334445666677876 688888776 123332 466778999988776552 234577999
Q ss_pred HHHHhhhcCh
Q psy5221 124 HEISHISNGD 133 (436)
Q Consensus 124 HElgHi~~~h 133 (436)
||+||..++.
T Consensus 176 HEyGHHVQ~l 185 (292)
T PF04228_consen 176 HEYGHHVQNL 185 (292)
T ss_pred HHHHHHHHHH
Confidence 9999987754
No 46
>PF06114 DUF955: Domain of unknown function (DUF955); InterPro: IPR010359 This is a family of bacterial and viral proteins with undetermined function. A conserved H-E-X-X-H motif is suggestive of a catalytic active site and shows similarity to IPR001915 from INTERPRO.; PDB: 3DTE_A 3DTK_A 3DTI_A.
Probab=97.37 E-value=0.0016 Score=53.50 Aligned_cols=33 Identities=21% Similarity=0.277 Sum_probs=26.7
Q ss_pred ccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHH
Q psy5221 100 SSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVT 136 (436)
Q Consensus 100 ~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~ 136 (436)
..+|+++. .+++.+..+++|||+||+..+|...
T Consensus 28 ~~~I~in~----~~~~~~~~f~laHELgH~~~~~~~~ 60 (122)
T PF06114_consen 28 NPIIFINS----NLSPERQRFTLAHELGHILLHHGDE 60 (122)
T ss_dssp TTEEEEES----SS-HHHHHHHHHHHHHHHHHHH-HH
T ss_pred CCEEEECC----CCCHHHHHHHHHHHHHHHHhhhccc
Confidence 67788888 3789999999999999999988643
No 47
>cd06465 p23_hB-ind1_like p23_like domain found in human (h) butyrate-induced transcript 1 (B-ind1) and similar proteins. hB-ind1 participates in signaling by the small GTPase Rac1. It binds to Rac1 and enhances different Rac1 effects including activation of nuclear factor (NF) kappaB and activation of c-Jun N-terminal kinase (JNK). hB-ind1 also plays a part in the RNA replication and particle production of Hepatitis C virus (HCV) through its interaction with heat shock protein Hsp90, HCV nonstructural protein 5A (NS5A), and the immunophilin FKBP8. hB-ind1 is upregulated in the outer layer of Chinese hamster V79 cells grown as multicell spheroids, versus in the same cells grown as monolayers. This group includes the Saccharomyces cerevisiae Sba1, a co-chaperone of the Hsp90. Sba1 has been shown to be is required for telomere length maintenance, and may modulate telomerase DNA-binding activity.
Probab=97.27 E-value=0.0037 Score=50.90 Aligned_cols=76 Identities=12% Similarity=0.190 Sum_probs=61.4
Q ss_pred CceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEe
Q psy5221 325 PPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFF 402 (436)
Q Consensus 325 p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~ 402 (436)
|+++-++ +++.+.|.+.+||+ ++++|++..+.|.|++.... ++ ..|.-.+.|...|+. .+.++
T Consensus 1 p~~~W~Q-t~~~V~i~i~~~~~--~~~~V~~~~~~l~v~~~~~~--~~----------~~y~~~~~L~~~I~pe~s~~~v 65 (108)
T cd06465 1 PPVLWAQ-RSDVVYLTIELPDA--KDPKIKLEPTSLSFKAKGGG--GG----------KKYEFDLEFYKEIDPEESKYKV 65 (108)
T ss_pred CceeeeE-CCCEEEEEEEeCCC--CCcEEEEECCEEEEEEEcCC--CC----------eeEEEEeEhhhhccccccEEEe
Confidence 6889997 89999999999998 88999999999999985321 11 124556688888883 78888
Q ss_pred eCcEEEEEEecCC
Q psy5221 403 DQGILNIDLIREV 415 (436)
Q Consensus 403 ~~GvL~i~~pk~~ 415 (436)
.++.+.|+|.|..
T Consensus 66 ~~~kveI~L~K~~ 78 (108)
T cd06465 66 TGRQIEFVLRKKE 78 (108)
T ss_pred cCCeEEEEEEECC
Confidence 8999999999953
No 48
>PF10463 Peptidase_U49: Peptidase U49; InterPro: IPR019504 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This entry contains peptidases belonging to MEROPS peptidase family U49 (Lit peptidase, clan U-). The Lit peptidase from Escherichia coli functions in bacterial cell death in response to infection by Enterobacteria phage T4. Following binding of Gol peptide to domains II and III of elongation factor Tu, the Lit peptidase cleaves domain I of the elongation factor. This prevents binding of guanine nucleotides, shuts down translation and leads to cell death.
Probab=97.16 E-value=0.00056 Score=61.79 Aligned_cols=20 Identities=30% Similarity=0.429 Sum_probs=17.5
Q ss_pred HHHHHHHHHHHHhhhcChHH
Q psy5221 116 LEIKAIIAHEISHISNGDMV 135 (436)
Q Consensus 116 ~El~aVLAHElgHi~~~h~~ 135 (436)
--+..+++||++|+..+|..
T Consensus 99 ~A~~fil~HE~~Hv~~~h~~ 118 (206)
T PF10463_consen 99 CAIAFILLHELAHVVLGHEG 118 (206)
T ss_pred HHHHHHHHHHHHHHHHcCcc
Confidence 46789999999999999964
No 49
>PF08190 PIH1: pre-RNA processing PIH1/Nop17
Probab=97.06 E-value=0.0021 Score=63.20 Aligned_cols=64 Identities=20% Similarity=0.374 Sum_probs=54.9
Q ss_pred cCceEEEEEecCCC-CCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEee--CcEE
Q psy5221 333 DENKYRITMAIAGF-NRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFFD--QGIL 407 (436)
Q Consensus 333 ~~~~~~v~~~lpG~-~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~~--~GvL 407 (436)
..+.++|+++|||+ +..+|+++|.++.|.|+.... .|+-.+.||..|| ..+|.|. .+.|
T Consensus 259 ~p~~lvv~i~LP~~~s~~~i~LdV~~~~l~l~~~~~----------------~y~L~l~LP~~V~~~~~~Akf~~~~~~L 322 (328)
T PF08190_consen 259 SPEELVVEIELPGVESASDIDLDVSEDRLSLSSPKP----------------KYRLDLPLPYPVDEDNGKAKFDKKTKTL 322 (328)
T ss_pred CCceEEEEEECCCcCccceeEEEEeCCEEEEEeCCC----------------ceEEEccCCCcccCCCceEEEccCCCEE
Confidence 36789999999999 899999999999999984431 3667799999999 4999996 5999
Q ss_pred EEEEe
Q psy5221 408 NIDLI 412 (436)
Q Consensus 408 ~i~~p 412 (436)
+||+|
T Consensus 323 ~vtlp 327 (328)
T PF08190_consen 323 TVTLP 327 (328)
T ss_pred EEEEE
Confidence 99998
No 50
>cd06489 p23_CS_hSgt1_like p23_like domain similar to the C-terminal CS (CHORD-SGT1) domain of human (h) Sgt1 and related proteins. hSgt1 is a co-chaperone which has been shown to be elevated in HEp-2 cells as a result of stress conditions such as heat shock. It interacts with the heat shock proteins (HSPs) Hsp70 and Hsp90, and it expression pattern is synchronized with these two Hsps. The interaction with HSP90 has been shown to involve the hSgt1_CS domain, and appears to be required for correct kinetochore assembly and efficient cell division. Some proteins in this subgroup contain a tetratricopeptide repeat (TPR) HSP-binding domain N-terminal to this CS domain, and most proteins in this subgroup contain a Sgt1-specific (SGS) domain C-terminal to the CS domain. The SGS domain interacts with some S100 family proteins. Studies suggest that S100A6 modulates in a Ca2+ dependent manner the interactions of hSgt1 with Hsp90 and Hsp70. The yeast Sgt1 CS domain is not found in this subgroup.
Probab=96.90 E-value=0.0078 Score=46.51 Aligned_cols=73 Identities=16% Similarity=0.205 Sum_probs=56.0
Q ss_pred eEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEeeCc
Q psy5221 328 NIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFFDQG 405 (436)
Q Consensus 328 ~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~~~G 405 (436)
|.++ +++.+.|.+.++|+.+++++|+++++.|.+++... ++. .|.-.+.|...|+. .+.+..++
T Consensus 1 dW~Q-~~~~V~iti~~k~~~~~~~~v~~~~~~l~~~~~~~---~~~----------~y~~~~~L~~~I~p~~s~~~v~~~ 66 (84)
T cd06489 1 DWYQ-TESQVVITILIKNVKPEDVSVEFEKRELSATVKLP---SGN----------DYSLKLHLLHPIVPEQSSYKILST 66 (84)
T ss_pred Cccc-cCCEEEEEEEECCCCHHHCEEEEeCCEEEEEEECC---CCC----------cEEEeeecCceecchhcEEEEeCc
Confidence 3455 78889999999999999999999999999997652 111 24445677777663 56666688
Q ss_pred EEEEEEecC
Q psy5221 406 ILNIDLIRE 414 (436)
Q Consensus 406 vL~i~~pk~ 414 (436)
-+.|++.|.
T Consensus 67 kiei~L~K~ 75 (84)
T cd06489 67 KIEIKLKKT 75 (84)
T ss_pred EEEEEEEcC
Confidence 899999994
No 51
>PRK09672 phage exclusion protein Lit; Provisional
Probab=96.74 E-value=0.0026 Score=59.94 Aligned_cols=21 Identities=24% Similarity=0.298 Sum_probs=18.0
Q ss_pred HHHHHHHHHHHHHhhhcChHH
Q psy5221 115 KLEIKAIIAHEISHISNGDMV 135 (436)
Q Consensus 115 ~~El~aVLAHElgHi~~~h~~ 135 (436)
-..++.|+.||++|+..+|..
T Consensus 162 l~A~a~i~~HEiaHv~~~h~~ 182 (305)
T PRK09672 162 LCALAWILLHEIAHVEFQHSS 182 (305)
T ss_pred HHHHHHHHHHHHHHHHhcccc
Confidence 346889999999999999963
No 52
>cd06467 p23_NUDC_like p23_like domain of NUD (nuclear distribution) C and similar proteins. Aspergillus nidulas (An) NUDC is needed for nuclear movement. AnNUDC is localized at the hyphal cortex, and binds NUDF at spindle pole bodies (SPBs) and in the cytoplasm at different stages in the cell cycle. At the SPBs it is part of the dynein molecular motor/NUDF complex that regulates microtubule dynamics. Mammalian(m) NUDC associates both with the dynein complex and also with an anti-inflammatory enzyme, platelet activating factor acetylhydrolase I, PAF-AH(I) complex, through binding mNUDF, the regulatory beta subunit of PAF-AH(I). mNUDC is important for cell proliferation both in normal and tumor tissues. Its expression is elevated in various cell types undergoing mitosis or stimulated to proliferate, with high expression levels observed in leukemic cells and tumors. For a leukemic cell line, human NUDC was shown to activate the thrombopoietin (TPO) receptor (Mpl) by binding to its ext
Probab=96.63 E-value=0.018 Score=44.39 Aligned_cols=69 Identities=16% Similarity=0.214 Sum_probs=52.2
Q ss_pred EEEecCceEEEEEecC-CCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEeeC-
Q psy5221 329 IELVDENKYRITMAIA-GFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFFDQ- 404 (436)
Q Consensus 329 i~~e~~~~~~v~~~lp-G~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~~~- 404 (436)
-++ +++.+.|.+.+| +++++|++|++.++.|.|+.+. ..+ .-.-.|...|+. ..-.+++
T Consensus 3 W~Q-t~~~V~i~i~~~~~~~~~dv~v~~~~~~l~v~~~~------~~~----------~l~~~L~~~I~~~~s~w~~~~~ 65 (85)
T cd06467 3 WTQ-TLDEVTVTIPLPEGTKSKDVKVEITPKHLKVGVKG------GEP----------LLDGELYAKVKVDESTWTLEDG 65 (85)
T ss_pred EEe-eCCEEEEEEECCCCCcceeEEEEEEcCEEEEEECC------CCc----------eEcCcccCceeEcCCEEEEeCC
Confidence 454 788999999997 7899999999999999998542 011 111246777774 5557788
Q ss_pred cEEEEEEecC
Q psy5221 405 GILNIDLIRE 414 (436)
Q Consensus 405 GvL~i~~pk~ 414 (436)
..+.|+++|.
T Consensus 66 ~~v~i~L~K~ 75 (85)
T cd06467 66 KLLEITLEKR 75 (85)
T ss_pred CEEEEEEEEC
Confidence 9999999994
No 53
>cd06493 p23_NUDCD1_like p23_NUDCD1: p23-like NUD (nuclear distribution) C-like domain found in human NUD (nuclear distribution) C domain-containing protein 1, NUDCD1 (also known as CML66), and similar proteins. NUDCD1/CML66 is a broadly immunogenic tumor associated antigen, which is highly expressed in a variety of solid tumors and in leukemias. In normal tissues high expression of NUDCD1/CML66 is limited to testis and heart.
Probab=96.35 E-value=0.038 Score=42.77 Aligned_cols=70 Identities=16% Similarity=0.248 Sum_probs=51.2
Q ss_pred eEEEecCceEEEEEecC-CCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEeeC
Q psy5221 328 NIELVDENKYRITMAIA-GFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFFDQ 404 (436)
Q Consensus 328 ~i~~e~~~~~~v~~~lp-G~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~~~ 404 (436)
+-++ +++.+.|.+.+| |++++|++|++..+.|.++... +.. + -.-.|...|+. .+-..++
T Consensus 2 ~W~Q-t~~~V~v~i~~p~~~~~~dv~v~~~~~~l~v~~~~-----~~~----------~-~~g~L~~~I~~d~Stw~i~~ 64 (85)
T cd06493 2 YWQQ-TEEDLTLTIRLPEDTTKEDIRIKFLPDHISIALKD-----QAP----------L-LEGKLYSSIDHESSTWIIKE 64 (85)
T ss_pred ccEE-eCCEEEEEEECCCCCChhhEEEEEecCEEEEEeCC-----CCe----------E-EeCcccCcccccCcEEEEeC
Confidence 4455 888999999996 9999999999999999997421 001 1 12256777774 5555666
Q ss_pred c-EEEEEEecC
Q psy5221 405 G-ILNIDLIRE 414 (436)
Q Consensus 405 G-vL~i~~pk~ 414 (436)
| .|.|++.|.
T Consensus 65 ~~~l~i~L~K~ 75 (85)
T cd06493 65 NKSLEVSLIKK 75 (85)
T ss_pred CCEEEEEEEEC
Confidence 6 699999994
No 54
>PF01863 DUF45: Protein of unknown function DUF45; InterPro: IPR002725 Members of this family are found in some archaebacteria, as well as Helicobacter pylori. The proteins are 190-240 amino acids long, with the C terminus being the most conserved region, containing three conserved histidines.
Probab=96.34 E-value=0.008 Score=54.80 Aligned_cols=68 Identities=18% Similarity=0.301 Sum_probs=53.0
Q ss_pred HHHHHHHHHHhcCCCCCeEEEECCCccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHH
Q psy5221 64 LLKTVNKISKNANISTPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMV 135 (436)
Q Consensus 64 L~~~l~~l~~~~g~~~p~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~ 135 (436)
+.+.+...++..|++++++.+-+... ..|.++.++.|.++.-|+ .++++-+..|+.||++|+.+.+.-
T Consensus 114 l~~~~~~~~~~~~~~~~~i~ir~~ks---rWGsc~~~~~I~ln~~L~-~~P~~~idYVvvHEL~Hl~~~nHs 181 (205)
T PF01863_consen 114 LPERLKKYAKKLGLPPPKIKIRDMKS---RWGSCSSKGNITLNWRLV-MAPPEVIDYVVVHELCHLRHPNHS 181 (205)
T ss_pred HHHHHHHHHHHcCCCcceEEEeehhh---ccccCCCCCcEEeecccc-cCCccHHHHHHHHHHHHhccCCCC
Confidence 66777888999999999988877511 256556677798998444 479999999999999999875543
No 55
>cd06488 p23_melusin_like p23_like domain similar to the C-terminal (tail) domain of vertebrate Melusin and related proteins. Melusin's tail domain interacts with the cytoplasmic domain of beta1-A and beta1-D isoforms of beta1 integrin, it does not bind other integrin beta subunits. Melusin is a muscle-specific protein expressed in skeletal and cardiac muscles but not in smooth muscle or other tissues. It is needed for heart hypertrophy following mechanical overload. The integrin-binding portion of this domain appears to be sequestered in the full length melusin protein, Ca2+ may modulate the protein's conformation exposing this binding site. This group includes Chordc1, also known as Chp-1, which is conserved from vertebrates to humans. Mammalian Chordc1 interacts with the heat shock protein (HSP) Hsp90 and is implicated in circadian and/or homeostatic mechanisms in the brain. The N-terminal portions of proteins belonging to this group contain two cysteine and histidine rich domain (C
Probab=96.30 E-value=0.054 Score=42.16 Aligned_cols=74 Identities=12% Similarity=0.091 Sum_probs=58.9
Q ss_pred eeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEeeC
Q psy5221 327 YNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFFDQ 404 (436)
Q Consensus 327 ~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~~~ 404 (436)
+|.++ +++.+.|.+-+.|++++++++.++++.|+++...+. + ..|.-.+.|-..|+. .+-....
T Consensus 3 ~dW~Q-s~~~V~ItI~~k~~~~~~~~v~~~~~~l~v~~~~~~---~----------~~y~~~l~L~~~I~~~~s~~~v~~ 68 (87)
T cd06488 3 HDWHQ-TGSHVVVSVYAKNSNPELSVVEANSTVLTIHIVFEG---N----------KEFQLDIELWGVIDVEKSSVNMLP 68 (87)
T ss_pred ccEee-CCCEEEEEEEECcCCccceEEEecCCEEEEEEECCC---C----------ceEEEEeeccceEChhHcEEEecC
Confidence 57886 889999999999999999999999999998755421 1 125566777777774 6666788
Q ss_pred cEEEEEEecC
Q psy5221 405 GILNIDLIRE 414 (436)
Q Consensus 405 GvL~i~~pk~ 414 (436)
+-+.|++.|.
T Consensus 69 ~kvei~L~K~ 78 (87)
T cd06488 69 TKVEIKLRKA 78 (87)
T ss_pred cEEEEEEEeC
Confidence 9999999994
No 56
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=96.22 E-value=0.68 Score=42.33 Aligned_cols=65 Identities=15% Similarity=0.205 Sum_probs=44.3
Q ss_pred HHHHHHHHHhcCCCCCeEEEECC-CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChH
Q psy5221 65 LKTVNKISKNANISTPEVAIYNG-EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDM 134 (436)
Q Consensus 65 ~~~l~~l~~~~g~~~p~v~v~~~-~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~ 134 (436)
.+.-+.+.++.|+.--+|..+++ --+ =..|..+.|-++..-.+.-| =--.||-|||.||-.+...
T Consensus 40 ae~Ar~iL~~~gl~~V~Ve~~~G~LtD----HYdP~~k~vrLS~~vy~~~S-iaAvaVAAHEvGHAiQ~a~ 105 (222)
T PF04298_consen 40 AEVARHILDRNGLSDVRVERVPGELTD----HYDPRNKVVRLSEDVYNGRS-IAAVAVAAHEVGHAIQHAE 105 (222)
T ss_pred HHHHHHHHHHCCCCCeeEEEeCCCCCC----CcCCCCCEEEeCCccCCCCC-HHHHHHHHHHHhHHHhccc
Confidence 57888999999998656666654 101 12366777888875444322 2456899999999998774
No 57
>cd06494 p23_NUDCD2_like p23-like NUD (nuclear distribution) C-like found in human NUDC domain-containing protein 2 (NUDCD2) and similar proteins. Little is known about the function of the proteins in this subgroup.
Probab=95.86 E-value=0.12 Score=40.74 Aligned_cols=74 Identities=12% Similarity=0.115 Sum_probs=54.8
Q ss_pred CCCceeEEEecCceEEEEEecC-CCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEe
Q psy5221 323 SYPPYNIELVDENKYRITMAIA-GFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVA 399 (436)
Q Consensus 323 ~~p~~~i~~e~~~~~~v~~~lp-G~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~ 399 (436)
.++.++-++ +.+.+.|.+.+| |++++|++|++..+.|.|.-+- . +.-.| .|+..|+ ...
T Consensus 4 ~~~~y~W~Q-T~~eV~v~i~lp~~~~~kdv~V~i~~~~l~V~~~g------~-----~~l~G------~L~~~I~~dest 65 (93)
T cd06494 4 KTPWGCWYQ-TMDEVFIEVNVPPGTRAKDVKCKLGSRDISLAVKG------Q-----EVLKG------KLFDSVVADECT 65 (93)
T ss_pred cCCCcEEEe-EcCEEEEEEECCCCCceeeEEEEEEcCEEEEEECC------E-----EEEcC------cccCccCcccCE
Confidence 456788887 889999999888 8999999999999999997411 0 11133 3455555 356
Q ss_pred EEeeCcE-EEEEEecC
Q psy5221 400 AFFDQGI-LNIDLIRE 414 (436)
Q Consensus 400 A~~~~Gv-L~i~~pk~ 414 (436)
-.+++|- |.|+|.|.
T Consensus 66 Wtled~k~l~I~L~K~ 81 (93)
T cd06494 66 WTLEDRKLIRIVLTKS 81 (93)
T ss_pred EEEECCcEEEEEEEeC
Confidence 6677775 89999994
No 58
>PRK04351 hypothetical protein; Provisional
Probab=95.86 E-value=0.043 Score=47.24 Aligned_cols=62 Identities=24% Similarity=0.295 Sum_probs=47.1
Q ss_pred HHHHHHHHHHh-cCCCCCeEEEECC----CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhh
Q psy5221 64 LLKTVNKISKN-ANISTPEVAIYNG----EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHI 129 (436)
Q Consensus 64 L~~~l~~l~~~-~g~~~p~v~v~~~----~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi 129 (436)
|.+.+++++.. .+.+.+.-..++. ..-++.. ....|-++..+++..+++++..||+||++|+
T Consensus 6 l~~l~~~~s~~~F~~~f~~~v~~n~RlrttgG~~~l----~~~~I~lnp~ll~~~~~~~l~~vv~HElcH~ 72 (149)
T PRK04351 6 LQRLVEEISLEYFGKPFRHQAYFNKRLRTTGGRYLL----KDHHIEFNPKMLEEYGLEELIGIIKHELCHY 72 (149)
T ss_pred HHHHHHHHHHHHhCCCCCcEEEEeccchhhhheeec----CCCeEEeCHHHHhhccHHHHHhhHHHHHHHH
Confidence 99999999986 5766554455555 2233322 2456889998999999999999999999997
No 59
>cd06468 p23_CacyBP p23_like domain found in proteins similar to Calcyclin-Binding Protein(CacyBP)/Siah-1-interacting protein (SIP). CacyBP/SIP interacts with S100A6 (calcyclin), with some other members of the S100 family, with tubulin, and with Siah-1 and Skp-1. The latter two are components of the ubiquitin ligase that regulates beta-catenin degradation. The beta-catenin gene is an oncogene participating in tumorigenesis in many different cancers. Overexpression of CacyBP/SIP, in part through its effect on the expression of beta-catenin, inhibits the proliferation, tumorigenicity, and invasion of gastric cancer cells. CacyBP/SIP is abundant in neurons and neuroblastoma NB2a cells. An extensive re-organization of microtubules accompanies the differentiation of NB2a cells. CacyBP/SIP may contribute to NB2a cell differentiation through binding to and increasing the oligomerization of tubulin. CacyBP/SIP is also implicated in differentiation of erythroid cells, rat neonatal cardiomyocytes
Probab=95.70 E-value=0.14 Score=40.08 Aligned_cols=75 Identities=11% Similarity=0.286 Sum_probs=56.3
Q ss_pred ceeEEEecCceEEEEEecCCCCC---CceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEE-ECCCccce--Ee
Q psy5221 326 PYNIELVDENKYRITMAIAGFNR---SDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRF-QLADHVHV--VA 399 (436)
Q Consensus 326 ~~~i~~e~~~~~~v~~~lpG~~~---edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~-~lP~~v~~--~~ 399 (436)
.++-++ +++.+.|.+.+|+..+ +|++|+++++.|.|++... ++. .|.-.+ +|-..|+. .+
T Consensus 3 ~y~W~Q-t~~~V~i~i~~~~~~~~~~~~v~v~~~~~~l~v~~~~~---~~~----------~~~~~~~~L~~~I~~e~s~ 68 (92)
T cd06468 3 KYAWDQ-SDKFVKIYITLKGVHQLPKENIQVEFTERSFELKVHDL---NGK----------NYRFTINRLLKKIDPEKSS 68 (92)
T ss_pred eeeeec-CCCEEEEEEEccCCCcCCcccEEEEecCCEEEEEEECC---CCc----------EEEEEehHhhCccCccccE
Confidence 466775 8899999999999987 9999999999999987421 111 122233 36677773 67
Q ss_pred EEeeCcEEEEEEecC
Q psy5221 400 AFFDQGILNIDLIRE 414 (436)
Q Consensus 400 A~~~~GvL~i~~pk~ 414 (436)
....++-+.|++.|.
T Consensus 69 ~~~~~~ki~i~L~K~ 83 (92)
T cd06468 69 FKVKTDRIVITLAKK 83 (92)
T ss_pred EEEeCCEEEEEEEeC
Confidence 777889999999994
No 60
>COG1451 Predicted metal-dependent hydrolase [General function prediction only]
Probab=95.61 E-value=0.044 Score=50.65 Aligned_cols=69 Identities=14% Similarity=0.187 Sum_probs=50.0
Q ss_pred HHHHHHHHHHhcCCCCCeEEEECCCccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHH
Q psy5221 64 LLKTVNKISKNANISTPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVT 136 (436)
Q Consensus 64 L~~~l~~l~~~~g~~~p~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~ 136 (436)
+...+...+...|.+...+.+-+. --..|.+...+.|.++. .+..++++.+..|++||++|++..+.-.
T Consensus 125 l~~~~~~~~~~l~~~~~~~~ik~~---k~~WGScs~~~~i~~~~-~l~~~p~~~i~YVvvHELaHLke~nHs~ 193 (223)
T COG1451 125 LEIRLKEYAKKLGVPPRAIKLKNM---KRRWGSCSKAGEIRFNW-RLVMAPEEVIDYVVVHELAHLKEKNHSK 193 (223)
T ss_pred HHHHHHHHHHHhCCCccceeeeec---cceeeeecCCCcEEeeh-hhhcCCHHHHHHHHHHHHHHHhhhhccH
Confidence 556677778888888777777653 11233333333677887 7777899999999999999999977544
No 61
>COG2321 Predicted metalloprotease [General function prediction only]
Probab=95.12 E-value=0.17 Score=47.22 Aligned_cols=67 Identities=18% Similarity=0.343 Sum_probs=44.1
Q ss_pred HHHHHHHhcCCC--CCeEEEECC------CccEEEeC--CCCCccEEEEcHHHHhhcC-------HHHHHHHHHHHHHhh
Q psy5221 67 TVNKISKNANIS--TPEVAIYNG------EINAFATG--AFKNSSLIAISANLLESMT-------KLEIKAIIAHEISHI 129 (436)
Q Consensus 67 ~l~~l~~~~g~~--~p~v~v~~~------~~NA~~~G--~~~~~~~v~it~gLl~~l~-------~~El~aVLAHElgHi 129 (436)
....+-+..|.. .|++.++.. ..+--+.| .+|..+.++++..+++.|. +=.-+-|+|||.||.
T Consensus 99 ~W~~if~~~~~~Y~~Ptlvlf~~~v~t~CG~assasGPFYCP~D~kvYlDlsFf~~m~~~fga~GdfAqaYViAHEVGHH 178 (295)
T COG2321 99 TWMQIFQESGRTYQKPTLVLFSGQVRTGCGFASSASGPFYCPADTKVYLDLSFFDEMKTKFGASGDFAQAYVIAHEVGHH 178 (295)
T ss_pred HHHHHHHHhcccccCCeEEEecCccccCcCCCCcCCCCeecCCCceEEEehhHHHHHHHHhcCCccHHHHHHHHhhhhHH
Confidence 334455555543 678877765 11222232 2577888999999998874 335678999999998
Q ss_pred hcCh
Q psy5221 130 SNGD 133 (436)
Q Consensus 130 ~~~h 133 (436)
.++-
T Consensus 179 VQnl 182 (295)
T COG2321 179 VQNL 182 (295)
T ss_pred HHHH
Confidence 8754
No 62
>PF13699 DUF4157: Domain of unknown function (DUF4157)
Probab=94.92 E-value=0.038 Score=42.19 Aligned_cols=60 Identities=20% Similarity=0.342 Sum_probs=40.4
Q ss_pred HHHHHHHhcCCCCCeEEEECC----------CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcC
Q psy5221 67 TVNKISKNANISTPEVAIYNG----------EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNG 132 (436)
Q Consensus 67 ~l~~l~~~~g~~~p~v~v~~~----------~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~ 132 (436)
.-.++....|.+..+|.|-.+ ...||+.| ..|++..|-++ .+..+=.++||||++|+.+.
T Consensus 6 ~r~~~e~~~G~dl~~Vrvh~~~~a~~~~~~~~A~A~T~G-----~~I~f~~g~~~-~~s~~~~~llaHEl~Hv~Qq 75 (79)
T PF13699_consen 6 IRSRLERAFGADLSDVRVHTGPAASRAAAALGARAFTVG-----NDIYFAPGKYN-PDSPEGRALLAHELAHVVQQ 75 (79)
T ss_pred HHHHHHHHhCCCccceEEEeCCchhhhhhccCCeEEEEC-----CEEEEcCCCcC-CCCCCcchhHhHHHHHHHhh
Confidence 445666677888777777543 67788886 23777554332 24445678999999999874
No 63
>PF10263 SprT-like: SprT-like family; InterPro: IPR006640 This is a family of uncharacterised bacterial proteins which includes Escherichia coli SprT (P39902 from SWISSPROT). SprT is described as a regulator of bolA gene in stationary phase []. The majority of members contain the metallopeptidase zinc binding signature which has a HExxH motif, however there is no evidence for them being metallopeptidases.
Probab=93.89 E-value=0.36 Score=41.88 Aligned_cols=70 Identities=16% Similarity=0.137 Sum_probs=47.3
Q ss_pred HHHHHHHHHHhcCCCCCeEEEECC-CccEEEeCCCCC-ccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcCh
Q psy5221 64 LLKTVNKISKNANISTPEVAIYNG-EINAFATGAFKN-SSLIAISANLLESMTKLEIKAIIAHEISHISNGD 133 (436)
Q Consensus 64 L~~~l~~l~~~~g~~~p~v~v~~~-~~NA~~~G~~~~-~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h 133 (436)
+++.+++..=.-.++.+.|.+-.. .-.+-..-..+. ...|.++..+++..+++++..+|.|||.|.....
T Consensus 4 ~f~~~n~~~F~~~L~~~~i~~~~~~~~~~G~~~~~~~~~~~I~ls~~~~~~~~~~~~~~tL~HEm~H~~~~~ 75 (157)
T PF10263_consen 4 LFDRFNEEYFGGKLPPIPITWSKRMKRTAGRCRYKRRSPCEIRLSPKLLDRNPEEELIDTLLHEMAHAAAYV 75 (157)
T ss_pred HHHHHHHHHhCCCCCCeEEEEECCCCCceEEEEECCCCceEEEECHHHHHhhHHHHHHHHHHHHHHHHHhhh
Confidence 677777777767777655555523 222222211111 2369999999998899999999999999987743
No 64
>smart00731 SprT SprT homologues. Predicted to have roles in transcription elongation. Contains a conserved HExxH motif, indicating a metalloprotease function.
Probab=93.60 E-value=0.18 Score=43.43 Aligned_cols=66 Identities=18% Similarity=0.259 Sum_probs=43.2
Q ss_pred HHHHHHHHHHh-c--CCCCCeEEEECCCccEEEeCCCC-CccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhc
Q psy5221 64 LLKTVNKISKN-A--NISTPEVAIYNGEINAFATGAFK-NSSLIAISANLLESMTKLEIKAIIAHEISHISN 131 (436)
Q Consensus 64 L~~~l~~l~~~-~--g~~~p~v~v~~~~~NA~~~G~~~-~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~ 131 (436)
|.+.++++-.+ . .++.|.|..-...-.+ .|... ....|.++..|++..+++++..||.|||+|...
T Consensus 3 L~~~~~~~n~~~F~~~l~~~~i~w~~r~~~~--~G~~~~~~~~I~ln~~l~~~~~~~~l~~~l~HEm~H~~~ 72 (146)
T smart00731 3 LTQRLEDASLRVFGRKLPHPKVVWNKRLRKT--GGRCLLKSAEIRLNPKLLTENGRDRLRETLLHELCHAAL 72 (146)
T ss_pred HHHHHHHHHHHHHCCCCCCCEEEEehhhhhh--hHHhhcCCCEEEeCHHHHhhccHHHHHhhHHHHHHHHHH
Confidence 66666666532 3 3455544333321111 22222 367799999898888999999999999999986
No 65
>cd00237 p23 p23 binds heat shock protein (Hsp)90 and participates in the folding of a number of Hsp90 clients, including the progesterone receptor. p23 also has a passive chaperoning activity and in addition may participate in prostaglandin synthesis.
Probab=93.58 E-value=1.4 Score=35.55 Aligned_cols=74 Identities=11% Similarity=0.056 Sum_probs=51.5
Q ss_pred CceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEe
Q psy5221 325 PPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFF 402 (436)
Q Consensus 325 p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~ 402 (436)
|+++-++ +++.+.|.+++|+ .+|++|+++++.|+++|... ++..| .-.+.|=..|+. .+-..
T Consensus 2 p~v~WaQ-r~~~V~ltI~v~d--~~d~~v~l~~~~l~f~~~~~---~g~~y----------~~~l~l~~~I~pe~Sk~~v 65 (106)
T cd00237 2 AKTLWYD-RRDYVFIEFCVED--SKDVKVDFEKSKLTFSCLNG---DNVKI----------YNEIELYDRVDPNDSKHKR 65 (106)
T ss_pred CcceeeE-CCCEEEEEEEeCC--CCCcEEEEecCEEEEEEECC---CCcEE----------EEEEEeecccCcccCeEEe
Confidence 6788886 8889999999999 58999999999999998432 22222 223445455553 33333
Q ss_pred eCcEEEEEEecC
Q psy5221 403 DQGILNIDLIRE 414 (436)
Q Consensus 403 ~~GvL~i~~pk~ 414 (436)
..--+.|.+.|.
T Consensus 66 ~~r~ve~~L~K~ 77 (106)
T cd00237 66 TDRSILCCLRKG 77 (106)
T ss_pred CCceEEEEEEeC
Confidence 455677788884
No 66
>PF10026 DUF2268: Predicted Zn-dependent protease (DUF2268); InterPro: IPR018728 This domain, found in various hypothetical bacterial proteins, as well as predicted zinc dependent proteases, has no known function.
Probab=93.29 E-value=0.23 Score=44.96 Aligned_cols=69 Identities=20% Similarity=0.125 Sum_probs=43.4
Q ss_pred HHHHHHHHHHhcCCCCCeEEEECC-Ccc------EEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcCh
Q psy5221 64 LLKTVNKISKNANISTPEVAIYNG-EIN------AFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGD 133 (436)
Q Consensus 64 L~~~l~~l~~~~g~~~p~v~v~~~-~~N------A~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h 133 (436)
+.+.+.++.+....+--+|++... ..| --..|.+.....|++.- +-...+.+++.+++|||+-|..+..
T Consensus 5 i~~~~~~~~~~~~~~~i~v~i~p~~~~~~~~~~~~g~~g~~~~~~~i~l~~-~~~~~~~~~l~~~iaHE~hH~~r~~ 80 (195)
T PF10026_consen 5 IEEALEKSIELLPGPDIPVFIFPADPENPFLIPELGGKGGGAIPGYIFLFL-LPNDYSLEELPALIAHEYHHNCRYE 80 (195)
T ss_pred HHHHHHHHHHHcCCCCCCEEEEeccCCCcccccccCcccccCCCCEEEEEe-cCCcccHHHHHHHHHHHHHHHHHHh
Confidence 567777777777776555665554 112 11112222234465654 4555688999999999999997644
No 67
>PF13203 DUF2201_N: Putative metallopeptidase domain
Probab=92.83 E-value=0.24 Score=47.88 Aligned_cols=38 Identities=18% Similarity=0.135 Sum_probs=34.1
Q ss_pred cEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHH
Q psy5221 101 SLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTML 138 (436)
Q Consensus 101 ~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~ 138 (436)
..|+++...+..++.+|+.+|++||+-|+..+|..+..
T Consensus 43 ~~l~~nP~~~~~l~~~~~~~~l~HevlH~~~~H~~r~~ 80 (292)
T PF13203_consen 43 RRLYYNPEFLESLSPEERVGLLLHEVLHCLLRHPWRRG 80 (292)
T ss_pred cEEEECcHHHhcCCHHHHHHHHHHHHHHHHccchhhhc
Confidence 36999999999999999999999999999999986643
No 68
>PF04450 BSP: Peptidase of plants and bacteria; InterPro: IPR007541 These basic secretory proteins (BSPs) are believed to be part of the plants defence mechanism against pathogens [].
Probab=92.56 E-value=0.28 Score=44.72 Aligned_cols=70 Identities=21% Similarity=0.212 Sum_probs=49.8
Q ss_pred HHHHHHHHHHhcCCC------CCeEE-EECC-CccEEEeCCCCCccEEEEcHHHHhhcC-----HHHHHHHHHHHHHhhh
Q psy5221 64 LLKTVNKISKNANIS------TPEVA-IYNG-EINAFATGAFKNSSLIAISANLLESMT-----KLEIKAIIAHEISHIS 130 (436)
Q Consensus 64 L~~~l~~l~~~~g~~------~p~v~-v~~~-~~NA~~~G~~~~~~~v~it~gLl~~l~-----~~El~aVLAHElgHi~ 130 (436)
|.+.+..+.+.+-.+ ...|. ++++ ..=|++.| .+....|.++...++... ..|+..||-||+.|+.
T Consensus 30 L~~a~~~V~~~ly~~~~~~~~v~~Vt~~~~~~~gVA~t~g-d~~~~~I~~S~~~i~~~~~~~~~~~Ei~Gvl~HE~~H~~ 108 (205)
T PF04450_consen 30 LRDASRFVWRLLYQSPADRKPVRSVTLILDDMDGVAYTSG-DDDHKEIHFSARYIAKYPADGDVRDEIIGVLYHEMVHCW 108 (205)
T ss_pred HHHHHHHHHHHhCCCCCCCCcccEEEEEEECCCeeEEEec-CCCccEEEEeHHHHhhcccccchHHHHHHHHHHHHHHHh
Confidence 555555555544332 23444 4555 77788888 455677999999999885 3599999999999999
Q ss_pred cChH
Q psy5221 131 NGDM 134 (436)
Q Consensus 131 ~~h~ 134 (436)
+.+.
T Consensus 109 Q~~~ 112 (205)
T PF04450_consen 109 QWDG 112 (205)
T ss_pred hcCC
Confidence 9775
No 69
>COG2738 Predicted Zn-dependent protease [General function prediction only]
Probab=91.36 E-value=2.6 Score=37.46 Aligned_cols=70 Identities=13% Similarity=0.091 Sum_probs=45.5
Q ss_pred HHHHHHHHHhcCCCCCeEEEECCCccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHHHH
Q psy5221 65 LKTVNKISKNANISTPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVTML 138 (436)
Q Consensus 65 ~~~l~~l~~~~g~~~p~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~~~ 138 (436)
.+.-+.+.+..|+-.-.|-.+++.. +--..|++++|.++..-+..-+- .-.||-|||.||-.+.+..+..
T Consensus 43 aevAr~iLd~nGl~dV~Ve~v~G~L---TDHYDP~~kvvrLSe~~y~g~Si-a~~aVAAHEVGHAiQd~~~Y~~ 112 (226)
T COG2738 43 AEVARMILDENGLYDVPVEEVPGTL---TDHYDPRRKVVRLSEANYYGPSI-AAIAVAAHEVGHAIQDQEDYAF 112 (226)
T ss_pred HHHHHHHHhhcCCccceeeeecCCc---ccccChhhheeeccccccCCccH-HHHHHHHHHhhHHHhhhcccHH
Confidence 4677788888998644455555410 11124778888888755444333 4468999999999887755433
No 70
>cd06495 p23_NUDCD3_like p23-like NUD (nuclear distribution) C-like domain found in human NUDC domain-containing protein 3 (NUDCD3) and similar proteins. Little is known about the function of the proteins in this subgroup.
Probab=90.41 E-value=4.5 Score=32.44 Aligned_cols=77 Identities=8% Similarity=0.226 Sum_probs=53.0
Q ss_pred CCceeEEEecCceEEEEEecC-CC-CCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEe
Q psy5221 324 YPPYNIELVDENKYRITMAIA-GF-NRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVA 399 (436)
Q Consensus 324 ~p~~~i~~e~~~~~~v~~~lp-G~-~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~ 399 (436)
.+.|.-.+ +.+.+.|.+.+| |. +..|++|++..+.|.|.-+.. .++..++ .| .|+..|+ -..
T Consensus 4 ~e~Y~WtQ-Tl~eV~V~i~lp~~~~~~kdv~v~i~~~~l~v~~~~~--~~~~~~i-----~G------~L~~~V~~des~ 69 (102)
T cd06495 4 RENYTWSQ-DYTDVEVRVPVPKDVVKGRQVSVDLQSSSIRVSVRDG--GGEKVLM-----EG------EFTHKINTENSL 69 (102)
T ss_pred CCceEEEe-ECCeEEEEEECCCCCccceEEEEEEEcCEEEEEEecC--CCCceEE-----eC------cccCcccCccce
Confidence 45666776 888999999999 53 578999999999999874310 0001111 33 3455555 366
Q ss_pred EEeeCcE-EEEEEecC
Q psy5221 400 AFFDQGI-LNIDLIRE 414 (436)
Q Consensus 400 A~~~~Gv-L~i~~pk~ 414 (436)
-.++||- |.|+|-|.
T Consensus 70 Wtled~~~l~I~L~K~ 85 (102)
T cd06495 70 WSLEPGKCVLLSLSKC 85 (102)
T ss_pred EEEeCCCEEEEEEEEC
Confidence 6788864 89999994
No 71
>COG2856 Predicted Zn peptidase [Amino acid transport and metabolism]
Probab=90.29 E-value=0.31 Score=44.63 Aligned_cols=31 Identities=23% Similarity=0.243 Sum_probs=24.6
Q ss_pred CccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcCh
Q psy5221 99 NSSLIAISANLLESMTKLEIKAIIAHEISHISNGD 133 (436)
Q Consensus 99 ~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h 133 (436)
.++.|+++. ..+++.-.+++|||+||+..+.
T Consensus 57 ~~~~I~iN~----n~~~~r~rFtlAHELGH~llH~ 87 (213)
T COG2856 57 EKPVIYINA----NNSLERKRFTLAHELGHALLHT 87 (213)
T ss_pred cCceEEEeC----CCCHHHHHHHHHHHHhHHHhcc
Confidence 345677776 4489999999999999988754
No 72
>KOG1309|consensus
Probab=90.27 E-value=1.1 Score=39.28 Aligned_cols=42 Identities=19% Similarity=0.167 Sum_probs=37.8
Q ss_pred CceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEe
Q psy5221 325 PPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQ 367 (436)
Q Consensus 325 p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~ 367 (436)
+..|.|+ +++..+|.+-.+|+.++|++|.+.++.|.|.-+.+
T Consensus 4 ~r~DwyQ-t~~~vvIti~~k~v~~~~v~v~~s~~~l~~~~~~~ 45 (196)
T KOG1309|consen 4 IRHDWYQ-TETSVVITIFAKNVPKEDVNVEISENTLSIVIQLP 45 (196)
T ss_pred ccceeec-CCceEEEEEEecCCCccceeEEeecceEEEEEecC
Confidence 5689996 99999999999999999999999999999886654
No 73
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=90.26 E-value=1.4 Score=43.87 Aligned_cols=77 Identities=8% Similarity=0.051 Sum_probs=58.2
Q ss_pred CceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce--EeEEe
Q psy5221 325 PPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV--VAAFF 402 (436)
Q Consensus 325 p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~--~~A~~ 402 (436)
+.+|.|+ +++.+.|.+-+.|+.+++++|++.++.|.|+-... .+. .|.-.+.|-..|+. .+.+.
T Consensus 157 ~r~dWyQ-s~~~V~i~i~~k~~~~~~~~v~~~~~~l~v~~~~~---~~~----------~y~~~~~L~~~I~p~~s~~~v 222 (356)
T PLN03088 157 YRHEFYQ-KPEEVVVTVFAKGVPAENVNVDFGEQILSVVIEVP---GED----------AYHLQPRLFGKIIPDKCKYEV 222 (356)
T ss_pred cccceee-cCCEEEEEEEecCCChHHcEEEeecCEEEEEEecC---CCc----------ceeecccccccccccccEEEE
Confidence 5688997 89999999999999999999999999999986432 111 24444677777773 55555
Q ss_pred eCcEEEEEEecCC
Q psy5221 403 DQGILNIDLIREV 415 (436)
Q Consensus 403 ~~GvL~i~~pk~~ 415 (436)
..--+.|+|.|..
T Consensus 223 ~~~Kiei~l~K~~ 235 (356)
T PLN03088 223 LSTKIEIRLAKAE 235 (356)
T ss_pred ecceEEEEEecCC
Confidence 5668899998853
No 74
>COG4900 Predicted metallopeptidase [General function prediction only]
Probab=90.14 E-value=0.77 Score=36.75 Aligned_cols=67 Identities=16% Similarity=0.184 Sum_probs=44.4
Q ss_pred HHHHHHHHHHhcCCC---CCeEEEECC---CccEEE------------eCCCCCccEEEEcHHHHhhcCHHHHHHHHHHH
Q psy5221 64 LLKTVNKISKNANIS---TPEVAIYNG---EINAFA------------TGAFKNSSLIAISANLLESMTKLEIKAIIAHE 125 (436)
Q Consensus 64 L~~~l~~l~~~~g~~---~p~v~v~~~---~~NA~~------------~G~~~~~~~v~it~gLl~~l~~~El~aVLAHE 125 (436)
..+.++.+..++|+. ..+|+++-+ ..-|++ .|..|.--+-+++. =++.|+.+|-.-|++||
T Consensus 9 ve~~~~~~V~~lgLdyi~~~rv~vVys~gS~~~A~ARIwg~pki~~e~lglnP~YviEl~se-kF~rLs~~ekvKviiHE 87 (133)
T COG4900 9 VEADIKNAVVRLGLDYIFQVRVVVVYSPGSHSKAVARIWGIPKIFQEVLGLNPVYVIELLSE-KFKRLSCAEKVKVIIHE 87 (133)
T ss_pred HHHHHHHHHHHhCcceeeeeeEEEEECCCCcceehhhhhcccHHHHHHhCCCCeeeeeeehh-hcCCCChHHHHHHHHHH
Confidence 457788888889987 456666654 223322 24444322223333 57788999999999999
Q ss_pred HHhhhc
Q psy5221 126 ISHISN 131 (436)
Q Consensus 126 lgHi~~ 131 (436)
+.||-.
T Consensus 88 llHIP~ 93 (133)
T COG4900 88 LLHIPA 93 (133)
T ss_pred HhcCcc
Confidence 999865
No 75
>cd06492 p23_mNUDC_like p23-like NUD (nuclear distribution) C-like domain of mammalian(m) NUDC and similar proteins. Mammalian(m) NUDC associates both with the dynein complex and also with an anti-inflammatory enzyme, platelet activating factor acetylhydrolase I, PAF-AH(I) complex, through binding mNUDF, the regulatory beta subunit of PAF-AH(I). mNUDC is important for cell proliferation both in normal and tumor tissues. Its expression is elevated in various cell types undergoing mitosis or stimulated to proliferate, with high expression levels observed in leukemic cells and tumors. For a leukemic cell line, human NUDC was shown to activate the thrombopoietin (TPO) receptor (Mpl) by binding to its extracellular domain, and promoting cell proliferation and differentiation.
Probab=89.30 E-value=5.1 Score=31.06 Aligned_cols=66 Identities=11% Similarity=0.279 Sum_probs=46.3
Q ss_pred cCceEEEEEecC---CCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc--eEeEEeeCc-E
Q psy5221 333 DENKYRITMAIA---GFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH--VVAAFFDQG-I 406 (436)
Q Consensus 333 ~~~~~~v~~~lp---G~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~--~~~A~~~~G-v 406 (436)
+.++..|.+.+| |+++.|++|++..+.|.|.-+. ...+ -.| .|...|+ ...-.+++| .
T Consensus 6 T~~ev~v~v~l~~~~~~~~kdv~v~i~~~~l~v~~~g------~~~~----i~G------~L~~~V~~des~Wtled~~~ 69 (87)
T cd06492 6 TLSEVELKVPFKVSFRLKGKDVVVDIQRKHLKVGLKG------QPPI----IDG------ELYNEVKVEESSWLIEDGKV 69 (87)
T ss_pred ecCEEEEEEECCCCCCccceEEEEEEecCEEEEEECC------CceE----EeC------cccCcccccccEEEEeCCCE
Confidence 667789999986 3889999999999999986321 1111 123 3445555 366667786 8
Q ss_pred EEEEEecC
Q psy5221 407 LNIDLIRE 414 (436)
Q Consensus 407 L~i~~pk~ 414 (436)
|.|++-|.
T Consensus 70 l~i~L~K~ 77 (87)
T cd06492 70 VTVNLEKI 77 (87)
T ss_pred EEEEEEEC
Confidence 99999994
No 76
>cd06490 p23_NCB5OR p23_like domain found in NAD(P)H cytochrome b5 (NCB5) oxidoreductase (OR) and similar proteins. NCB5OR is widely expressed in human organs and tissues and is localized in the ER (endoplasmic reticulum). It appears to play a critical role in maintaining viable pancreatic beta cells. Mice homozygous for a targeted knockout (KO) of the gene encoding NCB5OR develop an early-onset nonautoimmune diabetes phenotype with a non-inflammatory beta-cell deficiency. The role of NCB5OR in beta cells may be in maintaining or regulating their redox status. Proteins in this group in addition contain an N-terminal cytochrome b5 domain and a C-terminal cytochrome b5 oxidoreductase domain. The gene encoding NCB5OR has been considered as a positional candidate for type II diabetes and other diabetes subtypes related to B-cell dysfunction, however variation in its coding region does not appear not to be a major contributor to the pathogenesis of these diseases.
Probab=88.96 E-value=6 Score=30.60 Aligned_cols=73 Identities=11% Similarity=0.189 Sum_probs=49.0
Q ss_pred eeEEEecCceEEEEEecCC--CCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccce-EeEEee
Q psy5221 327 YNIELVDENKYRITMAIAG--FNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHV-VAAFFD 403 (436)
Q Consensus 327 ~~i~~e~~~~~~v~~~lpG--~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~-~~A~~~ 403 (436)
+|.|+ +++.++|.+-.++ .+++++.+..+++.|+|+-... + ..|...+.|=..++. .+.++.
T Consensus 1 ~DWyQ-t~~~Vtitiy~K~~~~~~~~v~v~~~~~~l~v~~~~~----~----------~~~~~~~~L~~~I~~~~~~~~~ 65 (87)
T cd06490 1 YDWFQ-TDSEVTIVVYTKSKGNPADIVIVDDQQRELRVEIILG----D----------KSYLLHLDLSNEVQWPCEVRIS 65 (87)
T ss_pred CCceE-CCCEEEEEEEEcccCCCCccEEEECCCCEEEEEEECC----C----------ceEEEeeeccccCCCCcEEEEc
Confidence 47787 8999999999885 4555666666777888874332 1 124555666666662 234444
Q ss_pred --CcEEEEEEecC
Q psy5221 404 --QGILNIDLIRE 414 (436)
Q Consensus 404 --~GvL~i~~pk~ 414 (436)
-|.++|+++|.
T Consensus 66 ~~~~KVEI~L~K~ 78 (87)
T cd06490 66 TETGKIELVLKKK 78 (87)
T ss_pred ccCceEEEEEEcC
Confidence 78999999994
No 77
>PF02031 Peptidase_M7: Streptomyces extracellular neutral proteinase (M7) family; InterPro: IPR000013 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M7 (snapalysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. With a molecular weight of around 16kDa, Streptomyces extracellular neutral protease is one of the smallest known proteases []; it is capable of hydrolysing milk proteins []. The enzyme is synthesised as a proenzyme with a signal peptide, a propeptide and an active domain that contains the conserved HEXXH motif characteristic of metalloproteases. Although family M7 shows active site sequence similarity to other members, it differs in one major respect: the third zinc ligand appears to be an aspartate residue rather than the usual histidine.; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0005576 extracellular region; PDB: 1C7K_A 1KUH_A.
Probab=88.46 E-value=1.4 Score=36.46 Aligned_cols=39 Identities=26% Similarity=0.217 Sum_probs=23.3
Q ss_pred CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhc
Q psy5221 88 EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISN 131 (436)
Q Consensus 88 ~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~ 131 (436)
..-|.+.|.+ +.+|.++..--+..+.- -|.+||+||+..
T Consensus 52 ~sya~~~g~G--~G~I~l~~~~~qgy~~~---RIaaHE~GHiLG 90 (132)
T PF02031_consen 52 GSYASTDGLG--SGYIFLDYQQNQGYNST---RIAAHELGHILG 90 (132)
T ss_dssp --EEEE-SSS---EEEEEEHHHHHHS-HH---HHHHHHHHHHHT
T ss_pred CcccccCCCC--cEEEEechHHhhCCccc---eeeeehhccccC
Confidence 4446666654 67799987544444432 299999999865
No 78
>PRK04860 hypothetical protein; Provisional
Probab=88.36 E-value=1.8 Score=37.76 Aligned_cols=66 Identities=18% Similarity=0.191 Sum_probs=41.4
Q ss_pred HHHHHHHHHHhcCCC--CCeEEEECCCccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhc
Q psy5221 64 LLKTVNKISKNANIS--TPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISN 131 (436)
Q Consensus 64 L~~~l~~l~~~~g~~--~p~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~ 131 (436)
+...++.-....|-+ .|.+..-.-.-.|-.+.. ...-|-++..++..-+++++..||.||++|+.-
T Consensus 9 ~~~~~~~a~~~f~~~f~~p~~~f~~R~rtaG~~~l--~~~~I~~Np~ll~~~~~~~l~~~v~HEl~H~~~ 76 (160)
T PRK04860 9 LRECLAQANLYFKRTFPEPKVSYTQRGTSAGTAWL--QSNEIRLNPVLLLENQQAFIDEVVPHELAHLLV 76 (160)
T ss_pred HHHHHHHHHHHhCCCCCCCEEEEeecchhhcchhH--hcCCeeeCHHHHhhCcHHHHHhHHHHHHHHHHH
Confidence 344444444455655 455543222222433322 234488898898888999999999999999854
No 79
>COG3091 SprT Zn-dependent metalloprotease, SprT family [General function prediction only]
Probab=87.14 E-value=1.6 Score=37.14 Aligned_cols=66 Identities=21% Similarity=0.188 Sum_probs=44.5
Q ss_pred HHHHHHHHHHHhcCCC--CCeEEEECCCccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhh
Q psy5221 63 WLLKTVNKISKNANIS--TPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHIS 130 (436)
Q Consensus 63 ~L~~~l~~l~~~~g~~--~p~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~ 130 (436)
.|.+.++++.....-+ .|+++.-.-.-+|-..-+ . +.-|-++.-++....++.+.-|+.||++|+.
T Consensus 6 ~L~~~~~~as~~~~r~~~~p~~~~n~Rg~taG~ayL-~-~~~I~lNP~ll~en~~~f~~~vV~HELaHl~ 73 (156)
T COG3091 6 KLQQCVEQASLKFFRKFFRPKASYNQRGRTAGGAYL-L-KSEIRLNPKLLEENGEDFIEQVVPHELAHLH 73 (156)
T ss_pred HHHHHHHHHHHHhcCCCCCcceehhhhhhhcchhhc-c-ccccccCHHHHHHccHHHHHHHHHHHHHHHH
Confidence 4788888887776543 555543222333322212 2 2257888889999999999999999999974
No 80
>PF13349 DUF4097: Domain of unknown function (DUF4097)
Probab=84.22 E-value=11 Score=32.63 Aligned_cols=81 Identities=17% Similarity=0.210 Sum_probs=48.2
Q ss_pred CCceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCcc--ceEeEE
Q psy5221 324 YPPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHV--HVVAAF 401 (436)
Q Consensus 324 ~p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v--~~~~A~ 401 (436)
...+.|...+++.+.+ +. +.+.++++.++++|.|+.+.....-...+..... ...-+-.+.||++. +.++..
T Consensus 65 ~~~V~I~~~~~~~i~v--~~---~~k~~~~~~~~~~L~I~~~~~~~~~~~~~~~~~~-~~~~~i~I~lP~~~~l~~i~i~ 138 (166)
T PF13349_consen 65 NGDVEIKPSDDDKIKV--EY---NGKKPEISVEGGTLTIKSKDRESFFFKGFNFNNS-DNKSKITIYLPKDYKLDKIDIK 138 (166)
T ss_pred ceeEEEEEcCCccEEE--EE---cCcEEEEEEcCCEEEEEEecccccccceEEEccc-CCCcEEEEEECCCCceeEEEEE
Confidence 3557776533444444 44 3227999999999999977221111111111111 23455668999986 468888
Q ss_pred eeCcEEEEE
Q psy5221 402 FDQGILNID 410 (436)
Q Consensus 402 ~~~GvL~i~ 410 (436)
..+|-+++.
T Consensus 139 ~~~G~i~i~ 147 (166)
T PF13349_consen 139 TSSGDITIE 147 (166)
T ss_pred eccccEEEE
Confidence 889966654
No 81
>PF12388 Peptidase_M57: Dual-action HEIGH metallo-peptidase; InterPro: IPR024653 This entry represents the metallopeptidases M10, M27 and M57. The catalytic triad for proteases in this entry is HE-H-H, which in many members is in the sequence motif HEIGH [].
Probab=82.87 E-value=1.1 Score=40.81 Aligned_cols=34 Identities=26% Similarity=0.313 Sum_probs=25.2
Q ss_pred ccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhc-ChHH
Q psy5221 100 SSLIAISANLLESMTKLEIKAIIAHEISHISN-GDMV 135 (436)
Q Consensus 100 ~~~v~it~gLl~~l~~~El~aVLAHElgHi~~-~h~~ 135 (436)
.+.|.+. + ++..+.+..+.||+|||||... ||.-
T Consensus 117 ~~~I~I~-~-~~~~~~~~~~hvi~HEiGH~IGfRHTD 151 (211)
T PF12388_consen 117 YKFIQIY-G-LSNYSVNVIEHVITHEIGHCIGFRHTD 151 (211)
T ss_pred CceEEEE-e-cCCCchhHHHHHHHHHhhhhccccccC
Confidence 4456663 2 4666888999999999999765 6654
No 82
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=80.48 E-value=0.45 Score=48.93 Aligned_cols=55 Identities=22% Similarity=0.215 Sum_probs=37.5
Q ss_pred hcCCCCCe-EEEECC-CccEEEeCCCCCccEEEEcHHHHhhc--CH--------HHHHHHHHHHHHhhhc
Q psy5221 74 NANISTPE-VAIYNG-EINAFATGAFKNSSLIAISANLLESM--TK--------LEIKAIIAHEISHISN 131 (436)
Q Consensus 74 ~~g~~~p~-v~v~~~-~~NA~~~G~~~~~~~v~it~gLl~~l--~~--------~El~aVLAHElgHi~~ 131 (436)
+.|-|+.+ -..++. .+|||-- |.+..|++-.++|+.- ++ .-+-|||||||||---
T Consensus 434 K~~kPVDr~eW~M~pq~VNAYYn---p~~N~IVFPAAILQ~PFfd~ea~~a~NYGgIGaVIgHEI~HgFD 500 (654)
T COG3590 434 KIGKPVDRDEWEMPPQTVNAYYN---PQKNEIVFPAAILQAPFFDPEADSAANYGGIGAVIGHEIGHGFD 500 (654)
T ss_pred HhCCCCchhhcCCCHHHhhhhcC---CCCceEeeeHHhcCCCCCCCCcchhhcccCccceehhhhccccc
Confidence 44555432 233344 8899866 6677899999898752 22 2488999999999754
No 83
>COG3864 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=79.75 E-value=2.3 Score=40.64 Aligned_cols=73 Identities=16% Similarity=0.165 Sum_probs=48.2
Q ss_pred HHHHHHHHHHhcCCCCC------eEEEECC--CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHH
Q psy5221 64 LLKTVNKISKNANISTP------EVAIYNG--EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMV 135 (436)
Q Consensus 64 L~~~l~~l~~~~g~~~p------~v~v~~~--~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~ 135 (436)
+...+.++.+...+... .+.++.. .+|+-.+-.+...-.++++.-.+..++.+++.|+|-||+-|+...|+.
T Consensus 7 ~~k~flr~~~es~f~l~~~~~ffa~~Li~~k~i~s~~~~~a~t~~~~~y~NPei~~~~p~~~~~aLl~HEV~Hi~l~Hi~ 86 (396)
T COG3864 7 FRKKFLRFIEESNFDLMNEDNFFALFLIQIKDIASACETKASTSYFTMYFNPEIFLNCPISEMKALLKHEVYHIMLNHIK 86 (396)
T ss_pred HHHHHHHHhccCCceeeccchhhhhhhhhhccchhHHHhhccCCceEEEeCHHHHccCCHHHHHHHHHHHHHHHHHHHHH
Confidence 55666666665554321 1122333 334433322233556888888999999999999999999999999985
Q ss_pred H
Q psy5221 136 T 136 (436)
Q Consensus 136 ~ 136 (436)
+
T Consensus 87 r 87 (396)
T COG3864 87 R 87 (396)
T ss_pred h
Confidence 4
No 84
>COG3824 Predicted Zn-dependent protease [General function prediction only]
Probab=77.89 E-value=2.1 Score=34.90 Aligned_cols=36 Identities=19% Similarity=0.350 Sum_probs=25.0
Q ss_pred CCCCCccEEEEcHHHHhhcCH------HHHHHHHHHHHHhhhc
Q psy5221 95 GAFKNSSLIAISANLLESMTK------LEIKAIIAHEISHISN 131 (436)
Q Consensus 95 G~~~~~~~v~it~gLl~~l~~------~El~aVLAHElgHi~~ 131 (436)
|-+| .++-+....+++.+.+ +++.-|+-||+||.-.
T Consensus 81 G~~P-~rItlYRrailDywae~eetlgd~vthvliHEIgHhFG 122 (136)
T COG3824 81 GELP-DRITLYRRALLDYWAENEETLGDQVTHVLIHEIGHHFG 122 (136)
T ss_pred CCCC-ceeeeeHHHHHHHHhhhhhhHhhHhhhhhhhhhhhhcC
Confidence 4345 4556667778888744 4567889999999754
No 85
>COG4324 Predicted aminopeptidase [General function prediction only]
Probab=76.00 E-value=2.7 Score=39.18 Aligned_cols=33 Identities=27% Similarity=0.396 Sum_probs=26.1
Q ss_pred EEcHHHHhhc---CHHHHHHHHHHHHHhhh---cChHHH
Q psy5221 104 AISANLLESM---TKLEIKAIIAHEISHIS---NGDMVT 136 (436)
Q Consensus 104 ~it~gLl~~l---~~~El~aVLAHElgHi~---~~h~~~ 136 (436)
.++++||+.| ++.+|+.+|-||++|.+ ++|+..
T Consensus 180 wFdDPlLstmlr~dd~~lA~LIFHELAHQk~Y~~~DtAF 218 (376)
T COG4324 180 WFDDPLLSTMLRQDDTYLASLIFHELAHQKIYVNNDTAF 218 (376)
T ss_pred ccccHHHHHHhcCChHHHHHHHHHHHhhheEeecCcchH
Confidence 3566777776 89999999999999976 466644
No 86
>cd04279 ZnMc_MMP_like_1 Zinc-dependent metalloprotease; MMP_like sub-family 1. A group of bacterial, archaeal, and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=75.66 E-value=1.7 Score=37.54 Aligned_cols=17 Identities=29% Similarity=0.446 Sum_probs=15.1
Q ss_pred HHHHHHHHHHHHHhhhc
Q psy5221 115 KLEIKAIIAHEISHISN 131 (436)
Q Consensus 115 ~~El~aVLAHElgHi~~ 131 (436)
..++.+|+.|||||...
T Consensus 101 ~~~~~~~~~HEiGHaLG 117 (156)
T cd04279 101 AENLQAIALHELGHALG 117 (156)
T ss_pred chHHHHHHHHHhhhhhc
Confidence 56899999999999876
No 87
>PF01421 Reprolysin: Reprolysin (M12B) family zinc metalloprotease This Prosite motif covers only the active site.; InterPro: IPR001590 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12B (adamalysin family, clan (MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The adamalysins are zinc dependent endopeptidases found in snake venom. There are some mammalian proteins such as P78325 from SWISSPROT, and fertilin Q28472 from SWISSPROT. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes. CD156 (also called ADAM8 (3.4.24 from EC) or MS2 human) has been implicated in extravasation of leukocytes. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2E3X_A 2W15_A 2W14_A 2W13_A 2W12_A 1ND1_A 3K7L_A 2DW2_A 2DW0_B 2DW1_A ....
Probab=75.30 E-value=2.1 Score=38.74 Aligned_cols=40 Identities=30% Similarity=0.310 Sum_probs=23.2
Q ss_pred cEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhc
Q psy5221 90 NAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISN 131 (436)
Q Consensus 90 NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~ 131 (436)
-|+..|.+..+.-+.+.. ...-+...++.++|||+||.-.
T Consensus 105 ~a~~~~~C~~~~s~~i~~--~~~~~~~~~a~~~AHelGH~lG 144 (199)
T PF01421_consen 105 LAYVGGMCSPSRSCGIVE--DHSRSGLSFAVIIAHELGHNLG 144 (199)
T ss_dssp EE-TT-TTSTTTSEEEEE---SSSSHHHHHHHHHHHHHHHTT
T ss_pred eEeCCCCCCcCCCCcEee--eccchhHHHHHHHHHHHHHhcC
Confidence 355555554433343333 1124577889999999999754
No 88
>cd04269 ZnMc_adamalysin_II_like Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=74.64 E-value=2.6 Score=37.88 Aligned_cols=17 Identities=29% Similarity=0.362 Sum_probs=14.4
Q ss_pred HHHHHHHHHHHHHhhhc
Q psy5221 115 KLEIKAIIAHEISHISN 131 (436)
Q Consensus 115 ~~El~aVLAHElgHi~~ 131 (436)
....+.++|||+||.-.
T Consensus 128 ~~~~a~~~AHElGH~lG 144 (194)
T cd04269 128 LLLFAVTMAHELGHNLG 144 (194)
T ss_pred hHHHHHHHHHHHHhhcC
Confidence 45778999999999874
No 89
>PF14891 Peptidase_M91: Effector protein
Probab=74.58 E-value=8 Score=34.22 Aligned_cols=67 Identities=13% Similarity=0.198 Sum_probs=35.3
Q ss_pred HHHHHHHHHhcCCCCCeEEEECC-CccE-EEeCCC-----CCccEEEEcHHHHh------------hcCHHHHHHHHHHH
Q psy5221 65 LKTVNKISKNANISTPEVAIYNG-EINA-FATGAF-----KNSSLIAISANLLE------------SMTKLEIKAIIAHE 125 (436)
Q Consensus 65 ~~~l~~l~~~~g~~~p~v~v~~~-~~NA-~~~G~~-----~~~~~v~it~gLl~------------~l~~~El~aVLAHE 125 (436)
+++|+.|.+.......+|+|... ..|. ...+.. .....|.++..... .-....=..+|+||
T Consensus 31 ~~mL~~l~~~~~~~~~~vtI~~~~~~~~~~~~~~~~~~~~G~~~~I~~np~~~~~~~~~~~~~~~~~~~~~~p~v~L~HE 110 (174)
T PF14891_consen 31 QQMLRELEKRSDSGEKKVTISEIDSDNGPYSPGSNSASQLGSDATINWNPSKYSIGDGTEGSPINGGWSPRPPFVVLYHE 110 (174)
T ss_pred HHHHHHHHhhccCCCCCeEEecCCCCCCccccCccccccCCCceEEEECCCcccccCCCCcccccccchhHHHHHHHHHH
Confidence 56777776655544556666655 2121 111100 12344555553000 01244567999999
Q ss_pred HHhhhc
Q psy5221 126 ISHISN 131 (436)
Q Consensus 126 lgHi~~ 131 (436)
|+|..+
T Consensus 111 L~HA~~ 116 (174)
T PF14891_consen 111 LIHAYD 116 (174)
T ss_pred HHHHHH
Confidence 999876
No 90
>PF08325 WLM: WLM domain; InterPro: IPR013536 The WLM (WSS1-like metalloprotease) domain is a globular domain related to the zincin-like superfamily of Zn-dependent peptidase. Since the WLM domain contains all known active site residues of zincins, it is predicted to be a catalytically active peptidase domain. The WLM domain is a eukaryotic domain represented in plants, fungi, Plasmodium, and kinetoplastids. By contrast, it is absent in animals, Cryptosporidium, and Microsporidia, suggesting that it has been lost on multiple occasions during the evolution of eukaryotes. The WLM domain is found either in stand-alone form or in association with other domains such as the RanBP2 zinc finger , the ubiquitin domain, or the PUB/PUG domain. This domain could function as a specific de-SUMOylating domain of distinct protein complexes in the nucleus and the cytoplasm []. It has been suggested to form a segregated alpha/beta structure with eight helices and five strands. Proteins containign this domain include yeast WSS1 (a weak suppressor of the Ub-related protein SMT3), and various putative metalloproteases from plant and fungal species.
Probab=74.47 E-value=7.8 Score=34.75 Aligned_cols=23 Identities=22% Similarity=0.286 Sum_probs=19.6
Q ss_pred cCHHHHHHHHHHHHHhhhcChHH
Q psy5221 113 MTKLEIKAIIAHEISHISNGDMV 135 (436)
Q Consensus 113 l~~~El~aVLAHElgHi~~~h~~ 135 (436)
++-+++..|+-|||+|+.++..-
T Consensus 77 l~~~~i~~t~lHELaH~~~~~H~ 99 (186)
T PF08325_consen 77 LPYETILGTMLHELAHNVHGPHD 99 (186)
T ss_pred eeHHHHHHHHHHHHHhcccCCcc
Confidence 37799999999999999986543
No 91
>PF00413 Peptidase_M10: Matrixin This Prosite motif covers only the active site.; InterPro: IPR001818 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M10 (clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Sequences having this domain are extracellular metalloproteases, such as collagenase and stromelysin, which degrade the extracellular matrix, are known as matrixins. They are zinc-dependent, calcium-activated proteases synthesised as inactive precursors (zymogens), which are proteolytically cleaved to yield the active enzyme [, ]. All matrixins and related proteins possess 2 domains: an N-terminal domain, and a zinc-binding active site domain. The N-terminal domain peptide, cleaved during the activation step, includes a conserved PRCGVPDV octapeptide, known as the cysteine switch, whose Cys residue chelates the active site zinc atom, rendering the enzyme inactive [, ]. The active enzyme degrades components of the extracellular matrix, playing a role in the initial steps of tissue remodelling during morphogenesis, wound healing, angiogenesis and tumour invasion [, ].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0031012 extracellular matrix; PDB: 1Q3A_C 3V96_B 1HV5_D 1CXV_A 1SRP_A 1FBL_A 1ZVX_A 1JH1_A 1I76_A 2OY4_A ....
Probab=74.28 E-value=2 Score=36.81 Aligned_cols=18 Identities=22% Similarity=0.455 Sum_probs=15.0
Q ss_pred CHHHHHHHHHHHHHhhhc
Q psy5221 114 TKLEIKAIIAHEISHISN 131 (436)
Q Consensus 114 ~~~El~aVLAHElgHi~~ 131 (436)
+..++..|+.|||||...
T Consensus 101 ~~~~~~~v~~HEiGHaLG 118 (154)
T PF00413_consen 101 SGNDLQSVAIHEIGHALG 118 (154)
T ss_dssp SSEEHHHHHHHHHHHHTT
T ss_pred hhhhhhhhhhhccccccC
Confidence 445799999999999854
No 92
>cd04268 ZnMc_MMP_like Zinc-dependent metalloprotease, MMP_like subfamily. This group contains matrix metalloproteinases (MMPs), serralysins, and the astacin_like family of proteases.
Probab=73.19 E-value=2.1 Score=37.20 Aligned_cols=18 Identities=17% Similarity=0.366 Sum_probs=15.2
Q ss_pred CHHHHHHHHHHHHHhhhc
Q psy5221 114 TKLEIKAIIAHEISHISN 131 (436)
Q Consensus 114 ~~~El~aVLAHElgHi~~ 131 (436)
+.+.+.++++|||||...
T Consensus 90 ~~~~~~~~~~HEiGHaLG 107 (165)
T cd04268 90 SGARLRNTAEHELGHALG 107 (165)
T ss_pred HHHHHHHHHHHHHHHHhc
Confidence 446799999999999765
No 93
>cd04270 ZnMc_TACE_like Zinc-dependent metalloprotease; TACE_like subfamily. TACE, the tumor-necrosis factor-alpha converting enzyme, releases soluble TNF-alpha from transmembrane pro-TNF-alpha.
Probab=72.87 E-value=2.3 Score=39.90 Aligned_cols=18 Identities=28% Similarity=0.329 Sum_probs=14.9
Q ss_pred CHHHHHHHHHHHHHhhhc
Q psy5221 114 TKLEIKAIIAHEISHISN 131 (436)
Q Consensus 114 ~~~El~aVLAHElgHi~~ 131 (436)
.....+.++||||||--.
T Consensus 163 ~~~~~a~t~AHElGHnlG 180 (244)
T cd04270 163 PTKESDLVTAHELGHNFG 180 (244)
T ss_pred chhHHHHHHHHHHHHhcC
Confidence 345688999999999876
No 94
>COG1164 Oligoendopeptidase F [Amino acid transport and metabolism]
Probab=70.89 E-value=5.9 Score=42.32 Aligned_cols=50 Identities=20% Similarity=0.304 Sum_probs=34.2
Q ss_pred CeEEEECC-CccEEEeCCCC-CccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChH
Q psy5221 80 PEVAIYNG-EINAFATGAFK-NSSLIAISANLLESMTKLEIKAIIAHEISHISNGDM 134 (436)
Q Consensus 80 p~v~v~~~-~~NA~~~G~~~-~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~ 134 (436)
.+++--++ .+-||+.|..+ ....|+.+- ...+ .=..+|+||+||..|.-.
T Consensus 345 iD~~~~~gKrsGaYs~~~~~~~~p~IlmN~--~gt~---~dV~TLaHElGHs~Hs~~ 396 (598)
T COG1164 345 IDVYPRKGKRSGAYSIGFYKGDHPFILMNY--DGTL---RDVFTLAHELGHSVHSYF 396 (598)
T ss_pred eeccCCCCCCCCcccCCCCCCCCCeEEEeC--CCch---hHHHHHHHHccHHHHHHH
Confidence 34444445 89999999877 677776653 2222 225789999999988643
No 95
>PF13485 Peptidase_MA_2: Peptidase MA superfamily
Probab=70.19 E-value=3.1 Score=33.89 Aligned_cols=23 Identities=22% Similarity=0.331 Sum_probs=18.9
Q ss_pred hcCHHHHHHHHHHHHHhhhcChH
Q psy5221 112 SMTKLEIKAIIAHEISHISNGDM 134 (436)
Q Consensus 112 ~l~~~El~aVLAHElgHi~~~h~ 134 (436)
..+.+.+..||+||++|....+.
T Consensus 19 ~~~~~~~~~~l~HE~~H~~~~~~ 41 (128)
T PF13485_consen 19 GSDEDWLDRVLAHELAHQWFGNY 41 (128)
T ss_pred CCCHHHHHHHHHHHHHHHHHHHH
Confidence 35778889999999999987654
No 96
>PF09768 Peptidase_M76: Peptidase M76 family; InterPro: IPR019165 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. Mitochondrial inner membrane protease ATP23 has two roles in the assembly of mitochondrial ATPase. Firstly, it acts as a protease that removes the N-terminal 10 residues of mitochondrial ATPase CF(0) subunit 6 (ATP6) at the intermembrane space side. Secondly, it is involved in the correct assembly of the membrane-embedded ATPase CF(0) particle, probably mediating association of ATP6 with the subunit 9 ring [, ].; GO: 0004222 metalloendopeptidase activity
Probab=69.19 E-value=15 Score=32.44 Aligned_cols=63 Identities=8% Similarity=0.138 Sum_probs=37.3
Q ss_pred HHHHHHHHHHhcCCC--CCeEEEECC-CccEEEeCCCC-CccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhc
Q psy5221 64 LLKTVNKISKNANIS--TPEVAIYNG-EINAFATGAFK-NSSLIAISANLLESMTKLEIKAIIAHEISHISN 131 (436)
Q Consensus 64 L~~~l~~l~~~~g~~--~p~v~v~~~-~~NA~~~G~~~-~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~ 131 (436)
++-+++.+. ++|-+ .++...... ..+ ...|+.| .+++++...- .-+..++.-+|+|||-|..-
T Consensus 18 v~fl~~~~~-~~gc~~~~~~~i~c~~C~~~-~~Ggf~p~~~~I~lC~N~---~~~~~~l~~~l~HELIHayD 84 (173)
T PF09768_consen 18 VRFLMEALK-KLGCPPVPPRHIKCEPCDSS-VSGGFDPSKKGIVLCQNR---IRSQGHLEDTLTHELIHAYD 84 (173)
T ss_pred HHHHHHHHH-HcCCCCCCCCCeEEEECcCC-CcCCccCCCCCEEEeeCC---CCCHHHHHHHHHHHHHHHHH
Confidence 444444444 45543 233333333 333 4456666 5666665541 24999999999999999764
No 97
>PF13688 Reprolysin_5: Metallo-peptidase family M12; PDB: 2FV5_B 3EWJ_A 3KME_A 3L0T_B 1BKC_E 3G42_D 2I47_D 2FV9_B 3LEA_A 1ZXC_B ....
Probab=68.61 E-value=2.5 Score=37.98 Aligned_cols=18 Identities=28% Similarity=0.255 Sum_probs=14.7
Q ss_pred CHHHHHHHHHHHHHhhhc
Q psy5221 114 TKLEIKAIIAHEISHISN 131 (436)
Q Consensus 114 ~~~El~aVLAHElgHi~~ 131 (436)
+...-.-++|||+||.-.
T Consensus 138 ~~~~~~~~~AHEiGH~lG 155 (196)
T PF13688_consen 138 PTYNGAITFAHEIGHNLG 155 (196)
T ss_dssp -HHHHHHHHHHHHHHHTT
T ss_pred CCCceehhhHHhHHHhcC
Confidence 467888999999999765
No 98
>PF14247 DUF4344: Domain of unknown function (DUF4344)
Probab=67.64 E-value=19 Score=33.10 Aligned_cols=23 Identities=17% Similarity=0.051 Sum_probs=18.1
Q ss_pred hhhccCCChHHHHHHHHhccccc
Q psy5221 261 KLFATHPSFEKRIAILSLLKLSL 283 (436)
Q Consensus 261 ~~~~tHP~~~~RI~~L~~~~~~~ 283 (436)
.+++.|..-.+|-..+..+.-..
T Consensus 160 ~~~deHsld~QR~yn~lCl~yGs 182 (220)
T PF14247_consen 160 PFWDEHSLDLQRYYNLLCLMYGS 182 (220)
T ss_pred cccccCCchHHHHHhhHHhhccC
Confidence 47889999999999988765443
No 99
>PRK06926 flagellar motor protein MotP; Reviewed
Probab=67.22 E-value=1.1e+02 Score=29.26 Aligned_cols=30 Identities=13% Similarity=0.158 Sum_probs=24.6
Q ss_pred HHhhcCHHHHHHHHHHHHHhhhcChHHHHH
Q psy5221 109 LLESMTKLEIKAIIAHEISHISNGDMVTML 138 (436)
Q Consensus 109 Ll~~l~~~El~aVLAHElgHi~~~h~~~~~ 138 (436)
+.+..++++++.+|-+|+-....+|....-
T Consensus 120 vvDG~~~~~i~~iLe~ei~~~~~r~~~~~~ 149 (271)
T PRK06926 120 AIDGWEPETIRDIMMAEIAAMEERHRKGRR 149 (271)
T ss_pred HHCCCCHHHHHHHHHHHHHHHHHHHHhHHH
Confidence 344579999999999999999999975433
No 100
>PRK00446 cyaY frataxin-like protein; Provisional
Probab=66.98 E-value=7.1 Score=31.49 Aligned_cols=31 Identities=19% Similarity=0.312 Sum_probs=22.7
Q ss_pred EeEEeeCcEEEEEEecCCCCCCCCeEeeecCCCCCCCC
Q psy5221 398 VAAFFDQGILNIDLIREVPEALKPRKIYINTTKNTDRI 435 (436)
Q Consensus 398 ~~A~~~~GvL~i~~pk~~~~~~~~~~I~I~~~~~~~~~ 435 (436)
+.+.+.+|||+|+++. ..+|-|+..++.+||
T Consensus 29 ~D~e~~~gVLti~f~~-------~~~~VINkQ~p~~QI 59 (105)
T PRK00446 29 IDCERNGGVLTLTFEN-------GSKIIINRQEPLHEL 59 (105)
T ss_pred eeeeccCCEEEEEECC-------CCEEEEeCCCchhhe
Confidence 7788899999999986 255666666665553
No 101
>PF13582 Reprolysin_3: Metallo-peptidase family M12B Reprolysin-like; PDB: 3P24_C.
Probab=66.66 E-value=3 Score=34.32 Aligned_cols=13 Identities=38% Similarity=0.419 Sum_probs=10.8
Q ss_pred HHHHHHHHHhhhc
Q psy5221 119 KAIIAHEISHISN 131 (436)
Q Consensus 119 ~aVLAHElgHi~~ 131 (436)
..+++||+||.-.
T Consensus 108 ~~~~~HEiGH~lG 120 (124)
T PF13582_consen 108 VDTFAHEIGHNLG 120 (124)
T ss_dssp TTHHHHHHHHHTT
T ss_pred ceEeeehhhHhcC
Confidence 3899999999753
No 102
>TIGR00181 pepF oligoendopeptidase F. This family represents the oligoendopeptidase F clade of the family of larger M3 or thimet (for thiol-dependent metallopeptidase) oligopeptidase family. Lactococcus lactis PepF hydrolyzed peptides of 7 and 17 amino acids with fairly broad specificity. The homolog of lactococcal PepF in group B Streptococcus was named PepB (PubMed:8757883), with the name difference reflecting a difference in species of origin rather activity; substrate profiles were quite similar. Differences in substrate specificity should be expected in other species. The gene is duplicated in Lactococcus lactis on the plasmid that bears it. A shortened second copy is found in Bacillus subtilis.
Probab=65.88 E-value=4.4 Score=43.28 Aligned_cols=42 Identities=21% Similarity=0.311 Sum_probs=29.0
Q ss_pred CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChH
Q psy5221 88 EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDM 134 (436)
Q Consensus 88 ~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~ 134 (436)
...||++|..+..++|+.+- -..-.=...||||+||-.|.-.
T Consensus 353 ~~Ga~~~~~~~~~p~il~N~-----~~~~~dv~TLaHElGHa~H~~~ 394 (591)
T TIGR00181 353 RSGAYSIGGYKVKPYILMNW-----DGTLNSVFTLAHELGHSMHSYF 394 (591)
T ss_pred CCCcccCCCCCCCCeEEEec-----CCCcchHHHHHHHhhhHHHHHH
Confidence 77899999877677775542 1112235689999999976543
No 103
>PRK00523 hypothetical protein; Provisional
Probab=65.24 E-value=25 Score=26.03 Aligned_cols=45 Identities=20% Similarity=0.201 Sum_probs=26.0
Q ss_pred HHHHHHHHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCC
Q psy5221 26 IIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANIST 79 (436)
Q Consensus 26 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~ 79 (436)
++..+++.+..+|++.+.....++..| |.++ +.++.+....|-++
T Consensus 12 i~~li~G~~~Gffiark~~~k~l~~NP---pine------~mir~M~~QMGqKP 56 (72)
T PRK00523 12 IPLLIVGGIIGYFVSKKMFKKQIRENP---PITE------NMIRAMYMQMGRKP 56 (72)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCc---CCCH------HHHHHHHHHhCCCc
Confidence 333444556667777777776666543 3332 46666666666653
No 104
>PRK13267 archaemetzincin-like protein; Reviewed
Probab=64.93 E-value=15 Score=32.76 Aligned_cols=67 Identities=16% Similarity=0.128 Sum_probs=32.7
Q ss_pred HHHHHHHHHhcCCCC-CeEEEECC----CccEEEeCCCCC-ccEEEEcHHHHhhc----------CHHHHHHHHHHHHHh
Q psy5221 65 LKTVNKISKNANIST-PEVAIYNG----EINAFATGAFKN-SSLIAISANLLESM----------TKLEIKAIIAHEISH 128 (436)
Q Consensus 65 ~~~l~~l~~~~g~~~-p~v~v~~~----~~NA~~~G~~~~-~~~v~it~gLl~~l----------~~~El~aVLAHElgH 128 (436)
..+++.+......+. .-+.|.+. ....|+.|.... .+.-+++..-++.- -.+-+.-...||+||
T Consensus 56 ~~iL~~l~~~~~~~~~~vl~vt~~DLy~~~~nfvfG~a~~~~~~aVvS~~Rl~~~~~~~~~~~~l~~~R~~k~~~HElGH 135 (179)
T PRK13267 56 EKFLPLLSRIGRFNGDKNIGITDCDLYYRGLNFVFGLAYPNLRGAVISTYRLRPEFYGNKPDSELFEERVRKEVTHELGH 135 (179)
T ss_pred HHHHHHHHhhCCCCCceEEEEEccccCCCCCCceeEEeecCCCEEEEEccccChhhccccchHHHHHHHHHHHHHHHHHH
Confidence 345555544433232 23444444 344578886432 22233333222211 123345558999999
Q ss_pred hhc
Q psy5221 129 ISN 131 (436)
Q Consensus 129 i~~ 131 (436)
.-.
T Consensus 136 ~lG 138 (179)
T PRK13267 136 TLG 138 (179)
T ss_pred HcC
Confidence 854
No 105
>PF13574 Reprolysin_2: Metallo-peptidase family M12B Reprolysin-like; PDB: 1KAP_P 1JIW_P 1AKL_A 1OM7_A 1OM8_A 1O0T_A 1OM6_A 1H71_P 1O0Q_A 1OMJ_A ....
Probab=64.14 E-value=3.6 Score=36.34 Aligned_cols=14 Identities=29% Similarity=0.266 Sum_probs=11.2
Q ss_pred HHHHHHHHHHhhhc
Q psy5221 118 IKAIIAHEISHISN 131 (436)
Q Consensus 118 l~aVLAHElgHi~~ 131 (436)
-.-++|||+||.-.
T Consensus 111 ~~~~~aHElGH~lG 124 (173)
T PF13574_consen 111 GIDTFAHELGHQLG 124 (173)
T ss_dssp HHHHHHHHHHHHHT
T ss_pred eeeeehhhhHhhcC
Confidence 34569999999876
No 106
>TIGR03421 FeS_CyaY iron donor protein CyaY. Members of this protein family are the iron-sulfur cluster (FeS) metabolism protein CyaY, a homolog of eukaryotic frataxin. ISC is one of several bacterial systems for FeS assembly; we find by Partial Phylogenetic Profiling vs. the ISC system that CyaY most like work with the ISC system for FeS cluster biosynthesis. A study of of cyaY mutants in Salmonella enterica bears this out. Although the trusted cutoff is set low enough to include eukaryotic frataxin sequences, a narrower, exception-type model (TIGR03421) identifies identifies members of that specific set.
Probab=64.14 E-value=7.9 Score=31.03 Aligned_cols=31 Identities=19% Similarity=0.295 Sum_probs=22.6
Q ss_pred EeEEeeCcEEEEEEecCCCCCCCCeEeeecCCCCCCCC
Q psy5221 398 VAAFFDQGILNIDLIREVPEALKPRKIYINTTKNTDRI 435 (436)
Q Consensus 398 ~~A~~~~GvL~i~~pk~~~~~~~~~~I~I~~~~~~~~~ 435 (436)
+.+.+.+|||+|+++. ..+|-|+..++.+||
T Consensus 27 ~D~e~~~gVLti~f~~-------~~~~VINkQ~p~~QI 57 (102)
T TIGR03421 27 IDCERAGGVLTLTFEN-------GSQIIINKQEPLHQI 57 (102)
T ss_pred eeeecCCCEEEEEECC-------CCEEEEeCCchhhhh
Confidence 7788889999999985 356666666655543
No 107
>PF01447 Peptidase_M4: Thermolysin metallopeptidase, catalytic domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification. ; InterPro: IPR013856 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases that belong to the MEROPS peptidase family M4 (thermolysin family, clan MA(E)). The protein fold of the peptidase domain of thermolysin, is the type example for members of the clan MA. The thermolysin family is composed only of secreted eubacterial endopeptidases. The zinc-binding residues are H-142, H-146 and E-166, with E-143 acting as the catalytic residue. Thermolysin also contains 4 calcium-binding sites, which contribute to its unusual thermostability. The family also includes enzymes from a number of pathogens, including Legionella and Listeria, and the protein pseudolysin, all with a substrate specificity for an aromatic residue in the P1' position. Three-dimensional structure analysis has shown that the enzymes undergo a hinge-bend motion during catalysis. Pseudolysin has a broader specificity, acting on large molecules such as elastin and collagen, possibly due to its wider active site cleft []. This entry represents a domain found in peptidase M4 family members.; GO: 0004222 metalloendopeptidase activity; PDB: 3NQX_A 3NQZ_B 3NQY_B 1BQB_A 1U4G_A 1EZM_A 3DBK_A 1ESP_A 1NPC_A 1LND_E ....
Probab=62.11 E-value=18 Score=31.17 Aligned_cols=62 Identities=18% Similarity=0.213 Sum_probs=28.9
Q ss_pred HHHHHHHHHHhcCCC---CCeEEEE--CC-CccEEEeCCCCCccEEEEcHHHHhhcCHHH-HHHHHHHHHHhhh
Q psy5221 64 LLKTVNKISKNANIS---TPEVAIY--NG-EINAFATGAFKNSSLIAISANLLESMTKLE-IKAIIAHEISHIS 130 (436)
Q Consensus 64 L~~~l~~l~~~~g~~---~p~v~v~--~~-~~NA~~~G~~~~~~~v~it~gLl~~l~~~E-l~aVLAHElgHi~ 130 (436)
.++...+.-.+-++. .+-+-.+ .. ..|||--| .. +++..|--..+.+=. -.=|+||||+|-.
T Consensus 79 v~d~y~~~~gr~siD~~G~~~~~~Vhyg~~~~NAfW~g----~~-m~yGdG~~~~f~~~~~~lDVvaHEltHGV 147 (150)
T PF01447_consen 79 VYDYYKNVFGRNSIDGNGMPIISRVHYGKNYNNAFWNG----SQ-MVYGDGDGQIFKPFASSLDVVAHELTHGV 147 (150)
T ss_dssp HHHHHHHHHSS-STTSS-S-EEEEESESSSTT-EEE-S----SS-EEEE---SSSBS-GGG-HHHHHHHHHHHH
T ss_pred HHHHHHHHHCCCCcCCCCcEEEEEEeCCCCccCccccC----CE-EEEECCCCcccccCccccceeeecccccc
Confidence 344455555555554 2222223 22 78999876 22 666765443332211 1349999999953
No 108
>cd04278 ZnMc_MMP Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Probab=61.75 E-value=4 Score=35.29 Aligned_cols=17 Identities=29% Similarity=0.454 Sum_probs=14.4
Q ss_pred HHHHHHHHHHHHHhhhc
Q psy5221 115 KLEIKAIIAHEISHISN 131 (436)
Q Consensus 115 ~~El~aVLAHElgHi~~ 131 (436)
...+..|+.|||||...
T Consensus 104 ~~~~~~~~~HEiGHaLG 120 (157)
T cd04278 104 GTDLFSVAAHEIGHALG 120 (157)
T ss_pred cchHHHHHHHHhccccc
Confidence 35699999999999865
No 109
>PF05572 Peptidase_M43: Pregnancy-associated plasma protein-A; InterPro: IPR008754 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase M43 (cytophagalysin family, clan MA(M)), subfamily M43. The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. The type example of this family is the pregnancy-associated plasma protein A (PAPP-A), which cleaves insulin-like growth factor (IGF) binding protein-4 (IGFBP-4), causing a dramatic reduction in its affinity for IGF-I and -II. Through this mechanism, PAPP-A is a regulator of IGF bioactivity in several systems, including the Homo sapiens ovary and the cardiovascular system [, , , ].; PDB: 3LUN_A 3LUM_B 2J83_A 2CKI_A.
Probab=61.73 E-value=4.3 Score=35.17 Aligned_cols=21 Identities=14% Similarity=0.281 Sum_probs=14.5
Q ss_pred CHHHHHHHHHHHHHhhhc-ChH
Q psy5221 114 TKLEIKAIIAHEISHISN-GDM 134 (436)
Q Consensus 114 ~~~El~aVLAHElgHi~~-~h~ 134 (436)
+.....-+|+||+||+-. .|+
T Consensus 65 ~~~~~g~TltHEvGH~LGL~Ht 86 (154)
T PF05572_consen 65 SQYNFGKTLTHEVGHWLGLYHT 86 (154)
T ss_dssp TTS-SSHHHHHHHHHHTT---T
T ss_pred Cccccccchhhhhhhhhccccc
Confidence 445567899999999976 554
No 110
>cd00503 Frataxin Frataxin is a nuclear-encoded mitochondrial protein implicated in Friedreich's ataxia (FRDA), an human autosomal recessive neurodegenerative disease; Frataxin is found in eukaryotes and in purple bacteria; lack of frataxin causes iron to accumulate in the mitochondrial matrix suggesting that frataxin is involved in mitochondrial iron homeostasis and possibly in iron transport; the domain has an alpha-beta fold consisting of two helices flanking an antiparallel beta sheet.
Probab=61.41 E-value=9.4 Score=30.76 Aligned_cols=31 Identities=19% Similarity=0.265 Sum_probs=23.1
Q ss_pred EeEEeeCcEEEEEEecCCCCCCCCeEeeecCCCCCCCC
Q psy5221 398 VAAFFDQGILNIDLIREVPEALKPRKIYINTTKNTDRI 435 (436)
Q Consensus 398 ~~A~~~~GvL~i~~pk~~~~~~~~~~I~I~~~~~~~~~ 435 (436)
+.+.+.+|||+|+++. ..+|-|+..++.+||
T Consensus 30 ~D~e~~~gVLti~f~~-------~~~~VINkQ~p~~QI 60 (105)
T cd00503 30 IDVETQGGVLTLTFGN-------GSTIVINRQEPLRQI 60 (105)
T ss_pred EeeeccCCEEEEEECC-------CCEEEEeCCchhhhh
Confidence 7778889999999984 356667766665554
No 111
>PF01433 Peptidase_M1: Peptidase family M1 This is family M1 in the peptidase classification.; InterPro: IPR014782 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M1 (clan MA(E)), the type example being aminopeptidase N from Homo sapiens (Human). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Membrane alanine aminopeptidase (3.4.11.2 from EC) is part of the HEXXH+E group; it consists entirely of aminopeptidases, spread across a wide variety of species []. Functional studies show that CD13/APN catalyzes the removal of single amino acids from the amino terminus of small peptides and probably plays a role in their final digestion; one family member (leukotriene-A4 hydrolase) is known to hydrolyse the epoxide leukotriene-A4 to form an inflammatory mediator []. This hydrolase has been shown to have aminopeptidase activity [], and the zinc ligands of the M1 family were identified by site-directed mutagenesis on this enzyme [] CD13 participates in trimming peptides bound to MHC class II molecules [] and cleaves MIP-1 chemokine, which alters target cell specificity from basophils to eosinophils []. CD13 acts as a receptor for specific strains of RNA viruses (coronaviruses) which cause a relatively large percentage of upper respiratory trace infections. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding; PDB: 2XQ0_A 2XPY_A 2XPZ_A 3SE6_B 3EBH_A 3EBG_A 3T8V_A 3Q44_A 3Q43_A 3EBI_A ....
Probab=59.67 E-value=34 Score=34.13 Aligned_cols=66 Identities=23% Similarity=0.277 Sum_probs=37.3
Q ss_pred HHHHHHHHHHhcCCCC--C--eEEEECC-CccEEEeCCCCCccEEEEcH-HHHhh------cCHHHHHHHHHHHHHhhhc
Q psy5221 64 LLKTVNKISKNANIST--P--EVAIYNG-EINAFATGAFKNSSLIAISA-NLLES------MTKLEIKAIIAHEISHISN 131 (436)
Q Consensus 64 L~~~l~~l~~~~g~~~--p--~v~v~~~-~~NA~~~G~~~~~~~v~it~-gLl~~------l~~~El~aVLAHElgHi~~ 131 (436)
..+.+.-+.+..|.+. + ++..+++ ..++.-. -+.|.+.. .++.. -+..++..++|||++|..-
T Consensus 234 ~~~~l~~~~~~~g~~yp~~k~~~v~~p~~~~~~me~-----~g~i~~~~~~l~~~~~~~~~~~~~~~~~~iahElahqWf 308 (390)
T PF01433_consen 234 APKALEYYEEYFGIPYPFKKLDIVAVPDFPFGGMEN-----WGLITYRESYLLYDPDISTIGDKQEIASLIAHELAHQWF 308 (390)
T ss_dssp HHHHHHHHHHHHTS--SSSEEEEEEEST-SSSEE-------TTEEEEEGGGTS-STTTS-HHHHHHHHHHHHHHHHTTTB
T ss_pred hHHHHHHHHhhccccceecceeEEEEeccccccccc-----cccccccccccccCcccccchhhhhhHHHHHHHHHHHHh
Confidence 4456666777788753 3 3455555 3344322 22344443 23321 1245789999999999999
Q ss_pred ChH
Q psy5221 132 GDM 134 (436)
Q Consensus 132 ~h~ 134 (436)
|+.
T Consensus 309 Gn~ 311 (390)
T PF01433_consen 309 GNL 311 (390)
T ss_dssp TTT
T ss_pred ccC
Confidence 885
No 112
>PF10023 DUF2265: Predicted aminopeptidase (DUF2265); InterPro: IPR014553 This group represents a predicted aminopeptidase.
Probab=59.58 E-value=4.9 Score=39.26 Aligned_cols=32 Identities=31% Similarity=0.477 Sum_probs=24.3
Q ss_pred cHHHHhhc---CHHHHHHHHHHHHHhhh---cChHHHH
Q psy5221 106 SANLLESM---TKLEIKAIIAHEISHIS---NGDMVTM 137 (436)
Q Consensus 106 t~gLl~~l---~~~El~aVLAHElgHi~---~~h~~~~ 137 (436)
+++++..+ ++.+|+.+|=||++|-. .+|+..+
T Consensus 150 ~DPlLSt~l~~~~~~LA~LIfHELaHq~~Yv~~dt~FN 187 (337)
T PF10023_consen 150 DDPLLSTMLRYPDGELARLIFHELAHQTLYVKGDTAFN 187 (337)
T ss_pred CCcccccccCCCchHHHHHHHHHHhhceeecCCCchhh
Confidence 44566655 89999999999999954 5776543
No 113
>cd04267 ZnMc_ADAM_like Zinc-dependent metalloprotease, ADAM_like or reprolysin_like subgroup. The adamalysin_like or ADAM family of metalloproteases contains proteolytic domains from snake venoms, proteases from the mammalian reproductive tract, and the tumor necrosis factor alpha convertase, TACE. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
Probab=59.17 E-value=3.8 Score=36.71 Aligned_cols=17 Identities=29% Similarity=0.348 Sum_probs=14.1
Q ss_pred HHHHHHHHHHHHHhhhc
Q psy5221 115 KLEIKAIIAHEISHISN 131 (436)
Q Consensus 115 ~~El~aVLAHElgHi~~ 131 (436)
......++|||+||.-.
T Consensus 130 ~~~~~~~~aHElGH~lG 146 (192)
T cd04267 130 TLLTALTMAHELGHNLG 146 (192)
T ss_pred ceeehhhhhhhHHhhcC
Confidence 45678899999999875
No 114
>cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o
Probab=58.36 E-value=82 Score=31.84 Aligned_cols=66 Identities=9% Similarity=-0.008 Sum_probs=40.4
Q ss_pred HHHHHHHHHHhcCCCCCeEEEECCCccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcCh
Q psy5221 64 LLKTVNKISKNANISTPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGD 133 (436)
Q Consensus 64 L~~~l~~l~~~~g~~~p~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h 133 (436)
=.+...++++.+|....+-+ +|..++.|+.|+++... .+++- .+.-+--.=..-++||+||-.+-.
T Consensus 109 Q~~~~~~~~~~~g~df~~gr-iD~s~hpF~~~~~~~dv--RItt~-y~~~d~~~~l~t~iHE~GHalye~ 174 (396)
T cd06460 109 QEALGRELLEALGFDFDRGR-LDVSAHPFTGGLGPGDV--RITTR-YDENDFRSALFSTIHETGHALYEQ 174 (396)
T ss_pred HHHHHHHHHHHhCCcccCCe-eecCCCCCCCCCCCCCc--eEEee-eCCcchHHHHHHHHHHhhHHHHHh
Confidence 34556788999999865433 44455679998866543 33330 111112233446899999988866
No 115
>PRK05518 rpl6p 50S ribosomal protein L6P; Reviewed
Probab=58.27 E-value=38 Score=30.19 Aligned_cols=45 Identities=18% Similarity=0.252 Sum_probs=32.6
Q ss_pred CCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccceEeEEeeCcEEEEEEe
Q psy5221 348 RSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLI 412 (436)
Q Consensus 348 ~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~~GvL~i~~p 412 (436)
|++++|+++++.++|+|.+ |+.++.|.-| .++...+||-+.|+..
T Consensus 13 P~~V~v~i~~~~v~VkGp~----------------G~L~~~~~~~----~v~i~~~~~~i~v~~~ 57 (180)
T PRK05518 13 PEGVTVEIEGLVVTVKGPK----------------GELTRDFWYP----GVTISVEDGKVVIETE 57 (180)
T ss_pred CCCCEEEEECCEEEEECCC----------------eEEEEEecCC----cEEEEEECCEEEEEEC
Confidence 6899999999999999885 5566555432 2444568887777754
No 116
>TIGR03422 mito_frataxin frataxin. Frataxin is a mitochondrial protein, mutation of which leads to the disease Friedreich's ataxia. Its orthologs are widely distributed in the bacteria, associated with the ISC system for iron-sulfur cluster assembly, and designated CyaY. This exception-type model allows those examples of frataxin per se that score above the trusted cutoff to the CyaY equivalog-type model (TIGR03421) to be named appropriately.
Probab=56.43 E-value=9.4 Score=30.29 Aligned_cols=29 Identities=17% Similarity=0.312 Sum_probs=20.4
Q ss_pred eEEeeCcEEEEEEecCCCCCCCCeEeeecCCCCCCC
Q psy5221 399 AAFFDQGILNIDLIREVPEALKPRKIYINTTKNTDR 434 (436)
Q Consensus 399 ~A~~~~GvL~i~~pk~~~~~~~~~~I~I~~~~~~~~ 434 (436)
.+.+.+|||+|+++. ..++-|+..++.+|
T Consensus 30 D~e~~~gVLti~~~~-------~~~~VINkQ~p~~Q 58 (97)
T TIGR03422 30 DVEYSSGVLTLELPS-------VGTYVINKQPPNKQ 58 (97)
T ss_pred ccccCCCEEEEEECC-------CCEEEEeCCChhhH
Confidence 777889999999975 24555655555444
No 117
>TIGR02414 pepN_proteo aminopeptidase N, Escherichia coli type. The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, EC 3.4.11.2, after the Escherichia coli enzyme, suggesting a similar activity profile. This family consists of all aminopeptidases closely related to E. coli PepN and presumed to have similar (not identical) function. Nearly all are found in Proteobacteria, but members are found also in Cyanobacteria, plants, and apicomplexan parasites. This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (TIGR02412), from the membrane bound aminopeptidase N family in animals, etc.
Probab=55.98 E-value=9.5 Score=42.60 Aligned_cols=20 Identities=40% Similarity=0.507 Sum_probs=16.8
Q ss_pred HHHHHHHHHHHHhhhcChHH
Q psy5221 116 LEIKAIIAHEISHISNGDMV 135 (436)
Q Consensus 116 ~El~aVLAHElgHi~~~h~~ 135 (436)
..+..|+|||++|-..|+..
T Consensus 281 ~~i~~VIaHElaHqWfGNlV 300 (863)
T TIGR02414 281 ERIESVIAHEYFHNWTGNRV 300 (863)
T ss_pred HHHHHHHHHHHHHHHhccee
Confidence 34789999999999988854
No 118
>TIGR03653 arch_L6P archaeal ribosomal protein L6P. Members of this protein family are the archaeal ribosomal protein L6P. The top-scoring proteins not selected by this model are eukaryotic cytosolic ribosomal protein L9. Bacterial ribosomal protein L6 scores lower and is described by a distinct model.
Probab=55.52 E-value=48 Score=29.23 Aligned_cols=45 Identities=11% Similarity=0.198 Sum_probs=32.2
Q ss_pred CCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccceEeEEeeCcEEEEEEe
Q psy5221 348 RSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLI 412 (436)
Q Consensus 348 ~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~~GvL~i~~p 412 (436)
|++++|+++++.++|+|.+ |+.++.|. |.. ++...+++-|.|+.+
T Consensus 7 P~~V~v~i~~~~i~vkGp~----------------G~L~~~~~-~~~---v~i~~~~~~i~v~~~ 51 (170)
T TIGR03653 7 PEGVSVTIEGNIVTVKGPK----------------GEVTRELW-YPG---IEISVEDGKVVIETD 51 (170)
T ss_pred CCCCEEEEeCCEEEEECCC----------------eEEEEEEe-CCc---EEEEEeCCEEEEEeC
Confidence 6899999999999999885 55665553 333 344557887777754
No 119
>PF13583 Reprolysin_4: Metallo-peptidase family M12B Reprolysin-like
Probab=55.31 E-value=5.7 Score=36.21 Aligned_cols=12 Identities=42% Similarity=0.504 Sum_probs=10.1
Q ss_pred HHHHHHHHhhhc
Q psy5221 120 AIIAHEISHISN 131 (436)
Q Consensus 120 aVLAHElgHi~~ 131 (436)
-++|||+||.-.
T Consensus 139 ~~~aHEiGH~lG 150 (206)
T PF13583_consen 139 QTFAHEIGHNLG 150 (206)
T ss_pred hHHHHHHHHHhc
Confidence 459999999876
No 120
>cd04271 ZnMc_ADAM_fungal Zinc-dependent metalloprotease, ADAM_fungal subgroup. The adamalysin_like or ADAM (A Disintegrin And Metalloprotease) family of metalloproteases are integral membrane proteases acting on a variety of extracellular targets. They are involved in shedding soluble peptides or proteins from the cell surface. This subfamily contains fungal ADAMs, whose precise function has yet to be determined.
Probab=55.14 E-value=4.4 Score=37.62 Aligned_cols=12 Identities=42% Similarity=0.648 Sum_probs=10.5
Q ss_pred HHHHHHHHhhhc
Q psy5221 120 AIIAHEISHISN 131 (436)
Q Consensus 120 aVLAHElgHi~~ 131 (436)
.++||||||.-.
T Consensus 147 ~t~AHElGHnLG 158 (228)
T cd04271 147 QVFAHEIGHTFG 158 (228)
T ss_pred eehhhhhhhhcC
Confidence 599999999874
No 121
>PF01491 Frataxin_Cyay: Frataxin-like domain; InterPro: IPR002908 The eukaryotic proteins in this entry include frataxin, the protein that is mutated in Friedreich's ataxia [], and related sequences. Friedreich's ataxia is a progressive neurodegenerative disorder caused by loss of function mutations in the gene encoding frataxin (FRDA). Frataxin mRNA is predominantly expressed in tissues with a high metabolic rate (including liver, kidney, brown fat and heart). Mouse and yeast frataxin homologues contain a potential N-terminal mitochondrial targeting sequence, and human frataxin has been observed to co-localise with a mitochondrial protein. Furthermore, disruption of the yeast gene has been shown to result in mitochondrial dysfunction. Friedreich's ataxia is thus believed to be a mitochondrial disease caused by a mutation in the nuclear genome (specifically, expansion of an intronic GAA triplet repeat) [, , ]. The bacterial proteins in this entry are iron-sulphur cluster (FeS) metabolism CyaY proteins hmologous to eukaryotic frataxin. Partial Phylogenetic Profiling [] suggests that CyaY most likely functions as part of the ISC system for FeS cluster biosynthesis, and is supported by expermimental data in some species [, ]. ; PDB: 1EW4_A 2P1X_A 1SOY_A 2EFF_A 3T3T_B 3S4M_A 3T3K_A 3S5D_A 1LY7_A 3T3X_B ....
Probab=55.06 E-value=14 Score=29.89 Aligned_cols=31 Identities=16% Similarity=0.191 Sum_probs=22.2
Q ss_pred eEeEEeeCcEEEEEEecCCCCCCCCeEeeecCCCCCCC
Q psy5221 397 VVAAFFDQGILNIDLIREVPEALKPRKIYINTTKNTDR 434 (436)
Q Consensus 397 ~~~A~~~~GvL~i~~pk~~~~~~~~~~I~I~~~~~~~~ 434 (436)
-+.+.+.+|||+|++|. ..++-|+..++.+|
T Consensus 31 d~d~e~~~gVLti~~~~-------~~~~VINkQ~p~~Q 61 (109)
T PF01491_consen 31 DIDVERSGGVLTIEFPD-------GGQYVINKQPPNRQ 61 (109)
T ss_dssp TEEEEEETTEEEEEETT-------SEEEEEEEECCCTE
T ss_pred ceEEEccCCEEEEEECC-------CCEEEEeCCCHHHH
Confidence 37899999999999975 25555655555444
No 122
>PF01431 Peptidase_M13: Peptidase family M13 This is family M13 in the peptidase classification. ; InterPro: IPR018497 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M13 (neprilysin family, clan MA(E)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk [, ]. The family includes eukaryotic and prokaryotic oligopeptidases, as well as some of the proteins responsible for the molecular basis of the blood group antigens e.g. Kell []. Neprilysin (3.4.24.11 from EC), is another member of this group, it is variously known as common acute lymphoblastic leukemia antigen (CALLA), enkephalinase (gp100) and neutral endopeptidase metalloendopeptidase (NEP). It is a plasma membrane-bound mammalian enzyme that is able to digest biologically-active peptides, including enkephalins []. The zinc ligands of neprilysin are known and are analogous to those in thermolysin, a related peptidase [, ]. Neprilysins, like thermolysin, are inhibited by phosphoramidon, which appears to selectively inhibit this family in mammals. The enzymes are all oligopeptidases, digesting oligo- and polypeptides, but not proteins []. Neprilysin consists of a short cytoplasmic domain, a membrane-spanning region and a large extracellular domain. The cytoplasmic domain contains a conformationally-restrained octapeptide, which is thought to act as a stop transfer sequence that prevents proteolysis and secretion [, ].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QPJ_A 1R1I_A 1R1J_A 1Y8J_A 1R1H_A 1DMT_A 2YB9_A 3DWB_A 3ZUK_A.
Probab=54.82 E-value=9.3 Score=34.57 Aligned_cols=35 Identities=26% Similarity=0.325 Sum_probs=23.8
Q ss_pred ccEEEEcHHHHhh----------cCHHHHHHHHHHHHHhhhcChH
Q psy5221 100 SSLIAISANLLES----------MTKLEIKAIIAHEISHISNGDM 134 (436)
Q Consensus 100 ~~~v~it~gLl~~----------l~~~El~aVLAHElgHi~~~h~ 134 (436)
...|++..|++.. ++=.-+-+||||||+|.-..+.
T Consensus 8 ~N~i~ip~~~l~~P~f~~~~p~~~~yg~lG~ilahel~hafd~~g 52 (206)
T PF01431_consen 8 FNSIVIPAGILQPPFFDPNYPPALNYGGLGFILAHELMHAFDPEG 52 (206)
T ss_dssp TTEEEEEGGGSSTTT--TTS-HHHHHHTHHHHHHHHHHHCTSTTG
T ss_pred cCEEEecHHHhCCccCCCCCCHHHHHHHHHHHHHHHHHHHHHHhH
Confidence 4446666666553 2335688999999999876654
No 123
>cd04272 ZnMc_salivary_gland_MPs Zinc-dependent metalloprotease, salivary_gland_MPs. Metalloproteases secreted by the salivary glands of arthropods.
Probab=54.71 E-value=7.2 Score=35.86 Aligned_cols=15 Identities=20% Similarity=0.268 Sum_probs=12.7
Q ss_pred HHHHHHHHHHHhhhc
Q psy5221 117 EIKAIIAHEISHISN 131 (436)
Q Consensus 117 El~aVLAHElgHi~~ 131 (436)
....++|||+||.-.
T Consensus 144 ~~~~~~AHElGH~lG 158 (220)
T cd04272 144 YGVYTMTHELAHLLG 158 (220)
T ss_pred ccHHHHHHHHHHHhC
Confidence 458999999999874
No 124
>cd00203 ZnMc Zinc-dependent metalloprotease. This super-family of metalloproteases contains two major branches, the astacin-like proteases and the adamalysin/reprolysin-like proteases. Both branches have wide phylogenetic distribution, and contain sub-families, which are involved in vertebrate development and disease.
Probab=54.58 E-value=5.5 Score=34.52 Aligned_cols=17 Identities=35% Similarity=0.366 Sum_probs=14.5
Q ss_pred HHHHHHHHHHHHHhhhc
Q psy5221 115 KLEIKAIIAHEISHISN 131 (436)
Q Consensus 115 ~~El~aVLAHElgHi~~ 131 (436)
......+++||+||...
T Consensus 93 ~~~~~~~~~HElGH~LG 109 (167)
T cd00203 93 TKEGAQTIAHELGHALG 109 (167)
T ss_pred cccchhhHHHHHHHHhC
Confidence 45789999999999875
No 125
>TIGR02412 pepN_strep_liv aminopeptidase N, Streptomyces lividans type. This family is a subset of the members of the zinc metallopeptidase family M1 (pfam01433), with a single member characterized in Streptomyces lividans 66 and designated aminopeptidase N. The spectrum of activity may differ somewhat from the aminopeptidase N clade of E. coli and most other Proteobacteria, well separated phylogenetically within the M1 family. The M1 family also includes leukotriene A-4 hydrolase/aminopeptidase (with a bifunctional active site).
Probab=53.03 E-value=32 Score=38.46 Aligned_cols=68 Identities=25% Similarity=0.410 Sum_probs=38.0
Q ss_pred HHHHHHHHHhcCCCCC----eEEEECCCccEEEeCCCCCccEEEEcHHHHhh--cC---HHHHHHHHHHHHHhhhcChHH
Q psy5221 65 LKTVNKISKNANISTP----EVAIYNGEINAFATGAFKNSSLIAISANLLES--MT---KLEIKAIIAHEISHISNGDMV 135 (436)
Q Consensus 65 ~~~l~~l~~~~g~~~p----~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~--l~---~~El~aVLAHElgHi~~~h~~ 135 (436)
.+.++-+.+..|+|.| ++..+++ |..|.-.+-+.|.+...++.. .+ .+.+..|+|||++|...|+..
T Consensus 229 ~~~l~~~e~~fg~pYP~~k~d~V~vP~----f~~GaMEn~Glit~~e~~l~~~~~~~~~~~~~~~viaHElAHqWFGnlV 304 (831)
T TIGR02412 229 RQGLAFFHRKFGYPYPFKKYDQIFVPE----FNAGAMENAGCVTFAENFLHRAEATRAEKENRAGVILHEMAHMWFGDLV 304 (831)
T ss_pred HHHHHHHHHHhCCCCCcccCCEEEcCC----CCCCcccccceeeechhhccCCcCCHHHHHHHHHHHHHHHHHHHhCCEe
Confidence 4455666677887633 3333443 111211222345555544421 12 335778999999999999865
Q ss_pred H
Q psy5221 136 T 136 (436)
Q Consensus 136 ~ 136 (436)
.
T Consensus 305 T 305 (831)
T TIGR02412 305 T 305 (831)
T ss_pred c
Confidence 4
No 126
>PF06262 DUF1025: Possibl zinc metallo-peptidase; InterPro: IPR010428 This is a family of bacterial protein with undetermined function.; PDB: 3E11_A.
Probab=52.51 E-value=25 Score=27.85 Aligned_cols=36 Identities=19% Similarity=0.235 Sum_probs=24.7
Q ss_pred CccEEEEcHHHHhhc-CHHH----HHHHHHHHHHhhhcChH
Q psy5221 99 NSSLIAISANLLESM-TKLE----IKAIIAHEISHISNGDM 134 (436)
Q Consensus 99 ~~~~v~it~gLl~~l-~~~E----l~aVLAHElgHi~~~h~ 134 (436)
..+++++-..+...+ +.+| +.-++-||+||+-..+-
T Consensus 49 p~rI~lyR~pl~~~~~~~~eL~~~I~~tlvhEiah~fG~~~ 89 (97)
T PF06262_consen 49 PDRIVLYRRPLERRARSREELAELIRDTLVHEIAHHFGISD 89 (97)
T ss_dssp -EEEEEEHHHHHHT-SSHHHHHHHHHHHHHHHHHHHTT--H
T ss_pred CCEEEEehHHHHHHhCCHHHHHHHHHHHHHHHHHHHcCCCH
Confidence 467888888888877 6554 56688899999877553
No 127
>PF04972 BON: BON domain; InterPro: IPR007055 The BON domain is typically ~60 residues long and has an alpha/beta predicted fold. There is a conserved glycine residue and several hydrophobic regions. This pattern of conservation is more suggestive of a binding or structural function rather than a catalytic function. Most proteobacteria seem to possess one or two BON-containing proteins, typically of the OsmY-type proteins; outside of this group the distribution is more disparate. The OsmY protein is an Escherichia coli 20 kDa outer membrane or periplasmic protein that is expressed in response to a variety of stress conditions, in particular, helping to provide protection against osmotic shock. One hypothesis is that OsmY prevents shrinkage of the cytoplasmic compartment by contacting the phospholipid interfaces surrounding the periplasmic space. The domain architecture of two BON domains alone suggests that these domains contact the surfaces of phospholipids, with each domain contacting a membrane [].; PDB: 2L26_A 2KGS_A 2KSM_A.
Probab=52.14 E-value=30 Score=24.56 Aligned_cols=26 Identities=12% Similarity=0.319 Sum_probs=20.8
Q ss_pred CCCCCCceEEEEECCEEEEEEEEecc
Q psy5221 344 AGFNRSDLEIETERDTLRITGRKQAD 369 (436)
Q Consensus 344 pG~~~edi~v~~~~~~l~i~g~~~~~ 369 (436)
|+++..+|+|.+.++.++++|.-...
T Consensus 12 ~~~~~~~i~v~v~~g~v~L~G~v~s~ 37 (64)
T PF04972_consen 12 PWLPDSNISVSVENGVVTLSGEVPSQ 37 (64)
T ss_dssp -CTT-TTEEEEEECTEEEEEEEESSC
T ss_pred cccCCCeEEEEEECCEEEEEeeCcHH
Confidence 46777799999999999999997543
No 128
>cd06477 ACD_HspB3_Like Alpha crystallin domain (ACD) found in mammalian HspB3, also known as heat-shock protein 27-like protein (HSPL27, 17-kDa) and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. HspB3 is expressed in adult skeletal muscle, smooth muscle, and heart, and in several other fetal tissues. In muscle cells HspB3 forms an oligomeric 150 kDa complex with myotonic dystrophy protein kinase-binding protein (MKBP/ HspB2), this complex may comprise one of two independent muscle-cell specific chaperone systems. The expression of HspB3 is induced during muscle differentiation controlled by the myogenic factor MyoD. HspB3 may also interact with Hsp22 (HspB8).
Probab=52.13 E-value=28 Score=26.69 Aligned_cols=30 Identities=13% Similarity=0.275 Sum_probs=27.1
Q ss_pred eEEEEEecC-CCCCCceEEEE-ECCEEEEEEE
Q psy5221 336 KYRITMAIA-GFNRSDLEIET-ERDTLRITGR 365 (436)
Q Consensus 336 ~~~v~~~lp-G~~~edi~v~~-~~~~l~i~g~ 365 (436)
.|.=.+.|| +++++.|+=+. +++.|+|.|+
T Consensus 51 ~F~R~~~LP~~Vd~~~v~A~~~~dGvL~I~~~ 82 (83)
T cd06477 51 SFTRQYQLPDGVEHKDLSAMLCHDGILVVETK 82 (83)
T ss_pred EEEEEEECCCCcchheEEEEEcCCCEEEEEec
Confidence 788899999 99999999997 7999999975
No 129
>PTZ00027 60S ribosomal protein L6; Provisional
Probab=52.07 E-value=48 Score=29.78 Aligned_cols=48 Identities=8% Similarity=0.185 Sum_probs=34.0
Q ss_pred CCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccceEeEEeeCcEEEEEEec
Q psy5221 348 RSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLIR 413 (436)
Q Consensus 348 ~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~~GvL~i~~pk 413 (436)
|++++|+++++.++|+|.+ |+.++.|. ..-..++...+||.|.|+-+.
T Consensus 13 P~~V~V~i~~~~v~VkGp~----------------G~L~~~~~--~~~~~i~i~~~~~~i~v~~~~ 60 (190)
T PTZ00027 13 PEGVTVTVKSRKVTVTGKY----------------GELTRSFR--HLPVDIKLSKDGKYIKVEMWF 60 (190)
T ss_pred CCCCEEEEECCEEEEECCC----------------ceEEEEec--CCCceEEEEeCCCEEEEEeCC
Confidence 7999999999999999874 55665443 211245566788877777543
No 130
>PRK05498 rplF 50S ribosomal protein L6; Validated
Probab=51.96 E-value=41 Score=29.88 Aligned_cols=44 Identities=9% Similarity=0.234 Sum_probs=32.2
Q ss_pred CCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccceEeEEeeCcEEEEEEe
Q psy5221 348 RSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLI 412 (436)
Q Consensus 348 ~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~~GvL~i~~p 412 (436)
|++++|+++++.++|+|.+ |+.++.| |..++ ...+|+-|.|+..
T Consensus 12 P~~V~v~~~~~~v~vkGp~----------------G~l~~~~--~~~v~---i~~~~~~i~v~~~ 55 (178)
T PRK05498 12 PAGVEVTINGNVVTVKGPK----------------GELSRTL--NPDVT---VKVEDNEITVTRP 55 (178)
T ss_pred CCCCEEEEECCEEEEECCC----------------EEEEEEc--CCCeE---EEEECCEEEEEcC
Confidence 6999999999999999874 5666666 54433 3457787777654
No 131
>TIGR03654 L6_bact ribosomal protein L6, bacterial type.
Probab=51.03 E-value=54 Score=29.04 Aligned_cols=45 Identities=13% Similarity=0.299 Sum_probs=32.8
Q ss_pred CCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccceEeEEeeCcEEEEEEec
Q psy5221 348 RSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLIR 413 (436)
Q Consensus 348 ~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~~GvL~i~~pk 413 (436)
|++++|+++++.++|+|.+ |+.++.| |..+ +...+|+.|.|+.+.
T Consensus 11 P~~V~v~~~~~~v~v~Gp~----------------G~l~~~l--~~~i---~i~~~~~~i~v~~~~ 55 (175)
T TIGR03654 11 PAGVEVTIDGNVVTVKGPK----------------GELSRTL--HPGV---TVKVEDGQLTVSRPN 55 (175)
T ss_pred CCCcEEEEeCCEEEEEcCC----------------eEEEEEc--CCCe---EEEEECCEEEEEecC
Confidence 6999999999999999874 5666666 5443 345577877777543
No 132
>PRK01844 hypothetical protein; Provisional
Probab=50.57 E-value=59 Score=24.16 Aligned_cols=47 Identities=9% Similarity=0.121 Sum_probs=26.3
Q ss_pred HHHHHHHHHHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCC
Q psy5221 24 SSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANIST 79 (436)
Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~ 79 (436)
+.++..+++++..+|++.+.....++..| |.++ +.++.+....|-++
T Consensus 9 l~I~~li~G~~~Gff~ark~~~k~lk~NP---pine------~mir~Mm~QMGqkP 55 (72)
T PRK01844 9 VGVVALVAGVALGFFIARKYMMNYLQKNP---PINE------QMLKMMMMQMGQKP 55 (72)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCC---CCCH------HHHHHHHHHhCCCc
Confidence 33334444556667777777766665553 3332 46666666666543
No 133
>PF12315 DUF3633: Protein of unknown function (DUF3633); InterPro: IPR022087 This domain family is found in bacteria and eukaryotes, and is approximately 210 amino acids in length. The family is found in association with PF00412 from PFAM.
Probab=50.52 E-value=16 Score=33.09 Aligned_cols=20 Identities=30% Similarity=0.429 Sum_probs=17.0
Q ss_pred hcCHHHHHHHHHHHHHhhhc
Q psy5221 112 SMTKLEIKAIIAHEISHISN 131 (436)
Q Consensus 112 ~l~~~El~aVLAHElgHi~~ 131 (436)
.|+.--..+|||||++|..-
T Consensus 87 GLPrll~gsiLAHE~mHa~L 106 (212)
T PF12315_consen 87 GLPRLLTGSILAHELMHAWL 106 (212)
T ss_pred CCCHHHHhhHHHHHHHHHHh
Confidence 46778889999999999865
No 134
>KOG3314|consensus
Probab=50.10 E-value=70 Score=27.84 Aligned_cols=68 Identities=10% Similarity=0.092 Sum_probs=38.9
Q ss_pred HHHHHHHHHhcCCCCCeEEEECCCccEEEe-CCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHH
Q psy5221 65 LKTVNKISKNANISTPEVAIYNGEINAFAT-GAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMV 135 (436)
Q Consensus 65 ~~~l~~l~~~~g~~~p~v~v~~~~~NA~~~-G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~ 135 (436)
-..+-+..++.|-...+-.+.-..-|+.+. |+-|.+++++....| .+.+++.-|++|||-|.--.+-.
T Consensus 40 Vkfl~~am~k~~c~~~d~~isc~~C~~~~~GGy~~~~gIvlCqN~l---~~q~h~n~vv~HElIH~fDd~r~ 108 (194)
T KOG3314|consen 40 VKFLMEAMEKSGCRVGDNFISCVVCTGPVAGGYTPGRGIVLCQNRL---TIQDHVNQVVIHELIHAFDDCRA 108 (194)
T ss_pred HHHHHHHHHHcCCCccCCceEEeeCCCCccCCccCCCceEEecccc---chHHHHHHHHHHHHHHHHHhhhh
Confidence 344455555666443222222112233333 455667766655522 37899999999999998766543
No 135
>COG1965 CyaY Protein implicated in iron transport, frataxin homolog [Inorganic ion transport and metabolism]
Probab=48.55 E-value=14 Score=29.54 Aligned_cols=31 Identities=19% Similarity=0.347 Sum_probs=23.4
Q ss_pred EeEEeeCcEEEEEEecCCCCCCCCeEeeecCCCCCCCC
Q psy5221 398 VAAFFDQGILNIDLIREVPEALKPRKIYINTTKNTDRI 435 (436)
Q Consensus 398 ~~A~~~~GvL~i~~pk~~~~~~~~~~I~I~~~~~~~~~ 435 (436)
+.+.+.+|||+|+++. .-+|-|+..+|.+||
T Consensus 30 ~D~d~qg~VlTl~f~n-------gs~iiINkQ~P~~qi 60 (106)
T COG1965 30 IDCEIQGGVLTLTFDN-------GSQIIINKQEPLQQI 60 (106)
T ss_pred cceecCCCEEEEEECC-------CcEEEEeCCChHHHH
Confidence 6677889999999986 256777777765553
No 136
>TIGR02289 M3_not_pepF oligoendopeptidase, M3 family. This family consists of probable oligoendopeptidases in the M3 family, related to lactococcal PepF and group B streptococcal PepB (TIGR00181) but in a distinct clade with considerable sequence differences. The likely substrate is small peptides and not whole proteins, as with PepF, but members are not characterized and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.
Probab=48.43 E-value=13 Score=39.29 Aligned_cols=65 Identities=20% Similarity=0.196 Sum_probs=37.1
Q ss_pred HHHHHHHHHHhcCCCCCe------------EEEECC-CccEEEeCCCC-CccEEEEcHHHHhhcCHHHHHHHHHHHHHhh
Q psy5221 64 LLKTVNKISKNANISTPE------------VAIYNG-EINAFATGAFK-NSSLIAISANLLESMTKLEIKAIIAHEISHI 129 (436)
Q Consensus 64 L~~~l~~l~~~~g~~~p~------------v~v~~~-~~NA~~~G~~~-~~~~v~it~gLl~~l~~~El~aVLAHElgHi 129 (436)
+.+.+.++...+|...-+ ++--++ ...||++++.+ ..++|+.+- -. +-++ ..+|+||+||.
T Consensus 274 ~~~~~~~~~~~l~~~~~e~~~~~~~~~~iD~~~r~gK~~Gayc~~~~~~~~P~I~~Nf--~~--t~~d-v~TL~HElGHa 348 (549)
T TIGR02289 274 LLEKAEKMYKELSLEFYEFFNFMREKNLLDLESRKGKAAGGYCTYLPKYKAPFIFSNF--NG--TSGD-IDVLTHEAGHA 348 (549)
T ss_pred HHHHHHHHhcCCCcHHHHHHHHHHhcCccccCCCCCCCCCcccCCCCCCCCcEEEEeC--CC--ChhH-HHHHHHHhhHH
Confidence 555666666655554222 122223 78899999765 345554331 11 3333 45799999999
Q ss_pred hcCh
Q psy5221 130 SNGD 133 (436)
Q Consensus 130 ~~~h 133 (436)
.|..
T Consensus 349 ~H~~ 352 (549)
T TIGR02289 349 FHVY 352 (549)
T ss_pred HHHH
Confidence 8754
No 137
>COG3763 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=47.89 E-value=54 Score=24.14 Aligned_cols=40 Identities=15% Similarity=0.164 Sum_probs=23.6
Q ss_pred HHHHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCC
Q psy5221 31 TGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANIST 79 (436)
Q Consensus 31 ~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~ 79 (436)
++.+..+|++.+...+.++-.| |.++ +.++.+..+.|-++
T Consensus 16 ~G~~~G~fiark~~~k~lk~NP---pine------~~iR~M~~qmGqKp 55 (71)
T COG3763 16 AGLIGGFFIARKQMKKQLKDNP---PINE------EMIRMMMAQMGQKP 55 (71)
T ss_pred HHHHHHHHHHHHHHHHHHhhCC---CCCH------HHHHHHHHHhCCCc
Confidence 3345557778888777766653 4443 35555555556543
No 138
>smart00235 ZnMc Zinc-dependent metalloprotease. Neutral zinc metallopeptidases. This alignment represents a subset of known subfamilies. Highest similarity occurs in the HExxH zinc-binding site/ active site.
Probab=47.65 E-value=9.1 Score=32.22 Aligned_cols=13 Identities=31% Similarity=0.555 Sum_probs=10.8
Q ss_pred HHHHHHHHHhhhc
Q psy5221 119 KAIIAHEISHISN 131 (436)
Q Consensus 119 ~aVLAHElgHi~~ 131 (436)
.+|++||+||...
T Consensus 87 ~~~~~HEigHaLG 99 (140)
T smart00235 87 TGVAAHELGHALG 99 (140)
T ss_pred cccHHHHHHHHhc
Confidence 3599999999864
No 139
>cd06471 ACD_LpsHSP_like Group of bacterial proteins containing an alpha crystallin domain (ACD) similar to Lactobacillus plantarum (Lp) small heat shock proteins (sHsp) HSP 18.5, HSP 18.55 and HSP 19.3. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Transcription of the genes encoding Lp HSP 18.5, 18.55 and 19.3 is regulated by a variety of stresses including heat, cold and ethanol. Early growing L. plantarum cells contain elevated levels of these mRNAs which rapidly fall of as the cells enter stationary phase. Also belonging to this group is Bifidobacterium breve (Bb) HSP20 and Oenococcus oenis (syn. Leuconostoc oenos) (Oo) HSP18. Transcription of the gene encoding BbHSP20 is strongly induced following heat or osmotic shock, and that of the gene encoding OoHSP18 following heat, ethanol or acid shock. OoHSP18 is peripherally associated with the cytoplasmic me
Probab=46.96 E-value=29 Score=26.82 Aligned_cols=31 Identities=13% Similarity=0.277 Sum_probs=26.9
Q ss_pred CceEEEEEecCCCCCCceEEEEECCEEEEEE
Q psy5221 334 ENKYRITMAIAGFNRSDLEIETERDTLRITG 364 (436)
Q Consensus 334 ~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g 364 (436)
-..|.-.+.+|.+++++++-+.+++.|+|+-
T Consensus 61 ~g~f~r~~~lp~v~~~~i~A~~~dGvL~I~l 91 (93)
T cd06471 61 YGSFSRSFYLPNVDEEEIKAKYENGVLKITL 91 (93)
T ss_pred ccEEEEEEECCCCCHHHCEEEEECCEEEEEE
Confidence 3467777899999999999999999999974
No 140
>TIGR02411 leuko_A4_hydro leukotriene A-4 hydrolase/aminopeptidase. Members of this family represent a distinctive subset within the zinc metallopeptidase family M1 (pfam01433). The majority of the members of pfam01433 are aminopeptidases, but the sequences in this family for which the function is known are leukotriene A-4 hydrolase. A dual epoxide hydrolase and aminopeptidase activity at the same active site is indicated. The physiological substrate for aminopeptidase activity is not known.
Probab=46.61 E-value=26 Score=37.61 Aligned_cols=31 Identities=26% Similarity=0.414 Sum_probs=21.7
Q ss_pred EEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChH
Q psy5221 102 LIAISANLLESMTKLEIKAIIAHEISHISNGDM 134 (436)
Q Consensus 102 ~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~ 134 (436)
+.+.+..++. .+.+...|+|||++|-.-|+.
T Consensus 265 ltf~~~~ll~--~d~s~~~viaHElAHqWfGNl 295 (601)
T TIGR02411 265 LTFATPTLIA--GDRSNVDVIAHELAHSWSGNL 295 (601)
T ss_pred ceeecccccc--CChhhhhhHHHHHHhhccCce
Confidence 4455554442 344567899999999999875
No 141
>PRK14015 pepN aminopeptidase N; Provisional
Probab=46.16 E-value=34 Score=38.40 Aligned_cols=20 Identities=45% Similarity=0.595 Sum_probs=17.0
Q ss_pred HHHHHHHHHHHhhhcChHHH
Q psy5221 117 EIKAIIAHEISHISNGDMVT 136 (436)
Q Consensus 117 El~aVLAHElgHi~~~h~~~ 136 (436)
.+..|+|||++|-..|+...
T Consensus 295 ~i~~vIaHElaHqWFGNlVT 314 (875)
T PRK14015 295 RIESVIAHEYFHNWTGNRVT 314 (875)
T ss_pred HHHHHHHHHHHHHHHhCcce
Confidence 48899999999999888643
No 142
>cd04276 ZnMc_MMP_like_2 Zinc-dependent metalloprotease; MMP_like sub-family 2. A group of bacterial metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=44.02 E-value=16 Score=33.16 Aligned_cols=31 Identities=16% Similarity=0.333 Sum_probs=21.4
Q ss_pred cEEEEcHHHHhhc-------CHHHHHHHHHHHHHhhhc
Q psy5221 101 SLIAISANLLESM-------TKLEIKAIIAHEISHISN 131 (436)
Q Consensus 101 ~~v~it~gLl~~l-------~~~El~aVLAHElgHi~~ 131 (436)
..|.+..+.+... ..+-+..+++||+||...
T Consensus 92 a~V~l~~~~~~~~~~~~~~~~~~~~~~~~~he~gh~lG 129 (197)
T cd04276 92 ADVILYSGFLRQDQLWYEDLLAASLRYLLAHEVGHTLG 129 (197)
T ss_pred EEEEeCchhhccchhHHHHHHHHHHHHHHHHHHHHHhc
Confidence 3456666665443 235588999999999765
No 143
>PF14730 DUF4468: Domain of unknown function (DUF4468) with TBP-like fold
Probab=43.98 E-value=1.1e+02 Score=23.65 Aligned_cols=17 Identities=6% Similarity=0.128 Sum_probs=14.9
Q ss_pred eEeEEeeCcEEEEEEec
Q psy5221 397 VVAAFFDQGILNIDLIR 413 (436)
Q Consensus 397 ~~~A~~~~GvL~i~~pk 413 (436)
.+++.++||-.++++-+
T Consensus 69 ~l~i~~kDgk~r~~~~~ 85 (91)
T PF14730_consen 69 TLIIDCKDGKYRLTITN 85 (91)
T ss_pred EEEEEEECCEEEEEEEE
Confidence 49999999999999865
No 144
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=43.95 E-value=15 Score=37.33 Aligned_cols=41 Identities=22% Similarity=0.259 Sum_probs=24.8
Q ss_pred CccEEEeCCCC-CccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcCh
Q psy5221 88 EINAFATGAFK-NSSLIAISANLLESMTKLEIKAIIAHEISHISNGD 133 (436)
Q Consensus 88 ~~NA~~~G~~~-~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h 133 (436)
..-||+.++.+ ...+|+.+ .-. +-++ ...|+||+||..|..
T Consensus 196 ~~gaf~~~~~~~~~p~i~~n--~~~--~~~~-v~tl~HE~GHa~h~~ 237 (427)
T cd06459 196 RSGAYCTGLPPGKHPFILMN--FNG--TLDD-VFTLAHELGHAFHSY 237 (427)
T ss_pred CCCeecCCCCCCCCCeEEec--CCC--Chhh-HHHHHHHhhHHHHHH
Confidence 66789998863 33334332 122 3333 567999999976644
No 145
>KOG1047|consensus
Probab=43.65 E-value=14 Score=38.40 Aligned_cols=52 Identities=27% Similarity=0.477 Sum_probs=33.0
Q ss_pred CCeEEEECCCccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHH
Q psy5221 79 TPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMV 135 (436)
Q Consensus 79 ~p~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~ 135 (436)
.-++.|++. -|=+|.-.+-...++|..|+. .+.-+.-|||||++|-..|...
T Consensus 254 ryDllvlPp---SFP~gGMENPcltF~TpTlla--GDrsl~~vIaHEIAHSWtGNlV 305 (613)
T KOG1047|consen 254 RYDLLVLPP---SFPFGGMENPCLTFVTPTLLA--GDRSLVDVIAHEIAHSWTGNLV 305 (613)
T ss_pred cceEEEecC---CCCcccccCcceeeecchhhc--CCcchhhHHHHHhhhhhccccc
Confidence 346666664 112222233455667775654 5667899999999998877653
No 146
>PF02163 Peptidase_M50: Peptidase family M50; InterPro: IPR008915 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This entry contains metallopeptidases belonging to MEROPS peptidase family M50 (S2P protease family, clan MM). Members of the M50 metallopeptidase family include: mammalian sterol-regulatory element binding protein (SREBP) site 2 protease, Escherichia coli protease EcfE, stage IV sporulation protein FB and various hypothetical bacterial and eukaryotic homologues. A number of proteins are classified as non-peptidase homologues as they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3B4R_A 3ID4_A 3ID2_A 2ZPL_B 3ID1_A 2ZPM_A 3ID3_B 2HGA_A.
Probab=43.03 E-value=16 Score=32.65 Aligned_cols=14 Identities=29% Similarity=0.555 Sum_probs=11.5
Q ss_pred HHHHHHHHHHhhhc
Q psy5221 118 IKAIIAHEISHISN 131 (436)
Q Consensus 118 l~aVLAHElgHi~~ 131 (436)
+.+++.||+||..-
T Consensus 7 ~i~i~~HE~gH~~~ 20 (192)
T PF02163_consen 7 LISIVLHELGHALA 20 (192)
T ss_dssp HHHHHHHHHHHHHH
T ss_pred cccccccccccccc
Confidence 46889999999864
No 147
>TIGR03793 TOMM_pelo TOMM propeptide domain. This model represents a domain that is conserved among a large number of putative thiazole/oxazole-modified microcins (TOMM). Oddly, most of this seqence region appears homologous to nitrile hydratase subunits. This family is expanded especially in Pelotomaculum thermopropionicum SI.
Probab=43.03 E-value=1.1e+02 Score=23.08 Aligned_cols=44 Identities=11% Similarity=0.205 Sum_probs=28.5
Q ss_pred HHHHHhcCCCCC---eEEEECC--CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHH
Q psy5221 69 NKISKNANISTP---EVAIYNG--EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIA 123 (436)
Q Consensus 69 ~~l~~~~g~~~p---~v~v~~~--~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLA 123 (436)
...++..|+..| ++.|+++ ..--++++..|. | .+|++||.+|-+
T Consensus 28 raaL~e~G~~~P~~~~i~VvE~t~~~~~lVlP~~P~--------~---~lse~~L~~vag 76 (77)
T TIGR03793 28 KEALEREGVQVPAEVEVKVVEESPTVLYLVLPVNPD--------I---ELTDEQLDAVAG 76 (77)
T ss_pred HHHHHHhCCCCCCceEEEEEEcCCCeEEEEecCCCC--------C---CCCHHHHHHhhC
Confidence 455566688754 6777776 444445654443 2 689999999864
No 148
>PTZ00179 60S ribosomal protein L9; Provisional
Probab=42.35 E-value=79 Score=28.41 Aligned_cols=47 Identities=6% Similarity=0.078 Sum_probs=32.0
Q ss_pred CCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccceEeEEeeCcEEEEEEe
Q psy5221 348 RSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLI 412 (436)
Q Consensus 348 ~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~~GvL~i~~p 412 (436)
|++++|+++++.++|+|.+ |+.++. +|..--.+....+|+.|.|+-+
T Consensus 12 P~~V~V~i~~~~ItVkGpk----------------G~Ls~~--~~~~~~~i~i~~~~~~I~v~~~ 58 (189)
T PTZ00179 12 PEDVTVSVKDRIVTVKGKR----------------GTLTKD--LRHLQLDFRVNKKNRTFTAVRW 58 (189)
T ss_pred CCCCEEEEeCCEEEEECCC----------------cEEEEE--cCCCCcEEEEEecCCEEEEEeC
Confidence 6999999999999999875 445543 3432112455667787777744
No 149
>cd04327 ZnMc_MMP_like_3 Zinc-dependent metalloprotease; MMP_like sub-family 3. A group of bacterial and fungal metalloproteinase domains similar to matrix metalloproteinases and astacin.
Probab=42.32 E-value=17 Score=32.88 Aligned_cols=16 Identities=25% Similarity=0.445 Sum_probs=13.4
Q ss_pred HHHHHHHHHHHHhhhc
Q psy5221 116 LEIKAIIAHEISHISN 131 (436)
Q Consensus 116 ~El~aVLAHElgHi~~ 131 (436)
.+..+++.||+||...
T Consensus 90 ~~~~~~i~HElgHaLG 105 (198)
T cd04327 90 PEFSRVVLHEFGHALG 105 (198)
T ss_pred hhHHHHHHHHHHHHhc
Confidence 4567899999999876
No 150
>cd04277 ZnMc_serralysin_like Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.
Probab=41.65 E-value=15 Score=32.57 Aligned_cols=34 Identities=18% Similarity=0.010 Sum_probs=21.0
Q ss_pred cEEEEcHHHHhhc--CHHHHHHHHHHHHHhhhc-ChH
Q psy5221 101 SLIAISANLLESM--TKLEIKAIIAHEISHISN-GDM 134 (436)
Q Consensus 101 ~~v~it~gLl~~l--~~~El~aVLAHElgHi~~-~h~ 134 (436)
..|.++....... .......++.||+||... +|.
T Consensus 94 g~i~~~~~~~~~~~~~g~~~~~t~~HEiGHaLGL~H~ 130 (186)
T cd04277 94 GDIWFNSSYDTNSDSPGSYGYQTIIHEIGHALGLEHP 130 (186)
T ss_pred ceeEEecCcccccCCCChhhHHHHHHHHHHHhcCCCC
Confidence 3455554322221 245679999999999866 443
No 151
>PRK01379 cyaY frataxin-like protein; Provisional
Probab=41.20 E-value=28 Score=27.89 Aligned_cols=30 Identities=23% Similarity=0.279 Sum_probs=20.0
Q ss_pred EeEEeeCcEEEEEEecCCCCCCCCeEeeecCCCCCCCC
Q psy5221 398 VAAFFDQGILNIDLIREVPEALKPRKIYINTTKNTDRI 435 (436)
Q Consensus 398 ~~A~~~~GvL~i~~pk~~~~~~~~~~I~I~~~~~~~~~ 435 (436)
+.+.+.+|||+|+.+. . .|-|+..++.+||
T Consensus 30 ~D~e~~~gVLtl~~~~-------g-t~VINkQ~p~~QI 59 (103)
T PRK01379 30 IDVDLQGDILNLDTDK-------G-IYVINKQSAAKEI 59 (103)
T ss_pred eeeeccCCEEEEEeCC-------c-EEEEeCCChhhhh
Confidence 6677789999998643 1 2666666665553
No 152
>CHL00140 rpl6 ribosomal protein L6; Validated
Probab=40.61 E-value=72 Score=28.31 Aligned_cols=44 Identities=9% Similarity=0.289 Sum_probs=30.7
Q ss_pred CCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccceEeEEeeCcEEEEEEe
Q psy5221 348 RSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLI 412 (436)
Q Consensus 348 ~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~~GvL~i~~p 412 (436)
|++++|+++++.++|+|.+ |+.++ .+|.. ++...+|+.+.|+.+
T Consensus 12 P~~V~v~i~~~~v~vkGp~----------------G~l~~--~~~~~---v~i~~~~~~i~v~~~ 55 (178)
T CHL00140 12 PDNVNVSIDDQIIKVKGPK----------------GTLSR--KIPDL---ITIEIQDNSLFVSKK 55 (178)
T ss_pred CCCCEEEEECCEEEEECCC----------------EEEEE--ECCCC---eEEEEeCCEEEEEcC
Confidence 6899999999999999875 44443 44553 334557777666654
No 153
>PF12992 DUF3876: Domain of unknown function, B. Theta Gene description (DUF3876); InterPro: IPR024452 This bacterial family of conserved proteins has no known function.
Probab=40.55 E-value=89 Score=24.68 Aligned_cols=41 Identities=20% Similarity=0.272 Sum_probs=32.9
Q ss_pred CCCCceeEEEecCceEEEEEecCCC-----CCCceEEEEECCEEEEE
Q psy5221 322 QSYPPYNIELVDENKYRITMAIAGF-----NRSDLEIETERDTLRIT 363 (436)
Q Consensus 322 ~~~p~~~i~~e~~~~~~v~~~lpG~-----~~edi~v~~~~~~l~i~ 363 (436)
..-|++.|++ +++.|.|.+--+.- +++.-.|.-+++.+-|.
T Consensus 23 ~~~P~v~I~r-~g~~Y~vti~~~~~~~~~~~p~tY~i~~~~g~~fI~ 68 (95)
T PF12992_consen 23 NGKPDVTIYR-NGGSYKVTITYRSGYTGRAKPETYPIQEEDGNLFIE 68 (95)
T ss_pred CCCCCEEEEE-CCCeEEEEEEEEcCcCCcccceEEEEEEeCCEEEEe
Confidence 4469999996 78889998877665 77888888888888876
No 154
>TIGR02290 M3_fam_3 oligoendopeptidase, pepF/M3 family. The M3 family of metallopeptidases contains several distinct clades. Oligoendopeptidase F as characterized in Lactococcus, the functionally equivalent oligoendopeptidase B of group B Streptococcus, and closely related sequences are described by TIGR00181. The present family is quite similar but forms a distinct clade, and a number of species have one member of each. A greater sequence difference separates members of TIGR02289, probable oligoendopeptidases of the M3 family that probably should not be designated PepF.
Probab=40.17 E-value=19 Score=38.47 Aligned_cols=41 Identities=20% Similarity=0.289 Sum_probs=25.5
Q ss_pred CccEEEeCCCCC-ccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcCh
Q psy5221 88 EINAFATGAFKN-SSLIAISANLLESMTKLEIKAIIAHEISHISNGD 133 (436)
Q Consensus 88 ~~NA~~~G~~~~-~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h 133 (436)
...||+.|+.+. ..+|+.+- -. +-++ ...|+||+||-.|.-
T Consensus 349 ~~Ga~~~~~~~~~~p~i~~N~--~~--~~~~-v~TL~HE~GHa~H~~ 390 (587)
T TIGR02290 349 RGGAFCTGFPPSKEPRVLMNY--DG--SRRD-VSTLAHELGHAYHSE 390 (587)
T ss_pred CCCcccCCCCCCCCCEEEEec--CC--Cchh-HHHHHHHhhHHHHHH
Confidence 778999886553 24444431 11 2222 356999999999843
No 155
>PHA00527 hypothetical protein
Probab=39.85 E-value=69 Score=25.50 Aligned_cols=49 Identities=14% Similarity=0.177 Sum_probs=28.9
Q ss_pred HHHHHHHHHhcCCCCCeEE---------EECC-CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhc
Q psy5221 65 LKTVNKISKNANISTPEVA---------IYNG-EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISN 131 (436)
Q Consensus 65 ~~~l~~l~~~~g~~~p~v~---------v~~~-~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~ 131 (436)
+...+++|.++|.+..+-- -..+ .-|.+++|.. -+ -++-|+||.+|...
T Consensus 28 ~~EWQ~~~~~LGVD~~~~~mla~~~~S~~~s~~~~~L~~~GVF-------NG-----------K~~T~~HECAH~AF 86 (129)
T PHA00527 28 KEEWQQACIHLGVDSGGNEMLAGATQSYCNTETGENLYLLGVF-------NG-----------KAATLVHECAHVAF 86 (129)
T ss_pred hHHHHHHHHHhCCCcCCchhhhccccccccccCCCeEEEEEEe-------cc-----------HHHHHHHHHHHHHH
Confidence 3567889999998753321 1111 4455555421 11 24679999999754
No 156
>KOG3624|consensus
Probab=39.38 E-value=43 Score=36.55 Aligned_cols=43 Identities=28% Similarity=0.421 Sum_probs=30.0
Q ss_pred CccEEEeCCCCCccEEEEcHHHHhh----------cCHHHHHHHHHHHHHhhhcCh
Q psy5221 88 EINAFATGAFKNSSLIAISANLLES----------MTKLEIKAIIAHEISHISNGD 133 (436)
Q Consensus 88 ~~NA~~~G~~~~~~~v~it~gLl~~----------l~~~El~aVLAHElgHi~~~h 133 (436)
..|||-. +.+..|.+--|+++. ++=.-+-+|||||++|---.+
T Consensus 481 ~~na~Y~---~~~N~i~~pa~ilq~P~f~~~~P~~~nyg~iG~vigHEl~H~FD~~ 533 (687)
T KOG3624|consen 481 QVNAFYS---PEKNEIVFPAGLLQPPFFDLSYPDYLNYGGIGFVIGHELTHGFDDQ 533 (687)
T ss_pred eeecccc---CCCceEEEehhcccCCCCCcccchhhhhHHHHHHHHHHHhhccccc
Confidence 7777655 334556677777764 455668899999999955433
No 157
>cd04273 ZnMc_ADAMTS_like Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Probab=39.20 E-value=5.9 Score=36.04 Aligned_cols=14 Identities=36% Similarity=0.513 Sum_probs=11.9
Q ss_pred HHHHHHHHHHhhhc
Q psy5221 118 IKAIIAHEISHISN 131 (436)
Q Consensus 118 l~aVLAHElgHi~~ 131 (436)
.+.++|||+||.-.
T Consensus 140 ~a~~~aHElGH~LG 153 (207)
T cd04273 140 SAFTIAHELGHVLG 153 (207)
T ss_pred eEEeeeeechhhcC
Confidence 57899999999873
No 158
>PF06861 BALF1: BALF1 protein; InterPro: IPR010677 Epstein-Barr virus (strain GD1) (HHV-4), a human tumour DNA virus and a prominent member of gamma-herpesviruses, encodes homologues of cellular antiapoptotic viral Bcl-2 proteins BALF1 and BHRF1. They protect the virus from apoptosis in its host cell during virus synthesis [, ]. The virus infects B lymphocytes to establish a latent infection and yield proliferating, growth-transformed B cells in vitro. Bcl-2 genes are essential for the initial evasion of apoptosis which allows it to establish a latent infection or cause cellular transformation, or both []. Bcl-2 family proteins can inhibit or induce programmed cell death in part by counteracting the activity of other BCL-2 family members. BALF1, inhibits the antiapoptotic activity of EBV BHRF1 and of KSBcl-2 in several transfected cell lines. BALF1 fails, however, to inhibit the cellular BCL-2 family member, BCL-x(L). Thus, BALF1 acts as a negative regulator of the survival function of BHRF1, similar to the counterbalance observed between cellular BCL-2 family members [].
Probab=39.04 E-value=22 Score=30.99 Aligned_cols=23 Identities=13% Similarity=0.218 Sum_probs=20.6
Q ss_pred cCHHHHHHHHHHHHHhhhcChHH
Q psy5221 113 MTKLEIKAIIAHEISHISNGDMV 135 (436)
Q Consensus 113 l~~~El~aVLAHElgHi~~~h~~ 135 (436)
.++.|-.++++|++||+..+|-.
T Consensus 127 ~~d~e~~s~v~~~lA~Fy~~~r~ 149 (182)
T PF06861_consen 127 LNDHENASLVSHALAHFYLRYRR 149 (182)
T ss_pred cCchHHHHHHHHHHHHHHHHHHH
Confidence 48999999999999999998843
No 159
>cd06161 S2P-M50_SpoIVFB SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB.
Probab=38.99 E-value=19 Score=32.80 Aligned_cols=15 Identities=20% Similarity=0.348 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHhhhc
Q psy5221 117 EIKAIIAHEISHISN 131 (436)
Q Consensus 117 El~aVLAHElgHi~~ 131 (436)
=+.+|+.||+||..-
T Consensus 37 l~~~v~iHElgH~~~ 51 (208)
T cd06161 37 LFLSVLLHELGHALV 51 (208)
T ss_pred HHHHHHHHHHHHHHH
Confidence 467899999999753
No 160
>cd06163 S2P-M50_PDZ_RseP-like RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. In Escherichia coli, the S2P homolog RseP is involved in the sigmaE pathway of extracytoplasmic stress responses. Also included in this group are such homologs as Bacillus subtilis YluC, Mycobacterium tuberculosis Rv2869c S2P, and Bordetella bronchiseptica HurP. Rv2869c S2P appears to have a role in the regulation of prokaryotic lipid biosynthesis and membrane composition and YluC of Bacillus has a role in transducing membrane stress. This group includes bacterial and eukaryotic S2P/M50s homologs with either one or two PDZ domains present. PDZ domains are believed to have a regulatory role. The RseP PDZ domain is required for the inhibitory reaction that prevents cleavage of its substrate, RseA.
Probab=38.58 E-value=18 Score=32.32 Aligned_cols=13 Identities=31% Similarity=0.445 Sum_probs=10.8
Q ss_pred HHHHHHHHHhhhc
Q psy5221 119 KAIIAHEISHISN 131 (436)
Q Consensus 119 ~aVLAHElgHi~~ 131 (436)
..|+-||+||+.-
T Consensus 10 ~~v~iHElGH~~~ 22 (182)
T cd06163 10 ILIFVHELGHFLV 22 (182)
T ss_pred HHHHHHHHHHHHH
Confidence 4689999999865
No 161
>cd05709 S2P-M50 Site-2 protease (S2P) class of zinc metalloproteases (MEROPS family M50) cleaves transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of this family use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. The domain core structure appears to contain at least three transmembrane helices with a catalytic zinc atom coordinated by three conserved residues contained within the consensus sequence HExxH, together with a conserved aspartate residue. The S2P/M50 family of RIP proteases is widely distributed; in eukaryotic cells, they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum (ER) stress responses. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of sterol regulatory element-bindin
Probab=38.41 E-value=20 Score=31.63 Aligned_cols=14 Identities=21% Similarity=0.432 Sum_probs=11.4
Q ss_pred HHHHHHHHHHhhhc
Q psy5221 118 IKAIIAHEISHISN 131 (436)
Q Consensus 118 l~aVLAHElgHi~~ 131 (436)
+.+|+.||+||..-
T Consensus 8 ~i~i~iHE~gH~~~ 21 (180)
T cd05709 8 LISVTVHELGHALV 21 (180)
T ss_pred HHHHHHHHHHHHHH
Confidence 45789999999863
No 162
>cd06472 ACD_ScHsp26_like Alpha crystallin domain (ACD) found in Saccharomyces cerevisiae (Sc) small heat shock protein (Hsp)26 and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. ScHsp26 is temperature-regulated, it switches from an inactive to a chaperone-active form upon elevation in temperature. It associates into large 24-mers storage forms which upon heat shock disassociate into dimers. These dimers initiate the interaction with non-native substrate proteins and re-assemble into large globular assemblies having one monomer of substrate bound per dimer. This group also contains Arabidopsis thaliana (Ath) Hsp15.7, a peroxisomal matrix protein which can complement the morphological phenotype of S. cerevisiae mutants deficient in Hsps26. AthHsp15.7 is minimally expressed under normal conditions and is strongly induced by heat and oxidative st
Probab=37.09 E-value=52 Score=25.42 Aligned_cols=32 Identities=9% Similarity=0.194 Sum_probs=28.0
Q ss_pred cCceEEEEEecC-CCCCCceEEEEECCEEEEEE
Q psy5221 333 DENKYRITMAIA-GFNRSDLEIETERDTLRITG 364 (436)
Q Consensus 333 ~~~~~~v~~~lp-G~~~edi~v~~~~~~l~i~g 364 (436)
....|.-.+.|| ++++++|+-+.+++.|+|+-
T Consensus 58 ~~g~f~r~i~LP~~v~~~~i~A~~~nGvL~I~l 90 (92)
T cd06472 58 SSGRFVRRFRLPENADADEVKAFLENGVLTVTV 90 (92)
T ss_pred eccEEEEEEECCCCCCHHHCEEEEECCEEEEEe
Confidence 356899999999 68999999999999999973
No 163
>cd06481 ACD_HspB9_like Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB9 and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Human (h) HspB9 is expressed exclusively in the normal testis and in various tumor samples and is a cancer/testis antigen. hHspB9 interacts with TCTEL1 (T-complex testis expressed protein -1), a subunit of dynein. hHspB9 and TCTEL1 are co-expressed in similar cells within the testis and in tumor cells. Included in this group is Xenopus Hsp30, a developmentally-regulated heat-inducible molecular chaperone.
Probab=37.00 E-value=41 Score=25.90 Aligned_cols=32 Identities=6% Similarity=0.091 Sum_probs=28.0
Q ss_pred CceEEEEEecC-CCCCCceEEEE-ECCEEEEEEE
Q psy5221 334 ENKYRITMAIA-GFNRSDLEIET-ERDTLRITGR 365 (436)
Q Consensus 334 ~~~~~v~~~lp-G~~~edi~v~~-~~~~l~i~g~ 365 (436)
...|.=.+.|| +++++.|+=++ .++.|+|+.-
T Consensus 53 ~~~F~R~~~LP~~Vd~~~i~A~~~~dGvL~I~~P 86 (87)
T cd06481 53 YQEFVREAQLPEHVDPEAVTCSLSPSGHLHIRAP 86 (87)
T ss_pred eeEEEEEEECCCCcChHHeEEEeCCCceEEEEcC
Confidence 35799999998 79999999999 8999999853
No 164
>TIGR02421 QEGLA conserved hypothetical protein. Members of this family include a possible metal-binding motif HEXXXH and, nearby, a perfectly conserved motif QEGLA. All members belong to the Proteobacteria, including Agrobacterium tumefaciens and several species of Vibrio and Pseudomonas, and are found in only one copy per chromosome (Vibrio vulnificus, with two chromosomes, has two). The function is unknown.
Probab=36.80 E-value=75 Score=31.70 Aligned_cols=60 Identities=15% Similarity=0.244 Sum_probs=43.5
Q ss_pred hhHHHHHHHHHHHhcCCC-CCeEEEECC-CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHH
Q psy5221 61 EDWLLKTVNKISKNANIS-TPEVAIYNG-EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEIS 127 (436)
Q Consensus 61 ~~~L~~~l~~l~~~~g~~-~p~v~v~~~-~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElg 127 (436)
.....+.+++.++..+.. ..+|.+-++ ..+|.+.| + .|.+..+ ..+++.++.+++-||+|
T Consensus 136 A~~a~~~~~~~~~~y~~~~~~~V~~sd~l~a~a~v~~----~-~l~i~~~--a~fs~~~l~~L~~HEig 197 (366)
T TIGR02421 136 ATEAAEILQQRLEDYFGEETIRVTLSDDLPAGAMVSG----D-KLKLNSD--AMFSERDLEALIHHEIG 197 (366)
T ss_pred HHHHHHHHHHHHHHhCCCCceEEEECcchhHHHhccC----C-eEEECCC--CCcCHHHHHHHHHHhHH
Confidence 344788888888888875 335555566 67887664 2 3767762 34699999999999999
No 165
>cd06164 S2P-M50_SpoIVFB_CBS SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation. In this subgroup, SpoIVFB (sporulation protein, stage IV cell wall formation, F locus, promoter-distal B) contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domain. SpoIVFB is one of 4 proteins involved in endospore formation; the others are SpoIVFA (sporulation protein, stage IV cell wall formation, F locus, promoter-proximal A), BofA (bypass-of-forespore A), and SpoIVB (sporulation protein, stage IV cell wall formation, B locus). SpoIVFB is negatively regulated by SpoIVFA and BofA and activated by SpoIVB. It is thought that SpoIVFB, SpoIVFA, and BofA are located in the mother-cell membrane that surrounds the forespore and that SpoIVB is secreted from the forespore into the space between the two where it activates SpoIVFB. It has been proposed tha
Probab=36.58 E-value=22 Score=32.95 Aligned_cols=15 Identities=20% Similarity=0.364 Sum_probs=12.1
Q ss_pred HHHHHHHHHHHhhhc
Q psy5221 117 EIKAIIAHEISHISN 131 (436)
Q Consensus 117 El~aVLAHElgHi~~ 131 (436)
=+.+|+.||+||..-
T Consensus 52 l~~~v~iHElgH~~~ 66 (227)
T cd06164 52 LFASVLLHELGHSLV 66 (227)
T ss_pred HHHHHHHHHHHHHHH
Confidence 457899999999753
No 166
>PF01432 Peptidase_M3: Peptidase family M3 This Prosite motif covers only the active site. This family belongs to family M3 of the peptidase classification.; InterPro: IPR001567 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M3 (clan MA(E)), subfamilies M3A and M3B. The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. The Thimet oligopeptidase family, is a large family of archaeal, bacterial and eukaryotic oligopeptidases that cleave medium sized peptides. The group contains: mitochondrial intermediate peptidase (3.4.24.59 from EC) Neurolysin, mitochondrial precursor, (3.4.24.16 from EC) Thimet oligopeptidase (3.4.24.15 from EC) Dipeptidyl carboxypeptidase (3.4.15.5 from EC) Oligopeptidase A (3.4.24.70 from EC) Oligoendopeptidase F ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 2QR4_B 3CE2_A 1Y79_1 2H1J_A 2H1N_A 2O36_A 1S4B_P 2O3E_A 1I1I_P.
Probab=36.12 E-value=26 Score=36.05 Aligned_cols=18 Identities=17% Similarity=0.420 Sum_probs=14.0
Q ss_pred cCHHHHHHHHHHHHHhhhc
Q psy5221 113 MTKLEIKAIIAHEISHISN 131 (436)
Q Consensus 113 l~~~El~aVLAHElgHi~~ 131 (436)
++-+++ ..|+||+||..|
T Consensus 238 l~~~~v-~tLfHE~GHa~H 255 (458)
T PF01432_consen 238 LSHDDV-ETLFHEFGHAMH 255 (458)
T ss_dssp -SHHHH-HHHHHHHHHHHH
T ss_pred cChhhH-HHHHHHHhHHHH
Confidence 356676 789999999876
No 167
>cd06526 metazoan_ACD Alpha-crystallin domain (ACD) of metazoan alpha-crystallin-type small(s) heat shock proteins (Hsps). sHsps are small stress induced proteins with monomeric masses between 12 -43 kDa, whose common feature is the Alpha-crystallin domain (ACD). sHsps are generally active as large oligomers consisting of multiple subunits, and are believed to be ATP-independent chaperones that prevent aggregation and are important in refolding in combination with other Hsps.
Probab=35.96 E-value=51 Score=24.91 Aligned_cols=31 Identities=10% Similarity=0.235 Sum_probs=27.0
Q ss_pred ceEEEEEecC-CCCCCceEEEEEC-CEEEEEEE
Q psy5221 335 NKYRITMAIA-GFNRSDLEIETER-DTLRITGR 365 (436)
Q Consensus 335 ~~~~v~~~lp-G~~~edi~v~~~~-~~l~i~g~ 365 (436)
..|.=.+.|| ++++++++-++.+ +.|+|+..
T Consensus 50 ~~f~r~~~LP~~vd~~~i~A~~~~~GvL~I~~P 82 (83)
T cd06526 50 REFTRRYQLPEGVDPDSVTSSLSSDGVLTIEAP 82 (83)
T ss_pred EEEEEEEECCCCCChHHeEEEeCCCcEEEEEec
Confidence 4788899998 7799999999998 99999864
No 168
>cd06464 ACD_sHsps-like Alpha-crystallin domain (ACD) of alpha-crystallin-type small(s) heat shock proteins (Hsps). sHsps are small stress induced proteins with monomeric masses between 12 -43 kDa, whose common feature is the Alpha-crystallin domain (ACD). sHsps are generally active as large oligomers consisting of multiple subunits, and are believed to be ATP-independent chaperones that prevent aggregation and are important in refolding in combination with other Hsps.
Probab=35.54 E-value=61 Score=24.17 Aligned_cols=33 Identities=12% Similarity=0.204 Sum_probs=28.6
Q ss_pred cCceEEEEEecC-CCCCCceEEEEECCEEEEEEE
Q psy5221 333 DENKYRITMAIA-GFNRSDLEIETERDTLRITGR 365 (436)
Q Consensus 333 ~~~~~~v~~~lp-G~~~edi~v~~~~~~l~i~g~ 365 (436)
....|.-.+.|| ++++++++-+++++.|+|+..
T Consensus 54 ~~~~f~r~~~LP~~vd~~~i~a~~~~G~L~I~~p 87 (88)
T cd06464 54 SYGSFSRSFRLPEDVDPDKIKASLENGVLTITLP 87 (88)
T ss_pred eCcEEEEEEECCCCcCHHHcEEEEeCCEEEEEEc
Confidence 467899999999 678999999999999999853
No 169
>PF00011 HSP20: Hsp20/alpha crystallin family This prints entry is a subset of the Pfam entry.; InterPro: IPR002068 Prokaryotic and eukaryotic organisms respond to heat shock or other environmental stress by inducing the synthesis of proteins collectively known as heat-shock proteins (hsp) []. Amongst them is a family of proteins with an average molecular weight of 20 Kd, known as the hsp20 proteins []. These seem to act as chaperones that can protect other proteins against heat-induced denaturation and aggregation. Hsp20 proteins seem to form large heterooligomeric aggregates. Structurally, this family is characterised by the presence of a conserved C-terminal domain of about 100 residues.; PDB: 2BOL_B 3N3E_B 2H50_P 2H53_F 2BYU_L 1GME_D 3VQM_J 3VQK_E 3VQL_A 3AAC_A ....
Probab=34.54 E-value=56 Score=25.57 Aligned_cols=38 Identities=13% Similarity=0.241 Sum_probs=28.8
Q ss_pred cCceEEEEEecC-CCCCCceEEEEECCEEEEEEEEeccc
Q psy5221 333 DENKYRITMAIA-GFNRSDLEIETERDTLRITGRKQADT 370 (436)
Q Consensus 333 ~~~~~~v~~~lp-G~~~edi~v~~~~~~l~i~g~~~~~~ 370 (436)
....|.-.+.|| +++.+.|+-.+++|.|+|+..+....
T Consensus 53 ~~~~f~r~~~lP~~vd~~~i~a~~~~GvL~I~~pk~~~~ 91 (102)
T PF00011_consen 53 RYGSFERSIRLPEDVDPDKIKASYENGVLTITIPKKEEE 91 (102)
T ss_dssp -SEEEEEEEE-STTB-GGG-EEEETTSEEEEEEEBSSSC
T ss_pred ccceEEEEEcCCCcCCcceEEEEecCCEEEEEEEccccc
Confidence 345788899998 67899999999999999999887654
No 170
>PF13402 M60-like: Peptidase M60-like family; PDB: 4FCA_A.
Probab=33.90 E-value=1.9e+02 Score=27.66 Aligned_cols=64 Identities=9% Similarity=0.045 Sum_probs=30.5
Q ss_pred HHHHHHHHHhcCCCCC----------e-EEEECC---CccEEEeCCCCCccEEEEcHHHHhh-cCHHHH---HHHHHHHH
Q psy5221 65 LKTVNKISKNANISTP----------E-VAIYNG---EINAFATGAFKNSSLIAISANLLES-MTKLEI---KAIIAHEI 126 (436)
Q Consensus 65 ~~~l~~l~~~~g~~~p----------~-v~v~~~---~~NA~~~G~~~~~~~v~it~gLl~~-l~~~El---~aVLAHEl 126 (436)
.++++...+-+|++.. . -++.+. ...+++.| .++.+... -.+. ++.+.+ .=-++||+
T Consensus 153 D~ii~~~~el~Gl~~~~~~~~~~~~~~~r~v~~v~~~~g~m~a~g----~~i~~~~~-~~~~~l~~~~~~~~~WG~~HE~ 227 (307)
T PF13402_consen 153 DRIIDAEYELAGLDKSSPGPENNPMPNNRFVFDVQISAGYMHASG----YPIGFPPN-WMNELLNPNPLRKGGWGPWHEL 227 (307)
T ss_dssp HHHHHHHHHHTT-BSS--GGGB--S--EEEEEETT----SEEEET----TEEEEETT---HHHH-HHHHHHH-HHHHHHH
T ss_pred HHHHHHHHHHhCCCcccCCccccCcccceEEEeccccccceeecC----CcEEeeCc-HHhcccCHhHcCCCCeeehhhh
Confidence 3455555556777641 1 233333 24555555 23333333 3333 366655 44689999
Q ss_pred HhhhcCh
Q psy5221 127 SHISNGD 133 (436)
Q Consensus 127 gHi~~~h 133 (436)
||..+.-
T Consensus 228 GH~~Q~~ 234 (307)
T PF13402_consen 228 GHNHQQG 234 (307)
T ss_dssp HHHH-BG
T ss_pred hhhcCcc
Confidence 9998866
No 171
>PF08014 DUF1704: Domain of unknown function (DUF1704); InterPro: IPR012548 This family contains many hypothetical proteins.
Probab=33.62 E-value=1e+02 Score=30.67 Aligned_cols=63 Identities=17% Similarity=0.295 Sum_probs=42.0
Q ss_pred HHHHHHHHHHHhc----CCCCCeEEEECC-CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHH-hhhcCh
Q psy5221 63 WLLKTVNKISKNA----NISTPEVAIYNG-EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEIS-HISNGD 133 (436)
Q Consensus 63 ~L~~~l~~l~~~~----g~~~p~v~v~~~-~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElg-Hi~~~h 133 (436)
.+...+++.+++. +++ ..|.+-++ ..+|.+.+ ++ |.+..+ ..+++.++.+++-||+| |+....
T Consensus 113 ~~~~~~~~~~~~y~~~~~~~-~~V~~sddl~a~A~v~~----~~-l~I~~~--~~fs~~~l~~L~~HEigvH~lt~~ 181 (349)
T PF08014_consen 113 EAVSRLQERLKKYFGKEGFE-VKVELSDDLLARAMVSG----DR-LKINKN--AMFSERDLEALLHHEIGVHLLTTL 181 (349)
T ss_pred HHHHHHHHHHHHHhcccCce-EEEEEcCCcchhhcccC----Ce-eEEcCC--CCcCHHHHHHHHHHhhhhhhcccc
Confidence 3667777777776 443 35666666 78886664 22 666652 24599999999999996 555443
No 172
>KOG1046|consensus
Probab=33.36 E-value=79 Score=35.66 Aligned_cols=66 Identities=24% Similarity=0.373 Sum_probs=37.7
Q ss_pred HHHHHHHHhcCCC--CC--eEEEECC-CccEEEeCCCCCccEEEEcHH-HH-hh----c-CHHHHHHHHHHHHHhhhcCh
Q psy5221 66 KTVNKISKNANIS--TP--EVAIYNG-EINAFATGAFKNSSLIAISAN-LL-ES----M-TKLEIKAIIAHEISHISNGD 133 (436)
Q Consensus 66 ~~l~~l~~~~g~~--~p--~v~v~~~-~~NA~~~G~~~~~~~v~it~g-Ll-~~----l-~~~El~aVLAHElgHi~~~h 133 (436)
+.++-..+-.|++ .| ++..+++ ...|+-- -+.|.+... |+ +. . +...++-|+|||+||..-|.
T Consensus 265 ~~L~~~e~~f~i~yPLpK~D~iavPdf~~GAMEN-----wGLvtyre~~lL~~~~~ss~~~k~~va~vIaHElAHQWFGN 339 (882)
T KOG1046|consen 265 KVLEFYEDYFGIPYPLPKLDLVAVPDFSAGAMEN-----WGLVTYRETALLYDPQTSSSSNKQRVAEVIAHELAHQWFGN 339 (882)
T ss_pred HHHHHHHHHhCCCCCCccccEEecCCccccchhc-----CcceeeeehhhccCCCcCcHHHHHHHHHHHHHHHHHHHhcC
Confidence 4555555666775 44 4455555 4444322 122333322 21 11 1 35669999999999999998
Q ss_pred HHH
Q psy5221 134 MVT 136 (436)
Q Consensus 134 ~~~ 136 (436)
...
T Consensus 340 LVT 342 (882)
T KOG1046|consen 340 LVT 342 (882)
T ss_pred ccc
Confidence 654
No 173
>PF00347 Ribosomal_L6: Ribosomal protein L6; InterPro: IPR020040 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. L6 is a protein from the large (50S) subunit. In Escherichia coli, it is located in the aminoacyl-tRNA binding site of the peptidyltransferase centre, and is known to bind directly to 23S rRNA. It belongs to a family of ribosomal proteins, including L6 from bacteria, cyanelles (structures that perform similar functions to chloroplasts, but have structural and biochemical characteristics of Cyanobacteria) and mitochondria; and L9 from mammals, Drosophila, plants and yeast. L6 contains two domains with almost identical folds, suggesting that is was derived by the duplication of an ancient RNA-binding protein gene. Analysis reveals several sites on the protein surface where interactions with other ribosome components may occur, the N terminus being involved in protein-protein interactions and the C terminus containing possible RNA-binding sites []. This entry represents the alpha-beta domain found duplicated in ribosomal L6 proteins. This domain consists of two beta-sheets and one alpha-helix packed around single core [].; GO: 0003735 structural constituent of ribosome, 0019843 rRNA binding, 0006412 translation, 0005840 ribosome; PDB: 2HGJ_H 2HGQ_H 2HGU_H 1S1I_H 3O5H_I 3O58_I 3J16_F 3IZS_F 2V47_H 2WDJ_H ....
Probab=33.28 E-value=97 Score=22.79 Aligned_cols=47 Identities=9% Similarity=0.222 Sum_probs=32.7
Q ss_pred CCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccceEeEEeeCcEEEEEEec
Q psy5221 348 RSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLIR 413 (436)
Q Consensus 348 ~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~~GvL~i~~pk 413 (436)
|+.++|+++++.++++|.+ |. .++.+|+.+ .++...+|+.+++....
T Consensus 2 P~gV~v~~~~~~i~v~G~~----------------g~--l~~~~~~~v-~v~~~~~~~~~~~~~~~ 48 (77)
T PF00347_consen 2 PEGVKVTIKGNIITVKGPK----------------GE--LSRPIPPGV-KVEIKVEDNKITVSVLS 48 (77)
T ss_dssp STTCEEEEETTEEEEESSS----------------SE--EEEEETTTE-EEEEEEETTSEEEEEEE
T ss_pred CCcEEEEEeCcEEEEECCC----------------Ee--EEEECCCCe-eEEEEcCCCceEEEECc
Confidence 6789999999999999764 22 446777762 23333668877776654
No 174
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=32.98 E-value=24 Score=34.48 Aligned_cols=19 Identities=21% Similarity=0.391 Sum_probs=16.2
Q ss_pred cCHHHHHHHHHHHHHhhhc
Q psy5221 113 MTKLEIKAIIAHEISHISN 131 (436)
Q Consensus 113 l~~~El~aVLAHElgHi~~ 131 (436)
+.+-.+=+.++||+||...
T Consensus 191 ~p~~~~P~T~~HElAHq~G 209 (318)
T PF12725_consen 191 LPPYSLPFTICHELAHQLG 209 (318)
T ss_pred CCcccccHHHHHHHHHHhC
Confidence 4566788999999999987
No 175
>PF01954 DUF104: Protein of unknown function DUF104; InterPro: IPR008203 This family includes short archaebacterial proteins of unknown function. Archaeoglobus fulgidus has twelve copies of this protein, with several being clustered together in the genome.; PDB: 2NWT_A.
Probab=32.89 E-value=49 Score=23.70 Aligned_cols=17 Identities=24% Similarity=0.372 Sum_probs=12.0
Q ss_pred eEeEEeeCcEEEEEEec
Q psy5221 397 VVAAFFDQGILNIDLIR 413 (436)
Q Consensus 397 ~~~A~~~~GvL~i~~pk 413 (436)
.|+|.|+||+|+--=|-
T Consensus 4 ~I~aiYe~GvlkPl~~~ 20 (60)
T PF01954_consen 4 VIEAIYENGVLKPLEPV 20 (60)
T ss_dssp -EEEEEETTEEEECS--
T ss_pred eEEEEEECCEEEECCCC
Confidence 48999999999855443
No 176
>cd06160 S2P-M50_like_2 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with additional putative N- and C-terminal transmembrane spanning regions, relative to the core protein, and no PDZ domains.
Probab=32.56 E-value=29 Score=31.03 Aligned_cols=16 Identities=19% Similarity=0.220 Sum_probs=12.3
Q ss_pred HHHHHHHHHHHHhhhc
Q psy5221 116 LEIKAIIAHEISHISN 131 (436)
Q Consensus 116 ~El~aVLAHElgHi~~ 131 (436)
-=+..+..||+||..-
T Consensus 39 ~l~~~l~iHElgH~~~ 54 (183)
T cd06160 39 ALLAILGIHEMGHYLA 54 (183)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3466788999999753
No 177
>cd06478 ACD_HspB4-5-6 Alpha-crystallin domain found in alphaA-crystallin (HspB4), alphaB-crystallin (HspB5), and the small heat shock protein (sHsp) HspB6, also known as Hsp20. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Alpha crystallin, an abundant protein in the mammalian lens, is a large (700 kDa) heteropolymer composed of HspB4 and HspB5, generally in a molar ratio of HspB4:HspB5 of 3:1. Only trace amounts of HspB4 are found in tissues other than the lens. HspB5 on the other hand is also expressed constitutively in other tissues including brain, heart, and type I and type IIa skeletal muscle fibers, and in several cancers including gliomas, renal cell carcinomas, basal-like and metaplastic breast carcinomas, and head and neck cancer. HspB5's functions include effects on the apoptotic pathway and on metastasis. Phosphorylation of HspB5 reduces its ol
Probab=32.30 E-value=59 Score=24.73 Aligned_cols=30 Identities=13% Similarity=0.335 Sum_probs=25.6
Q ss_pred eEEEEEecC-CCCCCceEEEE-ECCEEEEEEE
Q psy5221 336 KYRITMAIA-GFNRSDLEIET-ERDTLRITGR 365 (436)
Q Consensus 336 ~~~v~~~lp-G~~~edi~v~~-~~~~l~i~g~ 365 (436)
+|.=.+.|| +++++.|+=++ .+|.|+|+..
T Consensus 51 ef~R~~~LP~~vd~~~i~A~~~~dGvL~I~~P 82 (83)
T cd06478 51 EFHRRYRLPPGVDPAAITSSLSADGVLTISGP 82 (83)
T ss_pred EEEEEEECCCCcChHHeEEEECCCCEEEEEec
Confidence 388889998 69999999999 5999999853
No 178
>KOG3607|consensus
Probab=32.22 E-value=29 Score=37.84 Aligned_cols=42 Identities=19% Similarity=0.184 Sum_probs=26.0
Q ss_pred CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhc
Q psy5221 88 EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISN 131 (436)
Q Consensus 88 ~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~ 131 (436)
...||+.|+.......-+.. ...-+....+.|+||||||...
T Consensus 295 ~G~a~~~~mCs~~~s~gv~~--~~~~~~~~~a~v~AhelgH~lG 336 (716)
T KOG3607|consen 295 VGLAYFGGMCSPGHSGGVNK--FHSDILLAFAVVLAHELGHNLG 336 (716)
T ss_pred eceeecccccCcccccceee--cCcccchhHHHHHHHHHHhhcC
Confidence 55677777654332232222 1111467889999999999865
No 179
>PF08308 PEGA: PEGA domain; InterPro: IPR013229 This domain is found in both archaea and bacteria and has similarity to S-layer (surface layer) proteins. It is named after the characteristic PEGA sequence motif found in this domain. The secondary structure of this domain is predicted to be beta-strands.
Probab=32.20 E-value=1.4e+02 Score=21.45 Aligned_cols=41 Identities=10% Similarity=0.191 Sum_probs=29.5
Q ss_pred ceeEEEecCceEEEEEecCCCCCCceEEEEECC-EEEEEEEE
Q psy5221 326 PYNIELVDENKYRITMAIAGFNRSDLEIETERD-TLRITGRK 366 (436)
Q Consensus 326 ~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~-~l~i~g~~ 366 (436)
|+.+..-....|.|++..||+.+..-+|.+..+ ...|+.+-
T Consensus 26 p~~~~~l~~G~~~v~v~~~Gy~~~~~~v~v~~~~~~~v~~~L 67 (71)
T PF08308_consen 26 PLTLKDLPPGEHTVTVEKPGYEPYTKTVTVKPGETTTVNVTL 67 (71)
T ss_pred cceeeecCCccEEEEEEECCCeeEEEEEEECCCCEEEEEEEE
Confidence 344542256789999999999988888888744 66666553
No 180
>PF14913 DPCD: DPCD protein family
Probab=31.72 E-value=2.1e+02 Score=25.59 Aligned_cols=73 Identities=8% Similarity=0.235 Sum_probs=50.1
Q ss_pred CceeEEEecCceEEEEEecCCCCCCceEEEEECC--EEEEEEEEecccCCCceEEEeeeeeeEEEEEECCC--------c
Q psy5221 325 PPYNIELVDENKYRITMAIAGFNRSDLEIETERD--TLRITGRKQADTQQRNFLHRGIATRNFEHRFQLAD--------H 394 (436)
Q Consensus 325 p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~--~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~--------~ 394 (436)
-|+-+.+++..+|+-++-===+.+|--+|+++++ .++|+-. .++ |.++|.+|+ +
T Consensus 86 nP~~~r~dTk~~fqWRIRNLPYP~dvYsVtvd~~~r~ivvRTt------NKK----------YyKk~~IPDl~R~~l~l~ 149 (194)
T PF14913_consen 86 NPIFVRRDTKTSFQWRIRNLPYPKDVYSVTVDEDERCIVVRTT------NKK----------YYKKFSIPDLDRCGLPLE 149 (194)
T ss_pred CCEEEEEcCccceEEEEccCCCCccceEEEEcCCCcEEEEECc------Ccc----------ceeEecCCcHHhhCCCcc
Confidence 4555556778888888753335678788888764 6888732 222 445666664 2
Q ss_pred cceEeEEeeCcEEEEEEec
Q psy5221 395 VHVVAAFFDQGILNIDLIR 413 (436)
Q Consensus 395 v~~~~A~~~~GvL~i~~pk 413 (436)
-+.++-...|..|.|+-.|
T Consensus 150 ~~~ls~~h~nNTLIIsYkK 168 (194)
T PF14913_consen 150 QSALSFAHQNNTLIISYKK 168 (194)
T ss_pred hhhceeeeecCeEEEEecC
Confidence 2248999999999999988
No 181
>KOG3413|consensus
Probab=31.16 E-value=51 Score=28.03 Aligned_cols=30 Identities=13% Similarity=0.350 Sum_probs=18.8
Q ss_pred EeEEeeCcEEEEEEecC----CCCCCCCeEeeec
Q psy5221 398 VAAFFDQGILNIDLIRE----VPEALKPRKIYIN 427 (436)
Q Consensus 398 ~~A~~~~GvL~i~~pk~----~~~~~~~~~I~I~ 427 (436)
-.+.|.||||+|.++-. ..+....++|=..
T Consensus 74 ~Dv~y~~GVLTl~lg~~GTYViNKQ~PnkQIWlS 107 (156)
T KOG3413|consen 74 FDVDYADGVLTLKLGSVGTYVINKQPPNKQIWLS 107 (156)
T ss_pred cccccccceEEEEecCceeEEecCCCCcceeeee
Confidence 45679999999999741 1233344555554
No 182
>cd06159 S2P-M50_PDZ_Arch Uncharacterized Archaeal homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group appears to be limited to Archaeal S2P/M50s homologs with additional putative N-terminal transmembrane spanning regions, relative to the core protein, and either one or two PDZ domains present.
Probab=30.95 E-value=29 Score=32.91 Aligned_cols=15 Identities=27% Similarity=0.372 Sum_probs=11.9
Q ss_pred HHHHHHHHHHHhhhc
Q psy5221 117 EIKAIIAHEISHISN 131 (436)
Q Consensus 117 El~aVLAHElgHi~~ 131 (436)
=+.+|+.||+||..-
T Consensus 117 L~isv~iHElgHa~~ 131 (263)
T cd06159 117 LVVGVVVHELSHGIL 131 (263)
T ss_pred HHHHHHHHHHHHHHH
Confidence 456799999999753
No 183
>cd06498 ACD_alphaB-crystallin_HspB5 Alpha-crystallin domain found in the small heat shock protein (sHsp) alphaB-crystallin (HspB5, 20kDa). sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Alpha crystallin, an abundant protein in the mammalian lens, is a large (700 kDa) heteropolymer composed of HspB4 and HspB5, generally in a molar ratio of HspB4:HspB5 of 3:1. HspB4 does not belong to this group. HspB5 shows increased synthesis in response to stress. HspB5 is also expressed constitutively in other tissues including brain, heart, and type I and type IIa skeletal muscle fibers, and in several cancers including gliomas, renal cell carcinomas, basal-like and metaplastic breast carcinomas, and head and neck cancer. Its functions include effects on the apoptotic pathway and on metastasis. Phosphorylation of HspB5 reduces its oligomerization and anti-apoptotic activ
Probab=30.62 E-value=72 Score=24.36 Aligned_cols=31 Identities=10% Similarity=0.249 Sum_probs=26.7
Q ss_pred eEEEEEecC-CCCCCceEEEEE-CCEEEEEEEE
Q psy5221 336 KYRITMAIA-GFNRSDLEIETE-RDTLRITGRK 366 (436)
Q Consensus 336 ~~~v~~~lp-G~~~edi~v~~~-~~~l~i~g~~ 366 (436)
+|.=.+.|| +++++.|+=+.+ +|.|+|+..+
T Consensus 51 eF~R~~~LP~~vd~~~i~A~~~~dGvL~I~lPk 83 (84)
T cd06498 51 EFQRKYRIPADVDPLTITSSLSPDGVLTVCGPR 83 (84)
T ss_pred EEEEEEECCCCCChHHcEEEeCCCCEEEEEEeC
Confidence 488889998 799999999995 9999998654
No 184
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=30.12 E-value=13 Score=35.84 Aligned_cols=12 Identities=42% Similarity=0.590 Sum_probs=10.3
Q ss_pred HHHHHHHHHhhh
Q psy5221 119 KAIIAHEISHIS 130 (436)
Q Consensus 119 ~aVLAHElgHi~ 130 (436)
..|++||+||.-
T Consensus 166 igv~~HE~gH~l 177 (286)
T TIGR03296 166 VGVIAHELGHDL 177 (286)
T ss_pred eeeeehhhhccc
Confidence 699999999953
No 185
>PF13398 Peptidase_M50B: Peptidase M50B-like
Probab=30.00 E-value=32 Score=31.17 Aligned_cols=15 Identities=20% Similarity=0.437 Sum_probs=12.0
Q ss_pred HHHHHHHHHHHHhhh
Q psy5221 116 LEIKAIIAHEISHIS 130 (436)
Q Consensus 116 ~El~aVLAHElgHi~ 130 (436)
=.+..++.||+||..
T Consensus 20 ~~~l~t~~HE~gHal 34 (200)
T PF13398_consen 20 FRLLVTFVHELGHAL 34 (200)
T ss_pred HHHHHHHHHHHHHHH
Confidence 356679999999964
No 186
>PF09471 Peptidase_M64: IgA Peptidase M64; InterPro: IPR019026 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This is a family of highly selective metallo-endopeptidases belonging to the MEROPS peptidase family M64 (IgA peptidase, clan MA). The primary structure of the Clostridium ramosum IgA peptidase shows no significant overall similarity to any other known metallo-endopeptidase []. ; PDB: 3P1V_A 4DF9_D.
Probab=29.64 E-value=32 Score=32.64 Aligned_cols=17 Identities=24% Similarity=0.421 Sum_probs=13.3
Q ss_pred CHHHHHHHHHHHHHhhh
Q psy5221 114 TKLEIKAIIAHEISHIS 130 (436)
Q Consensus 114 ~~~El~aVLAHElgHi~ 130 (436)
+......|+.||+||--
T Consensus 212 ~~~~~~~v~vHE~GHsf 228 (264)
T PF09471_consen 212 DNPSFKQVVVHEFGHSF 228 (264)
T ss_dssp TSTTHHHHHHHHHHHHT
T ss_pred Ccccccceeeeeccccc
Confidence 33478999999999943
No 187
>cd06480 ACD_HspB8_like Alpha-crystallin domain (ACD) found in mammalian 21.6 KDa small heat shock protein (sHsp) HspB8, also denoted as Hsp22 in humans, and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. A chaperone complex formed of HspB8 and Bag3 stimulates degradation of protein complexes by macroautophagy. HspB8 also forms complexes with Hsp27 (HspB1), MKBP (HspB2), HspB3, alphaB-crystallin (HspB5), Hsp20 (HspB6), and cvHsp (HspB7). These latter interactions may depend on phosphorylation of the respective partner sHsp. HspB8 may participate in the regulation of cell proliferation, cardiac hypertrophy, apoptosis, and carcinogenesis. Point mutations in HspB8 have been correlated with the development of several congenital neurological diseases, including Charcot Marie tooth disease and distal motor neuropathy type II.
Probab=29.57 E-value=1e+02 Score=24.10 Aligned_cols=30 Identities=7% Similarity=0.123 Sum_probs=26.6
Q ss_pred ceEEEEEecC-CCCCCceEEEEE-CCEEEEEE
Q psy5221 335 NKYRITMAIA-GFNRSDLEIETE-RDTLRITG 364 (436)
Q Consensus 335 ~~~~v~~~lp-G~~~edi~v~~~-~~~l~i~g 364 (436)
.+|.=.+.|| +++.++|+=.+. ++.|+|++
T Consensus 58 r~F~R~~~LP~~Vd~~~v~s~l~~dGvL~Iea 89 (91)
T cd06480 58 KNFTKKIQLPPEVDPVTVFASLSPEGLLIIEA 89 (91)
T ss_pred EEEEEEEECCCCCCchhEEEEeCCCCeEEEEc
Confidence 5688899998 899999999998 88999985
No 188
>COG0097 RplF Ribosomal protein L6P/L9E [Translation, ribosomal structure and biogenesis]
Probab=29.47 E-value=1.7e+02 Score=25.93 Aligned_cols=47 Identities=19% Similarity=0.357 Sum_probs=31.6
Q ss_pred CCCCceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccceEeEEeeCcEEEEEEe
Q psy5221 346 FNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLI 412 (436)
Q Consensus 346 ~~~edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~~~GvL~i~~p 412 (436)
..|++++|+++++.++++|.+ |+.+|.| +..+ ++-..+|+.+.+...
T Consensus 10 ~~P~gV~V~i~~~~v~vkGpk----------------GeL~~~~--~~~~--v~v~~~~~~~vv~~~ 56 (178)
T COG0097 10 VIPAGVTVSIEGQVVTVKGPK----------------GELTREF--HDNV--VKVEVEDNILVVRPV 56 (178)
T ss_pred ecCCCeEEEEeccEEEEECCC----------------cEEEEEe--cCcc--eEEEecCCEEEEeec
Confidence 348999999999999999875 3444333 3333 566666776655543
No 189
>COG4004 Uncharacterized protein conserved in archaea [Function unknown]
Probab=28.71 E-value=2.7e+02 Score=21.76 Aligned_cols=48 Identities=21% Similarity=0.369 Sum_probs=35.3
Q ss_pred chHHHHhhhhhhhhcccCCCCCceeEEEecCceEEEEEecCCCCCCceEEEEECCEEEEEEE
Q psy5221 304 IGFDRLAQIFDNVRTEAQQSYPPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGR 365 (436)
Q Consensus 304 ~~~d~l~~~~~~~~~~~~~~~p~~~i~~e~~~~~~v~~~lpG~~~edi~v~~~~~~l~i~g~ 365 (436)
..-|++.+.++..+ +++.+ .+| .+....||+ +.|+|+.+++.|.|.+.
T Consensus 12 ~~~dri~~~l~e~g---------~~v~~-eGD--~ivas~pgi--s~ieik~E~kkL~v~t~ 59 (96)
T COG4004 12 PDPDRIMRGLSELG---------WTVSE-EGD--RIVASSPGI--SRIEIKPENKKLLVNTT 59 (96)
T ss_pred CCHHHHHHHHHHhC---------eeEee-ccc--EEEEecCCc--eEEEEecccceEEEecc
Confidence 34577777776643 46664 444 778899999 56889999999999873
No 190
>TIGR03750 conj_TIGR03750 conjugative transfer region protein, TIGR03750 family. Members of this protein family are found occasionally on plasmids. Usually, however, they are found on the bacterial main chromosome in regions flanked by markers of conjugative transfer and/or transposition.
Probab=28.56 E-value=3.2e+02 Score=22.21 Aligned_cols=41 Identities=15% Similarity=0.217 Sum_probs=28.6
Q ss_pred HHHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCCCe
Q psy5221 33 SIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANISTPE 81 (436)
Q Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~p~ 81 (436)
.+...++..+.+.+..+.+ | +.++++.++....+.++....
T Consensus 59 ~~l~v~~gg~~l~rlKRGr----P----e~yl~r~l~~~~~~~~l~~~~ 99 (111)
T TIGR03750 59 PILVVLIGGKLLARLKRGK----P----EGYLYRKLEWKLARLGLGRHR 99 (111)
T ss_pred HHHHHHHhHHHHHHHHcCC----C----chHHHHHHHHHHHHcCCCCCC
Confidence 3455666777777777766 3 334999999999998875433
No 191
>cd06476 ACD_HspB2_like Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB2/heat shock 27kDa protein 2 and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. HspB2 is preferentially and constitutively expressed in skeletal muscle and heart. HspB2 shows homooligomeric activity and forms aggregates in muscle cytosol. Although its expression is not induced by heat shock, it redistributes to the insoluble fraction in response to heat shock. In the mouse heart, HspB2 plays a role in maintaining energetic balance, by protecting cardiac energetics during ischemia/reperfusion, and allowing for increased work during acute inotropic challenge. hHspB2 [previously also known as myotonic dystrophy protein kinase (DMPK) binding protein (MKBP)] is selectively up-regulated in skeletal muscles from myotonic dystrophy patients.
Probab=28.31 E-value=1e+02 Score=23.51 Aligned_cols=30 Identities=10% Similarity=0.100 Sum_probs=25.7
Q ss_pred eEEEEEecC-CCCCCceEEEEE-CCEEEEEEE
Q psy5221 336 KYRITMAIA-GFNRSDLEIETE-RDTLRITGR 365 (436)
Q Consensus 336 ~~~v~~~lp-G~~~edi~v~~~-~~~l~i~g~ 365 (436)
+|.=.+.|| +++++.|+=+.. +|.|+|+..
T Consensus 51 eF~R~~~LP~~vd~~~v~A~~~~dGvL~I~~P 82 (83)
T cd06476 51 EFTRTYILPMDVDPLLVRASLSHDGILCIQAP 82 (83)
T ss_pred EEEEEEECCCCCChhhEEEEecCCCEEEEEec
Confidence 488889999 799999999996 999999853
No 192
>PF14521 Aspzincin_M35: Lysine-specific metallo-endopeptidase ; PDB: 2X3C_A 2X3A_A 2X3B_A 1GE7_B 1GE6_A 1GE5_A 1G12_A.
Probab=28.27 E-value=80 Score=27.03 Aligned_cols=40 Identities=18% Similarity=0.174 Sum_probs=24.1
Q ss_pred ccEEEeCCCCCccEEEEcHHHHhhc--CHHHHHHHHHHHHHhh
Q psy5221 89 INAFATGAFKNSSLIAISANLLESM--TKLEIKAIIAHEISHI 129 (436)
Q Consensus 89 ~NA~~~G~~~~~~~v~it~gLl~~l--~~~El~aVLAHElgHi 129 (436)
.-|++....+ ...|.+-......- ..+.-+++|.||+.|+
T Consensus 66 ~~a~~~~~~~-~~~IyLc~~F~~~p~~g~~Sk~~TLiHE~SHf 107 (148)
T PF14521_consen 66 TYAYVYPDSP-TYTIYLCPAFFSAPTTGKDSKEGTLIHEWSHF 107 (148)
T ss_dssp SSEEE-TTST--TEEEE-HHHHHS-SSSTT-HHHHHHHHHHHS
T ss_pred eEEEEECCCC-ceEEEEChhhcCCCCCCCCchHHHHHHhhhhh
Confidence 3444444322 24688888777742 4567799999999994
No 193
>cd06162 S2P-M50_PDZ_SREBP Sterol regulatory element-binding protein (SREBP) Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50A), regulates intramembrane proteolysis (RIP) of SREBP and is part of a signal transduction mechanism involved in sterol and lipid metabolism. In sterol-depleted mammalian cells, a two-step proteolytic process releases the N-terminal domains of SREBPs from membranes of the endoplasmic reticulum (ER). These domains translocate into the nucleus, where they activate genes of cholesterol and fatty acid biosynthesis. The first cleavage occurs at Site-1 within the ER lumen to generate an intermediate that is subsequently released from the membrane by cleavage at Site-2, which lies within the first transmembrane domain. It is the second proteolytic step that is carried out by the SREBP Site-2 protease (S2P) which is present in this CD family. This group appears to be limited to eumetazoan proteins and contains one PDZ domain.
Probab=28.07 E-value=35 Score=32.58 Aligned_cols=15 Identities=20% Similarity=0.233 Sum_probs=12.5
Q ss_pred HHHHHHHHHHHhhhc
Q psy5221 117 EIKAIIAHEISHISN 131 (436)
Q Consensus 117 El~aVLAHElgHi~~ 131 (436)
=+.+|+.||+||..-
T Consensus 134 l~isvvvHElgHal~ 148 (277)
T cd06162 134 LLISGVVHEMGHGVA 148 (277)
T ss_pred HHHHHHHHHHHHHHH
Confidence 377899999999754
No 194
>KOG0175|consensus
Probab=25.80 E-value=1.4e+02 Score=28.03 Aligned_cols=58 Identities=12% Similarity=0.157 Sum_probs=48.7
Q ss_pred CCeEEEECC-----CccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChHHH
Q psy5221 79 TPEVAIYNG-----EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVT 136 (436)
Q Consensus 79 ~p~v~v~~~-----~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~~~ 136 (436)
-|.+|.+|+ .-|.|..|.+....+=++++|.=-.|+++|-..+----+.|-.+||...
T Consensus 179 GP~lyYVDseG~Rl~G~~FSVGSGs~yAYGVLDsgYr~dls~eEA~~L~rrAI~hAThRDayS 241 (285)
T KOG0175|consen 179 GPGLYYVDSEGTRLSGDLFSVGSGSTYAYGVLDSGYRYDLSDEEAYDLARRAIYHATHRDAYS 241 (285)
T ss_pred CCceEEEcCCCCEecCceEeecCCCceeEEeeccCCCCCCCHHHHHHHHHHHHHHHHhccccc
Confidence 478999988 7899999998877777888888778999998888888888888888653
No 195
>cd02175 GH16_lichenase lichenase, member of glycosyl hydrolase family 16. Lichenase, also known as 1,3-1,4-beta-glucanase, is a member of glycosyl hydrolase family 16, that specifically cleaves 1,4-beta-D-glucosidic bonds in mixed-linked beta glucans that also contain 1,3-beta-D-glucosidic linkages. Natural substrates of beta-glucanase are beta-glucans from grain endosperm cell walls or lichenan from the Islandic moss, Cetraria islandica. This protein is found not only in bacteria but also in anaerobic fungi. This domain includes two seven-stranded antiparallel beta-sheets that are adjacent to one another forming a compact, jellyroll beta-sandwich structure.
Probab=25.72 E-value=2.7e+02 Score=25.18 Aligned_cols=53 Identities=23% Similarity=0.324 Sum_probs=31.2
Q ss_pred CCceEEEEECCEEEEEEEEecccC----CCceEEEe-eeeeeEEEEEECCCccceEeEEe
Q psy5221 348 RSDLEIETERDTLRITGRKQADTQ----QRNFLHRG-IATRNFEHRFQLADHVHVVAAFF 402 (436)
Q Consensus 348 ~edi~v~~~~~~l~i~g~~~~~~~----~~~~~~~e-~~~~~f~r~~~lP~~v~~~~A~~ 402 (436)
+++++ ++++.|.|+..++.... ......++ ..+|.|+-++++|...-...|-+
T Consensus 31 ~~nv~--v~~g~L~l~~~~~~~~~~~~tsg~i~S~~~f~yG~~ear~k~~~~~G~~~Afw 88 (212)
T cd02175 31 ADNVE--FSDGGLALTLTNDTYGEKPYACGEYRTRGFYGYGRYEVRMKPAKGSGVVSSFF 88 (212)
T ss_pred cccEE--EECCeEEEEEeCCcCCCCccccceEEECceEEeeEEEEEEEcCCCCeEEEEEE
Confidence 45544 44788888877654211 11121112 57999999999997533444444
No 196
>COG4823 AbiF Abortive infection bacteriophage resistance protein [Defense mechanisms]
Probab=25.54 E-value=45 Score=31.27 Aligned_cols=14 Identities=43% Similarity=0.608 Sum_probs=12.5
Q ss_pred HHHHHHHHHHHHHh
Q psy5221 115 KLEIKAIIAHEISH 128 (436)
Q Consensus 115 ~~El~aVLAHElgH 128 (436)
+-+++.||||++||
T Consensus 89 Et~iKs~iAyllg~ 102 (299)
T COG4823 89 ETEIKSVIAYLLGH 102 (299)
T ss_pred HHHHHHHHHHHhcc
Confidence 45789999999999
No 197
>PHA02456 zinc metallopeptidase motif-containing protein
Probab=25.19 E-value=38 Score=27.28 Aligned_cols=30 Identities=20% Similarity=0.276 Sum_probs=18.5
Q ss_pred cEEEEcHHHHhhcCHHHHHHHHHHHHHhhhcChH
Q psy5221 101 SLIAISANLLESMTKLEIKAIIAHEISHISNGDM 134 (436)
Q Consensus 101 ~~v~it~gLl~~l~~~El~aVLAHElgHi~~~h~ 134 (436)
..|.++.. ..+.--+--|+||++|+.+...
T Consensus 66 ~~i~IDP~----~~~KGC~~TL~HEL~H~WQ~Rs 95 (141)
T PHA02456 66 GWIEIDPD----YANKGCRDTLAHELNHAWQFRT 95 (141)
T ss_pred eEEEECCc----ccccchHHHHHHHHHHHHhhhc
Confidence 34555541 2233455678999999987553
No 198
>PRK11767 SpoVR family protein; Provisional
Probab=24.91 E-value=1.2e+02 Score=31.44 Aligned_cols=67 Identities=16% Similarity=0.314 Sum_probs=41.1
Q ss_pred HHHHHHHHHHhcCCC--CCeEEEECC--CccEEEe-CC------------------------CC-CccEEEEcHHHHhhc
Q psy5221 64 LLKTVNKISKNANIS--TPEVAIYNG--EINAFAT-GA------------------------FK-NSSLIAISANLLESM 113 (436)
Q Consensus 64 L~~~l~~l~~~~g~~--~p~v~v~~~--~~NA~~~-G~------------------------~~-~~~~v~it~gLl~~l 113 (436)
..+.|+++|+.+|+. +..+-|++. ...|++. |+ .. .--+|+-|.+-+.-|
T Consensus 20 ~~~~I~~iA~~~GLD~yp~~~EIi~~eqml~~~as~GmP~rY~HWsfGk~y~~~~~~Y~~gl~glaYEiVINSnPciAyL 99 (498)
T PRK11767 20 YLDEIERVAKEYGLDTYPNQIEVITAEQMMDAYSSVGMPINYRHWSFGKHFIETEQLYRRGQMGLAYEIVINSNPCIAYL 99 (498)
T ss_pred HHHHHHHHHHHcCCCCCCceEEEECHHHHHHHHHhcCCCCCCCcccccHHHHHHHHHHhcCCCCCceEEEecCchHHHHH
Confidence 467788899999997 345556665 4444443 21 11 123455555554444
Q ss_pred ----CHHHHHHHHHH-HHHhhh
Q psy5221 114 ----TKLEIKAIIAH-EISHIS 130 (436)
Q Consensus 114 ----~~~El~aVLAH-ElgHi~ 130 (436)
+.-.-+-|+|| ++||..
T Consensus 100 me~Ntl~~q~LViAHv~yGHnd 121 (498)
T PRK11767 100 MEENTMTMQALVIAHACYGHNS 121 (498)
T ss_pred hccCcHHHHHHHHHHHHHhhhh
Confidence 55567899999 699963
No 199
>cd06455 M3A_TOP Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasom
Probab=23.70 E-value=39 Score=35.02 Aligned_cols=18 Identities=22% Similarity=0.289 Sum_probs=13.4
Q ss_pred HHHHHHHHHHHHHhhhcCh
Q psy5221 115 KLEIKAIIAHEISHISNGD 133 (436)
Q Consensus 115 ~~El~aVLAHElgHi~~~h 133 (436)
-+++ ..|+||+||..|.-
T Consensus 261 ~~~V-~TLfHEfGHalH~~ 278 (472)
T cd06455 261 HDEV-ETFFHEFGHVIHHL 278 (472)
T ss_pred HHHH-HHHHHHHHHHHHHH
Confidence 3554 56899999998743
No 200
>PF07873 YabP: YabP family; InterPro: IPR022476 Members of this protein family are the YabP and YqfC proteins of the bacterial sporulation program, as found in Bacillus subtilis, Clostridium tetani, and other spore-forming members of the Firmicutes. ; PDB: 2KYI_B 3IPF_B 2KS0_A.
Probab=23.63 E-value=50 Score=23.96 Aligned_cols=21 Identities=24% Similarity=0.621 Sum_probs=18.2
Q ss_pred CCCCceEEEEECCEEEEEEEE
Q psy5221 346 FNRSDLEIETERDTLRITGRK 366 (436)
Q Consensus 346 ~~~edi~v~~~~~~l~i~g~~ 366 (436)
++++.|.+....+.|.|+|+.
T Consensus 23 f~~~~I~l~t~~g~l~I~G~~ 43 (66)
T PF07873_consen 23 FDDEEIRLNTKKGKLTIKGEG 43 (66)
T ss_dssp EETTEEEEEETTEEEEEEEEE
T ss_pred ECCCEEEEEeCCEEEEEECce
Confidence 567889999999999999984
No 201
>COG4227 Antirestriction protein [DNA replication, recombination, and repair]
Probab=23.58 E-value=54 Score=30.84 Aligned_cols=23 Identities=17% Similarity=0.293 Sum_probs=19.2
Q ss_pred CHHHHHHHHHHHHHhhhcChHHH
Q psy5221 114 TKLEIKAIIAHEISHISNGDMVT 136 (436)
Q Consensus 114 ~~~El~aVLAHElgHi~~~h~~~ 136 (436)
+..--.+.++||+||+..+....
T Consensus 199 da~~yyaTl~HElghwtgh~~rl 221 (316)
T COG4227 199 DAINYYATLLHELGHWTGHEARL 221 (316)
T ss_pred chHhHHHHHHHHhccccCchhhh
Confidence 67788999999999998876543
No 202
>PF10988 DUF2807: Protein of unknown function (DUF2807); InterPro: IPR021255 This bacterial family of proteins has no known function. ; PDB: 3JX8_A 3LJY_C 3LYC_A 3PET_A.
Probab=23.43 E-value=3.3e+02 Score=23.53 Aligned_cols=61 Identities=13% Similarity=0.245 Sum_probs=0.0
Q ss_pred EEEEECC-EEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCc-cceEeEEeeCcEEEEEEecCCCCCCCCeEeeecCC
Q psy5221 352 EIETERD-TLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADH-VHVVAAFFDQGILNIDLIREVPEALKPRKIYINTT 429 (436)
Q Consensus 352 ~v~~~~~-~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~-v~~~~A~~~~GvL~i~~pk~~~~~~~~~~I~I~~~ 429 (436)
+|++.++ .+.+. ....++-++.-|++ ++.++...+||.|.|...+. -...++-+|.|...
T Consensus 4 ~i~v~~~~~V~l~-----------------~g~~~~v~v~~~~~l~~~i~~~v~~g~L~I~~~~~-~~~~~~~~v~V~~~ 65 (181)
T PF10988_consen 4 KIEVSGNIEVELV-----------------QGDSPSVEVEADENLLDRIKVEVKDGTLKISYKKN-ISGSKPVKVRVTAP 65 (181)
T ss_dssp EEEEESSCEEEEE-----------------E-SS-EEEEEEEHHHHCCEEEEEETTEEEEEE-SC-CTCTSTEEEEEEES
T ss_pred EEEEcCCeEEEEE-----------------ECCCcEEEEEEChhhcceEEEEEECCEEEEEECCC-cCCCccEEEEEEcC
Q ss_pred C
Q psy5221 430 K 430 (436)
Q Consensus 430 ~ 430 (436)
.
T Consensus 66 ~ 66 (181)
T PF10988_consen 66 S 66 (181)
T ss_dssp -
T ss_pred c
No 203
>COG0308 PepN Aminopeptidase N [Amino acid transport and metabolism]
Probab=22.95 E-value=51 Score=37.04 Aligned_cols=21 Identities=38% Similarity=0.645 Sum_probs=17.9
Q ss_pred HHHHHHHHHHHHhhhcChHHH
Q psy5221 116 LEIKAIIAHEISHISNGDMVT 136 (436)
Q Consensus 116 ~El~aVLAHElgHi~~~h~~~ 136 (436)
+..+.|+|||++|-..|+...
T Consensus 305 ~~~~~viaHElaHqWfGnlVT 325 (859)
T COG0308 305 ENVEEVIAHELAHQWFGNLVT 325 (859)
T ss_pred HHHHHHHHHHHhhhcccceee
Confidence 567779999999999999654
No 204
>COG5504 Predicted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=22.92 E-value=67 Score=30.06 Aligned_cols=19 Identities=37% Similarity=0.427 Sum_probs=17.1
Q ss_pred cCHHHHHHHHHHHHHhhhc
Q psy5221 113 MTKLEIKAIIAHEISHISN 131 (436)
Q Consensus 113 l~~~El~aVLAHElgHi~~ 131 (436)
.|..++.|++|||+-|..+
T Consensus 135 ~~~~~v~aliaHE~HH~~R 153 (280)
T COG5504 135 STITSVPALIAHEYHHNCR 153 (280)
T ss_pred CCccchHHHHHHHHHhhhe
Confidence 6888999999999999766
No 205
>PF10460 Peptidase_M30: Peptidase M30; InterPro: IPR019501 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This family contains metallopeptidases belonging to MEROPS peptidase family M30 (hyicolysin family, clan MA). Hyicolysin has a zinc ion which is liganded by two histidine and one glutamate residue.
Probab=22.85 E-value=99 Score=30.84 Aligned_cols=33 Identities=24% Similarity=0.195 Sum_probs=24.0
Q ss_pred ccEEEEcHHHHhhc---CHHHHHHHHHHHHHhhhcC
Q psy5221 100 SSLIAISANLLESM---TKLEIKAIIAHEISHISNG 132 (436)
Q Consensus 100 ~~~v~it~gLl~~l---~~~El~aVLAHElgHi~~~ 132 (436)
..++.++...+..- ..+...+.||||+=|..+-
T Consensus 118 ~e~~YiD~~~~~~~~~~~~~~~~sTlAHEfQHmInf 153 (366)
T PF10460_consen 118 SEYFYIDSETLYLGGNSGPDTVYSTLAHEFQHMINF 153 (366)
T ss_pred eeEEEEecHHhhccCCccHHHHHHHHHHHHHHHHHH
Confidence 45566666554432 3678999999999999874
No 206
>PRK11198 LysM domain/BON superfamily protein; Provisional
Probab=22.48 E-value=1.1e+02 Score=26.14 Aligned_cols=28 Identities=14% Similarity=0.420 Sum_probs=23.3
Q ss_pred ecCCCCCCceEEEEECCEEEEEEEEecc
Q psy5221 342 AIAGFNRSDLEIETERDTLRITGRKQAD 369 (436)
Q Consensus 342 ~lpG~~~edi~v~~~~~~l~i~g~~~~~ 369 (436)
.=.|....+++|.++++.++++|.-...
T Consensus 36 ~~~~~~~~~i~V~v~~G~v~l~G~v~s~ 63 (147)
T PRK11198 36 SKQGLGDADVNVQVEDGKATVSGDAASQ 63 (147)
T ss_pred HhcCCCcCCceEEEeCCEEEEEEEeCCH
Confidence 3357888899999999999999987643
No 207
>cd06158 S2P-M50_like_1 Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms. Members of the S2P/M50 family of RIP proteases use proteolytic activity within the membrane to transfer information across membranes to integrate gene expression with physiologic stresses occurring in another cellular compartment. In eukaryotic cells they regulate such processes as sterol and lipid metabolism, and endoplasmic reticulum stress responses. In prokaryotes they regulate such processes as sporulation, cell division, stress response, and cell differentiation. This group includes bacterial, eukaryotic, and Archaeal S2P/M50s homologs with a minimal core protein and no PDZ domains.
Probab=22.06 E-value=55 Score=29.10 Aligned_cols=13 Identities=38% Similarity=0.521 Sum_probs=11.0
Q ss_pred HHHHHHHHHHhhh
Q psy5221 118 IKAIIAHEISHIS 130 (436)
Q Consensus 118 l~aVLAHElgHi~ 130 (436)
+-++..||+||-.
T Consensus 9 ~~~i~~HE~aHa~ 21 (181)
T cd06158 9 LLAITLHEFAHAY 21 (181)
T ss_pred HHHHHHHHHHHHH
Confidence 4588999999976
No 208
>cd04283 ZnMc_hatching_enzyme Zinc-dependent metalloprotease, hatching enzyme-like subfamily. Hatching enzymes are secreted by teleost embryos to digest the egg envelope or chorion. In some teleosts, the hatching enzyme may be a system consisting of two evolutionary related metalloproteases, high choriolytic enzyme and low choriolytic enzyme (HCE and LCE), which may have different substrate specificities and cooperatively digest the chorion.
Probab=21.95 E-value=42 Score=29.97 Aligned_cols=13 Identities=46% Similarity=0.649 Sum_probs=11.3
Q ss_pred HHHHHHHHHhhhc
Q psy5221 119 KAIIAHEISHISN 131 (436)
Q Consensus 119 ~aVLAHElgHi~~ 131 (436)
.+++.||+||...
T Consensus 78 ~G~i~HEl~HaLG 90 (182)
T cd04283 78 KGIIQHELLHALG 90 (182)
T ss_pred cchHHHHHHHHhC
Confidence 5899999999765
No 209
>PF11694 DUF3290: Protein of unknown function (DUF3290); InterPro: IPR021707 This family of proteins with unknown function appears to be restricted to Firmicutes.
Probab=21.29 E-value=2.6e+02 Score=24.03 Aligned_cols=27 Identities=15% Similarity=0.311 Sum_probs=22.7
Q ss_pred hhHHHHHHHHHHHhcCCCCCeEEEECC
Q psy5221 61 EDWLLKTVNKISKNANISTPEVAIYNG 87 (436)
Q Consensus 61 ~~~L~~~l~~l~~~~g~~~p~v~v~~~ 87 (436)
....-..++.+|+..+.+..+|++=.+
T Consensus 78 ~~~~~~fi~~vA~~~~V~~~~v~VNst 104 (149)
T PF11694_consen 78 SSQMVHFIESVAKDLGVSKEEVYVNST 104 (149)
T ss_pred HHHHHHHHHHHHHHhCCChheEEEecc
Confidence 445788999999999999999988655
No 210
>PF01400 Astacin: Astacin (Peptidase family M12A) This Prosite motif covers only the active site.; InterPro: IPR001506 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M12, subfamily M12A (astacin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA and the predicted active site residues for members of this family and thermolysin occur in the motif HEXXH []. The astacin () family of metalloendopeptidases encompasses a range of proteins found in hydra to humans, in mature and developmental systems []. Their functions include activation of growth factors, degradation of polypeptides, and processing of extracellular proteins []. The proteins are synthesised with N-terminal signal and pro-enzyme sequences, and many contain multiple domains C-terminal to the protease domain. They are either secreted from cells, or are associated with the plasma membrane. The astacin molecule adopts a kidney shape, with a deep active-site cleft between its N- and C-terminal domains []. The zinc ion, which lies at the bottom of the cleft, exhibits a unique penta-coordinated mode of binding, involving 3 histidine residues, a tyrosine and a water molecule (which is also bound to the carboxylate side chain of Glu93) []. The N-terminal domain comprises 2 alpha-helices and a 5-stranded beta-sheet. The overall topology of this domain is shared by the archetypal zinc-endopeptidase thermolysin. Astacin protease domains also share common features with serralysins, matrix metalloendopeptidases, and snake venom proteases; they cleave peptide bonds in polypeptides such as insulin B chain and bradykinin, and in proteins such as casein and gelatin; and they have arylamidase activity [].; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3LQB_A 3EDH_A 3EDG_A 3EDI_A 1IAE_A 1IAB_A 1IAA_A 1AST_A 1IAC_A 1QJJ_A ....
Probab=21.25 E-value=56 Score=29.30 Aligned_cols=62 Identities=19% Similarity=0.171 Sum_probs=32.3
Q ss_pred HHHHHHHHHHHhcCCC---CC-----eEEEECCCccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhc
Q psy5221 63 WLLKTVNKISKNANIS---TP-----EVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISN 131 (436)
Q Consensus 63 ~L~~~l~~l~~~~g~~---~p-----~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~ 131 (436)
.+++.++.+.+...+. .. -+.+.++....-.+|...+...|.++.+ + .-.+++.||+||...
T Consensus 23 ~I~~Am~~~e~~TcI~F~~~~~~~~~~i~~~~~~gC~S~vG~~~g~q~i~l~~~----c---~~~~~i~HEl~HaLG 92 (191)
T PF01400_consen 23 RIRKAMDEWEKNTCIRFVERTENEDDYISFSNGSGCWSYVGRQGGEQTINLGDG----C---FSVGTILHELGHALG 92 (191)
T ss_dssp HHHHHHHHHHHHSSEEEEE-SSSSSSEEEEESSSSEEEESS--SSEEEEEE-TT----C----SHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHhCCCEEEEECCCCCceEEEeecCccccchhhhcCcceeEEecce----e---CCccchHHHHHHHHh
Confidence 4777888888776664 11 1222222222233454444455555531 2 135799999999765
No 211
>PF06305 DUF1049: Protein of unknown function (DUF1049); InterPro: IPR010445 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=21.15 E-value=2.3e+02 Score=20.17 Aligned_cols=23 Identities=17% Similarity=0.273 Sum_probs=10.3
Q ss_pred CchHHHHHHHHHHHHHHHHHHHH
Q psy5221 16 LNIKILTISSIIFGFTGSIISLL 38 (436)
Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~ 38 (436)
..+..++++++++|+++.++..+
T Consensus 18 ~pl~l~il~~f~~G~llg~l~~~ 40 (68)
T PF06305_consen 18 LPLGLLILIAFLLGALLGWLLSL 40 (68)
T ss_pred chHHHHHHHHHHHHHHHHHHHHH
Confidence 33444445555555444443333
No 212
>cd04281 ZnMc_BMP1_TLD Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
Probab=21.02 E-value=44 Score=30.30 Aligned_cols=13 Identities=31% Similarity=0.639 Sum_probs=11.3
Q ss_pred HHHHHHHHHhhhc
Q psy5221 119 KAIIAHEISHISN 131 (436)
Q Consensus 119 ~aVLAHElgHi~~ 131 (436)
.+++.|||+|...
T Consensus 88 ~Gti~HEl~HaLG 100 (200)
T cd04281 88 FGIVVHELGHVIG 100 (200)
T ss_pred CchHHHHHHHHhc
Confidence 5899999999766
No 213
>COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism]
Probab=21.00 E-value=4.2e+02 Score=27.37 Aligned_cols=64 Identities=13% Similarity=0.027 Sum_probs=43.6
Q ss_pred HHHHHHHHHHhcCCCCCeEEEECCCccEEEeCCCCCccEEEEcHHHHhhcCHHHHHHHHHHHHHhhhc
Q psy5221 64 LLKTVNKISKNANISTPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISN 131 (436)
Q Consensus 64 L~~~l~~l~~~~g~~~p~v~v~~~~~NA~~~G~~~~~~~v~it~gLl~~l~~~El~aVLAHElgHi~~ 131 (436)
-.+.-..+.+.+|++..+ -.+|..+--|++|.+++. |-+|+ =.+.-+.-.-..=+-||.||-..
T Consensus 209 q~~~~~~vl~~lGfdf~~-GRlD~S~HPF~~g~~~~D--VRITT-Ry~~~df~~aL~g~iHE~GHAlY 272 (497)
T COG2317 209 QEALGLRVLELLGFDFER-GRLDVSVHPFTTGLPIND--VRITT-RYNEQDFRSALFGTIHETGHALY 272 (497)
T ss_pred HHHHHHHHHHHhCCCccC-ccccCCCCCccCCCCCCc--eeEEe-ecCCccHHHHHHHHHHhhhhHHH
Confidence 556778888999998654 334447778999987544 66777 34444544444555699999765
No 214
>cd06482 ACD_HspB10 Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB10, also known as sperm outer dense fiber protein (ODFP), and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Human (h) HspB10 occurs exclusively in the axoneme of sperm cells and may have a cytoskeletal role.
Probab=20.75 E-value=1.6e+02 Score=22.69 Aligned_cols=30 Identities=10% Similarity=0.192 Sum_probs=26.3
Q ss_pred ceEEEEEecC-CCCCCceEEEEECC-EEEEEE
Q psy5221 335 NKYRITMAIA-GFNRSDLEIETERD-TLRITG 364 (436)
Q Consensus 335 ~~~~v~~~lp-G~~~edi~v~~~~~-~l~i~g 364 (436)
..|.=.+.|| +++.+.|+=+.+++ .|+|.+
T Consensus 54 g~F~R~f~LP~~Vd~d~i~A~~~~~~~l~i~~ 85 (87)
T cd06482 54 MNICKEFSLPPGVDEKDVTYSYGLGSVVKIET 85 (87)
T ss_pred EEEEEEEECCCCcChHHcEEEEcCCCEEEEee
Confidence 4677889999 79999999999887 999985
No 215
>PF03672 UPF0154: Uncharacterised protein family (UPF0154); InterPro: IPR005359 The proteins in this entry are functionally uncharacterised.
Probab=20.51 E-value=2e+02 Score=20.93 Aligned_cols=37 Identities=14% Similarity=0.216 Sum_probs=19.4
Q ss_pred HHHHHhHHHHHHHhCCCEEecCCCCcchhHHHHHHHHHHHhcCCCC
Q psy5221 34 IISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNANIST 79 (436)
Q Consensus 34 ~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~L~~~l~~l~~~~g~~~ 79 (436)
+..+|++.+.....+...+ |.++ +.++.+....|-.+
T Consensus 12 ~~Gff~ar~~~~k~l~~NP---pine------~mir~M~~QMG~kp 48 (64)
T PF03672_consen 12 VIGFFIARKYMEKQLKENP---PINE------KMIRAMMMQMGRKP 48 (64)
T ss_pred HHHHHHHHHHHHHHHHHCC---CCCH------HHHHHHHHHhCCCc
Confidence 3444555555555444332 3332 46777777777654
No 216
>TIGR02856 spore_yqfC sporulation protein YqfC. This small protein, designated YqfC in Bacillus subtilis, is both restricted to and universal in sporulating species of the Firmcutes, such as Bacillus subtilis and Clostridium perfringens. It is part of the sigma(E)-controlled regulon, and its mutation leads to a sporulation defect.
Probab=20.51 E-value=71 Score=24.61 Aligned_cols=43 Identities=12% Similarity=0.334 Sum_probs=28.9
Q ss_pred CCCCceeEEEecCceEEEE--EecCCCCCCceEEEEECCEEEEEEEE
Q psy5221 322 QSYPPYNIELVDENKYRIT--MAIAGFNRSDLEIETERDTLRITGRK 366 (436)
Q Consensus 322 ~~~p~~~i~~e~~~~~~v~--~~lpG~~~edi~v~~~~~~l~i~g~~ 366 (436)
-..|.+.+. +.+...|+ -.+=-|+.+.|.++...+.|+|+|+.
T Consensus 17 ~~~p~itl~--gr~~~~Ien~k~I~~y~~~~I~l~t~~G~l~I~G~~ 61 (85)
T TIGR02856 17 LDLPRITLI--GNEHIYIENHRGLVVFSPEEVKLNSTNGKITIEGKN 61 (85)
T ss_pred cCCCEEEEE--CCcEEEEECccceEEECCCEEEEEcCceEEEEEccc
Confidence 345666664 34444443 12334578999999999999999984
No 217
>PF05309 TraE: TraE protein; InterPro: IPR007973 This family consists of several bacterial sex pilus assembly and synthesis proteins (TraE). Conjugal transfer of plasmids from donor to recipient cells is a complex process in which a cell-to-cell contact plays a key role. Many genes encoded by self-transmissible plasmids are required for various processes of conjugation, including pilus formation, stabilisation of mating pairs, conjugative DNA metabolism, surface exclusion and regulation of transfer gene expression []. The exact function of the TraE protein is unknown.; GO: 0000746 conjugation
Probab=20.37 E-value=1.8e+02 Score=25.85 Aligned_cols=48 Identities=15% Similarity=0.200 Sum_probs=25.9
Q ss_pred CceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccc-eEeEEeeCcEEEEEE
Q psy5221 349 SDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVH-VVAAFFDQGILNIDL 411 (436)
Q Consensus 349 edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~-~~~A~~~~GvL~i~~ 411 (436)
++++++-+++.+.|+|..+.-.+++.. -++... .++-.++||.|.++=
T Consensus 131 ~~i~~d~~~~~V~V~G~l~t~~g~~~~---------------~~~~~~y~~~~~~~~g~~~L~~ 179 (187)
T PF05309_consen 131 KSIEVDPETLTVFVTGTLKTWIGDKKV---------------SSEDKTYRLQFKYRNGRLWLKS 179 (187)
T ss_pred eEEEEecCCCEEEEEEEEEEEECCccc---------------cceeEEEEEEEEEeCCEEEEee
Confidence 455555667777778876432111110 011111 377788888887753
No 218
>TIGR02921 PEP_integral PEP-CTERM family integral membrane protein. Members of this protein family, found in three different species so far, have a PEP-CTERM sequence at the carboxyl-terminus (see model TIGR02595), but are unusual among PEP-CTERM proteins in having multiple predicted transmembrane segments. The function is unknown. It is proposed that a member of the EpsH family, to be designated exosortase (see TIGR02602), recognizes and cleaves PEP-CTERM proteins in a manner analogous to the cleavage of LPXTG proteins by sortase (see Haft, et al., 2006).
Probab=20.25 E-value=1.8e+02 Score=30.82 Aligned_cols=59 Identities=15% Similarity=0.175 Sum_probs=35.1
Q ss_pred CceEEEEECCEEEEEEEEecccCCCceEEEeeeeeeEEEEEECCCccceEeEEe-eCcEEE-----EEEecC
Q psy5221 349 SDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFF-DQGILN-----IDLIRE 414 (436)
Q Consensus 349 edi~v~~~~~~l~i~g~~~~~~~~~~~~~~e~~~~~f~r~~~lP~~v~~~~A~~-~~GvL~-----i~~pk~ 414 (436)
-+|+|+++++.-+.+-+-.-.+...+.+ -|.|+ |+||++.....--+ .||+|. +-+||.
T Consensus 419 h~VtVeIeg~iA~TEIEqTF~NPN~r~L-----EGEls--FPLPEgAtVTGyALdvdGkL~Daw~~VVVEKE 483 (952)
T TIGR02921 419 MAITVEEHGDNADIEIVETLENQTPENH-----EVFFH--FSLPEEAAITGLWLGDDAKDDDKFAFALAPRG 483 (952)
T ss_pred eeEEEEEECCeEEEEEEEEEECCCCCce-----eEEEE--ecCCCCCeeeeeeecCCccccccccceeccHH
Confidence 4677777777555544333222222222 45555 99999988544333 477887 888883
Done!