Query psy17672
Match_columns 319
No_of_seqs 311 out of 924
Neff 6.8
Searched_HMMs 46136
Date Fri Aug 16 20:01:06 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy17672.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/17672hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0012|consensus 100.0 4.6E-68 1E-72 496.7 22.2 317 1-317 23-372 (380)
2 PF09668 Asp_protease: Asparty 100.0 1.5E-42 3.2E-47 287.0 12.4 124 157-280 1-124 (124)
3 cd05480 NRIP_C NRIP_C; putativ 100.0 1.1E-34 2.3E-39 228.1 10.7 101 183-283 1-103 (103)
4 cd05479 RP_DDI RP_DDI; retrope 100.0 8.5E-31 1.8E-35 217.9 14.7 123 166-288 2-124 (124)
5 PF08284 RVP_2: Retroviral asp 99.9 4.1E-22 9E-27 168.2 13.1 113 179-291 20-132 (135)
6 TIGR02281 clan_AA_DTGA clan AA 99.8 1.7E-17 3.7E-22 137.6 13.6 110 176-288 7-119 (121)
7 cd05484 retropepsin_like_LTR_2 99.7 6.1E-17 1.3E-21 127.2 12.0 91 181-273 1-91 (91)
8 KOG0010|consensus 99.7 8.6E-17 1.9E-21 157.5 9.4 118 2-122 36-214 (493)
9 PF13650 Asp_protease_2: Aspar 99.6 6.2E-15 1.3E-19 113.7 11.6 88 183-271 1-90 (90)
10 cd05483 retropepsin_like_bacte 99.6 5E-15 1.1E-19 115.5 10.8 93 179-273 1-96 (96)
11 PF00077 RVP: Retroviral aspar 99.6 2E-14 4.3E-19 114.2 10.9 95 180-280 5-100 (100)
12 TIGR03698 clan_AA_DTGF clan AA 99.5 1.5E-13 3.2E-18 111.6 11.3 99 183-286 2-107 (107)
13 cd06095 RP_RTVL_H_like Retrope 99.5 1.2E-13 2.6E-18 107.6 9.5 85 183-273 1-86 (86)
14 TIGR00601 rad23 UV excision re 99.4 9.7E-13 2.1E-17 128.2 11.4 51 1-52 21-74 (378)
15 cd01815 BMSC_UbP_N Ubiquitin-l 99.4 3E-13 6.4E-18 102.5 4.8 52 1-53 21-75 (75)
16 cd01796 DDI1_N DNA damage indu 99.4 4.8E-13 1E-17 100.7 4.8 51 1-51 20-70 (71)
17 COG3577 Predicted aspartyl pro 99.4 3.8E-12 8.3E-17 112.8 10.8 99 177-276 102-202 (215)
18 cd01807 GDX_N ubiquitin-like d 99.4 7.7E-13 1.7E-17 100.1 5.1 52 1-53 21-72 (74)
19 cd01793 Fubi Fubi ubiquitin-li 99.4 7.3E-13 1.6E-17 100.3 5.0 52 1-53 19-70 (74)
20 cd01797 NIRF_N amino-terminal 99.3 1.8E-12 4E-17 99.3 5.2 52 1-53 23-74 (78)
21 cd01798 parkin_N amino-termina 99.3 2E-12 4.4E-17 96.7 4.9 51 1-52 19-69 (70)
22 cd00303 retropepsin_like Retro 99.3 3.7E-11 8.1E-16 88.7 10.9 89 183-273 1-92 (92)
23 cd01794 DC_UbP_C dendritic cel 99.3 3.5E-12 7.6E-17 95.8 4.8 51 1-52 19-69 (70)
24 cd01791 Ubl5 UBL5 ubiquitin-li 99.3 3.7E-12 8.1E-17 96.5 4.8 50 1-51 22-71 (73)
25 cd01810 ISG15_repeat2 ISG15 ub 99.3 4.5E-12 9.8E-17 95.9 4.9 52 1-53 19-70 (74)
26 KOG0011|consensus 99.3 1.8E-11 4E-16 115.0 10.0 50 2-52 22-73 (340)
27 PF00240 ubiquitin: Ubiquitin 99.3 9.3E-12 2E-16 92.4 6.3 52 1-53 16-67 (69)
28 cd01800 SF3a120_C Ubiquitin-li 99.2 1.3E-11 2.7E-16 94.0 4.9 52 1-53 18-69 (76)
29 cd01808 hPLIC_N Ubiquitin-like 99.2 1.6E-11 3.4E-16 92.1 4.9 51 1-52 20-70 (71)
30 PTZ00044 ubiquitin; Provisiona 99.2 1.9E-11 4.1E-16 92.6 4.8 52 1-53 21-72 (76)
31 cd01802 AN1_N ubiquitin-like d 99.2 2.1E-11 4.6E-16 98.4 5.0 52 1-53 48-99 (103)
32 cd01805 RAD23_N Ubiquitin-like 99.2 4E-11 8.6E-16 91.0 5.6 52 1-53 21-74 (77)
33 cd01799 Hoil1_N Ubiquitin-like 99.2 2.9E-11 6.4E-16 92.0 4.4 50 1-51 23-73 (75)
34 cd01790 Herp_N Homocysteine-re 99.2 3E-11 6.5E-16 92.6 4.3 51 1-52 24-78 (79)
35 cd01804 midnolin_N Ubiquitin-l 99.2 3.6E-11 7.8E-16 92.1 4.7 51 1-53 22-72 (78)
36 cd01806 Nedd8 Nebb8-like ubiq 99.1 6.1E-11 1.3E-15 89.4 5.3 52 1-53 21-72 (76)
37 PF13975 gag-asp_proteas: gag- 99.1 1.9E-10 4.1E-15 86.7 7.8 65 176-240 4-69 (72)
38 cd05481 retropepsin_like_LTR_1 99.1 4.2E-10 9.1E-15 89.1 10.1 86 184-271 2-91 (93)
39 cd01809 Scythe_N Ubiquitin-lik 99.1 7E-11 1.5E-15 88.1 5.1 51 1-52 21-71 (72)
40 cd01812 BAG1_N Ubiquitin-like 99.1 6E-11 1.3E-15 88.4 4.6 51 1-52 20-70 (71)
41 cd01803 Ubiquitin Ubiquitin. U 99.1 1.1E-10 2.4E-15 87.9 4.9 52 1-53 21-72 (76)
42 cd01792 ISG15_repeat1 ISG15 ub 99.1 8.5E-11 1.8E-15 90.3 4.1 52 1-53 23-76 (80)
43 PF12384 Peptidase_A2B: Ty3 tr 99.0 3.9E-09 8.5E-14 90.7 12.0 96 179-276 33-130 (177)
44 cd01813 UBP_N UBP ubiquitin pr 99.0 3.3E-10 7.2E-15 86.0 4.7 51 1-52 20-73 (74)
45 COG5550 Predicted aspartyl pro 98.9 1.6E-08 3.6E-13 83.0 11.6 105 180-289 10-120 (125)
46 cd01795 USP48_C USP ubiquitin- 98.9 1.9E-09 4.1E-14 85.2 5.1 53 1-53 25-77 (107)
47 cd01814 NTGP5 Ubiquitin-like N 98.8 3E-09 6.4E-14 86.2 4.5 54 1-55 26-92 (113)
48 smart00213 UBQ Ubiquitin homol 98.8 2.7E-09 5.9E-14 77.2 3.8 45 1-46 20-64 (64)
49 KOG0003|consensus 98.8 3.2E-10 6.9E-15 90.3 -1.7 52 1-53 21-72 (128)
50 KOG0005|consensus 98.8 1.8E-09 4E-14 77.2 1.9 49 2-51 22-70 (70)
51 cd01789 Alp11_N Ubiquitin-like 98.8 9.8E-09 2.1E-13 79.8 6.0 53 1-53 23-81 (84)
52 cd01763 Sumo Small ubiquitin-r 98.8 9.7E-09 2.1E-13 80.2 5.6 52 1-53 32-83 (87)
53 cd01769 UBL Ubiquitin-like dom 98.8 9.6E-09 2.1E-13 75.3 5.0 50 1-51 18-67 (69)
54 KOG0004|consensus 98.7 4.8E-09 1.1E-13 89.1 2.7 51 1-52 21-71 (156)
55 PF02160 Peptidase_A3: Caulifl 98.6 2.2E-07 4.8E-12 83.0 10.2 110 179-292 3-119 (201)
56 cd06094 RP_Saci_like RP_Saci_l 98.6 9.9E-08 2.1E-12 74.5 6.8 79 191-276 9-88 (89)
57 cd05482 HIV_retropepsin_like R 98.4 1.4E-06 3.1E-11 68.1 8.3 86 184-273 2-87 (87)
58 PF14560 Ubiquitin_2: Ubiquiti 98.4 5.7E-07 1.2E-11 70.0 5.4 53 1-53 24-83 (87)
59 PF05585 DUF1758: Putative pep 98.4 1.5E-06 3.3E-11 75.3 8.2 83 190-275 11-128 (164)
60 PF11976 Rad60-SLD: Ubiquitin- 98.3 9.3E-07 2E-11 66.0 5.0 49 2-51 22-71 (72)
61 KOG4248|consensus 98.2 1.8E-06 3.9E-11 91.6 5.1 51 1-53 23-73 (1143)
62 cd01801 Tsc13_N Ubiquitin-like 98.1 5.9E-06 1.3E-10 62.9 4.9 49 2-51 24-75 (77)
63 KOG0001|consensus 97.8 5.7E-05 1.2E-09 54.7 5.3 51 2-53 21-71 (75)
64 PF13881 Rad60-SLD_2: Ubiquiti 97.7 7E-05 1.5E-09 61.2 6.2 54 1-55 24-90 (111)
65 PF09280 XPC-binding: XPC-bind 97.7 8.5E-05 1.8E-09 53.9 5.0 46 77-122 4-55 (59)
66 cd00196 UBQ Ubiquitin-like pro 97.6 0.0001 2.3E-09 50.5 4.9 51 1-52 18-68 (69)
67 cd01788 ElonginB Ubiquitin-lik 97.5 0.0001 2.2E-09 59.9 4.3 51 2-53 23-80 (119)
68 PF11543 UN_NPL4: Nuclear pore 97.4 0.00016 3.6E-09 55.6 3.2 51 1-51 24-78 (80)
69 PLN02560 enoyl-CoA reductase 97.4 0.00025 5.5E-09 67.8 5.3 50 1-51 24-81 (308)
70 PF12382 Peptidase_A2E: Retrot 96.9 0.0042 9.1E-08 49.6 7.1 78 192-272 48-127 (137)
71 PF00026 Asp: Eukaryotic aspar 96.3 0.018 3.8E-07 54.0 8.2 89 181-271 2-114 (317)
72 PTZ00013 plasmepsin 4 (PM4); P 96.0 0.037 8.1E-07 55.8 9.5 91 179-271 137-251 (450)
73 cd05470 pepsin_retropepsin_lik 95.9 0.044 9.5E-07 43.4 7.8 84 183-269 1-109 (109)
74 KOG0006|consensus 95.8 0.0074 1.6E-07 57.3 3.0 49 2-51 25-73 (446)
75 PTZ00147 plasmepsin-1; Provisi 95.6 0.075 1.6E-06 53.6 9.7 92 178-271 137-252 (453)
76 KOG1872|consensus 95.4 0.022 4.9E-07 56.7 5.0 51 2-53 25-75 (473)
77 cd05476 pepsin_A_like_plant Ch 95.3 0.063 1.4E-06 49.7 7.5 91 185-290 155-262 (265)
78 PF08817 YukD: WXG100 protein 95.3 0.022 4.7E-07 43.4 3.7 49 2-51 24-79 (79)
79 COG4067 Uncharacterized protei 95.1 0.069 1.5E-06 45.9 6.4 101 184-284 30-156 (162)
80 cd06097 Aspergillopepsin_like 95.1 0.099 2.2E-06 48.6 8.2 89 182-271 2-114 (278)
81 cd05474 SAP_like SAPs, pepsin- 94.7 0.26 5.6E-06 45.9 9.9 75 180-271 2-80 (295)
82 cd05478 pepsin_A Pepsin A, asp 94.6 0.23 5E-06 47.1 9.5 92 178-271 8-122 (317)
83 cd05487 renin_like Renin stimu 94.4 0.35 7.5E-06 46.2 10.2 92 178-271 6-121 (326)
84 cd05476 pepsin_A_like_plant Ch 94.2 0.17 3.7E-06 46.7 7.5 76 181-271 2-88 (265)
85 KOG3206|consensus 94.1 0.13 2.7E-06 46.4 6.0 54 1-54 23-82 (234)
86 PF05618 Zn_protease: Putative 94.0 0.083 1.8E-06 44.8 4.5 45 241-285 87-133 (138)
87 cd05477 gastricsin Gastricsins 93.8 0.41 9E-06 45.4 9.5 90 179-271 2-115 (318)
88 KOG1769|consensus 93.8 0.15 3.2E-06 40.7 5.3 50 3-53 43-92 (99)
89 cd06098 phytepsin Phytepsin, a 93.8 0.42 9.2E-06 45.4 9.6 92 178-271 8-123 (317)
90 KOG4495|consensus 93.7 0.045 9.7E-07 43.3 2.1 41 2-43 23-65 (110)
91 cd05478 pepsin_A Pepsin A, asp 93.6 0.44 9.5E-06 45.2 9.3 102 184-289 193-316 (317)
92 PF10302 DUF2407: DUF2407 ubiq 93.6 0.074 1.6E-06 42.4 3.3 39 1-40 24-64 (97)
93 cd05488 Proteinase_A_fungi Fun 93.5 0.4 8.8E-06 45.6 8.8 91 178-271 8-122 (320)
94 cd05486 Cathespin_E Cathepsin 93.1 0.42 9.2E-06 45.3 8.2 88 181-271 1-112 (316)
95 cd05477 gastricsin Gastricsins 93.1 0.59 1.3E-05 44.3 9.2 102 185-290 188-317 (318)
96 cd05474 SAP_like SAPs, pepsin- 92.9 0.54 1.2E-05 43.7 8.6 96 192-290 179-294 (295)
97 cd05485 Cathepsin_D_like Cathe 92.9 0.52 1.1E-05 45.1 8.6 93 177-271 8-125 (329)
98 cd06096 Plasmepsin_5 Plasmepsi 92.8 0.33 7.2E-06 46.3 7.2 89 192-290 232-322 (326)
99 cd05490 Cathepsin_D2 Cathepsin 92.8 0.5 1.1E-05 44.9 8.3 92 178-271 4-120 (325)
100 PTZ00165 aspartyl protease; Pr 92.7 0.8 1.7E-05 46.6 10.0 99 171-271 109-237 (482)
101 cd06098 phytepsin Phytepsin, a 92.7 0.73 1.6E-05 43.8 9.2 91 185-289 197-316 (317)
102 cd05485 Cathepsin_D_like Cathe 92.5 1.1 2.4E-05 42.9 10.2 101 185-289 199-328 (329)
103 cd05473 beta_secretase_like Be 92.2 0.7 1.5E-05 44.8 8.6 89 180-271 3-113 (364)
104 KOG3493|consensus 92.0 0.044 9.6E-07 40.3 0.0 49 1-50 22-70 (73)
105 cd05472 cnd41_like Chloroplast 91.7 0.58 1.3E-05 43.9 7.2 79 181-271 2-89 (299)
106 cd05475 nucellin_like Nucellin 90.6 0.83 1.8E-05 42.4 7.1 87 185-290 166-270 (273)
107 cd05488 Proteinase_A_fungi Fun 90.6 2 4.3E-05 40.8 9.7 101 185-289 194-319 (320)
108 PF00026 Asp: Eukaryotic aspar 90.4 0.53 1.2E-05 43.9 5.6 98 184-290 186-316 (317)
109 cd06096 Plasmepsin_5 Plasmepsi 90.0 1.1 2.4E-05 42.7 7.5 92 180-272 3-139 (326)
110 cd06097 Aspergillopepsin_like 89.9 0.74 1.6E-05 42.7 6.1 84 185-289 186-277 (278)
111 cd05471 pepsin_like Pepsin-lik 89.8 1.5 3.2E-05 40.0 7.9 90 182-273 2-115 (283)
112 cd05472 cnd41_like Chloroplast 89.5 1.1 2.3E-05 42.1 6.8 27 264-290 270-296 (299)
113 cd01811 OASL_repeat1 2'-5' oli 89.5 0.77 1.7E-05 34.8 4.6 49 2-51 22-74 (80)
114 cd05471 pepsin_like Pepsin-lik 88.7 0.71 1.5E-05 42.1 5.0 80 191-289 202-282 (283)
115 cd05475 nucellin_like Nucellin 88.5 1.7 3.6E-05 40.4 7.3 84 180-271 2-102 (273)
116 PF00789 UBX: UBX domain; Int 88.4 1.3 2.8E-05 33.4 5.4 51 1-51 27-81 (82)
117 smart00727 STI1 Heat shock cha 87.8 0.64 1.4E-05 30.6 3.0 31 78-108 8-40 (41)
118 PF03539 Spuma_A9PTase: Spumav 87.0 4.5 9.7E-05 34.8 8.2 78 187-276 1-84 (163)
119 PF15044 CLU_N: Mitochondrial 85.9 1.5 3.1E-05 33.3 4.4 52 1-53 5-58 (76)
120 COG5417 Uncharacterized small 85.6 1.7 3.7E-05 33.0 4.4 31 19-50 50-80 (81)
121 KOG0013|consensus 85.5 1.1 2.4E-05 40.5 4.1 45 1-46 167-211 (231)
122 cd05486 Cathespin_E Cathepsin 85.4 5.8 0.00013 37.5 9.3 100 185-289 186-315 (316)
123 cd05489 xylanase_inhibitor_I_l 84.2 4.2 9.1E-05 39.7 7.9 26 265-290 335-360 (362)
124 cd05490 Cathepsin_D2 Cathepsin 83.9 4 8.7E-05 38.7 7.5 94 192-289 207-324 (325)
125 KOG0010|consensus 82.5 1.4 3E-05 44.5 3.8 91 84-184 148-251 (493)
126 PLN03146 aspartyl protease fam 82.2 3.3 7.1E-05 41.5 6.3 27 264-290 399-425 (431)
127 PTZ00147 plasmepsin-1; Provisi 82.0 11 0.00024 38.2 10.0 103 183-290 321-448 (453)
128 smart00166 UBX Domain present 81.9 3.2 7E-05 31.3 4.8 51 1-51 25-79 (80)
129 cd05487 renin_like Renin stimu 81.5 9 0.00019 36.4 8.9 101 184-289 194-324 (326)
130 PF11470 TUG-UBL1: GLUT4 regul 79.7 4.1 8.9E-05 30.1 4.5 48 2-50 18-65 (65)
131 cd01774 Faf1_like2_UBX Faf1 ik 79.4 4.5 9.8E-05 31.3 4.9 49 2-51 26-83 (85)
132 KOG1639|consensus 77.3 2.9 6.3E-05 38.9 3.8 50 2-52 24-78 (297)
133 PTZ00165 aspartyl protease; Pr 77.3 16 0.00034 37.3 9.5 28 263-290 418-445 (482)
134 cd01770 p47_UBX p47-like ubiqu 76.1 5.6 0.00012 30.3 4.6 48 1-49 25-75 (79)
135 cd01767 UBX UBX (ubiquitin reg 76.0 6.5 0.00014 29.4 4.9 48 2-51 24-75 (77)
136 cd05473 beta_secretase_like Be 73.2 7 0.00015 37.8 5.6 27 264-290 318-344 (364)
137 PF14836 Ubiquitin_3: Ubiquiti 72.7 11 0.00023 29.6 5.4 53 1-54 24-81 (88)
138 PF03419 Peptidase_U4: Sporula 72.7 55 0.0012 30.9 11.4 36 178-213 155-201 (293)
139 cd01772 SAKS1_UBX SAKS1-like U 72.2 9.4 0.0002 28.9 5.0 49 2-51 26-78 (79)
140 TIGR02854 spore_II_GA sigma-E 71.1 9.4 0.0002 36.2 5.8 36 179-214 157-203 (288)
141 PF14732 UAE_UbL: Ubiquitin/SU 69.2 4.2 9.1E-05 31.6 2.5 49 1-51 9-67 (87)
142 PF11620 GABP-alpha: GA-bindin 67.6 11 0.00023 29.5 4.3 50 2-52 14-63 (88)
143 PF11834 DUF3354: Domain of un 67.4 8 0.00017 28.9 3.5 43 2-51 27-69 (69)
144 PRK06437 hypothetical protein; 67.2 15 0.00034 26.9 5.0 34 13-52 29-62 (67)
145 PF14541 TAXi_C: Xylanase inhi 66.9 19 0.00041 30.5 6.4 29 261-289 132-160 (161)
146 COG5227 SMT3 Ubiquitin-like pr 66.0 8.7 0.00019 30.3 3.5 47 5-52 49-95 (103)
147 PTZ00013 plasmepsin 4 (PM4); P 65.9 30 0.00065 35.0 8.5 28 263-290 420-447 (450)
148 PF14453 ThiS-like: ThiS-like 64.6 15 0.00032 26.5 4.3 40 2-53 17-56 (57)
149 cd01771 Faf1_UBX Faf1 UBX doma 63.7 17 0.00038 27.7 4.9 49 2-51 26-78 (80)
150 cd00565 ThiS ThiaminS ubiquiti 63.6 16 0.00035 26.3 4.5 45 2-52 16-60 (65)
151 cd01773 Faf1_like1_UBX Faf1 ik 61.2 21 0.00046 27.5 5.0 50 2-52 27-80 (82)
152 TIGR01683 thiS thiamine biosyn 60.8 23 0.00049 25.4 4.9 45 2-52 15-59 (64)
153 KOG1339|consensus 60.4 42 0.0009 33.0 8.2 36 179-214 45-83 (398)
154 PRK12751 cpxP periplasmic stre 59.8 54 0.0012 28.6 7.8 27 135-161 88-114 (162)
155 PF12754 Blt1: Cell-cycle cont 59.0 3.1 6.8E-05 39.8 0.0 41 2-43 105-160 (309)
156 KOG4583|consensus 58.5 3 6.6E-05 40.3 -0.2 50 2-52 33-86 (391)
157 cd00754 MoaD Ubiquitin domain 58.0 26 0.00057 25.8 5.0 46 1-52 26-75 (80)
158 PRK06488 sulfur carrier protei 55.3 32 0.00068 24.7 4.9 45 2-52 16-60 (65)
159 PF11925 DUF3443: Protein of u 54.8 37 0.0008 33.5 6.5 93 178-270 21-147 (370)
160 TIGR02958 sec_mycoba_snm4 secr 54.8 11 0.00024 38.1 3.1 33 20-53 47-80 (452)
161 PRK05863 sulfur carrier protei 53.7 27 0.00059 25.2 4.3 43 6-52 18-60 (65)
162 PF14327 CSTF2_hinge: Hinge do 53.4 16 0.00034 28.2 3.1 27 83-109 37-63 (84)
163 smart00727 STI1 Heat shock cha 50.6 25 0.00054 22.8 3.3 8 91-98 10-17 (41)
164 PF14543 TAXi_N: Xylanase inhi 50.1 17 0.00038 31.1 3.2 25 181-205 1-27 (164)
165 PRK07440 hypothetical protein; 49.1 43 0.00093 24.7 4.8 38 13-52 28-65 (70)
166 PF02153 PDH: Prephenate dehyd 49.0 29 0.00063 32.0 4.8 30 89-122 209-239 (258)
167 PRK12750 cpxP periplasmic repr 47.9 1.9E+02 0.0041 25.3 10.0 28 133-160 93-120 (170)
168 PRK10363 cpxP periplasmic repr 47.4 1E+02 0.0022 27.0 7.5 21 141-162 122-142 (166)
169 PF10790 DUF2604: Protein of U 47.3 18 0.00038 26.8 2.3 40 14-54 32-72 (76)
170 PRK07696 sulfur carrier protei 47.3 49 0.0011 24.1 4.8 44 6-52 19-62 (67)
171 PRK07417 arogenate dehydrogena 46.9 49 0.0011 30.8 6.0 14 79-92 221-234 (279)
172 PF13801 Metal_resist: Heavy-m 46.5 1.4E+02 0.0029 23.1 9.2 43 115-157 56-98 (125)
173 PRK05659 sulfur carrier protei 45.2 57 0.0012 23.3 4.9 38 13-52 24-61 (66)
174 PF11069 DUF2870: Protein of u 43.9 74 0.0016 25.4 5.5 18 22-40 3-20 (98)
175 PHA00431 internal virion prote 43.5 1.1E+02 0.0023 32.4 8.0 82 80-161 81-179 (746)
176 PF09269 DUF1967: Domain of un 43.4 11 0.00024 27.9 0.8 16 35-50 47-62 (69)
177 TIGR03595 Obg_CgtA_exten Obg f 42.2 12 0.00027 27.7 0.9 16 35-50 47-62 (69)
178 COG0287 TyrA Prephenate dehydr 41.4 54 0.0012 31.0 5.4 22 88-109 221-243 (279)
179 PRK14806 bifunctional cyclohex 41.2 35 0.00076 36.3 4.5 47 77-123 226-277 (735)
180 PF07946 DUF1682: Protein of u 41.1 1.3E+02 0.0027 29.0 7.9 20 140-159 299-318 (321)
181 PRK07502 cyclohexadienyl dehyd 40.4 69 0.0015 30.2 5.9 13 80-92 233-245 (307)
182 PLN03146 aspartyl protease fam 39.9 75 0.0016 31.8 6.4 29 178-206 82-112 (431)
183 COG2104 ThiS Sulfur transfer p 38.3 88 0.0019 23.2 5.0 37 14-52 27-63 (68)
184 KOG3391|consensus 38.1 39 0.00085 28.7 3.3 27 27-53 110-136 (151)
185 PRK08364 sulfur carrier protei 37.8 1E+02 0.0022 22.6 5.3 41 2-52 25-65 (70)
186 PRK06083 sulfur carrier protei 37.8 75 0.0016 24.5 4.7 37 14-52 43-79 (84)
187 PF09280 XPC-binding: XPC-bind 37.6 41 0.00089 24.2 3.0 13 97-109 15-27 (59)
188 PF06487 SAP18: Sin3 associate 37.1 39 0.00084 28.0 3.2 28 24-51 92-119 (120)
189 KOG2561|consensus 35.5 23 0.0005 35.8 1.8 47 4-51 63-109 (568)
190 PF07319 DnaI_N: Primosomal pr 34.2 57 0.0012 25.5 3.6 29 81-109 21-49 (94)
191 PRK10455 periplasmic protein; 33.9 2.6E+02 0.0055 24.2 7.9 11 143-153 130-140 (161)
192 PF05823 Gp-FAR-1: Nematode fa 33.5 91 0.002 26.8 5.1 48 90-157 41-88 (154)
193 KOG3048|consensus 33.3 25 0.00053 30.1 1.4 27 177-204 69-95 (153)
194 PRK14011 prefoldin subunit alp 33.2 2.8E+02 0.0061 23.6 7.9 24 180-204 59-82 (144)
195 PRK08053 sulfur carrier protei 31.7 1.5E+02 0.0032 21.3 5.3 45 2-52 17-61 (66)
196 PRK08655 prephenate dehydrogen 31.6 1E+02 0.0022 31.0 5.7 31 88-122 217-247 (437)
197 TIGR01682 moaD molybdopterin c 30.9 1.6E+02 0.0034 21.8 5.5 45 2-52 28-75 (80)
198 KOG0011|consensus 30.8 1.2E+02 0.0025 29.6 5.7 46 77-122 219-270 (340)
199 PRK11199 tyrA bifunctional cho 30.3 84 0.0018 30.8 4.9 15 90-104 298-312 (374)
200 PF13019 Telomere_Sde2: Telome 29.7 1.2E+02 0.0026 26.5 5.1 50 2-51 26-80 (162)
201 PF14841 FliG_M: FliG middle d 29.3 2.5E+02 0.0053 21.1 7.8 26 135-160 46-71 (79)
202 TIGR01687 moaD_arch MoaD famil 29.1 2.2E+02 0.0047 21.4 6.1 47 2-52 26-83 (88)
203 PRK06545 prephenate dehydrogen 29.0 1.3E+02 0.0027 29.3 5.8 7 151-157 265-271 (359)
204 PF02597 ThiS: ThiS family; I 28.7 1E+02 0.0023 22.2 4.1 49 1-52 22-72 (77)
205 KOG4250|consensus 28.5 90 0.0019 33.4 4.8 49 1-51 335-385 (732)
206 cd06396 PB1_NBR1 The PB1 domai 27.4 84 0.0018 24.2 3.4 35 182-216 2-38 (81)
207 PRK06944 sulfur carrier protei 26.1 2.2E+02 0.0047 20.0 5.3 44 2-52 17-60 (65)
208 TIGR00985 3a0801s04tom mitocho 25.9 2.9E+02 0.0062 23.8 6.7 18 140-157 78-96 (148)
209 PRK11861 bifunctional prephena 25.6 1.5E+02 0.0033 31.4 6.1 46 77-122 160-210 (673)
210 KOG1464|consensus 25.1 1.2E+02 0.0026 29.2 4.6 55 88-158 298-353 (440)
211 PLN02799 Molybdopterin synthas 24.9 1.4E+02 0.003 22.2 4.2 46 1-52 29-77 (82)
212 cd06406 PB1_P67 A PB1 domain i 24.3 96 0.0021 23.9 3.2 26 2-27 22-47 (80)
213 PF13856 Gifsy-2: ATP-binding 24.2 80 0.0017 24.5 2.9 18 183-200 22-39 (95)
214 PF12436 USP7_ICP0_bdg: ICP0-b 23.7 77 0.0017 29.3 3.1 51 2-53 96-152 (249)
215 COG3159 Uncharacterized protei 23.3 2.1E+02 0.0045 26.2 5.6 27 79-109 4-30 (218)
216 PTZ00380 microtubule-associate 22.3 88 0.0019 26.0 2.8 52 1-53 51-105 (121)
217 PRK11130 moaD molybdopterin sy 20.5 3.5E+02 0.0076 20.1 5.7 45 1-51 26-75 (81)
No 1
>KOG0012|consensus
Probab=100.00 E-value=4.6e-68 Score=496.69 Aligned_cols=317 Identities=45% Similarity=0.734 Sum_probs=286.2
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCC-cccccccCCCCCcEEEEEccCCCCC----------CCccccCCCCc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLED-GSCLKQAGVRDGDIILVAMPGRGPT----------YHVAQSRHTAP 69 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~-~~tL~~~gI~~~d~l~l~~~~~~p~----------~~~s~~~~p~~ 69 (319)
+.+.+|++++..++|++.+...|+||++++.++ +.+|.++|+++||+|.+..+...|. .+|+++..|..
T Consensus 23 g~L~nl~aL~~~d~g~~~~~~~li~n~~~l~s~~s~~l~Q~g~~~~dsl~lr~ks~d~r~~v~~~~~~~~dFat~A~~~f 102 (380)
T KOG0012|consen 23 GELNNLAALCWKDTGIVYDPSDLIYNPRPLVSNESQGLTQIGLKDGDSLALRCKSSDPRPIVPIQVRLISDFATIAVPMF 102 (380)
T ss_pred cchhhHHHHHHHHhCcccchhhcccCCCccccchhhhhhhcccccceeEeccCCCCCCCccccccceehhcccccccccc
Confidence 467899999999999999999999999999764 6789999999999999987643321 13433322211
Q ss_pred -----------cccc---------cccC-CCChHHHHHHHhhCHHHHHHHHhcCchhHHHHhcCh-HHHHHHHHHHHHHH
Q psy17672 70 -----------HRSF---------MTAQ-LQDPAHVRDLLLACPDQLALLKQNNPRLSEALSTGK-DAFTKVFKEQLVER 127 (319)
Q Consensus 70 -----------~~~~---------~~~~-~~dpe~~r~~il~nP~~l~~L~~~nP~La~ai~~~~-~~f~~~l~~~~~~~ 127 (319)
++.+ .... .++|+.|||+++++|..+..+++.+|.|+.++..++ ++|...++..+..+
T Consensus 103 s~q~a~~~~gaq~~rg~~~~di~~a~~~~ldsp~~~Rq~~la~pf~L~~~r~~lp~La~~l~~g~~~k~~~~~~~~q~d~ 182 (380)
T KOG0012|consen 103 SSQRARQLQGAQRTRGRLQTDIPEASSLSLDSPATFRQALLAMPFFLHLDRAYLPPLAALLVLGDLEKFDRTLKEMQKDY 182 (380)
T ss_pred cchhccccccccccccccccccccccccCcCCHHHHHHHHhcCchhhhhchhhcCccchhhcccchhhhhhhhhhhcccc
Confidence 1110 1122 589999999999999999999999999999997777 99999999877777
Q ss_pred HHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHHHHhHhcCCCCccccceEEEEEEEcCEEEEEEEcCCccccccCH
Q psy17672 128 EKRQKQKIKLMNAHPFDTHAQRLIAEEIRQKNVEANMEAAMEYNPETFGSVVMLYINCKVNGYPVKAFVDSGAQTTIMSA 207 (319)
Q Consensus 128 ~~~~~~~~~~l~~dP~~~e~q~~i~~~i~~~~~~~~~~~a~~~~pe~~~~~~~l~v~v~Ing~~v~aLVDTGA~~siIs~ 207 (319)
.+++..+++++++||||+|+|+||+|.|+|++|+|||.+|+||+||.|+.|+||||+|+|||++||||||||||.|+||.
T Consensus 183 ~rr~~~~~rl~eanPfd~E~q~rIee~irq~~i~eq~~~ai~~~pe~f~~v~ML~iN~~ing~~VKAfVDsGaq~timS~ 262 (380)
T KOG0012|consen 183 QRRSVHQRRLLEANPFDLEAQRRIEEKIRQNAIDEQMSHAIEYHPEDFTQVTMLYINCEINGVPVKAFVDSGAQTTIMSA 262 (380)
T ss_pred chhhhhhHHHHhcCCcchhhhhhhhHHHHHHHHHHHHHHhhhcCccccccceEEEEEEEECCEEEEEEEcccchhhhhhH
Confidence 77778899999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHcCCccccccceeeEeeccceeEEEeEEEEEEEEEcCEEEEeeEEEecCCCCceeechhhhhhccEEEEccCCEEE
Q psy17672 208 KCAERCNIMRLIDTRWAGVAKGVGVQQIIGRIHMVQVAIEKDFLTTSLSILEEQPMDMLLGLDMLRRHECCIDLRKNVLR 287 (319)
Q Consensus 208 ~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~g~v~~~~i~IG~~~~~~~f~Vl~~~~~d~ILG~D~L~~~~~~ID~~~~~L~ 287 (319)
+||+||||.+++|+++.|++.|||+.|+.|+||.++++||+.+++|+|.|++...+|++||+|+|++|+||||+++|.|+
T Consensus 263 ~Caer~gL~rlid~r~~g~a~gvg~~ki~g~Ih~~~lki~~~~l~c~ftV~d~~~~d~llGLd~Lrr~~ccIdL~~~~L~ 342 (380)
T KOG0012|consen 263 ACAERCGLNRLIDKRFQGEARGVGTEKILGRIHQAQLKIEDLYLPCSFTVLDRRDMDLLLGLDMLRRHQCCIDLKTNVLR 342 (380)
T ss_pred HHHHHhChHHHhhhhhhccccCCCcccccceeEEEEEEeccEeeccceEEecCCCcchhhhHHHHHhccceeecccCeEE
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred EccCCeeeecccCCCccccccccCCccccc
Q psy17672 288 IGTTGTETKFLPERELPSCARLTSASDEEE 317 (319)
Q Consensus 288 i~~~~~~vpfl~~~~~~~~~~~~~~~~~~~ 317 (319)
|+++++++||+...++|...+.++.++|+.
T Consensus 343 ig~~~teiPfl~~~~lp~~~~~~~~~~~~~ 372 (380)
T KOG0012|consen 343 IGNTETEIPFLPSNELPSHNKETGASPEPE 372 (380)
T ss_pred ecCCCccccccccccCCccccccccCCCcc
Confidence 999999999999999999999999987764
No 2
>PF09668 Asp_protease: Aspartyl protease; InterPro: IPR019103 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This family of eukaryotic aspartyl proteases have a fold similar to retroviral proteases which implies they function proteolytically during regulated protein turnover []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 3S8I_A 2I1A_B.
Probab=100.00 E-value=1.5e-42 Score=286.98 Aligned_cols=124 Identities=66% Similarity=1.144 Sum_probs=87.2
Q ss_pred HHHHHHHHHHhHhcCCCCccccceEEEEEEEcCEEEEEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEE
Q psy17672 157 QKNVEANMEAAMEYNPETFGSVVMLYINCKVNGYPVKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQII 236 (319)
Q Consensus 157 ~~~~~~~~~~a~~~~pe~~~~~~~l~v~v~Ing~~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~ 236 (319)
|+||+|||++|||++||+|++++||||+|+|||++++||||||||+|+||.+||+||||.+++|+|+.|++.|+|+.+++
T Consensus 1 q~~i~~~~~~a~e~~PE~f~~v~mLyI~~~ing~~vkA~VDtGAQ~tims~~~a~r~gL~~lid~r~~g~a~GvG~~~i~ 80 (124)
T PF09668_consen 1 QENIDENLENAMEHSPESFGQVSMLYINCKINGVPVKAFVDTGAQSTIMSKSCAERCGLMRLIDKRFAGVAKGVGTQKIL 80 (124)
T ss_dssp -------------------------EEEEEETTEEEEEEEETT-SS-EEEHHHHHHTTGGGGEEGGG-EE-------EEE
T ss_pred ChhHHHHHHHHHHhCcHhhcCcceEEEEEEECCEEEEEEEeCCCCccccCHHHHHHcCChhhccccccccccCCCcCcee
Confidence 78999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred eEEEEEEEEEcCEEEEeeEEEecCCCCceeechhhhhhccEEEE
Q psy17672 237 GRIHMVQVAIEKDFLTTSLSILEEQPMDMLLGLDMLRRHECCID 280 (319)
Q Consensus 237 g~v~~~~i~IG~~~~~~~f~Vl~~~~~d~ILG~D~L~~~~~~ID 280 (319)
|+||.++++||+.+++|+|.|+|....|+|||+|||++|+|+||
T Consensus 81 G~Ih~~~l~ig~~~~~~s~~Vle~~~~d~llGld~L~~~~c~ID 124 (124)
T PF09668_consen 81 GRIHSVQLKIGGLFFPCSFTVLEDQDVDLLLGLDMLKRHKCCID 124 (124)
T ss_dssp EEEEEEEEEETTEEEEEEEEEETTSSSSEEEEHHHHHHTT-EEE
T ss_pred EEEEEEEEEECCEEEEEEEEEeCCCCcceeeeHHHHHHhCcccC
Confidence 99999999999999999999999989999999999999999998
No 3
>cd05480 NRIP_C NRIP_C; putative nuclear receptor interacting protein. Proteins in this family have been described as probable nuclear receptor interacting proteins. The C-terminal domain of this family is homologous to the retroviral aspartyl protease domain. The domain is structurally related to one lobe of the pepsin molecule. The conserved active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=100.00 E-value=1.1e-34 Score=228.14 Aligned_cols=101 Identities=24% Similarity=0.488 Sum_probs=97.0
Q ss_pred EEEEEcCEEEEEEEcCCccccccCHHHHHHcCCcccccc-ceeeEeeccce-eEEEeEEEEEEEEEcCEEEEeeEEEecC
Q psy17672 183 INCKVNGYPVKAFVDSGAQTTIMSAKCAERCNIMRLIDT-RWAGVAKGVGV-QQIIGRIHMVQVAIEKDFLTTSLSILEE 260 (319)
Q Consensus 183 v~v~Ing~~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~-~~~g~a~Gvg~-~~~~g~v~~~~i~IG~~~~~~~f~Vl~~ 260 (319)
|+|++||++++||||||||+|+||++||+||||.++++. ++.|+|.|+|+ .+++|+||.++++||+.+++|+|+|+|.
T Consensus 1 vnCk~nG~~vkAfVDsGaQ~timS~~caercgL~r~v~~~r~~g~A~gvgt~~kiiGrih~~~ikig~~~~~CSftVld~ 80 (103)
T cd05480 1 VSCQCAGKELRALVDTGCQYNLISAACLDRLGLKERVLKAKAEEEAPSLPTSVKVIGQIERLVLQLGQLTVECSAQVVDD 80 (103)
T ss_pred CceeECCEEEEEEEecCCchhhcCHHHHHHcChHhhhhhccccccccCCCcceeEeeEEEEEEEEeCCEEeeEEEEEEcC
Confidence 579999999999999999999999999999999988776 88899999997 6999999999999999999999999999
Q ss_pred CCCceeechhhhhhccEEEEccC
Q psy17672 261 QPMDMLLGLDMLRRHECCIDLRK 283 (319)
Q Consensus 261 ~~~d~ILG~D~L~~~~~~ID~~~ 283 (319)
.++|++||+|+|++|+|+||+++
T Consensus 81 ~~~d~llGLdmLkrhqc~IdL~k 103 (103)
T cd05480 81 NEKNFSLGLQTLKSLKCVINLEK 103 (103)
T ss_pred CCcceEeeHHHHhhcceeeeccC
Confidence 99999999999999999999975
No 4
>cd05479 RP_DDI RP_DDI; retropepsin-like domain of DNA damage inducible protein. The family represents the retropepsin-like domain of DNA damage inducible protein. DNA damage inducible protein has a retropepsin-like domain and an amino-terminal ubiquitin-like domain and/or a UBA (ubiquitin-associated) domain. This CD represents the retropepsin-like domain of DDI.
Probab=99.97 E-value=8.5e-31 Score=217.95 Aligned_cols=123 Identities=63% Similarity=1.084 Sum_probs=116.6
Q ss_pred HhHhcCCCCccccceEEEEEEEcCEEEEEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEEeEEEEEEEE
Q psy17672 166 AAMEYNPETFGSVVMLYINCKVNGYPVKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQIIGRIHMVQVA 245 (319)
Q Consensus 166 ~a~~~~pe~~~~~~~l~v~v~Ing~~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~g~v~~~~i~ 245 (319)
.|+|+.|+.|..++++|++++|||++++|||||||++|+||+++|+|+||....+.++.+.+.|+|+.++.|+++.++++
T Consensus 2 ~~~~~~~~~~~~~~~~~v~~~Ing~~~~~LvDTGAs~s~Is~~~a~~lgl~~~~~~~~~~~~~g~g~~~~~g~~~~~~l~ 81 (124)
T cd05479 2 NAMEHHPESFGKVPMLYINVEINGVPVKAFVDSGAQMTIMSKACAEKCGLMRLIDKRFQGIAKGVGTQKILGRIHLAQVK 81 (124)
T ss_pred chhhcCcchhceeeEEEEEEEECCEEEEEEEeCCCceEEeCHHHHHHcCCccccCcceEEEEecCCCcEEEeEEEEEEEE
Confidence 57999999999999999999999999999999999999999999999999977677777788888888899999999999
Q ss_pred EcCEEEEeeEEEecCCCCceeechhhhhhccEEEEccCCEEEE
Q psy17672 246 IEKDFLTTSLSILEEQPMDMLLGLDMLRRHECCIDLRKNVLRI 288 (319)
Q Consensus 246 IG~~~~~~~f~Vl~~~~~d~ILG~D~L~~~~~~ID~~~~~L~i 288 (319)
||++.++++|.|++..++|+|||||||++++++|||++++|+|
T Consensus 82 i~~~~~~~~~~Vl~~~~~d~ILG~d~L~~~~~~ID~~~~~i~~ 124 (124)
T cd05479 82 IGNLFLPCSFTVLEDDDVDFLIGLDMLKRHQCVIDLKENVLRI 124 (124)
T ss_pred ECCEEeeeEEEEECCCCcCEEecHHHHHhCCeEEECCCCEEEC
Confidence 9999999999999998999999999999999999999999985
No 5
>PF08284 RVP_2: Retroviral aspartyl protease; InterPro: IPR013242 This region defines single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein; usually pol, more rarely gag. Retroviral proteases appear to be homologous to a single domain of the two-domain eukaryotic aspartyl proteases.
Probab=99.88 E-value=4.1e-22 Score=168.21 Aligned_cols=113 Identities=21% Similarity=0.274 Sum_probs=98.1
Q ss_pred ceEEEEEEEcCEEEEEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEEeEEEEEEEEEcCEEEEeeEEEe
Q psy17672 179 VMLYINCKVNGYPVKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQIIGRIHMVQVAIEKDFLTTSLSIL 258 (319)
Q Consensus 179 ~~l~v~v~Ing~~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~g~v~~~~i~IG~~~~~~~f~Vl 258 (319)
.-+...+.|+++++.+||||||+|||||.++|+++|++.....+...+..+.|...+.+.++.+++++++..+..+|.|+
T Consensus 20 ~vi~g~~~I~~~~~~vLiDSGAThsFIs~~~a~~~~l~~~~l~~~~~V~~~g~~~~~~~~~~~~~~~i~g~~~~~dl~vl 99 (135)
T PF08284_consen 20 DVITGTFLINSIPASVLIDSGATHSFISSSFAKKLGLPLEPLPRPIVVSAPGGSINCEGVCPDVPLSIQGHEFVVDLLVL 99 (135)
T ss_pred CeEEEEEEeccEEEEEEEecCCCcEEccHHHHHhcCCEEEEccCeeEEecccccccccceeeeEEEEECCeEEEeeeEEe
Confidence 45778899999999999999999999999999999998664333332333334556678889999999999999999999
Q ss_pred cCCCCceeechhhhhhccEEEEccCCEEEEccC
Q psy17672 259 EEQPMDMLLGLDMLRRHECCIDLRKNVLRIGTT 291 (319)
Q Consensus 259 ~~~~~d~ILG~D~L~~~~~~ID~~~~~L~i~~~ 291 (319)
+..++|+|||||||++|++.|||.++++.|..+
T Consensus 100 ~l~~~DvILGm~WL~~~~~~IDw~~k~v~f~~p 132 (135)
T PF08284_consen 100 DLGGYDVILGMDWLKKHNPVIDWATKTVTFNSP 132 (135)
T ss_pred cccceeeEeccchHHhCCCEEEccCCEEEEeCC
Confidence 999999999999999999999999999999874
No 6
>TIGR02281 clan_AA_DTGA clan AA aspartic protease, TIGR02281 family. This family consists of predicted aspartic proteases, typically from 180 to 230 amino acids in length, in MEROPS clan AA. This model describes the well-conserved 121-residue C-terminal region. The poorly conserved, variable length N-terminal region usually contains a predicted transmembrane helix. Sequences in the seed alignment and those scoring above the trusted cutoff are Proteobacterial; homologs scroing between trusted and noise are found in Pyrobaculum aerophilum str. IM2 (archaeal), Pirellula sp. (Planctomycetes), and Nostoc sp. PCC 7120 (Cyanobacteria).
Probab=99.75 E-value=1.7e-17 Score=137.55 Aligned_cols=110 Identities=21% Similarity=0.233 Sum_probs=85.4
Q ss_pred cccceEEEEEEEcCEEEEEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEEeEEEEEEEEEcCEEEE-ee
Q psy17672 176 GSVVMLYINCKVNGYPVKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQIIGRIHMVQVAIEKDFLT-TS 254 (319)
Q Consensus 176 ~~~~~l~v~v~Ing~~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~g~v~~~~i~IG~~~~~-~~ 254 (319)
.+..|++++++|||+++.|+|||||++|+||+++|+++|+...... +.....+.++......+....++||++.+. +.
T Consensus 7 ~~~g~~~v~~~InG~~~~flVDTGAs~t~is~~~A~~Lgl~~~~~~-~~~~~~ta~G~~~~~~~~l~~l~iG~~~~~nv~ 85 (121)
T TIGR02281 7 DGDGHFYATGRVNGRNVRFLVDTGATSVALNEEDAQRLGLDLNRLG-YTVTVSTANGQIKAARVTLDRVAIGGIVVNDVD 85 (121)
T ss_pred cCCCeEEEEEEECCEEEEEEEECCCCcEEcCHHHHHHcCCCcccCC-ceEEEEeCCCcEEEEEEEeCEEEECCEEEeCcE
Confidence 4678999999999999999999999999999999999999854322 122233332222345667889999999877 89
Q ss_pred EEEecCCC-Cceeechhhhhhcc-EEEEccCCEEEE
Q psy17672 255 LSILEEQP-MDMLLGLDMLRRHE-CCIDLRKNVLRI 288 (319)
Q Consensus 255 f~Vl~~~~-~d~ILG~D~L~~~~-~~ID~~~~~L~i 288 (319)
+.|++... .++|||||||+++. +.||- ++|.+
T Consensus 86 ~~v~~~~~~~~~LLGm~fL~~~~~~~~~~--~~l~l 119 (121)
T TIGR02281 86 AMVAEGGALSESLLGMSFLNRLSRFTVRG--GKLIL 119 (121)
T ss_pred EEEeCCCcCCceEcCHHHHhccccEEEEC--CEEEE
Confidence 99998753 58999999999997 77665 55544
No 7
>cd05484 retropepsin_like_LTR_2 Retropepsins_like_LTR, pepsin-like aspartate proteases. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classif
Probab=99.72 E-value=6.1e-17 Score=127.16 Aligned_cols=91 Identities=24% Similarity=0.243 Sum_probs=76.9
Q ss_pred EEEEEEEcCEEEEEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEEeEEEEEEEEEcCEEEEeeEEEecC
Q psy17672 181 LYINCKVNGYPVKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQIIGRIHMVQVAIEKDFLTTSLSILEE 260 (319)
Q Consensus 181 l~v~v~Ing~~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~g~v~~~~i~IG~~~~~~~f~Vl~~ 260 (319)
.|+.++|||++++++|||||++|+||.+.+.++|++...+....-...+....++.|.+ .+++++|+..+..+|+|++.
T Consensus 1 ~~~~~~Ing~~i~~lvDTGA~~svis~~~~~~lg~~~~~~~~~~v~~a~G~~~~~~G~~-~~~v~~~~~~~~~~~~v~~~ 79 (91)
T cd05484 1 KTVTLLVNGKPLKFQLDTGSAITVISEKTWRKLGSPPLKPTKKRLRTATGTKLSVLGQI-LVTVKYGGKTKVLTLYVVKN 79 (91)
T ss_pred CEEEEEECCEEEEEEEcCCcceEEeCHHHHHHhCCCccccccEEEEecCCCEeeEeEEE-EEEEEECCEEEEEEEEEEEC
Confidence 37899999999999999999999999999999999874443333222333356889998 68999999999999999998
Q ss_pred CCCceeechhhhh
Q psy17672 261 QPMDMLLGLDMLR 273 (319)
Q Consensus 261 ~~~d~ILG~D~L~ 273 (319)
. ++.|||+|||.
T Consensus 80 ~-~~~lLG~~wl~ 91 (91)
T cd05484 80 E-GLNLLGRDWLD 91 (91)
T ss_pred C-CCCccChhhcC
Confidence 7 99999999984
No 8
>KOG0010|consensus
Probab=99.68 E-value=8.6e-17 Score=157.51 Aligned_cols=118 Identities=20% Similarity=0.376 Sum_probs=89.9
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEccCCCCCCC------ccccCCCCc------
Q psy17672 2 DVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAMPGRGPTYH------VAQSRHTAP------ 69 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~~~~p~~~------~s~~~~p~~------ 69 (319)
||.+||++|...+++|+++++|||.||+|+| .+||..|||+||.|||||++......+ .++...|+.
T Consensus 36 sV~qlKE~I~~~f~a~~dqlvLIfaGrILKD-~dTL~~~gI~Dg~TvHLVik~~~~~~~~~t~ap~~t~~~P~~~st~~~ 114 (493)
T KOG0010|consen 36 SVLQLKELIAQRFGAPPDQLVLIYAGRILKD-DDTLKQYGIQDGHTVHLVIKSQPRPTGTATSAPSSTASTPNNISTGRS 114 (493)
T ss_pred HHHHHHHHHHHhcCCChhHeeeeecCccccC-hhhHHHcCCCCCcEEEEEeccCCCCCCccccccccccCCCCCCCCccc
Confidence 7999999999999999999999999999998 689999999999999999764311111 000111100
Q ss_pred cc----------------c-------------c--------------------cccCCCChHHHHHHHhhCHHHHHHHHh
Q psy17672 70 HR----------------S-------------F--------------------MTAQLQDPAHVRDLLLACPDQLALLKQ 100 (319)
Q Consensus 70 ~~----------------~-------------~--------------------~~~~~~dpe~~r~~il~nP~~l~~L~~ 100 (319)
.. . + .+.+++||+.+|++|.+||+|++.| +
T Consensus 115 a~~~p~~l~~~~~~~g~~~~~~~~~~F~~l~~~~q~~~~snpe~~~~~m~nP~vq~ll~Npd~mrq~I~anPqmq~lm-~ 193 (493)
T KOG0010|consen 115 ASSNPFSLLTVGGFAGLSSLGLFAAMFGELQSQMQNQLLSNPEALRQMMENPIVQSLLNNPDLMRQLIMANPQMQDLM-Q 193 (493)
T ss_pred ccCCcccccccccccccccCCcchhhcccccccchhccccCHHHHHHhhhChHHHHHhcChHHHHHHHhcCHHHHHHH-h
Confidence 00 0 0 1246778899999999999988887 8
Q ss_pred cCchhHHHHhcChHHHHHHHHH
Q psy17672 101 NNPRLSEALSTGKDAFTKVFKE 122 (319)
Q Consensus 101 ~nP~La~ai~~~~~~f~~~l~~ 122 (319)
+|||+.+.+ ++|+-.++++..
T Consensus 194 ~npei~h~l-n~p~i~rQtle~ 214 (493)
T KOG0010|consen 194 RNPEIGHLL-NNPLILRQTLES 214 (493)
T ss_pred hCCcchhhh-cChHHHHHHHHh
Confidence 999997666 788888888765
No 9
>PF13650 Asp_protease_2: Aspartyl protease
Probab=99.62 E-value=6.2e-15 Score=113.75 Aligned_cols=88 Identities=24% Similarity=0.395 Sum_probs=69.9
Q ss_pred EEEEEcCEEEEEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEEeEEEEEEEEEcCEEE-EeeEEEec-C
Q psy17672 183 INCKVNGYPVKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQIIGRIHMVQVAIEKDFL-TTSLSILE-E 260 (319)
Q Consensus 183 v~v~Ing~~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~g~v~~~~i~IG~~~~-~~~f~Vl~-~ 260 (319)
|+++|||++++|+|||||+.++|++++|+++|+...... ......|.+...........+++||+..+ .+.+.|++ .
T Consensus 1 V~v~vng~~~~~liDTGa~~~~i~~~~~~~l~~~~~~~~-~~~~~~~~~g~~~~~~~~~~~i~ig~~~~~~~~~~v~~~~ 79 (90)
T PF13650_consen 1 VPVKVNGKPVRFLIDTGASISVISRSLAKKLGLKPRPKS-VPISVSGAGGSVTVYRGRVDSITIGGITLKNVPFLVVDLG 79 (90)
T ss_pred CEEEECCEEEEEEEcCCCCcEEECHHHHHHcCCCCcCCc-eeEEEEeCCCCEEEEEEEEEEEEECCEEEEeEEEEEECCC
Confidence 689999999999999999999999999999999855332 11234455433344455556899999887 68999999 6
Q ss_pred CCCceeechhh
Q psy17672 261 QPMDMLLGLDM 271 (319)
Q Consensus 261 ~~~d~ILG~D~ 271 (319)
..+|+|||+||
T Consensus 80 ~~~~~iLG~df 90 (90)
T PF13650_consen 80 DPIDGILGMDF 90 (90)
T ss_pred CCCEEEeCCcC
Confidence 67999999998
No 10
>cd05483 retropepsin_like_bacteria Bacterial aspartate proteases, retropepsin-like protease family. This family of bacteria aspartate proteases is a subfamily of retropepsin-like protease family, which includes enzymes from retrovirus and retrotransposons. While fungal and mammalian pepsin-like aspartate proteases are bilobal proteins with structurally related N- and C-termini, this family of bacteria aspartate proteases is half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate proteases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=99.61 E-value=5e-15 Score=115.48 Aligned_cols=93 Identities=26% Similarity=0.386 Sum_probs=72.9
Q ss_pred ceEEEEEEEcCEEEEEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEEeEEEEEEEEEcCEEEE-eeEEE
Q psy17672 179 VMLYINCKVNGYPVKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQIIGRIHMVQVAIEKDFLT-TSLSI 257 (319)
Q Consensus 179 ~~l~v~v~Ing~~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~g~v~~~~i~IG~~~~~-~~f~V 257 (319)
.+++++++|||++++|+|||||++|+|+.++|+++++........ ...+++...........+++||+..++ +.+.|
T Consensus 1 ~~~~v~v~i~~~~~~~llDTGa~~s~i~~~~~~~l~~~~~~~~~~--~~~~~~G~~~~~~~~~~~i~ig~~~~~~~~~~v 78 (96)
T cd05483 1 GHFVVPVTINGQPVRFLLDTGASTTVISEELAERLGLPLTLGGKV--TVQTANGRVRAARVRLDSLQIGGITLRNVPAVV 78 (96)
T ss_pred CcEEEEEEECCEEEEEEEECCCCcEEcCHHHHHHcCCCccCCCcE--EEEecCCCccceEEEcceEEECCcEEeccEEEE
Confidence 378999999999999999999999999999999999832222222 334443333334455678999998776 79999
Q ss_pred ecCCC--Cceeechhhhh
Q psy17672 258 LEEQP--MDMLLGLDMLR 273 (319)
Q Consensus 258 l~~~~--~d~ILG~D~L~ 273 (319)
++... .|+|||+|||+
T Consensus 79 ~d~~~~~~~gIlG~d~l~ 96 (96)
T cd05483 79 LPGDALGVDGLLGMDFLR 96 (96)
T ss_pred eCCcccCCceEeChHHhC
Confidence 99876 99999999985
No 11
>PF00077 RVP: Retroviral aspartyl protease The Prosite entry also includes Pfam:PF00026; InterPro: IPR018061 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to the MEROPS peptidase family A2 (retropepsin family, clan AA), subfamily A2A. The family includes the single domain aspartic proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). Retroviral aspartyl protease is synthesised as part of the POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, RNase H and integrase. POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins.; PDB: 3D3T_B 3SQF_A 1NSO_A 2HB3_A 2HS2_A 2HS1_B 3K4V_A 3GGV_C 1HTG_B 2FDE_A ....
Probab=99.57 E-value=2e-14 Score=114.19 Aligned_cols=95 Identities=28% Similarity=0.489 Sum_probs=77.8
Q ss_pred eEEEEEEEcCEEEEEEEcCCccccccCHHHHHHcCCccccccceeeEeecccee-EEEeEEEEEEEEEcCEEEEeeEEEe
Q psy17672 180 MLYINCKVNGYPVKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQ-QIIGRIHMVQVAIEKDFLTTSLSIL 258 (319)
Q Consensus 180 ~l~v~v~Ing~~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~-~~~g~v~~~~i~IG~~~~~~~f~Vl 258 (319)
..++.++|||++++|||||||+.|+|+.+++.+.+.. ......+.|+|.. ...|.+ .+.+++++..+...|+|+
T Consensus 5 rp~i~v~i~g~~i~~LlDTGA~vsiI~~~~~~~~~~~----~~~~~~v~~~~g~~~~~~~~-~~~v~~~~~~~~~~~~v~ 79 (100)
T PF00077_consen 5 RPYITVKINGKKIKALLDTGADVSIISEKDWKKLGPP----PKTSITVRGAGGSSSILGST-TVEVKIGGKEFNHTFLVV 79 (100)
T ss_dssp SSEEEEEETTEEEEEEEETTBSSEEESSGGSSSTSSE----EEEEEEEEETTEEEEEEEEE-EEEEEETTEEEEEEEEES
T ss_pred CceEEEeECCEEEEEEEecCCCcceeccccccccccc----ccCCceeccCCCcceeeeEE-EEEEEEECccceEEEEec
Confidence 3578999999999999999999999999998877665 1112245666643 555544 679999999999999999
Q ss_pred cCCCCceeechhhhhhccEEEE
Q psy17672 259 EEQPMDMLLGLDMLRRHECCID 280 (319)
Q Consensus 259 ~~~~~d~ILG~D~L~~~~~~ID 280 (319)
+...++ |||.|||+++++.|+
T Consensus 80 ~~~~~~-ILG~D~L~~~~~~i~ 100 (100)
T PF00077_consen 80 PDLPMN-ILGRDFLKKLNAVIN 100 (100)
T ss_dssp STCSSE-EEEHHHHTTTTCEEE
T ss_pred CCCCCC-EeChhHHHHcCCEEC
Confidence 987788 999999999999875
No 12
>TIGR03698 clan_AA_DTGF clan AA aspartic protease, AF_0612 family. Members of this protein family are clan AA aspartic proteases, related to family TIGR02281. These proteins resemble retropepsins, pepsin-like proteases of retroviruses such as HIV. Members of this family are found in archaea and bacteria.
Probab=99.51 E-value=1.5e-13 Score=111.59 Aligned_cols=99 Identities=19% Similarity=0.297 Sum_probs=74.3
Q ss_pred EEEEEcC------EEEEEEEcCCccccc-cCHHHHHHcCCccccccceeeEeeccceeEEEeEEEEEEEEEcCEEEEeeE
Q psy17672 183 INCKVNG------YPVKAFVDSGAQTTI-MSAKCAERCNIMRLIDTRWAGVAKGVGVQQIIGRIHMVQVAIEKDFLTTSL 255 (319)
Q Consensus 183 v~v~Ing------~~v~aLVDTGA~~si-Is~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~g~v~~~~i~IG~~~~~~~f 255 (319)
+++.|.+ .+++|||||||+..+ |+.+.|+++|+......+ ...|.|. ...-.+....+.+|+....+.+
T Consensus 2 ~~v~~~~p~~~~~~~v~~LVDTGat~~~~l~~~~a~~lgl~~~~~~~-~~tA~G~---~~~~~v~~~~v~igg~~~~~~v 77 (107)
T TIGR03698 2 LDVELSNPKNPEFMEVRALVDTGFSGFLLVPPDIVNKLGLPELDQRR-VYLADGR---EVLTDVAKASIIINGLEIDAFV 77 (107)
T ss_pred EEEEEeCCCCCCceEEEEEEECCCCeEEecCHHHHHHcCCCcccCcE-EEecCCc---EEEEEEEEEEEEECCEEEEEEE
Confidence 4566644 489999999999997 999999999998754322 2234442 2233466789999998875555
Q ss_pred EEecCCCCceeechhhhhhccEEEEccCCEE
Q psy17672 256 SILEEQPMDMLLGLDMLRRHECCIDLRKNVL 286 (319)
Q Consensus 256 ~Vl~~~~~d~ILG~D~L~~~~~~ID~~~~~L 286 (319)
.+.+..+ +.|||+.||.+++..|||++++|
T Consensus 78 ~~~~~~~-~~LLG~~~L~~l~l~id~~~~~~ 107 (107)
T TIGR03698 78 ESLGYVD-EPLLGTELLEGLGIVIDYRNQGL 107 (107)
T ss_pred EecCCCC-ccEecHHHHhhCCEEEehhhCcC
Confidence 5445434 89999999999999999999864
No 13
>cd06095 RP_RTVL_H_like Retropepsin of the RTVL_H family of human endogenous retrovirus-like elements. This family includes aspartate proteases from retroelements with LTR (long terminal repeats) including the RTVL_H family of human endogenous retrovirus-like elements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where
Probab=99.50 E-value=1.2e-13 Score=107.59 Aligned_cols=85 Identities=21% Similarity=0.310 Sum_probs=64.1
Q ss_pred EEEEEcCEEEEEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEEeEEEEE-EEEEcCEEEEeeEEEecCC
Q psy17672 183 INCKVNGYPVKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQIIGRIHMV-QVAIEKDFLTTSLSILEEQ 261 (319)
Q Consensus 183 v~v~Ing~~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~g~v~~~-~i~IG~~~~~~~f~Vl~~~ 261 (319)
|++.|||+++.|||||||+.|+|+.+.|+++.+. . ......|+|.....-..... .+++|+..+...+.|.+.
T Consensus 1 ~~v~InG~~~~fLvDTGA~~tii~~~~a~~~~~~---~--~~~~v~gagG~~~~~v~~~~~~v~vg~~~~~~~~~v~~~- 74 (86)
T cd06095 1 VTITVEGVPIVFLVDTGATHSVLKSDLGPKQELS---T--TSVLIRGVSGQSQQPVTTYRTLVDLGGHTVSHSFLVVPN- 74 (86)
T ss_pred CEEEECCEEEEEEEECCCCeEEECHHHhhhccCC---C--CcEEEEeCCCcccccEEEeeeEEEECCEEEEEEEEEEcC-
Confidence 5789999999999999999999999999998221 2 22256676644311111223 699999998888988875
Q ss_pred CCceeechhhhh
Q psy17672 262 PMDMLLGLDMLR 273 (319)
Q Consensus 262 ~~d~ILG~D~L~ 273 (319)
..+.|||||||+
T Consensus 75 ~~~~lLG~dfL~ 86 (86)
T cd06095 75 CPDPLLGRDLLS 86 (86)
T ss_pred CCCcEechhhcC
Confidence 369999999985
No 14
>TIGR00601 rad23 UV excision repair protein Rad23. All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=99.42 E-value=9.7e-13 Score=128.25 Aligned_cols=51 Identities=18% Similarity=0.346 Sum_probs=48.0
Q ss_pred CCHHHHHHHHHhhhC---CCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 1 MDVEQFRLICQDECG---IDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 1 ~tv~~lK~~i~~e~g---ip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
+||.+||.+|+.+.| +|+++|+|||+||+|+| ++||++|||++|++|+++.
T Consensus 21 ~TV~dLK~kI~~~~g~~~ip~~~QkLIy~GkiL~D-d~tL~dy~I~e~~~Ivvmv 74 (378)
T TIGR00601 21 ETVKELKEKIEAEQGKDAYPVAQQKLIYSGKILSD-DKTVREYKIKEKDFVVVMV 74 (378)
T ss_pred ChHHHHHHHHHHhhCCCCCChhHeEEEECCEECCC-CCcHHHcCCCCCCEEEEEe
Confidence 589999999999998 99999999999999998 6899999999999999884
No 15
>cd01815 BMSC_UbP_N Ubiquitin-like domain of BMSC-UbP. BMSC_UbP (bone marrow stromal cell-derived ubiquitin-like protein) has an N-terminal ubiquitin-like (UBQ) domain and a C-terminal ubiquitin-associated (UBA) domain, a domain architecture similar to those of the UBIN, Chap1, and ubiquilin proteins. This CD represents the N-terminal ubiquitin-like domain.
Probab=99.40 E-value=3e-13 Score=102.49 Aligned_cols=52 Identities=15% Similarity=0.298 Sum_probs=47.5
Q ss_pred CCHHHHHHHHHhhh--CCC-CCCeEEEecCeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 1 MDVEQFRLICQDEC--GID-ATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 1 ~tv~~lK~~i~~e~--gip-~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
+||++||++|++++ |++ +++|+|||+||+|+| ++||++|||++|++|||+++
T Consensus 21 ~TV~~LK~kI~~~~~egi~~~dqQrLIy~GKiL~D-~~TL~dygI~~gstlhLv~~ 75 (75)
T cd01815 21 YQVSTLKQLIAAQLPDSLPDPELIDLIHCGRKLKD-DQTLDFYGIQSGSTIHILRK 75 (75)
T ss_pred CcHHHHHHHHHHhhccCCCChHHeEEEeCCcCCCC-CCcHHHcCCCCCCEEEEEeC
Confidence 58999999999996 575 999999999999998 68999999999999999863
No 16
>cd01796 DDI1_N DNA damage inducible protein 1 ubiquitin-like domain. DDI1_N DDI1 (DNA damage inducible protein 1) has an amino-terminal ubiquitin-like domain, an retroviral protease-like (RVP-like) domain, and a UBA (ubiquitin-associated) domain. This CD represents the amino-terminal ubiquitin-like domain of DDI1.
Probab=99.38 E-value=4.8e-13 Score=100.66 Aligned_cols=51 Identities=33% Similarity=0.755 Sum_probs=48.2
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEE
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVA 51 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~ 51 (319)
+||++||++|++++|+|+++|+|+|+||+|.|+..+|++|||++|++|||.
T Consensus 20 ~TV~~lK~~I~~~~gip~~~q~Li~~Gk~L~D~~~~L~~~gi~~~~~l~l~ 70 (71)
T cd01796 20 LELENFKALCEAESGIPASQQQLIYNGRELVDNKRLLALYGVKDGDLVVLR 70 (71)
T ss_pred CCHHHHHHHHHHHhCCCHHHeEEEECCeEccCCcccHHHcCCCCCCEEEEe
Confidence 699999999999999999999999999999885578999999999999985
No 17
>COG3577 Predicted aspartyl protease [General function prediction only]
Probab=99.37 E-value=3.8e-12 Score=112.76 Aligned_cols=99 Identities=22% Similarity=0.213 Sum_probs=77.7
Q ss_pred ccceEEEEEEEcCEEEEEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEEeEEEEEEEEEcCEEEE-eeE
Q psy17672 177 SVVMLYINCKVNGYPVKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQIIGRIHMVQVAIEKDFLT-TSL 255 (319)
Q Consensus 177 ~~~~l~v~v~Ing~~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~g~v~~~~i~IG~~~~~-~~f 255 (319)
...|+.+++.|||++|++||||||+...++.+.|+|+|+..-...-...+...-|.++ -..|....++||++.+. +..
T Consensus 102 ~~GHF~a~~~VNGk~v~fLVDTGATsVal~~~dA~RlGid~~~l~y~~~v~TANG~~~-AA~V~Ld~v~IG~I~~~nV~A 180 (215)
T COG3577 102 RDGHFEANGRVNGKKVDFLVDTGATSVALNEEDARRLGIDLNSLDYTITVSTANGRAR-AAPVTLDRVQIGGIRVKNVDA 180 (215)
T ss_pred CCCcEEEEEEECCEEEEEEEecCcceeecCHHHHHHhCCCccccCCceEEEccCCccc-cceEEeeeEEEccEEEcCchh
Confidence 5679999999999999999999999999999999999998653322221222223221 23577889999998765 899
Q ss_pred EEecCCC-Cceeechhhhhhcc
Q psy17672 256 SILEEQP-MDMLLGLDMLRRHE 276 (319)
Q Consensus 256 ~Vl~~~~-~d~ILG~D~L~~~~ 276 (319)
.|+++.. -..+|||+||.+++
T Consensus 181 ~V~~~g~L~~sLLGMSfL~rL~ 202 (215)
T COG3577 181 MVAEDGALDESLLGMSFLNRLS 202 (215)
T ss_pred heecCCccchhhhhHHHHhhcc
Confidence 9998654 56999999999997
No 18
>cd01807 GDX_N ubiquitin-like domain of GDX. GDX contains an N-terminal ubiquitin-like domain as well as an uncharacterized c-terminal domain. The function of GDX is unknown.
Probab=99.36 E-value=7.7e-13 Score=100.09 Aligned_cols=52 Identities=13% Similarity=0.211 Sum_probs=49.6
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
+||++||+.|+++.|+|+++|+|+|+|++|.| +.||++|||++|++|+|+.+
T Consensus 21 ~tV~~lK~~i~~~~gi~~~~q~L~~~G~~L~d-~~~L~~~~i~~~~~l~l~~~ 72 (74)
T cd01807 21 ESVSTLKKLVSEHLNVPEEQQRLLFKGKALAD-DKRLSDYSIGPNAKLNLVVR 72 (74)
T ss_pred CcHHHHHHHHHHHHCCCHHHeEEEECCEECCC-CCCHHHCCCCCCCEEEEEEc
Confidence 58999999999999999999999999999998 68999999999999999875
No 19
>cd01793 Fubi Fubi ubiquitin-like protein. Fubi is a ubiquitin-like protein encoded by the fau gene which has an N-terminal ubiquitin-like domain (also referred to as FUBI) fused to the ribosomal protein S30. Fubi is thought to be a tumor suppressor protein and the FUBI domain may act as a substitute or an inhibitor of ubiquitin or one of ubiquitin's close relatives UCRP, FAT10, and Nedd8.
Probab=99.36 E-value=7.3e-13 Score=100.25 Aligned_cols=52 Identities=29% Similarity=0.287 Sum_probs=49.3
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
+||++||++|+++.|||+++|+|+|+||+|+| ++||++|||+++++|||+.+
T Consensus 19 ~tV~~lK~~i~~~~gip~~~q~Li~~Gk~L~D-~~tL~~~~i~~~~tl~l~~~ 70 (74)
T cd01793 19 ETVSDIKAHVAGLEGIDVEDQVLLLAGVPLED-DATLGQCGVEELCTLEVAGR 70 (74)
T ss_pred CcHHHHHHHHHhhhCCCHHHEEEEECCeECCC-CCCHHHcCCCCCCEEEEEEe
Confidence 58999999999999999999999999999998 69999999999999999864
No 20
>cd01797 NIRF_N amino-terminal ubiquitin-like domain of Np95 and NIRF. NIRF_N This CD represents the amino-terminal ubiquitin-like domain of a family of nuclear proteins that includes Np95 and NIRF (Np95/ICBP90-like RING finger) protein. Both Np95 and NIRF have a domain architecture consisting of a ubiquitin-like domain, a PHD finger, a YDG/SRA domain, Rb-binding motifs and a RING finger domain. Both Np95 and NIRF are ubiquitin ligases that ubiquitinate PCNP (PEST-containing nuclear proteins). While Np95 is capable of binding histones, NIRF is involved in cell cycle regulation.
Probab=99.32 E-value=1.8e-12 Score=99.35 Aligned_cols=52 Identities=25% Similarity=0.299 Sum_probs=49.7
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
+||++||++|++.+|||+++|+|+|+||+|+| +.||++|||++|++|+|+.+
T Consensus 23 ~TV~~lK~~i~~~~gi~~~~QrLi~~Gk~L~D-~~tL~~y~i~~~~~i~l~~~ 74 (78)
T cd01797 23 TKVEELREKIQELFNVEPECQRLFYRGKQMED-GHTLFDYNVGLNDIIQLLVR 74 (78)
T ss_pred CcHHHHHHHHHHHhCCCHHHeEEEeCCEECCC-CCCHHHcCCCCCCEEEEEEe
Confidence 58999999999999999999999999999998 68999999999999999875
No 21
>cd01798 parkin_N amino-terminal ubiquitin-like of parkin protein. parkin_N parkin protein is a RING-type E3 ubiquitin ligase with an amino-terminal ubiquitin-like (Ubl) domain and an RBR signature consisting of two RING finger domains separated by an IBR/DRIL domain. Naturally occurring mutations in parkin are thought to cause the disease AR_JP (autosomal-recessive juvenile parkinsonism). Parkin binds the Rpn10 subunit of 26S proteasomes through its Ubl domain.
Probab=99.31 E-value=2e-12 Score=96.67 Aligned_cols=51 Identities=14% Similarity=0.330 Sum_probs=48.8
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
+||++||++|+++.|+|+++|+|+|+|++|.| +.||++|||++|++|||+.
T Consensus 19 ~tV~~lK~~i~~~~gi~~~~q~Li~~G~~L~d-~~~l~~~~i~~~stl~l~~ 69 (70)
T cd01798 19 TDIKQLKEVVAKRQGVPPDQLRVIFAGKELRN-TTTIQECDLGQQSILHAVR 69 (70)
T ss_pred ChHHHHHHHHHHHHCCCHHHeEEEECCeECCC-CCcHHHcCCCCCCEEEEEe
Confidence 58999999999999999999999999999998 6999999999999999985
No 22
>cd00303 retropepsin_like Retropepsins; pepsin-like aspartate proteases. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements, as well as eukaryotic dna-damage-inducible proteins (DDIs), and bacterial aspartate peptidases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples
Probab=99.29 E-value=3.7e-11 Score=88.66 Aligned_cols=89 Identities=33% Similarity=0.514 Sum_probs=68.4
Q ss_pred EEEEEcCEEEEEEEcCCccccccCHHHHHHcCC-ccccccceeeEeecc-c-eeEEEeEEEEEEEEEcCEEEEeeEEEec
Q psy17672 183 INCKVNGYPVKAFVDSGAQTTIMSAKCAERCNI-MRLIDTRWAGVAKGV-G-VQQIIGRIHMVQVAIEKDFLTTSLSILE 259 (319)
Q Consensus 183 v~v~Ing~~v~aLVDTGA~~siIs~~~a~rlgl-~~~~~~~~~g~a~Gv-g-~~~~~g~v~~~~i~IG~~~~~~~f~Vl~ 259 (319)
+++.+||.++.+++||||++++|+.+++++.++ ......... ..+. | .....+.+..+.+.+++..+...|.|.+
T Consensus 1 ~~~~~~~~~~~~liDtgs~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 78 (92)
T cd00303 1 LKGKINGVPVRALVDSGASVNFISESLAKKLGLPPRLLPTPLK--VKGANGSSVKTLGVILPVTIGIGGKTFTVDFYVLD 78 (92)
T ss_pred CEEEECCEEEEEEEcCCCcccccCHHHHHHcCCCcccCCCceE--EEecCCCEeccCcEEEEEEEEeCCEEEEEEEEEEc
Confidence 468899999999999999999999999999987 322222222 2222 2 2233344456788999999999999999
Q ss_pred CCCCceeechhhhh
Q psy17672 260 EQPMDMLLGLDMLR 273 (319)
Q Consensus 260 ~~~~d~ILG~D~L~ 273 (319)
...+++|||+|||+
T Consensus 79 ~~~~~~ilG~~~l~ 92 (92)
T cd00303 79 LLSYDVILGRPWLE 92 (92)
T ss_pred CCCcCEEecccccC
Confidence 88999999999984
No 23
>cd01794 DC_UbP_C dendritic cell derived ubiquitin-like protein. DC_UbP (dendritic cell derived ubiquitin-like protein) is a ubiquitin-like protein from human dendritic cells that is expressed in the mitochondrion. The ubiquitin-like domain of this protein is found at the C-terminus and lacks the canonical gly-gly motif of ubiquitin required for ubiquitinization. DC_UbP is expressed in tumor cells but not in normal human adult tissue suggesting a role for DC_UbP in tumorogenesis.
Probab=99.28 E-value=3.5e-12 Score=95.80 Aligned_cols=51 Identities=20% Similarity=0.244 Sum_probs=48.4
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
+||.+||++|++..|+|+++|+|+|+|++|+| +.||++|||++|++|||+.
T Consensus 19 ~TV~~lK~~I~~~~gi~~~~q~Li~~G~~L~D-~~~l~~~~i~~~~tv~~~~ 69 (70)
T cd01794 19 DTVGQLKKQLQAAEGVDPCCQRWFFSGKLLTD-KTRLQETKIQKDYVVQVIV 69 (70)
T ss_pred ChHHHHHHHHHHHhCCCHHHeEEEECCeECCC-CCCHHHcCCCCCCEEEEEe
Confidence 58999999999999999999999999999998 6899999999999999874
No 24
>cd01791 Ubl5 UBL5 ubiquitin-like modifier. UBL5 (also known as HUB1) is a ubiquitin-like modifier that is both widely expressed and highly phylogenetically conserved. At the C-terminal end of the ubiquitin-like fold of UBL5 is a di-tyrosine motif followed by a single variable residue instead of the characteristic di-glycine found in all other ubiquitin-like modifiers. ULB5 interacts with a cyclin-like kinase called CLK4 but not with other cyclin-like kinase family members.
Probab=99.28 E-value=3.7e-12 Score=96.47 Aligned_cols=50 Identities=10% Similarity=0.131 Sum_probs=47.4
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEE
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVA 51 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~ 51 (319)
.||++||++|++..|+|+++|+|+|.|++|+| ++||++|||++|++|||-
T Consensus 22 ~TV~~LK~~I~~~~~~~~~~qrLi~~Gk~L~D-~~tL~~ygi~~~stv~l~ 71 (73)
T cd01791 22 DTIGDLKKLIAAQTGTRPEKIVLKKWYTIFKD-HISLGDYEIHDGMNLELY 71 (73)
T ss_pred CcHHHHHHHHHHHhCCChHHEEEEeCCcCCCC-CCCHHHcCCCCCCEEEEE
Confidence 48999999999999999999999999999998 689999999999999984
No 25
>cd01810 ISG15_repeat2 ISG15 ubiquitin-like protein, second repeat of 2. ISG15 is a ubiquitin-like protein containing two ubiquitin homology domains and becomes conjugated to a variety of proteins when cells are treated with type I interferon or lipopolysaccharide. Although ISG15 has properties similar to those of other ubiquitin-like molecules, it is a unique member of the ubiquitin-like superfamily, whose expression and conjugation to target proteins are tightly regulated by specific signaling pathways, indicating it may have specialized functions in the immune system.
Probab=99.27 E-value=4.5e-12 Score=95.91 Aligned_cols=52 Identities=13% Similarity=0.249 Sum_probs=49.2
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
+||++||+.|++..|||+++|+|+|+|++|.| ++||++|||++|++|+|+.+
T Consensus 19 ~tV~~lK~~I~~~~gi~~~~q~L~~~G~~L~D-~~tL~~~~i~~~~tl~l~~~ 70 (74)
T cd01810 19 QTVATLKQQVSQRERVQADQFWLSFEGRPMED-EHPLGEYGLKPGCTVFMNLR 70 (74)
T ss_pred ChHHHHHHHHHHHhCCCHHHeEEEECCEECCC-CCCHHHcCCCCCCEEEEEEE
Confidence 58999999999999999999999999999998 68999999999999999864
No 26
>KOG0011|consensus
Probab=99.27 E-value=1.8e-11 Score=114.99 Aligned_cols=50 Identities=22% Similarity=0.314 Sum_probs=47.6
Q ss_pred CHHHHHHHHHhhhC--CCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 2 DVEQFRLICQDECG--IDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 2 tv~~lK~~i~~e~g--ip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
||.++|.+|+.+.| +|+++|+|||+||+|+| +.|+.+|+|+++++|+|+.
T Consensus 22 tV~evK~kIet~~g~dyP~~~QkLIy~GkiL~D-~~tv~Eykv~E~~fiVvMl 73 (340)
T KOG0011|consen 22 TVVEVKKKIETEKGPDYPAEQQKLIYSGKILKD-ETTVGEYKVKEKKFIVVML 73 (340)
T ss_pred hHHHHHHHHHhccCCCCchhhheeeecceeccC-CcchhhhccccCceEEEEE
Confidence 79999999999998 99999999999999998 6899999999999999984
No 27
>PF00240 ubiquitin: Ubiquitin family; InterPro: IPR000626 Ubiquitinylation is an ATP-dependent process that involves the action of at least three enzymes: a ubiquitin-activating enzyme (E1, IPR000011 from INTERPRO), a ubiquitin-conjugating enzyme (E2, IPR000608 from INTERPRO), and a ubiquitin ligase (E3, IPR000569 from INTERPRO, IPR003613 from INTERPRO), which work sequentially in a cascade. There are many different E3 ligases, which are responsible for the type of ubiquitin chain formed, the specificity of the target protein, and the regulation of the ubiquitinylation process []. Ubiquitinylation is an important regulatory tool that controls the concentration of key signalling proteins, such as those involved in cell cycle control, as well as removing misfolded, damaged or mutant proteins that could be harmful to the cell. Several ubiquitin-like molecules have been discovered, such as Ufm1 (IPR005375 from INTERPRO), SUMO1 (IPR003653 from INTERPRO), NEDD8, Rad23 (IPR004806 from INTERPRO), Elongin B and Parkin (IPR003977 from INTERPRO), the latter being involved in Parkinson's disease []. Ubiquitin is a protein of 76 amino acid residues, found in all eukaryotic cells and whose sequence is extremely well conserved from protozoan to vertebrates. Ubiquitin acts through its post-translational attachment (ubiquitinylation) to other proteins, where these modifications alter the function, location or trafficking of the protein, or targets it for destruction by the 26S proteasome []. The terminal glycine in the C-terminal 4-residue tail of ubiquitin can form an isopeptide bond with a lysine residue in the target protein, or with a lysine in another ubiquitin molecule to form a ubiquitin chain that attaches itself to a target protein. Ubiquitin has seven lysine residues, any one of which can be used to link ubiquitin molecules together, resulting in different structures that alter the target protein in different ways. It appears that Lys(11)-, Lys(29) and Lys(48)-linked poly-ubiquitin chains target the protein to the proteasome for degradation, while mono-ubiquitinylated and Lys(6)- or Lys(63)-linked poly-ubiquitin chains signal reversible modifications in protein activity, location or trafficking []. For example, Lys(63)-linked poly-ubiquitinylation is known to be involved in DNA damage tolerance, inflammatory response, protein trafficking and signal transduction through kinase activation []. In addition, the length of the ubiquitin chain alters the fate of the target protein. Regulatory proteins such as transcription factors and histones are frequent targets of ubquitinylation [].; GO: 0005515 protein binding; PDB: 2DZI_A 2XEW_E 3NOB_E 2KWU_B 2Y5B_F 3PHD_G 2KWV_B 2KOX_A 2XK5_B 3NHE_B ....
Probab=99.27 E-value=9.3e-12 Score=92.40 Aligned_cols=52 Identities=27% Similarity=0.440 Sum_probs=49.1
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
.||.+||..|+.+.|+|++.|+|+|+|+.|.| +.||++|||++|++|+|+.+
T Consensus 16 ~tV~~lK~~i~~~~~~~~~~~~L~~~G~~L~d-~~tL~~~~i~~~~~I~l~~k 67 (69)
T PF00240_consen 16 DTVADLKQKIAEETGIPPEQQRLIYNGKELDD-DKTLSDYGIKDGSTIHLVIK 67 (69)
T ss_dssp SBHHHHHHHHHHHHTSTGGGEEEEETTEEEST-TSBTGGGTTSTTEEEEEEES
T ss_pred CCHHHhhhhcccccccccccceeeeeeecccC-cCcHHHcCCCCCCEEEEEEe
Confidence 48999999999999999999999999999976 79999999999999999865
No 28
>cd01800 SF3a120_C Ubiquitin-like domain of Mammalian splicing factor SF3a_120. SF3a120_C Mammalian splicing factor SF3a consists of three subunits of 60, 66, and 120 kDa and functions early during pre-mRNA splicing by converting the U2 snRNP to its active form. The 120kDa subunit (SF3a120) has a carboxy-terminal ubiquitin-like domain and two SWAP (suppressor-of-white-apricot) domains, referred to collectively as the SURP module, at its amino-terminus.
Probab=99.22 E-value=1.3e-11 Score=94.05 Aligned_cols=52 Identities=19% Similarity=0.302 Sum_probs=49.4
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
+||++||+.|+..+|||+++|+|+|+|+.|+| +.||++|||++|++|+|+.+
T Consensus 18 ~TV~~lK~~i~~~~gip~~~q~L~~~G~~L~d-~~tL~~~~i~~g~~l~v~~~ 69 (76)
T cd01800 18 DPVSVLKVKIHEETGMPAGKQKLQYEGIFIKD-SNSLAYYNLANGTIIHLQLK 69 (76)
T ss_pred CcHHHHHHHHHHHHCCCHHHEEEEECCEEcCC-CCcHHHcCCCCCCEEEEEEe
Confidence 58999999999999999999999999999998 68999999999999999865
No 29
>cd01808 hPLIC_N Ubiquitin-like domain of hPLIC-1 and hPLIC2. hPLIC-1 and hPLIC-2 (human homologs of the yeast ubiquitin-like Dsk2 protein) are type2 UBL's (ubiquitin-like) proteins that are thought to serve as adaptors that link the ubiquitination machinery to the proteasome. The hPLIC's have an N-terminal UBL domain that binds the S5a subunit of the proteasome and a C-terminal UBA (ubiquitin-associated) domain that binds a ubiquitylated protein.
Probab=99.21 E-value=1.6e-11 Score=92.12 Aligned_cols=51 Identities=20% Similarity=0.382 Sum_probs=48.3
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
+||.+||+.|+...|+|+++|+|+|+|++|.| ++||++|||++|++|||+.
T Consensus 20 ~TV~~lK~~I~~~~~i~~~~~~Li~~Gk~L~d-~~tL~~~~i~~~stl~l~~ 70 (71)
T cd01808 20 ASVKDFKEAVSKKFKANQEQLVLIFAGKILKD-TDTLTQHNIKDGLTVHLVI 70 (71)
T ss_pred ChHHHHHHHHHHHhCCCHHHEEEEECCeEcCC-CCcHHHcCCCCCCEEEEEE
Confidence 58999999999999999999999999999998 6899999999999999975
No 30
>PTZ00044 ubiquitin; Provisional
Probab=99.20 E-value=1.9e-11 Score=92.58 Aligned_cols=52 Identities=25% Similarity=0.387 Sum_probs=49.3
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
+||++||++|+...|+|+++|+|+|+|++|.| +.+|++|||++|++|||+.+
T Consensus 21 ~tv~~lK~~i~~~~gi~~~~q~L~~~g~~L~d-~~~l~~~~i~~~~~i~l~~~ 72 (76)
T PTZ00044 21 NTVQQVKMALQEKEGIDVKQIRLIYSGKQMSD-DLKLSDYKVVPGSTIHMVLQ 72 (76)
T ss_pred CcHHHHHHHHHHHHCCCHHHeEEEECCEEccC-CCcHHHcCCCCCCEEEEEEE
Confidence 58999999999999999999999999999997 68999999999999999864
No 31
>cd01802 AN1_N ubiquitin-like domain of AN1. AN1 (also known as ANUBL1 and RSD-7) is ubiquitin-like protein with a testis-specific expression in rats that has an N-terminal ubiquitin-like domain and a C-terminal zinc-binding domain. Unlike ubiquitin polyproteins and most ubiquitin fusion proteins, the N-terminal ubiquitin-like domain of An1 does not undergo proteolytic processing. The function of AN1 is unknown.
Probab=99.19 E-value=2.1e-11 Score=98.37 Aligned_cols=52 Identities=19% Similarity=0.254 Sum_probs=49.3
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
+||.+||++|+...|||+++|+|+|+|++|+| +.||++|||++|++|+|+.+
T Consensus 48 ~TV~~lK~kI~~~~gip~~~QrLi~~Gk~L~D-~~tL~dy~I~~~stL~l~~~ 99 (103)
T cd01802 48 ETVISVKAKIQRLEGIPVAQQHLIWNNMELED-EYCLNDYNISEGCTLKLVLA 99 (103)
T ss_pred CcHHHHHHHHHHHhCCChHHEEEEECCEECCC-CCcHHHcCCCCCCEEEEEEe
Confidence 58999999999999999999999999999998 68999999999999999864
No 32
>cd01805 RAD23_N Ubiquitin-like domain of RAD23. RAD23 belongs to a family of adaptor molecules having affinity for both the proteasome and ubiquitinylated proteins and thought to shuttle these ubiquitinylated proteins to the proteasome for destruction. RAD23 interacts with ubiquitin through its C-terminal ubiquitin-associated domains (UBA) and with the proteasome through its N-terminal ubiquitin-like domain (UBL).
Probab=99.17 E-value=4e-11 Score=90.96 Aligned_cols=52 Identities=19% Similarity=0.333 Sum_probs=48.9
Q ss_pred CCHHHHHHHHHhhhCC--CCCCeEEEecCeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 1 MDVEQFRLICQDECGI--DATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gi--p~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
+||.+||+.|++.+|+ |+++|+|+|+|++|.| +.||++|||++|++|+++.+
T Consensus 21 ~TV~~lK~~i~~~~~i~~~~~~q~L~~~G~~L~d-~~~L~~~~i~~~~~i~~~~~ 74 (77)
T cd01805 21 DTVAELKEKIEEEKGCDYPPEQQKLIYSGKILKD-DTTLEEYKIDEKDFVVVMVS 74 (77)
T ss_pred CcHHHHHHHHHHhhCCCCChhHeEEEECCEEccC-CCCHHHcCCCCCCEEEEEEe
Confidence 5899999999999999 9999999999999997 68999999999999998865
No 33
>cd01799 Hoil1_N Ubiquitin-like domain of HOIL1. HOIL1_N HOIL-1 (heme-oxidized IRP2 ubiquitin ligase-1) is an E3 ubiquitin-protein ligase that recognizes heme-oxidized IRP2 (iron regulatory protein2) and is thought to affect the turnover of oxidatively damaged proteins. Hoil-1 has an amino-terminal ubiquitin-like domain as well as an RBR signature consisting of two RING finger domains separated by an IBR/DRIL domain.
Probab=99.16 E-value=2.9e-11 Score=92.02 Aligned_cols=50 Identities=24% Similarity=0.294 Sum_probs=46.5
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCC-CCcEEEEE
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVR-DGDIILVA 51 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~-~~d~l~l~ 51 (319)
+||++||++|+..+|||+++|+| |+|+.|.||++||++|||+ +||+|||-
T Consensus 23 ~TV~~lK~kI~~~~gip~~~QrL-~~G~~L~dD~~tL~~ygi~~~g~~~~l~ 73 (75)
T cd01799 23 MTVAQLKDKVFLDYGFPPAVQRW-VIGQRLARDQETLYSHGIRTNGDSAFLY 73 (75)
T ss_pred CcHHHHHHHHHHHHCcCHHHEEE-EcCCeeCCCcCCHHHcCCCCCCCEEEEE
Confidence 69999999999999999999999 9999997557899999999 88999984
No 34
>cd01790 Herp_N Homocysteine-responsive endoplasmic reticulum-resident ubiquitin-like domain protein. Herp (Homocysteine-responsive endoplasmic reticulum-resident ubiquitin-like domain protein) , is an integral membrane protein that is induced by the endoplasmic reticulum (ER) stress response pathway and is involved in improving the balance of folding capacity and protein loads in the ER. Herp has an N-terminal ubiquitin-like domain that is involved in Herp degradation, but is not necessary for its enhancement of amyloid beta-protein generation.
Probab=99.16 E-value=3e-11 Score=92.64 Aligned_cols=51 Identities=14% Similarity=0.075 Sum_probs=46.2
Q ss_pred CCHHHHHHHHHhhhC--CCCCCeEEEecCeeccCCcccccccC--CCCCcEEEEEc
Q psy17672 1 MDVEQFRLICQDECG--IDATDMILLNNGKHLLEDGSCLKQAG--VRDGDIILVAM 52 (319)
Q Consensus 1 ~tv~~lK~~i~~e~g--ip~~~q~Li~~Gk~L~d~~~tL~~~g--I~~~d~l~l~~ 52 (319)
+||.+||++|+...+ .|+++|+|||+||+|+| +.||++|+ +++|.+|||+.
T Consensus 24 ~TV~~lK~~i~~~~~~~~~~~~QrLIy~GKiLkD-~~tL~~~~~~~~~~~tiHLV~ 78 (79)
T cd01790 24 WTVGELKTHLSRVYPSKPLEQDQRLIYSGKLLPD-HLKLRDVLRKQDEYHMVHLVC 78 (79)
T ss_pred ChHHHHHHHHHHhcCCCCChhHeEEEEcCeeccc-hhhHHHHhhcccCCceEEEEe
Confidence 589999999999874 55899999999999998 69999997 99999999984
No 35
>cd01804 midnolin_N Ubiquitin-like domain of midnolin. midnolin_N Midnolin (midbrain nucleolar protein) is expressed in the nucleolus and is thought to regulate genes involved in neurogenesis. Midnolin contains an amino-terminal ubiquitin-like domain.
Probab=99.16 E-value=3.6e-11 Score=92.09 Aligned_cols=51 Identities=20% Similarity=0.291 Sum_probs=48.0
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
.||++||+.|+.+.|+|+++|+|+|.|++|.| + ||++|||++|++|+|+..
T Consensus 22 ~TV~~LK~~I~~~~~~~~~~qrL~~~Gk~L~d-~-~L~~~gi~~~~~i~l~~~ 72 (78)
T cd01804 22 ETVEGLKKRISQRLKVPKERLALLHRETRLSS-G-KLQDLGLGDGSKLTLVPT 72 (78)
T ss_pred CHHHHHHHHHHHHhCCChHHEEEEECCcCCCC-C-cHHHcCCCCCCEEEEEee
Confidence 48999999999999999999999999999998 5 899999999999999864
No 36
>cd01806 Nedd8 Nebb8-like ubiquitin protein. Nedd8 (also known as Rub1) has a single conserved ubiquitin-like domain that is part of a protein modification pathway similar to that of ubiquitin. Nedd8 modifies a family of molecular scaffold proteins called cullins that are responsible for assembling the ROC1/Rbx1 RING-based E3 ubiquitin ligases, of which several play a direct role in tumorigenesis.
Probab=99.14 E-value=6.1e-11 Score=89.36 Aligned_cols=52 Identities=15% Similarity=0.314 Sum_probs=49.1
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
.||++||++|+..+|+|++.|+|+|+|+.|.| +.||++|||++|++|||+.+
T Consensus 21 ~tv~~lK~~i~~~~g~~~~~qrL~~~g~~L~d-~~tl~~~~i~~g~~i~l~~~ 72 (76)
T cd01806 21 DKVERIKERVEEKEGIPPQQQRLIYSGKQMND-DKTAADYKLEGGSVLHLVLA 72 (76)
T ss_pred CCHHHHHHHHhHhhCCChhhEEEEECCeEccC-CCCHHHcCCCCCCEEEEEEE
Confidence 48999999999999999999999999999997 68999999999999999864
No 37
>PF13975 gag-asp_proteas: gag-polyprotein putative aspartyl protease
Probab=99.13 E-value=1.9e-10 Score=86.71 Aligned_cols=65 Identities=23% Similarity=0.252 Sum_probs=49.9
Q ss_pred cccceEEEEEEEcCEEEEEEEcCCccccccCHHHHHHcCCcccccc-ceeeEeeccceeEEEeEEE
Q psy17672 176 GSVVMLYINCKVNGYPVKAFVDSGAQTTIMSAKCAERCNIMRLIDT-RWAGVAKGVGVQQIIGRIH 240 (319)
Q Consensus 176 ~~~~~l~v~v~Ing~~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~-~~~g~a~Gvg~~~~~g~v~ 240 (319)
.....++++|.|||+.+.+||||||++||||.++|+|||++..... +......+.....+.|.++
T Consensus 4 ~~~g~~~v~~~I~g~~~~alvDtGat~~fis~~~a~rLgl~~~~~~~~~~v~~a~g~~~~~~g~~~ 69 (72)
T PF13975_consen 4 PDPGLMYVPVSIGGVQVKALVDTGATHNFISESLAKRLGLPLEKPPSPIRVKLANGSVIEIRGVAE 69 (72)
T ss_pred ccCCEEEEEEEECCEEEEEEEeCCCcceecCHHHHHHhCCCcccCCCCEEEEECCCCccccceEEE
Confidence 4567899999999999999999999999999999999999977655 3332333333445555554
No 38
>cd05481 retropepsin_like_LTR_1 Retropepsins_like_LTR; pepsin-like aspartate protease from retrotransposons with long terminal repeats. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identifi
Probab=99.13 E-value=4.2e-10 Score=89.11 Aligned_cols=86 Identities=19% Similarity=0.210 Sum_probs=71.0
Q ss_pred EEEEcC-EEEEEEEcCCccccccCHHHHHHcC---CccccccceeeEeeccceeEEEeEEEEEEEEEcCEEEEeeEEEec
Q psy17672 184 NCKVNG-YPVKAFVDSGAQTTIMSAKCAERCN---IMRLIDTRWAGVAKGVGVQQIIGRIHMVQVAIEKDFLTTSLSILE 259 (319)
Q Consensus 184 ~v~Ing-~~v~aLVDTGA~~siIs~~~a~rlg---l~~~~~~~~~g~a~Gvg~~~~~g~v~~~~i~IG~~~~~~~f~Vl~ 259 (319)
.++||| ++++++|||||+.|+|+.++++++| ...+..+...-.+.+.......|. ..+.+++++..+++.|+|++
T Consensus 2 ~~~i~g~~~v~~~vDtGA~vnllp~~~~~~l~~~~~~~L~~t~~~L~~~~g~~~~~~G~-~~~~v~~~~~~~~~~f~Vvd 80 (93)
T cd05481 2 DMKINGKQSVKFQLDTGATCNVLPLRWLKSLTPDKDPELRPSPVRLTAYGGSTIPVEGG-VKLKCRYRNPKYNLTFQVVK 80 (93)
T ss_pred ceEeCCceeEEEEEecCCEEEeccHHHHhhhccCCCCcCccCCeEEEeeCCCEeeeeEE-EEEEEEECCcEEEEEEEEEC
Confidence 578999 9999999999999999999999998 666666666545555556688887 46899999999999999999
Q ss_pred CCCCceeechhh
Q psy17672 260 EQPMDMLLGLDM 271 (319)
Q Consensus 260 ~~~~d~ILG~D~ 271 (319)
.. ..-|||++.
T Consensus 81 ~~-~~~lLG~~~ 91 (93)
T cd05481 81 EE-GPPLLGAKA 91 (93)
T ss_pred CC-CCceEcccc
Confidence 63 567888763
No 39
>cd01809 Scythe_N Ubiquitin-like domain of Scythe protein. Scythe protein (also known as Bat3) is an apoptotic regulator that is highly conserved in eukaryotes and contains a ubiquitin-like domain near its N-terminus. Scythe binds reaper, a potent apoptotic inducer, and Scythe/Reaper are thought to signal apoptosis, in part through regulating the folding and activity of apoptotic signaling molecules.
Probab=99.13 E-value=7e-11 Score=88.11 Aligned_cols=51 Identities=24% Similarity=0.356 Sum_probs=48.3
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
.||.+||+.|+..+|+|++.|+|+|+|+.|.| +.+|++|||++|++|||+.
T Consensus 21 ~tv~~lK~~i~~~~gi~~~~q~L~~~g~~L~d-~~~L~~~~i~~~~~l~l~~ 71 (72)
T cd01809 21 ITVLDLKEKIAEEVGIPVEQQRLIYSGRVLKD-DETLSEYKVEDGHTIHLVK 71 (72)
T ss_pred CcHHHHHHHHHHHHCcCHHHeEEEECCEECCC-cCcHHHCCCCCCCEEEEEe
Confidence 48999999999999999999999999999998 6899999999999999874
No 40
>cd01812 BAG1_N Ubiquitin-like domain of BAG1. BAG1_N N-terminal ubiquitin-like (Ubl) domain of the BAG1 protein. This domain occurs together with the BAG domain and is closely related to the Ubl domain of a family of deubiquitinases that includes Rpn11, UBP6 (USP14), USP7 (HAUSP).
Probab=99.12 E-value=6e-11 Score=88.42 Aligned_cols=51 Identities=20% Similarity=0.373 Sum_probs=48.1
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
+||++||+.|+..+|+|++.|+|+|+|+.|.| +.+|++|||++|++|+|+.
T Consensus 20 ~tv~~lK~~i~~~~gi~~~~q~L~~~g~~l~d-~~~L~~~~i~~g~~l~v~~ 70 (71)
T cd01812 20 ATFGDLKKMLAPVTGVEPRDQKLIFKGKERDD-AETLDMSGVKDGSKVMLLE 70 (71)
T ss_pred CcHHHHHHHHHHhhCCChHHeEEeeCCcccCc-cCcHHHcCCCCCCEEEEec
Confidence 58999999999999999999999999999987 6899999999999999873
No 41
>cd01803 Ubiquitin Ubiquitin. Ubiquitin (includes Ubq/RPL40e and Ubq/RPS27a fusions as well as homopolymeric multiubiquitin protein chains)
Probab=99.09 E-value=1.1e-10 Score=87.91 Aligned_cols=52 Identities=21% Similarity=0.337 Sum_probs=49.2
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
+||++||+.|+..+|+|++.|+|+|+|+.|.| +.||++|||++|++|+|+.+
T Consensus 21 ~tV~~lK~~i~~~~g~~~~~q~L~~~g~~L~d-~~~L~~~~i~~~~~i~l~~~ 72 (76)
T cd01803 21 DTIENVKAKIQDKEGIPPDQQRLIFAGKQLED-GRTLSDYNIQKESTLHLVLR 72 (76)
T ss_pred CcHHHHHHHHHHHhCCCHHHeEEEECCEECCC-CCcHHHcCCCCCCEEEEEEE
Confidence 58999999999999999999999999999998 68999999999999999864
No 42
>cd01792 ISG15_repeat1 ISG15 ubiquitin-like protein, first repeat of 2. ISG15 is a ubiquitin-like protein containing two ubiquitin homology domains that becomes conjugated to a variety of proteins when cells are treated with type I interferon or lipopolysaccharide. Although ISG15 has properties similar to those of other ubiquitin-like molecules, it is a unique member of the ubiquitin-like superfamily, whose expression and conjugation to target proteins are tightly regulated by specific signaling pathways, indicating it may have specialized functions in the immune system.
Probab=99.09 E-value=8.5e-11 Score=90.34 Aligned_cols=52 Identities=23% Similarity=0.386 Sum_probs=49.0
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEE--EecCeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 1 MDVEQFRLICQDECGIDATDMIL--LNNGKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~L--i~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
.||++||..|+...|+|+++|+| +|+|++|.| +.||++|||++|++|+|+.+
T Consensus 23 ~TV~~lK~~I~~~~~i~~~~qrL~~~~~G~~L~D-~~tL~~~gi~~gs~l~l~~~ 76 (80)
T cd01792 23 MTVSELKQQIAQKIGVPAFQQRLAHLDSREVLQD-GVPLVSQGLGPGSTVLLVVQ 76 (80)
T ss_pred CcHHHHHHHHHHHhCCCHHHEEEEeccCCCCCCC-CCCHHHcCCCCCCEEEEEEE
Confidence 48999999999999999999999 899999998 68999999999999999875
No 43
>PF12384 Peptidase_A2B: Ty3 transposon peptidase; InterPro: IPR024650 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Ty3 is a gypsy-type, retrovirus-like, element found in the budding yeast. The Ty3 aspartyl protease is required for processing of the viral polyprotein into its mature species [].
Probab=99.02 E-value=3.9e-09 Score=90.67 Aligned_cols=96 Identities=16% Similarity=0.312 Sum_probs=78.1
Q ss_pred ceEEEEEEEcCEEEEEEEcCCccccccCHHHHHHcCCccccccc--eeeEeeccceeEEEeEEEEEEEEEcCEEEEeeEE
Q psy17672 179 VMLYINCKVNGYPVKAFVDSGAQTTIMSAKCAERCNIMRLIDTR--WAGVAKGVGVQQIIGRIHMVQVAIEKDFLTTSLS 256 (319)
Q Consensus 179 ~~l~v~v~Ing~~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~--~~g~a~Gvg~~~~~g~v~~~~i~IG~~~~~~~f~ 256 (319)
....+...|+|.++++|+||||-.|||+.+.+++|+|+...... ++|...+- ... ...+..+++.+++..++..++
T Consensus 33 ~T~~v~l~~~~t~i~vLfDSGSPTSfIr~di~~kL~L~~~~app~~fRG~vs~~-~~~-tsEAv~ld~~i~n~~i~i~aY 110 (177)
T PF12384_consen 33 KTAIVQLNCKGTPIKVLFDSGSPTSFIRSDIVEKLELPTHDAPPFRFRGFVSGE-SAT-TSEAVTLDFYIDNKLIDIAAY 110 (177)
T ss_pred cEEEEEEeecCcEEEEEEeCCCccceeehhhHHhhCCccccCCCEEEeeeccCC-ceE-EEEeEEEEEEECCeEEEEEEE
Confidence 45678899999999999999999999999999999999774433 44433331 122 233445689999999999999
Q ss_pred EecCCCCceeechhhhhhcc
Q psy17672 257 ILEEQPMDMLLGLDMLRRHE 276 (319)
Q Consensus 257 Vl~~~~~d~ILG~D~L~~~~ 276 (319)
|++.-+.|+|+|-++|++|-
T Consensus 111 V~d~m~~dlIIGnPiL~ryp 130 (177)
T PF12384_consen 111 VTDNMDHDLIIGNPILDRYP 130 (177)
T ss_pred EeccCCcceEeccHHHhhhH
Confidence 99999999999999999986
No 44
>cd01813 UBP_N UBP ubiquitin processing protease. The UBP (ubiquitin processing protease) domain (also referred to as USP which stands for "ubiquitin-specific protease") is present at in a large family of cysteine proteases that specifically cleave ubiquitin conjugates. This family includes Rpn11, UBP6 (USP14), USP7 (HAUSP). This domain is closely related to the amino-terminal ubiquitin-like domain of BAG1 (Bcl2-associated anthanogene1) protein and is found only in eukaryotes.
Probab=99.02 E-value=3.3e-10 Score=85.97 Aligned_cols=51 Identities=14% Similarity=0.145 Sum_probs=47.7
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEe---cCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLN---NGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~---~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
.||++||+.|++.+|||+++|+|+| .|++|.| +.+|++|||++|+.|+|+.
T Consensus 20 ~Tv~~lK~~i~~~tgvp~~~QKLi~~~~~Gk~l~D-~~~L~~~~i~~g~~i~lmG 73 (74)
T cd01813 20 DTVLDLKQFIKTLTGVLPERQKLLGLKVKGKPAED-DVKISALKLKPNTKIMMMG 73 (74)
T ss_pred CCHHHHHHHHHHHHCCCHHHEEEEeecccCCcCCC-CcCHHHcCCCCCCEEEEEe
Confidence 4899999999999999999999997 8999998 6899999999999999873
No 45
>COG5550 Predicted aspartyl protease [Posttranslational modification, protein turnover, chaperones]
Probab=98.93 E-value=1.6e-08 Score=83.03 Aligned_cols=105 Identities=12% Similarity=0.204 Sum_probs=79.2
Q ss_pred eEEEEEEE----cCE-EEEEEEcCCcc-ccccCHHHHHHcCCccccccceeeEeeccceeEEEeEEEEEEEEEcCEEEEe
Q psy17672 180 MLYINCKV----NGY-PVKAFVDSGAQ-TTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQIIGRIHMVQVAIEKDFLTT 253 (319)
Q Consensus 180 ~l~v~v~I----ng~-~v~aLVDTGA~-~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~g~v~~~~i~IG~~~~~~ 253 (319)
+.+|++.. +|- -..+|||||++ ..+++++.|+++|++.....+.. ....| ...-.|+.+.++|++.....
T Consensus 10 ~~~v~~~f~~~~~Gd~~~~~LiDTGFtg~lvlp~~vaek~~~~~~~~~~~~--~a~~~--~v~t~V~~~~iki~g~e~~~ 85 (125)
T COG5550 10 HVTVPVTFRLPGQGDFVYDELIDTGFTGYLVLPPQVAEKLGLPLFSTIRIV--LADGG--VVKTSVALATIKIDGVEKVA 85 (125)
T ss_pred ceeEEEEEEecCCCcEEeeeEEecCCceeEEeCHHHHHhcCCCccCChhhh--hhcCC--EEEEEEEEEEEEECCEEEEE
Confidence 44555554 333 34559999999 89999999999999977554432 22222 33446888999999988777
Q ss_pred eEEEecCCCCceeechhhhhhccEEEEccCCEEEEc
Q psy17672 254 SLSILEEQPMDMLLGLDMLRRHECCIDLRKNVLRIG 289 (319)
Q Consensus 254 ~f~Vl~~~~~d~ILG~D~L~~~~~~ID~~~~~L~i~ 289 (319)
...+.+....+ ++|++||+.++..+|++.++|+-.
T Consensus 86 ~Vl~s~~~~~~-liG~~~lk~l~~~vn~~~g~LEk~ 120 (125)
T COG5550 86 FVLASDNLPEP-LIGVNLLKLLGLVVNPKTGKLEKP 120 (125)
T ss_pred EEEccCCCccc-chhhhhhhhccEEEcCCcceEecc
Confidence 76677766666 999999999999999999999854
No 46
>cd01795 USP48_C USP ubiquitin-specific protease. The USP (ubiquitin-specific protease) family is one of at least seven deubiquitylating enzyme (DUB) families capable of deconjugating ubiquitin and ubiquitin-like adducts. While the USP's have a conserved catalytic core domain, they differ in their domain architectures. This subfamily, which includes USP31, and USP48, has a carboxy-terminal ubiquitin-like domain in addition to a DUSP (domain of ubiquitin-specific proteases) domain,
Probab=98.90 E-value=1.9e-09 Score=85.19 Aligned_cols=53 Identities=21% Similarity=0.286 Sum_probs=50.5
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
.||.+||.+|...+++|+.+|+|+|+|+.|.||.+||++|||..|+.|+|...
T Consensus 25 ~TVg~LK~lImQ~f~V~P~dQkL~~dG~~L~DDsrTLssyGv~sgSvl~Llid 77 (107)
T cd01795 25 QTLKELKIQIMHAFSVAPFDQNLSIDGKILSDDCATLGTLGVIPESVILLKAD 77 (107)
T ss_pred ccHHHHHHHHHHHhcCCcccceeeecCceeccCCccHHhcCCCCCCEEEEEec
Confidence 38999999999999999999999999999999899999999999999999874
No 47
>cd01814 NTGP5 Ubiquitin-like NTGP5 and ATGP4. NTGP5 and ATGP4 are plant-specific isoprenylated GTP-binding proteins with a single fold that resembles ubiquitin. The function of these proteins is unknown.
Probab=98.84 E-value=3e-09 Score=86.21 Aligned_cols=54 Identities=11% Similarity=0.064 Sum_probs=46.9
Q ss_pred CCHHHHHHHHHhhh-----CCC--CCCeEEEecCeeccCCcccccccC------CCCCcEEEEEccCC
Q psy17672 1 MDVEQFRLICQDEC-----GID--ATDMILLNNGKHLLEDGSCLKQAG------VRDGDIILVAMPGR 55 (319)
Q Consensus 1 ~tv~~lK~~i~~e~-----gip--~~~q~Li~~Gk~L~d~~~tL~~~g------I~~~d~l~l~~~~~ 55 (319)
.||++||++|+++. |+| +++|+|||+||+|+| ++||++|+ +....|+||+.+.+
T Consensus 26 dTV~~lKekI~~~~p~~ke~~P~~~~~qKLIysGKiLeD-~~TL~d~~~p~g~~~~~~~TmHvvlr~~ 92 (113)
T cd01814 26 TTVDFLKERVVSQWPKDKEVGPKTVNEVKLISAGKILEN-SKTVGECRSPVGDIAGGVITMHVVVQPP 92 (113)
T ss_pred hHHHHHHHHHHHhcccccccCCCCHHHeEEEeCCeecCC-CCcHHHhCCcccccCCCceEEEEEecCC
Confidence 48999999999665 455 999999999999998 69999999 87889999998643
No 48
>smart00213 UBQ Ubiquitin homologues. Ubiquitin-mediated proteolysis is involved in the regulated turnover of proteins required for controlling cell cycle progression
Probab=98.84 E-value=2.7e-09 Score=77.15 Aligned_cols=45 Identities=22% Similarity=0.327 Sum_probs=42.6
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGD 46 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d 46 (319)
+||++||++|+..+|+|++.|+|+|+|+.|.| +.||++|||++|+
T Consensus 20 ~tv~~lk~~i~~~~~~~~~~~~L~~~g~~L~d-~~tL~~~~i~~~~ 64 (64)
T smart00213 20 DTVSELKEKIAELTGIPVEQQRLIYKGKVLED-DRTLADYNIQDGS 64 (64)
T ss_pred CcHHHHHHHHHHHHCCCHHHEEEEECCEECCC-CCCHHHcCCcCCC
Confidence 58999999999999999999999999999998 6899999999985
No 49
>KOG0003|consensus
Probab=98.82 E-value=3.2e-10 Score=90.31 Aligned_cols=52 Identities=19% Similarity=0.328 Sum_probs=49.1
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
.||.++|++|...+|||+++|+|+|+||+|+| ..||++|||+..||||++.+
T Consensus 21 ~ti~~vKA~i~~~~Gi~~~~~~L~~~~k~LED-~~Tla~Y~i~~~~Tl~~~~r 72 (128)
T KOG0003|consen 21 DTIDNVKAKIQDKEGIPPDQQRLIFAGKQLED-GRTLADYNIQKESTLHLVLR 72 (128)
T ss_pred chHHHHHHHhccccCCCHHHHHHHhccccccc-CCcccccCccchhhhhhhHH
Confidence 48999999999999999999999999999998 79999999999999998853
No 50
>KOG0005|consensus
Probab=98.81 E-value=1.8e-09 Score=77.21 Aligned_cols=49 Identities=16% Similarity=0.319 Sum_probs=46.3
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEE
Q psy17672 2 DVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVA 51 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~ 51 (319)
+|+.+|+.|+++.|||+.+|+|+|.||++.| +.|-++|++.-|++||++
T Consensus 22 kverIKErvEEkeGIPp~qqrli~~gkqm~D-D~tA~~Y~~~~GSVlHlv 70 (70)
T KOG0005|consen 22 KVERIKERVEEKEGIPPQQQRLIYAGKQMND-DKTAAHYNLLGGSVLHLV 70 (70)
T ss_pred HHHHHHHHhhhhcCCCchhhhhhhccccccc-cccHHHhhhccceeEeeC
Confidence 6889999999999999999999999999988 589999999999999974
No 51
>cd01789 Alp11_N Ubiquitin-like domain of Alp11 tubulin-folding cofactor B. Alp11, also known as tubulin-folding cofactor B, is one of at least three proteins required for the proper folding of tubulins prior to their incorporation into microtubules. These cofactors are necessary for the biogenesis of microtubules and for cell viability. Alp11 has three domains including an N-terminal ubiquitin-like domain (represented by this CD) which executes the essential function, a central coiled-coil domain necessary for maintenance of cellular alpha-tubulin levels, and a C-terminal CLIP-170 domain is required for efficient binding to alpha-tubulin.
Probab=98.80 E-value=9.8e-09 Score=79.76 Aligned_cols=53 Identities=26% Similarity=0.278 Sum_probs=48.2
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEE-EecCe-----eccCCcccccccCCCCCcEEEEEcc
Q psy17672 1 MDVEQFRLICQDECGIDATDMIL-LNNGK-----HLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~L-i~~Gk-----~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
+||.+||++|+..+|+|+..|+| +|+|+ .|.||.++|++||+++|..|||+-.
T Consensus 23 ~Tv~~lK~kl~~~~G~~~~~mrL~l~~~~~~~~~~l~~d~~~L~~y~~~dg~~IhVvD~ 81 (84)
T cd01789 23 LTIAELKKKLELVVGTPASSMRLQLFDGDDKLVSKLDDDDALLGSYPVDDGCRIHVIDV 81 (84)
T ss_pred CcHHHHHHHHHHHHCCCccceEEEEEcCCCCeEeecCCCccEeeeccCCCCCEEEEEeC
Confidence 69999999999999999999999 58998 6766689999999999999999853
No 52
>cd01763 Sumo Small ubiquitin-related modifier (SUMO). Small ubiquitin-related modifier (SUMO) proteins are conjugated to numerous intracellular targets and serve to modulate protein interaction, localization, activity or stability. SUMO (also known as "Smt3" and "sentrin" in other organisms) is linked to several different pathways, including nucleocytoplasmic transport. Attachment of SUMO to targets proteins is stimulated by PIAS (Protein inhibitor of activated STATs) proteins which serve as E3-like ligases.
Probab=98.78 E-value=9.7e-09 Score=80.22 Aligned_cols=52 Identities=17% Similarity=0.312 Sum_probs=48.6
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
.|++.||..+++..|+|+++|+|+|+|+.|.+ ..|+++||+++||+|+++..
T Consensus 32 ~~l~~l~~~y~~~~gi~~~~~rf~f~G~~L~~-~~T~~~l~m~d~d~I~v~l~ 83 (87)
T cd01763 32 TPLKKLMEAYCQRQGLSMNSVRFLFDGQRIRD-NQTPDDLGMEDGDEIEVMLE 83 (87)
T ss_pred CHHHHHHHHHHHHhCCCccceEEEECCeECCC-CCCHHHcCCCCCCEEEEEEe
Confidence 37899999999999999999999999999997 68999999999999999864
No 53
>cd01769 UBL Ubiquitin-like domain of UBL. UBLs function by remodeling the surface of their target proteins, changing their target's half-life, enzymatic activity, protein-protein interactions, subcellular localization or other properties. At least 10 different ubiquitin-like modifications exist in mammals, and attachment of different ubls to a target leads to different biological consequences. Ubl-conjugation cascades are initiated by activating enzymes, which also coordinate the ubls with their downstream pathways.
Probab=98.78 E-value=9.6e-09 Score=75.28 Aligned_cols=50 Identities=20% Similarity=0.347 Sum_probs=47.2
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEE
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVA 51 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~ 51 (319)
+||++||.+|+..+|+|++.|+|+|+|+.|.| ..+|++|||++|+.|++.
T Consensus 18 ~ti~~lK~~i~~~~~~~~~~~~l~~~g~~l~d-~~~l~~~~v~~~~~i~v~ 67 (69)
T cd01769 18 DTVAELKAKIAAKEGVPPEQQRLIYAGKILKD-DKTLSDYGIQDGSTLHLV 67 (69)
T ss_pred ChHHHHHHHHHHHHCcChHHEEEEECCcCCCC-cCCHHHCCCCCCCEEEEE
Confidence 58999999999999999999999999999987 689999999999999986
No 54
>KOG0004|consensus
Probab=98.74 E-value=4.8e-09 Score=89.08 Aligned_cols=51 Identities=20% Similarity=0.344 Sum_probs=48.5
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
.||+++|++|+...|||+++|+|||.|++|+| ..||+||+|+..++|+|+.
T Consensus 21 ~ti~~~Kakiq~~egIp~dqqrlifag~qLed-grtlSDY~Iqkestl~l~l 71 (156)
T KOG0004|consen 21 DTIDNVKAKIQDKEGIPPDQQRLIFAGKQLED-GRTLSDYNIQKESTLHLVL 71 (156)
T ss_pred ccHHHHHHhhhcccCCCchhhhhhhhhccccc-CCccccccccccceEEEEE
Confidence 38999999999999999999999999999998 6999999999999999984
No 55
>PF02160 Peptidase_A3: Cauliflower mosaic virus peptidase (A3); InterPro: IPR000588 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of sequences contain an aspartic peptidase signature that belongs to MEROPS peptidase family A3, subfamily A3A (cauliflower mosaic virus-type endopeptidase, clan AA). Cauliflower mosaic virus belongs to the Retro-transcribing viruses, which have a double-stranded DNA genome. The genome includes an open reading frame (ORF V) that shows similarities to the pol gene of retroviruses. This ORF codes for a polyprotein that includes a reverse transcriptase, which, on the basis of a DTG triplet near the N terminus, was suggested to include an aspartic protease. The presence of an aspartic protease has been confirmed by mutational studies, implicating Asp-45 in catalysis. The protease releases itself from the polyprotein and is involved in reactions required to process the ORF IV polyprotein, which includes the viral coat protein []. The viral aspartic peptidase signature has also been found associated with a polyprotein encoded by integrated pararetrovirus-like sequences in the genome of Nicotiana tabacum (Common tobacco) []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis
Probab=98.63 E-value=2.2e-07 Score=82.99 Aligned_cols=110 Identities=23% Similarity=0.334 Sum_probs=76.3
Q ss_pred ceEEEEEEE--cC---EEEEEEEcCCccccccCHHHHHHcCCccccccceeeEeecc-c-eeEEEeEEEEEEEEEcCEEE
Q psy17672 179 VMLYINCKV--NG---YPVKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGV-G-VQQIIGRIHMVQVAIEKDFL 251 (319)
Q Consensus 179 ~~l~v~v~I--ng---~~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gv-g-~~~~~g~v~~~~i~IG~~~~ 251 (319)
...||.+++ .| ..+.++|||||+..+++..+.-.--+... .... .+.|. + ...+.-++....|.|++..|
T Consensus 3 NsiyI~~~i~~~gy~~~~~~~~vDTGAt~C~~~~~iiP~e~we~~-~~~i--~v~~an~~~~~i~~~~~~~~i~I~~~~F 79 (201)
T PF02160_consen 3 NSIYIKVKISFPGYKKFNYHCYVDTGATICCASKKIIPEEYWEKS-KKPI--KVKGANGSIIQINKKAKNGKIQIADKIF 79 (201)
T ss_pred ccEEEEEEEEEcCceeEEEEEEEeCCCceEEecCCcCCHHHHHhC-CCcE--EEEEecCCceEEEEEecCceEEEccEEE
Confidence 345666554 55 35799999999999888766522222111 1111 34443 2 34667788889999999999
Q ss_pred EeeEEEecCCCCceeechhhhhhccEEEEccCCEEEEccCC
Q psy17672 252 TTSLSILEEQPMDMLLGLDMLRRHECCIDLRKNVLRIGTTG 292 (319)
Q Consensus 252 ~~~f~Vl~~~~~d~ILG~D~L~~~~~~ID~~~~~L~i~~~~ 292 (319)
..++.-.-..+.|+|||+.|++.++.-|.+.+ .+.|+..+
T Consensus 80 ~IP~iYq~~~g~d~IlG~NF~r~y~Pfiq~~~-~I~f~~~~ 119 (201)
T PF02160_consen 80 RIPTIYQQESGIDIILGNNFLRLYEPFIQTED-RIQFHKKG 119 (201)
T ss_pred eccEEEEecCCCCEEecchHHHhcCCcEEEcc-EEEEEeCC
Confidence 98755554578999999999999997788864 57777644
No 56
>cd06094 RP_Saci_like RP_Saci_like, retropepsin family. Retropepsin on retrotransposons with long terminal repeats (LTR) including Saci-1, -2 and -3 of Schistosoma mansoni. Retropepsins are related to fungal and mammalian pepsins. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified
Probab=98.62 E-value=9.9e-08 Score=74.52 Aligned_cols=79 Identities=15% Similarity=0.212 Sum_probs=59.2
Q ss_pred EEEEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEEeEEEEEEEEEcCE-EEEeeEEEecCCCCceeech
Q psy17672 191 PVKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQIIGRIHMVQVAIEKD-FLTTSLSILEEQPMDMLLGL 269 (319)
Q Consensus 191 ~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~g~v~~~~i~IG~~-~~~~~f~Vl~~~~~d~ILG~ 269 (319)
.+++||||||.+|+|.....++- .....+.-.|.+.-..+..| .+...+.+|.. .|+..|+|.|- ...|||.
T Consensus 9 ~~~fLVDTGA~vSviP~~~~~~~----~~~~~~~l~AANgt~I~tyG-~~~l~ldlGlrr~~~w~FvvAdv--~~pIlGa 81 (89)
T cd06094 9 GLRFLVDTGAAVSVLPASSTKKS----LKPSPLTLQAANGTPIATYG-TRSLTLDLGLRRPFAWNFVVADV--PHPILGA 81 (89)
T ss_pred CcEEEEeCCCceEeecccccccc----ccCCceEEEeCCCCeEeeee-eEEEEEEcCCCcEEeEEEEEcCC--CcceecH
Confidence 47899999999999998887763 12223322333333457888 56678999985 89999999875 3589999
Q ss_pred hhhhhcc
Q psy17672 270 DMLRRHE 276 (319)
Q Consensus 270 D~L~~~~ 276 (319)
|||++|+
T Consensus 82 DfL~~~~ 88 (89)
T cd06094 82 DFLQHYG 88 (89)
T ss_pred HHHHHcC
Confidence 9999986
No 57
>cd05482 HIV_retropepsin_like Retropepsins, pepsin-like aspartate proteases. This is a subfamily of retropepsins. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This gro
Probab=98.41 E-value=1.4e-06 Score=68.13 Aligned_cols=86 Identities=15% Similarity=0.164 Sum_probs=59.6
Q ss_pred EEEEcCEEEEEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEEeEEEEEEEEEcCEEEEeeEEEecCCCC
Q psy17672 184 NCKVNGYPVKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQIIGRIHMVQVAIEKDFLTTSLSILEEQPM 263 (319)
Q Consensus 184 ~v~Ing~~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~g~v~~~~i~IG~~~~~~~f~Vl~~~~~ 263 (319)
.++|+|+++.+|+||||..|+|+.....+--- ...++ ....|+|.....-+...+.+++.+......+.|.+....
T Consensus 2 ~~~i~g~~~~~llDTGAd~Tvi~~~~~p~~w~--~~~~~--~~i~GIGG~~~~~~~~~v~i~i~~~~~~g~vlv~~~~~P 77 (87)
T cd05482 2 TLYINGKLFEGLLDTGADVSIIAENDWPKNWP--IQPAP--SNLTGIGGAITPSQSSVLLLEIDGEGHLGTILVYVLSLP 77 (87)
T ss_pred EEEECCEEEEEEEccCCCCeEEcccccCCCCc--cCCCC--eEEEeccceEEEEEEeeEEEEEcCCeEEEEEEEccCCCc
Confidence 57899999999999999999999744322210 11111 156777754333444568899999877777888775345
Q ss_pred ceeechhhhh
Q psy17672 264 DMLLGLDMLR 273 (319)
Q Consensus 264 d~ILG~D~L~ 273 (319)
.-|+|-|.|.
T Consensus 78 ~nllGRd~L~ 87 (87)
T cd05482 78 VNLWGRDILS 87 (87)
T ss_pred ccEEccccCC
Confidence 5799988873
No 58
>PF14560 Ubiquitin_2: Ubiquitin-like domain; PDB: 1WJN_A 2KJ6_A 2KJR_A 1V6E_A 1T0Y_A.
Probab=98.38 E-value=5.7e-07 Score=70.04 Aligned_cols=53 Identities=26% Similarity=0.349 Sum_probs=43.2
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEec----Ce---eccCCcccccccCCCCCcEEEEEcc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNN----GK---HLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~----Gk---~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
+||.+||.+|+..+|+|++.|+|.+. |. .+.||.++|..||+++|+.|||.-.
T Consensus 24 ~Tv~eLK~kl~~~~Gi~~~~m~L~l~~~~~~~~~~~~~dd~~~L~~y~~~dg~~i~V~D~ 83 (87)
T PF14560_consen 24 ITVSELKQKLEKLTGIPPSDMRLQLKSDKDDSKIEELDDDDATLGSYGIKDGMRIHVVDT 83 (87)
T ss_dssp SBHHHHHHHHHHHHTS-TTTEEEEEE-TSSSSEEEESSGSSSBCCHHT-STTEEEEEEE-
T ss_pred CCHHHHHHHHHHHhCCCcccEEEEEEecCCCccccccCCCccEeecCCCCCCCEEEEEeC
Confidence 69999999999999999999999765 22 2456689999999999999999743
No 59
>PF05585 DUF1758: Putative peptidase (DUF1758); InterPro: IPR008737 This is a family of nematode proteins of unknown function []. However, it seems likely that these proteins act as aspartic peptidases.
Probab=98.36 E-value=1.5e-06 Score=75.28 Aligned_cols=83 Identities=17% Similarity=0.272 Sum_probs=55.3
Q ss_pred EEEEEEEcCCccccccCHHHHHHcCCccccccceeeEeeccc-e--eEEEeEE-EEEEEEEcCEEEEeeEEEecC-----
Q psy17672 190 YPVKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVG-V--QQIIGRI-HMVQVAIEKDFLTTSLSILEE----- 260 (319)
Q Consensus 190 ~~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg-~--~~~~g~v-~~~~i~IG~~~~~~~f~Vl~~----- 260 (319)
+++++|+||||+.|||+.++|++|+|+...... ...|++ . .+....+ ..+.+++++..+.+.+.+++.
T Consensus 11 ~~~~~LlDsGSq~SfIt~~la~~L~L~~~~~~~---~~~~~~g~~~~~~~~~~~~~i~~~~~~~~~~i~alvv~~I~~~l 87 (164)
T PF05585_consen 11 VEARALLDSGSQRSFITESLANKLNLPGTGEKI---LVIGTFGSSSPKSKKCVRVKISSRTSNNSLEIEALVVPKITGNL 87 (164)
T ss_pred EEEEEEEecCCchhHHhHHHHHHhCCCCCCceE---EEEeccCccCccceeEEEEEEEEecCCCceEEEEEecCcccccc
Confidence 368999999999999999999999999653322 223332 1 1222211 123456666667777777762
Q ss_pred --------------------------CCCceeechhhhhhc
Q psy17672 261 --------------------------QPMDMLLGLDMLRRH 275 (319)
Q Consensus 261 --------------------------~~~d~ILG~D~L~~~ 275 (319)
..+|+|||.|++.++
T Consensus 88 ~~~~i~~~~~~~~~~l~lad~~f~~~~~iDiLIG~D~~~~l 128 (164)
T PF05585_consen 88 PSAPIDDSDWKHLNNLPLADPNFRESSPIDILIGADYFWQL 128 (164)
T ss_pred cccccCHHHHhhhcCCccccccccCCCCCeEEEccchHHHH
Confidence 148999999998764
No 60
>PF11976 Rad60-SLD: Ubiquitin-2 like Rad60 SUMO-like; InterPro: IPR022617 This entry includes small ubiquitin-related modifier (SUMO) proteins. SUMOs are small proteins that are covalently attached to lysines as post-translational modifications and are used to control multiple cellular process including signal transduction, nuclear transport and DNA replication and repair []. Unlike ubiquitin, they are not involved in protein degradation. This entry also contains the C-terminal Rad60 DNA repair protein SUMO-like domain.; PDB: 3RD2_A 2JXX_A 3RCZ_A 3GOE_A 3A4S_D 3A4R_B 2IO1_D 1U4A_A 2K1F_A 1WZ0_A ....
Probab=98.32 E-value=9.3e-07 Score=66.00 Aligned_cols=49 Identities=27% Similarity=0.352 Sum_probs=44.0
Q ss_pred CHHHHHHHHHhhhCCCC-CCeEEEecCeeccCCcccccccCCCCCcEEEEE
Q psy17672 2 DVEQFRLICQDECGIDA-TDMILLNNGKHLLEDGSCLKQAGVRDGDIILVA 51 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~-~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~ 51 (319)
++..|+..+....|+|+ +..+|+|+|+.|.. ++|++++|+++||+|.|.
T Consensus 22 ~~~~l~~~~~~~~~i~~~~~~~l~fdG~~L~~-~~T~~~~~ied~d~Idv~ 71 (72)
T PF11976_consen 22 TVSKLIEKYCEKKGIPPEESIRLIFDGKRLDP-NDTPEDLGIEDGDTIDVI 71 (72)
T ss_dssp CCHHHHHHHHHHHTTTT-TTEEEEETTEEE-T-TSCHHHHT-STTEEEEEE
T ss_pred cHHHHHHHHHHhhCCCccceEEEEECCEEcCC-CCCHHHCCCCCCCEEEEE
Confidence 67889999999999999 99999999999987 689999999999999985
No 61
>KOG4248|consensus
Probab=98.17 E-value=1.8e-06 Score=91.56 Aligned_cols=51 Identities=20% Similarity=0.357 Sum_probs=48.7
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
|||.+||.+|.++.+|+.+.|+|||.||+|.| ++++.+||| ||.+|||+-+
T Consensus 23 ~ti~~~~d~~r~~~ni~s~~qr~i~~grvl~~-~k~vq~~~v-dgk~~hlver 73 (1143)
T KOG4248|consen 23 MTIKEFKDHIRASVNIPSEKQRLIYQGRVLQD-DKKVQEYNV-DGKVIHLVER 73 (1143)
T ss_pred HHHHHHHHHHHHhcccccccceeeecceeecc-chhhhhccC-CCeEEEeecc
Confidence 79999999999999999999999999999998 689999999 9999999965
No 62
>cd01801 Tsc13_N Ubiquitin-like domain of Tsc13. Tsc13_N N-terminal domain of Tsc13. Tsc13 is an enoyl reductase involved in elongation of long chain fatty acids that localizes to the endoplasmic reticulum and is highly enriched in a novel structure marking nuclear-vacuolar junctions.
Probab=98.07 E-value=5.9e-06 Score=62.91 Aligned_cols=49 Identities=18% Similarity=0.312 Sum_probs=43.5
Q ss_pred CHHHHHHHHHhhhC-CCCCCeEEE--ecCeeccCCcccccccCCCCCcEEEEE
Q psy17672 2 DVEQFRLICQDECG-IDATDMILL--NNGKHLLEDGSCLKQAGVRDGDIILVA 51 (319)
Q Consensus 2 tv~~lK~~i~~e~g-ip~~~q~Li--~~Gk~L~d~~~tL~~~gI~~~d~l~l~ 51 (319)
||++||..|+...+ ++++.|+|. +.|+.|.| ++||++|||++|++|++.
T Consensus 24 TV~dlk~~i~~~~~~~~~~Rqrl~~~~~g~~L~d-~~tL~~~gv~~g~~lyvK 75 (77)
T cd01801 24 TIADLKKLIAKSSPQLTVNRQSLRLEPKGKSLKD-DDTLVDLGVGAGATLYVR 75 (77)
T ss_pred cHHHHHHHHHHHcCCCCcceeEEEeCCCCcccCC-cccHhhcCCCCCCEEEEe
Confidence 89999999998875 589999985 89999997 579999999999999873
No 63
>KOG0001|consensus
Probab=97.75 E-value=5.7e-05 Score=54.71 Aligned_cols=51 Identities=22% Similarity=0.397 Sum_probs=47.3
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 2 DVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
+|+.+|..|+...|+|.++|++.+.|+.|.| ..+|.+|+|..+.+++++..
T Consensus 21 ~i~~~k~~i~~~~~~~~~~q~~~~~~~~l~d-~~~l~~~~i~~~~~~~l~~~ 71 (75)
T KOG0001|consen 21 TIEVVKAKIRDKEGIPVDQQRLIFGGKPLED-GRTLADYNIQEGSTLHLVLS 71 (75)
T ss_pred HHHHHHHHHHhhcCCCCeeEEEEECCEECcC-CCcHHHhCCCCCCEEEEEEe
Confidence 6788999999999999999999999999998 58999999999999998754
No 64
>PF13881 Rad60-SLD_2: Ubiquitin-2 like Rad60 SUMO-like; PDB: 1SE9_A 1WGH_A 2GOW_A.
Probab=97.75 E-value=7e-05 Score=61.24 Aligned_cols=54 Identities=15% Similarity=0.259 Sum_probs=40.2
Q ss_pred CCHHHHHHHHHhhh-------CCCCCCeEEEecCeeccCCcccccccCCCCCc------EEEEEccCC
Q psy17672 1 MDVEQFRLICQDEC-------GIDATDMILLNNGKHLLEDGSCLKQAGVRDGD------IILVAMPGR 55 (319)
Q Consensus 1 ~tv~~lK~~i~~e~-------gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d------~l~l~~~~~ 55 (319)
+||.+||+.|-.+. -..+...+|||.||.|.| ++||+++++.-|+ ++||+.+..
T Consensus 24 ~TV~~lKe~i~~~WP~d~~~~p~s~~~lRLI~~GriL~d-~~tL~~~~~~~~~~~~~~~vmHlvvrp~ 90 (111)
T PF13881_consen 24 TTVADLKERIWAEWPEDWEERPKSPSDLRLIYAGRILED-NKTLSDCRLPSGETPGGPTVMHLVVRPN 90 (111)
T ss_dssp SBHHHHHHHHHHSSSTTSSSTT-SGGGEEEEETTEEE-S-SSBTGGGT--TTSETT--EEEEEEE-SS
T ss_pred ChHHHHHHHHHHHCccccccCCCChhhEEEEeCCeecCC-cCcHHHhCCCCCCCCCCCEEEEEEecCC
Confidence 58999999998653 234578899999999997 7999999999888 567776543
No 65
>PF09280 XPC-binding: XPC-binding domain; InterPro: IPR015360 Members of this entry adopt a structure consisting of four alpha helices, arranged in an array. They bind specifically and directly to the xeroderma pigmentosum group C protein (XPC) to initiate nucleotide excision repair []. ; GO: 0003684 damaged DNA binding, 0006289 nucleotide-excision repair, 0043161 proteasomal ubiquitin-dependent protein catabolic process; PDB: 1PVE_A 1QZE_A 1OQY_A 1TP4_A 1X3W_B 3ESW_B 2QSG_X 2QSF_X 1X3Z_B 2QSH_X ....
Probab=97.67 E-value=8.5e-05 Score=53.90 Aligned_cols=46 Identities=26% Similarity=0.510 Sum_probs=38.4
Q ss_pred CCCCh--HHHHHHHhhCHH----HHHHHHhcCchhHHHHhcChHHHHHHHHH
Q psy17672 77 QLQDP--AHVRDLLLACPD----QLALLKQNNPRLSEALSTGKDAFTKVFKE 122 (319)
Q Consensus 77 ~~~dp--e~~r~~il~nP~----~l~~L~~~nP~La~ai~~~~~~f~~~l~~ 122 (319)
+.++| ..+|+++.+||+ ++.+|.++||+|++.|.+|++.|.+++.+
T Consensus 4 Lr~~Pqf~~lR~~vq~NP~lL~~lLqql~~~nP~l~q~I~~n~e~Fl~ll~~ 55 (59)
T PF09280_consen 4 LRNNPQFQQLRQLVQQNPQLLPPLLQQLGQSNPQLLQLIQQNPEEFLRLLNE 55 (59)
T ss_dssp GTTSHHHHHHHHHHHC-GGGHHHHHHHHHCCSHHHHHHHHHTHHHHHHHHHS
T ss_pred HHcChHHHHHHHHHHHCHHHHHHHHHHHhccCHHHHHHHHHCHHHHHHHHcC
Confidence 55677 469999999985 55689999999999999999999998864
No 66
>cd00196 UBQ Ubiquitin-like proteins. Ubiquitin homologs; Includes ubiquitin and ubiquitin-like proteins. Ubiquitin-mediated proteolysis is part of the regulated turnover of proteins required for controlling cell cycle progression. Other family members are protein modifiers that perform a wide range of functions. Ubiquitination usually results in a covalent bond between the C-terminus of ubiquitin and the epsilon-amino group of a substrate lysine. The three-step mechanism requires an activating enzyme (E1) that forms a thiol ester with the C-terminal carboxy group, a conjugating enzyme (E2) that transiently carries the activated ubiquitin molecule as a thiol ester, and a ligase (E3) that transfers the activated ubiquitin from the E2 to the substrate lysine residue. In poly-ubiquitination, ubiquitin itself is the substrate.
Probab=97.63 E-value=0.0001 Score=50.52 Aligned_cols=51 Identities=25% Similarity=0.387 Sum_probs=46.1
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
.|+++||..|..+.|++++.+.|+++|..+.+ ..++.++++.+|+.|++..
T Consensus 18 ~tv~~l~~~i~~~~~~~~~~~~l~~~~~~~~~-~~~~~~~~~~~~~~i~~~~ 68 (69)
T cd00196 18 TTVADLKEKLAKKLGLPPEQQRLLVNGKILPD-SLTLEDYGLQDGDELVLVP 68 (69)
T ss_pred CcHHHHHHHHHHHHCcChHHeEEEECCeECCC-CCcHHHcCCCCCCEEEEEe
Confidence 48999999999999999999999999999987 5667899999999999863
No 67
>cd01788 ElonginB Ubiquitin-like domain of Elongin B. Elongin B is part of an E3 ubiquitin ligase complex called VEC that activates ubiquitylation by the E2 ubiquitin-conjugating enzyme Ubc5. VEC is composed of von Hippel-Lindau tumor suppressor protein (pVHL), elongin C, cullin 2, NEDD8, and Rbx1. ElonginB binds elonginC to form the elonginBC complex which is a positive regulator of RNA polymerase II elongation factor Elongin A. The BC complex then binds VHL (von Hippel-Lindau) tumour suppressor protein to form a VCB ternary complex. Elongin B has a ubiquitin-llike domain.
Probab=97.54 E-value=0.0001 Score=59.89 Aligned_cols=51 Identities=18% Similarity=0.216 Sum_probs=46.1
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCC-------CCCcEEEEEcc
Q psy17672 2 DVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGV-------RDGDIILVAMP 53 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI-------~~~d~l~l~~~ 53 (319)
||-+||.+|+.-...|++.|+|+.++.+|+| ++||++||+ +...+|-|..+
T Consensus 23 tVlelK~~iegI~k~pp~dQrL~kd~qvLeD-~kTL~d~g~t~~~akaq~pA~vgLa~r 80 (119)
T cd01788 23 TVYELKRIVEGILKRPPEDQRLYKDDQLLDD-GKTLGDCGFTSQTARPQAPATVGLAFR 80 (119)
T ss_pred cHHHHHHHHHHHhcCChhHheeecCceeecc-cccHHHcCccccccccCCCCeEEEEEe
Confidence 7999999999999999999999988888887 799999999 67888888765
No 68
>PF11543 UN_NPL4: Nuclear pore localisation protein NPL4; InterPro: IPR024682 Npl4, along with Ufd1, forms the heterodimer adaptor complex UN, which is involved in the recruitment of p97, an AAA ATPase, for tasks involving the ubiquitin pathway. Npl4 has a N-terminal ubiquitin-like domain which has within its structure a beta-grasp fold with a helical insert []. This entry represents the ubiquitin-like domain.; PDB: 2PJH_A 1WF9_A.
Probab=97.36 E-value=0.00016 Score=55.61 Aligned_cols=51 Identities=16% Similarity=0.343 Sum_probs=30.1
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEec--C-eec-cCCcccccccCCCCCcEEEEE
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNN--G-KHL-LEDGSCLKQAGVRDGDIILVA 51 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~--G-k~L-~d~~~tL~~~gI~~~d~l~l~ 51 (319)
.|+.+||++|++.+++|...|.|+.+ + ..| .++.+||+++||+.||+|+|.
T Consensus 24 ~t~~~L~~kI~~~l~~~~~~~~L~~~~~~~~~l~s~~~~tl~~lglkHGdmlyL~ 78 (80)
T PF11543_consen 24 STLSDLKEKISEQLSIPDSSQSLSKDRNNKEELKSSDSKTLSSLGLKHGDMLYLK 78 (80)
T ss_dssp SBHHHHHHHHHHHS---TTT---BSSGGGGGCSSS-TT-CCCCT---TT-EEE--
T ss_pred ccHHHHHHHHHHHcCCCCcceEEEecCCCCcccccCCcCCHHHcCCCCccEEEEe
Confidence 37999999999999999999998643 2 244 245789999999999999974
No 69
>PLN02560 enoyl-CoA reductase
Probab=97.36 E-value=0.00025 Score=67.84 Aligned_cols=50 Identities=20% Similarity=0.230 Sum_probs=43.8
Q ss_pred CCHHHHHHHHHhhhCC-CCCCeEEEec---Ce----eccCCcccccccCCCCCcEEEEE
Q psy17672 1 MDVEQFRLICQDECGI-DATDMILLNN---GK----HLLEDGSCLKQAGVRDGDIILVA 51 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gi-p~~~q~Li~~---Gk----~L~d~~~tL~~~gI~~~d~l~l~ 51 (319)
.||+|||..|+...++ ++++|+|++. |+ .|+| ++||+++|+++|++|++.
T Consensus 24 aTV~dLK~~Isk~~~~~~~~RqRL~~~~~~gk~~g~~L~d-~ktL~d~gv~~gstLy~k 81 (308)
T PLN02560 24 ATVADLKKAIHKRKKKYYPSRQRLTLPLPPGKTRPTVLDD-SKSLKDYGLGDGGTVVFK 81 (308)
T ss_pred CcHHHHHHHHHHHcCCCChhheEEEEecCCCCcCccccCC-CCCHHhcCCCCCceEEEE
Confidence 4899999999999987 8999999973 43 7777 689999999999999986
No 70
>PF12382 Peptidase_A2E: Retrotransposon peptidase; InterPro: IPR024648 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. This entry represents a small family of fungal retroviral aspartyl peptidases.
Probab=96.90 E-value=0.0042 Score=49.56 Aligned_cols=78 Identities=15% Similarity=0.299 Sum_probs=55.1
Q ss_pred EEEEEcCCccccccCHHHHHHcCCcccccccee-eEeec-cceeEEEeEEEEEEEEEcCEEEEeeEEEecCCCCceeech
Q psy17672 192 VKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWA-GVAKG-VGVQQIIGRIHMVQVAIEKDFLTTSLSILEEQPMDMLLGL 269 (319)
Q Consensus 192 v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~-g~a~G-vg~~~~~g~v~~~~i~IG~~~~~~~f~Vl~~~~~d~ILG~ 269 (319)
+..|+|||||.++|..+.++...|+.. +|. .+..| |=-.++-.+....-+.+.|..+...|.|+..-+...-+..
T Consensus 48 ipclidtgaq~niiteetvrahklptr---pw~~sviyggvyp~kinrkt~kl~i~lngisikteflvvkkfshpaaisf 124 (137)
T PF12382_consen 48 IPCLIDTGAQVNIITEETVRAHKLPTR---PWSQSVIYGGVYPNKINRKTIKLNINLNGISIKTEFLVVKKFSHPAAISF 124 (137)
T ss_pred ceeEEccCceeeeeehhhhhhccCCCC---cchhheEeccccccccccceEEEEEEecceEEEEEEEEEEeccCcceEEE
Confidence 357899999999999999999988733 332 13443 3344666666677889999999999999985444444443
Q ss_pred hhh
Q psy17672 270 DML 272 (319)
Q Consensus 270 D~L 272 (319)
..|
T Consensus 125 ttl 127 (137)
T PF12382_consen 125 TTL 127 (137)
T ss_pred EEE
Confidence 333
No 71
>PF00026 Asp: Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.; InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) . More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=96.27 E-value=0.018 Score=53.96 Aligned_cols=89 Identities=21% Similarity=0.470 Sum_probs=58.2
Q ss_pred EEEEEEEc--CEEEEEEEcCCccccccCHH-HHHH--cCCccc----cccce--ee----EeeccceeEEEeEEEEEEEE
Q psy17672 181 LYINCKVN--GYPVKAFVDSGAQTTIMSAK-CAER--CNIMRL----IDTRW--AG----VAKGVGVQQIIGRIHMVQVA 245 (319)
Q Consensus 181 l~v~v~In--g~~v~aLVDTGA~~siIs~~-~a~r--lgl~~~----~~~~~--~g----~a~Gvg~~~~~g~v~~~~i~ 245 (319)
+++++.|+ +++++++||||++.+.+... |... |..... ..+.+ .+ ...|-|. ..|.+....+.
T Consensus 2 Y~~~v~iGtp~q~~~~~iDTGS~~~wv~~~~c~~~~~~~~~~~y~~~~S~t~~~~~~~~~~~y~~g~--~~G~~~~D~v~ 79 (317)
T PF00026_consen 2 YYINVTIGTPPQTFRVLIDTGSSDTWVPSSNCNSCSSCASSGFYNPSKSSTFSNQGKPFSISYGDGS--VSGNLVSDTVS 79 (317)
T ss_dssp EEEEEEETTTTEEEEEEEETTBSSEEEEBTTECSHTHHCTSC-BBGGGSTTEEEEEEEEEEEETTEE--EEEEEEEEEEE
T ss_pred eEEEEEECCCCeEEEEEEecccceeeeceeccccccccccccccccccccccccceeeeeeeccCcc--cccccccceEe
Confidence 57899997 99999999999999988643 2222 221111 00111 11 2234333 88899999999
Q ss_pred EcCEEEE-eeEEEecC--------CCCceeechhh
Q psy17672 246 IEKDFLT-TSLSILEE--------QPMDMLLGLDM 271 (319)
Q Consensus 246 IG~~~~~-~~f~Vl~~--------~~~d~ILG~D~ 271 (319)
||+..++ ..|..+.. ...|+||||-+
T Consensus 80 ig~~~~~~~~f~~~~~~~~~~~~~~~~~GilGLg~ 114 (317)
T PF00026_consen 80 IGGLTIPNQTFGLADSYSGDPFSPIPFDGILGLGF 114 (317)
T ss_dssp ETTEEEEEEEEEEEEEEESHHHHHSSSSEEEE-SS
T ss_pred eeeccccccceeccccccccccccccccccccccC
Confidence 9998776 66666654 25799999983
No 72
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=96.01 E-value=0.037 Score=55.75 Aligned_cols=91 Identities=20% Similarity=0.297 Sum_probs=60.0
Q ss_pred ceEEEEEEEc--CEEEEEEEcCCccccccCHHHHH--HcCCccccc----c------ceeeEeeccceeEEEeEEEEEEE
Q psy17672 179 VMLYINCKVN--GYPVKAFVDSGAQTTIMSAKCAE--RCNIMRLID----T------RWAGVAKGVGVQQIIGRIHMVQV 244 (319)
Q Consensus 179 ~~l~v~v~In--g~~v~aLVDTGA~~siIs~~~a~--rlgl~~~~~----~------~~~g~a~Gvg~~~~~g~v~~~~i 244 (319)
.++|.+++|+ ++++.+++|||++.+.+...-|. .|.....-+ . ....+..|-|. ..|.+....|
T Consensus 137 ~~Yy~~i~IGTP~Q~f~vi~DTGSsdlWV~s~~C~~~~C~~~~~yd~s~SsT~~~~~~~~~i~YG~Gs--v~G~~~~Dtv 214 (450)
T PTZ00013 137 IMFYGEGEVGDNHQKFMLIFDTGSANLWVPSKKCDSIGCSIKNLYDSSKSKSYEKDGTKVDITYGSGT--VKGFFSKDLV 214 (450)
T ss_pred CEEEEEEEECCCCeEEEEEEeCCCCceEEecccCCccccccCCCccCccCcccccCCcEEEEEECCce--EEEEEEEEEE
Confidence 4678999996 89999999999999999643222 222111111 0 11113455553 6888888999
Q ss_pred EEcCEEEEeeEEEec----------CCCCceeechhh
Q psy17672 245 AIEKDFLTTSLSILE----------EQPMDMLLGLDM 271 (319)
Q Consensus 245 ~IG~~~~~~~f~Vl~----------~~~~d~ILG~D~ 271 (319)
.+|+..++..|..+. ....|+||||.|
T Consensus 215 ~iG~~~~~~~f~~~~~~~~~~~~~~~~~~dGIlGLg~ 251 (450)
T PTZ00013 215 TLGHLSMPYKFIEVTDTDDLEPIYSSSEFDGILGLGW 251 (450)
T ss_pred EECCEEEccEEEEEEeccccccceecccccceecccC
Confidence 999987765554442 124799999975
No 73
>cd05470 pepsin_retropepsin_like Cellular and retroviral pepsin-like aspartate proteases. This family includes both cellular and retroviral pepsin-like aspartate proteases. The cellular pepsin and pepsin-like enzymes are twice as long as their retroviral counterparts. The cellular pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, rennin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (rennin, cathepsin D and E, pepsin) or commercially (chymosin) important. The eukaryotic pepsin-like proteases contain two domains possessing similar topological features. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except in the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The eukaryotic pepsin-like proteases have two active site
Probab=95.91 E-value=0.044 Score=43.36 Aligned_cols=84 Identities=15% Similarity=0.258 Sum_probs=51.4
Q ss_pred EEEEEcC--EEEEEEEcCCccccccCHHHHHHcCCccc--------------cccceeeEeeccceeEEEeEEEEEEEEE
Q psy17672 183 INCKVNG--YPVKAFVDSGAQTTIMSAKCAERCNIMRL--------------IDTRWAGVAKGVGVQQIIGRIHMVQVAI 246 (319)
Q Consensus 183 v~v~Ing--~~v~aLVDTGA~~siIs~~~a~rlgl~~~--------------~~~~~~g~a~Gvg~~~~~g~v~~~~i~I 246 (319)
+++.|+. +++.+++|||++.+.+..+-+..|+.... ....+. +..|-|. ..|.+....|.|
T Consensus 1 ~~i~vGtP~q~~~~~~DTGSs~~Wv~~~~c~~~~~~~~~~~~~~~~sst~~~~~~~~~-~~Y~~g~--~~g~~~~D~v~i 77 (109)
T cd05470 1 IEIGIGTPPQTFNVLLDTGSSNLWVPSVDCQSLAIYSHSSYDDPSASSTYSDNGCTFS-ITYGTGS--LSGGLSTDTVSI 77 (109)
T ss_pred CEEEeCCCCceEEEEEeCCCCCEEEeCCCCCCcccccccccCCcCCCCCCCCCCcEEE-EEeCCCe--EEEEEEEEEEEE
Confidence 4678875 89999999999999887654444432110 001111 2344443 356677778889
Q ss_pred cCEEEE-eeEEEecC--------CCCceeech
Q psy17672 247 EKDFLT-TSLSILEE--------QPMDMLLGL 269 (319)
Q Consensus 247 G~~~~~-~~f~Vl~~--------~~~d~ILG~ 269 (319)
|+..++ ..|-+... ...|+||||
T Consensus 78 g~~~~~~~~fg~~~~~~~~~~~~~~~~GilGL 109 (109)
T cd05470 78 GDIEVVGQAFGCATDEPGATFLPALFDGILGL 109 (109)
T ss_pred CCEEECCEEEEEEEecCCccccccccccccCC
Confidence 887544 44554442 246889886
No 74
>KOG0006|consensus
Probab=95.76 E-value=0.0074 Score=57.29 Aligned_cols=49 Identities=12% Similarity=0.270 Sum_probs=43.9
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEE
Q psy17672 2 DVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVA 51 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~ 51 (319)
+|.+||+.++...|+|+++.++||.||.|.| +.|++.+-+..-+.+|++
T Consensus 25 ~I~~lke~Vak~~gvp~D~L~viFaGKeLs~-~ttv~~cDL~qqs~~hi~ 73 (446)
T KOG0006|consen 25 SIFQLKEVVAKRQGVPADQLRVIFAGKELSN-DTTVQNCDLSQQSATHIM 73 (446)
T ss_pred CHHHHHHHHHHhhCCChhheEEEEecccccc-Cceeecccccccchhhhh
Confidence 5899999999999999999999999999998 689997777777788877
No 75
>PTZ00147 plasmepsin-1; Provisional
Probab=95.59 E-value=0.075 Score=53.64 Aligned_cols=92 Identities=23% Similarity=0.318 Sum_probs=60.5
Q ss_pred cceEEEEEEEc--CEEEEEEEcCCccccccCH-HHH-HHcCCcccccc----------ceeeEeeccceeEEEeEEEEEE
Q psy17672 178 VVMLYINCKVN--GYPVKAFVDSGAQTTIMSA-KCA-ERCNIMRLIDT----------RWAGVAKGVGVQQIIGRIHMVQ 243 (319)
Q Consensus 178 ~~~l~v~v~In--g~~v~aLVDTGA~~siIs~-~~a-~rlgl~~~~~~----------~~~g~a~Gvg~~~~~g~v~~~~ 243 (319)
...+|.+++|+ ++++.+++|||++.+.+.. .|. ..|.-...-+. ....+..|-| ...|.+-...
T Consensus 137 n~~Y~~~I~IGTP~Q~f~Vi~DTGSsdlWVps~~C~~~~C~~~~~yd~s~SsT~~~~~~~f~i~Yg~G--svsG~~~~Dt 214 (453)
T PTZ00147 137 NVMSYGEAKLGDNGQKFNFIFDTGSANLWVPSIKCTTEGCETKNLYDSSKSKTYEKDGTKVEMNYVSG--TVSGFFSKDL 214 (453)
T ss_pred CCEEEEEEEECCCCeEEEEEEeCCCCcEEEeecCCCcccccCCCccCCccCcceEECCCEEEEEeCCC--CEEEEEEEEE
Confidence 45788999997 8999999999999999854 332 12322111111 0111345555 3578888889
Q ss_pred EEEcCEEEEeeEEEec----------CCCCceeechhh
Q psy17672 244 VAIEKDFLTTSLSILE----------EQPMDMLLGLDM 271 (319)
Q Consensus 244 i~IG~~~~~~~f~Vl~----------~~~~d~ILG~D~ 271 (319)
|.||+..++..|..+. ....|+||||-|
T Consensus 215 VtiG~~~v~~qF~~~~~~~~f~~~~~~~~~DGILGLG~ 252 (453)
T PTZ00147 215 VTIGNLSVPYKFIEVTDTNGFEPFYTESDFDGIFGLGW 252 (453)
T ss_pred EEECCEEEEEEEEEEEeccCcccccccccccceecccC
Confidence 9999987775554432 124799999976
No 76
>KOG1872|consensus
Probab=95.38 E-value=0.022 Score=56.71 Aligned_cols=51 Identities=6% Similarity=0.066 Sum_probs=46.7
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 2 DVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
|...||+.+.+.+|+||+.|++.+.|+.+.|| -.+...+|++|.+|+|+-.
T Consensus 25 t~~vlKaqlf~LTgV~PeRQKv~vKGg~a~dd-~~~~al~iKpn~~lmMmGt 75 (473)
T KOG1872|consen 25 TPSVLKAQLFALTGVPPERQKVMVKGGLAKDD-VDWGALQIKPNETLMMMGT 75 (473)
T ss_pred chHHHHHHHHHhcCCCccceeEEEeccccccc-ccccccccCCCCEEEeecc
Confidence 67899999999999999999999999999984 4788999999999999854
No 77
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which
Probab=95.28 E-value=0.063 Score=49.65 Aligned_cols=91 Identities=19% Similarity=0.171 Sum_probs=53.9
Q ss_pred EEEcCEEE---------------EEEEcCCccccccCHHHHHHcCCccccccceeeEeeccce-eEEEeEEEEEEEEEcC
Q psy17672 185 CKVNGYPV---------------KAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGV-QQIIGRIHMVQVAIEK 248 (319)
Q Consensus 185 v~Ing~~v---------------~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~-~~~~g~v~~~~i~IG~ 248 (319)
++|+++.+ .++||||++.+.++.++. + .+.-...| |. ..+...-... ..+
T Consensus 155 i~v~~~~~~~~~~~~~~~~~~~~~ai~DTGTs~~~lp~~~~-----P-----~i~~~f~~-~~~~~i~~~~y~~--~~~- 220 (265)
T cd05476 155 ISVGGKRLPIPPSVFAIDSDGSGGTIIDSGTTLTYLPDPAY-----P-----DLTLHFDG-GADLELPPENYFV--DVG- 220 (265)
T ss_pred EEECCEEecCCchhcccccCCCCcEEEeCCCcceEcCcccc-----C-----CEEEEECC-CCEEEeCcccEEE--ECC-
Confidence 57787764 389999999999998887 1 11111121 11 1111000000 111
Q ss_pred EEEEeeEEEecC-CCCceeechhhhhhccEEEEccCCEEEEcc
Q psy17672 249 DFLTTSLSILEE-QPMDMLLGLDMLRRHECCIDLRKNVLRIGT 290 (319)
Q Consensus 249 ~~~~~~f~Vl~~-~~~d~ILG~D~L~~~~~~ID~~~~~L~i~~ 290 (319)
....| +..+.. ...-.|||..||+.+-.+.|+.+++|-|..
T Consensus 221 ~~~~C-~~~~~~~~~~~~ilG~~fl~~~~~vFD~~~~~iGfa~ 262 (265)
T cd05476 221 EGVVC-LAILSSSSGGVSILGNIQQQNFLVEYDLENSRLGFAP 262 (265)
T ss_pred CCCEE-EEEecCCCCCcEEEChhhcccEEEEEECCCCEEeeec
Confidence 11123 223332 345689999999999999999999998865
No 78
>PF08817 YukD: WXG100 protein secretion system (Wss), protein YukD; InterPro: IPR014921 YukD is a bacterial protein that adopts a ubiquitin-like fold []. Ubiquitin covalently binds to protein and flags them for protein degradation, however conjugation assays have indicated that YukD lacks the capacity for covalent bond formation with other proteins []. ; PDB: 2BPS_B.
Probab=95.28 E-value=0.022 Score=43.40 Aligned_cols=49 Identities=27% Similarity=0.434 Sum_probs=31.3
Q ss_pred CHHHHHHHHHhhhCCCC---C---CeEEE-ecCeeccCCcccccccCCCCCcEEEEE
Q psy17672 2 DVEQFRLICQDECGIDA---T---DMILL-NNGKHLLEDGSCLKQAGVRDGDIILVA 51 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~---~---~q~Li-~~Gk~L~d~~~tL~~~gI~~~d~l~l~ 51 (319)
+|.+|...|....+.+. . .-+|. -+|..|.+ +.||+++||.|||+|+|.
T Consensus 24 pv~~li~~l~~~~~~~~~~~~~~~~~~L~~~~g~~L~~-~~tL~~~gV~dGd~L~L~ 79 (79)
T PF08817_consen 24 PVAELIPELVELLGLPGDDPPGHGQWVLARAGGRPLDP-DQTLADAGVRDGDVLVLR 79 (79)
T ss_dssp BTTHHHHHHHHHS---S---TT-E-EEEG-GGTEEEET-TSBCGGGT--TT-EEEE-
T ss_pred cHHHHHHHHHHHhCCccCCCCCcceEEEEecCCcccCC-cCcHhHcCCCCCCEEEeC
Confidence 56667766666655432 2 35666 67888987 689999999999999874
No 79
>COG4067 Uncharacterized protein conserved in archaea [Posttranslational modification, protein turnover, chaperones]
Probab=95.09 E-value=0.069 Score=45.88 Aligned_cols=101 Identities=20% Similarity=0.275 Sum_probs=65.6
Q ss_pred EEEEcCE--EEEEEEcCCccccccCHHHHHHc---C-------Ccc---cccc------ceee--Eeec-cceeEEEeEE
Q psy17672 184 NCKVNGY--PVKAFVDSGAQTTIMSAKCAERC---N-------IMR---LIDT------RWAG--VAKG-VGVQQIIGRI 239 (319)
Q Consensus 184 ~v~Ing~--~v~aLVDTGA~~siIs~~~a~rl---g-------l~~---~~~~------~~~g--~a~G-vg~~~~~g~v 239 (319)
.+.++|- .++|=+||||..|.++..-.++. | +.. .... +..+ ...+ .|.....--|
T Consensus 30 ~~~l~~l~~~~kAkiDTGA~TSsL~A~dI~~fkRdGe~WVRF~~~~~~~~~~~~~~~e~pvi~~ikvR~s~~~~~e~RpV 109 (162)
T COG4067 30 WVSLPGLKIQLKAKIDTGAVTSSLSASDIERFKRDGERWVRFRLADTDNLDQRSEECEAPVIRKIKVRSSSGSRAERRPV 109 (162)
T ss_pred EEEcCcccceeeeeecccceeeeEEeecceeeeeCCceEEEEEeecccCccccceeeccceEEEEEEecCCCCccccccE
Confidence 4555665 47999999999999987654433 2 110 0000 0011 0111 1110111224
Q ss_pred EEEEEEEcCEEEEeeEEEecCCC--CceeechhhhhhccEEEEccCC
Q psy17672 240 HMVQVAIEKDFLTTSLSILEEQP--MDMLLGLDMLRRHECCIDLRKN 284 (319)
Q Consensus 240 ~~~~i~IG~~~~~~~f~Vl~~~~--~d~ILG~D~L~~~~~~ID~~~~ 284 (319)
-...+.+|+...+..|+.-|..+ |.+|||--+|+.+++.+|....
T Consensus 110 V~~~l~lG~~~~~~E~tLtDR~~m~Yp~LlGrk~l~~~~~~VDpSr~ 156 (162)
T COG4067 110 VRLTLCLGGRILPIEFTLTDRSNMRYPVLLGRKALRHFGAVVDPSRK 156 (162)
T ss_pred EEEEEeeCCeeeeEEEEeecccccccceEecHHHHhhCCeEECchhh
Confidence 45689999999999999999654 8999999999999999998764
No 80
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=95.08 E-value=0.099 Score=48.62 Aligned_cols=89 Identities=15% Similarity=0.118 Sum_probs=56.0
Q ss_pred EEEEEEcC--EEEEEEEcCCccccccCHHHHHHcCCc------ccc-------ccceeeEeeccceeEEEeEEEEEEEEE
Q psy17672 182 YINCKVNG--YPVKAFVDSGAQTTIMSAKCAERCNIM------RLI-------DTRWAGVAKGVGVQQIIGRIHMVQVAI 246 (319)
Q Consensus 182 ~v~v~Ing--~~v~aLVDTGA~~siIs~~~a~rlgl~------~~~-------~~~~~g~a~Gvg~~~~~g~v~~~~i~I 246 (319)
+++++|+. +++.++||||++.+.+..+-+..|+.. ... +.....+..|-|+ ...|.+....|.+
T Consensus 2 ~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~~c~~~~~~~~~~y~~~~Sst~~~~~~~~~~i~Y~~G~-~~~G~~~~D~v~i 80 (278)
T cd06097 2 LTPVKIGTPPQTLNLDLDTGSSDLWVFSSETPAAQQGGHKLYDPSKSSTAKLLPGATWSISYGDGS-SASGIVYTDTVSI 80 (278)
T ss_pred eeeEEECCCCcEEEEEEeCCCCceeEeeCCCCchhhccCCcCCCccCccceecCCcEEEEEeCCCC-eEEEEEEEEEEEE
Confidence 67899988 899999999999999865433333211 000 0001112344332 3678888889999
Q ss_pred cCEEEE-eeEEEec--------CCCCceeechhh
Q psy17672 247 EKDFLT-TSLSILE--------EQPMDMLLGLDM 271 (319)
Q Consensus 247 G~~~~~-~~f~Vl~--------~~~~d~ILG~D~ 271 (319)
|+..++ ..|.+.. ....|+||||-+
T Consensus 81 g~~~~~~~~fg~~~~~~~~~~~~~~~dGilGLg~ 114 (278)
T cd06097 81 GGVEVPNQAIELATAVSASFFSDTASDGLLGLAF 114 (278)
T ss_pred CCEEECCeEEEEEeecCccccccccccceeeecc
Confidence 997654 3444433 135899999964
No 81
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=94.66 E-value=0.26 Score=45.86 Aligned_cols=75 Identities=15% Similarity=0.231 Sum_probs=55.1
Q ss_pred eEEEEEEEcC--EEEEEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEEeEEEEEEEEEcCEEEE-eeEE
Q psy17672 180 MLYINCKVNG--YPVKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQIIGRIHMVQVAIEKDFLT-TSLS 256 (319)
Q Consensus 180 ~l~v~v~Ing--~~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~g~v~~~~i~IG~~~~~-~~f~ 256 (319)
.+++++.|+. +++.++||||++.+.+. .+. +..|-| ....|.+....+.+|+..++ ..|.
T Consensus 2 ~Y~~~i~iGtp~q~~~v~~DTgS~~~wv~---------------~~~-~~Y~~g-~~~~G~~~~D~v~~g~~~~~~~~fg 64 (295)
T cd05474 2 YYSAELSVGTPPQKVTVLLDTGSSDLWVP---------------DFS-ISYGDG-TSASGTWGTDTVSIGGATVKNLQFA 64 (295)
T ss_pred eEEEEEEECCCCcEEEEEEeCCCCcceee---------------eeE-EEeccC-CcEEEEEEEEEEEECCeEecceEEE
Confidence 4688999987 89999999999999999 121 333432 35678888889999987665 4565
Q ss_pred Eec-CCCCceeechhh
Q psy17672 257 ILE-EQPMDMLLGLDM 271 (319)
Q Consensus 257 Vl~-~~~~d~ILG~D~ 271 (319)
+.. ....|+||||.+
T Consensus 65 ~~~~~~~~~GilGLg~ 80 (295)
T cd05474 65 VANSTSSDVGVLGIGL 80 (295)
T ss_pred EEecCCCCcceeeECC
Confidence 555 345889999764
No 82
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which
Probab=94.59 E-value=0.23 Score=47.13 Aligned_cols=92 Identities=16% Similarity=0.270 Sum_probs=57.7
Q ss_pred cceEEEEEEEc--CEEEEEEEcCCccccccCHHHHHH--cCCccccc----ccee------eEeeccceeEEEeEEEEEE
Q psy17672 178 VVMLYINCKVN--GYPVKAFVDSGAQTTIMSAKCAER--CNIMRLID----TRWA------GVAKGVGVQQIIGRIHMVQ 243 (319)
Q Consensus 178 ~~~l~v~v~In--g~~v~aLVDTGA~~siIs~~~a~r--lgl~~~~~----~~~~------g~a~Gvg~~~~~g~v~~~~ 243 (319)
...+++++.|+ .+++.++||||++.+.+...-|.. |.-....+ +.+. ....|-|. ..|.+....
T Consensus 8 ~~~Y~~~i~vGtp~q~~~v~~DTGS~~~wv~~~~C~~~~c~~~~~f~~~~Sst~~~~~~~~~~~yg~gs--~~G~~~~D~ 85 (317)
T cd05478 8 DMEYYGTISIGTPPQDFTVIFDTGSSNLWVPSVYCSSQACSNHNRFNPRQSSTYQSTGQPLSIQYGTGS--MTGILGYDT 85 (317)
T ss_pred CCEEEEEEEeCCCCcEEEEEEeCCCccEEEecCCCCcccccccCcCCCCCCcceeeCCcEEEEEECCce--EEEEEeeeE
Confidence 45788999997 688999999999999996432222 22111111 0010 12345443 578888889
Q ss_pred EEEcCEEEE-eeEEEecC--------CCCceeechhh
Q psy17672 244 VAIEKDFLT-TSLSILEE--------QPMDMLLGLDM 271 (319)
Q Consensus 244 i~IG~~~~~-~~f~Vl~~--------~~~d~ILG~D~ 271 (319)
|.||+..++ ..|.+... ...|+||||.+
T Consensus 86 v~ig~~~i~~~~fg~~~~~~~~~~~~~~~dGilGLg~ 122 (317)
T cd05478 86 VQVGGISDTNQIFGLSETEPGSFFYYAPFDGILGLAY 122 (317)
T ss_pred EEECCEEECCEEEEEEEecCccccccccccceeeecc
Confidence 999987654 34444431 13789999975
No 83
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate r
Probab=94.38 E-value=0.35 Score=46.16 Aligned_cols=92 Identities=17% Similarity=0.275 Sum_probs=58.3
Q ss_pred cceEEEEEEEc--CEEEEEEEcCCccccccCH-HHHH---HcCCccccc----cce------eeEeeccceeEEEeEEEE
Q psy17672 178 VVMLYINCKVN--GYPVKAFVDSGAQTTIMSA-KCAE---RCNIMRLID----TRW------AGVAKGVGVQQIIGRIHM 241 (319)
Q Consensus 178 ~~~l~v~v~In--g~~v~aLVDTGA~~siIs~-~~a~---rlgl~~~~~----~~~------~g~a~Gvg~~~~~g~v~~ 241 (319)
....+++++|+ +++++++||||++.+.+.. .|.. .|.....-+ +.+ ..+..|-| .+.|.+..
T Consensus 6 ~~~y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~c~~~~~y~~~~SsT~~~~~~~~~~~Yg~g--~~~G~~~~ 83 (326)
T cd05487 6 DTQYYGEIGIGTPPQTFKVVFDTGSSNLWVPSSKCSPLYTACVTHNLYDASDSSTYKENGTEFTIHYASG--TVKGFLSQ 83 (326)
T ss_pred CCeEEEEEEECCCCcEEEEEEeCCccceEEccCCCcCcchhhcccCcCCCCCCeeeeECCEEEEEEeCCc--eEEEEEee
Confidence 35678999997 8999999999999999954 3321 233221111 101 11234544 36888888
Q ss_pred EEEEEcCEEEEeeEEEecC--------CCCceeechhh
Q psy17672 242 VQVAIEKDFLTTSLSILEE--------QPMDMLLGLDM 271 (319)
Q Consensus 242 ~~i~IG~~~~~~~f~Vl~~--------~~~d~ILG~D~ 271 (319)
..|.+|+......|.++.. ...|+||||.+
T Consensus 84 D~v~~g~~~~~~~fg~~~~~~~~~~~~~~~dGilGLg~ 121 (326)
T cd05487 84 DIVTVGGIPVTQMFGEVTALPAIPFMLAKFDGVLGMGY 121 (326)
T ss_pred eEEEECCEEeeEEEEEEEeccCCccceeecceEEecCC
Confidence 8999998755433333321 24799999975
No 84
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which
Probab=94.20 E-value=0.17 Score=46.73 Aligned_cols=76 Identities=16% Similarity=0.267 Sum_probs=53.6
Q ss_pred EEEEEEEc--CEEEEEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEEeEEEEEEEEEcCE--EEE-eeE
Q psy17672 181 LYINCKVN--GYPVKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQIIGRIHMVQVAIEKD--FLT-TSL 255 (319)
Q Consensus 181 l~v~v~In--g~~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~g~v~~~~i~IG~~--~~~-~~f 255 (319)
++++++|+ .+++.+++|||++.+.+.- | .+. +..|- ...+.|.+....+.+|+. .++ ..|
T Consensus 2 Y~~~i~iGtP~q~~~v~~DTGSs~~wv~~-----~--------~~~-~~Y~d-g~~~~G~~~~D~v~~g~~~~~~~~~~F 66 (265)
T cd05476 2 YLVTLSIGTPPQPFSLIVDTGSDLTWTQC-----C--------SYE-YSYGD-GSSTSGVLATETFTFGDSSVSVPNVAF 66 (265)
T ss_pred eEEEEecCCCCcceEEEecCCCCCEEEcC-----C--------ceE-eEeCC-CceeeeeEEEEEEEecCCCCccCCEEE
Confidence 57889998 7899999999999999863 1 121 23332 235678787888899987 443 456
Q ss_pred EEecCC------CCceeechhh
Q psy17672 256 SILEEQ------PMDMLLGLDM 271 (319)
Q Consensus 256 ~Vl~~~------~~d~ILG~D~ 271 (319)
-+.... ..|+||||.+
T Consensus 67 g~~~~~~~~~~~~~~GIlGLg~ 88 (265)
T cd05476 67 GCGTDNEGGSFGGADGILGLGR 88 (265)
T ss_pred EecccccCCccCCCCEEEECCC
Confidence 555532 4899999976
No 85
>KOG3206|consensus
Probab=94.13 E-value=0.13 Score=46.36 Aligned_cols=54 Identities=28% Similarity=0.269 Sum_probs=47.0
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEE-EecC-----eeccCCcccccccCCCCCcEEEEEccC
Q psy17672 1 MDVEQFRLICQDECGIDATDMIL-LNNG-----KHLLEDGSCLKQAGVRDGDIILVAMPG 54 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~L-i~~G-----k~L~d~~~tL~~~gI~~~d~l~l~~~~ 54 (319)
|||++||.+|+-.+|.+++.+.| +|.| -.|.|++..|..|+..||-.||++-..
T Consensus 23 ltl~q~K~KLe~~~G~~~~~M~l~l~~~~d~~~~~lsn~d~~lg~~~~~Dg~rihviD~~ 82 (234)
T KOG3206|consen 23 LTLAQFKDKLELLTGTEAESMELELYDGDDKKVSALSNEDADLGFYKVEDGLRIHVIDSN 82 (234)
T ss_pred CcHHHHHhhhhhhhCCCccceEEEEEcCCCceeeeccCCcccccccCCCCceEEEEEecC
Confidence 79999999999999999999998 4766 356676789999999999999998653
No 86
>PF05618 Zn_protease: Putative ATP-dependant zinc protease; InterPro: IPR008503 This family consists of several hypothetical proteins from different archaeal and bacterial species.; PDB: 2PMA_B.
Probab=94.02 E-value=0.083 Score=44.82 Aligned_cols=45 Identities=13% Similarity=0.250 Sum_probs=31.0
Q ss_pred EEEEEEcCEEEEeeEEEecCCC--CceeechhhhhhccEEEEccCCE
Q psy17672 241 MVQVAIEKDFLTTSLSILEEQP--MDMLLGLDMLRRHECCIDLRKNV 285 (319)
Q Consensus 241 ~~~i~IG~~~~~~~f~Vl~~~~--~d~ILG~D~L~~~~~~ID~~~~~ 285 (319)
...+.||+..++..|+.-+... +.+|||.--+.+..+.+|..+..
T Consensus 87 ~~~~~lg~~~~~~e~tL~dR~~m~yp~LlGrR~~l~~~~lVD~s~~~ 133 (138)
T PF05618_consen 87 ETTLCLGGKTWKIEFTLTDRSNMKYPMLLGRRNFLRGRFLVDVSRSF 133 (138)
T ss_dssp EEEEEETTEEEEEEEEEE-S--SS-SEEE-HHHHHHTTEEEETT---
T ss_pred EEEEEECCEEEEEEEEEcCCCcCcCCEEEEehHHhcCCEEECCChhh
Confidence 5689999999999999998654 89999955555667888887653
No 87
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=93.84 E-value=0.41 Score=45.35 Aligned_cols=90 Identities=16% Similarity=0.294 Sum_probs=58.7
Q ss_pred ceEEEEEEEc--CEEEEEEEcCCccccccCHHHHH--HcCCccccc-----------cceeeEeeccceeEEEeEEEEEE
Q psy17672 179 VMLYINCKVN--GYPVKAFVDSGAQTTIMSAKCAE--RCNIMRLID-----------TRWAGVAKGVGVQQIIGRIHMVQ 243 (319)
Q Consensus 179 ~~l~v~v~In--g~~v~aLVDTGA~~siIs~~~a~--rlgl~~~~~-----------~~~~g~a~Gvg~~~~~g~v~~~~ 243 (319)
..+++++.|+ .+++.++||||++.+.+...-+. .|+-....+ ..+. +..|-|. ..|.+....
T Consensus 2 ~~y~~~i~iGtP~q~~~v~~DTGS~~~wv~~~~C~~~~C~~~~~f~~~~SsT~~~~~~~~~-~~Yg~Gs--~~G~~~~D~ 78 (318)
T cd05477 2 MSYYGEISIGTPPQNFLVLFDTGSSNLWVPSVLCQSQACTNHTKFNPSQSSTYSTNGETFS-LQYGSGS--LTGIFGYDT 78 (318)
T ss_pred cEEEEEEEECCCCcEEEEEEeCCCccEEEccCCCCCccccccCCCCcccCCCceECCcEEE-EEECCcE--EEEEEEeeE
Confidence 3578899997 48999999999999998643222 233211111 1111 3345453 578888889
Q ss_pred EEEcCEEEE-eeEEEecC--------CCCceeechhh
Q psy17672 244 VAIEKDFLT-TSLSILEE--------QPMDMLLGLDM 271 (319)
Q Consensus 244 i~IG~~~~~-~~f~Vl~~--------~~~d~ILG~D~ 271 (319)
|.+|+..++ ..|.+... ...|+||||-+
T Consensus 79 i~~g~~~i~~~~Fg~~~~~~~~~~~~~~~~GilGLg~ 115 (318)
T cd05477 79 VTVQGIIITNQEFGLSETEPGTNFVYAQFDGILGLAY 115 (318)
T ss_pred EEECCEEEcCEEEEEEEecccccccccceeeEeecCc
Confidence 999998765 45665543 13689999974
No 88
>KOG1769|consensus
Probab=93.83 E-value=0.15 Score=40.70 Aligned_cols=50 Identities=18% Similarity=0.276 Sum_probs=43.1
Q ss_pred HHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 3 VEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 3 v~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
+..|+..-....|++..+.+++|+|+.+.+ ..|=.+++..+||.|-++..
T Consensus 43 LkKLM~aYc~r~Gl~~~s~RFlFdG~rI~~-~~TP~~L~mEd~D~Iev~~~ 92 (99)
T KOG1769|consen 43 LKKLMKAYCERQGLSMNSLRFLFDGQRIRE-THTPADLEMEDGDEIEVVQE 92 (99)
T ss_pred HHHHHHHHHHHcCCccceEEEEECCcCcCC-CCChhhhCCcCCcEEEEEee
Confidence 445666667788999999999999999987 68999999999999998853
No 89
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases. They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=93.83 E-value=0.42 Score=45.41 Aligned_cols=92 Identities=16% Similarity=0.255 Sum_probs=58.7
Q ss_pred cceEEEEEEEc--CEEEEEEEcCCccccccCH-HHH--HHcCCccccc----cce------eeEeeccceeEEEeEEEEE
Q psy17672 178 VVMLYINCKVN--GYPVKAFVDSGAQTTIMSA-KCA--ERCNIMRLID----TRW------AGVAKGVGVQQIIGRIHMV 242 (319)
Q Consensus 178 ~~~l~v~v~In--g~~v~aLVDTGA~~siIs~-~~a--~rlgl~~~~~----~~~------~g~a~Gvg~~~~~g~v~~~ 242 (319)
...++++++|+ ++++++++|||++...+.. .|. ..|.....-+ ..+ ..+..|-|. +.|.+...
T Consensus 8 ~~~Y~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~~C~~~~~C~~~~~y~~~~SsT~~~~~~~~~i~Yg~G~--~~G~~~~D 85 (317)
T cd06098 8 DAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYKSSKSSTYKKNGTSASIQYGTGS--ISGFFSQD 85 (317)
T ss_pred CCEEEEEEEECCCCeEEEEEECCCccceEEecCCCCCCccccccCcCCcccCCCcccCCCEEEEEcCCce--EEEEEEee
Confidence 45689999998 7899999999999888854 332 2343221111 001 113345553 57888888
Q ss_pred EEEEcCEEEE-eeEEEec--------CCCCceeechhh
Q psy17672 243 QVAIEKDFLT-TSLSILE--------EQPMDMLLGLDM 271 (319)
Q Consensus 243 ~i~IG~~~~~-~~f~Vl~--------~~~~d~ILG~D~ 271 (319)
.|.||+..++ ..|.+.. ....|+||||-+
T Consensus 86 ~v~ig~~~v~~~~f~~~~~~~~~~~~~~~~dGilGLg~ 123 (317)
T cd06098 86 SVTVGDLVVKNQVFIEATKEPGLTFLLAKFDGILGLGF 123 (317)
T ss_pred EEEECCEEECCEEEEEEEecCCccccccccceeccccc
Confidence 9999997655 3444332 124699999975
No 90
>KOG4495|consensus
Probab=93.70 E-value=0.045 Score=43.32 Aligned_cols=41 Identities=15% Similarity=0.153 Sum_probs=34.6
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEEe-cC-eeccCCcccccccCCC
Q psy17672 2 DVEQFRLICQDECGIDATDMILLN-NG-KHLLEDGSCLKQAGVR 43 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li~-~G-k~L~d~~~tL~~~gI~ 43 (319)
||-+||.+++.-..-|++.|+|+. +- ..|.| .+||+++|..
T Consensus 23 tV~elK~~l~gi~~~Pvn~qrL~kmd~eqlL~D-~ktL~d~gft 65 (110)
T KOG4495|consen 23 TVFELKRKLEGILKRPVNEQRLYKMDTEQLLDD-GKTLGDCGFT 65 (110)
T ss_pred cHHHHHHHHHHHHhCCCcchheeecCHHHHhhc-cchhhhcccc
Confidence 789999999999999999999986 54 44555 8999999653
No 91
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which
Probab=93.65 E-value=0.44 Score=45.20 Aligned_cols=102 Identities=16% Similarity=0.100 Sum_probs=61.3
Q ss_pred EEEEcCEEE------EEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEE--eEEEEEEEEEcCEEEEee-
Q psy17672 184 NCKVNGYPV------KAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQII--GRIHMVQVAIEKDFLTTS- 254 (319)
Q Consensus 184 ~v~Ing~~v------~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~--g~v~~~~i~IG~~~~~~~- 254 (319)
.++|||+.+ .++||||++.+.++.+...++--.. ..... ..|.-...+. .......+.+++..+.++
T Consensus 193 ~v~v~g~~~~~~~~~~~iiDTGts~~~lp~~~~~~l~~~~--~~~~~--~~~~~~~~C~~~~~~P~~~f~f~g~~~~i~~ 268 (317)
T cd05478 193 SVTINGQVVACSGGCQAIVDTGTSLLVGPSSDIANIQSDI--GASQN--QNGEMVVNCSSISSMPDVVFTINGVQYPLPP 268 (317)
T ss_pred EEEECCEEEccCCCCEEEECCCchhhhCCHHHHHHHHHHh--CCccc--cCCcEEeCCcCcccCCcEEEEECCEEEEECH
Confidence 367888865 6899999999999999887752110 00000 0010000111 112234556666554432
Q ss_pred ------------EEEecCC-CCceeechhhhhhccEEEEccCCEEEEc
Q psy17672 255 ------------LSILEEQ-PMDMLLGLDMLRRHECCIDLRKNVLRIG 289 (319)
Q Consensus 255 ------------f~Vl~~~-~~d~ILG~D~L~~~~~~ID~~~~~L~i~ 289 (319)
..+.... ....|||-.||+.+-.+.|+.+++|-|.
T Consensus 269 ~~y~~~~~~~C~~~~~~~~~~~~~IlG~~fl~~~y~vfD~~~~~iG~A 316 (317)
T cd05478 269 SAYILQDQGSCTSGFQSMGLGELWILGDVFIRQYYSVFDRANNKVGLA 316 (317)
T ss_pred HHheecCCCEEeEEEEeCCCCCeEEechHHhcceEEEEeCCCCEEeec
Confidence 2222211 2458999999999999999999998774
No 92
>PF10302 DUF2407: DUF2407 ubiquitin-like domain; InterPro: IPR019413 This entry represents a family of proteins of unknown function found in fungi. They contain a characteristic GFDRL sequence motif.
Probab=93.62 E-value=0.074 Score=42.45 Aligned_cols=39 Identities=13% Similarity=0.188 Sum_probs=30.9
Q ss_pred CCHHHHHHHHHhhh--CCCCCCeEEEecCeeccCCccccccc
Q psy17672 1 MDVEQFRLICQDEC--GIDATDMILLNNGKHLLEDGSCLKQA 40 (319)
Q Consensus 1 ~tv~~lK~~i~~e~--gip~~~q~Li~~Gk~L~d~~~tL~~~ 40 (319)
.||..||.+|.... ...-..++|||+||.|.| +..|+..
T Consensus 24 ~Tv~~LK~lIR~~~p~~~s~~rLRlI~~Gr~L~d-~t~l~~~ 64 (97)
T PF10302_consen 24 TTVAWLKQLIRERLPPEPSRRRLRLIYAGRLLND-HTDLSSE 64 (97)
T ss_pred ccHHHHHHHHHhhcCCCCccccEEeeecCcccCc-cchhhhh
Confidence 38999999999887 455567899999999998 5555544
No 93
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme. Proteinase A preferentially hydro
Probab=93.49 E-value=0.4 Score=45.56 Aligned_cols=91 Identities=15% Similarity=0.272 Sum_probs=58.1
Q ss_pred cceEEEEEEEc--CEEEEEEEcCCccccccCHH-HH-HHcCCcccc-----------ccceeeEeeccceeEEEeEEEEE
Q psy17672 178 VVMLYINCKVN--GYPVKAFVDSGAQTTIMSAK-CA-ERCNIMRLI-----------DTRWAGVAKGVGVQQIIGRIHMV 242 (319)
Q Consensus 178 ~~~l~v~v~In--g~~v~aLVDTGA~~siIs~~-~a-~rlgl~~~~-----------~~~~~g~a~Gvg~~~~~g~v~~~ 242 (319)
...+++++.|+ ++++.+++|||++.+.+... |. ..|.....- ...+ .+..|-| ...|.+...
T Consensus 8 ~~~Y~~~i~iGtp~q~~~v~~DTGSs~~wv~~~~C~~~~C~~~~~y~~~~Sst~~~~~~~~-~~~y~~g--~~~G~~~~D 84 (320)
T cd05488 8 NAQYFTDITLGTPPQKFKVILDTGSSNLWVPSVKCGSIACFLHSKYDSSASSTYKANGTEF-KIQYGSG--SLEGFVSQD 84 (320)
T ss_pred CCEEEEEEEECCCCcEEEEEEecCCcceEEEcCCCCCcccCCcceECCCCCcceeeCCCEE-EEEECCc--eEEEEEEEe
Confidence 35689999997 58999999999999988543 32 123322111 1111 1234444 357888888
Q ss_pred EEEEcCEEEE-eeEEEec--------CCCCceeechhh
Q psy17672 243 QVAIEKDFLT-TSLSILE--------EQPMDMLLGLDM 271 (319)
Q Consensus 243 ~i~IG~~~~~-~~f~Vl~--------~~~~d~ILG~D~ 271 (319)
.+.||+..++ ..|.... ....|+||||.+
T Consensus 85 ~v~ig~~~~~~~~f~~a~~~~g~~~~~~~~dGilGLg~ 122 (320)
T cd05488 85 TLSIGDLTIKKQDFAEATSEPGLAFAFGKFDGILGLAY 122 (320)
T ss_pred EEEECCEEECCEEEEEEecCCCcceeeeeeceEEecCC
Confidence 9999987665 3444442 124699999985
No 94
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=93.09 E-value=0.42 Score=45.28 Aligned_cols=88 Identities=17% Similarity=0.335 Sum_probs=54.8
Q ss_pred EEEEEEEc--CEEEEEEEcCCccccccCH-HHH-HHcCCccccc-----------cceeeEeeccceeEEEeEEEEEEEE
Q psy17672 181 LYINCKVN--GYPVKAFVDSGAQTTIMSA-KCA-ERCNIMRLID-----------TRWAGVAKGVGVQQIIGRIHMVQVA 245 (319)
Q Consensus 181 l~v~v~In--g~~v~aLVDTGA~~siIs~-~~a-~rlgl~~~~~-----------~~~~g~a~Gvg~~~~~g~v~~~~i~ 245 (319)
+++++.|+ +++++++||||++...+.. .|. ..|.-...-+ ..+ .+..|-| ...|.+....|.
T Consensus 1 Y~~~i~iGtP~Q~~~v~~DTGSs~~Wv~s~~C~~~~C~~~~~y~~~~SsT~~~~~~~~-~i~Yg~g--~~~G~~~~D~v~ 77 (316)
T cd05486 1 YFGQISIGTPPQNFTVIFDTGSSNLWVPSIYCTSQACTKHNRFQPSESSTYVSNGEAF-SIQYGTG--SLTGIIGIDQVT 77 (316)
T ss_pred CeEEEEECCCCcEEEEEEcCCCccEEEecCCCCCcccCccceECCCCCcccccCCcEE-EEEeCCc--EEEEEeeecEEE
Confidence 36788897 7899999999999998853 332 1232211111 111 1334555 357888888899
Q ss_pred EcCEEEE-eeEEEec--------CCCCceeechhh
Q psy17672 246 IEKDFLT-TSLSILE--------EQPMDMLLGLDM 271 (319)
Q Consensus 246 IG~~~~~-~~f~Vl~--------~~~~d~ILG~D~ 271 (319)
||+..++ ..|.+.. ....|+||||-+
T Consensus 78 ig~~~~~~~~fg~~~~~~~~~~~~~~~dGilGLg~ 112 (316)
T cd05486 78 VEGITVQNQQFAESVSEPGSTFQDSEFDGILGLAY 112 (316)
T ss_pred ECCEEEcCEEEEEeeccCcccccccccceEeccCc
Confidence 9986554 3444321 235799999975
No 95
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=93.06 E-value=0.59 Score=44.30 Aligned_cols=102 Identities=19% Similarity=0.166 Sum_probs=60.2
Q ss_pred EEEcCEEE-------EEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEE--eEEEEEEEEEcCEEEEee-
Q psy17672 185 CKVNGYPV-------KAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQII--GRIHMVQVAIEKDFLTTS- 254 (319)
Q Consensus 185 v~Ing~~v-------~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~--g~v~~~~i~IG~~~~~~~- 254 (319)
+.|+|..+ .++||||.+.+.++.+.++++--. ...... ..|.-...+. .......+.+++..+.+.
T Consensus 188 i~v~g~~~~~~~~~~~~iiDSGtt~~~lP~~~~~~l~~~--~~~~~~--~~~~~~~~C~~~~~~p~l~~~f~g~~~~v~~ 263 (318)
T cd05477 188 FQINGQATGWCSQGCQAIVDTGTSLLTAPQQVMSTLMQS--IGAQQD--QYGQYVVNCNNIQNLPTLTFTINGVSFPLPP 263 (318)
T ss_pred EEECCEEecccCCCceeeECCCCccEECCHHHHHHHHHH--hCCccc--cCCCEEEeCCccccCCcEEEEECCEEEEECH
Confidence 56787753 589999999999999888775211 011000 0110000110 112234556666544422
Q ss_pred ------------EEEecC------CCCceeechhhhhhccEEEEccCCEEEEcc
Q psy17672 255 ------------LSILEE------QPMDMLLGLDMLRRHECCIDLRKNVLRIGT 290 (319)
Q Consensus 255 ------------f~Vl~~------~~~d~ILG~D~L~~~~~~ID~~~~~L~i~~ 290 (319)
+.+.+. .....|||..||+.+-.+.|+.++++-|..
T Consensus 264 ~~y~~~~~~~C~~~i~~~~~~~~~~~~~~ilG~~fl~~~y~vfD~~~~~ig~a~ 317 (318)
T cd05477 264 SAYILQNNGYCTVGIEPTYLPSQNGQPLWILGDVFLRQYYSVYDLGNNQVGFAT 317 (318)
T ss_pred HHeEecCCCeEEEEEEecccCCCCCCceEEEcHHHhhheEEEEeCCCCEEeeee
Confidence 222211 123589999999999999999999988753
No 96
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=92.95 E-value=0.54 Score=43.73 Aligned_cols=96 Identities=17% Similarity=0.227 Sum_probs=56.5
Q ss_pred EEEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEEeEE-EEEEEEEcCEEEEee----------------
Q psy17672 192 VKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQIIGRI-HMVQVAIEKDFLTTS---------------- 254 (319)
Q Consensus 192 v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~g~v-~~~~i~IG~~~~~~~---------------- 254 (319)
..++||||++.+.++.+.++.+--. ....+.. ..+.....|.-.. ..+.+.+++..+.++
T Consensus 179 ~~~iiDSGt~~~~lP~~~~~~l~~~--~~~~~~~-~~~~~~~~C~~~~~p~i~f~f~g~~~~i~~~~~~~~~~~~~~~~~ 255 (295)
T cd05474 179 LPALLDSGTTLTYLPSDIVDAIAKQ--LGATYDS-DEGLYVVDCDAKDDGSLTFNFGGATISVPLSDLVLPASTDDGGDG 255 (295)
T ss_pred ccEEECCCCccEeCCHHHHHHHHHH--hCCEEcC-CCcEEEEeCCCCCCCEEEEEECCeEEEEEHHHhEeccccCCCCCC
Confidence 5889999999999999988875211 0111000 0011111110000 245566666444322
Q ss_pred ---EEEecCCCCceeechhhhhhccEEEEccCCEEEEcc
Q psy17672 255 ---LSILEEQPMDMLLGLDMLRRHECCIDLRKNVLRIGT 290 (319)
Q Consensus 255 ---f~Vl~~~~~d~ILG~D~L~~~~~~ID~~~~~L~i~~ 290 (319)
+.+........|||..||+.+-.+.|+.++++-|..
T Consensus 256 ~C~~~i~~~~~~~~iLG~~fl~~~y~vfD~~~~~ig~a~ 294 (295)
T cd05474 256 ACYLGIQPSTSDYNILGDTFLRSAYVVYDLDNNEISLAQ 294 (295)
T ss_pred CeEEEEEeCCCCcEEeChHHhhcEEEEEECCCCEEEeec
Confidence 222222224589999999999999999999998754
No 97
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=92.93 E-value=0.52 Score=45.11 Aligned_cols=93 Identities=16% Similarity=0.282 Sum_probs=59.1
Q ss_pred ccceEEEEEEEc--CEEEEEEEcCCccccccCHH-HH---HHcCCccccc----ccee------eEeeccceeEEEeEEE
Q psy17672 177 SVVMLYINCKVN--GYPVKAFVDSGAQTTIMSAK-CA---ERCNIMRLID----TRWA------GVAKGVGVQQIIGRIH 240 (319)
Q Consensus 177 ~~~~l~v~v~In--g~~v~aLVDTGA~~siIs~~-~a---~rlgl~~~~~----~~~~------g~a~Gvg~~~~~g~v~ 240 (319)
....+++++.|+ ++++.+++|||++.+.+... |. ..|+.....+ +.+. .+..|.|. ..|.+.
T Consensus 8 ~~~~Y~~~i~vGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~c~~~~~y~~~~Sst~~~~~~~~~i~Y~~g~--~~G~~~ 85 (329)
T cd05485 8 MDAQYYGVITIGTPPQSFKVVFDTGSSNLWVPSKKCSWTNIACLLHNKYDSTKSSTYKKNGTEFAIQYGSGS--LSGFLS 85 (329)
T ss_pred cCCeEEEEEEECCCCcEEEEEEcCCCccEEEecCCCCCCCccccCCCeECCcCCCCeEECCeEEEEEECCce--EEEEEe
Confidence 456789999998 68999999999999988543 32 1343321111 1111 12345553 578888
Q ss_pred EEEEEEcCEEEE-eeEEEecC--------CCCceeechhh
Q psy17672 241 MVQVAIEKDFLT-TSLSILEE--------QPMDMLLGLDM 271 (319)
Q Consensus 241 ~~~i~IG~~~~~-~~f~Vl~~--------~~~d~ILG~D~ 271 (319)
...+.||+..++ ..|.+... ...|+||||-+
T Consensus 86 ~D~v~ig~~~~~~~~fg~~~~~~~~~~~~~~~~GilGLg~ 125 (329)
T cd05485 86 TDTVSVGGVSVKGQTFAEAINEPGLTFVAAKFDGILGMGY 125 (329)
T ss_pred cCcEEECCEEECCEEEEEEEecCCccccccccceEEEcCC
Confidence 889999987654 34444321 23699999875
No 98
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5. Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=92.84 E-value=0.33 Score=46.32 Aligned_cols=89 Identities=13% Similarity=0.103 Sum_probs=51.0
Q ss_pred EEEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEEeEEEEEE--EEEcCEEEEeeEEEecCCCCceeech
Q psy17672 192 VKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQIIGRIHMVQ--VAIEKDFLTTSLSILEEQPMDMLLGL 269 (319)
Q Consensus 192 v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~g~v~~~~--i~IG~~~~~~~f~Vl~~~~~d~ILG~ 269 (319)
..++||||.+.++++.+..+++--.. ..+.-...+ |. ++. +.... +...+. .|-+.. .....-.|||-
T Consensus 232 ~~aivDSGTs~~~lp~~~~~~l~~~~---P~i~~~f~~-g~-~~~--i~p~~y~~~~~~~--~c~~~~-~~~~~~~ILG~ 301 (326)
T cd06096 232 LGMLVDSGSTLSHFPEDLYNKINNFF---PTITIIFEN-NL-KID--WKPSSYLYKKESF--WCKGGE-KSVSNKPILGA 301 (326)
T ss_pred CCEEEeCCCCcccCCHHHHHHHHhhc---CcEEEEEcC-Cc-EEE--ECHHHhccccCCc--eEEEEE-ecCCCceEECh
Confidence 46899999999999999988863211 111101111 11 000 00000 011111 121222 22223579999
Q ss_pred hhhhhccEEEEccCCEEEEcc
Q psy17672 270 DMLRRHECCIDLRKNVLRIGT 290 (319)
Q Consensus 270 D~L~~~~~~ID~~~~~L~i~~ 290 (319)
.||+.+-.+.|+.+++|-|..
T Consensus 302 ~flr~~y~vFD~~~~riGfa~ 322 (326)
T cd06096 302 SFFKNKQIIFDLDNNRIGFVE 322 (326)
T ss_pred HHhcCcEEEEECcCCEEeeEc
Confidence 999999999999999998864
No 99
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank
Probab=92.76 E-value=0.5 Score=44.91 Aligned_cols=92 Identities=14% Similarity=0.248 Sum_probs=58.1
Q ss_pred cceEEEEEEEc--CEEEEEEEcCCccccccCH-HHH---HHcCCccccc----cce------eeEeeccceeEEEeEEEE
Q psy17672 178 VVMLYINCKVN--GYPVKAFVDSGAQTTIMSA-KCA---ERCNIMRLID----TRW------AGVAKGVGVQQIIGRIHM 241 (319)
Q Consensus 178 ~~~l~v~v~In--g~~v~aLVDTGA~~siIs~-~~a---~rlgl~~~~~----~~~------~g~a~Gvg~~~~~g~v~~ 241 (319)
...+++++.|+ ++++.+++|||++.+.+.. .|. ..|+....-+ ..+ ..+..|-|. ..|.+..
T Consensus 4 ~~~Y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~C~~~~~y~~~~SsT~~~~~~~~~i~Yg~G~--~~G~~~~ 81 (325)
T cd05490 4 DAQYYGEIGIGTPPQTFTVVFDTGSSNLWVPSVHCSLLDIACWLHHKYNSSKSSTYVKNGTEFAIQYGSGS--LSGYLSQ 81 (325)
T ss_pred CCEEEEEEEECCCCcEEEEEEeCCCccEEEEcCCCCCCCccccCcCcCCcccCcceeeCCcEEEEEECCcE--EEEEEee
Confidence 34678999997 5889999999999998843 332 1343221111 011 113455553 5788888
Q ss_pred EEEEEcCEEEE-eeEEEecC--------CCCceeechhh
Q psy17672 242 VQVAIEKDFLT-TSLSILEE--------QPMDMLLGLDM 271 (319)
Q Consensus 242 ~~i~IG~~~~~-~~f~Vl~~--------~~~d~ILG~D~ 271 (319)
..|.+|+..++ ..|.+... ...|+||||-+
T Consensus 82 D~v~~g~~~~~~~~Fg~~~~~~~~~~~~~~~dGilGLg~ 120 (325)
T cd05490 82 DTVSIGGLQVEGQLFGEAVKQPGITFIAAKFDGILGMAY 120 (325)
T ss_pred eEEEECCEEEcCEEEEEEeeccCCcccceeeeEEEecCC
Confidence 99999997655 34543321 23699999964
No 100
>PTZ00165 aspartyl protease; Provisional
Probab=92.69 E-value=0.8 Score=46.65 Aligned_cols=99 Identities=17% Similarity=0.230 Sum_probs=60.9
Q ss_pred CCCCcc--ccceEEEEEEEcC--EEEEEEEcCCccccccCHHHHHH--cCCccccc----ccee-----------eEeec
Q psy17672 171 NPETFG--SVVMLYINCKVNG--YPVKAFVDSGAQTTIMSAKCAER--CNIMRLID----TRWA-----------GVAKG 229 (319)
Q Consensus 171 ~pe~~~--~~~~l~v~v~Ing--~~v~aLVDTGA~~siIs~~~a~r--lgl~~~~~----~~~~-----------g~a~G 229 (319)
.||.+. ....++.++.|+. +++++++|||++...+...-+.. |.....-+ +.+. -+..|
T Consensus 109 ~~~~l~n~~d~~Y~~~I~IGTPpQ~f~Vv~DTGSS~lWVps~~C~~~~C~~~~~yd~s~SSTy~~~~~~~~~~~~~i~YG 188 (482)
T PTZ00165 109 LQQDLLNFHNSQYFGEIQVGTPPKSFVVVFDTGSSNLWIPSKECKSGGCAPHRKFDPKKSSTYTKLKLGDESAETYIQYG 188 (482)
T ss_pred cceecccccCCeEEEEEEeCCCCceEEEEEeCCCCCEEEEchhcCcccccccCCCCccccCCcEecCCCCccceEEEEeC
Confidence 344443 5567899999986 89999999999999885433322 22111111 0110 12345
Q ss_pred cceeEEEeEEEEEEEEEcCEEEE-eeEEEec--------CCCCceeechhh
Q psy17672 230 VGVQQIIGRIHMVQVAIEKDFLT-TSLSILE--------EQPMDMLLGLDM 271 (319)
Q Consensus 230 vg~~~~~g~v~~~~i~IG~~~~~-~~f~Vl~--------~~~~d~ILG~D~ 271 (319)
-|. ..|.+-...|.+|+..++ ..|-+.. ....|+||||-|
T Consensus 189 sGs--~~G~l~~DtV~ig~l~i~~q~FG~a~~~s~~~f~~~~~DGILGLg~ 237 (482)
T PTZ00165 189 TGE--CVLALGKDTVKIGGLKVKHQSIGLAIEESLHPFADLPFDGLVGLGF 237 (482)
T ss_pred CCc--EEEEEEEEEEEECCEEEccEEEEEEEeccccccccccccceeecCC
Confidence 553 457777788999987665 3443332 124799999987
No 101
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases. They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=92.68 E-value=0.73 Score=43.79 Aligned_cols=91 Identities=12% Similarity=0.066 Sum_probs=54.9
Q ss_pred EEEcCEE-------EEEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEEeEEEEEEEEEcCEEE------
Q psy17672 185 CKVNGYP-------VKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQIIGRIHMVQVAIEKDFL------ 251 (319)
Q Consensus 185 v~Ing~~-------v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~g~v~~~~i~IG~~~~------ 251 (319)
++|||.. ..|+||||.+.+.++.++++.+.....-... . ......+.+|+..+
T Consensus 197 i~v~g~~~~~~~~~~~aivDTGTs~~~lP~~~~~~i~~~~~C~~~-----~---------~~P~i~f~f~g~~~~l~~~~ 262 (317)
T cd06098 197 VLIGGKSTGFCAGGCAAIADSGTSLLAGPTTIVTQINSAVDCNSL-----S---------SMPNVSFTIGGKTFELTPEQ 262 (317)
T ss_pred EEECCEEeeecCCCcEEEEecCCcceeCCHHHHHhhhccCCcccc-----c---------cCCcEEEEECCEEEEEChHH
Confidence 5788875 3589999999999999998876521110000 0 00011122222111
Q ss_pred -----------EeeEEEe--cC---CCCceeechhhhhhccEEEEccCCEEEEc
Q psy17672 252 -----------TTSLSIL--EE---QPMDMLLGLDMLRRHECCIDLRKNVLRIG 289 (319)
Q Consensus 252 -----------~~~f~Vl--~~---~~~d~ILG~D~L~~~~~~ID~~~~~L~i~ 289 (319)
.|-+.+. +. .+...|||-.||+.+-.+.|+.+++|-|.
T Consensus 263 yi~~~~~~~~~~C~~~~~~~~~~~~~~~~~IlGd~Flr~~y~VfD~~~~~iGfA 316 (317)
T cd06098 263 YILKVGEGAAAQCISGFTALDVPPPRGPLWILGDVFMGAYHTVFDYGNLRVGFA 316 (317)
T ss_pred eEEeecCCCCCEEeceEEECCCCCCCCCeEEechHHhcccEEEEeCCCCEEeec
Confidence 2322222 21 12347999999999999999999998764
No 102
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=92.50 E-value=1.1 Score=42.88 Aligned_cols=101 Identities=14% Similarity=0.112 Sum_probs=57.2
Q ss_pred EEEcCEEE-----EEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEE--EeEEEEEEEEEcCEEEEee---
Q psy17672 185 CKVNGYPV-----KAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQI--IGRIHMVQVAIEKDFLTTS--- 254 (319)
Q Consensus 185 v~Ing~~v-----~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~--~g~v~~~~i~IG~~~~~~~--- 254 (319)
++|+|... .++||||.+.+.++.+.++.+.-. +.... ...+.-...+ ......+.+.+|+..+.++
T Consensus 199 i~v~~~~~~~~~~~~iiDSGtt~~~lP~~~~~~l~~~--~~~~~--~~~~~~~~~C~~~~~~p~i~f~fgg~~~~i~~~~ 274 (329)
T cd05485 199 VSVGEGEFCSGGCQAIADTGTSLIAGPVDEIEKLNNA--IGAKP--IIGGEYMVNCSAIPSLPDITFVLGGKSFSLTGKD 274 (329)
T ss_pred EEECCeeecCCCcEEEEccCCcceeCCHHHHHHHHHH--hCCcc--ccCCcEEEeccccccCCcEEEEECCEEeEEChHH
Confidence 35676654 699999999999999887765211 10000 0001000000 0012234455555443322
Q ss_pred --------------EEEec-----CCCCceeechhhhhhccEEEEccCCEEEEc
Q psy17672 255 --------------LSILE-----EQPMDMLLGLDMLRRHECCIDLRKNVLRIG 289 (319)
Q Consensus 255 --------------f~Vl~-----~~~~d~ILG~D~L~~~~~~ID~~~~~L~i~ 289 (319)
+.+.. ..+...|||..||+.+-.+.|+++++|-|.
T Consensus 275 yi~~~~~~~~~~C~~~~~~~~~~~~~~~~~IlG~~fl~~~y~vFD~~~~~ig~a 328 (329)
T cd05485 275 YVLKVTQMGQTICLSGFMGIDIPPPAGPLWILGDVFIGKYYTEFDLGNNRVGFA 328 (329)
T ss_pred eEEEecCCCCCEEeeeEEECcCCCCCCCeEEEchHHhccceEEEeCCCCEEeec
Confidence 11121 112247999999999999999999998774
No 103
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two
Probab=92.22 E-value=0.7 Score=44.81 Aligned_cols=89 Identities=18% Similarity=0.208 Sum_probs=54.0
Q ss_pred eEEEEEEEc--CEEEEEEEcCCccccccCHH-HHHHcC-Cccc-------cccceeeEeeccceeEEEeEEEEEEEEEcC
Q psy17672 180 MLYINCKVN--GYPVKAFVDSGAQTTIMSAK-CAERCN-IMRL-------IDTRWAGVAKGVGVQQIIGRIHMVQVAIEK 248 (319)
Q Consensus 180 ~l~v~v~In--g~~v~aLVDTGA~~siIs~~-~a~rlg-l~~~-------~~~~~~g~a~Gvg~~~~~g~v~~~~i~IG~ 248 (319)
-+++++.|+ ++++.+++|||++.+.+... |+...+ .... ....+. +..|-|. +.|.+....|.||+
T Consensus 3 ~Y~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~~~~~~~~~f~~~~SsT~~~~~~~~~-i~Yg~Gs--~~G~~~~D~v~ig~ 79 (364)
T cd05473 3 GYYIEMLIGTPPQKLNILVDTGSSNFAVAAAPHPFIHTYFHRELSSTYRDLGKGVT-VPYTQGS--WEGELGTDLVSIPK 79 (364)
T ss_pred ceEEEEEecCCCceEEEEEecCCcceEEEcCCCccccccCCchhCcCcccCCceEE-EEECcce--EEEEEEEEEEEECC
Confidence 478899997 68999999999999988543 321100 0000 111111 3456553 46888888999985
Q ss_pred ---EEEEeeEEEec--------CCCCceeechhh
Q psy17672 249 ---DFLTTSLSILE--------EQPMDMLLGLDM 271 (319)
Q Consensus 249 ---~~~~~~f~Vl~--------~~~~d~ILG~D~ 271 (319)
..+...+.... ....|+||||-|
T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~dGIlGLg~ 113 (364)
T cd05473 80 GPNVTFRANIAAITESENFFLNGSNWEGILGLAY 113 (364)
T ss_pred CCccceEEeeEEEeccccceecccccceeeeecc
Confidence 23433333321 124699999975
No 104
>KOG3493|consensus
Probab=92.05 E-value=0.044 Score=40.26 Aligned_cols=49 Identities=10% Similarity=0.152 Sum_probs=42.7
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEE
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILV 50 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l 50 (319)
.||.|+|.+|++.+|-.++..+|---+.+++| .-||++|-|.+|--+.|
T Consensus 22 DtiGD~KKliaaQtGT~~~kivl~k~~~i~kd-~I~L~dyeihdg~~lel 70 (73)
T KOG3493|consen 22 DTIGDLKKLIAAQTGTRPEKIVLKKWYTIFKD-HITLSDYEIHDGMNLEL 70 (73)
T ss_pred ccccCHHHHHHHhhCCChhHhHHHhhhhhhhc-ccceeeEEeccCccEEE
Confidence 48999999999999999999999867777887 78999999999976554
No 105
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco. CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=91.74 E-value=0.58 Score=43.88 Aligned_cols=79 Identities=16% Similarity=0.279 Sum_probs=53.6
Q ss_pred EEEEEEEc--CEEEEEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEEeEEEEEEEEEcCE-EEE-eeEE
Q psy17672 181 LYINCKVN--GYPVKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQIIGRIHMVQVAIEKD-FLT-TSLS 256 (319)
Q Consensus 181 l~v~v~In--g~~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~g~v~~~~i~IG~~-~~~-~~f~ 256 (319)
+++++.|+ .+++.+++|||++.+.+.- ..| ..+. +..|-|+ ...|.+....+.+|+. .++ ..|-
T Consensus 2 Y~~~i~iGtP~q~~~v~~DTGSs~~Wv~c---~~c-------~~~~-i~Yg~Gs-~~~G~~~~D~v~ig~~~~~~~~~Fg 69 (299)
T cd05472 2 YVVTVGLGTPARDQTVIVDTGSDLTWVQC---QPC-------CLYQ-VSYGDGS-YTTGDLATDTLTLGSSDVVPGFAFG 69 (299)
T ss_pred eEEEEecCCCCcceEEEecCCCCcccccC---CCC-------Ceee-eEeCCCc-eEEEEEEEEEEEeCCCCccCCEEEE
Confidence 57889997 6889999999999998842 122 1222 3455443 3468888888999986 444 4555
Q ss_pred EecCC-----CCceeechhh
Q psy17672 257 ILEEQ-----PMDMLLGLDM 271 (319)
Q Consensus 257 Vl~~~-----~~d~ILG~D~ 271 (319)
+.... ..|+||||-+
T Consensus 70 ~~~~~~~~~~~~~GilGLg~ 89 (299)
T cd05472 70 CGHDNEGLFGGAAGLLGLGR 89 (299)
T ss_pred CCccCCCccCCCCEEEECCC
Confidence 54421 4789999964
No 106
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=90.64 E-value=0.83 Score=42.42 Aligned_cols=87 Identities=17% Similarity=0.095 Sum_probs=52.1
Q ss_pred EEEcCEE-----EEEEEcCCccccccCHHHHHHcCCccccccceeeEeecc-ceeEEEeEEEEEEEEEcC--EE------
Q psy17672 185 CKVNGYP-----VKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGV-GVQQIIGRIHMVQVAIEK--DF------ 250 (319)
Q Consensus 185 v~Ing~~-----v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gv-g~~~~~g~v~~~~i~IG~--~~------ 250 (319)
++|||++ ..++||||++.+.+..+.. ++ .+.-...|. +. +.+.|-. ..
T Consensus 166 i~vg~~~~~~~~~~~ivDTGTt~t~lp~~~y----~p-----~i~~~f~~~~~~---------~~~~l~~~~y~~~~~~~ 227 (273)
T cd05475 166 LLFNGQPTGGKGLEVVFDSGSSYTYFNAQAY----FK-----PLTLKFGKGWRT---------RLLEIPPENYLIISEKG 227 (273)
T ss_pred EEECCEECcCCCceEEEECCCceEEcCCccc----cc-----cEEEEECCCCce---------eEEEeCCCceEEEcCCC
Confidence 5678775 4689999999999998765 22 111111221 00 1111111 11
Q ss_pred EEeeEEEecCC----CCceeechhhhhhccEEEEccCCEEEEcc
Q psy17672 251 LTTSLSILEEQ----PMDMLLGLDMLRRHECCIDLRKNVLRIGT 290 (319)
Q Consensus 251 ~~~~f~Vl~~~----~~d~ILG~D~L~~~~~~ID~~~~~L~i~~ 290 (319)
..| +..+... ....|||-.||+.+-.+.|+.+++|-|..
T Consensus 228 ~~C-l~~~~~~~~~~~~~~ilG~~~l~~~~~vfD~~~~riGfa~ 270 (273)
T cd05475 228 NVC-LGILNGSEIGLGNTNIIGDISMQGLMVIYDNEKQQIGWVR 270 (273)
T ss_pred CEE-EEEecCCCcCCCceEEECceEEEeeEEEEECcCCEeCccc
Confidence 124 2333322 12479999999999999999999887754
No 107
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme. Proteinase A preferentially hydro
Probab=90.55 E-value=2 Score=40.83 Aligned_cols=101 Identities=15% Similarity=0.107 Sum_probs=58.2
Q ss_pred EEEcCEE-----EEEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEE--eEEEEEEEEEcCEEEEee---
Q psy17672 185 CKVNGYP-----VKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQII--GRIHMVQVAIEKDFLTTS--- 254 (319)
Q Consensus 185 v~Ing~~-----v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~--g~v~~~~i~IG~~~~~~~--- 254 (319)
++|+|+. ..++||||++.+.++.++++.+.-. +.... ...+.-...+. .....+.+.+++..+.++
T Consensus 194 i~vg~~~~~~~~~~~ivDSGtt~~~lp~~~~~~l~~~--~~~~~--~~~~~~~~~C~~~~~~P~i~f~f~g~~~~i~~~~ 269 (320)
T cd05488 194 IGLGDEELELENTGAAIDTGTSLIALPSDLAEMLNAE--IGAKK--SWNGQYTVDCSKVDSLPDLTFNFDGYNFTLGPFD 269 (320)
T ss_pred EEECCEEeccCCCeEEEcCCcccccCCHHHHHHHHHH--hCCcc--ccCCcEEeeccccccCCCEEEEECCEEEEECHHH
Confidence 4566654 3689999999999999988765211 00000 00000000110 112234556666544322
Q ss_pred ----------EEEecC-----CCCceeechhhhhhccEEEEccCCEEEEc
Q psy17672 255 ----------LSILEE-----QPMDMLLGLDMLRRHECCIDLRKNVLRIG 289 (319)
Q Consensus 255 ----------f~Vl~~-----~~~d~ILG~D~L~~~~~~ID~~~~~L~i~ 289 (319)
..+... .+...|||-.||+.+-.+.|+.+++|-|.
T Consensus 270 y~~~~~g~C~~~~~~~~~~~~~~~~~ilG~~fl~~~y~vfD~~~~~iG~a 319 (320)
T cd05488 270 YTLEVSGSCISAFTGMDFPEPVGPLAIVGDAFLRKYYSVYDLGNNAVGLA 319 (320)
T ss_pred heecCCCeEEEEEEECcCCCCCCCeEEEchHHhhheEEEEeCCCCEEeec
Confidence 222211 12358999999999999999999998774
No 108
>PF00026 Asp: Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.; InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) . More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=90.44 E-value=0.53 Score=43.89 Aligned_cols=98 Identities=16% Similarity=0.160 Sum_probs=60.3
Q ss_pred EEEEcCE------EEEEEEcCCccccccCHHHHHHc----CCccccccceeeEeeccceeEE--EeEEEEEEEEEcCEEE
Q psy17672 184 NCKVNGY------PVKAFVDSGAQTTIMSAKCAERC----NIMRLIDTRWAGVAKGVGVQQI--IGRIHMVQVAIEKDFL 251 (319)
Q Consensus 184 ~v~Ing~------~v~aLVDTGA~~siIs~~~a~rl----gl~~~~~~~~~g~a~Gvg~~~~--~g~v~~~~i~IG~~~~ 251 (319)
.+.|++. ...++||||++...++.+....+ +..... +.....+ ........+.+++..+
T Consensus 186 ~i~i~~~~~~~~~~~~~~~Dtgt~~i~lp~~~~~~i~~~l~~~~~~---------~~~~~~c~~~~~~p~l~f~~~~~~~ 256 (317)
T PF00026_consen 186 SISIGGESVFSSSGQQAILDTGTSYIYLPRSIFDAIIKALGGSYSD---------GVYSVPCNSTDSLPDLTFTFGGVTF 256 (317)
T ss_dssp EEEETTEEEEEEEEEEEEEETTBSSEEEEHHHHHHHHHHHTTEEEC---------SEEEEETTGGGGSEEEEEEETTEEE
T ss_pred cccccccccccccceeeecccccccccccchhhHHHHhhhcccccc---------eeEEEecccccccceEEEeeCCEEE
Confidence 3666666 35899999999999999887765 221111 1000000 0011233445555433
Q ss_pred E-----------------eeEEEec----CCCCceeechhhhhhccEEEEccCCEEEEcc
Q psy17672 252 T-----------------TSLSILE----EQPMDMLLGLDMLRRHECCIDLRKNVLRIGT 290 (319)
Q Consensus 252 ~-----------------~~f~Vl~----~~~~d~ILG~D~L~~~~~~ID~~~~~L~i~~ 290 (319)
. |-+.+.. ......+||..||+++=.+.|+.++++-|..
T Consensus 257 ~i~~~~~~~~~~~~~~~~C~~~i~~~~~~~~~~~~iLG~~fl~~~y~vfD~~~~~ig~A~ 316 (317)
T PF00026_consen 257 TIPPSDYIFKIEDGNGGYCYLGIQPMDSSDDSDDWILGSPFLRNYYVVFDYENNRIGFAQ 316 (317)
T ss_dssp EEEHHHHEEEESSTTSSEEEESEEEESSTTSSSEEEEEHHHHTTEEEEEETTTTEEEEEE
T ss_pred EecchHhcccccccccceeEeeeecccccccCCceEecHHHhhceEEEEeCCCCEEEEec
Confidence 2 2222222 3456799999999999999999999998753
No 109
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5. Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=89.98 E-value=1.1 Score=42.68 Aligned_cols=92 Identities=15% Similarity=0.182 Sum_probs=57.6
Q ss_pred eEEEEEEEc--CEEEEEEEcCCccccccCHHHHHHcCCccc------cccc----------------------eeeEeec
Q psy17672 180 MLYINCKVN--GYPVKAFVDSGAQTTIMSAKCAERCNIMRL------IDTR----------------------WAGVAKG 229 (319)
Q Consensus 180 ~l~v~v~In--g~~v~aLVDTGA~~siIs~~~a~rlgl~~~------~~~~----------------------~~g~a~G 229 (319)
.+++++.|+ .+++.++||||++.+.+.-.-+..|+.... ..+. ...+..|
T Consensus 3 ~Y~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~~C~~c~~~~~~~y~~~~Sst~~~~~C~~~~c~~~~~~~~~~~~~~i~Y~ 82 (326)
T cd06096 3 YYFIDIFIGNPPQKQSLILDTGSSSLSFPCSQCKNCGIHMEPPYNLNNSITSSILYCDCNKCCYCLSCLNNKCEYSISYS 82 (326)
T ss_pred eEEEEEEecCCCeEEEEEEeCCCCceEEecCCCCCcCCCCCCCcCcccccccccccCCCccccccCcCCCCcCcEEEEEC
Confidence 578999998 689999999999999986654455653210 0000 1112344
Q ss_pred cceeEEEeEEEEEEEEEcCEEEE--------eeEEEec-------CCCCceeechhhh
Q psy17672 230 VGVQQIIGRIHMVQVAIEKDFLT--------TSLSILE-------EQPMDMLLGLDML 272 (319)
Q Consensus 230 vg~~~~~g~v~~~~i~IG~~~~~--------~~f~Vl~-------~~~~d~ILG~D~L 272 (319)
-| ..+.|.+....+.||+...+ ..|.+.. ....|+||||.+=
T Consensus 83 ~g-s~~~G~~~~D~v~lg~~~~~~~~~~~~~~~fg~~~~~~~~~~~~~~~GilGLg~~ 139 (326)
T cd06096 83 EG-SSISGFYFSDFVSFESYLNSNSEKESFKKIFGCHTHETNLFLTQQATGILGLSLT 139 (326)
T ss_pred CC-CceeeEEEEEEEEeccCCCCccccccccEEeccCccccCcccccccceEEEccCC
Confidence 44 24678888888999875432 1232221 1247999999873
No 110
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=89.89 E-value=0.74 Score=42.72 Aligned_cols=84 Identities=15% Similarity=0.157 Sum_probs=50.2
Q ss_pred EEEcCE------EEEEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEEeEEEEEEEEEcCEEEEeeEEEe
Q psy17672 185 CKVNGY------PVKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQIIGRIHMVQVAIEKDFLTTSLSIL 258 (319)
Q Consensus 185 v~Ing~------~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~g~v~~~~i~IG~~~~~~~f~Vl 258 (319)
++|+|. ...++||||++.+.++.+.++++--. +.|.... . .-|...++|+-. +
T Consensus 186 i~v~~~~~~~~~~~~~iiDSGTs~~~lP~~~~~~l~~~------l~g~~~~----~----------~~~~~~~~C~~~-~ 244 (278)
T cd06097 186 YTVGGDAPWSRSGFSAIADTGTTLILLPDAIVEAYYSQ------VPGAYYD----S----------EYGGWVFPCDTT-L 244 (278)
T ss_pred EEECCcceeecCCceEEeecCCchhcCCHHHHHHHHHh------CcCCccc----C----------CCCEEEEECCCC-C
Confidence 456665 35799999999999999888766211 0000000 0 001112222211 1
Q ss_pred cCCC--CceeechhhhhhccEEEEccCCEEEEc
Q psy17672 259 EEQP--MDMLLGLDMLRRHECCIDLRKNVLRIG 289 (319)
Q Consensus 259 ~~~~--~d~ILG~D~L~~~~~~ID~~~~~L~i~ 289 (319)
+.-. +..|||-.||+.+=.+.|+.+++|-|.
T Consensus 245 P~i~f~~~~ilGd~fl~~~y~vfD~~~~~ig~A 277 (278)
T cd06097 245 PDLSFAVFSILGDVFLKAQYVVFDVGGPKLGFA 277 (278)
T ss_pred CCEEEEEEEEEcchhhCceeEEEcCCCceeeec
Confidence 1000 157999999999999999999988664
No 111
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=89.78 E-value=1.5 Score=39.99 Aligned_cols=90 Identities=14% Similarity=0.201 Sum_probs=55.3
Q ss_pred EEEEEEcCE--EEEEEEcCCccccccCHHHHHHcCCccc--------------cccceeeEeeccceeEEEeEEEEEEEE
Q psy17672 182 YINCKVNGY--PVKAFVDSGAQTTIMSAKCAERCNIMRL--------------IDTRWAGVAKGVGVQQIIGRIHMVQVA 245 (319)
Q Consensus 182 ~v~v~Ing~--~v~aLVDTGA~~siIs~~~a~rlgl~~~--------------~~~~~~g~a~Gvg~~~~~g~v~~~~i~ 245 (319)
++++.|+.- ++.++||||++.+.+...-+..|+.... ..........+.| ...|.+....+.
T Consensus 2 ~~~i~iGtp~q~~~l~~DTGS~~~wv~~~~c~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~Y~~g--~~~g~~~~D~v~ 79 (283)
T cd05471 2 YGEITIGTPPQKFSVIFDTGSSLLWVPSSNCTSCSCQKHPRFKYDSSKSSTYKDTGCTFSITYGDG--SVTGGLGTDTVT 79 (283)
T ss_pred EEEEEECCCCcEEEEEEeCCCCCEEEecCCCCccccccCCCCccCccCCceeecCCCEEEEEECCC--eEEEEEEEeEEE
Confidence 577888654 8999999999999996665555432111 0111111234433 345666678899
Q ss_pred EcCEEEE-eeEEEec-------CCCCceeechhhhh
Q psy17672 246 IEKDFLT-TSLSILE-------EQPMDMLLGLDMLR 273 (319)
Q Consensus 246 IG~~~~~-~~f~Vl~-------~~~~d~ILG~D~L~ 273 (319)
+|+...+ ..|.... ....++||||.+=.
T Consensus 80 ~~~~~~~~~~fg~~~~~~~~~~~~~~~GilGLg~~~ 115 (283)
T cd05471 80 IGGLTIPNQTFGCATSESGDFSSSGFDGILGLGFPS 115 (283)
T ss_pred ECCEEEeceEEEEEeccCCcccccccceEeecCCcc
Confidence 9986543 3444443 23689999987643
No 112
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco. CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=89.53 E-value=1.1 Score=42.11 Aligned_cols=27 Identities=7% Similarity=0.074 Sum_probs=24.3
Q ss_pred ceeechhhhhhccEEEEccCCEEEEcc
Q psy17672 264 DMLLGLDMLRRHECCIDLRKNVLRIGT 290 (319)
Q Consensus 264 d~ILG~D~L~~~~~~ID~~~~~L~i~~ 290 (319)
-.|||-.||+.+-.+.|+.+++|-|..
T Consensus 270 ~~ilG~~fl~~~~vvfD~~~~~igfa~ 296 (299)
T cd05472 270 LSIIGNVQQQTFRVVYDVAGGRIGFAP 296 (299)
T ss_pred CEEEchHHccceEEEEECCCCEEeEec
Confidence 479999999999999999999998864
No 113
>cd01811 OASL_repeat1 2'-5' oligoadenylate synthetase-like protein, repeat 1 of 2. OASL_repeat1 (2'-5' oligoadenylate synthetase-like protein) belongs to a family of interferon-induced 2'-5' oligoadenylate synthetases which are important for the antiviral activity of interferons. While each member of this famliy has a conserved N-terminal OAS catalytic domain, only OASL has two tandem ubiquitin-like repeats located at the C-terminus and this CD represents one of those repeats.
Probab=89.49 E-value=0.77 Score=34.83 Aligned_cols=49 Identities=14% Similarity=0.142 Sum_probs=38.5
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEEec---C-eeccCCcccccccCCCCCcEEEEE
Q psy17672 2 DVEQFRLICQDECGIDATDMILLNN---G-KHLLEDGSCLKQAGVRDGDIILVA 51 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li~~---G-k~L~d~~~tL~~~gI~~~d~l~l~ 51 (319)
+|..+|+.|..+.|++- +|+|.|- | ++|-.+..||++|||=.+-.|.|.
T Consensus 22 pI~k~K~kI~~~~~~~g-~qrLsfQepgg~rqlL~s~~sLA~yGiFs~~~i~ll 74 (80)
T cd01811 22 PIRKIKEKIRRSRNCSG-LQRLSFQEPGGERQLLSSRKSLADYGIFSKTNICLL 74 (80)
T ss_pred hHHHHHHHHHHhhCccc-ceEEEeecCCcccccccccccHhhhcceeccEEEEE
Confidence 57899999999999997 9999983 3 344333789999999877666655
No 114
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=88.71 E-value=0.71 Score=42.14 Aligned_cols=80 Identities=18% Similarity=0.164 Sum_probs=48.4
Q ss_pred EEEEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEEeEEEEEEEEEcCEEEE-eeEEEecCCCCceeech
Q psy17672 191 PVKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQIIGRIHMVQVAIEKDFLT-TSLSILEEQPMDMLLGL 269 (319)
Q Consensus 191 ~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~g~v~~~~i~IG~~~~~-~~f~Vl~~~~~d~ILG~ 269 (319)
...++||||++.+.++.+.++.+--. ...... .... ...+.. .... .+| +.|.. ..|||.
T Consensus 202 ~~~~iiDsGt~~~~lp~~~~~~l~~~--~~~~~~---~~~~----~~~~~~--~~~~--~~p~i~f~f------~~ilG~ 262 (283)
T cd05471 202 GGGAIVDSGTSLIYLPSSVYDAILKA--LGAAVS---SSDG----GYGVDC--SPCD--TLPDITFTF------LWILGD 262 (283)
T ss_pred CcEEEEecCCCCEeCCHHHHHHHHHH--hCCccc---ccCC----cEEEeC--cccC--cCCCEEEEE------EEEccH
Confidence 46899999999999999998886321 111100 0000 000000 0000 011 12222 799999
Q ss_pred hhhhhccEEEEccCCEEEEc
Q psy17672 270 DMLRRHECCIDLRKNVLRIG 289 (319)
Q Consensus 270 D~L~~~~~~ID~~~~~L~i~ 289 (319)
.||+.+-.+.|+.+.+|-|.
T Consensus 263 ~fl~~~y~vfD~~~~~igfa 282 (283)
T cd05471 263 VFLRNYYTVFDLDNNRIGFA 282 (283)
T ss_pred hhhhheEEEEeCCCCEEeec
Confidence 99999999999999988764
No 115
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=88.46 E-value=1.7 Score=40.41 Aligned_cols=84 Identities=15% Similarity=0.254 Sum_probs=50.1
Q ss_pred eEEEEEEEc--CEEEEEEEcCCccccccCH-HHHHHcCCccccccceeeEeeccceeEEEeEEEEEEEEEcC----EEEE
Q psy17672 180 MLYINCKVN--GYPVKAFVDSGAQTTIMSA-KCAERCNIMRLIDTRWAGVAKGVGVQQIIGRIHMVQVAIEK----DFLT 252 (319)
Q Consensus 180 ~l~v~v~In--g~~v~aLVDTGA~~siIs~-~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~g~v~~~~i~IG~----~~~~ 252 (319)
.+++++.|+ .+++.+++|||++.+.+.- ..+..|+. .+. +..|-| ....|.+-...|.++. ..++
T Consensus 2 ~Y~~~i~iGtP~q~~~v~~DTGS~~~Wv~c~~~c~~c~c------~~~-i~Ygd~-~~~~G~~~~D~v~~~~~~~~~~~~ 73 (273)
T cd05475 2 YYYVTINIGNPPKPYFLDIDTGSDLTWLQCDAPCTGCQC------DYE-IEYADG-GSSMGVLVTDIFSLKLTNGSRAKP 73 (273)
T ss_pred ceEEEEEcCCCCeeEEEEEccCCCceEEeCCCCCCCCcC------ccE-eEeCCC-CceEEEEEEEEEEEeecCCCcccC
Confidence 468899998 8899999999999999942 11233422 222 334422 2446777666777753 1211
Q ss_pred -eeEEEe---------cCCCCceeechhh
Q psy17672 253 -TSLSIL---------EEQPMDMLLGLDM 271 (319)
Q Consensus 253 -~~f~Vl---------~~~~~d~ILG~D~ 271 (319)
..|-.. .....|+||||-+
T Consensus 74 ~~~Fgc~~~~~~~~~~~~~~~dGIlGLg~ 102 (273)
T cd05475 74 RIAFGCGYDQQGPLLNPPPPTDGILGLGR 102 (273)
T ss_pred CEEEEeeeccCCcccCCCccCCEEEECCC
Confidence 223222 1135799999954
No 116
>PF00789 UBX: UBX domain; InterPro: IPR001012 The UBX domain is found in ubiquitin-regulatory proteins, which are members of the ubiquitination pathway, as well as a number of other proteins including FAF-1 (FAS-associated factor 1), the human Rep-8 reproduction protein and several hypothetical proteins from yeast. The function of the UBX domain is not known although the fragment of avian FAF-1 containing the UBX domain causes apoptosis of transfected cells.; GO: 0005515 protein binding; PDB: 3QX1_A 1H8C_A 3QCA_B 3QQ8_B 3QC8_B 3R3M_A 3QWZ_B 1S3S_H 1JRU_A 1I42_A ....
Probab=88.38 E-value=1.3 Score=33.43 Aligned_cols=51 Identities=12% Similarity=0.189 Sum_probs=41.6
Q ss_pred CCHHHHHHHHHhhhCCCCCC-eEEE--ecCeeccCCc-ccccccCCCCCcEEEEE
Q psy17672 1 MDVEQFRLICQDECGIDATD-MILL--NNGKHLLEDG-SCLKQAGVRDGDIILVA 51 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~-q~Li--~~Gk~L~d~~-~tL~~~gI~~~d~l~l~ 51 (319)
.||.+|...|......+... ..|+ |--+.|.+++ +||+++|+..+.+|+|.
T Consensus 27 ~tl~~l~~~v~~~~~~~~~~~f~L~~~~Pr~~l~~~~~~tl~e~~l~p~~~l~v~ 81 (82)
T PF00789_consen 27 DTLQDLYDFVESQLFSPEESDFELITAFPRRELTDEDSKTLEEAGLLPSATLIVE 81 (82)
T ss_dssp SBHHHHHHHHHHHHHCTTTSSEEEEESSSTEECCSTTTSBTCCCTTSSCEEEEEE
T ss_pred chHHHHHHHHHHhcCCCCCccEEEEeCCCCcCCCccccccHHHhcCCCCeEEEEE
Confidence 37899999999888777765 7776 5678886644 79999999999999875
No 117
>smart00727 STI1 Heat shock chaperonin-binding motif.
Probab=87.82 E-value=0.64 Score=30.60 Aligned_cols=31 Identities=35% Similarity=0.492 Sum_probs=19.7
Q ss_pred CCChHH--HHHHHhhCHHHHHHHHhcCchhHHH
Q psy17672 78 LQDPAH--VRDLLLACPDQLALLKQNNPRLSEA 108 (319)
Q Consensus 78 ~~dpe~--~r~~il~nP~~l~~L~~~nP~La~a 108 (319)
+.||.. +.+.+.+||+.+..+.+.||+++..
T Consensus 8 l~~P~~~~~l~~~~~nP~~~~~~~~~nP~~~~~ 40 (41)
T smart00727 8 LQNPQVQSLLQDMQQNPDMLAQMLQENPQLLQL 40 (41)
T ss_pred HcCHHHHHHHHHHHHCHHHHHHHHHhCHHhHhh
Confidence 346643 3444667888777776668877754
No 118
>PF03539 Spuma_A9PTase: Spumavirus aspartic protease (A9); InterPro: IPR001641 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A9 (spumapepsin family, clan AA). Foamy viruses are single-stranded enveloped retroviruses that have been noted to infect monkeys, cats and humans. In the human virus, the aspartic protease is encoded by the retroviral gag gene [], and in monkeys by the pol gene []. At present, the virus has not been proven to cause any particular disease. However, studies have shown Human foamy virus causes neurological disorders in infected mice []. It is not clear whether the Foamy virus/spumavirus proteases share a common evolutionary origin with other aspartic proteases. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 2JYS_A.
Probab=86.99 E-value=4.5 Score=34.75 Aligned_cols=78 Identities=18% Similarity=0.232 Sum_probs=43.3
Q ss_pred EcCEEEEEEEcCCccccccCHHHHHHcCCccccccceee--EeeccceeEEEeEEEEEEEEEcCEEEEeeEEEecCCCCc
Q psy17672 187 VNGYPVKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAG--VAKGVGVQQIIGRIHMVQVAIEKDFLTTSLSILEEQPMD 264 (319)
Q Consensus 187 Ing~~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g--~a~Gvg~~~~~g~v~~~~i~IG~~~~~~~f~Vl~~~~~d 264 (319)
|.|..+++.-||||+.|+|...+.+. ....| ....+-+ ...+.+.-+.++|.|..+.+.+.--+ +|
T Consensus 1 ikg~~l~~~wDsga~ITCiP~~fl~~--------E~Pi~~~~i~Tihg-~~~~~vYYl~fKi~grkv~aEVi~s~---~d 68 (163)
T PF03539_consen 1 IKGTKLKGHWDSGAQITCIPESFLEE--------EQPIGKTLIKTIHG-EKEQDVYYLTFKINGRKVEAEVIASP---YD 68 (163)
T ss_dssp ETTEEEEEEE-TT-SSEEEEGGGTTT-----------SEEEEEE-SS--EEEEEEEEEEEEESS-EEEEEEEEES---SS
T ss_pred CCCceeeEEecCCCeEEEccHHHhCc--------cccccceEEEEecC-ceeccEEEEEEEEcCeEEEEEEecCc---cc
Confidence 56899999999999999999887421 11111 1122211 33455677889999977665443332 34
Q ss_pred eee----chhhhhhcc
Q psy17672 265 MLL----GLDMLRRHE 276 (319)
Q Consensus 265 ~IL----G~D~L~~~~ 276 (319)
-+| -++|++..-
T Consensus 69 y~li~p~diPw~~~~p 84 (163)
T PF03539_consen 69 YILISPSDIPWYKKKP 84 (163)
T ss_dssp SEEE-TTT-HHHHS--
T ss_pred eEEEcccccccccCCC
Confidence 332 367888765
No 119
>PF15044 CLU_N: Mitochondrial function, CLU-N-term
Probab=85.94 E-value=1.5 Score=33.35 Aligned_cols=52 Identities=21% Similarity=0.272 Sum_probs=42.1
Q ss_pred CCHHHHHHHHHhhh-CCCCCCeEEEecCeeccCCccccccc-CCCCCcEEEEEcc
Q psy17672 1 MDVEQFRLICQDEC-GIDATDMILLNNGKHLLEDGSCLKQA-GVRDGDIILVAMP 53 (319)
Q Consensus 1 ~tv~~lK~~i~~e~-gip~~~q~Li~~Gk~L~d~~~tL~~~-gI~~~d~l~l~~~ 53 (319)
.+|.|+++.|.... ...-....|.|+|+.|++ ...|++. |+++|..|.|+-.
T Consensus 5 d~v~dvrq~L~~~~~t~~~Tn~~L~~~g~~L~~-~~el~~i~~~~~~~~L~lve~ 58 (76)
T PF15044_consen 5 DTVQDVRQVLAESPETCYLTNFSLEHNGQRLDD-FVELSEIEGIKDGCVLELVEE 58 (76)
T ss_pred hHHHHHHHHHHhCccccceeEEEEEECCCccCC-chhhhhhhCCCCCcEEEEEec
Confidence 37889998887664 577788999999999976 5677776 6999999998843
No 120
>COG5417 Uncharacterized small protein [Function unknown]
Probab=85.55 E-value=1.7 Score=32.97 Aligned_cols=31 Identities=26% Similarity=0.401 Sum_probs=25.9
Q ss_pred CCeEEEecCeeccCCcccccccCCCCCcEEEE
Q psy17672 19 TDMILLNNGKHLLEDGSCLKQAGVRDGDIILV 50 (319)
Q Consensus 19 ~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l 50 (319)
.+.+..-.++.|.+ +..|.+|||.+||.|.+
T Consensus 50 ~~Ikv~nKa~llsg-d~kL~d~~IadGD~Lei 80 (81)
T COG5417 50 TQIKVMNKAQLLSG-DDKLIDYQIADGDILEI 80 (81)
T ss_pred CEEEEeccceEecC-CceEEeccccCCCEEEe
Confidence 45677778999987 57999999999999875
No 121
>KOG0013|consensus
Probab=85.54 E-value=1.1 Score=40.48 Aligned_cols=45 Identities=16% Similarity=0.289 Sum_probs=41.1
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGD 46 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d 46 (319)
.|+.++|..+.+..|+++-.|++.|+|+.|.| ...|++|+|..|.
T Consensus 167 Dtv~eik~~L~Aaeg~D~~sQrif~Sg~~l~d-kt~LeEc~iekg~ 211 (231)
T KOG0013|consen 167 DTVGEIKRALRAAEGVDPLSQRIFFSGGVLVD-KTDLEECKIEKGQ 211 (231)
T ss_pred CcHHHHHHHHHHhhccchhhheeeccCCceec-cccceeeeecCCC
Confidence 37889999999999999999999999999997 6789999999994
No 122
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=85.36 E-value=5.8 Score=37.50 Aligned_cols=100 Identities=17% Similarity=0.117 Sum_probs=55.8
Q ss_pred EEEcCEEE------EEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEE--eEEEEEEEEEcCEEEEe---
Q psy17672 185 CKVNGYPV------KAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQII--GRIHMVQVAIEKDFLTT--- 253 (319)
Q Consensus 185 v~Ing~~v------~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~--g~v~~~~i~IG~~~~~~--- 253 (319)
++|+|+.+ .++||||++.+.++.+.++++-- .+.... ..|.-...+. .....+.+.+|+..+.+
T Consensus 186 i~v~g~~~~~~~~~~aiiDTGTs~~~lP~~~~~~l~~--~~~~~~---~~~~~~~~C~~~~~~p~i~f~f~g~~~~l~~~ 260 (316)
T cd05486 186 IQVGGTVIFCSDGCQAIVDTGTSLITGPSGDIKQLQN--YIGATA---TDGEYGVDCSTLSLMPSVTFTINGIPYSLSPQ 260 (316)
T ss_pred EEEecceEecCCCCEEEECCCcchhhcCHHHHHHHHH--HhCCcc---cCCcEEEeccccccCCCEEEEECCEEEEeCHH
Confidence 56787653 59999999999999987766510 000000 0000000110 01122344555543332
Q ss_pred --------------eEEE--ec---CCCCceeechhhhhhccEEEEccCCEEEEc
Q psy17672 254 --------------SLSI--LE---EQPMDMLLGLDMLRRHECCIDLRKNVLRIG 289 (319)
Q Consensus 254 --------------~f~V--l~---~~~~d~ILG~D~L~~~~~~ID~~~~~L~i~ 289 (319)
-+.+ ++ ..+...|||-.||+.+-.+.|+.+++|-|.
T Consensus 261 ~y~~~~~~~~~~~C~~~~~~~~~~~~~~~~~ILGd~flr~~y~vfD~~~~~IGfA 315 (316)
T cd05486 261 AYTLEDQSDGGGYCSSGFQGLDIPPPAGPLWILGDVFIRQYYSVFDRGNNRVGFA 315 (316)
T ss_pred HeEEecccCCCCEEeeEEEECCCCCCCCCeEEEchHHhcceEEEEeCCCCEeecc
Confidence 1111 11 111237999999999999999999988663
No 123
>cd05489 xylanase_inhibitor_I_like TAXI-I inhibits degradation of xylan in the cell wall. Xylanase inhibitor-I (TAXI-I) is a member of potent TAXI-type inhibitors of fungal and bacterial family 11 xylanases. Plants developed a diverse battery of defense mechanisms in response to continual challenges by a broad spectrum of pathogenic microorganisms. Their defense arsenal includes inhibitors of cell wall-degrading enzymes, which hinder a possible invasion and colonization by antagonists. Xylanases of fungal and bacterial pathogens are the key enzymes in the degradation of xylan in the cell wall. Plants secrete proteins that inhibit these degradation glycosidases, including xylanase. Surprisingly, TAXI-I displays structural homology with the pepsin-like family of aspartic proteases but is proteolytically nonfunctional, because one or more residues of the essential catalytic triad are absent. The structure of the TAXI-inhibitor, Aspergillus niger xylanase I complex, illustrates the ability
Probab=84.16 E-value=4.2 Score=39.68 Aligned_cols=26 Identities=19% Similarity=0.331 Sum_probs=23.4
Q ss_pred eeechhhhhhccEEEEccCCEEEEcc
Q psy17672 265 MLLGLDMLRRHECCIDLRKNVLRIGT 290 (319)
Q Consensus 265 ~ILG~D~L~~~~~~ID~~~~~L~i~~ 290 (319)
.|||--+|+.+..+.|+++++|-|..
T Consensus 335 ~IlG~~~~~~~~vvyD~~~~riGfa~ 360 (362)
T cd05489 335 VVIGGHQMEDNLLVFDLEKSRLGFSS 360 (362)
T ss_pred EEEeeheecceEEEEECCCCEeeccc
Confidence 58999999999999999999988753
No 124
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank
Probab=83.86 E-value=4 Score=38.70 Aligned_cols=94 Identities=10% Similarity=0.060 Sum_probs=53.1
Q ss_pred EEEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEE--eEEEEEEEEEcCEEEE-----------------
Q psy17672 192 VKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQII--GRIHMVQVAIEKDFLT----------------- 252 (319)
Q Consensus 192 v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~--g~v~~~~i~IG~~~~~----------------- 252 (319)
..|+||||++.+.++.+.++.+.-... ... ...|.-...+. .......+.+|+..+.
T Consensus 207 ~~aiiDSGTt~~~~p~~~~~~l~~~~~-~~~---~~~~~~~~~C~~~~~~P~i~f~fgg~~~~l~~~~y~~~~~~~~~~~ 282 (325)
T cd05490 207 CEAIVDTGTSLITGPVEEVRALQKAIG-AVP---LIQGEYMIDCEKIPTLPVISFSLGGKVYPLTGEDYILKVSQRGTTI 282 (325)
T ss_pred CEEEECCCCccccCCHHHHHHHHHHhC-Ccc---ccCCCEEecccccccCCCEEEEECCEEEEEChHHeEEeccCCCCCE
Confidence 479999999999999988876621100 000 00111011111 1112234455554333
Q ss_pred eeEEEe--cC---CCCceeechhhhhhccEEEEccCCEEEEc
Q psy17672 253 TSLSIL--EE---QPMDMLLGLDMLRRHECCIDLRKNVLRIG 289 (319)
Q Consensus 253 ~~f~Vl--~~---~~~d~ILG~D~L~~~~~~ID~~~~~L~i~ 289 (319)
|-+.+. +. .....|||-.||+.+-.+.|+.+++|-|.
T Consensus 283 C~~~~~~~~~~~~~~~~~ilGd~flr~~y~vfD~~~~~IGfA 324 (325)
T cd05490 283 CLSGFMGLDIPPPAGPLWILGDVFIGRYYTVFDRDNDRVGFA 324 (325)
T ss_pred EeeEEEECCCCCCCCceEEEChHhheeeEEEEEcCCcEeecc
Confidence 211121 11 12347999999999999999999988764
No 125
>KOG0010|consensus
Probab=82.49 E-value=1.4 Score=44.54 Aligned_cols=91 Identities=20% Similarity=0.328 Sum_probs=52.4
Q ss_pred HHHHHhhCHHHHHHHHhcCchhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHHHhhcCC------CCHHHHHHHHHHHHH
Q psy17672 84 VRDLLLACPDQLALLKQNNPRLSEALSTGKDAFTKVFKEQLVEREKRQKQKIKLMNAHP------FDTHAQRLIAEEIRQ 157 (319)
Q Consensus 84 ~r~~il~nP~~l~~L~~~nP~La~ai~~~~~~f~~~l~~~~~~~~~~~~~~~~~l~~dP------~~~e~q~~i~~~i~~ 157 (319)
+..++.+||+++.++ ..||-+...+ +|++-+++++...-+ -.++|..+| .+|+..+--.|..|.
T Consensus 148 ~q~~~~snpe~~~~~-m~nP~vq~ll-~Npd~mrq~I~anPq--------mq~lm~~npei~h~ln~p~i~rQtle~arN 217 (493)
T KOG0010|consen 148 MQNQLLSNPEALRQM-MENPIVQSLL-NNPDLMRQLIMANPQ--------MQDLMQRNPEIGHLLNNPLILRQTLESARN 217 (493)
T ss_pred chhccccCHHHHHHh-hhChHHHHHh-cChHHHHHHHhcCHH--------HHHHHhhCCcchhhhcChHHHHHHHHhccC
Confidence 445577788777776 5677777666 567677766655211 112333444 467777777777665
Q ss_pred HHHH-HH---HHHh---HhcCCCCccccceEEEE
Q psy17672 158 KNVE-AN---MEAA---MEYNPETFGSVVMLYIN 184 (319)
Q Consensus 158 ~~~~-~~---~~~a---~~~~pe~~~~~~~l~v~ 184 (319)
-.-. |+ -..| +|-.|..|.-..+.|..
T Consensus 218 P~m~qemmrn~d~a~SnlesiPgG~n~l~~my~d 251 (493)
T KOG0010|consen 218 PEMMQEMMRNQDRAMSNLESIPGGYNALRRMYTD 251 (493)
T ss_pred HHHHHHHHhhccccccChhcCccHHHHHHHHhhh
Confidence 3322 22 2234 47777777655555433
No 126
>PLN03146 aspartyl protease family protein; Provisional
Probab=82.17 E-value=3.3 Score=41.47 Aligned_cols=27 Identities=11% Similarity=0.030 Sum_probs=24.2
Q ss_pred ceeechhhhhhccEEEEccCCEEEEcc
Q psy17672 264 DMLLGLDMLRRHECCIDLRKNVLRIGT 290 (319)
Q Consensus 264 d~ILG~D~L~~~~~~ID~~~~~L~i~~ 290 (319)
..|||-.+++.+.++.|+++++|-|..
T Consensus 399 ~~IlG~~~q~~~~vvyDl~~~~igFa~ 425 (431)
T PLN03146 399 IAIFGNLAQMNFLVGYDLESKTVSFKP 425 (431)
T ss_pred ceEECeeeEeeEEEEEECCCCEEeeec
Confidence 379999999999999999999998865
No 127
>PTZ00147 plasmepsin-1; Provisional
Probab=81.99 E-value=11 Score=38.15 Aligned_cols=103 Identities=13% Similarity=0.179 Sum_probs=57.1
Q ss_pred EEEEEcCE---EEEEEEcCCccccccCHHHHHHcCCccccccceeeEeeccc--eeEEE-eEEEEEEEEEcCEEEE----
Q psy17672 183 INCKVNGY---PVKAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVG--VQQII-GRIHMVQVAIEKDFLT---- 252 (319)
Q Consensus 183 v~v~Ing~---~v~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg--~~~~~-g~v~~~~i~IG~~~~~---- 252 (319)
+.+.+++. ...|+||||.+.+.++.+.+..+-=. ++.. .....+ ...|. .......+.+++..++
T Consensus 321 l~~~vg~~~~~~~~aIiDSGTsli~lP~~~~~ai~~~--l~~~---~~~~~~~y~~~C~~~~lP~~~f~f~g~~~~L~p~ 395 (453)
T PTZ00147 321 LDVHFGNVSSEKANVIVDSGTSVITVPTEFLNKFVES--LDVF---KVPFLPLYVTTCNNTKLPTLEFRSPNKVYTLEPE 395 (453)
T ss_pred EEEEECCEecCceeEEECCCCchhcCCHHHHHHHHHH--hCCe---ecCCCCeEEEeCCCCCCCeEEEEECCEEEEECHH
Confidence 33444443 35799999999999999887654100 0000 000000 00000 0111223344443222
Q ss_pred -------------eeEEEec--CCCCceeechhhhhhccEEEEccCCEEEEcc
Q psy17672 253 -------------TSLSILE--EQPMDMLLGLDMLRRHECCIDLRKNVLRIGT 290 (319)
Q Consensus 253 -------------~~f~Vl~--~~~~d~ILG~D~L~~~~~~ID~~~~~L~i~~ 290 (319)
|-+.+.+ ......|||-.||+.+-.+.|+.++++-|..
T Consensus 396 ~yi~~~~~~~~~~C~~~i~~~~~~~~~~ILGd~FLr~~YtVFD~~n~rIGfA~ 448 (453)
T PTZ00147 396 YYLQPIEDIGSALCMLNIIPIDLEKNTFILGDPFMRKYFTVFDYDNHTVGFAL 448 (453)
T ss_pred HheeccccCCCcEEEEEEEECCCCCCCEEECHHHhccEEEEEECCCCEEEEEE
Confidence 3222332 1223579999999999999999999998875
No 128
>smart00166 UBX Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p.
Probab=81.92 E-value=3.2 Score=31.32 Aligned_cols=51 Identities=6% Similarity=0.021 Sum_probs=40.1
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEE--ecCeeccCC--cccccccCCCCCcEEEEE
Q psy17672 1 MDVEQFRLICQDECGIDATDMILL--NNGKHLLED--GSCLKQAGVRDGDIILVA 51 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li--~~Gk~L~d~--~~tL~~~gI~~~d~l~l~ 51 (319)
.||++|.+.|....+.+.....|+ |--|.+.++ ++||.++|+....+|+|.
T Consensus 25 ~tl~~v~~~v~~~~~~~~~~f~L~t~~Prk~l~~~d~~~tL~e~gL~p~~~l~v~ 79 (80)
T smart00166 25 DTLRTVYEFVSAALTDGNDPFTLNSPFPRRTFTKDDYSKTLLELALLPSSTLVLE 79 (80)
T ss_pred CcHHHHHHHHHHcccCCCCCEEEEeCCCCcCCccccccCCHHHCCCCCceEEEEe
Confidence 378999999976667777778886 567788653 579999999999888864
No 129
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate r
Probab=81.49 E-value=9 Score=36.40 Aligned_cols=101 Identities=14% Similarity=0.109 Sum_probs=56.3
Q ss_pred EEEEcCEEE------EEEEcCCccccccCHHHHHHcCCccccccceeeEeeccceeEEE--eEEEEEEEEEcCEEEEe--
Q psy17672 184 NCKVNGYPV------KAFVDSGAQTTIMSAKCAERCNIMRLIDTRWAGVAKGVGVQQII--GRIHMVQVAIEKDFLTT-- 253 (319)
Q Consensus 184 ~v~Ing~~v------~aLVDTGA~~siIs~~~a~rlgl~~~~~~~~~g~a~Gvg~~~~~--g~v~~~~i~IG~~~~~~-- 253 (319)
.++|+|..+ .++||||++.+.++.+.++++--. +..... .+.=...+. .....+.+.+|+..+.+
T Consensus 194 ~i~vg~~~~~~~~~~~aiiDSGts~~~lP~~~~~~l~~~--~~~~~~---~~~y~~~C~~~~~~P~i~f~fgg~~~~v~~ 268 (326)
T cd05487 194 GVSVGSSTLLCEDGCTAVVDTGASFISGPTSSISKLMEA--LGAKER---LGDYVVKCNEVPTLPDISFHLGGKEYTLSS 268 (326)
T ss_pred EEEECCEEEecCCCCEEEECCCccchhCcHHHHHHHHHH--hCCccc---CCCEEEeccccCCCCCEEEEECCEEEEeCH
Confidence 346676653 589999999999999877665110 000000 000000000 01122344555543332
Q ss_pred ---------------eEEEec-----CCCCceeechhhhhhccEEEEccCCEEEEc
Q psy17672 254 ---------------SLSILE-----EQPMDMLLGLDMLRRHECCIDLRKNVLRIG 289 (319)
Q Consensus 254 ---------------~f~Vl~-----~~~~d~ILG~D~L~~~~~~ID~~~~~L~i~ 289 (319)
-+.+.. ..+...|||-.||+.+-.+.|+.+++|-|.
T Consensus 269 ~~yi~~~~~~~~~~C~~~~~~~~~~~~~~~~~ilG~~flr~~y~vfD~~~~~IGfA 324 (326)
T cd05487 269 SDYVLQDSDFSDKLCTVAFHAMDIPPPTGPLWVLGATFIRKFYTEFDRQNNRIGFA 324 (326)
T ss_pred HHhEEeccCCCCCEEEEEEEeCCCCCCCCCeEEEehHHhhccEEEEeCCCCEEeee
Confidence 111211 111247999999999999999999988775
No 130
>PF11470 TUG-UBL1: GLUT4 regulating protein TUG; InterPro: IPR021569 TUG is a GLUT4 regulating protein and functions to retain membrane vesicles containing GLUT4 intracellularly. TUG releases the GLUT4 containing vesicles to the cellular exocytic machinery in response to insulin stimulation which allows translocation to the plasma membrane []. TUG has an N-terminal ubiquitin-like domain (UBL1) which in similar proteins appears to participate in protein-protein interactions []. The region does have a area of negative electrostatic potential and increased backbone motility which leads to suggestions of a potential protein-protein interaction site []. ; PDB: 2AL3_A.
Probab=79.69 E-value=4.1 Score=30.05 Aligned_cols=48 Identities=15% Similarity=0.131 Sum_probs=32.1
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEE
Q psy17672 2 DVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILV 50 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l 50 (319)
++.++-+..-..+|+++++-.|.|++|.|+- +.++.-.|+-.|..|.|
T Consensus 18 ~l~~VL~eac~k~~l~~~~~~L~h~~k~ldl-slp~R~snL~n~akLeL 65 (65)
T PF11470_consen 18 TLNQVLEEACKKFGLDPSSYDLKHNNKPLDL-SLPFRLSNLPNNAKLEL 65 (65)
T ss_dssp BHHHHHHHHHHHTT--GGG-EEEETTEEESS-S-BHHHH---SS-EEEE
T ss_pred CHHHHHHHHHHHcCCCccceEEEECCEEecc-ccceeecCCCCCCEEeC
Confidence 4556666666678999999999999999975 67899999999988764
No 131
>cd01774 Faf1_like2_UBX Faf1 ike-2 UBX domain. Faf1_like2 is a protein of unknown function with a domain architecture that includes the UAS (ubiquitin-associated) and UBX (ubiquitin-like) domains. This protein is related to other UBA/UBX-containing proteins like Faf1, p47, and SAKS1 and may serve as an adaptor molecule that shuttles proteins to the proteasome for degradation. The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins.
Probab=79.39 E-value=4.5 Score=31.30 Aligned_cols=49 Identities=12% Similarity=0.173 Sum_probs=38.3
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEEecC--eecc-------CCcccccccCCCCCcEEEEE
Q psy17672 2 DVEQFRLICQDECGIDATDMILLNNG--KHLL-------EDGSCLKQAGVRDGDIILVA 51 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li~~G--k~L~-------d~~~tL~~~gI~~~d~l~l~ 51 (319)
||++|.+.|.. .+..++...|+.+= |.+. |...||+++|+....+|+|.
T Consensus 26 tl~~l~~fv~~-~~~~~~~f~L~t~FPrr~~~~~~~~~~~~~~TL~eaGL~~s~~L~V~ 83 (85)
T cd01774 26 SLRVIHDFLFS-LKETPEKFQIVTNFPRRVLPCLPSEGDPPPPTLLEAGLSNSEVLFVQ 83 (85)
T ss_pred cHHHHHHHHHh-CCCCCCcEEEecCCCCccccccccccCcCCCCHHHcCCCCccEEEEe
Confidence 78899999964 45566888998766 7885 33579999999998888774
No 132
>KOG1639|consensus
Probab=77.33 E-value=2.9 Score=38.88 Aligned_cols=50 Identities=24% Similarity=0.326 Sum_probs=37.7
Q ss_pred CHHHHH-HHHHhhhCCCCCCeEEE----ecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 2 DVEQFR-LICQDECGIDATDMILL----NNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 2 tv~~lK-~~i~~e~gip~~~q~Li----~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
|+.|++ +++..+..+.+..+++. -.|++|.| +++|++||...|++|.+.-
T Consensus 24 ti~d~~~~~~~~~~k~~~~~~r~tlr~e~kgkpl~~-~s~l~e~~~~s~~~i~vKD 78 (297)
T KOG1639|consen 24 TIDDLLKAISAKNLKITPYRIRLTLRVEPKGKPLID-NSKLQEYGDGSGATIYVKD 78 (297)
T ss_pred cHHHHHHHHHHhhhccCccchhheeeccCCCccccc-hhHHHHhccCCCCEEEEec
Confidence 688888 44455567776444443 47999998 5789999999999998763
No 133
>PTZ00165 aspartyl protease; Provisional
Probab=77.29 E-value=16 Score=37.34 Aligned_cols=28 Identities=14% Similarity=0.196 Sum_probs=25.5
Q ss_pred CceeechhhhhhccEEEEccCCEEEEcc
Q psy17672 263 MDMLLGLDMLRRHECCIDLRKNVLRIGT 290 (319)
Q Consensus 263 ~d~ILG~D~L~~~~~~ID~~~~~L~i~~ 290 (319)
...|||-.||+++-.+.|+.++++-|..
T Consensus 418 ~~~ILGd~Flr~yy~VFD~~n~rIGfA~ 445 (482)
T PTZ00165 418 PLFVLGNNFIRKYYSIFDRDHMMVGLVP 445 (482)
T ss_pred ceEEEchhhheeEEEEEeCCCCEEEEEe
Confidence 3479999999999999999999999976
No 134
>cd01770 p47_UBX p47-like ubiquitin domain. p47_UBX p47 is an adaptor molecule of the cytosolic AAA ATPase p97. The principal role of the p97-p47 complex is to regulate membrane fusion events. Mono-ubiquitin recognition by p47 is crucial for p97-p47-mediated Golgi membrane fusion events. p47 has carboxy-terminal SEP and UBX domains. The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins.
Probab=76.07 E-value=5.6 Score=30.26 Aligned_cols=48 Identities=21% Similarity=0.212 Sum_probs=36.1
Q ss_pred CCHHHHHHHHHhhhCC-CCCCeEEE--ecCeeccCCcccccccCCCCCcEEE
Q psy17672 1 MDVEQFRLICQDECGI-DATDMILL--NNGKHLLEDGSCLKQAGVRDGDIIL 49 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gi-p~~~q~Li--~~Gk~L~d~~~tL~~~gI~~~d~l~ 49 (319)
.||++|.+.|..+.+- ......|. |=.|.|.|++.||+++|+.. ..|.
T Consensus 25 ~tv~~l~~~v~~~~~~~~~~~f~L~t~fP~k~l~~~~~Tl~eagL~~-s~v~ 75 (79)
T cd01770 25 HRVSDVRDFIVNARPEFAARPFTLMTAFPVKELSDESLTLKEANLLN-AVIV 75 (79)
T ss_pred CcHHHHHHHHHHhCCCCCCCCEEEecCCCCcccCCCCCcHHHCCCcC-cEEE
Confidence 3799999999987543 33567776 56889987789999999985 4443
No 135
>cd01767 UBX UBX (ubiquitin regulatory X) domain. The UBX (ubiquitin regulatory X) domain has a beta-grasp fold that is structurally quite similar to ubiquitin although UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins. Most UBX-containing proteins including p47, FAF1, and SAKS1 (Y33K) also contain a UBA (ubiquitin-associated) domain and are thought to serve as adaptor molecules that shuttle proteins to the proteasome for degradation.
Probab=75.96 E-value=6.5 Score=29.35 Aligned_cols=48 Identities=15% Similarity=0.221 Sum_probs=35.2
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEE--ecCeeccC--CcccccccCCCCCcEEEEE
Q psy17672 2 DVEQFRLICQDECGIDATDMILL--NNGKHLLE--DGSCLKQAGVRDGDIILVA 51 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li--~~Gk~L~d--~~~tL~~~gI~~~d~l~l~ 51 (319)
||++|.+.|.....- .....|+ |-.|.+.+ .+.||+++|+. .+++++.
T Consensus 24 tl~~l~~fv~~~~~~-~~~f~L~t~~Pr~~~~~~~~~~TL~e~gL~-~s~~~~~ 75 (77)
T cd01767 24 KLSDVRDFVESNGPP-AEPFTLMTSFPRRVLTDLDYELTLQEAGLV-NEVVFQR 75 (77)
T ss_pred CHHHHHHHHHHcCCC-CCCEEEEeCCCCccCCCCCccCcHHHcCCc-cceEEEE
Confidence 789999999876543 4566776 45677865 57899999999 5555553
No 136
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two
Probab=73.22 E-value=7 Score=37.81 Aligned_cols=27 Identities=7% Similarity=0.166 Sum_probs=24.9
Q ss_pred ceeechhhhhhccEEEEccCCEEEEcc
Q psy17672 264 DMLLGLDMLRRHECCIDLRKNVLRIGT 290 (319)
Q Consensus 264 d~ILG~D~L~~~~~~ID~~~~~L~i~~ 290 (319)
-.|||--||+.+-.+.|+.+++|-|..
T Consensus 318 ~~ILG~~flr~~yvvfD~~~~rIGfa~ 344 (364)
T cd05473 318 GTVIGAVIMEGFYVVFDRANKRVGFAV 344 (364)
T ss_pred ceEEeeeeEcceEEEEECCCCEEeeEe
Confidence 379999999999999999999999876
No 137
>PF14836 Ubiquitin_3: Ubiquitin-like domain; PDB: 3JYU_A 4A3O_B 3PPA_A 3T9L_A 4A3P_A 3PV1_A.
Probab=72.67 E-value=11 Score=29.62 Aligned_cols=53 Identities=19% Similarity=0.236 Sum_probs=36.0
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEe----cCee-ccCCcccccccCCCCCcEEEEEccC
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLN----NGKH-LLEDGSCLKQAGVRDGDIILVAMPG 54 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~----~Gk~-L~d~~~tL~~~gI~~~d~l~l~~~~ 54 (319)
.||..++.-+...+.| ...-||.- ++.. |.+.+.||+++||.+|-+|++-.+.
T Consensus 24 DTI~~v~~~~rklf~i-~~E~RLW~~~~~~~~e~L~~~~~Tv~da~L~~gQ~vliE~rn 81 (88)
T PF14836_consen 24 DTIGFVEKEMRKLFNI-QEETRLWNKYSENSYELLNNPEITVEDAGLYDGQVVLIEERN 81 (88)
T ss_dssp SBHHHHHHHHHHHCT--TS-EEEEEECTTTCEEEE--TTSBTTTTT--TTEEEEEEE--
T ss_pred ChHHHHHHHHHHHhCC-CccceehhccCCcchhhhCCCCccHHHccCcCCCEEEEEeec
Confidence 4888999999999999 66678863 2333 5666689999999999999987653
No 138
>PF03419 Peptidase_U4: Sporulation factor SpoIIGA This family belongs to family U4 of the peptidase classification.; InterPro: IPR005081 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This group of peptidases belong to the MEROPS peptidase family U4 (SpoIIGA peptidase family, clan U-). Sporulation in bacteria such as Bacillus subtilis involves the formation of a polar septum, which divides the sporangium into a mother cell and a forespore. The sigma E factor, which is encoded within the spoIIG operon, is a cell-specific regulatory protein that directs gene transcription in the mother cell. Sigma E is synthesised as an inactive proprotein pro-sigma E, which is converted to the mature factor by the putative processing enzyme SpoIIGA []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis, 0030436 asexual sporulation
Probab=72.66 E-value=55 Score=30.86 Aligned_cols=36 Identities=19% Similarity=0.365 Sum_probs=29.1
Q ss_pred cceEEEEEEEcCEE--EEEEEcCCccc---------cccCHHHHHHc
Q psy17672 178 VVMLYINCKVNGYP--VKAFVDSGAQT---------TIMSAKCAERC 213 (319)
Q Consensus 178 ~~~l~v~v~Ing~~--v~aLVDTGA~~---------siIs~~~a~rl 213 (319)
.....|.+.++|+. ++|++|||-+. .+++.+.++++
T Consensus 155 ~~~~~v~i~~~~~~~~~~allDTGN~L~DPitg~PV~Vve~~~~~~~ 201 (293)
T PF03419_consen 155 DYLYPVTIEIGGKKIELKALLDTGNQLRDPITGRPVIVVEYEALEKL 201 (293)
T ss_pred cEEEEEEEEECCEEEEEEEEEECCCcccCCCCCCcEEEEEHHHHHhh
Confidence 35677889999995 69999999765 47788888887
No 139
>cd01772 SAKS1_UBX SAKS1-like UBX domain. SAKS1 (SAPK-substrate-1), also known as Y33K, is a widely expressed protein containing N-terminal UBA (ubiquitin-associated) and C-terminal UBX (ubiqiutin-like) domains that was identified as a substrate of stress-activated protein kinases (SAPKs). SAKS1 is related evolutionarily to two other UBA/UBX-containing proteins, p47 and Faf1. The UBA and UBX domains of SAKS1 bind ubiquitin tetramers and valosin-containing protein (VCP), respectively suggesting a role for SAKS1 as an adaptor that directs VCP to polyubiquitinated proteins facilitating its destruction by the proteasome. The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins.
Probab=72.22 E-value=9.4 Score=28.87 Aligned_cols=49 Identities=18% Similarity=0.264 Sum_probs=37.6
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEE--ecCeeccC--CcccccccCCCCCcEEEEE
Q psy17672 2 DVEQFRLICQDECGIDATDMILL--NNGKHLLE--DGSCLKQAGVRDGDIILVA 51 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li--~~Gk~L~d--~~~tL~~~gI~~~d~l~l~ 51 (319)
||++|.+.|+...+-+ ....|+ |--|.+.+ .+.||+++|+....+|+|.
T Consensus 26 tl~~v~~fV~~~~~~~-~~f~L~t~fPrk~~~~~d~~~TL~elgL~Psa~L~v~ 78 (79)
T cd01772 26 QLAAVRLFVELNTGNG-GPFTLMTPFPRKVFTEDDMEKPLQELGLVPSAVLIVT 78 (79)
T ss_pred hHHHHHHHHHHcCCCC-CCEEEEeCCCCeECCcccccCCHHHCCCCCceEEEEe
Confidence 7899999998765433 556776 56788853 3579999999999998874
No 140
>TIGR02854 spore_II_GA sigma-E processing peptidase SpoIIGA. Members of this protein family are the stage II sporulation protein SpoIIGA. This protein acts as an activating protease for Sigma-E, one of several specialized sigma factors of the sporulation process in Bacillus subtilis and related endospore-forming bacteria.
Probab=71.12 E-value=9.4 Score=36.23 Aligned_cols=36 Identities=17% Similarity=0.327 Sum_probs=28.9
Q ss_pred ceEEEEEEEcCEE--EEEEEcCCcc---------ccccCHHHHHHcC
Q psy17672 179 VMLYINCKVNGYP--VKAFVDSGAQ---------TTIMSAKCAERCN 214 (319)
Q Consensus 179 ~~l~v~v~Ing~~--v~aLVDTGA~---------~siIs~~~a~rlg 214 (319)
...-|.+.++|+. +++|+|||-+ +.+++.+.++++-
T Consensus 157 ~~~~v~i~~~g~~~~~~alvDTGN~L~DPlT~~PV~Ive~~~~~~~l 203 (288)
T TIGR02854 157 QIYELEICLDGKKVTIKGFLDTGNQLRDPLTKLPVIVVEYDSLKSIL 203 (288)
T ss_pred eEEEEEEEECCEEEEEEEEEecCCcccCCCCCCCEEEEEHHHhhhhC
Confidence 5677888999995 6999999976 4577888887773
No 141
>PF14732 UAE_UbL: Ubiquitin/SUMO-activating enzyme ubiquitin-like domain; PDB: 1Y8Q_B 1Y8R_E 3KYD_B 3KYC_B.
Probab=69.19 E-value=4.2 Score=31.61 Aligned_cols=49 Identities=20% Similarity=0.331 Sum_probs=25.6
Q ss_pred CCHHHHHHHH-HhhhCCCCCCeEEEecCeecc---------CCcccccccCCCCCcEEEEE
Q psy17672 1 MDVEQFRLIC-QDECGIDATDMILLNNGKHLL---------EDGSCLKQAGVRDGDIILVA 51 (319)
Q Consensus 1 ~tv~~lK~~i-~~e~gip~~~q~Li~~Gk~L~---------d~~~tL~~~gI~~~d~l~l~ 51 (319)
+|+.+|-..| ..+.|+...... ++|+.|- ...++|+++||++|++|.+.
T Consensus 9 ~TL~~lv~~Vlk~~Lg~~~P~v~--~~~~ilyd~de~~~~~~l~k~L~elgi~~gs~L~v~ 67 (87)
T PF14732_consen 9 MTLGDLVEKVLKKKLGMNEPDVS--VGGTILYDSDEEEYDDNLPKKLSELGIVNGSILTVD 67 (87)
T ss_dssp -BHHHHHHHCCCCCS--SSEEEE--ES-EEEE-SSSSSSTTCTTSBGGGGT--TT-EEEEE
T ss_pred CcHHHHHHHHHHhccCCCCCEEE--eCCCEEEcCCcchhhhcccCChhHcCCCCCCEEEEE
Confidence 5677776654 445565543222 1444441 12478999999999999876
No 142
>PF11620 GABP-alpha: GA-binding protein alpha chain; InterPro: IPR024668 GA-binding protein alpha is a transcription factor capable of interacting with purine rich repeats (GA repeats). This N-terminal domain found in the transcription factor GABP alpha consists of a five-stranded beta-sheet crossed by a distorted helix and has been termed OST domain. The surface of the GABP alpha OST domain contains two clusters of negatively-charged residues suggesting there are positively-charged partner proteins. The OST domain binds to the CH1 and CH3 domains of the co-activator histone acetyltransferase CBP/p300 [].; PDB: 2JUO_A.
Probab=67.56 E-value=11 Score=29.48 Aligned_cols=50 Identities=12% Similarity=0.196 Sum_probs=34.7
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 2 DVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
.+.+||.+++.+.++.-+...++.-...|.. +++|-+-||+-.-+|-+..
T Consensus 14 pl~~Lk~lLe~Rl~~~L~~~~f~LQD~~L~~-~k~L~dQcVqgeGlVQlnv 63 (88)
T PF11620_consen 14 PLSTLKKLLERRLGISLSDYEFWLQDIQLEP-HKSLVDQCVQGEGLVQLNV 63 (88)
T ss_dssp BGGGHHHHSHHHH-S--SS-EEEETTEE--T-TSBTTTSS----SEEEEEE
T ss_pred cHHHHHHHHHHhhCCCcCCCeEEeccceecC-CccHHHhhccccCEEEEEE
Confidence 4678999999999999999999888888876 6899999999888887763
No 143
>PF11834 DUF3354: Domain of unknown function (DUF3354); InterPro: IPR021789 Potassium channels take part in important processes of higher plants, including opening and closing of stomatal pores and leaf movement. Inward rectifying potassium (K(+)in) channels play an important role in turgor regulation and ion uptake in higher plants. All of them comprise, from their N-terminal to their C-terminal ends: a short hydrophilic region, a hydrophobic region structurally analogous and partially homologous to the transmembrane domain of voltage-gated animal channels from the Shaker superfamily, a putative cyclic nucleotide-binding domain, and a conserved C-terminal KHA domain. Between these last two regions, some of them (AKT1, AKT2 and SKT1) contain an ankyrin-repeat domain with six repeats homologous to those of human erythrocyte ankyrin. This entry represents the KHA domain which is unique to plant K(+)in channels. The KHA domain contains two high-homology blocks enriched for hydrophobic and acidic residues, respectively. The KHA domain is essential for interaction of plant K(+)in channels. The KHA domain mediates tetramerization and/or stabilisation of the heteromers [, , ].
Probab=67.36 E-value=8 Score=28.90 Aligned_cols=43 Identities=21% Similarity=0.451 Sum_probs=33.3
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEE
Q psy17672 2 DVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVA 51 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~ 51 (319)
|+++|...-+.++|++ ..-.+.-+|-.+.| =.. |.|||.|+++
T Consensus 27 SleeLl~ia~~kfg~~-~~~v~~~dgaeIdD-I~~-----IRDgD~L~~~ 69 (69)
T PF11834_consen 27 SLEELLKIASEKFGFS-ATKVLNEDGAEIDD-IDV-----IRDGDHLYLV 69 (69)
T ss_pred cHHHHHHHHHHHhCCC-ceEEEcCCCCEEeE-EEE-----EEcCCEEEEC
Confidence 7899999999999997 55556667777765 222 7899999874
No 144
>PRK06437 hypothetical protein; Provisional
Probab=67.19 E-value=15 Score=26.87 Aligned_cols=34 Identities=26% Similarity=0.552 Sum_probs=27.2
Q ss_pred hhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 13 ECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 13 e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
+.|+++....+..||+++.. ++-+++||.|-+..
T Consensus 29 ~Lgi~~~~vaV~vNg~iv~~------~~~L~dgD~Veiv~ 62 (67)
T PRK06437 29 DLGLDEEEYVVIVNGSPVLE------DHNVKKEDDVLILE 62 (67)
T ss_pred HcCCCCccEEEEECCEECCC------ceEcCCCCEEEEEe
Confidence 36899888888999999973 23578999998874
No 145
>PF14541 TAXi_C: Xylanase inhibitor C-terminal; PDB: 3AUP_D 3HD8_A 1T6G_A 1T6E_X 2B42_A 3VLB_A 3VLA_A.
Probab=66.86 E-value=19 Score=30.55 Aligned_cols=29 Identities=10% Similarity=0.072 Sum_probs=24.8
Q ss_pred CCCceeechhhhhhccEEEEccCCEEEEc
Q psy17672 261 QPMDMLLGLDMLRRHECCIDLRKNVLRIG 289 (319)
Q Consensus 261 ~~~d~ILG~D~L~~~~~~ID~~~~~L~i~ 289 (319)
...-.|||.-.++.+.+..|+.++++.|.
T Consensus 132 ~~~~~viG~~~~~~~~v~fDl~~~~igF~ 160 (161)
T PF14541_consen 132 DDGVSVIGNFQQQNYHVVFDLENGRIGFA 160 (161)
T ss_dssp TSSSEEE-HHHCCTEEEEEETTTTEEEEE
T ss_pred CCCcEEECHHHhcCcEEEEECCCCEEEEe
Confidence 34669999999999999999999999875
No 146
>COG5227 SMT3 Ubiquitin-like protein (sentrin) [Posttranslational modification, protein turnover, chaperones]
Probab=65.98 E-value=8.7 Score=30.29 Aligned_cols=47 Identities=15% Similarity=0.145 Sum_probs=38.2
Q ss_pred HHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 5 QFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 5 ~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
.|........|-..++.+++|+|+.+.- .+|=++++..++|.|-++.
T Consensus 49 klm~af~~rqGK~m~slRfL~dG~rI~~-dqTP~dldmEdnd~iEav~ 95 (103)
T COG5227 49 KLMDAFSRRQGKNMSSLRFLFDGKRIDL-DQTPGDLDMEDNDEIEAVT 95 (103)
T ss_pred HHHHHHHHHhCcCcceeEEEEcceecCC-CCChhhcCCccchHHHHHH
Confidence 3444445667999999999999999975 5899999999999887654
No 147
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=65.89 E-value=30 Score=34.98 Aligned_cols=28 Identities=18% Similarity=0.281 Sum_probs=24.7
Q ss_pred CceeechhhhhhccEEEEccCCEEEEcc
Q psy17672 263 MDMLLGLDMLRRHECCIDLRKNVLRIGT 290 (319)
Q Consensus 263 ~d~ILG~D~L~~~~~~ID~~~~~L~i~~ 290 (319)
...|||-.||+.+=.+.|+.++++-|..
T Consensus 420 ~~~ILGd~FLr~~Y~VFD~~n~rIGfA~ 447 (450)
T PTZ00013 420 NTFILGDPFMRKYFTVFDYDKESVGFAI 447 (450)
T ss_pred CCEEECHHHhccEEEEEECCCCEEEEEE
Confidence 3579999999999999999999988764
No 148
>PF14453 ThiS-like: ThiS-like ubiquitin
Probab=64.56 E-value=15 Score=26.46 Aligned_cols=40 Identities=20% Similarity=0.284 Sum_probs=27.4
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 2 DVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
|+.+||..+.. +.-.+||||=+.++| +-+++||.|++..+
T Consensus 17 tl~~lr~~~k~------~~DI~I~NGF~~~~d------~~L~e~D~v~~Ikk 56 (57)
T PF14453_consen 17 TLFELRKESKP------DADIVILNGFPTKED------IELKEGDEVFLIKK 56 (57)
T ss_pred CHHHHHHhhCC------CCCEEEEcCcccCCc------cccCCCCEEEEEeC
Confidence 45666665442 333789999988763 34689999988754
No 149
>cd01771 Faf1_UBX Faf1 UBX domain. Faf1 (fas-associated factor1) is a nucleolar protein that was first identified as an interaction partner of the death receptor Fas. Faf1 contains N-terminal UAS (ubiquitin-associated) and C-terminal UBX (ubiquitin-like) domains and is closely related to other UBA/UBX-containing proteins like p47, Rep8 and SAKS1. Faf1 is thought to be involved in 18S rRNA synthesis and/or 40S ribosomal subunit assembly. The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins.
Probab=63.74 E-value=17 Score=27.66 Aligned_cols=49 Identities=10% Similarity=0.151 Sum_probs=38.5
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEE--ecCeecc--CCcccccccCCCCCcEEEEE
Q psy17672 2 DVEQFRLICQDECGIDATDMILL--NNGKHLL--EDGSCLKQAGVRDGDIILVA 51 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li--~~Gk~L~--d~~~tL~~~gI~~~d~l~l~ 51 (319)
++++|...|..+ |.+....+|+ |==|.+. |.+.||+++|+....+|+|.
T Consensus 26 ~L~~l~~fv~~~-~~~~~~f~L~t~fPRk~~~~~d~~~TL~e~gL~p~~~L~Ve 78 (80)
T cd01771 26 PLQVLLNFVASK-GYPIDEYKLLSSWPRRDLTQLDPNFTLLELKLYPQETLILE 78 (80)
T ss_pred cHHHHHHHHHhc-CCCCCCEEEecCCCCCCCcCCCCCCcHHHcCCCCCcEEEEE
Confidence 678888888765 7777888886 4467774 44579999999999999874
No 150
>cd00565 ThiS ThiaminS ubiquitin-like sulfur carrier protein. ThiS (ThiaminS) is a sulfur carrier protein involved in thiamin biosynthesis in bacteria. The ThiS fold, like those of two closely related proteins MoaD and Urm1, is similar to that of ubiquitin although there is little or no sequence similarity.
Probab=63.58 E-value=16 Score=26.30 Aligned_cols=45 Identities=20% Similarity=0.271 Sum_probs=32.5
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 2 DVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
||.+|.+. .+++.+...+..||+.+.. ..-++.-+++||.|.+..
T Consensus 16 tv~~ll~~----l~~~~~~i~V~vNg~~v~~--~~~~~~~L~~gD~V~ii~ 60 (65)
T cd00565 16 TLAELLEE----LGLDPRGVAVALNGEIVPR--SEWASTPLQDGDRIEIVT 60 (65)
T ss_pred CHHHHHHH----cCCCCCcEEEEECCEEcCH--HHcCceecCCCCEEEEEE
Confidence 55655554 4578888888899999864 234455699999998864
No 151
>cd01773 Faf1_like1_UBX Faf1 ike-1 UBX domain. Faf1_like1 is a protein of unknown function with a domain architecture that includes the UAS (ubiquitin-associated) and UBX (ubiquitin-like) domains. This protein is related to other UBA/UBX-containing proteins like Faf1, p47, and SAKS1 and may serve as an adaptor molecule that shuttles proteins to the proteasome for degradation. The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins.
Probab=61.24 E-value=21 Score=27.53 Aligned_cols=50 Identities=18% Similarity=0.183 Sum_probs=39.0
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEE--ecCeecc--CCcccccccCCCCCcEEEEEc
Q psy17672 2 DVEQFRLICQDECGIDATDMILL--NNGKHLL--EDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li--~~Gk~L~--d~~~tL~~~gI~~~d~l~l~~ 52 (319)
++++|-..++. .|.+++...|+ |==|.+. |.+.||+++|+..-.+|+|.-
T Consensus 27 ~L~~v~~fv~~-~g~~~~~f~L~t~FPRr~~~~~d~~~TL~e~GL~P~~~LfVq~ 80 (82)
T cd01773 27 KLLALVRHVQS-KGYPNERFELLTNFPRRKLSHLDYDITLQEAGLCPQETVFVQE 80 (82)
T ss_pred cHHHHHHHHHh-cCCCCCCEEEecCCCCcccCCcccCCCHHHcCCCCCcEEEEec
Confidence 57777777877 57888999997 4466773 335799999999999999853
No 152
>TIGR01683 thiS thiamine biosynthesis protein ThiS. This model represents ThiS, a small thiamine-biosynthesis protein related to MoaD, a molybdenum cofactor biosynthesis protein. Both proteins are involved in sulfur transfer. ThiS has a conserved Gly-Gly C-terminus that is modified, in reactions requiring ThiI, ThiF, IscS, and a sulfur atom from Cys, into the thiocarboxylate that provides the sulfur for thiazole biosynthesis.
Probab=60.81 E-value=23 Score=25.44 Aligned_cols=45 Identities=18% Similarity=0.251 Sum_probs=32.0
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 2 DVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
||.+|... .++++....+..||+.+.. ..-.++-+++||.|.+..
T Consensus 15 tv~~ll~~----l~~~~~~v~v~vN~~iv~~--~~~~~~~L~~gD~veii~ 59 (64)
T TIGR01683 15 TLAALLES----LGLDPRRVAVAVNGEIVPR--SEWDDTILKEGDRIEIVT 59 (64)
T ss_pred cHHHHHHH----cCCCCCeEEEEECCEEcCH--HHcCceecCCCCEEEEEE
Confidence 55555554 4677777777899999853 235556799999998764
No 153
>KOG1339|consensus
Probab=60.39 E-value=42 Score=33.02 Aligned_cols=36 Identities=14% Similarity=0.221 Sum_probs=27.7
Q ss_pred ceEEEEEEEcC--EEEEEEEcCCccccccCHHHHH-HcC
Q psy17672 179 VMLYINCKVNG--YPVKAFVDSGAQTTIMSAKCAE-RCN 214 (319)
Q Consensus 179 ~~l~v~v~Ing--~~v~aLVDTGA~~siIs~~~a~-rlg 214 (319)
.-+|+++.|+- +++.+++|||++...+.-..+. .|.
T Consensus 45 ~~Y~~~i~IGTPpq~f~v~~DTGS~~lWV~c~~c~~~C~ 83 (398)
T KOG1339|consen 45 GEYYGNISIGTPPQSFTVVLDTGSDLLWVPCAPCSSACY 83 (398)
T ss_pred cccEEEEecCCCCeeeEEEEeCCCCceeecccccccccc
Confidence 35788999954 4589999999999999775554 453
No 154
>PRK12751 cpxP periplasmic stress adaptor protein CpxP; Reviewed
Probab=59.81 E-value=54 Score=28.57 Aligned_cols=27 Identities=41% Similarity=0.543 Sum_probs=14.5
Q ss_pred HHHhhcCCCCHHHHHHHHHHHHHHHHH
Q psy17672 135 IKLMNAHPFDTHAQRLIAEEIRQKNVE 161 (319)
Q Consensus 135 ~~~l~~dP~~~e~q~~i~~~i~~~~~~ 161 (319)
..++.+||||+++=+-..+.+.+..++
T Consensus 88 ~~Li~Ad~FDeaAvra~~~kma~~~~e 114 (162)
T PRK12751 88 HKLITADKFDEAAVRAQAEKMSQNQIE 114 (162)
T ss_pred HHHHhcCCCCHHHHHHHHHHHHHHHHH
Confidence 445556666666555555544444443
No 155
>PF12754 Blt1: Cell-cycle control medial ring component; InterPro: IPR024737 During size-dependent cell cycle transitions controlled by the ubiquitous cyclin-dependent kinase Cdk1, Blt1 has been shown to co-localise with Cdr2 in the medial interphase nodes, as well as with Mid1 which was previously shown to localise to similar interphase structures. Physical interactions between Blt1-Mid1, Blt1-Cdr2 and Cdr2-Mid1 were detected, indicating that medial cortical nodes are formed by the ordered, Cdr2-dependent assembly of multiple interacting proteins during interphase[].; PDB: 2LO0_A.
Probab=59.00 E-value=3.1 Score=39.79 Aligned_cols=41 Identities=17% Similarity=0.193 Sum_probs=0.0
Q ss_pred CHHHHHHHHHh----------hhCCCCCCeE-----EEecCeeccCCcccccccCCC
Q psy17672 2 DVEQFRLICQD----------ECGIDATDMI-----LLNNGKHLLEDGSCLKQAGVR 43 (319)
Q Consensus 2 tv~~lK~~i~~----------e~gip~~~q~-----Li~~Gk~L~d~~~tL~~~gI~ 43 (319)
||.++|..++. +.++|.+..+ |+|+-|++.| .+||.+..-.
T Consensus 105 Sv~dlk~~v~~rv~~~~~~~~~~~vp~dKik~~~~~lL~~kkPv~~-~ktl~e~l~~ 160 (309)
T PF12754_consen 105 SVQDLKDAVQQRVHPSQATYDETRVPLDKIKNFRCRLLYKKKPVGD-SKTLAEVLAD 160 (309)
T ss_dssp ---------------------------------------------------------
T ss_pred cHHHHHHHHHhhhcccccccccccCCHHHhhhhhhhheecCccCCC-cCcHHHHHhc
Confidence 68899999999 8999999999 9999999976 6888876433
No 156
>KOG4583|consensus
Probab=58.52 E-value=3 Score=40.28 Aligned_cols=50 Identities=18% Similarity=0.232 Sum_probs=37.5
Q ss_pred CHHHHHHHHHhhh-CCCC-CCeEEEecCeeccCCcccccccCCC--CCcEEEEEc
Q psy17672 2 DVEQFRLICQDEC-GIDA-TDMILLNNGKHLLEDGSCLKQAGVR--DGDIILVAM 52 (319)
Q Consensus 2 tv~~lK~~i~~e~-gip~-~~q~Li~~Gk~L~d~~~tL~~~gI~--~~d~l~l~~ 52 (319)
||++||..++.-. +-|. ..|+|||.||.|.| ...|++.=+| ..+++||+-
T Consensus 33 tv~~Lk~hls~VyPskpl~~dqrliYsgkllld-~qcl~d~lrkq~k~Hv~hlvc 86 (391)
T KOG4583|consen 33 TVGDLKVHLSQVYPSKPLELDQRLIYSGKLLLD-HQCLTDWLRKQVKEHVKHLVC 86 (391)
T ss_pred hHHHHhhhHhhcCCCCCchhhHHHHhhcccccc-chhHHHHHHHHHHHHHHHHhc
Confidence 7899999998765 4443 57999999999987 6788877554 345667764
No 157
>cd00754 MoaD Ubiquitin domain of MoaD-like proteins. MoaD family. Members of this family are involved in biosynthesis of the molybdenum cofactor (Moco), an essential cofactor of a diverse group of redox enzymes. Moco biosynthesis is an evolutionarily conserved pathway present in eubacteria, archaea and eukaryotes. Moco contains a tricyclic pyranopterin, termed molybdopterin (MPT), that contains the cis-dithiolene group responsible for molybdenum ligation. This dithiolene group is generated by MPT synthase, the second major step in Moco biosynthesis. MPT synthase consists of a large (MoeE) and small (MoaD) subunit. The small subunit is inserted into the lare subunit to form the active site. The small subunit, which is structurally similar to ubiquitin, contains a C-terminal thiocarboxylated glycine residue that serves as a sulfur donor for the synthesis of the MPT dithiolene group.
Probab=58.03 E-value=26 Score=25.75 Aligned_cols=46 Identities=13% Similarity=0.087 Sum_probs=32.6
Q ss_pred CCHHHHHHHHHhhhCC----CCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 1 MDVEQFRLICQDECGI----DATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gi----p~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
.||.+|.+.+....+- ......+..||+.... +.-|++||.|.+..
T Consensus 26 ~tv~~ll~~l~~~~~~~~~~~~~~~~v~vNg~~v~~------~~~l~~gD~v~i~p 75 (80)
T cd00754 26 ATVGELLDALEARYPGLLEELLARVRIAVNGEYVRL------DTPLKDGDEVAIIP 75 (80)
T ss_pred CcHHHHHHHHHHHCchHHHhhhhcEEEEECCeEcCC------CcccCCCCEEEEeC
Confidence 3788888888877532 2344566679998863 23489999999874
No 158
>PRK06488 sulfur carrier protein ThiS; Validated
Probab=55.28 E-value=32 Score=24.72 Aligned_cols=45 Identities=13% Similarity=0.088 Sum_probs=31.9
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 2 DVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
||.+|... .+++.....+-.||+.+.. ....+.-+++||.|.+..
T Consensus 16 tl~~Ll~~----l~~~~~~vavavN~~iv~~--~~~~~~~L~dgD~Ieiv~ 60 (65)
T PRK06488 16 TLALLLAE----LDYEGNWLATAVNGELVHK--EARAQFVLHEGDRIEILS 60 (65)
T ss_pred cHHHHHHH----cCCCCCeEEEEECCEEcCH--HHcCccccCCCCEEEEEE
Confidence 45555544 3677665667799999863 356677799999998874
No 159
>PF11925 DUF3443: Protein of unknown function (DUF3443); InterPro: IPR021847 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 400 to 434 amino acids in length. This protein has two conserved sequence motifs: NPV and DNNG.
Probab=54.79 E-value=37 Score=33.45 Aligned_cols=93 Identities=12% Similarity=0.176 Sum_probs=0.0
Q ss_pred cceEEEEEEE------cCEEE-EEEEcCCccccccCHHHH-HHc--CCcccc-ccceeeEeeccceeEEEeEEEEEEEEE
Q psy17672 178 VVMLYINCKV------NGYPV-KAFVDSGAQTTIMSAKCA-ERC--NIMRLI-DTRWAGVAKGVGVQQIIGRIHMVQVAI 246 (319)
Q Consensus 178 ~~~l~v~v~I------ng~~v-~aLVDTGA~~siIs~~~a-~rl--gl~~~~-~~~~~g~a~Gvg~~~~~g~v~~~~i~I 246 (319)
..++||.|+| +=+.| .+|||||+..-=|-.+.. .-+ .|+... .......=..-+....-|.|+.+.|+|
T Consensus 21 ~N~p~VsVtVC~PGts~CqTIdnvlVDTGS~GLRi~~sAl~~~l~~~Lp~~t~~g~~laEC~~F~sgytWGsVr~AdV~i 100 (370)
T PF11925_consen 21 INIPTVSVTVCAPGTSNCQTIDNVLVDTGSYGLRIFASALPSSLAGSLPQQTGGGAPLAECAQFASGYTWGSVRTADVTI 100 (370)
T ss_pred ccceeeEEEEeCCCCCCceeeCcEEEeccchhhhHHHhhhchhhhccCCcccCCCcchhhhhhccCcccccceEEEEEEE
Q ss_pred cCEE-EEeeEEEecCC----------------------CCceeechh
Q psy17672 247 EKDF-LTTSLSILEEQ----------------------PMDMLLGLD 270 (319)
Q Consensus 247 G~~~-~~~~f~Vl~~~----------------------~~d~ILG~D 270 (319)
|++. -.+.+-|+.+. +.++|||+.
T Consensus 101 gge~A~~iPiQvI~D~~~~~~P~sC~~~g~~~~t~~~lgaNGILGIg 147 (370)
T PF11925_consen 101 GGETASSIPIQVIGDSAAPSVPSSCSNSGASMNTVADLGANGILGIG 147 (370)
T ss_pred cCeeccccCEEEEcCCCCCCCCchhhcCCCCCCCcccccCceEEeec
No 160
>TIGR02958 sec_mycoba_snm4 secretion protein snm4. Members of this family are the 12-transmembrane domain protein snm4, where snm stands for secretion in mycocbacteria. This system acts on Mycobacterium tuberculosis related pair of virulence factors ESAT-6 and CFP-10 and on other homologs. The system is conserved in many Actinobacteria, including the non-pathogenic Mycobacterium smegmatis.
Probab=54.76 E-value=11 Score=38.10 Aligned_cols=33 Identities=30% Similarity=0.514 Sum_probs=27.2
Q ss_pred CeEEE-ecCeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 20 DMILL-NNGKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 20 ~q~Li-~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
.-+|. -+|.+|+- +.||.+.||.|||+|+|+..
T Consensus 47 ~w~L~r~gG~pL~~-~~sL~~~gV~DG~~L~L~p~ 80 (452)
T TIGR02958 47 RWALARAGGSPLDP-DASLAEAGVRDGELLVLVPA 80 (452)
T ss_pred ceEEecCCCCCCCC-CCCHHHcCCCCCCeEEEeeC
Confidence 34554 57889986 69999999999999999964
No 161
>PRK05863 sulfur carrier protein ThiS; Provisional
Probab=53.65 E-value=27 Score=25.23 Aligned_cols=43 Identities=16% Similarity=0.162 Sum_probs=32.2
Q ss_pred HHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 6 FRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 6 lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
+..+++. .+++.....+.+||.++.. ...+.+ +++||.|-+..
T Consensus 18 l~~ll~~-l~~~~~~vav~~N~~iv~r--~~~~~~-L~~gD~ieIv~ 60 (65)
T PRK05863 18 VAALLDS-LGFPEKGIAVAVDWSVLPR--SDWATK-LRDGARLEVVT 60 (65)
T ss_pred HHHHHHH-cCCCCCcEEEEECCcCcCh--hHhhhh-cCCCCEEEEEe
Confidence 4444443 6899999999999999864 235556 99999998874
No 162
>PF14327 CSTF2_hinge: Hinge domain of cleavage stimulation factor subunit 2; PDB: 4EBA_G.
Probab=53.37 E-value=16 Score=28.21 Aligned_cols=27 Identities=37% Similarity=0.522 Sum_probs=18.4
Q ss_pred HHHHHHhhCHHHHHHHHhcCchhHHHH
Q psy17672 83 HVRDLLLACPDQLALLKQNNPRLSEAL 109 (319)
Q Consensus 83 ~~r~~il~nP~~l~~L~~~nP~La~ai 109 (319)
.++.+...||+...+|-..||+|+-|+
T Consensus 37 ~mK~l~~~~p~~ar~lL~~nPqLa~Al 63 (84)
T PF14327_consen 37 QMKQLAQQNPEQARQLLQQNPQLAYAL 63 (84)
T ss_dssp HHHHHHC----HHHHHHHS-THHHHHH
T ss_pred HHHHHHHhCHHHHHHHHHHCcHHHHHH
Confidence 466778899998888889999999888
No 163
>smart00727 STI1 Heat shock chaperonin-binding motif.
Probab=50.59 E-value=25 Score=22.80 Aligned_cols=8 Identities=38% Similarity=0.530 Sum_probs=4.6
Q ss_pred CHHHHHHH
Q psy17672 91 CPDQLALL 98 (319)
Q Consensus 91 nP~~l~~L 98 (319)
||.+++.+
T Consensus 10 ~P~~~~~l 17 (41)
T smart00727 10 NPQVQSLL 17 (41)
T ss_pred CHHHHHHH
Confidence 66665554
No 164
>PF14543 TAXi_N: Xylanase inhibitor N-terminal; PDB: 3HD8_A 3VLB_A 3VLA_A 3AUP_D 1T6G_A 1T6E_X 2B42_A.
Probab=50.11 E-value=17 Score=31.10 Aligned_cols=25 Identities=24% Similarity=0.514 Sum_probs=20.1
Q ss_pred EEEEEEEcC--EEEEEEEcCCcccccc
Q psy17672 181 LYINCKVNG--YPVKAFVDSGAQTTIM 205 (319)
Q Consensus 181 l~v~v~Ing--~~v~aLVDTGA~~siI 205 (319)
+++++.|+- +++.++||||++.+.+
T Consensus 1 Y~~~~~iGtP~~~~~lvvDtgs~l~W~ 27 (164)
T PF14543_consen 1 YYVSVSIGTPPQPFSLVVDTGSDLTWV 27 (164)
T ss_dssp EEEEEECTCTTEEEEEEEETT-SSEEE
T ss_pred CEEEEEeCCCCceEEEEEECCCCceEE
Confidence 467888876 6899999999988886
No 165
>PRK07440 hypothetical protein; Provisional
Probab=49.10 E-value=43 Score=24.74 Aligned_cols=38 Identities=21% Similarity=0.302 Sum_probs=29.4
Q ss_pred hhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 13 ECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 13 e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
+.++++....+-+||.++.- ..-.+.-+++||.|-+..
T Consensus 28 ~l~~~~~~vav~~N~~iv~r--~~w~~~~L~~gD~IEIv~ 65 (70)
T PRK07440 28 QLGFNPRLVAVEYNGEILHR--QFWEQTQVQPGDRLEIVT 65 (70)
T ss_pred HcCCCCCeEEEEECCEEeCH--HHcCceecCCCCEEEEEE
Confidence 45788888888899999863 356667789999988764
No 166
>PF02153 PDH: Prephenate dehydrogenase; InterPro: IPR003099 Members of this family are prephenate dehydrogenases 1.3.1.12 from EC involved in tyrosine biosynthesis. ; GO: 0004665 prephenate dehydrogenase (NADP+) activity, 0008977 prephenate dehydrogenase activity, 0006571 tyrosine biosynthetic process, 0055114 oxidation-reduction process; PDB: 2F1K_B 2PV7_A 3DZB_B 3KTD_B 3B1F_A 2G5C_D 3GGP_C 3GGG_C 3GGO_D.
Probab=49.03 E-value=29 Score=32.03 Aligned_cols=30 Identities=40% Similarity=0.484 Sum_probs=16.9
Q ss_pred hhCHHHHHHHHhcCch-hHHHHhcChHHHHHHHHH
Q psy17672 89 LACPDQLALLKQNNPR-LSEALSTGKDAFTKVFKE 122 (319)
Q Consensus 89 l~nP~~l~~L~~~nP~-La~ai~~~~~~f~~~l~~ 122 (319)
.+||.+-..+...||+ +.+++ ++|.+.|..
T Consensus 209 ~~~p~l~~~I~~~N~~~~~~~l----~~~~~~L~~ 239 (258)
T PF02153_consen 209 SSDPELWADIFLSNPENLLEAL----DEFIKELNE 239 (258)
T ss_dssp GS-HHHHHHHHHHTHHHHHHHH----HHHHHHHHH
T ss_pred cCChHHHHHHHHHCHHHHHHHH----HHHHHHHHH
Confidence 3666666666666665 66666 355555544
No 167
>PRK12750 cpxP periplasmic repressor CpxP; Reviewed
Probab=47.90 E-value=1.9e+02 Score=25.26 Aligned_cols=28 Identities=29% Similarity=0.348 Sum_probs=16.9
Q ss_pred HHHHHhhcCCCCHHHHHHHHHHHHHHHH
Q psy17672 133 QKIKLMNAHPFDTHAQRLIAEEIRQKNV 160 (319)
Q Consensus 133 ~~~~~l~~dP~~~e~q~~i~~~i~~~~~ 160 (319)
+...++.++|||+++=+.+.+.+.+..+
T Consensus 93 ~~~~Ll~a~~FDeaavral~~~~~~~~~ 120 (170)
T PRK12750 93 KVQALVLADDFDEAAANDLAKQMVEKQV 120 (170)
T ss_pred HHHHHHhcCCCCHHHHHHHHHHHHHHHH
Confidence 3456777888888865555444444333
No 168
>PRK10363 cpxP periplasmic repressor CpxP; Reviewed
Probab=47.41 E-value=1e+02 Score=27.00 Aligned_cols=21 Identities=19% Similarity=0.252 Sum_probs=9.8
Q ss_pred CCCCHHHHHHHHHHHHHHHHHH
Q psy17672 141 HPFDTHAQRLIAEEIRQKNVEA 162 (319)
Q Consensus 141 dP~~~e~q~~i~~~i~~~~~~~ 162 (319)
+-..||-++. .+..+++++++
T Consensus 122 ~lLTPEQKaq-~~~~~~~rm~~ 142 (166)
T PRK10363 122 RLLTPEQQAV-LNEKHQQRMEQ 142 (166)
T ss_pred HhCCHHHHHH-HHHHHHHHHHH
Confidence 3355654433 34445555544
No 169
>PF10790 DUF2604: Protein of Unknown function (DUF2604); InterPro: IPR019726 This entry represents bacterial proteins with undetermined function.
Probab=47.32 E-value=18 Score=26.80 Aligned_cols=40 Identities=15% Similarity=0.257 Sum_probs=32.9
Q ss_pred hCCCCCCeEEE-ecCeeccCCcccccccCCCCCcEEEEEccC
Q psy17672 14 CGIDATDMILL-NNGKHLLEDGSCLKQAGVRDGDIILVAMPG 54 (319)
Q Consensus 14 ~gip~~~q~Li-~~Gk~L~d~~~tL~~~gI~~~d~l~l~~~~ 54 (319)
.|-|++.-.|- -+|..|+- ++.+++||+..|-+|+|..+.
T Consensus 32 vgQP~ENWElkDe~G~vlD~-~kKveD~GftngvkLFLsLKA 72 (76)
T PF10790_consen 32 VGQPPENWELKDESGQVLDV-NKKVEDFGFTNGVKLFLSLKA 72 (76)
T ss_pred cCCCcccceeeccCCcEeec-cchhhhccccccceEEEEeec
Confidence 46788888876 57888865 689999999999999998763
No 170
>PRK07696 sulfur carrier protein ThiS; Provisional
Probab=47.28 E-value=49 Score=24.11 Aligned_cols=44 Identities=18% Similarity=0.182 Sum_probs=32.8
Q ss_pred HHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 6 FRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 6 lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
+.++++ ..++++..-.+-+||.++.- ..-+++-+++||.|-+..
T Consensus 19 v~~lL~-~l~~~~~~vav~vN~~iv~r--~~w~~~~L~~gD~iEIv~ 62 (67)
T PRK07696 19 VAELLT-HLELDNKIVVVERNKDILQK--DDHTDTSVFDGDQIEIVT 62 (67)
T ss_pred HHHHHH-HcCCCCCeEEEEECCEEeCH--HHcCceecCCCCEEEEEE
Confidence 344443 46788887788899999964 357777799999998864
No 171
>PRK07417 arogenate dehydrogenase; Reviewed
Probab=46.86 E-value=49 Score=30.77 Aligned_cols=14 Identities=7% Similarity=-0.175 Sum_probs=7.0
Q ss_pred CChHHHHHHHhhCH
Q psy17672 79 QDPAHVRDLLLACP 92 (319)
Q Consensus 79 ~dpe~~r~~il~nP 92 (319)
.||++-+++++.|+
T Consensus 221 ~~p~~w~~i~~~N~ 234 (279)
T PRK07417 221 GNPELGVMMAEYNR 234 (279)
T ss_pred CChHHHHHHHHHhH
Confidence 34555555555554
No 172
>PF13801 Metal_resist: Heavy-metal resistance; PDB: 3EPV_C 2Y3D_A 2Y3H_D 2Y3G_B 2Y3B_A 2Y39_A 3LAY_H.
Probab=46.49 E-value=1.4e+02 Score=23.13 Aligned_cols=43 Identities=26% Similarity=0.314 Sum_probs=25.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHH
Q psy17672 115 AFTKVFKEQLVEREKRQKQKIKLMNAHPFDTHAQRLIAEEIRQ 157 (319)
Q Consensus 115 ~f~~~l~~~~~~~~~~~~~~~~~l~~dP~~~e~q~~i~~~i~~ 157 (319)
+|..-++....+.+..+.+...++.++|+|++.=+.+.+.|..
T Consensus 56 ~~~~~~~~~r~~~~~~r~~l~~ll~~~~~D~~~i~a~~~~~~~ 98 (125)
T PF13801_consen 56 EFRQEMRALRQELRAARQELRALLAAPPPDEAAIEALLEEIRE 98 (125)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHCCSSS-HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHH
Confidence 4444444433333444456777888999999966655555554
No 173
>PRK05659 sulfur carrier protein ThiS; Validated
Probab=45.23 E-value=57 Score=23.26 Aligned_cols=38 Identities=18% Similarity=0.335 Sum_probs=28.3
Q ss_pred hhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 13 ECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 13 e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
..+++...-.+.+||.++.- ..-.+.-+++||.|-+..
T Consensus 24 ~l~~~~~~vav~vNg~iv~r--~~~~~~~l~~gD~vei~~ 61 (66)
T PRK05659 24 REGLAGRRVAVEVNGEIVPR--SQHASTALREGDVVEIVH 61 (66)
T ss_pred hcCCCCCeEEEEECCeEeCH--HHcCcccCCCCCEEEEEE
Confidence 36888888888899988863 234445589999998764
No 174
>PF11069 DUF2870: Protein of unknown function (DUF2870); InterPro: IPR021298 This is a eukaryotic family of proteins with unknown function.
Probab=43.91 E-value=74 Score=25.44 Aligned_cols=18 Identities=33% Similarity=0.320 Sum_probs=16.0
Q ss_pred EEEecCeeccCCccccccc
Q psy17672 22 ILLNNGKHLLEDGSCLKQA 40 (319)
Q Consensus 22 ~Li~~Gk~L~d~~~tL~~~ 40 (319)
+|.|.||.|.. .++|++|
T Consensus 3 ~LW~aGK~l~~-~k~l~dy 20 (98)
T PF11069_consen 3 QLWWAGKELQR-GKKLSDY 20 (98)
T ss_pred eEEeccccccC-CCcHHHh
Confidence 68999999976 6899999
No 175
>PHA00431 internal virion protein C
Probab=43.53 E-value=1.1e+02 Score=32.40 Aligned_cols=82 Identities=17% Similarity=0.273 Sum_probs=48.7
Q ss_pred ChHHHHHHHh-------hCHHHHHHHHhc---------CchhHHHHhcChHHHHHHHHHHHHHHHH-HHHHHHHHhhcCC
Q psy17672 80 DPAHVRDLLL-------ACPDQLALLKQN---------NPRLSEALSTGKDAFTKVFKEQLVEREK-RQKQKIKLMNAHP 142 (319)
Q Consensus 80 dpe~~r~~il-------~nP~~l~~L~~~---------nP~La~ai~~~~~~f~~~l~~~~~~~~~-~~~~~~~~l~~dP 142 (319)
-||..|+.+. -||-.|..|+.. +-+++..|.++.=+=|+-|.+.+.+|-+ .-..+-+...-||
T Consensus 81 TPEQrReAi~nGTLLYQDDPYAMeALr~KtGRnaAy~VDdev~~Ki~nGeFrtRqemEEyRhsrL~e~aksyAE~~GInp 160 (746)
T PHA00431 81 TPEQRREAIKNGTLLYQDDPYAMEALRFKTGRNAAYLVDDEVAQKIKNGEFRTRQEMEEYRHSRLQESAKSYAEQFGINP 160 (746)
T ss_pred CHHHHHHHHhcCceeecCCHHHHHHHHHHhccchhhhccHHHHHHHhccCcccHHHHHHHHHHHHHHHHHHHHHHhCCCC
Confidence 5677777643 788888877543 4556666643321112233333222211 1122333345799
Q ss_pred CCHHHHHHHHHHHHHHHHH
Q psy17672 143 FDTHAQRLIAEEIRQKNVE 161 (319)
Q Consensus 143 ~~~e~q~~i~~~i~~~~~~ 161 (319)
.|++.|+-.-.-|-++||-
T Consensus 161 ~D~~fq~GFnsdIteRNia 179 (746)
T PHA00431 161 TDEEYQRGFNSDITERNIA 179 (746)
T ss_pred CCHHHHhhcccchHhhhHH
Confidence 9999999999999998884
No 176
>PF09269 DUF1967: Domain of unknown function (DUF1967); InterPro: IPR015349 The Obg family comprises a group of ancient P-loop small G proteins (GTPases) belonging to the TRAFAC (for translation factors) class and can be subdivided into several distinct protein subfamilies []. OBG GTPases have been found in both prokaryotes and eukaryotes []. The structure of the OBG GTPase from Thermus thermophilus has been determined []. This entry represents a C-terminal domain found in certain OBG GTPases. This domain contains a four-stranded beta sheet and three alpha helices flanked by an additional beta strand. It is predominantly found in the bacterial GTP-binding protein Obg, and is functionally uncharacterised. ; GO: 0000166 nucleotide binding; PDB: 1UDX_A.
Probab=43.43 E-value=11 Score=27.92 Aligned_cols=16 Identities=31% Similarity=0.792 Sum_probs=11.3
Q ss_pred ccccccCCCCCcEEEE
Q psy17672 35 SCLKQAGVRDGDIILV 50 (319)
Q Consensus 35 ~tL~~~gI~~~d~l~l 50 (319)
+.|...|+++||+|.+
T Consensus 47 ~~L~~~G~~~GD~V~I 62 (69)
T PF09269_consen 47 KALRKAGAKEGDTVRI 62 (69)
T ss_dssp HHHHTTT--TT-EEEE
T ss_pred HHHHHcCCCCCCEEEE
Confidence 4799999999999986
No 177
>TIGR03595 Obg_CgtA_exten Obg family GTPase CgtA, C-terminal extension. CgtA (see model TIGR02729) is a broadly conserved member of the obg family of GTPases associated with ribosome maturation. This model represents a unique C-terminal domain found in some but not all sequences of CgtA. This region is preceded, and may be followed, by a region of low-complexity sequence.
Probab=42.16 E-value=12 Score=27.71 Aligned_cols=16 Identities=38% Similarity=0.841 Sum_probs=14.4
Q ss_pred ccccccCCCCCcEEEE
Q psy17672 35 SCLKQAGVRDGDIILV 50 (319)
Q Consensus 35 ~tL~~~gI~~~d~l~l 50 (319)
+.|...|+++||+|.+
T Consensus 47 ~~L~~~G~~~GD~V~I 62 (69)
T TIGR03595 47 DALRKAGAKDGDTVRI 62 (69)
T ss_pred HHHHHcCCCCCCEEEE
Confidence 5799999999999986
No 178
>COG0287 TyrA Prephenate dehydrogenase [Amino acid transport and metabolism]
Probab=41.44 E-value=54 Score=30.97 Aligned_cols=22 Identities=27% Similarity=0.307 Sum_probs=11.6
Q ss_pred HhhCHHHHHHHHhcCch-hHHHH
Q psy17672 88 LLACPDQLALLKQNNPR-LSEAL 109 (319)
Q Consensus 88 il~nP~~l~~L~~~nP~-La~ai 109 (319)
..+||.|-..+...||+ +.+++
T Consensus 221 a~~~P~m~~dI~~~N~~~~l~~i 243 (279)
T COG0287 221 ASSDPEMYADIQLSNKEALLEAI 243 (279)
T ss_pred HcCChHHHHHHHHhCcHHHHHHH
Confidence 33555555555555555 44455
No 179
>PRK14806 bifunctional cyclohexadienyl dehydrogenase/ 3-phosphoshikimate 1-carboxyvinyltransferase; Provisional
Probab=41.24 E-value=35 Score=36.33 Aligned_cols=47 Identities=26% Similarity=0.421 Sum_probs=36.1
Q ss_pred CCCChHHHHHHHhhCHHH----HHHHHhcCchhHHHHhcCh-HHHHHHHHHH
Q psy17672 77 QLQDPAHVRDLLLACPDQ----LALLKQNNPRLSEALSTGK-DAFTKVFKEQ 123 (319)
Q Consensus 77 ~~~dpe~~r~~il~nP~~----l~~L~~~nP~La~ai~~~~-~~f~~~l~~~ 123 (319)
...+|++.++++.+|+.. ++.+.+.--+|++++.+++ ++|.+.|.+-
T Consensus 226 a~~~p~~~~di~~~n~~~~~~~l~~~~~~l~~~~~~l~~~d~~~~~~~~~~~ 277 (735)
T PRK14806 226 AASDPVMWHDIFLANKEAVLRALDHFRDDLDALRAAIEAGDGHALLGVFTRA 277 (735)
T ss_pred ccCCHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHH
Confidence 467999999999999753 3455555578899998777 8888888763
No 180
>PF07946 DUF1682: Protein of unknown function (DUF1682); InterPro: IPR012879 The members of this family are all hypothetical eukaryotic proteins of unknown function. One member (Q920S6 from SWISSPROT) is described as being an adipocyte-specific protein, but no evidence of this was found.
Probab=41.11 E-value=1.3e+02 Score=29.00 Aligned_cols=20 Identities=25% Similarity=0.302 Sum_probs=16.1
Q ss_pred cCCCCHHHHHHHHHHHHHHH
Q psy17672 140 AHPFDTHAQRLIAEEIRQKN 159 (319)
Q Consensus 140 ~dP~~~e~q~~i~~~i~~~~ 159 (319)
..-+|||.|+|++|--+++.
T Consensus 299 ~~~lspeeQrK~eeKe~kk~ 318 (321)
T PF07946_consen 299 LSKLSPEEQRKYEEKERKKE 318 (321)
T ss_pred HhcCCHHHHHHHHHHHHHHh
Confidence 36699999999998876653
No 181
>PRK07502 cyclohexadienyl dehydrogenase; Validated
Probab=40.44 E-value=69 Score=30.16 Aligned_cols=13 Identities=38% Similarity=0.590 Sum_probs=5.5
Q ss_pred ChHHHHHHHhhCH
Q psy17672 80 DPAHVRDLLLACP 92 (319)
Q Consensus 80 dpe~~r~~il~nP 92 (319)
||++.+++++.|+
T Consensus 233 ~~~~w~~i~~~N~ 245 (307)
T PRK07502 233 DPTMWRDVFLHNK 245 (307)
T ss_pred ChHHHHHHHHHhH
Confidence 3444444444443
No 182
>PLN03146 aspartyl protease family protein; Provisional
Probab=39.86 E-value=75 Score=31.78 Aligned_cols=29 Identities=17% Similarity=0.235 Sum_probs=24.5
Q ss_pred cceEEEEEEEc--CEEEEEEEcCCccccccC
Q psy17672 178 VVMLYINCKVN--GYPVKAFVDSGAQTTIMS 206 (319)
Q Consensus 178 ~~~l~v~v~In--g~~v~aLVDTGA~~siIs 206 (319)
..-+++++.|+ .+++.+++|||++.+.+.
T Consensus 82 ~~~Y~v~i~iGTPpq~~~vi~DTGS~l~Wv~ 112 (431)
T PLN03146 82 GGEYLMNISIGTPPVPILAIADTGSDLIWTQ 112 (431)
T ss_pred CccEEEEEEcCCCCceEEEEECCCCCcceEc
Confidence 34688999997 567899999999999984
No 183
>COG2104 ThiS Sulfur transfer protein involved in thiamine biosynthesis [Coenzyme metabolism]
Probab=38.26 E-value=88 Score=23.16 Aligned_cols=37 Identities=22% Similarity=0.305 Sum_probs=25.3
Q ss_pred hCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 14 CGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 14 ~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
.|++.+.-...+||.++.. .--++.-+++||.|-++.
T Consensus 27 l~~~~~~vav~vNg~iVpr--~~~~~~~l~~gD~ievv~ 63 (68)
T COG2104 27 LGLNPEGVAVAVNGEIVPR--SQWADTILKEGDRIEVVR 63 (68)
T ss_pred hCCCCceEEEEECCEEccc--hhhhhccccCCCEEEEEE
Confidence 5677777777788888763 234556677878776653
No 184
>KOG3391|consensus
Probab=38.15 E-value=39 Score=28.67 Aligned_cols=27 Identities=30% Similarity=0.469 Sum_probs=22.4
Q ss_pred CeeccCCcccccccCCCCCcEEEEEcc
Q psy17672 27 GKHLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 27 Gk~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
|+-.-||.+||.+++++-||.|-+...
T Consensus 110 g~Kg~ddnktL~~~kf~iGD~lDVaI~ 136 (151)
T KOG3391|consen 110 GRKGIDDNKTLQQTKFEIGDYLDVAIT 136 (151)
T ss_pred CcccCCccchhhhCCccccceEEEEec
Confidence 455555689999999999999999865
No 185
>PRK08364 sulfur carrier protein ThiS; Provisional
Probab=37.80 E-value=1e+02 Score=22.58 Aligned_cols=41 Identities=32% Similarity=0.392 Sum_probs=27.7
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 2 DVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
||.+|.+. .+++...-.+..||..+.. +.-+++||.|-+..
T Consensus 25 tv~~ll~~----l~~~~~~v~v~vNg~iv~~------~~~l~~gD~Veii~ 65 (70)
T PRK08364 25 KVADILRA----VGFNTESAIAKVNGKVALE------DDPVKDGDYVEVIP 65 (70)
T ss_pred cHHHHHHH----cCCCCccEEEEECCEECCC------CcCcCCCCEEEEEc
Confidence 45555443 3677777777789998853 23479999888763
No 186
>PRK06083 sulfur carrier protein ThiS; Provisional
Probab=37.80 E-value=75 Score=24.48 Aligned_cols=37 Identities=14% Similarity=0.129 Sum_probs=28.3
Q ss_pred hCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 14 CGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 14 ~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
.+++...-.+-.||.++.- ..-+++-+++||.|.+..
T Consensus 43 l~~~~~~vAVevNg~iVpr--~~w~~t~L~egD~IEIv~ 79 (84)
T PRK06083 43 LSLPELGCVFAINNQVVPR--SEWQSTVLSSGDAISLFQ 79 (84)
T ss_pred cCCCCceEEEEECCEEeCH--HHcCcccCCCCCEEEEEE
Confidence 5788777777789998863 456777789999888764
No 187
>PF09280 XPC-binding: XPC-binding domain; InterPro: IPR015360 Members of this entry adopt a structure consisting of four alpha helices, arranged in an array. They bind specifically and directly to the xeroderma pigmentosum group C protein (XPC) to initiate nucleotide excision repair []. ; GO: 0003684 damaged DNA binding, 0006289 nucleotide-excision repair, 0043161 proteasomal ubiquitin-dependent protein catabolic process; PDB: 1PVE_A 1QZE_A 1OQY_A 1TP4_A 1X3W_B 3ESW_B 2QSG_X 2QSF_X 1X3Z_B 2QSH_X ....
Probab=37.62 E-value=41 Score=24.23 Aligned_cols=13 Identities=46% Similarity=0.583 Sum_probs=5.5
Q ss_pred HHHhcCchhHHHH
Q psy17672 97 LLKQNNPRLSEAL 109 (319)
Q Consensus 97 ~L~~~nP~La~ai 109 (319)
++.++||++-..+
T Consensus 15 ~~vq~NP~lL~~l 27 (59)
T PF09280_consen 15 QLVQQNPQLLPPL 27 (59)
T ss_dssp HHHHC-GGGHHHH
T ss_pred HHHHHCHHHHHHH
Confidence 3445565544333
No 188
>PF06487 SAP18: Sin3 associated polypeptide p18 (SAP18); InterPro: IPR010516 This family consists of several eukaryotic Sin3 associated polypeptide p18 (SAP18) sequences. SAP18 is known to be a component of the Sin3-containing complex, which is responsible for the repression of transcription via the modification of histone polypeptides []. SAP18 is also present in the ASAP complex which is thought to be involved in the regulation of splicing during the execution of programmed cell death [].; PDB: 2HDE_A 4A90_A 4A6Q_A 4A8X_C.
Probab=37.11 E-value=39 Score=27.98 Aligned_cols=28 Identities=29% Similarity=0.289 Sum_probs=17.9
Q ss_pred EecCeeccCCcccccccCCCCCcEEEEE
Q psy17672 24 LNNGKHLLEDGSCLKQAGVRDGDIILVA 51 (319)
Q Consensus 24 i~~Gk~L~d~~~tL~~~gI~~~d~l~l~ 51 (319)
++.|+...|+.+||++++..-||.|-+.
T Consensus 92 v~~g~~~~d~~kTL~~~~F~iGDyidva 119 (120)
T PF06487_consen 92 VVSGRKGPDDNKTLADLRFVIGDYIDVA 119 (120)
T ss_dssp EETTB--TTTTSBCGGGT--TT-EEEEE
T ss_pred EECCCCCCCcccCHhhCCcccCCEEEEe
Confidence 3445555567899999999999999764
No 189
>KOG2561|consensus
Probab=35.48 E-value=23 Score=35.84 Aligned_cols=47 Identities=21% Similarity=0.336 Sum_probs=40.4
Q ss_pred HHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEE
Q psy17672 4 EQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVA 51 (319)
Q Consensus 4 ~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~ 51 (319)
.++.++|+..+||+-..-+.|-+||+|.- .+||.+-|++.+--+.+.
T Consensus 63 selqa~iakklgi~enhvKci~~~Kils~-~ktlaeQglk~nq~~mv~ 109 (568)
T KOG2561|consen 63 SELQALIAKKLGIKENHVKCIINGKILSC-RKTLAEQGLKINQELMVA 109 (568)
T ss_pred HHHHHHHHHHcCCchhhhheeeccceeec-ccchhhhhhhhhhHHHHH
Confidence 57899999999999998999999999987 689999999877544443
No 190
>PF07319 DnaI_N: Primosomal protein DnaI N-terminus; InterPro: IPR009928 This entry represents the N terminus (approximately 120 residues) of bacterial primosomal DnaI proteins, although one family member appears to be of viral origin. DnaI is one of the components of the Bacillus subtilis replication restart primosome, and is required for the DnaB75-dependent loading of the DnaC helicase [].; PDB: 2K7R_A.
Probab=34.19 E-value=57 Score=25.50 Aligned_cols=29 Identities=21% Similarity=0.496 Sum_probs=19.9
Q ss_pred hHHHHHHHhhCHHHHHHHHhcCchhHHHH
Q psy17672 81 PAHVRDLLLACPDQLALLKQNNPRLSEAL 109 (319)
Q Consensus 81 pe~~r~~il~nP~~l~~L~~~nP~La~ai 109 (319)
-+.+++.+++||...+-|.+..|+|-+..
T Consensus 21 ~~~l~~~vl~dp~V~~Fl~~h~~eLt~~~ 49 (94)
T PF07319_consen 21 YEQLKQEVLSDPEVQAFLQEHQPELTQEM 49 (94)
T ss_dssp HHHHHHHHTT-HHHHHHHHHSTTT--HHH
T ss_pred HHHHHHHHHcCHHHHHHHHHhHHhcCHHH
Confidence 35677888899999988877778887544
No 191
>PRK10455 periplasmic protein; Reviewed
Probab=33.86 E-value=2.6e+02 Score=24.25 Aligned_cols=11 Identities=0% Similarity=0.172 Sum_probs=4.6
Q ss_pred CCHHHHHHHHH
Q psy17672 143 FDTHAQRLIAE 153 (319)
Q Consensus 143 ~~~e~q~~i~~ 153 (319)
++||-++...+
T Consensus 130 LTPEQr~q~~~ 140 (161)
T PRK10455 130 LTPEQKKQFNA 140 (161)
T ss_pred CCHHHHHHHHH
Confidence 44444443333
No 192
>PF05823 Gp-FAR-1: Nematode fatty acid retinoid binding protein (Gp-FAR-1); InterPro: IPR008632 Parasitic nematodes produce at least two structurally novel classes of small helix-rich retinol- and fatty-acid-binding proteins that have no counterparts in their plant or animal hosts and thus represent potential targets for new nematicides. Gp-FAR-1 is a member of the nematode-specific fatty-acid- and retinol-binding (FAR) family of proteins but localises to the surface of the organism, placing it in a strategic position for interaction with the host. Gp-FAR-1 functions as a broad-spectrum retinol- and fatty-acid-binding protein, and it is thought that it is involved in the evasion of primary host plant defence systems [].; GO: 0008289 lipid binding; PDB: 2W9Y_A.
Probab=33.54 E-value=91 Score=26.77 Aligned_cols=48 Identities=21% Similarity=0.351 Sum_probs=28.7
Q ss_pred hCHHHHHHHHhcCchhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHH
Q psy17672 90 ACPDQLALLKQNNPRLSEALSTGKDAFTKVFKEQLVEREKRQKQKIKLMNAHPFDTHAQRLIAEEIRQ 157 (319)
Q Consensus 90 ~nP~~l~~L~~~nP~La~ai~~~~~~f~~~l~~~~~~~~~~~~~~~~~l~~dP~~~e~q~~i~~~i~~ 157 (319)
+--++++.|++.+|.|++.+ +.+...++.. -|=++||+|..+.+.|.+
T Consensus 41 ~~de~i~~LK~ksP~L~~k~----~~l~~~~k~k----------------i~~L~peak~Fv~~li~~ 88 (154)
T PF05823_consen 41 NEDEMIAALKEKSPSLYEKA----EKLRDKLKKK----------------IDKLSPEAKAFVKELIAK 88 (154)
T ss_dssp --TTHHHHHHHH-HHHHHHH----HHHHHHHHHT----------------TTT--HHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHhCHHHHHHH----HHHHHHHHHH----------------HHcCCHHHHHHHHHHHHH
Confidence 44567888889999999777 3444443321 244578888877777765
No 193
>KOG3048|consensus
Probab=33.27 E-value=25 Score=30.12 Aligned_cols=27 Identities=26% Similarity=0.326 Sum_probs=22.2
Q ss_pred ccceEEEEEEEcCEEEEEEEcCCccccc
Q psy17672 177 SVVMLYINCKVNGYPVKAFVDSGAQTTI 204 (319)
Q Consensus 177 ~~~~l~v~v~Ing~~v~aLVDTGA~~si 204 (319)
-..++||+++++. +-++|||-|-..-+
T Consensus 69 LTsSlYVPGkl~d-~~k~lVDIGTGYyV 95 (153)
T KOG3048|consen 69 LTSSLYVPGKLSD-NSKFLVDIGTGYYV 95 (153)
T ss_pred cccceeccceecc-ccceeEeccCceEE
Confidence 3467999999998 78999999976554
No 194
>PRK14011 prefoldin subunit alpha; Provisional
Probab=33.23 E-value=2.8e+02 Score=23.62 Aligned_cols=24 Identities=21% Similarity=0.415 Sum_probs=18.9
Q ss_pred eEEEEEEEcCEEEEEEEcCCccccc
Q psy17672 180 MLYINCKVNGYPVKAFVDSGAQTTI 204 (319)
Q Consensus 180 ~l~v~v~Ing~~v~aLVDTGA~~si 204 (319)
.+||+++|.. +=++|||-|+...+
T Consensus 59 s~yV~g~i~d-~dkVlVdIGtGy~V 82 (144)
T PRK14011 59 GAFLKAKIVD-PDKAILGVGSDIYL 82 (144)
T ss_pred CcEEeEEecC-CCeEEEEccCCeEE
Confidence 5889998863 34889999988765
No 195
>PRK08053 sulfur carrier protein ThiS; Provisional
Probab=31.70 E-value=1.5e+02 Score=21.32 Aligned_cols=45 Identities=22% Similarity=0.111 Sum_probs=30.7
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 2 DVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
||.+|... .+++.....+-.||.++.. ..-++.-+++||.|.+..
T Consensus 17 tl~~ll~~----l~~~~~~vaVavN~~iv~r--~~w~~~~L~~gD~Ieii~ 61 (66)
T PRK08053 17 TVHELLEQ----LNQLQPGAALAINQQIIPR--EQWAQHIVQDGDQILLFQ 61 (66)
T ss_pred CHHHHHHH----cCCCCCcEEEEECCEEeCh--HHcCccccCCCCEEEEEE
Confidence 55555443 4566666777799999863 345556689999998864
No 196
>PRK08655 prephenate dehydrogenase; Provisional
Probab=31.57 E-value=1e+02 Score=31.01 Aligned_cols=31 Identities=32% Similarity=0.433 Sum_probs=19.4
Q ss_pred HhhCHHHHHHHHhcCchhHHHHhcChHHHHHHHHH
Q psy17672 88 LLACPDQLALLKQNNPRLSEALSTGKDAFTKVFKE 122 (319)
Q Consensus 88 il~nP~~l~~L~~~nP~La~ai~~~~~~f~~~l~~ 122 (319)
.-+||.|-..+...||.+.+++ ++|.+.+..
T Consensus 217 a~~~p~lw~dI~~~N~~~~~~l----~~~~~~l~~ 247 (437)
T PRK08655 217 LGQNPYLYASIQMNNPQIPEIH----ETFIKECEE 247 (437)
T ss_pred hcCCHHHHHHHHHhCHHHHHHH----HHHHHHHHH
Confidence 3377777777777777766666 355554443
No 197
>TIGR01682 moaD molybdopterin converting factor, subunit 1, non-archaeal. The C-terminal Gly-Gly of this model is critical to function.
Probab=30.89 E-value=1.6e+02 Score=21.84 Aligned_cols=45 Identities=13% Similarity=0.175 Sum_probs=31.1
Q ss_pred CHHHHHHHHHhhhC-C-C-CCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 2 DVEQFRLICQDECG-I-D-ATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 2 tv~~lK~~i~~e~g-i-p-~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
||.+|+..+.+..+ + . .....+..||+...+ +.-|++||.|.+..
T Consensus 28 tv~~L~~~L~~~~p~l~~~~~~~~v~vn~~~v~~------~~~l~dgDevai~P 75 (80)
T TIGR01682 28 TVGELKEHLAKEGPELAASRGQVMVAVNEEYVTD------DALLNEGDEVAFIP 75 (80)
T ss_pred CHHHHHHHHHHhCchhhhhccceEEEECCEEcCC------CcCcCCCCEEEEeC
Confidence 78899999987753 1 1 122355678888764 24589999998873
No 198
>KOG0011|consensus
Probab=30.76 E-value=1.2e+02 Score=29.60 Aligned_cols=46 Identities=24% Similarity=0.521 Sum_probs=39.2
Q ss_pred CCCChH--HHHHHHhhCHHHHH----HHHhcCchhHHHHhcChHHHHHHHHH
Q psy17672 77 QLQDPA--HVRDLLLACPDQLA----LLKQNNPRLSEALSTGKDAFTKVFKE 122 (319)
Q Consensus 77 ~~~dpe--~~r~~il~nP~~l~----~L~~~nP~La~ai~~~~~~f~~~l~~ 122 (319)
+.++|. .+|+++.+||+++. +|-++||.|.+.|..|++.|.++|.+
T Consensus 219 Lr~~~qf~~lR~~iqqNP~ll~~~Lqqlg~~nP~L~q~Iq~nqe~Fl~mlne 270 (340)
T KOG0011|consen 219 LRNQPQFQQLRQMIQQNPELLHPLLQQLGKQNPQLLQLIQENQEAFLQLLNE 270 (340)
T ss_pred hhccHHHHHHHHHHhhCHHHHHHHHHHHhhhCHHHHHHHHHHHHHHHHHhhc
Confidence 455664 58999999998765 78889999999998888999999985
No 199
>PRK11199 tyrA bifunctional chorismate mutase/prephenate dehydrogenase; Provisional
Probab=30.34 E-value=84 Score=30.81 Aligned_cols=15 Identities=20% Similarity=0.255 Sum_probs=6.6
Q ss_pred hCHHHHHHHHhcCch
Q psy17672 90 ACPDQLALLKQNNPR 104 (319)
Q Consensus 90 ~nP~~l~~L~~~nP~ 104 (319)
+||.|-..+...||+
T Consensus 298 ~~p~lw~dI~~~N~~ 312 (374)
T PRK11199 298 QDPQLYADIIMSSPE 312 (374)
T ss_pred CCHHHHHHHHHhChh
Confidence 444444444444443
No 200
>PF13019 Telomere_Sde2: Telomere stability and silencing
Probab=29.71 E-value=1.2e+02 Score=26.50 Aligned_cols=50 Identities=14% Similarity=0.102 Sum_probs=33.3
Q ss_pred CHHHHHHHHHhhhCCCCCCe-EEEe-cCeec-cCCcccccccCCCCC--cEEEEE
Q psy17672 2 DVEQFRLICQDECGIDATDM-ILLN-NGKHL-LEDGSCLKQAGVRDG--DIILVA 51 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q-~Li~-~Gk~L-~d~~~tL~~~gI~~~--d~l~l~ 51 (319)
||.+|+..|....++|...| .|++ .|+.| .+++..+..+.-.++ +++.+.
T Consensus 26 tv~dL~~~l~~~~~~~~~~~~~L~~~~n~~l~~~~~~~~s~l~~~~~~~~~~~l~ 80 (162)
T PF13019_consen 26 TVSDLKDRLSERLPIPSSSQLYLTTNSNGQLSPSSDIPLSSLLSSSQDSDFITLR 80 (162)
T ss_pred cHHHHHHHHHhhcCCCccceeEEEEeCCCeeCCCccccHHhhccCcCCCCceEEE
Confidence 79999999999999999884 5555 35555 333456666654444 344443
No 201
>PF14841 FliG_M: FliG middle domain; PDB: 3HJL_A 3AJC_A 1LKV_X 3SOH_D 3USY_B 3USW_A.
Probab=29.33 E-value=2.5e+02 Score=21.06 Aligned_cols=26 Identities=12% Similarity=0.106 Sum_probs=21.3
Q ss_pred HHHhhcCCCCHHHHHHHHHHHHHHHH
Q psy17672 135 IKLMNAHPFDTHAQRLIAEEIRQKNV 160 (319)
Q Consensus 135 ~~~l~~dP~~~e~q~~i~~~i~~~~~ 160 (319)
.++...+..+|++-+.|++.++++--
T Consensus 46 ~Ria~~~~v~~~~i~~ie~~L~~~l~ 71 (79)
T PF14841_consen 46 RRIARLESVSPEVIEEIEEVLEEKLE 71 (79)
T ss_dssp HHHHTCCCCSHHHHHHHHHHHHHHCH
T ss_pred HHHHccCCCCHHHHHHHHHHHHHHHH
Confidence 44566799999999999999988643
No 202
>TIGR01687 moaD_arch MoaD family protein, archaeal. Members of this family appear to be archaeal versions of MoaD, subunit 1 of molybdopterin converting factor. This model has been split from the bacterial/eukaryotic equivalog model TIGR01682 because the presence of two members of this family in a substantial number of archaeal species suggests that roles might not be interchangeable.
Probab=29.08 E-value=2.2e+02 Score=21.37 Aligned_cols=47 Identities=17% Similarity=0.194 Sum_probs=31.2
Q ss_pred CHHHHHHHHHhhhCC-----------CCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 2 DVEQFRLICQDECGI-----------DATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 2 tv~~lK~~i~~e~gi-----------p~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
||.+|.+.+.++..- ......+..||+....+ .. .-+++||.|.+..
T Consensus 26 tv~~l~~~l~~~~p~~~~~~l~~~~~~~~~~~v~vN~~~v~~~-~~---~~l~dgdev~i~P 83 (88)
T TIGR01687 26 TVGDLLNELMARYPKEFSELFKEGLGLVPNVIILVNGRNVDWG-LG---TELKDGDVVAIFP 83 (88)
T ss_pred CHHHHHHHHHHHCcHHHHHhCccCCcccccEEEEECCEecCcc-CC---CCCCCCCEEEEeC
Confidence 788888888776531 01234556789877642 21 4689999999863
No 203
>PRK06545 prephenate dehydrogenase; Validated
Probab=29.03 E-value=1.3e+02 Score=29.28 Aligned_cols=7 Identities=43% Similarity=0.368 Sum_probs=2.8
Q ss_pred HHHHHHH
Q psy17672 151 IAEEIRQ 157 (319)
Q Consensus 151 i~~~i~~ 157 (319)
+++.+.+
T Consensus 265 l~~~~~~ 271 (359)
T PRK06545 265 IAELFDA 271 (359)
T ss_pred HHHHHHH
Confidence 4444433
No 204
>PF02597 ThiS: ThiS family; InterPro: IPR003749 ThiS (thiaminS) is a 66 aa protein involved in sulphur transfer. ThiS is coded in the thiCEFSGH operon in Escherichia coli. This family of proteins have two conserved Glycines at the COOH terminus. Thiocarboxylate is formed at the last G in the activation process. Sulphur is transferred from ThiI to ThiS in a reaction catalysed by IscS []. MoaD, a protein involved in sulphur transfer during molybdopterin synthesis, is about the same length and shows limited sequence similarity to ThiS. Both have the conserved GG at the COOH end.; PDB: 1JW9_D 1JWB_D 1JWA_D 3BII_D 1NVI_D 1FMA_D 1FM0_D 2QIE_G 2Q5W_D 2K5P_A ....
Probab=28.73 E-value=1e+02 Score=22.17 Aligned_cols=49 Identities=12% Similarity=0.080 Sum_probs=34.3
Q ss_pred CCHHHHHHHHHhhhCC--CCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 1 MDVEQFRLICQDECGI--DATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gi--p~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
.||.+|.+.+..+..- ......+..||+.+.+ .-.+.-+++||.|.+..
T Consensus 22 ~tv~~ll~~l~~~~p~~~~~~~~~v~vN~~~v~~---~~~~~~l~~gD~V~i~p 72 (77)
T PF02597_consen 22 STVRDLLEALAERYPELALRDRVAVAVNGEIVPD---DGLDTPLKDGDEVAILP 72 (77)
T ss_dssp SBHHHHHHHHCHHTGGGHTTTTEEEEETTEEEGG---GTTTSBEETTEEEEEEE
T ss_pred CcHHHHHHHHHhhccccccCccEEEEECCEEcCC---ccCCcCcCCCCEEEEEC
Confidence 4788888888766421 2355677789998874 24455679999999874
No 205
>KOG4250|consensus
Probab=28.52 E-value=90 Score=33.44 Aligned_cols=49 Identities=18% Similarity=0.156 Sum_probs=34.6
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeec--cCCcccccccCCCCCcEEEEE
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHL--LEDGSCLKQAGVRDGDIILVA 51 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L--~d~~~tL~~~gI~~~d~l~l~ 51 (319)
+|+..|++.|...+|||.+.|.|+|.|... .| ..+-.--| .+.++.++.
T Consensus 335 ntl~~~~~~I~~~Tgipe~~qeLL~e~~~~h~~~-~~Q~~~dg-~~~~l~l~~ 385 (732)
T KOG4250|consen 335 NTLHSLIERISKQTGIPEGKQELLFEGGLSHLED-SAQCIPDG-LDSPLYLVS 385 (732)
T ss_pred hhHHHHHHHHHHhhCCCCccceeeeecCccccCc-ccccCCCC-CCCceEEEe
Confidence 478899999999999999999999987644 33 33333334 345555544
No 206
>cd06396 PB1_NBR1 The PB1 domain is an essential part of NBR1 protein, next to BRCA1, a scaffold protein mediating specific protein-protein interaction with both titin protein kinase and with another scaffold protein p62. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domain, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. The NBR1 protein contains a type I PB1 domain.
Probab=27.41 E-value=84 Score=24.23 Aligned_cols=35 Identities=14% Similarity=0.144 Sum_probs=29.6
Q ss_pred EEEEEEcCEEEEEEEcCCccc--cccCHHHHHHcCCc
Q psy17672 182 YINCKVNGYPVKAFVDSGAQT--TIMSAKCAERCNIM 216 (319)
Q Consensus 182 ~v~v~Ing~~v~aLVDTGA~~--siIs~~~a~rlgl~ 216 (319)
.|++..||..+++.++..+.. .=+...++++|++.
T Consensus 2 ~vKaty~~d~~rf~~~~~~~~~~~~L~~ev~~rf~l~ 38 (81)
T cd06396 2 NLKVTYNGESQSFLVSDSENTTWASVEAMVKVSFGLN 38 (81)
T ss_pred EEEEEECCeEEEEEecCCCCCCHHHHHHHHHHHhCCC
Confidence 578999999999999996644 45577999999998
No 207
>PRK06944 sulfur carrier protein ThiS; Provisional
Probab=26.10 E-value=2.2e+02 Score=20.00 Aligned_cols=44 Identities=14% Similarity=0.147 Sum_probs=26.7
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 2 DVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
||.+|.+.+ +++ ....+-+||+.+.. ..-.+.-+++||.|.+..
T Consensus 17 tl~~ll~~l----~~~-~~~~v~vN~~~v~~--~~~~~~~L~~gD~vei~~ 60 (65)
T PRK06944 17 TVADALAAY----GAR-PPFAVAVNGDFVAR--TQHAARALAAGDRLDLVQ 60 (65)
T ss_pred cHHHHHHhh----CCC-CCeEEEECCEEcCc--hhcccccCCCCCEEEEEe
Confidence 455554433 454 34566789998853 123334489999998874
No 208
>TIGR00985 3a0801s04tom mitochondrial import receptor subunit translocase of outer membrane 20 kDa subunit.
Probab=25.95 E-value=2.9e+02 Score=23.76 Aligned_cols=18 Identities=17% Similarity=0.407 Sum_probs=11.7
Q ss_pred cCCCCHHH-HHHHHHHHHH
Q psy17672 140 AHPFDTHA-QRLIAEEIRQ 157 (319)
Q Consensus 140 ~dP~~~e~-q~~i~~~i~~ 157 (319)
-||.|+|. |+...++|..
T Consensus 78 p~p~d~~e~E~~Fl~eV~~ 96 (148)
T TIGR00985 78 PDPTDPSEKEAFFLQEVQL 96 (148)
T ss_pred CCCCCHHHHHHHHHHHHHH
Confidence 46778864 5566666665
No 209
>PRK11861 bifunctional prephenate dehydrogenase/3-phosphoshikimate 1-carboxyvinyltransferase; Provisional
Probab=25.65 E-value=1.5e+02 Score=31.43 Aligned_cols=46 Identities=28% Similarity=0.359 Sum_probs=30.2
Q ss_pred CCCChHHHHHHHhhCHHH----HHHHHhcCchhHHHHhcCh-HHHHHHHHH
Q psy17672 77 QLQDPAHVRDLLLACPDQ----LALLKQNNPRLSEALSTGK-DAFTKVFKE 122 (319)
Q Consensus 77 ~~~dpe~~r~~il~nP~~----l~~L~~~nP~La~ai~~~~-~~f~~~l~~ 122 (319)
...||++-++++++|++- ++.++..--++.++|.+++ +.+.+.|.+
T Consensus 160 a~~~p~lw~di~~~N~~~i~~~l~~~~~~l~~~~~~l~~~d~~~l~~~~~~ 210 (673)
T PRK11861 160 AASSPEMWRDVCLANRAALLDELDAYTAVLARLRAAIDAGDGAALEAVFAR 210 (673)
T ss_pred ccCCHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHH
Confidence 356777778888887764 3455556666777776666 666666654
No 210
>KOG1464|consensus
Probab=25.08 E-value=1.2e+02 Score=29.19 Aligned_cols=55 Identities=22% Similarity=0.347 Sum_probs=34.6
Q ss_pred HhhCHHHHHHHHhcCchhHHHHhcCh-HHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHH
Q psy17672 88 LLACPDQLALLKQNNPRLSEALSTGK-DAFTKVFKEQLVEREKRQKQKIKLMNAHPFDTHAQRLIAEEIRQK 158 (319)
Q Consensus 88 il~nP~~l~~L~~~nP~La~ai~~~~-~~f~~~l~~~~~~~~~~~~~~~~~l~~dP~~~e~q~~i~~~i~~~ 158 (319)
+.++|+.++. -.|..|-++|+ .+|.++|+..+. ..| +|||--|.-.-+..-||-+
T Consensus 298 yKNdPEIlAM-----Tnlv~aYQ~NdI~eFE~Il~~~~~----------~IM-~DpFIReh~EdLl~niRTQ 353 (440)
T KOG1464|consen 298 YKNDPEILAM-----TNLVAAYQNNDIIEFERILKSNRS----------NIM-DDPFIREHIEDLLRNIRTQ 353 (440)
T ss_pred CCCCHHHHHH-----HHHHHHHhcccHHHHHHHHHhhhc----------ccc-ccHHHHHHHHHHHHHHHHH
Confidence 5566666553 23456777777 899998877532 234 7888777655555555544
No 211
>PLN02799 Molybdopterin synthase sulfur carrier subunit
Probab=24.95 E-value=1.4e+02 Score=22.24 Aligned_cols=46 Identities=15% Similarity=0.158 Sum_probs=29.1
Q ss_pred CCHHHHHHHHHhhh-CCCC--CCeEEEecCeeccCCcccccccCCCCCcEEEEEc
Q psy17672 1 MDVEQFRLICQDEC-GIDA--TDMILLNNGKHLLEDGSCLKQAGVRDGDIILVAM 52 (319)
Q Consensus 1 ~tv~~lK~~i~~e~-gip~--~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~~ 52 (319)
.||++|+..+..+. ++.. ..-.+..||+...+ + .=+++||.|.+..
T Consensus 29 ~tv~~L~~~l~~~~p~l~~~~~~~~vavN~~~v~~-~-----~~l~dgDeVai~P 77 (82)
T PLN02799 29 STTADCLAELVAKFPSLEEVRSCCVLALNEEYTTE-S-----AALKDGDELAIIP 77 (82)
T ss_pred CcHHHHHHHHHHHChhHHHHhhCcEEEECCEEcCC-C-----cCcCCCCEEEEeC
Confidence 37888888886654 2211 11235678887653 2 2379999998863
No 212
>cd06406 PB1_P67 A PB1 domain is present in p67 proteins which forms a signaling complex with p40, a crucial step for activation of NADPH oxidase during phagocytosis. PB1 domain is a modular domain mediating specific protein-protein interactions which play a role in many critical cell processes . A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions. The PB1 domain module is conserved in amoebas, fungi, animals, and plants. The p67 proteins contain
Probab=24.25 E-value=96 Score=23.87 Aligned_cols=26 Identities=8% Similarity=0.033 Sum_probs=23.3
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEEecC
Q psy17672 2 DVEQFRLICQDECGIDATDMILLNNG 27 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li~~G 27 (319)
+..+|..+|.++.++|++..+|.|.-
T Consensus 22 ~y~~L~~ki~~kLkl~~e~i~LsYkd 47 (80)
T cd06406 22 SYATLLQKISSKLELPAEHITLSYKS 47 (80)
T ss_pred CHHHHHHHHHHHhCCCchhcEEEecc
Confidence 56899999999999999999999864
No 213
>PF13856 Gifsy-2: ATP-binding sugar transporter from pro-phage; PDB: 2PP6_A.
Probab=24.18 E-value=80 Score=24.52 Aligned_cols=18 Identities=28% Similarity=0.436 Sum_probs=15.8
Q ss_pred EEEEEcCEEEEEEEcCCc
Q psy17672 183 INCKVNGYPVKAFVDSGA 200 (319)
Q Consensus 183 v~v~Ing~~v~aLVDTGA 200 (319)
-.+.|||+++++++|..-
T Consensus 22 ~~~~InG~~~~~v~d~~~ 39 (95)
T PF13856_consen 22 EEHTINGKPYRAVFDEPE 39 (95)
T ss_dssp EEEEETTEEEEEEEGGGT
T ss_pred CeEEECCEEEEEEECCch
Confidence 578999999999999763
No 214
>PF12436 USP7_ICP0_bdg: ICP0-binding domain of Ubiquitin-specific protease 7; InterPro: IPR024729 This domain is found in eukaryotes, and is approximately 40 amino acids in length. It is found in proteins of the peptidase C19 family, which contains contains ubiquitinyl hydrolases like ubiquitin-specific protease 7 (USP7). USP7 regulates the turnover of p53 [].; PDB: 2KVR_A 2YLM_A.
Probab=23.71 E-value=77 Score=29.30 Aligned_cols=51 Identities=20% Similarity=0.392 Sum_probs=33.6
Q ss_pred CHHHHHHHHHhhhCCCCCCeEEEec----Ce--eccCCcccccccCCCCCcEEEEEcc
Q psy17672 2 DVEQFRLICQDECGIDATDMILLNN----GK--HLLEDGSCLKQAGVRDGDIILVAMP 53 (319)
Q Consensus 2 tv~~lK~~i~~e~gip~~~q~Li~~----Gk--~L~d~~~tL~~~gI~~~d~l~l~~~ 53 (319)
+|+++-..|-...|+|++.-.++|. ++ .+ |...|+..+.|.+||+|..-..
T Consensus 96 ~v~~l~~~i~~~~g~p~~t~l~lyEEi~~~~ie~i-~~~~t~~~~el~~GdIi~fQ~~ 152 (249)
T PF12436_consen 96 KVSELVPLINERAGLPPDTPLLLYEEIKPNMIEPI-DPNQTFEKAELQDGDIICFQRA 152 (249)
T ss_dssp BGGGTHHHHHHHHT--TT--EEEEEEEETTEEEE---SSSBHHHTT--TTEEEEEEE-
T ss_pred CHHHHHHHHHHHcCCCCCCceEEEEEeccceeeEc-CCCCchhhcccCCCCEEEEEec
Confidence 5677888888999999987777765 33 24 4578999999999999997653
No 215
>COG3159 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=23.29 E-value=2.1e+02 Score=26.22 Aligned_cols=27 Identities=30% Similarity=0.375 Sum_probs=21.9
Q ss_pred CChHHHHHHHhhCHHHHHHHHhcCchhHHHH
Q psy17672 79 QDPAHVRDLLLACPDQLALLKQNNPRLSEAL 109 (319)
Q Consensus 79 ~dpe~~r~~il~nP~~l~~L~~~nP~La~ai 109 (319)
-|++.+.+++..+|++..+ ||+|.+++
T Consensus 4 lda~~VadyL~~hPeFf~~----h~~Ll~~L 30 (218)
T COG3159 4 LDAEDVADYLRQHPEFFIQ----HAELLEEL 30 (218)
T ss_pred CCHHHHHHHHHhCcHHHHh----CHHHHHHc
Confidence 4788888888888888755 88888887
No 216
>PTZ00380 microtubule-associated protein (MAP); Provisional
Probab=22.33 E-value=88 Score=25.99 Aligned_cols=52 Identities=15% Similarity=0.215 Sum_probs=35.6
Q ss_pred CCHHHHHHHHHhhhCCCCCCeEEEecCeeccCCccccccc---CCCCCcEEEEEcc
Q psy17672 1 MDVEQFRLICQDECGIDATDMILLNNGKHLLEDGSCLKQA---GVRDGDIILVAMP 53 (319)
Q Consensus 1 ~tv~~lK~~i~~e~gip~~~q~Li~~Gk~L~d~~~tL~~~---gI~~~d~l~l~~~ 53 (319)
+||.+|...|....++++++.-|+-|+..+.. +.|+++. -=.++-+|++.-+
T Consensus 51 ~tV~qF~~iIRkrl~l~~~k~flfVnn~lp~~-s~~mg~lYe~~KDeDGFLYi~Ys 105 (121)
T PTZ00380 51 ATVAELEAAVRQALGTSAKKVTLAIEGSTPAV-TATVGDIADACKRDDGFLYVSVR 105 (121)
T ss_pred CcHHHHHHHHHHHcCCChhHEEEEECCccCCc-cchHHHHHHHhcCCCCeEEEEEc
Confidence 68999999999999999998444445544443 5676643 1124558888754
No 217
>PRK11130 moaD molybdopterin synthase small subunit; Provisional
Probab=20.54 E-value=3.5e+02 Score=20.07 Aligned_cols=45 Identities=16% Similarity=0.153 Sum_probs=28.1
Q ss_pred CCHHHHHHHHHhhhC-----CCCCCeEEEecCeeccCCcccccccCCCCCcEEEEE
Q psy17672 1 MDVEQFRLICQDECG-----IDATDMILLNNGKHLLEDGSCLKQAGVRDGDIILVA 51 (319)
Q Consensus 1 ~tv~~lK~~i~~e~g-----ip~~~q~Li~~Gk~L~d~~~tL~~~gI~~~d~l~l~ 51 (319)
.||++|+..|..+.. ......+..-|++.... +| -|++||.|.+.
T Consensus 26 ~tv~~l~~~L~~~~~~~~~~~~~~~~~~aVN~~~~~~--~~----~l~dgDeVai~ 75 (81)
T PRK11130 26 PTVEALRQHLAQKGDRWALALEDGKLLAAVNQTLVSF--DH----PLTDGDEVAFF 75 (81)
T ss_pred CCHHHHHHHHHHhCccHHhhhcCCCEEEEECCEEcCC--CC----CCCCCCEEEEe
Confidence 489999999987752 11223344456654332 22 48999999876
Done!