Query psy8018
Match_columns 280
No_of_seqs 335 out of 1356
Neff 6.6
Searched_HMMs 46136
Date Sat Aug 17 00:42:37 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8018.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8018hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0012|consensus 100.0 1.1E-61 2.3E-66 446.7 19.3 273 1-274 1-359 (380)
2 PF09668 Asp_protease: Asparty 100.0 3E-44 6.5E-49 292.2 11.8 124 126-249 1-124 (124)
3 cd05480 NRIP_C NRIP_C; putativ 100.0 1.6E-36 3.5E-41 235.0 10.2 101 152-252 1-103 (103)
4 cd05479 RP_DDI RP_DDI; retrope 100.0 7.3E-31 1.6E-35 214.2 14.7 124 134-257 1-124 (124)
5 PF08284 RVP_2: Retroviral asp 99.8 3E-20 6.5E-25 154.0 12.1 112 148-260 20-132 (135)
6 cd01807 GDX_N ubiquitin-like d 99.8 1.5E-19 3.3E-24 134.5 9.3 72 1-73 1-72 (74)
7 cd01797 NIRF_N amino-terminal 99.8 7.9E-19 1.7E-23 132.3 9.3 75 1-76 1-77 (78)
8 cd01793 Fubi Fubi ubiquitin-li 99.8 1.4E-18 3E-23 129.4 9.6 70 1-73 1-70 (74)
9 PTZ00044 ubiquitin; Provisiona 99.8 4.1E-18 8.8E-23 127.0 9.6 72 1-73 1-72 (76)
10 cd01791 Ubl5 UBL5 ubiquitin-li 99.7 4.4E-18 9.6E-23 126.7 8.3 70 1-71 2-71 (73)
11 TIGR02281 clan_AA_DTGA clan AA 99.7 2.5E-17 5.5E-22 134.0 13.5 108 145-253 7-117 (121)
12 cd01805 RAD23_N Ubiquitin-like 99.7 9.5E-18 2.1E-22 125.2 9.7 72 1-73 1-74 (77)
13 cd01796 DDI1_N DNA damage indu 99.7 4.9E-18 1.1E-22 125.6 7.9 69 3-71 1-70 (71)
14 cd01810 ISG15_repeat2 ISG15 ub 99.7 8.4E-18 1.8E-22 125.1 8.7 70 3-73 1-70 (74)
15 cd01806 Nedd8 Nebb8-like ubiq 99.7 2E-17 4.4E-22 122.6 10.1 72 1-73 1-72 (76)
16 cd01802 AN1_N ubiquitin-like d 99.7 1.1E-17 2.5E-22 132.5 9.3 72 1-73 28-99 (103)
17 cd01798 parkin_N amino-termina 99.7 1E-17 2.2E-22 123.3 8.2 70 3-73 1-70 (70)
18 cd01803 Ubiquitin Ubiquitin. U 99.7 3E-17 6.4E-22 121.7 9.5 72 1-73 1-72 (76)
19 cd01804 midnolin_N Ubiquitin-l 99.7 2.9E-17 6.3E-22 123.7 8.7 71 1-73 2-72 (78)
20 cd01809 Scythe_N Ubiquitin-lik 99.7 4.4E-17 9.5E-22 119.6 9.2 72 1-73 1-72 (72)
21 cd01792 ISG15_repeat1 ISG15 ub 99.7 3.5E-17 7.5E-22 123.7 8.2 75 1-76 3-79 (80)
22 cd01794 DC_UbP_C dendritic cel 99.7 4.9E-17 1.1E-21 120.1 8.2 68 4-72 2-69 (70)
23 cd01808 hPLIC_N Ubiquitin-like 99.7 1.7E-16 3.7E-21 117.1 9.0 71 1-73 1-71 (71)
24 cd05484 retropepsin_like_LTR_2 99.7 8.3E-16 1.8E-20 118.4 12.1 88 151-242 2-91 (91)
25 PF00240 ubiquitin: Ubiquitin 99.7 3E-16 6.4E-21 114.5 8.9 67 6-73 1-67 (69)
26 cd01812 BAG1_N Ubiquitin-like 99.6 7.4E-16 1.6E-20 113.0 8.2 70 1-72 1-70 (71)
27 cd01790 Herp_N Homocysteine-re 99.6 5.9E-16 1.3E-20 116.7 7.5 71 1-72 2-78 (79)
28 PF13650 Asp_protease_2: Aspar 99.6 4.8E-15 1E-19 112.0 11.3 88 152-240 1-90 (90)
29 cd01800 SF3a120_C Ubiquitin-li 99.6 1.7E-15 3.6E-20 113.4 8.4 65 8-73 5-69 (76)
30 cd05483 retropepsin_like_bacte 99.6 6E-15 1.3E-19 112.6 10.5 92 149-242 2-96 (96)
31 cd01813 UBP_N UBP ubiquitin pr 99.6 3.1E-15 6.8E-20 111.6 8.4 70 1-72 1-73 (74)
32 KOG0005|consensus 99.6 2.5E-15 5.5E-20 105.7 4.6 70 1-71 1-70 (70)
33 TIGR00601 rad23 UV excision re 99.6 9.3E-15 2E-19 139.6 9.7 72 1-73 1-75 (378)
34 PF00077 RVP: Retroviral aspar 99.6 2.7E-14 5.8E-19 111.2 10.6 95 149-249 5-100 (100)
35 smart00213 UBQ Ubiquitin homol 99.6 1.1E-14 2.5E-19 103.7 7.7 64 1-66 1-64 (64)
36 KOG0003|consensus 99.5 6.5E-16 1.4E-20 120.9 -0.1 72 1-73 1-72 (128)
37 cd01799 Hoil1_N Ubiquitin-like 99.5 1.6E-14 3.4E-19 108.2 7.1 68 3-71 3-73 (75)
38 cd01763 Sumo Small ubiquitin-r 99.5 1.1E-13 2.4E-18 106.3 10.1 73 1-74 12-84 (87)
39 cd06095 RP_RTVL_H_like Retrope 99.5 9.6E-14 2.1E-18 106.1 9.6 85 152-242 1-86 (86)
40 cd01815 BMSC_UbP_N Ubiquitin-l 99.5 2.7E-14 5.9E-19 106.5 5.3 57 16-73 15-75 (75)
41 KOG0004|consensus 99.5 2.8E-14 6.1E-19 118.8 4.2 72 1-73 1-72 (156)
42 KOG0010|consensus 99.4 1.7E-13 3.7E-18 132.2 7.4 71 1-73 16-86 (493)
43 TIGR03698 clan_AA_DTGF clan AA 99.4 2.3E-12 5E-17 102.7 10.9 92 159-255 15-107 (107)
44 KOG0011|consensus 99.4 4.4E-13 9.5E-18 123.8 7.5 73 1-74 1-75 (340)
45 cd01795 USP48_C USP ubiquitin- 99.3 2.1E-12 4.6E-17 100.3 7.0 62 12-73 16-77 (107)
46 cd01769 UBL Ubiquitin-like dom 99.3 4.6E-12 1E-16 91.3 7.9 68 4-72 1-68 (69)
47 COG3577 Predicted aspartyl pro 99.3 8.7E-12 1.9E-16 108.8 9.0 97 147-245 103-202 (215)
48 PF11976 Rad60-SLD: Ubiquitin- 99.3 2.1E-11 4.7E-16 89.6 8.1 70 1-71 1-71 (72)
49 cd00303 retropepsin_like Retro 99.2 1.4E-10 3E-15 83.6 10.2 89 152-242 1-92 (92)
50 cd01789 Alp11_N Ubiquitin-like 99.2 1.5E-10 3.1E-15 88.5 8.7 71 2-73 3-81 (84)
51 cd01814 NTGP5 Ubiquitin-like N 99.2 5.3E-11 1.1E-15 94.9 6.2 71 2-73 6-90 (113)
52 PF13975 gag-asp_proteas: gag- 99.0 1.5E-09 3.3E-14 80.3 7.5 46 145-190 4-49 (72)
53 cd05481 retropepsin_like_LTR_1 99.0 3.6E-09 7.9E-14 82.2 9.9 86 153-240 2-91 (93)
54 PF12384 Peptidase_A2B: Ty3 tr 98.9 1.1E-08 2.4E-13 86.6 11.3 95 149-245 34-130 (177)
55 PF14560 Ubiquitin_2: Ubiquiti 98.9 5.6E-09 1.2E-13 79.9 8.1 72 2-73 3-83 (87)
56 KOG4248|consensus 98.9 1.9E-09 4.1E-14 111.6 7.0 71 2-74 4-74 (1143)
57 PLN02560 enoyl-CoA reductase 98.9 5.6E-09 1.2E-13 97.7 8.6 71 1-72 1-82 (308)
58 KOG0001|consensus 98.8 3E-08 6.5E-13 70.9 9.4 70 3-73 2-71 (75)
59 PF11543 UN_NPL4: Nuclear pore 98.7 2.6E-08 5.5E-13 75.5 5.5 70 1-71 5-78 (80)
60 COG5550 Predicted aspartyl pro 98.7 3.8E-07 8.2E-12 73.9 11.3 94 160-258 26-120 (125)
61 cd01801 Tsc13_N Ubiquitin-like 98.6 9.2E-08 2E-12 71.6 6.9 69 2-71 2-75 (77)
62 cd05480 NRIP_C NRIP_C; putativ 98.6 4.9E-08 1.1E-12 76.4 3.6 44 233-280 53-96 (103)
63 cd00196 UBQ Ubiquitin-like pro 98.5 5E-07 1.1E-11 61.1 7.8 67 5-72 2-68 (69)
64 PF02160 Peptidase_A3: Caulifl 98.5 8.3E-07 1.8E-11 78.0 9.0 108 150-261 5-119 (201)
65 cd01788 ElonginB Ubiquitin-lik 98.4 7.6E-07 1.6E-11 71.1 7.6 72 1-73 1-80 (119)
66 PF13881 Rad60-SLD_2: Ubiquiti 98.4 2.2E-06 4.7E-11 68.9 9.3 70 3-73 5-88 (111)
67 cd06094 RP_Saci_like RP_Saci_l 98.4 1E-06 2.3E-11 67.7 6.8 77 160-245 9-88 (89)
68 cd05482 HIV_retropepsin_like R 98.4 2.3E-06 4.9E-11 65.9 8.4 86 153-242 2-87 (87)
69 PF05585 DUF1758: Putative pep 98.3 1.7E-06 3.6E-11 73.5 7.5 66 160-228 12-81 (164)
70 KOG0006|consensus 97.8 4.3E-05 9.3E-10 71.0 6.2 71 1-72 1-74 (446)
71 cd01811 OASL_repeat1 2'-5' oli 97.8 0.00014 3E-09 54.0 7.7 70 1-72 1-75 (80)
72 PF08817 YukD: WXG100 protein 97.7 7.9E-05 1.7E-09 56.0 6.1 69 2-71 4-79 (79)
73 KOG1872|consensus 97.7 9.3E-05 2E-09 72.0 7.7 65 8-73 10-75 (473)
74 PF09668 Asp_protease: Asparty 97.5 0.0001 2.2E-09 60.4 4.0 46 231-280 75-120 (124)
75 PF00789 UBX: UBX domain; Int 97.3 0.0019 4.1E-08 48.3 8.7 70 2-71 8-81 (82)
76 KOG1769|consensus 97.3 0.0029 6.2E-08 49.5 9.5 74 2-76 20-95 (99)
77 KOG0012|consensus 97.2 0.00019 4.2E-09 67.7 2.6 46 231-280 286-331 (380)
78 KOG4495|consensus 97.1 0.00056 1.2E-08 53.1 4.1 63 1-63 1-65 (110)
79 KOG3206|consensus 97.1 0.0065 1.4E-07 53.6 11.1 72 2-73 3-81 (234)
80 smart00166 UBX Domain present 97.1 0.0038 8.2E-08 46.8 8.1 70 2-71 6-79 (80)
81 PF13019 Telomere_Sde2: Telome 97.0 0.043 9.3E-07 46.9 14.3 66 1-66 1-73 (162)
82 COG5417 Uncharacterized small 96.8 0.0071 1.5E-07 44.9 7.5 69 1-70 5-80 (81)
83 KOG3493|consensus 96.8 0.00052 1.1E-08 49.6 1.4 68 2-70 3-70 (73)
84 cd01767 UBX UBX (ubiquitin reg 96.8 0.0091 2E-07 44.3 8.3 69 2-72 4-76 (77)
85 cd01770 p47_UBX p47-like ubiqu 96.8 0.0098 2.1E-07 44.8 7.9 67 2-69 6-75 (79)
86 PF11470 TUG-UBL1: GLUT4 regul 96.7 0.0092 2E-07 43.4 7.1 63 7-70 3-65 (65)
87 cd01772 SAKS1_UBX SAKS1-like U 96.6 0.018 3.9E-07 43.2 8.3 69 2-71 6-78 (79)
88 PF10302 DUF2407: DUF2407 ubiq 96.5 0.0068 1.5E-07 47.6 5.6 57 2-59 2-63 (97)
89 KOG1639|consensus 96.4 0.0069 1.5E-07 54.9 5.8 71 1-72 1-78 (297)
90 cd01774 Faf1_like2_UBX Faf1 ik 96.4 0.031 6.6E-07 42.7 8.7 69 2-71 6-83 (85)
91 cd01771 Faf1_UBX Faf1 UBX doma 96.0 0.059 1.3E-06 40.7 8.3 70 2-72 6-79 (80)
92 cd01773 Faf1_like1_UBX Faf1 ik 95.9 0.086 1.9E-06 40.1 8.7 71 2-73 7-81 (82)
93 PF12382 Peptidase_A2E: Retrot 95.6 0.055 1.2E-06 42.7 6.9 65 162-229 49-115 (137)
94 PF14453 ThiS-like: ThiS-like 95.5 0.06 1.3E-06 38.2 6.1 56 1-73 1-56 (57)
95 COG4067 Uncharacterized protei 95.4 0.046 1E-06 46.2 6.2 101 153-254 30-157 (162)
96 cd05470 pepsin_retropepsin_lik 95.2 0.13 2.7E-06 39.8 7.9 84 152-238 1-109 (109)
97 PF00026 Asp: Eukaryotic aspar 95.1 0.092 2E-06 47.9 8.1 89 150-240 2-114 (317)
98 PTZ00013 plasmepsin 4 (PM4); P 95.1 0.13 2.9E-06 50.8 9.6 91 148-240 137-251 (450)
99 cd05476 pepsin_A_like_plant Ch 94.9 0.1 2.2E-06 47.2 7.5 30 231-260 234-263 (265)
100 PRK06437 hypothetical protein; 94.9 0.19 4.2E-06 36.5 7.6 58 5-72 5-62 (67)
101 PTZ00147 plasmepsin-1; Provisi 94.9 0.17 3.7E-06 50.0 9.6 92 147-240 137-252 (453)
102 KOG0013|consensus 94.7 0.061 1.3E-06 47.7 5.3 63 9-72 155-217 (231)
103 PRK05863 sulfur carrier protei 94.5 0.18 3.8E-06 36.3 6.5 60 1-72 1-60 (65)
104 PRK06488 sulfur carrier protei 94.5 0.24 5.1E-06 35.5 7.1 60 1-72 1-60 (65)
105 COG5227 SMT3 Ubiquitin-like pr 94.5 0.064 1.4E-06 41.4 4.2 70 7-77 31-100 (103)
106 PRK08364 sulfur carrier protei 93.7 0.51 1.1E-05 34.4 7.7 51 12-72 15-65 (70)
107 PRK05659 sulfur carrier protei 93.5 0.54 1.2E-05 33.5 7.5 61 1-72 1-61 (66)
108 cd05474 SAP_like SAPs, pepsin- 93.5 0.58 1.3E-05 42.5 9.5 75 149-240 2-80 (295)
109 cd06097 Aspergillopepsin_like 93.5 0.4 8.7E-06 43.6 8.4 89 151-240 2-114 (278)
110 cd05487 renin_like Renin stimu 93.4 0.55 1.2E-05 43.8 9.4 91 148-240 7-121 (326)
111 PRK07696 sulfur carrier protei 93.1 0.65 1.4E-05 33.6 7.4 61 1-72 1-62 (67)
112 PRK08053 sulfur carrier protei 92.6 1 2.3E-05 32.3 7.8 61 1-72 1-61 (66)
113 PF15044 CLU_N: Mitochondrial 92.5 0.24 5.2E-06 37.0 4.5 56 17-73 1-58 (76)
114 cd05478 pepsin_A Pepsin A, asp 92.5 0.84 1.8E-05 42.3 9.2 100 153-258 193-316 (317)
115 cd05477 gastricsin Gastricsins 92.3 1.2 2.7E-05 41.2 10.0 100 154-259 188-317 (318)
116 cd05476 pepsin_A_like_plant Ch 92.1 0.47 1E-05 42.9 6.8 76 150-240 2-88 (265)
117 PRK07440 hypothetical protein; 92.0 0.98 2.1E-05 33.0 7.2 56 9-72 10-65 (70)
118 PLN02799 Molybdopterin synthas 92.0 0.94 2E-05 33.7 7.3 66 1-72 2-77 (82)
119 cd05474 SAP_like SAPs, pepsin- 92.0 1.4 3.1E-05 40.0 9.9 95 161-259 179-294 (295)
120 cd05477 gastricsin Gastricsins 92.0 1.1 2.5E-05 41.4 9.4 90 149-240 3-115 (318)
121 cd06098 phytepsin Phytepsin, a 91.9 0.98 2.1E-05 42.0 8.9 91 148-240 9-123 (317)
122 PRK06083 sulfur carrier protei 91.7 0.93 2E-05 34.5 6.9 56 9-72 24-79 (84)
123 PF09379 FERM_N: FERM N-termin 91.3 1.7 3.7E-05 31.7 8.0 67 5-72 1-76 (80)
124 PF05618 Zn_protease: Putative 91.2 0.28 6E-06 40.9 3.9 46 210-255 87-134 (138)
125 cd06096 Plasmepsin_5 Plasmepsi 91.2 0.7 1.5E-05 43.1 7.1 90 160-260 231-323 (326)
126 cd05486 Cathespin_E Cathepsin 91.0 0.95 2.1E-05 42.0 7.8 88 151-240 2-112 (316)
127 cd05478 pepsin_A Pepsin A, asp 91.0 1.6 3.4E-05 40.5 9.2 91 148-240 9-122 (317)
128 cd05485 Cathepsin_D_like Cathe 91.0 1.9 4E-05 40.4 9.8 26 233-258 303-328 (329)
129 cd05488 Proteinase_A_fungi Fun 91.0 1.4 3.1E-05 40.9 8.9 91 148-240 9-122 (320)
130 cd05490 Cathepsin_D2 Cathepsin 90.9 1.2 2.5E-05 41.5 8.3 91 148-240 5-120 (325)
131 PF14836 Ubiquitin_3: Ubiquiti 90.9 1.6 3.5E-05 33.6 7.5 63 10-73 13-80 (88)
132 cd06098 phytepsin Phytepsin, a 90.8 1.3 2.7E-05 41.2 8.4 95 153-258 196-316 (317)
133 PF00026 Asp: Eukaryotic aspar 90.6 0.98 2.1E-05 41.1 7.4 99 154-259 187-316 (317)
134 cd05485 Cathepsin_D_like Cathe 90.3 1.5 3.2E-05 41.1 8.4 91 148-240 10-125 (329)
135 PF10790 DUF2604: Protein of U 90.2 1.3 2.9E-05 32.2 6.1 65 8-73 3-71 (76)
136 cd00565 ThiS ThiaminS ubiquiti 89.6 1.5 3.3E-05 31.2 6.1 56 9-72 5-60 (65)
137 cd05472 cnd41_like Chloroplast 89.2 1.1 2.5E-05 41.0 6.7 79 150-240 2-89 (299)
138 PTZ00165 aspartyl protease; Pr 89.1 1.9 4E-05 43.1 8.5 99 140-240 109-237 (482)
139 PRK06944 sulfur carrier protei 89.0 3.6 7.8E-05 29.0 7.8 60 1-72 1-60 (65)
140 cd01760 RBD Ubiquitin-like dom 88.8 2.7 5.9E-05 31.1 7.2 65 3-71 2-70 (72)
141 cd06097 Aspergillopepsin_like 88.6 1 2.2E-05 40.9 5.8 78 160-258 198-277 (278)
142 COG2104 ThiS Sulfur transfer p 88.4 3.6 7.7E-05 30.1 7.4 63 1-72 1-63 (68)
143 cd05488 Proteinase_A_fungi Fun 88.4 4.1 8.8E-05 37.8 9.9 101 154-258 194-319 (320)
144 TIGR01683 thiS thiamine biosyn 88.3 2 4.3E-05 30.5 6.1 56 9-72 4-59 (64)
145 cd05473 beta_secretase_like Be 88.3 2.1 4.5E-05 40.6 8.0 87 149-240 3-113 (364)
146 cd05472 cnd41_like Chloroplast 88.0 5.2 0.00011 36.6 10.2 28 233-260 270-297 (299)
147 cd00754 MoaD Ubiquitin domain 87.9 2.6 5.7E-05 30.7 6.7 55 12-72 17-75 (80)
148 PF11069 DUF2870: Protein of u 87.8 1.5 3.2E-05 34.5 5.3 30 42-73 3-32 (98)
149 cd05475 nucellin_like Nucellin 87.7 1.6 3.4E-05 39.7 6.5 28 233-260 244-271 (273)
150 PRK11840 bifunctional sulfur c 87.2 2.9 6.2E-05 39.7 7.9 67 1-79 1-67 (326)
151 cd06406 PB1_P67 A PB1 domain i 87.0 2.4 5.2E-05 32.1 6.0 36 12-47 12-47 (80)
152 cd05471 pepsin_like Pepsin-lik 86.1 1.9 4.2E-05 38.3 6.1 81 160-258 202-282 (283)
153 TIGR02958 sec_mycoba_snm4 secr 85.8 3.4 7.4E-05 41.0 8.1 70 2-73 4-80 (452)
154 KOG2086|consensus 85.4 1.8 3.9E-05 41.8 5.6 66 2-67 307-375 (380)
155 PRK01777 hypothetical protein; 85.0 7 0.00015 30.4 7.9 62 1-72 4-75 (95)
156 cd06096 Plasmepsin_5 Plasmepsi 85.0 3.3 7.2E-05 38.6 7.3 91 149-240 3-138 (326)
157 smart00455 RBD Raf-like Ras-bi 84.1 4.2 9E-05 29.8 6.0 48 3-50 2-51 (70)
158 smart00666 PB1 PB1 domain. Pho 83.7 5.5 0.00012 29.2 6.6 38 9-46 9-46 (81)
159 PF11620 GABP-alpha: GA-bindin 83.7 4.2 9.1E-05 31.2 5.9 61 11-72 3-63 (88)
160 KOG4250|consensus 83.3 1.8 3.9E-05 44.9 4.9 65 7-72 321-386 (732)
161 cd05471 pepsin_like Pepsin-lik 83.0 6 0.00013 35.1 7.9 90 151-242 2-115 (283)
162 cd01768 RA RA (Ras-associating 81.5 18 0.0004 26.7 8.9 49 10-58 12-68 (87)
163 cd05487 renin_like Renin stimu 81.0 10 0.00022 35.2 8.9 26 233-258 299-324 (326)
164 cd06407 PB1_NLP A PB1 domain i 81.0 5.1 0.00011 30.3 5.6 42 4-45 2-45 (82)
165 cd06409 PB1_MUG70 The MUG70 pr 80.6 6.5 0.00014 30.1 6.1 43 3-45 3-48 (86)
166 PF03539 Spuma_A9PTase: Spumav 80.5 14 0.00031 31.3 8.4 77 156-244 1-83 (163)
167 cd05475 nucellin_like Nucellin 80.2 5.9 0.00013 35.9 6.8 83 150-240 3-102 (273)
168 PF14451 Ub-Mut7C: Mut7-C ubiq 80.1 7.2 0.00016 29.5 6.1 53 10-72 22-75 (81)
169 PTZ00147 plasmepsin-1; Provisi 79.5 13 0.00029 36.8 9.5 29 233-261 422-450 (453)
170 cd05490 Cathepsin_D2 Cathepsin 79.5 7.9 0.00017 35.9 7.6 26 233-258 299-324 (325)
171 PF08825 E2_bind: E2 binding d 79.5 3.2 7E-05 31.6 4.1 61 15-76 1-74 (84)
172 TIGR01682 moaD molybdopterin c 78.9 14 0.00031 27.1 7.4 55 12-72 17-75 (80)
173 PF00564 PB1: PB1 domain; Int 78.6 8.8 0.00019 28.1 6.2 38 9-46 9-47 (84)
174 cd05486 Cathespin_E Cathepsin 78.4 11 0.00024 34.8 8.2 26 233-258 290-315 (316)
175 PLN03146 aspartyl protease fam 78.1 9 0.00019 37.5 7.8 30 233-262 399-428 (431)
176 smart00295 B41 Band 4.1 homolo 76.6 27 0.00058 29.6 9.6 71 2-73 5-83 (207)
177 TIGR01687 moaD_arch MoaD famil 74.9 31 0.00068 25.6 8.4 57 11-72 16-83 (88)
178 COG5100 NPL4 Nuclear pore prot 74.2 15 0.00032 36.0 7.8 70 1-73 1-79 (571)
179 PF12754 Blt1: Cell-cycle cont 73.6 1.1 2.3E-05 42.1 0.0 43 20-63 103-160 (309)
180 KOG4583|consensus 73.6 1.7 3.8E-05 41.2 1.4 60 3-63 12-75 (391)
181 cd05473 beta_secretase_like Be 73.5 8.1 0.00018 36.5 6.0 31 234-264 319-349 (364)
182 PF14732 UAE_UbL: Ubiquitin/SU 73.3 4.3 9.2E-05 31.0 3.3 57 15-71 2-67 (87)
183 cd05489 xylanase_inhibitor_I_l 73.2 19 0.00041 34.4 8.4 26 234-259 335-360 (362)
184 PTZ00165 aspartyl protease; Pr 72.3 33 0.00072 34.3 10.2 32 232-263 418-449 (482)
185 KOG2982|consensus 71.6 5.1 0.00011 38.1 3.9 59 14-72 351-416 (418)
186 cd05992 PB1 The PB1 domain is 71.6 11 0.00025 27.3 5.2 37 9-45 8-45 (81)
187 PF02196 RBD: Raf-like Ras-bin 71.4 15 0.00033 26.8 5.8 49 3-51 3-53 (71)
188 PTZ00013 plasmepsin 4 (PM4); P 70.8 28 0.0006 34.5 9.2 28 233-260 421-448 (450)
189 cd06408 PB1_NoxR The PB1 domai 70.6 20 0.00042 27.5 6.3 36 2-37 2-38 (86)
190 PF03419 Peptidase_U4: Sporula 69.7 51 0.0011 30.4 10.3 36 147-182 155-201 (293)
191 smart00314 RA Ras association 69.0 38 0.00083 25.2 7.7 51 9-59 14-71 (90)
192 cd01775 CYR1_RA Ubiquitin doma 68.9 22 0.00049 27.9 6.4 65 7-71 9-85 (97)
193 cd06396 PB1_NBR1 The PB1 domai 67.9 17 0.00036 27.6 5.4 35 3-37 1-38 (81)
194 cd06411 PB1_p51 The PB1 domain 67.3 12 0.00027 28.1 4.5 35 12-46 8-42 (78)
195 PF08337 Plexin_cytopl: Plexin 66.6 22 0.00048 36.1 7.6 65 10-74 201-290 (539)
196 PF02597 ThiS: ThiS family; I 65.6 28 0.00061 24.8 6.2 58 12-72 13-72 (77)
197 cd01787 GRB7_RA RA (RAS-associ 63.5 23 0.0005 27.1 5.4 40 5-44 7-47 (85)
198 PF10209 DUF2340: Uncharacteri 63.5 21 0.00046 29.1 5.5 57 16-72 21-107 (122)
199 cd06410 PB1_UP2 Uncharacterize 63.4 27 0.00058 27.3 6.0 40 5-45 17-56 (97)
200 PF14541 TAXi_C: Xylanase inhi 61.5 28 0.00062 28.8 6.3 28 231-258 133-160 (161)
201 TIGR02854 spore_II_GA sigma-E 61.2 20 0.00044 33.3 5.8 35 148-182 157-202 (288)
202 cd01817 RGS12_RBD Ubiquitin do 59.7 70 0.0015 23.8 7.6 65 5-73 4-70 (73)
203 cd01818 TIAM1_RBD Ubiquitin do 59.7 23 0.0005 26.6 4.7 41 5-45 4-44 (77)
204 PTZ00380 microtubule-associate 59.7 10 0.00022 30.9 3.1 57 15-72 45-104 (121)
205 PF02505 MCR_D: Methyl-coenzym 58.5 19 0.00042 30.4 4.6 49 10-65 75-124 (153)
206 PF06234 TmoB: Toluene-4-monoo 57.8 51 0.0011 25.3 6.4 59 13-72 17-83 (85)
207 PRK12751 cpxP periplasmic stre 57.6 58 0.0013 27.8 7.5 32 103-134 87-118 (162)
208 PF12436 USP7_ICP0_bdg: ICP0-b 56.7 16 0.00036 33.1 4.3 59 14-73 88-152 (249)
209 cd01782 AF6_RA_repeat1 Ubiquit 56.5 79 0.0017 25.4 7.5 57 1-57 24-92 (112)
210 PF02824 TGS: TGS domain; Int 56.0 38 0.00083 23.7 5.2 59 3-71 1-59 (60)
211 PF11834 DUF3354: Domain of un 54.3 16 0.00035 26.8 3.1 44 21-71 26-69 (69)
212 PF00788 RA: Ras association ( 53.1 57 0.0012 23.9 6.1 56 4-59 6-73 (93)
213 cd06398 PB1_Joka2 The PB1 doma 51.8 53 0.0011 25.3 5.8 42 4-45 2-50 (91)
214 PRK10363 cpxP periplasmic repr 50.0 83 0.0018 27.1 7.2 54 83-136 57-114 (166)
215 cd01776 Rin1_RA Ubiquitin doma 49.9 38 0.00082 25.9 4.5 44 10-53 13-61 (87)
216 TIGR03260 met_CoM_red_D methyl 49.7 32 0.00069 29.1 4.5 45 10-60 74-118 (150)
217 PF12436 USP7_ICP0_bdg: ICP0-b 49.0 37 0.00079 30.8 5.3 43 2-44 178-223 (249)
218 cd01766 Ufm1 Urm1-like ubiquit 48.7 96 0.0021 23.2 6.4 61 12-73 17-78 (82)
219 PRK11130 moaD molybdopterin sy 48.2 1.1E+02 0.0023 22.5 7.6 53 14-72 18-76 (81)
220 cd01611 GABARAP Ubiquitin doma 47.1 27 0.00059 28.0 3.6 56 16-73 46-106 (112)
221 PF13801 Metal_resist: Heavy-m 46.8 90 0.002 23.6 6.6 27 101-127 73-99 (125)
222 PF03671 Ufm1: Ubiquitin fold 46.3 1.2E+02 0.0026 22.6 6.5 58 12-70 17-75 (76)
223 PF06487 SAP18: Sin3 associate 46.0 27 0.00058 28.4 3.5 59 11-71 37-119 (120)
224 PF00276 Ribosomal_L23: Riboso 44.6 58 0.0013 24.9 5.0 42 10-51 20-62 (91)
225 PF02991 Atg8: Autophagy prote 44.3 38 0.00083 26.8 4.1 54 17-72 39-97 (104)
226 PRK12750 cpxP periplasmic repr 42.8 91 0.002 26.8 6.5 28 103-130 94-121 (170)
227 PF14533 USP7_C2: Ubiquitin-sp 42.1 1.2E+02 0.0027 26.6 7.5 119 11-187 34-161 (213)
228 KOG3391|consensus 41.9 27 0.00058 29.2 2.9 55 18-73 59-136 (151)
229 KOG2689|consensus 41.8 78 0.0017 29.5 6.2 70 2-72 212-286 (290)
230 TIGR00691 spoT_relA (p)ppGpp s 40.8 4.1E+02 0.0088 27.9 12.0 62 2-73 361-422 (683)
231 KOG2561|consensus 40.8 23 0.0005 35.2 2.7 56 16-72 55-110 (568)
232 KOG2507|consensus 40.6 41 0.00089 33.2 4.4 72 2-73 316-391 (506)
233 cd05479 RP_DDI RP_DDI; retrope 39.0 45 0.00098 26.6 3.8 36 245-280 77-112 (124)
234 PF08154 NLE: NLE (NUC135) dom 38.0 1.4E+02 0.003 21.2 5.9 45 13-60 18-64 (65)
235 KOG1339|consensus 37.8 1.5E+02 0.0032 28.5 7.9 91 148-240 45-181 (398)
236 PF09269 DUF1967: Domain of un 37.0 19 0.00042 26.1 1.2 16 55-70 47-62 (69)
237 cd01777 SNX27_RA Ubiquitin dom 36.7 79 0.0017 24.3 4.6 41 2-42 3-43 (87)
238 PF14543 TAXi_N: Xylanase inhi 36.4 41 0.00089 28.2 3.3 25 150-174 1-27 (164)
239 TIGR03595 Obg_CgtA_exten Obg f 34.7 25 0.00054 25.6 1.5 17 55-71 47-63 (69)
240 PF00077 RVP: Retroviral aspar 34.1 72 0.0016 23.8 4.1 37 243-280 60-96 (100)
241 PRK10872 relA (p)ppGpp synthet 34.0 6.1E+02 0.013 27.1 13.5 62 2-73 405-466 (743)
242 cd01764 Urm1 Urm1-like ubuitin 34.0 1.1E+02 0.0023 23.5 5.1 56 15-72 23-89 (94)
243 PRK10455 periplasmic protein; 33.3 2.5E+02 0.0053 23.9 7.6 48 83-130 63-114 (161)
244 PRK05738 rplW 50S ribosomal pr 33.0 73 0.0016 24.5 4.0 40 10-49 20-60 (92)
245 cd06397 PB1_UP1 Uncharacterize 32.9 1.2E+02 0.0026 23.1 4.9 37 9-45 8-44 (82)
246 cd01778 RASSF1_RA Ubiquitin-li 32.8 2.4E+02 0.0052 22.1 7.3 51 7-57 13-73 (96)
247 PF03931 Skp1_POZ: Skp1 family 32.2 34 0.00075 24.0 1.9 32 1-32 1-32 (62)
248 PF03658 Ub-RnfH: RnfH family 31.8 1.9E+02 0.004 22.1 5.9 66 1-72 1-72 (84)
249 PF08284 RVP_2: Retroviral asp 30.6 67 0.0014 26.3 3.6 37 244-280 81-117 (135)
250 PF02192 PI3K_p85B: PI3-kinase 30.2 64 0.0014 24.2 3.1 21 13-33 2-22 (78)
251 PF14847 Ras_bdg_2: Ras-bindin 30.0 2.4E+02 0.0052 22.3 6.5 36 3-38 3-38 (105)
252 KOG4572|consensus 28.4 86 0.0019 33.7 4.6 64 9-72 3-69 (1424)
253 cd01612 APG12_C Ubiquitin-like 27.7 79 0.0017 24.1 3.3 59 15-73 20-81 (87)
254 TIGR03636 L23_arch archaeal ri 27.7 1E+02 0.0022 23.0 3.8 33 10-42 14-46 (77)
255 PF13856 Gifsy-2: ATP-binding 27.6 64 0.0014 24.6 2.8 18 152-169 22-39 (95)
256 PRK14548 50S ribosomal protein 27.0 1.1E+02 0.0024 23.3 3.9 33 10-42 21-53 (84)
257 PF04126 Cyclophil_like: Cyclo 26.2 59 0.0013 26.2 2.5 29 1-30 1-29 (120)
258 COG1730 GIM5 Predicted prefold 25.2 83 0.0018 26.5 3.3 42 131-174 47-89 (145)
259 KOG3048|consensus 25.1 47 0.001 28.0 1.7 52 115-174 41-96 (153)
260 PF12685 SpoIIIAH: SpoIIIAH-li 23.9 4.8E+02 0.01 22.6 10.4 56 104-169 107-169 (196)
261 PF14533 USP7_C2: Ubiquitin-sp 23.5 75 0.0016 28.1 2.9 30 10-39 132-161 (213)
262 PF07946 DUF1682: Protein of u 23.4 2.1E+02 0.0044 27.0 6.0 19 109-127 299-317 (321)
263 cd01666 TGS_DRG_C TGS_DRG_C: 23.1 2.4E+02 0.0052 20.8 5.1 53 13-71 17-74 (75)
264 PF00031 Cystatin: Cystatin do 22.9 1.7E+02 0.0037 21.5 4.4 23 108-130 5-27 (94)
265 PF07929 PRiA4_ORF3: Plasmid p 22.8 2.6E+02 0.0057 23.6 6.0 60 12-71 19-96 (179)
266 COG0089 RplW Ribosomal protein 22.6 1.3E+02 0.0028 23.5 3.6 40 9-48 20-60 (94)
267 smart00144 PI3K_rbd PI3-kinase 22.4 3.8E+02 0.0083 20.9 7.8 62 10-72 28-103 (108)
268 cd06396 PB1_NBR1 The PB1 domai 21.9 1.2E+02 0.0027 22.9 3.3 35 151-185 2-38 (81)
269 KOG3439|consensus 21.7 1.8E+02 0.0038 23.5 4.3 38 13-50 47-84 (116)
270 CHL00030 rpl23 ribosomal prote 21.7 1.5E+02 0.0032 23.0 3.8 40 9-48 18-58 (93)
271 COG2080 CoxS Aerobic-type carb 21.4 1.3E+02 0.0028 25.7 3.7 47 1-51 2-58 (156)
272 PRK15326 type III secretion sy 21.1 3.7E+02 0.0081 20.3 7.9 32 103-134 30-61 (80)
273 smart00143 PI3K_p85B PI3-kinas 20.7 1.2E+02 0.0025 22.9 2.9 21 13-33 2-22 (78)
No 1
>KOG0012|consensus
Probab=100.00 E-value=1.1e-61 Score=446.68 Aligned_cols=273 Identities=42% Similarity=0.684 Sum_probs=245.3
Q ss_pred CEEEEEcC--CCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCC-cCCccccCCCCCCEEEEEEec---
Q psy8018 1 MKVTVTSL--DNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDN-KKTLKDYNIQDGDLVLLKRVQ--- 74 (280)
Q Consensus 1 M~ItV~~~--~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~-~~tL~~~gI~~gd~l~l~~~~--- 74 (280)
|.+||.+. ....++++|..+..+.+|+++++.++|++.+.-.|+||++++.++ +.+|.+||+++||++.+..+.
T Consensus 1 M~~tvs~~l~~~~~~~i~v~~dg~L~nl~aL~~~d~g~~~~~~~li~n~~~l~s~~s~~l~Q~g~~~~dsl~lr~ks~d~ 80 (380)
T KOG0012|consen 1 MSLTVSVALNFEKKFPIPVTTDGELNNLAALCWKDTGIVYDPSDLIYNPRPLVSNESQGLTQIGLKDGDSLALRCKSSDP 80 (380)
T ss_pred CeEEEEEEecceeeeccccccccchhhHHHHHHHHhCcccchhhcccCCCccccchhhhhhhcccccceeEeccCCCCCC
Confidence 78888865 677889999999999999999999999999999999999999765 778999999999999997760
Q ss_pred ------------------cCCCC-----------------------------C---------------------------
Q psy8018 75 ------------------VARTS-----------------------------N--------------------------- 80 (280)
Q Consensus 75 ------------------~p~~s-----------------------------~--------------------------- 80 (280)
.|+.+ .
T Consensus 81 r~~v~~~~~~~~dFat~A~~~fs~q~a~~~~gaq~~rg~~~~di~~a~~~~ldsp~~~Rq~~la~pf~L~~~r~~lp~La 160 (380)
T KOG0012|consen 81 RPIVPIQVRLISDFATIAVPMFSSQRARQLQGAQRTRGRLQTDIPEASSLSLDSPATFRQALLAMPFFLHLDRAYLPPLA 160 (380)
T ss_pred CccccccceehhcccccccccccchhccccccccccccccccccccccccCcCCHHHHHHHHhcCchhhhhchhhcCccc
Confidence 00000 0
Q ss_pred ------ChhHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHhhhHHHHHHHhHhhCCCcceeeeEEEEEE
Q psy8018 81 ------PSDDFARILRQQYEEREKREQLRQRMLTADPFDTEAQKLIAEEIKKSNIQANMEAAMEYNPETFGSVIMLYINC 154 (280)
Q Consensus 81 ------~~~~f~~~~~~~~~~~~~~~~~~~~~~~~dp~~~e~q~~i~e~i~q~~i~~~~~~a~e~~Pe~~~~~~~l~i~~ 154 (280)
+.++|.+.+++.+..+..++..+++++++||||+|+|++|+|.|+|++|+|||.+|+||+||.|+.|+|+||+|
T Consensus 161 ~~l~~g~~~k~~~~~~~~q~d~~rr~~~~~rl~eanPfd~E~q~rIee~irq~~i~eq~~~ai~~~pe~f~~v~ML~iN~ 240 (380)
T KOG0012|consen 161 ALLVLGDLEKFDRTLKEMQKDYQRRSVHQRRLLEANPFDLEAQRRIEEKIRQNAIDEQMSHAIEYHPEDFTQVTMLYINC 240 (380)
T ss_pred hhhcccchhhhhhhhhhhccccchhhhhhHHHHhcCCcchhhhhhhhHHHHHHHHHHHHHHhhhcCccccccceEEEEEE
Confidence 34555555555555556566778899999999999999999999999999999999999999999999999999
Q ss_pred EEcCEEEEEEEcCCCcceecCHHHHHHcCCccccccceeeeeecCceeeEEeEEEEEEEEECCeEEEeEEEEecCCCccc
Q psy8018 155 RVNGHPVKAFIDSGAQTTIMSAACAARVNITRLIDTRWAGIAKGVGVQRIIGRIHMVQIAIENDFLTTSFSVLEDQPMDM 234 (280)
Q Consensus 155 ~InG~~v~alVDTGAq~tiis~~~A~rlgL~~~~d~r~~g~a~gvG~~~~~g~i~~~~i~IG~~~~~~~~~Vl~~~~~d~ 234 (280)
+|||+|||||||||||.|+||.+||+||||.+++|+||.|++.|||+.++.|+||.++++||+.+++|+|.|++..++|+
T Consensus 241 ~ing~~VKAfVDsGaq~timS~~Caer~gL~rlid~r~~g~a~gvg~~ki~g~Ih~~~lki~~~~l~c~ftV~d~~~~d~ 320 (380)
T KOG0012|consen 241 EINGVPVKAFVDSGAQTTIMSAACAERCGLNRLIDKRFQGEARGVGTEKILGRIHQAQLKIEDLYLPCSFTVLDRRDMDL 320 (380)
T ss_pred EECCEEEEEEEcccchhhhhhHHHHHHhChHHHhhhhhhccccCCCcccccceeEEEEEEeccEeeccceEEecCCCcch
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred ccchhhhhcceEEEEecccEEEeeeeecCCCccceeeccc
Q psy8018 235 LLGLDMLKRHQVQIAIENDFLTTSFSVLEDQPMDMLLGLD 274 (280)
Q Consensus 235 ILG~D~L~~~~~~ID~~~~~L~i~~~~~~~~~~~~~~~~~ 274 (280)
+||+|+||+|+|+||+++|+|+|+.+.- ++||-+.-+++
T Consensus 321 llGLd~Lrr~~ccIdL~~~~L~ig~~~t-eiPfl~~~~lp 359 (380)
T KOG0012|consen 321 LLGLDMLRRHQCCIDLKTNVLRIGNTET-EIPFLPSNELP 359 (380)
T ss_pred hhhHHHHHhccceeecccCeEEecCCCc-cccccccccCC
Confidence 9999999999999999999999999866 78887665543
No 2
>PF09668 Asp_protease: Aspartyl protease; InterPro: IPR019103 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This family of eukaryotic aspartyl proteases have a fold similar to retroviral proteases which implies they function proteolytically during regulated protein turnover []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 3S8I_A 2I1A_B.
Probab=100.00 E-value=3e-44 Score=292.19 Aligned_cols=124 Identities=62% Similarity=1.028 Sum_probs=87.2
Q ss_pred hhhHHHHHHHhHhhCCCcceeeeEEEEEEEEcCEEEEEEEcCCCcceecCHHHHHHcCCccccccceeeeeecCceeeEE
Q psy8018 126 KSNIQANMEAAMEYNPETFGSVIMLYINCRVNGHPVKAFIDSGAQTTIMSAACAARVNITRLIDTRWAGIAKGVGVQRII 205 (280)
Q Consensus 126 q~~i~~~~~~a~e~~Pe~~~~~~~l~i~~~InG~~v~alVDTGAq~tiis~~~A~rlgL~~~~d~r~~g~a~gvG~~~~~ 205 (280)
|+||+|||++|+|++||.|++++|+||+|+|||++++||||||||+|+||.+||+||||++++|+|+.|++.|+|+++++
T Consensus 1 q~~i~~~~~~a~e~~PE~f~~v~mLyI~~~ing~~vkA~VDtGAQ~tims~~~a~r~gL~~lid~r~~g~a~GvG~~~i~ 80 (124)
T PF09668_consen 1 QENIDENLENAMEHSPESFGQVSMLYINCKINGVPVKAFVDTGAQSTIMSKSCAERCGLMRLIDKRFAGVAKGVGTQKIL 80 (124)
T ss_dssp -------------------------EEEEEETTEEEEEEEETT-SS-EEEHHHHHHTTGGGGEEGGG-EE-------EEE
T ss_pred ChhHHHHHHHHHHhCcHhhcCcceEEEEEEECCEEEEEEEeCCCCccccCHHHHHHcCChhhccccccccccCCCcCcee
Confidence 78999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred eEEEEEEEEECCeEEEeEEEEecCCCcccccchhhhhcceEEEE
Q psy8018 206 GRIHMVQIAIENDFLTTSFSVLEDQPMDMLLGLDMLKRHQVQIA 249 (280)
Q Consensus 206 g~i~~~~i~IG~~~~~~~~~Vl~~~~~d~ILG~D~L~~~~~~ID 249 (280)
|++|.++++||+.++||+|.|+++.+.|+|||+|||++|+|+||
T Consensus 81 G~Ih~~~l~ig~~~~~~s~~Vle~~~~d~llGld~L~~~~c~ID 124 (124)
T PF09668_consen 81 GRIHSVQLKIGGLFFPCSFTVLEDQDVDLLLGLDMLKRHKCCID 124 (124)
T ss_dssp EEEEEEEEEETTEEEEEEEEEETTSSSSEEEEHHHHHHTT-EEE
T ss_pred EEEEEEEEEECCEEEEEEEEEeCCCCcceeeeHHHHHHhCcccC
Confidence 99999999999999999999999999999999999999999998
No 3
>cd05480 NRIP_C NRIP_C; putative nuclear receptor interacting protein. Proteins in this family have been described as probable nuclear receptor interacting proteins. The C-terminal domain of this family is homologous to the retroviral aspartyl protease domain. The domain is structurally related to one lobe of the pepsin molecule. The conserved active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=100.00 E-value=1.6e-36 Score=235.01 Aligned_cols=101 Identities=25% Similarity=0.467 Sum_probs=96.8
Q ss_pred EEEEEcCEEEEEEEcCCCcceecCHHHHHHcCCcccccc-ceeeeeecCce-eeEEeEEEEEEEEECCeEEEeEEEEecC
Q psy8018 152 INCRVNGHPVKAFIDSGAQTTIMSAACAARVNITRLIDT-RWAGIAKGVGV-QRIIGRIHMVQIAIENDFLTTSFSVLED 229 (280)
Q Consensus 152 i~~~InG~~v~alVDTGAq~tiis~~~A~rlgL~~~~d~-r~~g~a~gvG~-~~~~g~i~~~~i~IG~~~~~~~~~Vl~~ 229 (280)
|+|++||+|++||||||||+|+||++||+||||+++++. |+.|+|.|||+ .+++||+|.++++||+.++||+|+|+++
T Consensus 1 vnCk~nG~~vkAfVDsGaQ~timS~~caercgL~r~v~~~r~~g~A~gvgt~~kiiGrih~~~ikig~~~~~CSftVld~ 80 (103)
T cd05480 1 VSCQCAGKELRALVDTGCQYNLISAACLDRLGLKERVLKAKAEEEAPSLPTSVKVIGQIERLVLQLGQLTVECSAQVVDD 80 (103)
T ss_pred CceeECCEEEEEEEecCCchhhcCHHHHHHcChHhhhhhccccccccCCCcceeEeeEEEEEEEEeCCEEeeEEEEEEcC
Confidence 589999999999999999999999999999999988776 88899999998 6999999999999999999999999999
Q ss_pred CCcccccchhhhhcceEEEEecc
Q psy8018 230 QPMDMLLGLDMLKRHQVQIAIEN 252 (280)
Q Consensus 230 ~~~d~ILG~D~L~~~~~~ID~~~ 252 (280)
.++|++||+|+|++|+|+||+++
T Consensus 81 ~~~d~llGLdmLkrhqc~IdL~k 103 (103)
T cd05480 81 NEKNFSLGLQTLKSLKCVINLEK 103 (103)
T ss_pred CCcceEeeHHHHhhcceeeeccC
Confidence 99999999999999999999975
No 4
>cd05479 RP_DDI RP_DDI; retropepsin-like domain of DNA damage inducible protein. The family represents the retropepsin-like domain of DNA damage inducible protein. DNA damage inducible protein has a retropepsin-like domain and an amino-terminal ubiquitin-like domain and/or a UBA (ubiquitin-associated) domain. This CD represents the retropepsin-like domain of DDI.
Probab=99.97 E-value=7.3e-31 Score=214.22 Aligned_cols=124 Identities=60% Similarity=0.968 Sum_probs=116.6
Q ss_pred HHhHhhCCCcceeeeEEEEEEEEcCEEEEEEEcCCCcceecCHHHHHHcCCccccccceeeeeecCceeeEEeEEEEEEE
Q psy8018 134 EAAMEYNPETFGSVIMLYINCRVNGHPVKAFIDSGAQTTIMSAACAARVNITRLIDTRWAGIAKGVGVQRIIGRIHMVQI 213 (280)
Q Consensus 134 ~~a~e~~Pe~~~~~~~l~i~~~InG~~v~alVDTGAq~tiis~~~A~rlgL~~~~d~r~~g~a~gvG~~~~~g~i~~~~i 213 (280)
++|+|++||.|..++++|++++|||++++|||||||++|+||+++|+|+||+...+.++.+.+.|+|+....|+++.+++
T Consensus 1 ~~~~~~~~~~~~~~~~~~v~~~Ing~~~~~LvDTGAs~s~Is~~~a~~lgl~~~~~~~~~~~~~g~g~~~~~g~~~~~~l 80 (124)
T cd05479 1 QNAMEHHPESFGKVPMLYINVEINGVPVKAFVDSGAQMTIMSKACAEKCGLMRLIDKRFQGIAKGVGTQKILGRIHLAQV 80 (124)
T ss_pred CchhhcCcchhceeeEEEEEEEECCEEEEEEEeCCCceEEeCHHHHHHcCCccccCcceEEEEecCCCcEEEeEEEEEEE
Confidence 35799999999999999999999999999999999999999999999999987666677777889888888999999999
Q ss_pred EECCeEEEeEEEEecCCCcccccchhhhhcceEEEEecccEEEe
Q psy8018 214 AIENDFLTTSFSVLEDQPMDMLLGLDMLKRHQVQIAIENDFLTT 257 (280)
Q Consensus 214 ~IG~~~~~~~~~Vl~~~~~d~ILG~D~L~~~~~~ID~~~~~L~i 257 (280)
+||++.++++|.|++..++|+|||||||++++++|||++++|+|
T Consensus 81 ~i~~~~~~~~~~Vl~~~~~d~ILG~d~L~~~~~~ID~~~~~i~~ 124 (124)
T cd05479 81 KIGNLFLPCSFTVLEDDDVDFLIGLDMLKRHQCVIDLKENVLRI 124 (124)
T ss_pred EECCEEeeeEEEEECCCCcCEEecHHHHHhCCeEEECCCCEEEC
Confidence 99999999999999988999999999999999999999999985
No 5
>PF08284 RVP_2: Retroviral aspartyl protease; InterPro: IPR013242 This region defines single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein; usually pol, more rarely gag. Retroviral proteases appear to be homologous to a single domain of the two-domain eukaryotic aspartyl proteases.
Probab=99.84 E-value=3e-20 Score=153.99 Aligned_cols=112 Identities=21% Similarity=0.270 Sum_probs=95.1
Q ss_pred eEEEEEEEEcCEEEEEEEcCCCcceecCHHHHHHcCCccc-cccceeeeeecCceeeEEeEEEEEEEEECCeEEEeEEEE
Q psy8018 148 IMLYINCRVNGHPVKAFIDSGAQTTIMSAACAARVNITRL-IDTRWAGIAKGVGVQRIIGRIHMVQIAIENDFLTTSFSV 226 (280)
Q Consensus 148 ~~l~i~~~InG~~v~alVDTGAq~tiis~~~A~rlgL~~~-~d~r~~g~a~gvG~~~~~g~i~~~~i~IG~~~~~~~~~V 226 (280)
.-+...+.|+++++.+||||||++||||.++|.+++|+.. ....+ -+..+.|.....+.++.+++.+++..+..+|.|
T Consensus 20 ~vi~g~~~I~~~~~~vLiDSGAThsFIs~~~a~~~~l~~~~l~~~~-~V~~~g~~~~~~~~~~~~~~~i~g~~~~~dl~v 98 (135)
T PF08284_consen 20 DVITGTFLINSIPASVLIDSGATHSFISSSFAKKLGLPLEPLPRPI-VVSAPGGSINCEGVCPDVPLSIQGHEFVVDLLV 98 (135)
T ss_pred CeEEEEEEeccEEEEEEEecCCCcEEccHHHHHhcCCEEEEccCee-EEecccccccccceeeeEEEEECCeEEEeeeEE
Confidence 4456789999999999999999999999999999999765 33322 222222345567888889999999999999999
Q ss_pred ecCCCcccccchhhhhcceEEEEecccEEEeeee
Q psy8018 227 LEDQPMDMLLGLDMLKRHQVQIAIENDFLTTSFS 260 (280)
Q Consensus 227 l~~~~~d~ILG~D~L~~~~~~ID~~~~~L~i~~~ 260 (280)
++..++|+|||||||++|++.|||.+++++|..+
T Consensus 99 l~l~~~DvILGm~WL~~~~~~IDw~~k~v~f~~p 132 (135)
T PF08284_consen 99 LDLGGYDVILGMDWLKKHNPVIDWATKTVTFNSP 132 (135)
T ss_pred ecccceeeEeccchHHhCCCEEEccCCEEEEeCC
Confidence 9988999999999999999999999999999765
No 6
>cd01807 GDX_N ubiquitin-like domain of GDX. GDX contains an N-terminal ubiquitin-like domain as well as an uncharacterized c-terminal domain. The function of GDX is unknown.
Probab=99.81 E-value=1.5e-19 Score=134.47 Aligned_cols=72 Identities=28% Similarity=0.359 Sum_probs=70.1
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
|+|+|++.+|+.++++|++++||++||++|++++|+|+++|+|+|+|++|.|+ .||++|||++|++|+|+.+
T Consensus 1 m~i~vk~~~G~~~~l~v~~~~tV~~lK~~i~~~~gi~~~~q~L~~~G~~L~d~-~~L~~~~i~~~~~l~l~~~ 72 (74)
T cd01807 1 MFLTVKLLQGRECSLQVSEKESVSTLKKLVSEHLNVPEEQQRLLFKGKALADD-KRLSDYSIGPNAKLNLVVR 72 (74)
T ss_pred CEEEEEeCCCCEEEEEECCCCcHHHHHHHHHHHHCCCHHHeEEEECCEECCCC-CCHHHCCCCCCCEEEEEEc
Confidence 89999999999999999999999999999999999999999999999999986 9999999999999999985
No 7
>cd01797 NIRF_N amino-terminal ubiquitin-like domain of Np95 and NIRF. NIRF_N This CD represents the amino-terminal ubiquitin-like domain of a family of nuclear proteins that includes Np95 and NIRF (Np95/ICBP90-like RING finger) protein. Both Np95 and NIRF have a domain architecture consisting of a ubiquitin-like domain, a PHD finger, a YDG/SRA domain, Rb-binding motifs and a RING finger domain. Both Np95 and NIRF are ubiquitin ligases that ubiquitinate PCNP (PEST-containing nuclear proteins). While Np95 is capable of binding histones, NIRF is involved in cell cycle regulation.
Probab=99.78 E-value=7.9e-19 Score=132.35 Aligned_cols=75 Identities=21% Similarity=0.284 Sum_probs=70.3
Q ss_pred CEEEEEcCCCcE-EEEE-eCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEeccC
Q psy8018 1 MKVTVTSLDNEC-FLLD-VSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRVQVA 76 (280)
Q Consensus 1 M~ItV~~~~g~~-~~le-V~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~~~p 76 (280)
|+|+|++.+|+. ++++ ++++.||++||++|++.+|+|+++|+|+|+|++|.|+ .||++|||++|++|+|+.++-|
T Consensus 1 M~I~vk~~~G~~~~~l~~v~~~~TV~~lK~~i~~~~gi~~~~QrLi~~Gk~L~D~-~tL~~y~i~~~~~i~l~~~~~~ 77 (78)
T cd01797 1 MWIQVRTMDGKETRTVDSLSRLTKVEELREKIQELFNVEPECQRLFYRGKQMEDG-HTLFDYNVGLNDIIQLLVRQDP 77 (78)
T ss_pred CEEEEEcCCCCEEEEeeccCCcCcHHHHHHHHHHHhCCCHHHeEEEeCCEECCCC-CCHHHcCCCCCCEEEEEEecCC
Confidence 999999999997 7895 8999999999999999999999999999999999986 8999999999999999987544
No 8
>cd01793 Fubi Fubi ubiquitin-like protein. Fubi is a ubiquitin-like protein encoded by the fau gene which has an N-terminal ubiquitin-like domain (also referred to as FUBI) fused to the ribosomal protein S30. Fubi is thought to be a tumor suppressor protein and the FUBI domain may act as a substitute or an inhibitor of ubiquitin or one of ubiquitin's close relatives UCRP, FAT10, and Nedd8.
Probab=99.78 E-value=1.4e-18 Score=129.37 Aligned_cols=70 Identities=19% Similarity=0.254 Sum_probs=66.9
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
|+|+||+. ++++++|+|++||++||++|++++|+|+++|+|+|+|++|.|+ +||++|||+++++|+++.+
T Consensus 1 mqi~vk~~--~~~~l~v~~~~tV~~lK~~i~~~~gip~~~q~Li~~Gk~L~D~-~tL~~~~i~~~~tl~l~~~ 70 (74)
T cd01793 1 MQLFVRAQ--NTHTLEVTGQETVSDIKAHVAGLEGIDVEDQVLLLAGVPLEDD-ATLGQCGVEELCTLEVAGR 70 (74)
T ss_pred CEEEEECC--CEEEEEECCcCcHHHHHHHHHhhhCCCHHHEEEEECCeECCCC-CCHHHcCCCCCCEEEEEEe
Confidence 89999983 7899999999999999999999999999999999999999986 9999999999999999985
No 9
>PTZ00044 ubiquitin; Provisional
Probab=99.76 E-value=4.1e-18 Score=126.95 Aligned_cols=72 Identities=18% Similarity=0.373 Sum_probs=69.8
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
|+|+|++.+|++++++|++++||++||++|++.+|+|+++|+|+|+|++|.|+ .+|++|||++|++|++..+
T Consensus 1 m~i~vk~~~G~~~~l~v~~~~tv~~lK~~i~~~~gi~~~~q~L~~~g~~L~d~-~~l~~~~i~~~~~i~l~~~ 72 (76)
T PTZ00044 1 MQILIKTLTGKKQSFNFEPDNTVQQVKMALQEKEGIDVKQIRLIYSGKQMSDD-LKLSDYKVVPGSTIHMVLQ 72 (76)
T ss_pred CEEEEEeCCCCEEEEEECCCCcHHHHHHHHHHHHCCCHHHeEEEECCEEccCC-CcHHHcCCCCCCEEEEEEE
Confidence 89999999999999999999999999999999999999999999999999875 8999999999999999986
No 10
>cd01791 Ubl5 UBL5 ubiquitin-like modifier. UBL5 (also known as HUB1) is a ubiquitin-like modifier that is both widely expressed and highly phylogenetically conserved. At the C-terminal end of the ubiquitin-like fold of UBL5 is a di-tyrosine motif followed by a single variable residue instead of the characteristic di-glycine found in all other ubiquitin-like modifiers. ULB5 interacts with a cyclin-like kinase called CLK4 but not with other cyclin-like kinase family members.
Probab=99.75 E-value=4.4e-18 Score=126.75 Aligned_cols=70 Identities=20% Similarity=0.269 Sum_probs=67.5
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEE
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLK 71 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~ 71 (280)
|+|+|++..|+.+.++|+|+.||++||++|++..|+|+++|+|+|.|++|.|+ +||++|||++|++|||-
T Consensus 2 ~~i~vkt~~Gk~~~~~v~~~~TV~~LK~~I~~~~~~~~~~qrLi~~Gk~L~D~-~tL~~ygi~~~stv~l~ 71 (73)
T cd01791 2 IEVVCNDRLGKKVRVKCNPDDTIGDLKKLIAAQTGTRPEKIVLKKWYTIFKDH-ISLGDYEIHDGMNLELY 71 (73)
T ss_pred EEEEEECCCCCEEEEEeCCCCcHHHHHHHHHHHhCCChHHEEEEeCCcCCCCC-CCHHHcCCCCCCEEEEE
Confidence 78999999999999999999999999999999999999999999999999986 89999999999999985
No 11
>TIGR02281 clan_AA_DTGA clan AA aspartic protease, TIGR02281 family. This family consists of predicted aspartic proteases, typically from 180 to 230 amino acids in length, in MEROPS clan AA. This model describes the well-conserved 121-residue C-terminal region. The poorly conserved, variable length N-terminal region usually contains a predicted transmembrane helix. Sequences in the seed alignment and those scoring above the trusted cutoff are Proteobacterial; homologs scroing between trusted and noise are found in Pyrobaculum aerophilum str. IM2 (archaeal), Pirellula sp. (Planctomycetes), and Nostoc sp. PCC 7120 (Cyanobacteria).
Probab=99.75 E-value=2.5e-17 Score=134.00 Aligned_cols=108 Identities=20% Similarity=0.231 Sum_probs=84.8
Q ss_pred eeeeEEEEEEEEcCEEEEEEEcCCCcceecCHHHHHHcCCccccccceeeeeecCceeeEEeEEEEEEEEECCeEEE-eE
Q psy8018 145 GSVIMLYINCRVNGHPVKAFIDSGAQTTIMSAACAARVNITRLIDTRWAGIAKGVGVQRIIGRIHMVQIAIENDFLT-TS 223 (280)
Q Consensus 145 ~~~~~l~i~~~InG~~v~alVDTGAq~tiis~~~A~rlgL~~~~d~r~~g~a~gvG~~~~~g~i~~~~i~IG~~~~~-~~ 223 (280)
.+..+++++++|||+++.|+|||||++|++|+++|+++|+..... .+.....+.++.....++....+++|+..+. ++
T Consensus 7 ~~~g~~~v~~~InG~~~~flVDTGAs~t~is~~~A~~Lgl~~~~~-~~~~~~~ta~G~~~~~~~~l~~l~iG~~~~~nv~ 85 (121)
T TIGR02281 7 DGDGHFYATGRVNGRNVRFLVDTGATSVALNEEDAQRLGLDLNRL-GYTVTVSTANGQIKAARVTLDRVAIGGIVVNDVD 85 (121)
T ss_pred cCCCeEEEEEEECCEEEEEEEECCCCcEEcCHHHHHHcCCCcccC-CceEEEEeCCCcEEEEEEEeCEEEECCEEEeCcE
Confidence 355789999999999999999999999999999999999975321 1122334444444466677779999998776 88
Q ss_pred EEEecCC-Ccccccchhhhhcce-EEEEeccc
Q psy8018 224 FSVLEDQ-PMDMLLGLDMLKRHQ-VQIAIEND 253 (280)
Q Consensus 224 ~~Vl~~~-~~d~ILG~D~L~~~~-~~ID~~~~ 253 (280)
+.|++.. ..++|||||||++++ +.||-.+-
T Consensus 86 ~~v~~~~~~~~~LLGm~fL~~~~~~~~~~~~l 117 (121)
T TIGR02281 86 AMVAEGGALSESLLGMSFLNRLSRFTVRGGKL 117 (121)
T ss_pred EEEeCCCcCCceEcCHHHHhccccEEEECCEE
Confidence 9999865 357999999999996 87776443
No 12
>cd01805 RAD23_N Ubiquitin-like domain of RAD23. RAD23 belongs to a family of adaptor molecules having affinity for both the proteasome and ubiquitinylated proteins and thought to shuttle these ubiquitinylated proteins to the proteasome for destruction. RAD23 interacts with ubiquitin through its C-terminal ubiquitin-associated domains (UBA) and with the proteasome through its N-terminal ubiquitin-like domain (UBL).
Probab=99.74 E-value=9.5e-18 Score=125.15 Aligned_cols=72 Identities=26% Similarity=0.491 Sum_probs=69.6
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCC--CCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGI--SAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gi--p~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
|+|+|++.+|+.++++|++++||.+||+.|++.+|+ |+++|+|+|+|++|.|+ .+|++|||++|++|+++.+
T Consensus 1 m~i~vk~~~g~~~~l~v~~~~TV~~lK~~i~~~~~i~~~~~~q~L~~~G~~L~d~-~~L~~~~i~~~~~i~~~~~ 74 (77)
T cd01805 1 MKITFKTLKQQTFPIEVDPDDTVAELKEKIEEEKGCDYPPEQQKLIYSGKILKDD-TTLEEYKIDEKDFVVVMVS 74 (77)
T ss_pred CEEEEEeCCCCEEEEEECCCCcHHHHHHHHHHhhCCCCChhHeEEEECCEEccCC-CCHHHcCCCCCCEEEEEEe
Confidence 999999999999999999999999999999999999 99999999999999876 8999999999999999876
No 13
>cd01796 DDI1_N DNA damage inducible protein 1 ubiquitin-like domain. DDI1_N DDI1 (DNA damage inducible protein 1) has an amino-terminal ubiquitin-like domain, an retroviral protease-like (RVP-like) domain, and a UBA (ubiquitin-associated) domain. This CD represents the amino-terminal ubiquitin-like domain of DDI1.
Probab=99.74 E-value=4.9e-18 Score=125.65 Aligned_cols=69 Identities=51% Similarity=0.837 Sum_probs=65.7
Q ss_pred EEEEcC-CCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEE
Q psy8018 3 VTVTSL-DNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLK 71 (280)
Q Consensus 3 ItV~~~-~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~ 71 (280)
|||++. +|++++++|++++||++||++|++++|+|+++|+|+|+|++|.|++.+|++|||++|++|+|.
T Consensus 1 l~v~~~~~g~~~~l~v~~~~TV~~lK~~I~~~~gip~~~q~Li~~Gk~L~D~~~~L~~~gi~~~~~l~l~ 70 (71)
T cd01796 1 ITVYTARSETTFSLDVDPDLELENFKALCEAESGIPASQQQLIYNGRELVDNKRLLALYGVKDGDLVVLR 70 (71)
T ss_pred CEEEECCCCCEEEEEECCcCCHHHHHHHHHHHhCCCHHHeEEEECCeEccCCcccHHHcCCCCCCEEEEe
Confidence 689998 999999999999999999999999999999999999999999887679999999999999985
No 14
>cd01810 ISG15_repeat2 ISG15 ubiquitin-like protein, second repeat of 2. ISG15 is a ubiquitin-like protein containing two ubiquitin homology domains and becomes conjugated to a variety of proteins when cells are treated with type I interferon or lipopolysaccharide. Although ISG15 has properties similar to those of other ubiquitin-like molecules, it is a unique member of the ubiquitin-like superfamily, whose expression and conjugation to target proteins are tightly regulated by specific signaling pathways, indicating it may have specialized functions in the immune system.
Probab=99.74 E-value=8.4e-18 Score=125.13 Aligned_cols=70 Identities=16% Similarity=0.312 Sum_probs=67.7
Q ss_pred EEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 3 VTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 3 ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
|+|++++|++++++|++++||++||++|++..|+|+++|+|+|+|++|.|+ +||++|||++|++|+|..+
T Consensus 1 i~vk~~~g~~~~l~v~~~~tV~~lK~~I~~~~gi~~~~q~L~~~G~~L~D~-~tL~~~~i~~~~tl~l~~~ 70 (74)
T cd01810 1 ILVRNDKGRSSIYEVQLTQTVATLKQQVSQRERVQADQFWLSFEGRPMEDE-HPLGEYGLKPGCTVFMNLR 70 (74)
T ss_pred CEEECCCCCEEEEEECCcChHHHHHHHHHHHhCCCHHHeEEEECCEECCCC-CCHHHcCCCCCCEEEEEEE
Confidence 689999999999999999999999999999999999999999999999987 9999999999999999885
No 15
>cd01806 Nedd8 Nebb8-like ubiquitin protein. Nedd8 (also known as Rub1) has a single conserved ubiquitin-like domain that is part of a protein modification pathway similar to that of ubiquitin. Nedd8 modifies a family of molecular scaffold proteins called cullins that are responsible for assembling the ROC1/Rbx1 RING-based E3 ubiquitin ligases, of which several play a direct role in tumorigenesis.
Probab=99.73 E-value=2e-17 Score=122.62 Aligned_cols=72 Identities=25% Similarity=0.432 Sum_probs=69.8
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
|+|+|++.+|+.+.++|+++.||++||++|+..+|+|++.|+|+|+|+.|.|+ +||++|||++|++|++..+
T Consensus 1 m~i~v~~~~g~~~~~~v~~~~tv~~lK~~i~~~~g~~~~~qrL~~~g~~L~d~-~tl~~~~i~~g~~i~l~~~ 72 (76)
T cd01806 1 MLIKVKTLTGKEIEIDIEPTDKVERIKERVEEKEGIPPQQQRLIYSGKQMNDD-KTAADYKLEGGSVLHLVLA 72 (76)
T ss_pred CEEEEEeCCCCEEEEEECCCCCHHHHHHHHhHhhCCChhhEEEEECCeEccCC-CCHHHcCCCCCCEEEEEEE
Confidence 89999999999999999999999999999999999999999999999999876 8999999999999999986
No 16
>cd01802 AN1_N ubiquitin-like domain of AN1. AN1 (also known as ANUBL1 and RSD-7) is ubiquitin-like protein with a testis-specific expression in rats that has an N-terminal ubiquitin-like domain and a C-terminal zinc-binding domain. Unlike ubiquitin polyproteins and most ubiquitin fusion proteins, the N-terminal ubiquitin-like domain of An1 does not undergo proteolytic processing. The function of AN1 is unknown.
Probab=99.73 E-value=1.1e-17 Score=132.47 Aligned_cols=72 Identities=28% Similarity=0.363 Sum_probs=69.9
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
|+|+|++.+|+.++++|++++||.+||++|++..|+|+++|+|+|+|++|.|+ .+|++|||++|++|+|+.+
T Consensus 28 M~I~Vk~l~G~~~~leV~~~~TV~~lK~kI~~~~gip~~~QrLi~~Gk~L~D~-~tL~dy~I~~~stL~l~~~ 99 (103)
T cd01802 28 MELFIETLTGTCFELRVSPFETVISVKAKIQRLEGIPVAQQHLIWNNMELEDE-YCLNDYNISEGCTLKLVLA 99 (103)
T ss_pred EEEEEEcCCCCEEEEEeCCCCcHHHHHHHHHHHhCCChHHEEEEECCEECCCC-CcHHHcCCCCCCEEEEEEe
Confidence 89999999999999999999999999999999999999999999999999986 8999999999999999885
No 17
>cd01798 parkin_N amino-terminal ubiquitin-like of parkin protein. parkin_N parkin protein is a RING-type E3 ubiquitin ligase with an amino-terminal ubiquitin-like (Ubl) domain and an RBR signature consisting of two RING finger domains separated by an IBR/DRIL domain. Naturally occurring mutations in parkin are thought to cause the disease AR_JP (autosomal-recessive juvenile parkinsonism). Parkin binds the Rpn10 subunit of 26S proteasomes through its Ubl domain.
Probab=99.73 E-value=1e-17 Score=123.27 Aligned_cols=70 Identities=17% Similarity=0.395 Sum_probs=67.1
Q ss_pred EEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 3 VTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 3 ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
|+|++.+|+.++++|++++||++||++|++++|+|+++|+|+|+|++|.|+ .||++|||++|++|+|+.|
T Consensus 1 i~vk~~~g~~~~~~v~~~~tV~~lK~~i~~~~gi~~~~q~Li~~G~~L~d~-~~l~~~~i~~~stl~l~~~ 70 (70)
T cd01798 1 VYVRTNTGHTFPVEVDPDTDIKQLKEVVAKRQGVPPDQLRVIFAGKELRNT-TTIQECDLGQQSILHAVRR 70 (70)
T ss_pred CEEEcCCCCEEEEEECCCChHHHHHHHHHHHHCCCHHHeEEEECCeECCCC-CcHHHcCCCCCCEEEEEeC
Confidence 689999999999999999999999999999999999999999999999876 9999999999999999864
No 18
>cd01803 Ubiquitin Ubiquitin. Ubiquitin (includes Ubq/RPL40e and Ubq/RPS27a fusions as well as homopolymeric multiubiquitin protein chains)
Probab=99.72 E-value=3e-17 Score=121.72 Aligned_cols=72 Identities=32% Similarity=0.448 Sum_probs=69.9
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
|+|+|++.+|+.+.++|++++||++||++|+..+|+|++.|+|+|+|+.|.|+ +||++||+++|++|++..+
T Consensus 1 m~i~v~~~~g~~~~~~v~~~~tV~~lK~~i~~~~g~~~~~q~L~~~g~~L~d~-~~L~~~~i~~~~~i~l~~~ 72 (76)
T cd01803 1 MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDG-RTLSDYNIQKESTLHLVLR 72 (76)
T ss_pred CEEEEEcCCCCEEEEEECCcCcHHHHHHHHHHHhCCCHHHeEEEECCEECCCC-CcHHHcCCCCCCEEEEEEE
Confidence 89999999999999999999999999999999999999999999999999976 8999999999999999986
No 19
>cd01804 midnolin_N Ubiquitin-like domain of midnolin. midnolin_N Midnolin (midbrain nucleolar protein) is expressed in the nucleolus and is thought to regulate genes involved in neurogenesis. Midnolin contains an amino-terminal ubiquitin-like domain.
Probab=99.72 E-value=2.9e-17 Score=123.68 Aligned_cols=71 Identities=21% Similarity=0.302 Sum_probs=68.3
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
|+|+|++.+|+.++++|+++.||++||+.|+++.|+|+++|+|+|+|++|.|+ +|++|||++|++|+|+..
T Consensus 2 m~I~Vk~~~G~~~~l~v~~~~TV~~LK~~I~~~~~~~~~~qrL~~~Gk~L~d~--~L~~~gi~~~~~i~l~~~ 72 (78)
T cd01804 2 MNLNIHSTTGTRFDLSVPPDETVEGLKKRISQRLKVPKERLALLHRETRLSSG--KLQDLGLGDGSKLTLVPT 72 (78)
T ss_pred eEEEEEECCCCEEEEEECCcCHHHHHHHHHHHHhCCChHHEEEEECCcCCCCC--cHHHcCCCCCCEEEEEee
Confidence 89999999999999999999999999999999999999999999999999874 899999999999999885
No 20
>cd01809 Scythe_N Ubiquitin-like domain of Scythe protein. Scythe protein (also known as Bat3) is an apoptotic regulator that is highly conserved in eukaryotes and contains a ubiquitin-like domain near its N-terminus. Scythe binds reaper, a potent apoptotic inducer, and Scythe/Reaper are thought to signal apoptosis, in part through regulating the folding and activity of apoptotic signaling molecules.
Probab=99.71 E-value=4.4e-17 Score=119.58 Aligned_cols=72 Identities=24% Similarity=0.505 Sum_probs=69.1
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
|+|+|++.+|+.++++++++.||.+||+.|+..+|+|++.|+|+|+|+.|.|+ ++|++||+++|++|+|..+
T Consensus 1 i~i~vk~~~g~~~~~~v~~~~tv~~lK~~i~~~~gi~~~~q~L~~~g~~L~d~-~~L~~~~i~~~~~l~l~~~ 72 (72)
T cd01809 1 IEIKVKTLDSQTHTFTVEEEITVLDLKEKIAEEVGIPVEQQRLIYSGRVLKDD-ETLSEYKVEDGHTIHLVKR 72 (72)
T ss_pred CEEEEEeCCCCEEEEEECCCCcHHHHHHHHHHHHCcCHHHeEEEECCEECCCc-CcHHHCCCCCCCEEEEEeC
Confidence 89999999999999999999999999999999999999999999999999986 8999999999999999763
No 21
>cd01792 ISG15_repeat1 ISG15 ubiquitin-like protein, first repeat of 2. ISG15 is a ubiquitin-like protein containing two ubiquitin homology domains that becomes conjugated to a variety of proteins when cells are treated with type I interferon or lipopolysaccharide. Although ISG15 has properties similar to those of other ubiquitin-like molecules, it is a unique member of the ubiquitin-like superfamily, whose expression and conjugation to target proteins are tightly regulated by specific signaling pathways, indicating it may have specialized functions in the immune system.
Probab=99.71 E-value=3.5e-17 Score=123.70 Aligned_cols=75 Identities=19% Similarity=0.252 Sum_probs=70.9
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEE--EECCeeeCCCcCCccccCCCCCCEEEEEEeccC
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAI--EFQGNALVDNKKTLKDYNIQDGDLVLLKRVQVA 76 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~L--i~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~~~p 76 (280)
|+|+|++.+|+.++++|+++.||.+||+.|+..+|+|+++|+| +|+|++|.|+ +||++|||++|++|+|+.++.|
T Consensus 3 ~~i~Vk~~~G~~~~~~v~~~~TV~~lK~~I~~~~~i~~~~qrL~~~~~G~~L~D~-~tL~~~gi~~gs~l~l~~~~~~ 79 (80)
T cd01792 3 WDLKVKMLGGNEFLVSLRDSMTVSELKQQIAQKIGVPAFQQRLAHLDSREVLQDG-VPLVSQGLGPGSTVLLVVQNCS 79 (80)
T ss_pred eEEEEEeCCCCEEEEEcCCCCcHHHHHHHHHHHhCCCHHHEEEEeccCCCCCCCC-CCHHHcCCCCCCEEEEEEEccC
Confidence 7899999999999999999999999999999999999999999 8999999986 8999999999999999987544
No 22
>cd01794 DC_UbP_C dendritic cell derived ubiquitin-like protein. DC_UbP (dendritic cell derived ubiquitin-like protein) is a ubiquitin-like protein from human dendritic cells that is expressed in the mitochondrion. The ubiquitin-like domain of this protein is found at the C-terminus and lacks the canonical gly-gly motif of ubiquitin required for ubiquitinization. DC_UbP is expressed in tumor cells but not in normal human adult tissue suggesting a role for DC_UbP in tumorogenesis.
Probab=99.70 E-value=4.9e-17 Score=120.12 Aligned_cols=68 Identities=21% Similarity=0.193 Sum_probs=65.0
Q ss_pred EEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 4 TVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 4 tV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
.|+.++|+.++++|+|++||.+||++|++..|+|+++|+|+|+|++|.|+ .+|++|||++|++|+|..
T Consensus 2 ~vk~~~G~~~~l~v~~~~TV~~lK~~I~~~~gi~~~~q~Li~~G~~L~D~-~~l~~~~i~~~~tv~~~~ 69 (70)
T cd01794 2 KVRLSTGKDVKLSVSSKDTVGQLKKQLQAAEGVDPCCQRWFFSGKLLTDK-TRLQETKIQKDYVVQVIV 69 (70)
T ss_pred eEEcCCCCEEEEEECCcChHHHHHHHHHHHhCCCHHHeEEEECCeECCCC-CCHHHcCCCCCCEEEEEe
Confidence 57889999999999999999999999999999999999999999999986 899999999999999975
No 23
>cd01808 hPLIC_N Ubiquitin-like domain of hPLIC-1 and hPLIC2. hPLIC-1 and hPLIC-2 (human homologs of the yeast ubiquitin-like Dsk2 protein) are type2 UBL's (ubiquitin-like) proteins that are thought to serve as adaptors that link the ubiquitination machinery to the proteasome. The hPLIC's have an N-terminal UBL domain that binds the S5a subunit of the proteasome and a C-terminal UBA (ubiquitin-associated) domain that binds a ubiquitylated protein.
Probab=99.68 E-value=1.7e-16 Score=117.06 Aligned_cols=71 Identities=28% Similarity=0.404 Sum_probs=66.7
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
|+|+|++.+|. .+++|+++.||.+||+.|++.+|+|+++|+|+|+|++|.|+ +||++|||++|++|+++.+
T Consensus 1 ~~i~vk~~~g~-~~l~v~~~~TV~~lK~~I~~~~~i~~~~~~Li~~Gk~L~d~-~tL~~~~i~~~stl~l~~~ 71 (71)
T cd01808 1 IKVTVKTPKDK-EEIEIAEDASVKDFKEAVSKKFKANQEQLVLIFAGKILKDT-DTLTQHNIKDGLTVHLVIK 71 (71)
T ss_pred CEEEEEcCCCC-EEEEECCCChHHHHHHHHHHHhCCCHHHEEEEECCeEcCCC-CcHHHcCCCCCCEEEEEEC
Confidence 68999999997 58999999999999999999999999999999999999986 8999999999999999763
No 24
>cd05484 retropepsin_like_LTR_2 Retropepsins_like_LTR, pepsin-like aspartate proteases. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classif
Probab=99.67 E-value=8.3e-16 Score=118.40 Aligned_cols=88 Identities=23% Similarity=0.317 Sum_probs=74.4
Q ss_pred EEEEEEcCEEEEEEEcCCCcceecCHHHHHHcCCccccccceeeeeecCc--eeeEEeEEEEEEEEECCeEEEeEEEEec
Q psy8018 151 YINCRVNGHPVKAFIDSGAQTTIMSAACAARVNITRLIDTRWAGIAKGVG--VQRIIGRIHMVQIAIENDFLTTSFSVLE 228 (280)
Q Consensus 151 ~i~~~InG~~v~alVDTGAq~tiis~~~A~rlgL~~~~d~r~~g~a~gvG--~~~~~g~i~~~~i~IG~~~~~~~~~Vl~ 228 (280)
|+.+.|||++++++|||||++|+||.+.+.++|++...+... ...+.+ ..+..|.+ .+.+++|+..+..+|.|++
T Consensus 2 ~~~~~Ing~~i~~lvDTGA~~svis~~~~~~lg~~~~~~~~~--~v~~a~G~~~~~~G~~-~~~v~~~~~~~~~~~~v~~ 78 (91)
T cd05484 2 TVTLLVNGKPLKFQLDTGSAITVISEKTWRKLGSPPLKPTKK--RLRTATGTKLSVLGQI-LVTVKYGGKTKVLTLYVVK 78 (91)
T ss_pred EEEEEECCEEEEEEEcCCcceEEeCHHHHHHhCCCccccccE--EEEecCCCEeeEeEEE-EEEEEECCEEEEEEEEEEE
Confidence 789999999999999999999999999999999986433322 223333 46778888 6799999999999999999
Q ss_pred CCCcccccchhhhh
Q psy8018 229 DQPMDMLLGLDMLK 242 (280)
Q Consensus 229 ~~~~d~ILG~D~L~ 242 (280)
.. .+.|||+|||.
T Consensus 79 ~~-~~~lLG~~wl~ 91 (91)
T cd05484 79 NE-GLNLLGRDWLD 91 (91)
T ss_pred CC-CCCccChhhcC
Confidence 76 99999999984
No 25
>PF00240 ubiquitin: Ubiquitin family; InterPro: IPR000626 Ubiquitinylation is an ATP-dependent process that involves the action of at least three enzymes: a ubiquitin-activating enzyme (E1, IPR000011 from INTERPRO), a ubiquitin-conjugating enzyme (E2, IPR000608 from INTERPRO), and a ubiquitin ligase (E3, IPR000569 from INTERPRO, IPR003613 from INTERPRO), which work sequentially in a cascade. There are many different E3 ligases, which are responsible for the type of ubiquitin chain formed, the specificity of the target protein, and the regulation of the ubiquitinylation process []. Ubiquitinylation is an important regulatory tool that controls the concentration of key signalling proteins, such as those involved in cell cycle control, as well as removing misfolded, damaged or mutant proteins that could be harmful to the cell. Several ubiquitin-like molecules have been discovered, such as Ufm1 (IPR005375 from INTERPRO), SUMO1 (IPR003653 from INTERPRO), NEDD8, Rad23 (IPR004806 from INTERPRO), Elongin B and Parkin (IPR003977 from INTERPRO), the latter being involved in Parkinson's disease []. Ubiquitin is a protein of 76 amino acid residues, found in all eukaryotic cells and whose sequence is extremely well conserved from protozoan to vertebrates. Ubiquitin acts through its post-translational attachment (ubiquitinylation) to other proteins, where these modifications alter the function, location or trafficking of the protein, or targets it for destruction by the 26S proteasome []. The terminal glycine in the C-terminal 4-residue tail of ubiquitin can form an isopeptide bond with a lysine residue in the target protein, or with a lysine in another ubiquitin molecule to form a ubiquitin chain that attaches itself to a target protein. Ubiquitin has seven lysine residues, any one of which can be used to link ubiquitin molecules together, resulting in different structures that alter the target protein in different ways. It appears that Lys(11)-, Lys(29) and Lys(48)-linked poly-ubiquitin chains target the protein to the proteasome for degradation, while mono-ubiquitinylated and Lys(6)- or Lys(63)-linked poly-ubiquitin chains signal reversible modifications in protein activity, location or trafficking []. For example, Lys(63)-linked poly-ubiquitinylation is known to be involved in DNA damage tolerance, inflammatory response, protein trafficking and signal transduction through kinase activation []. In addition, the length of the ubiquitin chain alters the fate of the target protein. Regulatory proteins such as transcription factors and histones are frequent targets of ubquitinylation [].; GO: 0005515 protein binding; PDB: 2DZI_A 2XEW_E 3NOB_E 2KWU_B 2Y5B_F 3PHD_G 2KWV_B 2KOX_A 2XK5_B 3NHE_B ....
Probab=99.67 E-value=3e-16 Score=114.54 Aligned_cols=67 Identities=30% Similarity=0.504 Sum_probs=63.6
Q ss_pred EcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 6 TSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 6 ~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
|+++|+.++++|+++.||.+||++|++++|+|+++|+|+|+|+.|.|+ .||++|||++|++|+|..+
T Consensus 1 k~~~g~~~~~~v~~~~tV~~lK~~i~~~~~~~~~~~~L~~~G~~L~d~-~tL~~~~i~~~~~I~l~~k 67 (69)
T PF00240_consen 1 KTLSGKTFTLEVDPDDTVADLKQKIAEETGIPPEQQRLIYNGKELDDD-KTLSDYGIKDGSTIHLVIK 67 (69)
T ss_dssp EETTSEEEEEEEETTSBHHHHHHHHHHHHTSTGGGEEEEETTEEESTT-SBTGGGTTSTTEEEEEEES
T ss_pred CCCCCcEEEEEECCCCCHHHhhhhcccccccccccceeeeeeecccCc-CcHHHcCCCCCCEEEEEEe
Confidence 578999999999999999999999999999999999999999999765 9999999999999999875
No 26
>cd01812 BAG1_N Ubiquitin-like domain of BAG1. BAG1_N N-terminal ubiquitin-like (Ubl) domain of the BAG1 protein. This domain occurs together with the BAG domain and is closely related to the Ubl domain of a family of deubiquitinases that includes Rpn11, UBP6 (USP14), USP7 (HAUSP).
Probab=99.64 E-value=7.4e-16 Score=112.96 Aligned_cols=70 Identities=23% Similarity=0.369 Sum_probs=66.0
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
|+|+|+.. |+.++++|+++.||++||++|+..+|+|++.|+|+|+|+.|.|+ ++|++|||++|++|+++.
T Consensus 1 i~i~vk~~-g~~~~i~v~~~~tv~~lK~~i~~~~gi~~~~q~L~~~g~~l~d~-~~L~~~~i~~g~~l~v~~ 70 (71)
T cd01812 1 IRVRVKHG-GESHDLSISSQATFGDLKKMLAPVTGVEPRDQKLIFKGKERDDA-ETLDMSGVKDGSKVMLLE 70 (71)
T ss_pred CEEEEEEC-CEEEEEEECCCCcHHHHHHHHHHhhCCChHHeEEeeCCcccCcc-CcHHHcCCCCCCEEEEec
Confidence 68999996 99999999999999999999999999999999999999999875 899999999999999863
No 27
>cd01790 Herp_N Homocysteine-responsive endoplasmic reticulum-resident ubiquitin-like domain protein. Herp (Homocysteine-responsive endoplasmic reticulum-resident ubiquitin-like domain protein) , is an integral membrane protein that is induced by the endoplasmic reticulum (ER) stress response pathway and is involved in improving the balance of folding capacity and protein loads in the ER. Herp has an N-terminal ubiquitin-like domain that is involved in Herp degradation, but is not necessary for its enhancement of amyloid beta-protein generation.
Probab=99.64 E-value=5.9e-16 Score=116.73 Aligned_cols=71 Identities=14% Similarity=0.134 Sum_probs=63.0
Q ss_pred CEEEEEcCCCcE--EEEEeCCCccHHHHHHHHHHHhC--CCCcceEEEECCeeeCCCcCCccccC--CCCCCEEEEEE
Q psy8018 1 MKVTVTSLDNEC--FLLDVSEDLELENFKAFCEVQSG--ISAQEIAIEFQGNALVDNKKTLKDYN--IQDGDLVLLKR 72 (280)
Q Consensus 1 M~ItV~~~~g~~--~~leV~~~~tV~~LK~~Ie~e~g--ip~~~Q~Li~~Gk~L~d~~~tL~~~g--I~~gd~l~l~~ 72 (280)
+.|+||+++|+. +++++++++||.+||++|+...+ .|+++|+|||+||+|+|+ .||++|+ +++|.+|||+-
T Consensus 2 i~l~IK~~~~~~~~~~ve~~~~~TV~~lK~~i~~~~~~~~~~~~QrLIy~GKiLkD~-~tL~~~~~~~~~~~tiHLV~ 78 (79)
T cd01790 2 VTLLIKSPNQKYEDQTVSCFLNWTVGELKTHLSRVYPSKPLEQDQRLIYSGKLLPDH-LKLRDVLRKQDEYHMVHLVC 78 (79)
T ss_pred eEEEEECCCCCeEEEEEecCCcChHHHHHHHHHHhcCCCCChhHeEEEEcCeeccch-hhHHHHhhcccCCceEEEEe
Confidence 578999999999 55566899999999999999874 558999999999999976 9999997 99999999974
No 28
>PF13650 Asp_protease_2: Aspartyl protease
Probab=99.62 E-value=4.8e-15 Score=112.01 Aligned_cols=88 Identities=27% Similarity=0.450 Sum_probs=70.9
Q ss_pred EEEEEcCEEEEEEEcCCCcceecCHHHHHHcCCccccccceeeeeecCceeeEEeEEEEEEEEECCeEE-EeEEEEec-C
Q psy8018 152 INCRVNGHPVKAFIDSGAQTTIMSAACAARVNITRLIDTRWAGIAKGVGVQRIIGRIHMVQIAIENDFL-TTSFSVLE-D 229 (280)
Q Consensus 152 i~~~InG~~v~alVDTGAq~tiis~~~A~rlgL~~~~d~r~~g~a~gvG~~~~~g~i~~~~i~IG~~~~-~~~~~Vl~-~ 229 (280)
|+++|||++++|++||||+.|+|++++|+++|+...... ......|.++...........+++|+..+ .+.+.|++ .
T Consensus 1 V~v~vng~~~~~liDTGa~~~~i~~~~~~~l~~~~~~~~-~~~~~~~~~g~~~~~~~~~~~i~ig~~~~~~~~~~v~~~~ 79 (90)
T PF13650_consen 1 VPVKVNGKPVRFLIDTGASISVISRSLAKKLGLKPRPKS-VPISVSGAGGSVTVYRGRVDSITIGGITLKNVPFLVVDLG 79 (90)
T ss_pred CEEEECCEEEEEEEcCCCCcEEECHHHHHHcCCCCcCCc-eeEEEEeCCCCEEEEEEEEEEEEECCEEEEeEEEEEECCC
Confidence 689999999999999999999999999999999755321 12234566655555555666999999876 68899998 6
Q ss_pred CCcccccchhh
Q psy8018 230 QPMDMLLGLDM 240 (280)
Q Consensus 230 ~~~d~ILG~D~ 240 (280)
...|+|||+||
T Consensus 80 ~~~~~iLG~df 90 (90)
T PF13650_consen 80 DPIDGILGMDF 90 (90)
T ss_pred CCCEEEeCCcC
Confidence 78999999998
No 29
>cd01800 SF3a120_C Ubiquitin-like domain of Mammalian splicing factor SF3a_120. SF3a120_C Mammalian splicing factor SF3a consists of three subunits of 60, 66, and 120 kDa and functions early during pre-mRNA splicing by converting the U2 snRNP to its active form. The 120kDa subunit (SF3a120) has a carboxy-terminal ubiquitin-like domain and two SWAP (suppressor-of-white-apricot) domains, referred to collectively as the SURP module, at its amino-terminus.
Probab=99.62 E-value=1.7e-15 Score=113.38 Aligned_cols=65 Identities=17% Similarity=0.345 Sum_probs=62.4
Q ss_pred CCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 8 LDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 8 ~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
++|++++++|++++||++||++|+..+|+|+++|+|+|+|+.|.|+ +||++|||++|++|+|+.+
T Consensus 5 l~g~~~~l~v~~~~TV~~lK~~i~~~~gip~~~q~L~~~G~~L~d~-~tL~~~~i~~g~~l~v~~~ 69 (76)
T cd01800 5 LNGQMLNFTLQLSDPVSVLKVKIHEETGMPAGKQKLQYEGIFIKDS-NSLAYYNLANGTIIHLQLK 69 (76)
T ss_pred cCCeEEEEEECCCCcHHHHHHHHHHHHCCCHHHEEEEECCEEcCCC-CcHHHcCCCCCCEEEEEEe
Confidence 5799999999999999999999999999999999999999999986 8999999999999999986
No 30
>cd05483 retropepsin_like_bacteria Bacterial aspartate proteases, retropepsin-like protease family. This family of bacteria aspartate proteases is a subfamily of retropepsin-like protease family, which includes enzymes from retrovirus and retrotransposons. While fungal and mammalian pepsin-like aspartate proteases are bilobal proteins with structurally related N- and C-termini, this family of bacteria aspartate proteases is half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate proteases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=99.61 E-value=6e-15 Score=112.63 Aligned_cols=92 Identities=25% Similarity=0.414 Sum_probs=72.9
Q ss_pred EEEEEEEEcCEEEEEEEcCCCcceecCHHHHHHcCCccccccceeeeeecCceeeEEeEEEEEEEEECCeEEE-eEEEEe
Q psy8018 149 MLYINCRVNGHPVKAFIDSGAQTTIMSAACAARVNITRLIDTRWAGIAKGVGVQRIIGRIHMVQIAIENDFLT-TSFSVL 227 (280)
Q Consensus 149 ~l~i~~~InG~~v~alVDTGAq~tiis~~~A~rlgL~~~~d~r~~g~a~gvG~~~~~g~i~~~~i~IG~~~~~-~~~~Vl 227 (280)
+++++++|||++++|+|||||+.|+|+.++|+++++....... ....++++...........+++|+..+. +.+.|+
T Consensus 2 ~~~v~v~i~~~~~~~llDTGa~~s~i~~~~~~~l~~~~~~~~~--~~~~~~~G~~~~~~~~~~~i~ig~~~~~~~~~~v~ 79 (96)
T cd05483 2 HFVVPVTINGQPVRFLLDTGASTTVISEELAERLGLPLTLGGK--VTVQTANGRVRAARVRLDSLQIGGITLRNVPAVVL 79 (96)
T ss_pred cEEEEEEECCEEEEEEEECCCCcEEcCHHHHHHcCCCccCCCc--EEEEecCCCccceEEEcceEEECCcEEeccEEEEe
Confidence 6899999999999999999999999999999999973221111 2334555554555555669999998665 789999
Q ss_pred cCCC--cccccchhhhh
Q psy8018 228 EDQP--MDMLLGLDMLK 242 (280)
Q Consensus 228 ~~~~--~d~ILG~D~L~ 242 (280)
+... .|+|||+|||+
T Consensus 80 d~~~~~~~gIlG~d~l~ 96 (96)
T cd05483 80 PGDALGVDGLLGMDFLR 96 (96)
T ss_pred CCcccCCceEeChHHhC
Confidence 8766 99999999995
No 31
>cd01813 UBP_N UBP ubiquitin processing protease. The UBP (ubiquitin processing protease) domain (also referred to as USP which stands for "ubiquitin-specific protease") is present at in a large family of cysteine proteases that specifically cleave ubiquitin conjugates. This family includes Rpn11, UBP6 (USP14), USP7 (HAUSP). This domain is closely related to the amino-terminal ubiquitin-like domain of BAG1 (Bcl2-associated anthanogene1) protein and is found only in eukaryotes.
Probab=99.60 E-value=3.1e-15 Score=111.61 Aligned_cols=70 Identities=10% Similarity=0.208 Sum_probs=65.1
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEE---CCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEF---QGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~---~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
|.|+|+- +|+.|+++|+++.||++||+.|++.+|+|+++|+|+| .|+.|.|+ .+|++|||++|+.|+|+.
T Consensus 1 ~~i~vk~-~g~~~~v~v~~~~Tv~~lK~~i~~~tgvp~~~QKLi~~~~~Gk~l~D~-~~L~~~~i~~g~~i~lmG 73 (74)
T cd01813 1 VPVIVKW-GGQEYSVTTLSEDTVLDLKQFIKTLTGVLPERQKLLGLKVKGKPAEDD-VKISALKLKPNTKIMMMG 73 (74)
T ss_pred CEEEEEE-CCEEEEEEECCCCCHHHHHHHHHHHHCCCHHHEEEEeecccCCcCCCC-cCHHHcCCCCCCEEEEEe
Confidence 5788887 7889999999999999999999999999999999997 99999876 899999999999999874
No 32
>KOG0005|consensus
Probab=99.57 E-value=2.5e-15 Score=105.72 Aligned_cols=70 Identities=26% Similarity=0.431 Sum_probs=67.6
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEE
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLK 71 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~ 71 (280)
|.|.|++++|+.+.++++|++.|+.+|+.++++.||||.+|||+|.||++.|| +|-++|++.-|++||++
T Consensus 1 m~iKvktLt~KeIeidIep~DkverIKErvEEkeGIPp~qqrli~~gkqm~DD-~tA~~Y~~~~GSVlHlv 70 (70)
T KOG0005|consen 1 MLIKVKTLTGKEIEIDIEPTDKVERIKERVEEKEGIPPQQQRLIYAGKQMNDD-KTAAHYNLLGGSVLHLV 70 (70)
T ss_pred CeeeEeeeccceEEEeeCcchHHHHHHHHhhhhcCCCchhhhhhhcccccccc-ccHHHhhhccceeEeeC
Confidence 88999999999999999999999999999999999999999999999999987 89999999999999974
No 33
>TIGR00601 rad23 UV excision repair protein Rad23. All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=99.56 E-value=9.3e-15 Score=139.62 Aligned_cols=72 Identities=25% Similarity=0.492 Sum_probs=69.5
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhC---CCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSG---ISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~g---ip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
|+||||+++|++++|+|++++||.+||++|+.+.| +|+++|+|+|+||+|.|+ +||++|||++|++|+++.+
T Consensus 1 MkItVKtl~g~~~~IeV~~~~TV~dLK~kI~~~~g~~~ip~~~QkLIy~GkiL~Dd-~tL~dy~I~e~~~Ivvmv~ 75 (378)
T TIGR00601 1 MTLTFKTLQQQKFKIDMEPDETVKELKEKIEAEQGKDAYPVAQQKLIYSGKILSDD-KTVREYKIKEKDFVVVMVS 75 (378)
T ss_pred CEEEEEeCCCCEEEEEeCCcChHHHHHHHHHHhhCCCCCChhHeEEEECCEECCCC-CcHHHcCCCCCCEEEEEec
Confidence 99999999999999999999999999999999998 999999999999999986 8999999999999998876
No 34
>PF00077 RVP: Retroviral aspartyl protease The Prosite entry also includes Pfam:PF00026; InterPro: IPR018061 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to the MEROPS peptidase family A2 (retropepsin family, clan AA), subfamily A2A. The family includes the single domain aspartic proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). Retroviral aspartyl protease is synthesised as part of the POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, RNase H and integrase. POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins.; PDB: 3D3T_B 3SQF_A 1NSO_A 2HB3_A 2HS2_A 2HS1_B 3K4V_A 3GGV_C 1HTG_B 2FDE_A ....
Probab=99.56 E-value=2.7e-14 Score=111.21 Aligned_cols=95 Identities=29% Similarity=0.510 Sum_probs=77.1
Q ss_pred EEEEEEEEcCEEEEEEEcCCCcceecCHHHHHHcCCccccccceeeeeecCceee-EEeEEEEEEEEECCeEEEeEEEEe
Q psy8018 149 MLYINCRVNGHPVKAFIDSGAQTTIMSAACAARVNITRLIDTRWAGIAKGVGVQR-IIGRIHMVQIAIENDFLTTSFSVL 227 (280)
Q Consensus 149 ~l~i~~~InG~~v~alVDTGAq~tiis~~~A~rlgL~~~~d~r~~g~a~gvG~~~-~~g~i~~~~i~IG~~~~~~~~~Vl 227 (280)
..++.++|||++++|+|||||..|+|+.+.+...+... .......|+|+.. ..|.. .+.+++++..+...|.|+
T Consensus 5 rp~i~v~i~g~~i~~LlDTGA~vsiI~~~~~~~~~~~~----~~~~~v~~~~g~~~~~~~~-~~~v~~~~~~~~~~~~v~ 79 (100)
T PF00077_consen 5 RPYITVKINGKKIKALLDTGADVSIISEKDWKKLGPPP----KTSITVRGAGGSSSILGST-TVEVKIGGKEFNHTFLVV 79 (100)
T ss_dssp SSEEEEEETTEEEEEEEETTBSSEEESSGGSSSTSSEE----EEEEEEEETTEEEEEEEEE-EEEEEETTEEEEEEEEES
T ss_pred CceEEEeECCEEEEEEEecCCCcceecccccccccccc----cCCceeccCCCcceeeeEE-EEEEEEECccceEEEEec
Confidence 34699999999999999999999999999887776541 1122456777654 34444 569999999999999999
Q ss_pred cCCCcccccchhhhhcceEEEE
Q psy8018 228 EDQPMDMLLGLDMLKRHQVQIA 249 (280)
Q Consensus 228 ~~~~~d~ILG~D~L~~~~~~ID 249 (280)
++.++| |||.|||+++++.|+
T Consensus 80 ~~~~~~-ILG~D~L~~~~~~i~ 100 (100)
T PF00077_consen 80 PDLPMN-ILGRDFLKKLNAVIN 100 (100)
T ss_dssp STCSSE-EEEHHHHTTTTCEEE
T ss_pred CCCCCC-EeChhHHHHcCCEEC
Confidence 987888 999999999999885
No 35
>smart00213 UBQ Ubiquitin homologues. Ubiquitin-mediated proteolysis is involved in the regulated turnover of proteins required for controlling cell cycle progression
Probab=99.56 E-value=1.1e-14 Score=103.67 Aligned_cols=64 Identities=33% Similarity=0.556 Sum_probs=61.0
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCC
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGD 66 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd 66 (280)
|+|+|++.+ ..+.++|+++.||++||++|+..+|+|++.|+|+|+|+.|.|+ +||++|||++|+
T Consensus 1 ~~i~vk~~~-~~~~~~v~~~~tv~~lk~~i~~~~~~~~~~~~L~~~g~~L~d~-~tL~~~~i~~~~ 64 (64)
T smart00213 1 IELTVKTLD-GTITLEVKPSDTVSELKEKIAELTGIPVEQQRLIYKGKVLEDD-RTLADYNIQDGS 64 (64)
T ss_pred CEEEEEECC-ceEEEEECCCCcHHHHHHHHHHHHCCCHHHEEEEECCEECCCC-CCHHHcCCcCCC
Confidence 899999988 7899999999999999999999999999999999999999986 899999999986
No 36
>KOG0003|consensus
Probab=99.54 E-value=6.5e-16 Score=120.88 Aligned_cols=72 Identities=31% Similarity=0.442 Sum_probs=69.5
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
|++++++..|++.+++|+|+.||.++|++|....|||+++|+|+|+|++|+|. .||++|||+.-|||++..+
T Consensus 1 ~~~~~~~~~GKT~~le~EpS~ti~~vKA~i~~~~Gi~~~~~~L~~~~k~LED~-~Tla~Y~i~~~~Tl~~~~r 72 (128)
T KOG0003|consen 1 MQIFVKTLTGKTITLEVEPSDTIDNVKAKIQDKEGIPPDQQRLIFAGKQLEDG-RTLADYNIQKESTLHLVLR 72 (128)
T ss_pred CcEEEEEeeCceEEEEecccchHHHHHHHhccccCCCHHHHHHHhcccccccC-CcccccCccchhhhhhhHH
Confidence 78999999999999999999999999999999999999999999999999985 9999999999999999875
No 37
>cd01799 Hoil1_N Ubiquitin-like domain of HOIL1. HOIL1_N HOIL-1 (heme-oxidized IRP2 ubiquitin ligase-1) is an E3 ubiquitin-protein ligase that recognizes heme-oxidized IRP2 (iron regulatory protein2) and is thought to affect the turnover of oxidatively damaged proteins. Hoil-1 has an amino-terminal ubiquitin-like domain as well as an RBR signature consisting of two RING finger domains separated by an IBR/DRIL domain.
Probab=99.54 E-value=1.6e-14 Score=108.18 Aligned_cols=68 Identities=22% Similarity=0.294 Sum_probs=61.1
Q ss_pred EEEE--cCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCC-CCCEEEEE
Q psy8018 3 VTVT--SLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQ-DGDLVLLK 71 (280)
Q Consensus 3 ItV~--~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~-~gd~l~l~ 71 (280)
+.|. ...|.+++++|+|++||++||++|+..+|||+++|+| |+|+.|.++++||++||++ +||+++|-
T Consensus 3 ~~~~~~~~~~~t~~l~v~~~~TV~~lK~kI~~~~gip~~~QrL-~~G~~L~dD~~tL~~ygi~~~g~~~~l~ 73 (75)
T cd01799 3 VSVEDAQSHTVTIWLTVRPDMTVAQLKDKVFLDYGFPPAVQRW-VIGQRLARDQETLYSHGIRTNGDSAFLY 73 (75)
T ss_pred EEEeccccCCCeEEEEECCCCcHHHHHHHHHHHHCcCHHHEEE-EcCCeeCCCcCCHHHcCCCCCCCEEEEE
Confidence 4454 4568899999999999999999999999999999999 9999997666899999999 88999985
No 38
>cd01763 Sumo Small ubiquitin-related modifier (SUMO). Small ubiquitin-related modifier (SUMO) proteins are conjugated to numerous intracellular targets and serve to modulate protein interaction, localization, activity or stability. SUMO (also known as "Smt3" and "sentrin" in other organisms) is linked to several different pathways, including nucleocytoplasmic transport. Attachment of SUMO to targets proteins is stimulated by PIAS (Protein inhibitor of activated STATs) proteins which serve as E3-like ligases.
Probab=99.51 E-value=1.1e-13 Score=106.31 Aligned_cols=73 Identities=21% Similarity=0.361 Sum_probs=69.2
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEec
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRVQ 74 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~~ 74 (280)
|+|+|++.+|+...+.|.++++++.||+.+++..|+|+++|+|+|+|+.|.++ .|+++||+++||+|++..++
T Consensus 12 i~I~v~~~~g~~~~~~v~~~~~l~~l~~~y~~~~gi~~~~~rf~f~G~~L~~~-~T~~~l~m~d~d~I~v~l~l 84 (87)
T cd01763 12 INLKVKGQDGNEVFFKIKRSTPLKKLMEAYCQRQGLSMNSVRFLFDGQRIRDN-QTPDDLGMEDGDEIEVMLEQ 84 (87)
T ss_pred EEEEEECCCCCEEEEEEcCCCHHHHHHHHHHHHhCCCccceEEEECCeECCCC-CCHHHcCCCCCCEEEEEEec
Confidence 57888999999999999999999999999999999999999999999999976 89999999999999998763
No 39
>cd06095 RP_RTVL_H_like Retropepsin of the RTVL_H family of human endogenous retrovirus-like elements. This family includes aspartate proteases from retroelements with LTR (long terminal repeats) including the RTVL_H family of human endogenous retrovirus-like elements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where
Probab=99.51 E-value=9.6e-14 Score=106.12 Aligned_cols=85 Identities=21% Similarity=0.353 Sum_probs=64.3
Q ss_pred EEEEEcCEEEEEEEcCCCcceecCHHHHHHcCCccccccceeeeeecCceeeEEeEEEEE-EEEECCeEEEeEEEEecCC
Q psy8018 152 INCRVNGHPVKAFIDSGAQTTIMSAACAARVNITRLIDTRWAGIAKGVGVQRIIGRIHMV-QIAIENDFLTTSFSVLEDQ 230 (280)
Q Consensus 152 i~~~InG~~v~alVDTGAq~tiis~~~A~rlgL~~~~d~r~~g~a~gvG~~~~~g~i~~~-~i~IG~~~~~~~~~Vl~~~ 230 (280)
+++.|||+|+.|||||||+.|+++.+.|+++.+. . ......|+|+....-..... .+++|+......+.|.+.
T Consensus 1 ~~v~InG~~~~fLvDTGA~~tii~~~~a~~~~~~---~--~~~~v~gagG~~~~~v~~~~~~v~vg~~~~~~~~~v~~~- 74 (86)
T cd06095 1 VTITVEGVPIVFLVDTGATHSVLKSDLGPKQELS---T--TSVLIRGVSGQSQQPVTTYRTLVDLGGHTVSHSFLVVPN- 74 (86)
T ss_pred CEEEECCEEEEEEEECCCCeEEECHHHhhhccCC---C--CcEEEEeCCCcccccEEEeeeEEEECCEEEEEEEEEEcC-
Confidence 4789999999999999999999999999998221 1 22356777765411111122 699999988888888875
Q ss_pred Ccccccchhhhh
Q psy8018 231 PMDMLLGLDMLK 242 (280)
Q Consensus 231 ~~d~ILG~D~L~ 242 (280)
..+.|||||||+
T Consensus 75 ~~~~lLG~dfL~ 86 (86)
T cd06095 75 CPDPLLGRDLLS 86 (86)
T ss_pred CCCcEechhhcC
Confidence 368999999995
No 40
>cd01815 BMSC_UbP_N Ubiquitin-like domain of BMSC-UbP. BMSC_UbP (bone marrow stromal cell-derived ubiquitin-like protein) has an N-terminal ubiquitin-like (UBQ) domain and a C-terminal ubiquitin-associated (UBA) domain, a domain architecture similar to those of the UBIN, Chap1, and ubiquilin proteins. This CD represents the N-terminal ubiquitin-like domain.
Probab=99.50 E-value=2.7e-14 Score=106.45 Aligned_cols=57 Identities=28% Similarity=0.414 Sum_probs=51.0
Q ss_pred EeCC-CccHHHHHHHHHHHh--CCC-CcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 16 DVSE-DLELENFKAFCEVQS--GIS-AQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 16 eV~~-~~tV~~LK~~Ie~e~--gip-~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
+|+| ++||++||++|++++ |+| +++|+|+|.||+|.|+ +||++|||++|++|||+++
T Consensus 15 ~~~~~~~TV~~LK~kI~~~~~egi~~~dqQrLIy~GKiL~D~-~TL~dygI~~gstlhLv~~ 75 (75)
T cd01815 15 DVSPGGYQVSTLKQLIAAQLPDSLPDPELIDLIHCGRKLKDD-QTLDFYGIQSGSTIHILRK 75 (75)
T ss_pred CcCCccCcHHHHHHHHHHhhccCCCChHHeEEEeCCcCCCCC-CcHHHcCCCCCCEEEEEeC
Confidence 4555 899999999999996 575 9999999999999986 9999999999999999874
No 41
>KOG0004|consensus
Probab=99.47 E-value=2.8e-14 Score=118.85 Aligned_cols=72 Identities=31% Similarity=0.454 Sum_probs=70.3
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
|+|+|+++.+.+.++||+++.||+++|++|+..+|||+++|+|+|.|++|.|+ +||++|+|+..++|+|+.+
T Consensus 1 m~ifVk~l~~kti~~eve~~~ti~~~Kakiq~~egIp~dqqrlifag~qLedg-rtlSDY~Iqkestl~l~l~ 72 (156)
T KOG0004|consen 1 MQIFVKTLTGKTITLEVEANDTIDNVKAKIQDKEGIPPDQQRLIFAGKQLEDG-RTLSDYNIQKESTLHLVLR 72 (156)
T ss_pred CccchhhccccceeeeecccccHHHHHHhhhcccCCCchhhhhhhhhcccccC-CccccccccccceEEEEEE
Confidence 89999999999999999999999999999999999999999999999999987 9999999999999999986
No 42
>KOG0010|consensus
Probab=99.44 E-value=1.7e-13 Score=132.20 Aligned_cols=71 Identities=28% Similarity=0.465 Sum_probs=68.2
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
++|+||+.++ ++.+.|..+.||.+||++|...+++|+++|+|||.||+|+|+ +||..|||+||.||||++|
T Consensus 16 irV~Vkt~~d-k~~~~V~~~ssV~qlKE~I~~~f~a~~dqlvLIfaGrILKD~-dTL~~~gI~Dg~TvHLVik 86 (493)
T KOG0010|consen 16 IRVTVKTPKD-KYEVNVASDSSVLQLKELIAQRFGAPPDQLVLIYAGRILKDD-DTLKQYGIQDGHTVHLVIK 86 (493)
T ss_pred eEEEEecCCc-ceeEecccchHHHHHHHHHHHhcCCChhHeeeeecCccccCh-hhHHHcCCCCCcEEEEEec
Confidence 3799999888 999999999999999999999999999999999999999986 8999999999999999998
No 43
>TIGR03698 clan_AA_DTGF clan AA aspartic protease, AF_0612 family. Members of this protein family are clan AA aspartic proteases, related to family TIGR02281. These proteins resemble retropepsins, pepsin-like proteases of retroviruses such as HIV. Members of this family are found in archaea and bacteria.
Probab=99.41 E-value=2.3e-12 Score=102.69 Aligned_cols=92 Identities=18% Similarity=0.245 Sum_probs=69.7
Q ss_pred EEEEEEEcCCCccee-cCHHHHHHcCCccccccceeeeeecCceeeEEeEEEEEEEEECCeEEEeEEEEecCCCcccccc
Q psy8018 159 HPVKAFIDSGAQTTI-MSAACAARVNITRLIDTRWAGIAKGVGVQRIIGRIHMVQIAIENDFLTTSFSVLEDQPMDMLLG 237 (280)
Q Consensus 159 ~~v~alVDTGAq~ti-is~~~A~rlgL~~~~d~r~~g~a~gvG~~~~~g~i~~~~i~IG~~~~~~~~~Vl~~~~~d~ILG 237 (280)
.+++|+|||||+..+ ++++.|+++||......+. . .+-| .....++....+.+|+....+.+.+.+..+ +.|||
T Consensus 15 ~~v~~LVDTGat~~~~l~~~~a~~lgl~~~~~~~~-~--tA~G-~~~~~~v~~~~v~igg~~~~~~v~~~~~~~-~~LLG 89 (107)
T TIGR03698 15 MEVRALVDTGFSGFLLVPPDIVNKLGLPELDQRRV-Y--LADG-REVLTDVAKASIIINGLEIDAFVESLGYVD-EPLLG 89 (107)
T ss_pred eEEEEEEECCCCeEEecCHHHHHHcCCCcccCcEE-E--ecCC-cEEEEEEEEEEEEECCEEEEEEEEecCCCC-ccEec
Confidence 489999999999997 9999999999987532211 1 2223 234455677899999988765544444333 79999
Q ss_pred hhhhhcceEEEEecccEE
Q psy8018 238 LDMLKRHQVQIAIENDFL 255 (280)
Q Consensus 238 ~D~L~~~~~~ID~~~~~L 255 (280)
|.||.+++..||++++++
T Consensus 90 ~~~L~~l~l~id~~~~~~ 107 (107)
T TIGR03698 90 TELLEGLGIVIDYRNQGL 107 (107)
T ss_pred HHHHhhCCEEEehhhCcC
Confidence 999999999999998764
No 44
>KOG0011|consensus
Probab=99.41 E-value=4.4e-13 Score=123.76 Aligned_cols=73 Identities=22% Similarity=0.445 Sum_probs=70.2
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhC--CCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEec
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSG--ISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRVQ 74 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~g--ip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~~ 74 (280)
|+||||++.++.|+++|.|+.||.++|.+|+.+.| +|++.|+|||+||+|.|+ .|+++|+|++++.|+++..+
T Consensus 1 m~lt~KtL~q~~F~iev~Pe~tV~evK~kIet~~g~dyP~~~QkLIy~GkiL~D~-~tv~Eykv~E~~fiVvMlsK 75 (340)
T KOG0011|consen 1 MKLTVKTLKQQTFTIEVKPEDTVVEVKKKIETEKGPDYPAEQQKLIYSGKILKDE-TTVGEYKVKEKKFIVVMLSK 75 (340)
T ss_pred CeeEeeeccCceeEeecCcchhHHHHHHHHHhccCCCCchhhheeeecceeccCC-cchhhhccccCceEEEEEec
Confidence 89999999999999999999999999999999999 999999999999999986 99999999999999998763
No 45
>cd01795 USP48_C USP ubiquitin-specific protease. The USP (ubiquitin-specific protease) family is one of at least seven deubiquitylating enzyme (DUB) families capable of deconjugating ubiquitin and ubiquitin-like adducts. While the USP's have a conserved catalytic core domain, they differ in their domain architectures. This subfamily, which includes USP31, and USP48, has a carboxy-terminal ubiquitin-like domain in addition to a DUSP (domain of ubiquitin-specific proteases) domain,
Probab=99.35 E-value=2.1e-12 Score=100.31 Aligned_cols=62 Identities=21% Similarity=0.206 Sum_probs=59.4
Q ss_pred EEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 12 CFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 12 ~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
..+++|+++.||.+||.+|...+++||.+|+|+|+|+.|.|+++||++|||..|+.|.|...
T Consensus 16 ~~~L~V~~~~TVg~LK~lImQ~f~V~P~dQkL~~dG~~L~DDsrTLssyGv~sgSvl~Llid 77 (107)
T cd01795 16 EKALLVSANQTLKELKIQIMHAFSVAPFDQNLSIDGKILSDDCATLGTLGVIPESVILLKAD 77 (107)
T ss_pred CceEEeCccccHHHHHHHHHHHhcCCcccceeeecCceeccCCccHHhcCCCCCCEEEEEec
Confidence 46799999999999999999999999999999999999999999999999999999999885
No 46
>cd01769 UBL Ubiquitin-like domain of UBL. UBLs function by remodeling the surface of their target proteins, changing their target's half-life, enzymatic activity, protein-protein interactions, subcellular localization or other properties. At least 10 different ubiquitin-like modifications exist in mammals, and attachment of different ubls to a target leads to different biological consequences. Ubl-conjugation cascades are initiated by activating enzymes, which also coordinate the ubls with their downstream pathways.
Probab=99.34 E-value=4.6e-12 Score=91.28 Aligned_cols=68 Identities=35% Similarity=0.535 Sum_probs=63.0
Q ss_pred EEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 4 TVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 4 tV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
+|+..+|..+.++++++.||++||++|+..+|+|++.|+|+|+|+.|.|+ .+|++||+++|+.|++..
T Consensus 1 ~v~~~~~~~~~~~~~~~~ti~~lK~~i~~~~~~~~~~~~l~~~g~~l~d~-~~l~~~~v~~~~~i~v~~ 68 (69)
T cd01769 1 TVKTLTGKTFELEVSPDDTVAELKAKIAAKEGVPPEQQRLIYAGKILKDD-KTLSDYGIQDGSTLHLVL 68 (69)
T ss_pred CeEccCCCEEEEEECCCChHHHHHHHHHHHHCcChHHEEEEECCcCCCCc-CCHHHCCCCCCCEEEEEE
Confidence 36777899999999999999999999999999999999999999999865 899999999999999864
No 47
>COG3577 Predicted aspartyl protease [General function prediction only]
Probab=99.30 E-value=8.7e-12 Score=108.76 Aligned_cols=97 Identities=25% Similarity=0.336 Sum_probs=76.5
Q ss_pred eeEEEEEEEEcCEEEEEEEcCCCcceecCHHHHHHcCCccc-cccceeeeeecCceeeEEeEEEEEEEEECCeEE-EeEE
Q psy8018 147 VIMLYINCRVNGHPVKAFIDSGAQTTIMSAACAARVNITRL-IDTRWAGIAKGVGVQRIIGRIHMVQIAIENDFL-TTSF 224 (280)
Q Consensus 147 ~~~l~i~~~InG~~v~alVDTGAq~tiis~~~A~rlgL~~~-~d~r~~g~a~gvG~~~~~g~i~~~~i~IG~~~~-~~~~ 224 (280)
..|++++..|||++++++|||||+...++.+.|+|+|+..- .+..+. +...-| ...-..+...+++||++.+ .++.
T Consensus 103 ~GHF~a~~~VNGk~v~fLVDTGATsVal~~~dA~RlGid~~~l~y~~~-v~TANG-~~~AA~V~Ld~v~IG~I~~~nV~A 180 (215)
T COG3577 103 DGHFEANGRVNGKKVDFLVDTGATSVALNEEDARRLGIDLNSLDYTIT-VSTANG-RARAAPVTLDRVQIGGIRVKNVDA 180 (215)
T ss_pred CCcEEEEEEECCEEEEEEEecCcceeecCHHHHHHhCCCccccCCceE-EEccCC-ccccceEEeeeEEEccEEEcCchh
Confidence 37999999999999999999999999999999999999755 232221 112222 3445677788999999755 5899
Q ss_pred EEecCCCcc-cccchhhhhcce
Q psy8018 225 SVLEDQPMD-MLLGLDMLKRHQ 245 (280)
Q Consensus 225 ~Vl~~~~~d-~ILG~D~L~~~~ 245 (280)
.|+++...| .||||+||.+..
T Consensus 181 ~V~~~g~L~~sLLGMSfL~rL~ 202 (215)
T COG3577 181 MVAEDGALDESLLGMSFLNRLS 202 (215)
T ss_pred heecCCccchhhhhHHHHhhcc
Confidence 999876554 999999999865
No 48
>PF11976 Rad60-SLD: Ubiquitin-2 like Rad60 SUMO-like; InterPro: IPR022617 This entry includes small ubiquitin-related modifier (SUMO) proteins. SUMOs are small proteins that are covalently attached to lysines as post-translational modifications and are used to control multiple cellular process including signal transduction, nuclear transport and DNA replication and repair []. Unlike ubiquitin, they are not involved in protein degradation. This entry also contains the C-terminal Rad60 DNA repair protein SUMO-like domain.; PDB: 3RD2_A 2JXX_A 3RCZ_A 3GOE_A 3A4S_D 3A4R_B 2IO1_D 1U4A_A 2K1F_A 1WZ0_A ....
Probab=99.27 E-value=2.1e-11 Score=89.62 Aligned_cols=70 Identities=24% Similarity=0.350 Sum_probs=65.0
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCC-cceEEEECCeeeCCCcCCccccCCCCCCEEEEE
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISA-QEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLK 71 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~-~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~ 71 (280)
|+|+|+..+|+.+.+.|.++.++..|++......|+|+ +..+|+|+|+.|..+ +|+++||+++||+|.+.
T Consensus 1 I~i~v~~~~~~~~~~~v~~~~~~~~l~~~~~~~~~i~~~~~~~l~fdG~~L~~~-~T~~~~~ied~d~Idv~ 71 (72)
T PF11976_consen 1 ITIKVRSQDGKEIKFKVKPTTTVSKLIEKYCEKKGIPPEESIRLIFDGKRLDPN-DTPEDLGIEDGDTIDVI 71 (72)
T ss_dssp EEEEEEETTSEEEEEEEETTSCCHHHHHHHHHHHTTTT-TTEEEEETTEEE-TT-SCHHHHT-STTEEEEEE
T ss_pred CEEEEEeCCCCEEEEEECCCCcHHHHHHHHHHhhCCCccceEEEEECCEEcCCC-CCHHHCCCCCCCEEEEE
Confidence 68999999999999999999999999999999999999 999999999999875 89999999999999885
No 49
>cd00303 retropepsin_like Retropepsins; pepsin-like aspartate proteases. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements, as well as eukaryotic dna-damage-inducible proteins (DDIs), and bacterial aspartate peptidases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples
Probab=99.22 E-value=1.4e-10 Score=83.59 Aligned_cols=89 Identities=33% Similarity=0.529 Sum_probs=67.1
Q ss_pred EEEEEcCEEEEEEEcCCCcceecCHHHHHHcCC-ccccccceeeeeecCc--eeeEEeEEEEEEEEECCeEEEeEEEEec
Q psy8018 152 INCRVNGHPVKAFIDSGAQTTIMSAACAARVNI-TRLIDTRWAGIAKGVG--VQRIIGRIHMVQIAIENDFLTTSFSVLE 228 (280)
Q Consensus 152 i~~~InG~~v~alVDTGAq~tiis~~~A~rlgL-~~~~d~r~~g~a~gvG--~~~~~g~i~~~~i~IG~~~~~~~~~Vl~ 228 (280)
+++.+||.++.+++||||+.++++..++.+.++ ........ ...++. .....+.+....+.+++..+...|.+.+
T Consensus 1 ~~~~~~~~~~~~liDtgs~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 78 (92)
T cd00303 1 LKGKINGVPVRALVDSGASVNFISESLAKKLGLPPRLLPTPL--KVKGANGSSVKTLGVILPVTIGIGGKTFTVDFYVLD 78 (92)
T ss_pred CEEEECCEEEEEEEcCCCcccccCHHHHHHcCCCcccCCCce--EEEecCCCEeccCcEEEEEEEEeCCEEEEEEEEEEc
Confidence 468899999999999999999999999999987 32222222 112222 2222333355688999988999999999
Q ss_pred CCCcccccchhhhh
Q psy8018 229 DQPMDMLLGLDMLK 242 (280)
Q Consensus 229 ~~~~d~ILG~D~L~ 242 (280)
....|+|||+|||+
T Consensus 79 ~~~~~~ilG~~~l~ 92 (92)
T cd00303 79 LLSYDVILGRPWLE 92 (92)
T ss_pred CCCcCEEecccccC
Confidence 88999999999985
No 50
>cd01789 Alp11_N Ubiquitin-like domain of Alp11 tubulin-folding cofactor B. Alp11, also known as tubulin-folding cofactor B, is one of at least three proteins required for the proper folding of tubulins prior to their incorporation into microtubules. These cofactors are necessary for the biogenesis of microtubules and for cell viability. Alp11 has three domains including an N-terminal ubiquitin-like domain (represented by this CD) which executes the essential function, a central coiled-coil domain necessary for maintenance of cellular alpha-tubulin levels, and a C-terminal CLIP-170 domain is required for efficient binding to alpha-tubulin.
Probab=99.17 E-value=1.5e-10 Score=88.45 Aligned_cols=71 Identities=27% Similarity=0.434 Sum_probs=59.4
Q ss_pred EEEEEcCCCcEEEEE--eCCCccHHHHHHHHHHHhCCCCcceEE-EECCe-----eeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 2 KVTVTSLDNECFLLD--VSEDLELENFKAFCEVQSGISAQEIAI-EFQGN-----ALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 2 ~ItV~~~~g~~~~le--V~~~~tV~~LK~~Ie~e~gip~~~Q~L-i~~Gk-----~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
.|+|+... ..+..| +++++||.+||++++..+|+||..|+| +|+|+ .|.+|.++|++||+++|..||+.-.
T Consensus 3 ~v~i~~~~-~~~~~ekr~~~~~Tv~~lK~kl~~~~G~~~~~mrL~l~~~~~~~~~~l~~d~~~L~~y~~~dg~~IhVvD~ 81 (84)
T cd01789 3 TVNITSSA-DSFSFEKKYSRGLTIAELKKKLELVVGTPASSMRLQLFDGDDKLVSKLDDDDALLGSYPVDDGCRIHVIDV 81 (84)
T ss_pred EEEEEeCC-CceeeeEecCCCCcHHHHHHHHHHHHCCCccceEEEEEcCCCCeEeecCCCccEeeeccCCCCCEEEEEeC
Confidence 45666533 244454 999999999999999999999999999 58988 6867779999999999999999863
No 51
>cd01814 NTGP5 Ubiquitin-like NTGP5 and ATGP4. NTGP5 and ATGP4 are plant-specific isoprenylated GTP-binding proteins with a single fold that resembles ubiquitin. The function of these proteins is unknown.
Probab=99.16 E-value=5.3e-11 Score=94.86 Aligned_cols=71 Identities=11% Similarity=-0.008 Sum_probs=60.7
Q ss_pred EEEEEcCCCc-EEEEEeCCCccHHHHHHHHHHHh-----CCC--CcceEEEECCeeeCCCcCCccccC------CCCCCE
Q psy8018 2 KVTVTSLDNE-CFLLDVSEDLELENFKAFCEVQS-----GIS--AQEIAIEFQGNALVDNKKTLKDYN------IQDGDL 67 (280)
Q Consensus 2 ~ItV~~~~g~-~~~leV~~~~tV~~LK~~Ie~e~-----gip--~~~Q~Li~~Gk~L~d~~~tL~~~g------I~~gd~ 67 (280)
.|.++..+|. +=+.-++++.||++||++|+++. |+| +++|+|||.||+|.|+ +||++|+ +.-..|
T Consensus 6 e~kfrl~dg~digp~~~~~sdTV~~lKekI~~~~p~~ke~~P~~~~~qKLIysGKiLeD~-~TL~d~~~p~g~~~~~~~T 84 (113)
T cd01814 6 EIKFRLYDGSDIGPKRYPAATTVDFLKERVVSQWPKDKEVGPKTVNEVKLISAGKILENS-KTVGECRSPVGDIAGGVIT 84 (113)
T ss_pred EEEEEccCCCccCccccChhhHHHHHHHHHHHhcccccccCCCCHHHeEEEeCCeecCCC-CcHHHhCCcccccCCCceE
Confidence 3566777774 44788999999999999999554 555 9999999999999986 9999999 778899
Q ss_pred EEEEEe
Q psy8018 68 VLLKRV 73 (280)
Q Consensus 68 l~l~~~ 73 (280)
+||+.+
T Consensus 85 mHvvlr 90 (113)
T cd01814 85 MHVVVQ 90 (113)
T ss_pred EEEEec
Confidence 999997
No 52
>PF13975 gag-asp_proteas: gag-polyprotein putative aspartyl protease
Probab=99.00 E-value=1.5e-09 Score=80.27 Aligned_cols=46 Identities=24% Similarity=0.398 Sum_probs=41.2
Q ss_pred eeeeEEEEEEEEcCEEEEEEEcCCCcceecCHHHHHHcCCcccccc
Q psy8018 145 GSVIMLYINCRVNGHPVKAFIDSGAQTTIMSAACAARVNITRLIDT 190 (280)
Q Consensus 145 ~~~~~l~i~~~InG~~v~alVDTGAq~tiis~~~A~rlgL~~~~d~ 190 (280)
.....++++|.|||+++.+++||||++||||.++|+|||++.....
T Consensus 4 ~~~g~~~v~~~I~g~~~~alvDtGat~~fis~~~a~rLgl~~~~~~ 49 (72)
T PF13975_consen 4 PDPGLMYVPVSIGGVQVKALVDTGATHNFISESLAKRLGLPLEKPP 49 (72)
T ss_pred ccCCEEEEEEEECCEEEEEEEeCCCcceecCHHHHHHhCCCcccCC
Confidence 3457899999999999999999999999999999999999877443
No 53
>cd05481 retropepsin_like_LTR_1 Retropepsins_like_LTR; pepsin-like aspartate protease from retrotransposons with long terminal repeats. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identifi
Probab=99.00 E-value=3.6e-09 Score=82.22 Aligned_cols=86 Identities=20% Similarity=0.237 Sum_probs=66.9
Q ss_pred EEEEcC-EEEEEEEcCCCcceecCHHHHHHcC---CccccccceeeeeecCceeeEEeEEEEEEEEECCeEEEeEEEEec
Q psy8018 153 NCRVNG-HPVKAFIDSGAQTTIMSAACAARVN---ITRLIDTRWAGIAKGVGVQRIIGRIHMVQIAIENDFLTTSFSVLE 228 (280)
Q Consensus 153 ~~~InG-~~v~alVDTGAq~tiis~~~A~rlg---L~~~~d~r~~g~a~gvG~~~~~g~i~~~~i~IG~~~~~~~~~Vl~ 228 (280)
...||| +++++.|||||+.|+|+.+..+++| ...+..+...-.+.+-+.....|. ..+.+++++..+++.|.|++
T Consensus 2 ~~~i~g~~~v~~~vDtGA~vnllp~~~~~~l~~~~~~~L~~t~~~L~~~~g~~~~~~G~-~~~~v~~~~~~~~~~f~Vvd 80 (93)
T cd05481 2 DMKINGKQSVKFQLDTGATCNVLPLRWLKSLTPDKDPELRPSPVRLTAYGGSTIPVEGG-VKLKCRYRNPKYNLTFQVVK 80 (93)
T ss_pred ceEeCCceeEEEEEecCCEEEeccHHHHhhhccCCCCcCccCCeEEEeeCCCEeeeeEE-EEEEEEECCcEEEEEEEEEC
Confidence 468999 9999999999999999999999998 555544444333344344566787 45799999999999999998
Q ss_pred CCCcccccchhh
Q psy8018 229 DQPMDMLLGLDM 240 (280)
Q Consensus 229 ~~~~d~ILG~D~ 240 (280)
. +..-|||++.
T Consensus 81 ~-~~~~lLG~~~ 91 (93)
T cd05481 81 E-EGPPLLGAKA 91 (93)
T ss_pred C-CCCceEcccc
Confidence 5 3556788653
No 54
>PF12384 Peptidase_A2B: Ty3 transposon peptidase; InterPro: IPR024650 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Ty3 is a gypsy-type, retrovirus-like, element found in the budding yeast. The Ty3 aspartyl protease is required for processing of the viral polyprotein into its mature species [].
Probab=98.94 E-value=1.1e-08 Score=86.61 Aligned_cols=95 Identities=18% Similarity=0.376 Sum_probs=76.6
Q ss_pred EEEEEEEEcCEEEEEEEcCCCcceecCHHHHHHcCCcccccc--ceeeeeecCceeeEEeEEEEEEEEECCeEEEeEEEE
Q psy8018 149 MLYINCRVNGHPVKAFIDSGAQTTIMSAACAARVNITRLIDT--RWAGIAKGVGVQRIIGRIHMVQIAIENDFLTTSFSV 226 (280)
Q Consensus 149 ~l~i~~~InG~~v~alVDTGAq~tiis~~~A~rlgL~~~~d~--r~~g~a~gvG~~~~~g~i~~~~i~IG~~~~~~~~~V 226 (280)
-..++..++|.++++|+||||-.|||+++.+++|+|+..... +|.|...+ .....-.....++.+++..++..++|
T Consensus 34 T~~v~l~~~~t~i~vLfDSGSPTSfIr~di~~kL~L~~~~app~~fRG~vs~--~~~~tsEAv~ld~~i~n~~i~i~aYV 111 (177)
T PF12384_consen 34 TAIVQLNCKGTPIKVLFDSGSPTSFIRSDIVEKLELPTHDAPPFRFRGFVSG--ESATTSEAVTLDFYIDNKLIDIAAYV 111 (177)
T ss_pred EEEEEEeecCcEEEEEEeCCCccceeehhhHHhhCCccccCCCEEEeeeccC--CceEEEEeEEEEEEECCeEEEEEEEE
Confidence 356788899999999999999999999999999999977332 34444333 22333444455899999999999999
Q ss_pred ecCCCcccccchhhhhcce
Q psy8018 227 LEDQPMDMLLGLDMLKRHQ 245 (280)
Q Consensus 227 l~~~~~d~ILG~D~L~~~~ 245 (280)
.+.-+.|+|+|.+.|++|.
T Consensus 112 ~d~m~~dlIIGnPiL~ryp 130 (177)
T PF12384_consen 112 TDNMDHDLIIGNPILDRYP 130 (177)
T ss_pred eccCCcceEeccHHHhhhH
Confidence 9988999999999999875
No 55
>PF14560 Ubiquitin_2: Ubiquitin-like domain; PDB: 1WJN_A 2KJ6_A 2KJR_A 1V6E_A 1T0Y_A.
Probab=98.91 E-value=5.6e-09 Score=79.89 Aligned_cols=72 Identities=21% Similarity=0.356 Sum_probs=58.4
Q ss_pred EEEEEcCCC--cEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEEC----C---eeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 2 KVTVTSLDN--ECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQ----G---NALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 2 ~ItV~~~~g--~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~----G---k~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
+|+|+...- ......+++++||.+||.+|+..+|+|++.|+|.+. | ..+.|+.++|.+||+++|+.|++.-
T Consensus 3 ~l~It~~~~~~~~~ekr~~~~~Tv~eLK~kl~~~~Gi~~~~m~L~l~~~~~~~~~~~~~dd~~~L~~y~~~dg~~i~V~D 82 (87)
T PF14560_consen 3 KLFITSSNSKQRSVEKRFPKSITVSELKQKLEKLTGIPPSDMRLQLKSDKDDSKIEELDDDDATLGSYGIKDGMRIHVVD 82 (87)
T ss_dssp EEEEEESSSSSSEEEEEEETTSBHHHHHHHHHHHHTS-TTTEEEEEE-TSSSSEEEESSGSSSBCCHHT-STTEEEEEEE
T ss_pred EEEEEeCCCCCeeEEEEcCCCCCHHHHHHHHHHHhCCCcccEEEEEEecCCCccccccCCCccEeecCCCCCCCEEEEEe
Confidence 567775443 588899999999999999999999999999999875 2 2255677999999999999999987
Q ss_pred e
Q psy8018 73 V 73 (280)
Q Consensus 73 ~ 73 (280)
.
T Consensus 83 ~ 83 (87)
T PF14560_consen 83 T 83 (87)
T ss_dssp -
T ss_pred C
Confidence 4
No 56
>KOG4248|consensus
Probab=98.91 E-value=1.9e-09 Score=111.59 Aligned_cols=71 Identities=24% Similarity=0.477 Sum_probs=68.1
Q ss_pred EEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEec
Q psy8018 2 KVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRVQ 74 (280)
Q Consensus 2 ~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~~ 74 (280)
.|+||+++.++.++.|...+||.+||.+|.++.+|+.+.|||||.|++|.|| +++++||| ||-+|||+-|.
T Consensus 4 ~v~vktld~r~~t~~ig~q~ti~~~~d~~r~~~ni~s~~qr~i~~grvl~~~-k~vq~~~v-dgk~~hlverp 74 (1143)
T KOG4248|consen 4 NVLVKTLDSRTRTFIIGAQMTIKEFKDHIRASVNIPSEKQRLIYQGRVLQDD-KKVQEYNV-DGKVIHLVERP 74 (1143)
T ss_pred ceeeeecccceeEEEechHHHHHHHHHHHHHhcccccccceeeecceeeccc-hhhhhccC-CCeEEEeeccC
Confidence 3789999999999999999999999999999999999999999999999987 99999999 99999999883
No 57
>PLN02560 enoyl-CoA reductase
Probab=98.88 E-value=5.6e-09 Score=97.68 Aligned_cols=71 Identities=30% Similarity=0.416 Sum_probs=63.1
Q ss_pred CEEEEEcCCCcEE---EEEeCCCccHHHHHHHHHHHhCC-CCcceEEEEC---C----eeeCCCcCCccccCCCCCCEEE
Q psy8018 1 MKVTVTSLDNECF---LLDVSEDLELENFKAFCEVQSGI-SAQEIAIEFQ---G----NALVDNKKTLKDYNIQDGDLVL 69 (280)
Q Consensus 1 M~ItV~~~~g~~~---~leV~~~~tV~~LK~~Ie~e~gi-p~~~Q~Li~~---G----k~L~d~~~tL~~~gI~~gd~l~ 69 (280)
|+|+|+..+|+.+ ++||+++.||++||..|++..++ ++++|+|++. | +.|.|+ +||+++|+++|++|+
T Consensus 1 M~I~Vk~~~Gk~i~~~~lev~~~aTV~dLK~~Isk~~~~~~~~RqRL~~~~~~gk~~g~~L~d~-ktL~d~gv~~gstLy 79 (308)
T PLN02560 1 MKVTVVSRSGREIIKGGLEVPDSATVADLKKAIHKRKKKYYPSRQRLTLPLPPGKTRPTVLDDS-KSLKDYGLGDGGTVV 79 (308)
T ss_pred CEEEEEcCCCCeecceeEEcCCCCcHHHHHHHHHHHcCCCChhheEEEEecCCCCcCccccCCC-CCHHhcCCCCCceEE
Confidence 8999998888887 89999999999999999999987 8999999982 4 377765 899999999999998
Q ss_pred EEE
Q psy8018 70 LKR 72 (280)
Q Consensus 70 l~~ 72 (280)
+.-
T Consensus 80 ~kD 82 (308)
T PLN02560 80 FKD 82 (308)
T ss_pred EEe
Confidence 865
No 58
>KOG0001|consensus
Probab=98.84 E-value=3e-08 Score=70.86 Aligned_cols=70 Identities=34% Similarity=0.476 Sum_probs=66.0
Q ss_pred EEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 3 VTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 3 ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
+++.+..|+.+++++.+..+++.+|..|+...|+|+.+|++.+.|++|.|+ .+|.+|+|..+.++++..+
T Consensus 2 ~~~~~~~gk~~~~~~~~~~~i~~~k~~i~~~~~~~~~~q~~~~~~~~l~d~-~~l~~~~i~~~~~~~l~~~ 71 (75)
T KOG0001|consen 2 IFVKTLDGKTITLEVSPSDTIEVVKAKIRDKEGIPVDQQRLIFGGKPLEDG-RTLADYNIQEGSTLHLVLS 71 (75)
T ss_pred EEEEecCCCEEEEEecCCCHHHHHHHHHHhhcCCCCeeEEEEECCEECcCC-CcHHHhCCCCCCEEEEEEe
Confidence 567788999999999999999999999999999999999999999999986 9999999999999998875
No 59
>PF11543 UN_NPL4: Nuclear pore localisation protein NPL4; InterPro: IPR024682 Npl4, along with Ufd1, forms the heterodimer adaptor complex UN, which is involved in the recruitment of p97, an AAA ATPase, for tasks involving the ubiquitin pathway. Npl4 has a N-terminal ubiquitin-like domain which has within its structure a beta-grasp fold with a helical insert []. This entry represents the ubiquitin-like domain.; PDB: 2PJH_A 1WF9_A.
Probab=98.71 E-value=2.6e-08 Score=75.50 Aligned_cols=70 Identities=26% Similarity=0.340 Sum_probs=43.8
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEEC--C-eee-CCCcCCccccCCCCCCEEEEE
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQ--G-NAL-VDNKKTLKDYNIQDGDLVLLK 71 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~--G-k~L-~d~~~tL~~~gI~~gd~l~l~ 71 (280)
|-|.|++.+| .+-++++|++|+.+|+++|++.+++|.+.|.|+.+ + ..| .++++||+++||+.||+|+|.
T Consensus 5 milRvrS~dG-~~Rie~~~~~t~~~L~~kI~~~l~~~~~~~~L~~~~~~~~~l~s~~~~tl~~lglkHGdmlyL~ 78 (80)
T PF11543_consen 5 MILRVRSKDG-MKRIEVSPSSTLSDLKEKISEQLSIPDSSQSLSKDRNNKEELKSSDSKTLSSLGLKHGDMLYLK 78 (80)
T ss_dssp -EEEEE-SSE-EEEEEE-TTSBHHHHHHHHHHHS---TTT---BSSGGGGGCSSS-TT-CCCCT---TT-EEE--
T ss_pred EEEEEECCCC-CEEEEcCCcccHHHHHHHHHHHcCCCCcceEEEecCCCCcccccCCcCCHHHcCCCCccEEEEe
Confidence 5577888777 66699999999999999999999999999998642 2 234 345689999999999999985
No 60
>COG5550 Predicted aspartyl protease [Posttranslational modification, protein turnover, chaperones]
Probab=98.66 E-value=3.8e-07 Score=73.89 Aligned_cols=94 Identities=15% Similarity=0.222 Sum_probs=71.5
Q ss_pred EEEEEEcCCCc-ceecCHHHHHHcCCccccccceeeeeecCceeeEEeEEEEEEEEECCeEEEeEEEEecCCCcccccch
Q psy8018 160 PVKAFIDSGAQ-TTIMSAACAARVNITRLIDTRWAGIAKGVGVQRIIGRIHMVQIAIENDFLTTSFSVLEDQPMDMLLGL 238 (280)
Q Consensus 160 ~v~alVDTGAq-~tiis~~~A~rlgL~~~~d~r~~g~a~gvG~~~~~g~i~~~~i~IG~~~~~~~~~Vl~~~~~d~ILG~ 238 (280)
-..+|||||++ ..+++++.|+++|+++....+. + .+-|+ ...-.++.+.++|++........+.+..+.+ ++|+
T Consensus 26 ~~~~LiDTGFtg~lvlp~~vaek~~~~~~~~~~~--~-~a~~~-~v~t~V~~~~iki~g~e~~~~Vl~s~~~~~~-liG~ 100 (125)
T COG5550 26 VYDELIDTGFTGYLVLPPQVAEKLGLPLFSTIRI--V-LADGG-VVKTSVALATIKIDGVEKVAFVLASDNLPEP-LIGV 100 (125)
T ss_pred EeeeEEecCCceeEEeCHHHHHhcCCCccCChhh--h-hhcCC-EEEEEEEEEEEEECCEEEEEEEEccCCCccc-chhh
Confidence 44569999999 9999999999999997733332 1 22222 3345688889999998776655555555666 9999
Q ss_pred hhhhcceEEEEecccEEEee
Q psy8018 239 DMLKRHQVQIAIENDFLTTS 258 (280)
Q Consensus 239 D~L~~~~~~ID~~~~~L~i~ 258 (280)
+||+.....+|++.++|+-+
T Consensus 101 ~~lk~l~~~vn~~~g~LEk~ 120 (125)
T COG5550 101 NLLKLLGLVVNPKTGKLEKP 120 (125)
T ss_pred hhhhhccEEEcCCcceEecc
Confidence 99999999999999998643
No 61
>cd01801 Tsc13_N Ubiquitin-like domain of Tsc13. Tsc13_N N-terminal domain of Tsc13. Tsc13 is an enoyl reductase involved in elongation of long chain fatty acids that localizes to the endoplasmic reticulum and is highly enriched in a novel structure marking nuclear-vacuolar junctions.
Probab=98.64 E-value=9.2e-08 Score=71.60 Aligned_cols=69 Identities=19% Similarity=0.293 Sum_probs=54.3
Q ss_pred EEEEEcCC-CcEEEEEeC-CCccHHHHHHHHHHHhC-CCCcceEEE--ECCeeeCCCcCCccccCCCCCCEEEEE
Q psy8018 2 KVTVTSLD-NECFLLDVS-EDLELENFKAFCEVQSG-ISAQEIAIE--FQGNALVDNKKTLKDYNIQDGDLVLLK 71 (280)
Q Consensus 2 ~ItV~~~~-g~~~~leV~-~~~tV~~LK~~Ie~e~g-ip~~~Q~Li--~~Gk~L~d~~~tL~~~gI~~gd~l~l~ 71 (280)
.|+++..+ .....++++ ++.||++||..|+...+ +|++.|+|. +.|+.|.|+ +||++||+++|++|++.
T Consensus 2 ~i~~~~~~~k~~~~~~~~~~~aTV~dlk~~i~~~~~~~~~~Rqrl~~~~~g~~L~d~-~tL~~~gv~~g~~lyvK 75 (77)
T cd01801 2 EILDAKRSDKPIGKLKVSSGDATIADLKKLIAKSSPQLTVNRQSLRLEPKGKSLKDD-DTLVDLGVGAGATLYVR 75 (77)
T ss_pred eeeccccCcCceeecccCCCCccHHHHHHHHHHHcCCCCcceeEEEeCCCCcccCCc-ccHhhcCCCCCCEEEEe
Confidence 34555444 333335554 89999999999999876 589999995 799999876 79999999999999874
No 62
>cd05480 NRIP_C NRIP_C; putative nuclear receptor interacting protein. Proteins in this family have been described as probable nuclear receptor interacting proteins. The C-terminal domain of this family is homologous to the retroviral aspartyl protease domain. The domain is structurally related to one lobe of the pepsin molecule. The conserved active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=98.57 E-value=4.9e-08 Score=76.38 Aligned_cols=44 Identities=20% Similarity=0.329 Sum_probs=42.3
Q ss_pred ccccchhhhhcceEEEEecccEEEeeeeecCCCccceeeccchhhccC
Q psy8018 233 DMLLGLDMLKRHQVQIAIENDFLTTSFSVLEDQPMDMLLGLDMLKRHQ 280 (280)
Q Consensus 233 d~ILG~D~L~~~~~~ID~~~~~L~i~~~~~~~~~~~~~~~~~~~~~~~ 280 (280)
--|+| |-|.+.|-+....+..+.+++|++++||+||||||||||
T Consensus 53 ~kiiG----rih~~~ikig~~~~~CSftVld~~~~d~llGLdmLkrhq 96 (103)
T cd05480 53 VKVIG----QIERLVLQLGQLTVECSAQVVDDNEKNFSLGLQTLKSLK 96 (103)
T ss_pred eeEee----EEEEEEEEeCCEEeeEEEEEEcCCCcceEeeHHHHhhcc
Confidence 47889 999999999999999999999999999999999999998
No 63
>cd00196 UBQ Ubiquitin-like proteins. Ubiquitin homologs; Includes ubiquitin and ubiquitin-like proteins. Ubiquitin-mediated proteolysis is part of the regulated turnover of proteins required for controlling cell cycle progression. Other family members are protein modifiers that perform a wide range of functions. Ubiquitination usually results in a covalent bond between the C-terminus of ubiquitin and the epsilon-amino group of a substrate lysine. The three-step mechanism requires an activating enzyme (E1) that forms a thiol ester with the C-terminal carboxy group, a conjugating enzyme (E2) that transiently carries the activated ubiquitin molecule as a thiol ester, and a ligase (E3) that transfers the activated ubiquitin from the E2 to the substrate lysine residue. In poly-ubiquitination, ubiquitin itself is the substrate.
Probab=98.54 E-value=5e-07 Score=61.11 Aligned_cols=67 Identities=27% Similarity=0.373 Sum_probs=59.9
Q ss_pred EEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 5 VTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 5 V~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
|+..++....+.++++.|+++||+.|..+.|++++.|.|+++|..+.+. ..+.++++.+|+.|++..
T Consensus 2 v~~~~~~~~~~~~~~~~tv~~l~~~i~~~~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~i~~~~ 68 (69)
T cd00196 2 VKLNDGKTVELLVPSGTTVADLKEKLAKKLGLPPEQQRLLVNGKILPDS-LTLEDYGLQDGDELVLVP 68 (69)
T ss_pred eEecCCCEEEEEcCCCCcHHHHHHHHHHHHCcChHHeEEEECCeECCCC-CcHHHcCCCCCCEEEEEe
Confidence 3344788899999999999999999999999999999999999999865 677899999999999864
No 64
>PF02160 Peptidase_A3: Cauliflower mosaic virus peptidase (A3); InterPro: IPR000588 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of sequences contain an aspartic peptidase signature that belongs to MEROPS peptidase family A3, subfamily A3A (cauliflower mosaic virus-type endopeptidase, clan AA). Cauliflower mosaic virus belongs to the Retro-transcribing viruses, which have a double-stranded DNA genome. The genome includes an open reading frame (ORF V) that shows similarities to the pol gene of retroviruses. This ORF codes for a polyprotein that includes a reverse transcriptase, which, on the basis of a DTG triplet near the N terminus, was suggested to include an aspartic protease. The presence of an aspartic protease has been confirmed by mutational studies, implicating Asp-45 in catalysis. The protease releases itself from the polyprotein and is involved in reactions required to process the ORF IV polyprotein, which includes the viral coat protein []. The viral aspartic peptidase signature has also been found associated with a polyprotein encoded by integrated pararetrovirus-like sequences in the genome of Nicotiana tabacum (Common tobacco) []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis
Probab=98.46 E-value=8.3e-07 Score=78.00 Aligned_cols=108 Identities=17% Similarity=0.214 Sum_probs=74.5
Q ss_pred EEEEEE--EcCE---EEEEEEcCCCcceecCHHHHHHcCCccccccceeeeeecCc-e-eeEEeEEEEEEEEECCeEEEe
Q psy8018 150 LYINCR--VNGH---PVKAFIDSGAQTTIMSAACAARVNITRLIDTRWAGIAKGVG-V-QRIIGRIHMVQIAIENDFLTT 222 (280)
Q Consensus 150 l~i~~~--InG~---~v~alVDTGAq~tiis~~~A~rlgL~~~~d~r~~g~a~gvG-~-~~~~g~i~~~~i~IG~~~~~~ 222 (280)
+||.++ +.|- .+.++|||||+..+++..+.=.--+ ...+..-...|+. + ..+..++....|.|++..|..
T Consensus 5 iyI~~~i~~~gy~~~~~~~~vDTGAt~C~~~~~iiP~e~w---e~~~~~i~v~~an~~~~~i~~~~~~~~i~I~~~~F~I 81 (201)
T PF02160_consen 5 IYIKVKISFPGYKKFNYHCYVDTGATICCASKKIIPEEYW---EKSKKPIKVKGANGSIIQINKKAKNGKIQIADKIFRI 81 (201)
T ss_pred EEEEEEEEEcCceeEEEEEEEeCCCceEEecCCcCCHHHH---HhCCCcEEEEEecCCceEEEEEecCceEEEccEEEec
Confidence 455554 5562 5789999999998877654311111 1111112344543 3 566778888899999999988
Q ss_pred EEEEecCCCcccccchhhhhcceEEEEecccEEEeeeee
Q psy8018 223 SFSVLEDQPMDMLLGLDMLKRHQVQIAIENDFLTTSFSV 261 (280)
Q Consensus 223 ~~~Vl~~~~~d~ILG~D~L~~~~~~ID~~~~~L~i~~~~ 261 (280)
++.-.-+.+.|+|||++|++.++.-|.+.+ .+.+....
T Consensus 82 P~iYq~~~g~d~IlG~NF~r~y~Pfiq~~~-~I~f~~~~ 119 (201)
T PF02160_consen 82 PTIYQQESGIDIILGNNFLRLYEPFIQTED-RIQFHKKG 119 (201)
T ss_pred cEEEEecCCCCEEecchHHHhcCCcEEEcc-EEEEEeCC
Confidence 766665679999999999999999999974 57776654
No 65
>cd01788 ElonginB Ubiquitin-like domain of Elongin B. Elongin B is part of an E3 ubiquitin ligase complex called VEC that activates ubiquitylation by the E2 ubiquitin-conjugating enzyme Ubc5. VEC is composed of von Hippel-Lindau tumor suppressor protein (pVHL), elongin C, cullin 2, NEDD8, and Rbx1. ElonginB binds elonginC to form the elonginBC complex which is a positive regulator of RNA polymerase II elongation factor Elongin A. The BC complex then binds VHL (von Hippel-Lindau) tumour suppressor protein to form a VCB ternary complex. Elongin B has a ubiquitin-llike domain.
Probab=98.45 E-value=7.6e-07 Score=71.08 Aligned_cols=72 Identities=21% Similarity=0.127 Sum_probs=60.7
Q ss_pred CEEEEEcC-CCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCC-------CCCCEEEEEE
Q psy8018 1 MKVTVTSL-DNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNI-------QDGDLVLLKR 72 (280)
Q Consensus 1 M~ItV~~~-~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI-------~~gd~l~l~~ 72 (280)
|-+++..- ...++-++..++.||.+||.+|+.-...||++|+|+..+.+|.| ++||++||+ +...+|-|..
T Consensus 1 MdvFlmIrR~KTTiF~dakes~tVlelK~~iegI~k~pp~dQrL~kd~qvLeD-~kTL~d~g~t~~~akaq~pA~vgLa~ 79 (119)
T cd01788 1 MDVFLMIRRHKTTIFTDAKESTTVYELKRIVEGILKRPPEDQRLYKDDQLLDD-GKTLGDCGFTSQTARPQAPATVGLAF 79 (119)
T ss_pred CceEEEEEecceEEEeecCCcccHHHHHHHHHHHhcCChhHheeecCceeecc-cccHHHcCccccccccCCCCeEEEEE
Confidence 55666543 23455689999999999999999999999999999977787876 499999999 7799999888
Q ss_pred e
Q psy8018 73 V 73 (280)
Q Consensus 73 ~ 73 (280)
|
T Consensus 80 r 80 (119)
T cd01788 80 R 80 (119)
T ss_pred e
Confidence 7
No 66
>PF13881 Rad60-SLD_2: Ubiquitin-2 like Rad60 SUMO-like; PDB: 1SE9_A 1WGH_A 2GOW_A.
Probab=98.40 E-value=2.2e-06 Score=68.90 Aligned_cols=70 Identities=19% Similarity=0.232 Sum_probs=52.4
Q ss_pred EEEEcCCCc-EEEEEeCCCccHHHHHHHHHHHh-------CCCCcceEEEECCeeeCCCcCCccccCCCCCC------EE
Q psy8018 3 VTVTSLDNE-CFLLDVSEDLELENFKAFCEVQS-------GISAQEIAIEFQGNALVDNKKTLKDYNIQDGD------LV 68 (280)
Q Consensus 3 ItV~~~~g~-~~~leV~~~~tV~~LK~~Ie~e~-------gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd------~l 68 (280)
|.++..+|. +.++..++++||.+||+.|-.+- -..++..||||.||.|.|+ +||++|++..|+ ++
T Consensus 5 lkf~l~~G~d~~~~~~~~~~TV~~lKe~i~~~WP~d~~~~p~s~~~lRLI~~GriL~d~-~tL~~~~~~~~~~~~~~~vm 83 (111)
T PF13881_consen 5 LKFRLADGKDIGPFRFDPSTTVADLKERIWAEWPEDWEERPKSPSDLRLIYAGRILEDN-KTLSDCRLPSGETPGGPTVM 83 (111)
T ss_dssp EEEEETTS-EEEEEEE-TTSBHHHHHHHHHHSSSTTSSSTT-SGGGEEEEETTEEE-SS-SBTGGGT--TTSETT--EEE
T ss_pred EEEEEeCCCcccccccCccChHHHHHHHHHHHCccccccCCCChhhEEEEeCCeecCCc-CcHHHhCCCCCCCCCCCEEE
Confidence 455567888 88999999999999999998652 1245789999999999975 999999999888 56
Q ss_pred EEEEe
Q psy8018 69 LLKRV 73 (280)
Q Consensus 69 ~l~~~ 73 (280)
||+.+
T Consensus 84 Hlvvr 88 (111)
T PF13881_consen 84 HLVVR 88 (111)
T ss_dssp EEEE-
T ss_pred EEEec
Confidence 77775
No 67
>cd06094 RP_Saci_like RP_Saci_like, retropepsin family. Retropepsin on retrotransposons with long terminal repeats (LTR) including Saci-1, -2 and -3 of Schistosoma mansoni. Retropepsins are related to fungal and mammalian pepsins. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified
Probab=98.38 E-value=1e-06 Score=67.74 Aligned_cols=77 Identities=16% Similarity=0.253 Sum_probs=55.0
Q ss_pred EEEEEEcCCCcceecCHHHHHHcCCccccccceeeeeecCce--eeEEeEEEEEEEEECCe-EEEeEEEEecCCCccccc
Q psy8018 160 PVKAFIDSGAQTTIMSAACAARVNITRLIDTRWAGIAKGVGV--QRIIGRIHMVQIAIEND-FLTTSFSVLEDQPMDMLL 236 (280)
Q Consensus 160 ~v~alVDTGAq~tiis~~~A~rlgL~~~~d~r~~g~a~gvG~--~~~~g~i~~~~i~IG~~-~~~~~~~Vl~~~~~d~IL 236 (280)
.+++||||||..|++.....++. .....+ ..++.++ .++.| .....+.+|.. .++..|.|.+- ...||
T Consensus 9 ~~~fLVDTGA~vSviP~~~~~~~----~~~~~~--~l~AANgt~I~tyG-~~~l~ldlGlrr~~~w~FvvAdv--~~pIl 79 (89)
T cd06094 9 GLRFLVDTGAAVSVLPASSTKKS----LKPSPL--TLQAANGTPIATYG-TRSLTLDLGLRRPFAWNFVVADV--PHPIL 79 (89)
T ss_pred CcEEEEeCCCceEeecccccccc----ccCCce--EEEeCCCCeEeeee-eEEEEEEcCCCcEEeEEEEEcCC--Cccee
Confidence 47899999999999998877653 112222 2333333 46677 44558889974 89999999763 34799
Q ss_pred chhhhhcce
Q psy8018 237 GLDMLKRHQ 245 (280)
Q Consensus 237 G~D~L~~~~ 245 (280)
|.|||++|+
T Consensus 80 GaDfL~~~~ 88 (89)
T cd06094 80 GADFLQHYG 88 (89)
T ss_pred cHHHHHHcC
Confidence 999999985
No 68
>cd05482 HIV_retropepsin_like Retropepsins, pepsin-like aspartate proteases. This is a subfamily of retropepsins. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This gro
Probab=98.37 E-value=2.3e-06 Score=65.86 Aligned_cols=86 Identities=16% Similarity=0.179 Sum_probs=58.6
Q ss_pred EEEEcCEEEEEEEcCCCcceecCHHHHHHcCCccccccceeeeeecCceeeEEeEEEEEEEEECCeEEEeEEEEecCCCc
Q psy8018 153 NCRVNGHPVKAFIDSGAQTTIMSAACAARVNITRLIDTRWAGIAKGVGVQRIIGRIHMVQIAIENDFLTTSFSVLEDQPM 232 (280)
Q Consensus 153 ~~~InG~~v~alVDTGAq~tiis~~~A~rlgL~~~~d~r~~g~a~gvG~~~~~g~i~~~~i~IG~~~~~~~~~Vl~~~~~ 232 (280)
.++|+|+++.|++||||..|+++.....+-. .....+ -...|+|+....-+.....+++.+.....++.|.++...
T Consensus 2 ~~~i~g~~~~~llDTGAd~Tvi~~~~~p~~w--~~~~~~--~~i~GIGG~~~~~~~~~v~i~i~~~~~~g~vlv~~~~~P 77 (87)
T cd05482 2 TLYINGKLFEGLLDTGADVSIIAENDWPKNW--PIQPAP--SNLTGIGGAITPSQSSVLLLEIDGEGHLGTILVYVLSLP 77 (87)
T ss_pred EEEECCEEEEEEEccCCCCeEEcccccCCCC--ccCCCC--eEEEeccceEEEEEEeeEEEEEcCCeEEEEEEEccCCCc
Confidence 5789999999999999999999974432211 000111 245688876555556567999988776667777664234
Q ss_pred ccccchhhhh
Q psy8018 233 DMLLGLDMLK 242 (280)
Q Consensus 233 d~ILG~D~L~ 242 (280)
.-|+|-|.|.
T Consensus 78 ~nllGRd~L~ 87 (87)
T cd05482 78 VNLWGRDILS 87 (87)
T ss_pred ccEEccccCC
Confidence 4789988773
No 69
>PF05585 DUF1758: Putative peptidase (DUF1758); InterPro: IPR008737 This is a family of nematode proteins of unknown function []. However, it seems likely that these proteins act as aspartic peptidases.
Probab=98.33 E-value=1.7e-06 Score=73.51 Aligned_cols=66 Identities=17% Similarity=0.211 Sum_probs=41.7
Q ss_pred EEEEEEcCCCcceecCHHHHHHcCCccccccceeeeeecCcee--eEEeEEEE--EEEEECCeEEEeEEEEec
Q psy8018 160 PVKAFIDSGAQTTIMSAACAARVNITRLIDTRWAGIAKGVGVQ--RIIGRIHM--VQIAIENDFLTTSFSVLE 228 (280)
Q Consensus 160 ~v~alVDTGAq~tiis~~~A~rlgL~~~~d~r~~g~a~gvG~~--~~~g~i~~--~~i~IG~~~~~~~~~Vl~ 228 (280)
++++|+|||||.|+|+.++|++|+|+..-. +. ...|+++. ...-.+.. ..+.+++..+.+.+.+++
T Consensus 12 ~~~~LlDsGSq~SfIt~~la~~L~L~~~~~-~~--~~~~~~g~~~~~~~~~~~~~i~~~~~~~~~~i~alvv~ 81 (164)
T PF05585_consen 12 EARALLDSGSQRSFITESLANKLNLPGTGE-KI--LVIGTFGSSSPKSKKCVRVKISSRTSNNSLEIEALVVP 81 (164)
T ss_pred EEEEEEecCCchhHHhHHHHHHhCCCCCCc-eE--EEEeccCccCccceeEEEEEEEEecCCCceEEEEEecC
Confidence 678999999999999999999999986522 21 22333321 11112222 245555555677777665
No 70
>KOG0006|consensus
Probab=97.79 E-value=4.3e-05 Score=70.99 Aligned_cols=71 Identities=17% Similarity=0.310 Sum_probs=61.1
Q ss_pred CEEEEEc---CCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 1 MKVTVTS---LDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 1 M~ItV~~---~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
|.+.|.. ..-..++++|+.+.+|.+||+.++...|+|+++.+++|.||.|.++ .|++.+.+..-+.++++.
T Consensus 1 m~~lvqf~~~~~~h~l~v~v~~~t~I~~lke~Vak~~gvp~D~L~viFaGKeLs~~-ttv~~cDL~qqs~~hi~~ 74 (446)
T KOG0006|consen 1 MIVLVQFNKTGSSHGLPVEVDSDTSIFQLKEVVAKRQGVPADQLRVIFAGKELSND-TTVQNCDLSQQSATHIML 74 (446)
T ss_pred CeEEEEeCCccccCceeEEEecCCCHHHHHHHHHHhhCCChhheEEEEeccccccC-ceeecccccccchhhhhc
Confidence 5666663 2345678999999999999999999999999999999999999986 899988888888888774
No 71
>cd01811 OASL_repeat1 2'-5' oligoadenylate synthetase-like protein, repeat 1 of 2. OASL_repeat1 (2'-5' oligoadenylate synthetase-like protein) belongs to a family of interferon-induced 2'-5' oligoadenylate synthetases which are important for the antiviral activity of interferons. While each member of this famliy has a conserved N-terminal OAS catalytic domain, only OASL has two tandem ubiquitin-like repeats located at the C-terminus and this CD represents one of those repeats.
Probab=97.79 E-value=0.00014 Score=53.96 Aligned_cols=70 Identities=23% Similarity=0.270 Sum_probs=60.1
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEEC----Cee-eCCCcCCccccCCCCCCEEEEEE
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQ----GNA-LVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~----Gk~-L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
.||||+-.++..+++-|+|..+|..+|+.|+...|++- .|+|.|. -++ |.+ .++|++|||=.+-.|.|.-
T Consensus 1 iqVtV~q~g~~dl~l~vnPy~pI~k~K~kI~~~~~~~g-~qrLsfQepgg~rqlL~s-~~sLA~yGiFs~~~i~lle 75 (80)
T cd01811 1 IQVTVEQTGYSDWILRVNPYSPIRKIKEKIRRSRNCSG-LQRLSFQEPGGERQLLSS-RKSLADYGIFSKTNICLLE 75 (80)
T ss_pred CEEEeeecCCCceEEEeCCcchHHHHHHHHHHhhCccc-ceEEEeecCCcccccccc-cccHhhhcceeccEEEEEe
Confidence 47999988999999999999999999999999999987 9999982 233 454 4899999998888877765
No 72
>PF08817 YukD: WXG100 protein secretion system (Wss), protein YukD; InterPro: IPR014921 YukD is a bacterial protein that adopts a ubiquitin-like fold []. Ubiquitin covalently binds to protein and flags them for protein degradation, however conjugation assays have indicated that YukD lacks the capacity for covalent bond formation with other proteins []. ; PDB: 2BPS_B.
Probab=97.75 E-value=7.9e-05 Score=55.95 Aligned_cols=69 Identities=22% Similarity=0.339 Sum_probs=48.4
Q ss_pred EEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCC---c---ceEEE-ECCeeeCCCcCCccccCCCCCCEEEEE
Q psy8018 2 KVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISA---Q---EIAIE-FQGNALVDNKKTLKDYNIQDGDLVLLK 71 (280)
Q Consensus 2 ~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~---~---~Q~Li-~~Gk~L~d~~~tL~~~gI~~gd~l~l~ 71 (280)
+|||...+|+.+.+.++.+.+|.+|...+-...+.+. . ..+|. -+|.+|.++ .||+++||.+||+|.|.
T Consensus 4 rVtv~~~~~~~~Dl~lP~~vpv~~li~~l~~~~~~~~~~~~~~~~~~L~~~~g~~L~~~-~tL~~~gV~dGd~L~L~ 79 (79)
T PF08817_consen 4 RVTVDAGNGRQVDLALPADVPVAELIPELVELLGLPGDDPPGHGQWVLARAGGRPLDPD-QTLADAGVRDGDVLVLR 79 (79)
T ss_dssp EEEEE-TT--EEEEEEETTSBTTHHHHHHHHHS---S---TT-E-EEEG-GGTEEEETT-SBCGGGT--TT-EEEE-
T ss_pred EEEEEcCCCcEEEEEcCCCCcHHHHHHHHHHHhCCccCCCCCcceEEEEecCCcccCCc-CcHhHcCCCCCCEEEeC
Confidence 4566665578999999999999999999988877643 2 24665 689999865 99999999999999873
No 73
>KOG1872|consensus
Probab=97.73 E-value=9.3e-05 Score=71.96 Aligned_cols=65 Identities=12% Similarity=0.195 Sum_probs=60.8
Q ss_pred CCCcEEEEE-eCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 8 LDNECFLLD-VSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 8 ~~g~~~~le-V~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
-.|+.++++ ++.+.|...||+.+...+|+||+.|++.+.|+.+.|| -.+...+|++|.+|+++..
T Consensus 10 W~gk~y~v~~l~~d~t~~vlKaqlf~LTgV~PeRQKv~vKGg~a~dd-~~~~al~iKpn~~lmMmGt 75 (473)
T KOG1872|consen 10 WGGKKYPVETLSTDETPSVLKAQLFALTGVPPERQKVMVKGGLAKDD-VDWGALQIKPNETLMMMGT 75 (473)
T ss_pred ecCccccceeccCCCchHHHHHHHHHhcCCCccceeEEEeccccccc-ccccccccCCCCEEEeecc
Confidence 478999998 9999999999999999999999999999999999986 5788999999999999876
No 74
>PF09668 Asp_protease: Aspartyl protease; InterPro: IPR019103 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This family of eukaryotic aspartyl proteases have a fold similar to retroviral proteases which implies they function proteolytically during regulated protein turnover []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 3S8I_A 2I1A_B.
Probab=97.51 E-value=0.0001 Score=60.37 Aligned_cols=46 Identities=54% Similarity=0.858 Sum_probs=40.2
Q ss_pred CcccccchhhhhcceEEEEecccEEEeeeeecCCCccceeeccchhhccC
Q psy8018 231 PMDMLLGLDMLKRHQVQIAIENDFLTTSFSVLEDQPMDMLLGLDMLKRHQ 280 (280)
Q Consensus 231 ~~d~ILG~D~L~~~~~~ID~~~~~L~i~~~~~~~~~~~~~~~~~~~~~~~ 280 (280)
+..-++| +-|.+.|-+.+..+.++.+++|+.+++|++|+|||||||
T Consensus 75 G~~~i~G----~Ih~~~l~ig~~~~~~s~~Vle~~~~d~llGld~L~~~~ 120 (124)
T PF09668_consen 75 GTQKILG----RIHSVQLKIGGLFFPCSFTVLEDQDVDLLLGLDMLKRHK 120 (124)
T ss_dssp ---EEEE----EEEEEEEEETTEEEEEEEEEETTSSSSEEEEHHHHHHTT
T ss_pred CcCceeE----EEEEEEEEECCEEEEEEEEEeCCCCcceeeeHHHHHHhC
Confidence 3445777 779999999999999999999999999999999999997
No 75
>PF00789 UBX: UBX domain; InterPro: IPR001012 The UBX domain is found in ubiquitin-regulatory proteins, which are members of the ubiquitination pathway, as well as a number of other proteins including FAF-1 (FAS-associated factor 1), the human Rep-8 reproduction protein and several hypothetical proteins from yeast. The function of the UBX domain is not known although the fragment of avian FAF-1 containing the UBX domain causes apoptosis of transfected cells.; GO: 0005515 protein binding; PDB: 3QX1_A 1H8C_A 3QCA_B 3QQ8_B 3QC8_B 3R3M_A 3QWZ_B 1S3S_H 1JRU_A 1I42_A ....
Probab=97.32 E-value=0.0019 Score=48.33 Aligned_cols=70 Identities=17% Similarity=0.253 Sum_probs=60.4
Q ss_pred EEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcc-eEEE--ECCeeeCCCc-CCccccCCCCCCEEEEE
Q psy8018 2 KVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQE-IAIE--FQGNALVDNK-KTLKDYNIQDGDLVLLK 71 (280)
Q Consensus 2 ~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~-Q~Li--~~Gk~L~d~~-~tL~~~gI~~gd~l~l~ 71 (280)
+|-|+.++|....-...++.||.+|.+.|+.....+... ..|+ |--+.+.+++ +||+++|+.++.+|+|.
T Consensus 8 ~I~vRlpdG~~l~~~F~~~~tl~~l~~~v~~~~~~~~~~~f~L~~~~Pr~~l~~~~~~tl~e~~l~p~~~l~v~ 81 (82)
T PF00789_consen 8 RIQVRLPDGSRLQRRFPKSDTLQDLYDFVESQLFSPEESDFELITAFPRRELTDEDSKTLEEAGLLPSATLIVE 81 (82)
T ss_dssp EEEEEETTSTEEEEEEETTSBHHHHHHHHHHHHHCTTTSSEEEEESSSTEECCSTTTSBTCCCTTSSCEEEEEE
T ss_pred EEEEECCCCCEEEEEECCcchHHHHHHHHHHhcCCCCCccEEEEeCCCCcCCCccccccHHHhcCCCCeEEEEE
Confidence 477889999999999999999999999999998777765 7775 5678886654 89999999999999885
No 76
>KOG1769|consensus
Probab=97.30 E-value=0.0029 Score=49.55 Aligned_cols=74 Identities=16% Similarity=0.330 Sum_probs=64.2
Q ss_pred EEEEEc--CCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEeccC
Q psy8018 2 KVTVTS--LDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRVQVA 76 (280)
Q Consensus 2 ~ItV~~--~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~~~p 76 (280)
+|+++. -++...-+-|.-+..+..|+..-....|++.++.+++|+|+.+..+ .|-++++.++||.|-+..-|..
T Consensus 20 hi~LKV~gqd~~~~~Fkikr~t~LkKLM~aYc~r~Gl~~~s~RFlFdG~rI~~~-~TP~~L~mEd~D~Iev~~~q~g 95 (99)
T KOG1769|consen 20 HINLKVKGQDGSVVVFKIKRHTPLKKLMKAYCERQGLSMNSLRFLFDGQRIRET-HTPADLEMEDGDEIEVVQEQTG 95 (99)
T ss_pred eEEEEEecCCCCEEEEEeecCChHHHHHHHHHHHcCCccceEEEEECCcCcCCC-CChhhhCCcCCcEEEEEeeccc
Confidence 455554 4567778899999999999999999999999999999999999876 8999999999999998875443
No 77
>KOG0012|consensus
Probab=97.21 E-value=0.00019 Score=67.72 Aligned_cols=46 Identities=52% Similarity=0.884 Sum_probs=42.6
Q ss_pred CcccccchhhhhcceEEEEecccEEEeeeeecCCCccceeeccchhhccC
Q psy8018 231 PMDMLLGLDMLKRHQVQIAIENDFLTTSFSVLEDQPMDMLLGLDMLKRHQ 280 (280)
Q Consensus 231 ~~d~ILG~D~L~~~~~~ID~~~~~L~i~~~~~~~~~~~~~~~~~~~~~~~ 280 (280)
+..-|+| +-|+|.+-..+..|..+.++++.++|||+.|||||||||
T Consensus 286 g~~ki~g----~Ih~~~lki~~~~l~c~ftV~d~~~~d~llGLd~Lrr~~ 331 (380)
T KOG0012|consen 286 GTEKILG----RIHQAQLKIEDLYLPCSFTVLDRRDMDLLLGLDMLRRHQ 331 (380)
T ss_pred Ccccccc----eeEEEEEEeccEeeccceEEecCCCcchhhhHHHHHhcc
Confidence 4556777 789999999999999999999999999999999999997
No 78
>KOG4495|consensus
Probab=97.15 E-value=0.00056 Score=53.13 Aligned_cols=63 Identities=21% Similarity=0.162 Sum_probs=47.7
Q ss_pred CEEEEEcC-CCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEE-CCeeeCCCcCCccccCCC
Q psy8018 1 MKVTVTSL-DNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEF-QGNALVDNKKTLKDYNIQ 63 (280)
Q Consensus 1 M~ItV~~~-~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~-~Gk~L~d~~~tL~~~gI~ 63 (280)
|.++++.- ...++-++.+++.||-+||.+++.-..-|+++|+|+. +--+|-+|++||+++|..
T Consensus 1 ~~~f~~VrR~kttif~da~es~tV~elK~~l~gi~~~Pvn~qrL~kmd~eqlL~D~ktL~d~gft 65 (110)
T KOG4495|consen 1 MDVFLRVRRHKTTIFTDAKESSTVFELKRKLEGILKRPVNEQRLYKMDTEQLLDDGKTLGDCGFT 65 (110)
T ss_pred CceeeeeeecceeEEeecCccccHHHHHHHHHHHHhCCCcchheeecCHHHHhhccchhhhcccc
Confidence 44555532 2335568999999999999999999999999999987 442444456999999653
No 79
>KOG3206|consensus
Probab=97.14 E-value=0.0065 Score=53.60 Aligned_cols=72 Identities=24% Similarity=0.332 Sum_probs=58.2
Q ss_pred EEEEEcCC-CcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEE-EECC-----eeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 2 KVTVTSLD-NECFLLDVSEDLELENFKAFCEVQSGISAQEIAI-EFQG-----NALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 2 ~ItV~~~~-g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~L-i~~G-----k~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
++.|.+.. .-......++++||.+||.+++-.+|.+++...| +|+| -.|.|++..|..|+..||-.||+.-.
T Consensus 3 ~v~Iss~~~~~~~Ekr~~~~ltl~q~K~KLe~~~G~~~~~M~l~l~~~~d~~~~~lsn~d~~lg~~~~~Dg~rihviD~ 81 (234)
T KOG3206|consen 3 RVVISSSLNDFRTEKRLSNSLTLAQFKDKLELLTGTEAESMELELYDGDDKKVSALSNEDADLGFYKVEDGLRIHVIDS 81 (234)
T ss_pred EEEEecccccchhhhhcCCcCcHHHHHhhhhhhhCCCccceEEEEEcCCCceeeeccCCcccccccCCCCceEEEEEec
Confidence 35555432 3234567789999999999999999999999998 4665 45777779999999999999999875
No 80
>smart00166 UBX Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p.
Probab=97.08 E-value=0.0038 Score=46.78 Aligned_cols=70 Identities=11% Similarity=0.210 Sum_probs=58.6
Q ss_pred EEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEE--ECCeeeCCC--cCCccccCCCCCCEEEEE
Q psy8018 2 KVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIE--FQGNALVDN--KKTLKDYNIQDGDLVLLK 71 (280)
Q Consensus 2 ~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li--~~Gk~L~d~--~~tL~~~gI~~gd~l~l~ 71 (280)
+|-|+.++|..+.....++.||++|.+.|....+.+.....|+ |--+.+.++ ++||+++|+.++.+|+|.
T Consensus 6 ~I~iRlPdG~ri~~~F~~~~tl~~v~~~v~~~~~~~~~~f~L~t~~Prk~l~~~d~~~tL~e~gL~p~~~l~v~ 79 (80)
T smart00166 6 RLQIRLPDGSRLVRRFPSSDTLRTVYEFVSAALTDGNDPFTLNSPFPRRTFTKDDYSKTLLELALLPSSTLVLE 79 (80)
T ss_pred EEEEEcCCCCEEEEEeCCCCcHHHHHHHHHHcccCCCCCEEEEeCCCCcCCccccccCCHHHCCCCCceEEEEe
Confidence 5778999999999999999999999999977667666677775 566778643 579999999999999874
No 81
>PF13019 Telomere_Sde2: Telomere stability and silencing
Probab=96.97 E-value=0.043 Score=46.88 Aligned_cols=66 Identities=15% Similarity=0.173 Sum_probs=51.6
Q ss_pred CEEEEEcCCC----cEEEEEeCCCccHHHHHHHHHHHhCCCCcce-EEEE-CCeee-CCCcCCccccCCCCCC
Q psy8018 1 MKVTVTSLDN----ECFLLDVSEDLELENFKAFCEVQSGISAQEI-AIEF-QGNAL-VDNKKTLKDYNIQDGD 66 (280)
Q Consensus 1 M~ItV~~~~g----~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q-~Li~-~Gk~L-~d~~~tL~~~gI~~gd 66 (280)
|+|+|++.+| .++++.++++.||.+|+..|....++|+..| .|++ .|+.| .+++..++.+.-.+++
T Consensus 1 i~Vlvss~~g~~lp~tl~~~lp~~ttv~dL~~~l~~~~~~~~~~~~~L~~~~n~~l~~~~~~~~s~l~~~~~~ 73 (162)
T PF13019_consen 1 INVLVSSFDGLTLPPTLSLSLPSTTTVSDLKDRLSERLPIPSSSQLYLTTNSNGQLSPSSDIPLSSLLSSSQD 73 (162)
T ss_pred CeEEEecCCCCCCCCeEEeeCCCCCcHHHHHHHHHhhcCCCccceeEEEEeCCCeeCCCccccHHhhccCcCC
Confidence 7899999999 6889999999999999999999999998874 4555 35555 3344567777655544
No 82
>COG5417 Uncharacterized small protein [Function unknown]
Probab=96.85 E-value=0.0071 Score=44.94 Aligned_cols=69 Identities=25% Similarity=0.387 Sum_probs=55.4
Q ss_pred CEEEEE--cCCCcEEEEEeCCCccHHHHHHHHHHHhCCC--C---cceEEEECCeeeCCCcCCccccCCCCCCEEEE
Q psy8018 1 MKVTVT--SLDNECFLLDVSEDLELENFKAFCEVQSGIS--A---QEIAIEFQGNALVDNKKTLKDYNIQDGDLVLL 70 (280)
Q Consensus 1 M~ItV~--~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip--~---~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l 70 (280)
|+|||- .-+|..+.|-++.-.++..|-..+.+...|. . .+.+..-+++.|.++ ..|.+|||.+||.|.+
T Consensus 5 ikVTvD~t~y~g~~yDLrl~d~~pikklIdivwe~~kis~~~reg~~Ikv~nKa~llsgd-~kL~d~~IadGD~Lei 80 (81)
T COG5417 5 IKVTVDFTNYNGGTYDLRLPDYLPIKKLIDIVWESLKISIFDREGTQIKVMNKAQLLSGD-DKLIDYQIADGDILEI 80 (81)
T ss_pred EEEEEEeEecCCceEEEeccccchHHHHHHHHHHHhhccccccCCCEEEEeccceEecCC-ceEEeccccCCCEEEe
Confidence 345553 3579999999999999999999887776552 2 356777889999876 8999999999999875
No 83
>KOG3493|consensus
Probab=96.85 E-value=0.00052 Score=49.57 Aligned_cols=68 Identities=22% Similarity=0.297 Sum_probs=57.0
Q ss_pred EEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEE
Q psy8018 2 KVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLL 70 (280)
Q Consensus 2 ~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l 70 (280)
++++..--|+...+...+++||.++|.+|++++|--++..+|---+..++|. -+|++|.|.+|--+.|
T Consensus 3 ev~~nDrLGKKVRvKCn~dDtiGD~KKliaaQtGT~~~kivl~k~~~i~kd~-I~L~dyeihdg~~lel 70 (73)
T KOG3493|consen 3 EVVLNDRLGKKVRVKCNTDDTIGDLKKLIAAQTGTRPEKIVLKKWYTIFKDH-ITLSDYEIHDGMNLEL 70 (73)
T ss_pred eehhhhhcCceEEEEeCCcccccCHHHHHHHhhCCChhHhHHHhhhhhhhcc-cceeeEEeccCccEEE
Confidence 3455555688999999999999999999999999999988887556666764 8999999999987765
No 84
>cd01767 UBX UBX (ubiquitin regulatory X) domain. The UBX (ubiquitin regulatory X) domain has a beta-grasp fold that is structurally quite similar to ubiquitin although UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins. Most UBX-containing proteins including p47, FAF1, and SAKS1 (Y33K) also contain a UBA (ubiquitin-associated) domain and are thought to serve as adaptor molecules that shuttle proteins to the proteasome for degradation.
Probab=96.84 E-value=0.0091 Score=44.31 Aligned_cols=69 Identities=17% Similarity=0.292 Sum_probs=54.6
Q ss_pred EEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEE--ECCeeeCC--CcCCccccCCCCCCEEEEEE
Q psy8018 2 KVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIE--FQGNALVD--NKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 2 ~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li--~~Gk~L~d--~~~tL~~~gI~~gd~l~l~~ 72 (280)
+|-|+.++|........++.||++|.+.|+....- .....|+ |-.+.+.+ .+.||+++|+. ++.+.+..
T Consensus 4 ~i~iRlpdG~~~~~~F~~~~tl~~l~~fv~~~~~~-~~~f~L~t~~Pr~~~~~~~~~~TL~e~gL~-~s~~~~~~ 76 (77)
T cd01767 4 KIQIRLPDGKRLEQRFNSTHKLSDVRDFVESNGPP-AEPFTLMTSFPRRVLTDLDYELTLQEAGLV-NEVVFQRL 76 (77)
T ss_pred EEEEEcCCCCEEEEEeCCCCCHHHHHHHHHHcCCC-CCCEEEEeCCCCccCCCCCccCcHHHcCCc-cceEEEEe
Confidence 47788999999999999999999999999877543 4556665 45677865 67899999999 56665543
No 85
>cd01770 p47_UBX p47-like ubiquitin domain. p47_UBX p47 is an adaptor molecule of the cytosolic AAA ATPase p97. The principal role of the p97-p47 complex is to regulate membrane fusion events. Mono-ubiquitin recognition by p47 is crucial for p97-p47-mediated Golgi membrane fusion events. p47 has carboxy-terminal SEP and UBX domains. The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins.
Probab=96.76 E-value=0.0098 Score=44.77 Aligned_cols=67 Identities=15% Similarity=0.257 Sum_probs=54.5
Q ss_pred EEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCC-CcceEEE--ECCeeeCCCcCCccccCCCCCCEEE
Q psy8018 2 KVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGIS-AQEIAIE--FQGNALVDNKKTLKDYNIQDGDLVL 69 (280)
Q Consensus 2 ~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip-~~~Q~Li--~~Gk~L~d~~~tL~~~gI~~gd~l~ 69 (280)
+|-|+.++|+.+......+.||++|.+.|..+.+-+ .....|. |=.|.|.+++.||+++|+.+ +.|.
T Consensus 6 ~iqiRlpdG~r~~~rF~~~~tv~~l~~~v~~~~~~~~~~~f~L~t~fP~k~l~~~~~Tl~eagL~~-s~v~ 75 (79)
T cd01770 6 SIQIRLADGKRLVQKFNSSHRVSDVRDFIVNARPEFAARPFTLMTAFPVKELSDESLTLKEANLLN-AVIV 75 (79)
T ss_pred EEEEECCCCCEEEEEeCCCCcHHHHHHHHHHhCCCCCCCCEEEecCCCCcccCCCCCcHHHCCCcC-cEEE
Confidence 477899999999999999999999999999875432 3456775 66888987789999999985 4444
No 86
>PF11470 TUG-UBL1: GLUT4 regulating protein TUG; InterPro: IPR021569 TUG is a GLUT4 regulating protein and functions to retain membrane vesicles containing GLUT4 intracellularly. TUG releases the GLUT4 containing vesicles to the cellular exocytic machinery in response to insulin stimulation which allows translocation to the plasma membrane []. TUG has an N-terminal ubiquitin-like domain (UBL1) which in similar proteins appears to participate in protein-protein interactions []. The region does have a area of negative electrostatic potential and increased backbone motility which leads to suggestions of a potential protein-protein interaction site []. ; PDB: 2AL3_A.
Probab=96.70 E-value=0.0092 Score=43.41 Aligned_cols=63 Identities=11% Similarity=0.125 Sum_probs=45.4
Q ss_pred cCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEE
Q psy8018 7 SLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLL 70 (280)
Q Consensus 7 ~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l 70 (280)
..+++.+++.|.|++++.++-+..-.++|+++++-.|.|++++|.- +-++.-.|+-+|..|.|
T Consensus 3 ~~~~rr~~vkvtp~~~l~~VL~eac~k~~l~~~~~~L~h~~k~ldl-slp~R~snL~n~akLeL 65 (65)
T PF11470_consen 3 CYNFRRFKVKVTPNTTLNQVLEEACKKFGLDPSSYDLKHNNKPLDL-SLPFRLSNLPNNAKLEL 65 (65)
T ss_dssp -TTS-EEEE---TTSBHHHHHHHHHHHTT--GGG-EEEETTEEESS-S-BHHHH---SS-EEEE
T ss_pred ccCCcEEEEEECCCCCHHHHHHHHHHHcCCCccceEEEECCEEecc-ccceeecCCCCCCEEeC
Confidence 4578899999999999999999999999999999999999999954 58999999999998865
No 87
>cd01772 SAKS1_UBX SAKS1-like UBX domain. SAKS1 (SAPK-substrate-1), also known as Y33K, is a widely expressed protein containing N-terminal UBA (ubiquitin-associated) and C-terminal UBX (ubiqiutin-like) domains that was identified as a substrate of stress-activated protein kinases (SAPKs). SAKS1 is related evolutionarily to two other UBA/UBX-containing proteins, p47 and Faf1. The UBA and UBX domains of SAKS1 bind ubiquitin tetramers and valosin-containing protein (VCP), respectively suggesting a role for SAKS1 as an adaptor that directs VCP to polyubiquitinated proteins facilitating its destruction by the proteasome. The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins.
Probab=96.58 E-value=0.018 Score=43.20 Aligned_cols=69 Identities=13% Similarity=0.332 Sum_probs=56.3
Q ss_pred EEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEE--ECCeeeCC--CcCCccccCCCCCCEEEEE
Q psy8018 2 KVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIE--FQGNALVD--NKKTLKDYNIQDGDLVLLK 71 (280)
Q Consensus 2 ~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li--~~Gk~L~d--~~~tL~~~gI~~gd~l~l~ 71 (280)
+|-|+.++|....-..+.+.|+++|.+.|+...+.+ ....|+ |--|.+.+ .++||+++|+.+..+|+|.
T Consensus 6 ~i~iRlp~G~~~~~~F~~~~tl~~v~~fV~~~~~~~-~~f~L~t~fPrk~~~~~d~~~TL~elgL~Psa~L~v~ 78 (79)
T cd01772 6 RIQIRLLDGTTLKQTFKAREQLAAVRLFVELNTGNG-GPFTLMTPFPRKVFTEDDMEKPLQELGLVPSAVLIVT 78 (79)
T ss_pred EEEEECCCCCEEEEEeCCCChHHHHHHHHHHcCCCC-CCEEEEeCCCCeECCcccccCCHHHCCCCCceEEEEe
Confidence 577889999999999999999999999999775443 445665 56777853 3589999999999999874
No 88
>PF10302 DUF2407: DUF2407 ubiquitin-like domain; InterPro: IPR019413 This entry represents a family of proteins of unknown function found in fungi. They contain a characteristic GFDRL sequence motif.
Probab=96.49 E-value=0.0068 Score=47.55 Aligned_cols=57 Identities=18% Similarity=0.236 Sum_probs=40.9
Q ss_pred EEEEEcCCC-cEEEEEeC--CCccHHHHHHHHHHHhC--CCCcceEEEECCeeeCCCcCCccc
Q psy8018 2 KVTVTSLDN-ECFLLDVS--EDLELENFKAFCEVQSG--ISAQEIAIEFQGNALVDNKKTLKD 59 (280)
Q Consensus 2 ~ItV~~~~g-~~~~leV~--~~~tV~~LK~~Ie~e~g--ip~~~Q~Li~~Gk~L~d~~~tL~~ 59 (280)
.|+|+-.++ --.+|+|+ .+.||..||.+|.+... ..-..+||+|+||.|.|. ..|+.
T Consensus 2 ~l~IRFs~sipDl~L~I~~~~~~Tv~~LK~lIR~~~p~~~s~~rLRlI~~Gr~L~d~-t~l~~ 63 (97)
T PF10302_consen 2 YLTIRFSDSIPDLPLDIPSPNTTTVAWLKQLIRERLPPEPSRRRLRLIYAGRLLNDH-TDLSS 63 (97)
T ss_pred eEEEEECCCCCCceeecCCCCcccHHHHHHHHHhhcCCCCccccEEeeecCcccCcc-chhhh
Confidence 366764331 12567777 79999999999999872 334578999999999975 44443
No 89
>KOG1639|consensus
Probab=96.40 E-value=0.0069 Score=54.86 Aligned_cols=71 Identities=23% Similarity=0.388 Sum_probs=54.6
Q ss_pred CEEEEEcC-CCcEEE-EEeCCCccHHHHH-HHHHHHhCCCCcceEEEE----CCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 1 MKVTVTSL-DNECFL-LDVSEDLELENFK-AFCEVQSGISAQEIAIEF----QGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 1 M~ItV~~~-~g~~~~-leV~~~~tV~~LK-~~Ie~e~gip~~~Q~Li~----~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
|.||+..- +|...+ .+.+...|+.|++ +....+..+.+..+|+.+ +|++|.|+ .+|++||..+|++|.+.-
T Consensus 1 m~It~~srs~~~~~~~~~~s~~~ti~d~~~~~~~~~~k~~~~~~r~tlr~e~kgkpl~~~-s~l~e~~~~s~~~i~vKD 78 (297)
T KOG1639|consen 1 MEITIASRSKGLRIKEKDLSGSETIDDLLKAISAKNLKITPYRIRLTLRVEPKGKPLIDN-SKLQEYGDGSGATIYVKD 78 (297)
T ss_pred CceeeeccCCCceeeeecCCCCCcHHHHHHHHHHhhhccCccchhheeeccCCCccccch-hHHHHhccCCCCEEEEec
Confidence 88999864 444555 6777889999999 555666778775555443 79999986 789999999999998864
No 90
>cd01774 Faf1_like2_UBX Faf1 ike-2 UBX domain. Faf1_like2 is a protein of unknown function with a domain architecture that includes the UAS (ubiquitin-associated) and UBX (ubiquitin-like) domains. This protein is related to other UBA/UBX-containing proteins like Faf1, p47, and SAKS1 and may serve as an adaptor molecule that shuttles proteins to the proteasome for degradation. The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins.
Probab=96.40 E-value=0.031 Score=42.72 Aligned_cols=69 Identities=10% Similarity=0.057 Sum_probs=57.5
Q ss_pred EEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECC--eeeC-------CCcCCccccCCCCCCEEEEE
Q psy8018 2 KVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQG--NALV-------DNKKTLKDYNIQDGDLVLLK 71 (280)
Q Consensus 2 ~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~G--k~L~-------d~~~tL~~~gI~~gd~l~l~ 71 (280)
+|-|+.++|+...--...+.||++|.+.|.. .+.+++...|+.+= |.+. |.+.||++.|+.+..+|+|.
T Consensus 6 ~I~iRlp~G~Rl~rrF~~~~tl~~l~~fv~~-~~~~~~~f~L~t~FPrr~~~~~~~~~~~~~~TL~eaGL~~s~~L~V~ 83 (85)
T cd01774 6 KIVFKLPNGTRVERRFLFTQSLRVIHDFLFS-LKETPEKFQIVTNFPRRVLPCLPSEGDPPPPTLLEAGLSNSEVLFVQ 83 (85)
T ss_pred EEEEECCCCCEEEEEeCCCCcHHHHHHHHHh-CCCCCCcEEEecCCCCccccccccccCcCCCCHHHcCCCCccEEEEe
Confidence 5778899999999999999999999999965 45566888888765 7785 34579999999999998874
No 91
>cd01771 Faf1_UBX Faf1 UBX domain. Faf1 (fas-associated factor1) is a nucleolar protein that was first identified as an interaction partner of the death receptor Fas. Faf1 contains N-terminal UAS (ubiquitin-associated) and C-terminal UBX (ubiquitin-like) domains and is closely related to other UBA/UBX-containing proteins like p47, Rep8 and SAKS1. Faf1 is thought to be involved in 18S rRNA synthesis and/or 40S ribosomal subunit assembly. The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins.
Probab=95.98 E-value=0.059 Score=40.67 Aligned_cols=70 Identities=17% Similarity=0.254 Sum_probs=58.8
Q ss_pred EEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEE--ECCeeeC--CCcCCccccCCCCCCEEEEEE
Q psy8018 2 KVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIE--FQGNALV--DNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 2 ~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li--~~Gk~L~--d~~~tL~~~gI~~gd~l~l~~ 72 (280)
+|-|+.++|....-....+.++++|-..+..+ |.++...+|+ |==|++. |.+.||++.|+.+..+|+|..
T Consensus 6 ~i~iRlP~G~r~~rrF~~t~~L~~l~~fv~~~-~~~~~~f~L~t~fPRk~~~~~d~~~TL~e~gL~p~~~L~Vee 79 (80)
T cd01771 6 KLRVRTPSGDFLERRFLGDTPLQVLLNFVASK-GYPIDEYKLLSSWPRRDLTQLDPNFTLLELKLYPQETLILEE 79 (80)
T ss_pred EEEEECCCCCEEEEEeCCCCcHHHHHHHHHhc-CCCCCCEEEecCCCCCCCcCCCCCCcHHHcCCCCCcEEEEEc
Confidence 57788999999999999999999999999875 7787888886 4456674 445799999999999998853
No 92
>cd01773 Faf1_like1_UBX Faf1 ike-1 UBX domain. Faf1_like1 is a protein of unknown function with a domain architecture that includes the UAS (ubiquitin-associated) and UBX (ubiquitin-like) domains. This protein is related to other UBA/UBX-containing proteins like Faf1, p47, and SAKS1 and may serve as an adaptor molecule that shuttles proteins to the proteasome for degradation. The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins.
Probab=95.85 E-value=0.086 Score=40.11 Aligned_cols=71 Identities=14% Similarity=0.218 Sum_probs=59.8
Q ss_pred EEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEE--ECCeeeC--CCcCCccccCCCCCCEEEEEEe
Q psy8018 2 KVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIE--FQGNALV--DNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 2 ~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li--~~Gk~L~--d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
+|-|+.++|+...-....+.++.+|-..++. .|.+++...|+ |==|++. |.+.||+++|+.+-.+|+|.-|
T Consensus 7 ~i~vRlP~G~r~~rrF~~~~~L~~v~~fv~~-~g~~~~~f~L~t~FPRr~~~~~d~~~TL~e~GL~P~~~LfVq~r 81 (82)
T cd01773 7 RLMLRYPDGKREQIALPEQAKLLALVRHVQS-KGYPNERFELLTNFPRRKLSHLDYDITLQEAGLCPQETVFVQER 81 (82)
T ss_pred EEEEECCCCCEEEEEeCCCCcHHHHHHHHHh-cCCCCCCEEEecCCCCcccCCcccCCCHHHcCCCCCcEEEEecC
Confidence 5789999999999999999999999999998 57788888887 4456663 3357999999999999998643
No 93
>PF12382 Peptidase_A2E: Retrotransposon peptidase; InterPro: IPR024648 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. This entry represents a small family of fungal retroviral aspartyl peptidases.
Probab=95.61 E-value=0.055 Score=42.66 Aligned_cols=65 Identities=22% Similarity=0.366 Sum_probs=44.8
Q ss_pred EEEEcCCCcceecCHHHHHHcCCccccccce-eeeeec-CceeeEEeEEEEEEEEECCeEEEeEEEEecC
Q psy8018 162 KAFIDSGAQTTIMSAACAARVNITRLIDTRW-AGIAKG-VGVQRIIGRIHMVQIAIENDFLTTSFSVLED 229 (280)
Q Consensus 162 ~alVDTGAq~tiis~~~A~rlgL~~~~d~r~-~g~a~g-vG~~~~~g~i~~~~i~IG~~~~~~~~~Vl~~ 229 (280)
-.|+|||||.++|..+.++...|+.. .| ..+.+| |=-.++.-+.....|.+.|..+...|.|+..
T Consensus 49 pclidtgaq~niiteetvrahklptr---pw~~sviyggvyp~kinrkt~kl~i~lngisikteflvvkk 115 (137)
T PF12382_consen 49 PCLIDTGAQVNIITEETVRAHKLPTR---PWSQSVIYGGVYPNKINRKTIKLNINLNGISIKTEFLVVKK 115 (137)
T ss_pred eeEEccCceeeeeehhhhhhccCCCC---cchhheEeccccccccccceEEEEEEecceEEEEEEEEEEe
Confidence 46889999999999999999888632 23 223332 2223444444445788888988888888864
No 94
>PF14453 ThiS-like: ThiS-like ubiquitin
Probab=95.50 E-value=0.06 Score=38.16 Aligned_cols=56 Identities=18% Similarity=0.296 Sum_probs=43.7
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
|+|+|. | ..+++..+.|+.+||..+..+ .=.++|||-+..+| +-+++||.|++..|
T Consensus 1 M~I~vN---~--k~~~~~~~~tl~~lr~~~k~~------~DI~I~NGF~~~~d------~~L~e~D~v~~Ikk 56 (57)
T PF14453_consen 1 MKIKVN---E--KEIETEENTTLFELRKESKPD------ADIVILNGFPTKED------IELKEGDEVFLIKK 56 (57)
T ss_pred CEEEEC---C--EEEEcCCCcCHHHHHHhhCCC------CCEEEEcCcccCCc------cccCCCCEEEEEeC
Confidence 777773 3 357889999999999986643 23789999998765 46899999998764
No 95
>COG4067 Uncharacterized protein conserved in archaea [Posttranslational modification, protein turnover, chaperones]
Probab=95.39 E-value=0.046 Score=46.17 Aligned_cols=101 Identities=18% Similarity=0.224 Sum_probs=64.1
Q ss_pred EEEEcCE--EEEEEEcCCCcceecCHHHHHHc---CCcc---------cccc-------ceee--eeecCcee--eEEeE
Q psy8018 153 NCRVNGH--PVKAFIDSGAQTTIMSAACAARV---NITR---------LIDT-------RWAG--IAKGVGVQ--RIIGR 207 (280)
Q Consensus 153 ~~~InG~--~v~alVDTGAq~tiis~~~A~rl---gL~~---------~~d~-------r~~g--~a~gvG~~--~~~g~ 207 (280)
.+.++|- ..+|=+||||..|.+|..-.++. |-.. ..+. ...+ ...+-|++ ..-..
T Consensus 30 ~~~l~~l~~~~kAkiDTGA~TSsL~A~dI~~fkRdGe~WVRF~~~~~~~~~~~~~~~e~pvi~~ikvR~s~~~~~e~RpV 109 (162)
T COG4067 30 WVSLPGLKIQLKAKIDTGAVTSSLSASDIERFKRDGERWVRFRLADTDNLDQRSEECEAPVIRKIKVRSSSGSRAERRPV 109 (162)
T ss_pred EEEcCcccceeeeeecccceeeeEEeecceeeeeCCceEEEEEeecccCccccceeeccceEEEEEEecCCCCccccccE
Confidence 3455655 57899999999999987643332 1000 0000 0001 11122222 22222
Q ss_pred EEEEEEEECCeEEEeEEEEecCC--CcccccchhhhhcceEEEEecccE
Q psy8018 208 IHMVQIAIENDFLTTSFSVLEDQ--PMDMLLGLDMLKRHQVQIAIENDF 254 (280)
Q Consensus 208 i~~~~i~IG~~~~~~~~~Vl~~~--~~d~ILG~D~L~~~~~~ID~~~~~ 254 (280)
| ...+.+|+...+..|+.-+-. .+.+|||--+|+++.+.+|..+..
T Consensus 110 V-~~~l~lG~~~~~~E~tLtDR~~m~Yp~LlGrk~l~~~~~~VDpSr~y 157 (162)
T COG4067 110 V-RLTLCLGGRILPIEFTLTDRSNMRYPVLLGRKALRHFGAVVDPSRKY 157 (162)
T ss_pred E-EEEEeeCCeeeeEEEEeecccccccceEecHHHHhhCCeEECchhhc
Confidence 3 348999999999999888733 688999999999999999987654
No 96
>cd05470 pepsin_retropepsin_like Cellular and retroviral pepsin-like aspartate proteases. This family includes both cellular and retroviral pepsin-like aspartate proteases. The cellular pepsin and pepsin-like enzymes are twice as long as their retroviral counterparts. The cellular pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, rennin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (rennin, cathepsin D and E, pepsin) or commercially (chymosin) important. The eukaryotic pepsin-like proteases contain two domains possessing similar topological features. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except in the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The eukaryotic pepsin-like proteases have two active site
Probab=95.22 E-value=0.13 Score=39.84 Aligned_cols=84 Identities=18% Similarity=0.325 Sum_probs=50.0
Q ss_pred EEEEEcC--EEEEEEEcCCCcceecCHHHHHHcCCc---cc-----------cccceeeeeecCceeeEEeEEEEEEEEE
Q psy8018 152 INCRVNG--HPVKAFIDSGAQTTIMSAACAARVNIT---RL-----------IDTRWAGIAKGVGVQRIIGRIHMVQIAI 215 (280)
Q Consensus 152 i~~~InG--~~v~alVDTGAq~tiis~~~A~rlgL~---~~-----------~d~r~~g~a~gvG~~~~~g~i~~~~i~I 215 (280)
+++.|+. +++.+++|||++.+.+..+-+..|+.. .. ....+ .+.+|-|+ ..|.+....+.+
T Consensus 1 ~~i~vGtP~q~~~~~~DTGSs~~Wv~~~~c~~~~~~~~~~~~~~~~sst~~~~~~~~-~~~Y~~g~--~~g~~~~D~v~i 77 (109)
T cd05470 1 IEIGIGTPPQTFNVLLDTGSSNLWVPSVDCQSLAIYSHSSYDDPSASSTYSDNGCTF-SITYGTGS--LSGGLSTDTVSI 77 (109)
T ss_pred CEEEeCCCCceEEEEEeCCCCCEEEeCCCCCCcccccccccCCcCCCCCCCCCCcEE-EEEeCCCe--EEEEEEEEEEEE
Confidence 3577765 889999999999888865533333211 11 00111 12345443 456677778999
Q ss_pred CCeEEE-eEEEEecC--------CCcccccch
Q psy8018 216 ENDFLT-TSFSVLED--------QPMDMLLGL 238 (280)
Q Consensus 216 G~~~~~-~~~~Vl~~--------~~~d~ILG~ 238 (280)
|+..++ ..|.+... ...|+||||
T Consensus 78 g~~~~~~~~fg~~~~~~~~~~~~~~~~GilGL 109 (109)
T cd05470 78 GDIEVVGQAFGCATDEPGATFLPALFDGILGL 109 (109)
T ss_pred CCEEECCEEEEEEEecCCccccccccccccCC
Confidence 886543 44544431 356889986
No 97
>PF00026 Asp: Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.; InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) . More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=95.15 E-value=0.092 Score=47.94 Aligned_cols=89 Identities=22% Similarity=0.483 Sum_probs=57.5
Q ss_pred EEEEEEEc--CEEEEEEEcCCCcceecC-HHHHHH--cCCccccccc----e--ee----eeecCceeeEEeEEEEEEEE
Q psy8018 150 LYINCRVN--GHPVKAFIDSGAQTTIMS-AACAAR--VNITRLIDTR----W--AG----IAKGVGVQRIIGRIHMVQIA 214 (280)
Q Consensus 150 l~i~~~In--G~~v~alVDTGAq~tiis-~~~A~r--lgL~~~~d~r----~--~g----~a~gvG~~~~~g~i~~~~i~ 214 (280)
.+++++|. ++++++++|||+..+.+. ..|... |.-....+.. + .+ ...|-|. ..|.+....+.
T Consensus 2 Y~~~v~iGtp~q~~~~~iDTGS~~~wv~~~~c~~~~~~~~~~~y~~~~S~t~~~~~~~~~~~y~~g~--~~G~~~~D~v~ 79 (317)
T PF00026_consen 2 YYINVTIGTPPQTFRVLIDTGSSDTWVPSSNCNSCSSCASSGFYNPSKSSTFSNQGKPFSISYGDGS--VSGNLVSDTVS 79 (317)
T ss_dssp EEEEEEETTTTEEEEEEEETTBSSEEEEBTTECSHTHHCTSC-BBGGGSTTEEEEEEEEEEEETTEE--EEEEEEEEEEE
T ss_pred eEEEEEECCCCeEEEEEEecccceeeeceeccccccccccccccccccccccccceeeeeeeccCcc--cccccccceEe
Confidence 57899997 899999999999988775 333332 2221111111 1 11 2234333 88999999999
Q ss_pred ECCeEEE-eEEEEecC--------CCcccccchhh
Q psy8018 215 IENDFLT-TSFSVLED--------QPMDMLLGLDM 240 (280)
Q Consensus 215 IG~~~~~-~~~~Vl~~--------~~~d~ILG~D~ 240 (280)
+|+..++ ..|..+.. ...|+|||+-+
T Consensus 80 ig~~~~~~~~f~~~~~~~~~~~~~~~~~GilGLg~ 114 (317)
T PF00026_consen 80 IGGLTIPNQTFGLADSYSGDPFSPIPFDGILGLGF 114 (317)
T ss_dssp ETTEEEEEEEEEEEEEEESHHHHHSSSSEEEE-SS
T ss_pred eeeccccccceeccccccccccccccccccccccC
Confidence 9998766 56666553 25799999983
No 98
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=95.13 E-value=0.13 Score=50.80 Aligned_cols=91 Identities=21% Similarity=0.330 Sum_probs=59.9
Q ss_pred eEEEEEEEEc--CEEEEEEEcCCCcceecCH-HHH-HHcCCccccccc----------eeeeeecCceeeEEeEEEEEEE
Q psy8018 148 IMLYINCRVN--GHPVKAFIDSGAQTTIMSA-ACA-ARVNITRLIDTR----------WAGIAKGVGVQRIIGRIHMVQI 213 (280)
Q Consensus 148 ~~l~i~~~In--G~~v~alVDTGAq~tiis~-~~A-~rlgL~~~~d~r----------~~g~a~gvG~~~~~g~i~~~~i 213 (280)
.+.|.+++|+ ++++++++|||+..+-+.. .|. ..|.-....+.. ...+.+|-|+ ..|.+....+
T Consensus 137 ~~Yy~~i~IGTP~Q~f~vi~DTGSsdlWV~s~~C~~~~C~~~~~yd~s~SsT~~~~~~~~~i~YG~Gs--v~G~~~~Dtv 214 (450)
T PTZ00013 137 IMFYGEGEVGDNHQKFMLIFDTGSANLWVPSKKCDSIGCSIKNLYDSSKSKSYEKDGTKVDITYGSGT--VKGFFSKDLV 214 (450)
T ss_pred CEEEEEEEECCCCeEEEEEEeCCCCceEEecccCCccccccCCCccCccCcccccCCcEEEEEECCce--EEEEEEEEEE
Confidence 4678899996 8899999999999998853 332 123222111211 0123456553 6899999999
Q ss_pred EECCeEEEeEEEEec----------CCCcccccchhh
Q psy8018 214 AIENDFLTTSFSVLE----------DQPMDMLLGLDM 240 (280)
Q Consensus 214 ~IG~~~~~~~~~Vl~----------~~~~d~ILG~D~ 240 (280)
.+|+..++..|..+. ....|+||||.+
T Consensus 215 ~iG~~~~~~~f~~~~~~~~~~~~~~~~~~dGIlGLg~ 251 (450)
T PTZ00013 215 TLGHLSMPYKFIEVTDTDDLEPIYSSSEFDGILGLGW 251 (450)
T ss_pred EECCEEEccEEEEEEeccccccceecccccceecccC
Confidence 999987765554432 125799999975
No 99
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which
Probab=94.91 E-value=0.1 Score=47.24 Aligned_cols=30 Identities=20% Similarity=0.062 Sum_probs=26.3
Q ss_pred CcccccchhhhhcceEEEEecccEEEeeee
Q psy8018 231 PMDMLLGLDMLKRHQVQIAIENDFLTTSFS 260 (280)
Q Consensus 231 ~~d~ILG~D~L~~~~~~ID~~~~~L~i~~~ 260 (280)
....|||..|||.+-.+.|+.++++-+...
T Consensus 234 ~~~~ilG~~fl~~~~~vFD~~~~~iGfa~~ 263 (265)
T cd05476 234 GGVSILGNIQQQNFLVEYDLENSRLGFAPA 263 (265)
T ss_pred CCcEEEChhhcccEEEEEECCCCEEeeecC
Confidence 445899999999999999999999988654
No 100
>PRK06437 hypothetical protein; Provisional
Probab=94.91 E-value=0.19 Score=36.46 Aligned_cols=58 Identities=16% Similarity=0.315 Sum_probs=45.8
Q ss_pred EEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 5 VTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 5 V~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
++.-++...++++++..||.+|-+. .|++++.-.+..||+++.. ++-+++||.|.+..
T Consensus 5 ~~v~g~~~~~~~i~~~~tv~dLL~~----Lgi~~~~vaV~vNg~iv~~------~~~L~dgD~Veiv~ 62 (67)
T PRK06437 5 IRVKGHINKTIEIDHELTVNDIIKD----LGLDEEEYVVIVNGSPVLE------DHNVKKEDDVLILE 62 (67)
T ss_pred EEecCCcceEEEcCCCCcHHHHHHH----cCCCCccEEEEECCEECCC------ceEcCCCCEEEEEe
Confidence 3333556678899999999988755 5888888888999999973 35689999999875
No 101
>PTZ00147 plasmepsin-1; Provisional
Probab=94.86 E-value=0.17 Score=50.03 Aligned_cols=92 Identities=17% Similarity=0.320 Sum_probs=60.8
Q ss_pred eeEEEEEEEEc--CEEEEEEEcCCCcceecC-HHHHH-HcCCccccccc----e------eeeeecCceeeEEeEEEEEE
Q psy8018 147 VIMLYINCRVN--GHPVKAFIDSGAQTTIMS-AACAA-RVNITRLIDTR----W------AGIAKGVGVQRIIGRIHMVQ 212 (280)
Q Consensus 147 ~~~l~i~~~In--G~~v~alVDTGAq~tiis-~~~A~-rlgL~~~~d~r----~------~g~a~gvG~~~~~g~i~~~~ 212 (280)
....|.+++|+ ++++.+++|||++.+-+. ..|.. .|.-....+.. + ..+.+|-| ...|.+....
T Consensus 137 n~~Y~~~I~IGTP~Q~f~Vi~DTGSsdlWVps~~C~~~~C~~~~~yd~s~SsT~~~~~~~f~i~Yg~G--svsG~~~~Dt 214 (453)
T PTZ00147 137 NVMSYGEAKLGDNGQKFNFIFDTGSANLWVPSIKCTTEGCETKNLYDSSKSKTYEKDGTKVEMNYVSG--TVSGFFSKDL 214 (453)
T ss_pred CCEEEEEEEECCCCeEEEEEEeCCCCcEEEeecCCCcccccCCCccCCccCcceEECCCEEEEEeCCC--CEEEEEEEEE
Confidence 35778999997 889999999999988874 44431 23322222211 0 11345555 3679998889
Q ss_pred EEECCeEEEeEEEEec----------CCCcccccchhh
Q psy8018 213 IAIENDFLTTSFSVLE----------DQPMDMLLGLDM 240 (280)
Q Consensus 213 i~IG~~~~~~~~~Vl~----------~~~~d~ILG~D~ 240 (280)
+.||+..++..|..+. ....|+||||-|
T Consensus 215 VtiG~~~v~~qF~~~~~~~~f~~~~~~~~~DGILGLG~ 252 (453)
T PTZ00147 215 VTIGNLSVPYKFIEVTDTNGFEPFYTESDFDGIFGLGW 252 (453)
T ss_pred EEECCEEEEEEEEEEEeccCcccccccccccceecccC
Confidence 9999987765554432 125799999976
No 102
>KOG0013|consensus
Probab=94.72 E-value=0.061 Score=47.68 Aligned_cols=63 Identities=24% Similarity=0.346 Sum_probs=55.7
Q ss_pred CCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 9 DNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 9 ~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
.++.+.+.+..-.|+.++|..+.+..|+.+-.|+++|+|++|.+. ..|++|+|..|..-++-.
T Consensus 155 T~~d~~lta~~~Dtv~eik~~L~Aaeg~D~~sQrif~Sg~~l~dk-t~LeEc~iekg~rYvlqv 217 (231)
T KOG0013|consen 155 TREDFWLTAPHYDTVGEIKRALRAAEGVDPLSQRIFFSGGVLVDK-TDLEECKIEKGQRYVLQV 217 (231)
T ss_pred hhhheeecccCcCcHHHHHHHHHHhhccchhhheeeccCCceecc-ccceeeeecCCCEEEEEE
Confidence 577788899999999999999999999999999999999999875 789999999996555443
No 103
>PRK05863 sulfur carrier protein ThiS; Provisional
Probab=94.50 E-value=0.18 Score=36.34 Aligned_cols=60 Identities=20% Similarity=0.228 Sum_probs=45.2
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
|+|+| +|+.++ +++..|+.+|-.. .++|+..-.+.+||..+.. ...+.+ +++||.|.+..
T Consensus 1 m~i~v---NG~~~~--~~~~~tl~~ll~~----l~~~~~~vav~~N~~iv~r--~~~~~~-L~~gD~ieIv~ 60 (65)
T PRK05863 1 MIVVV---NEEQVE--VDEQTTVAALLDS----LGFPEKGIAVAVDWSVLPR--SDWATK-LRDGARLEVVT 60 (65)
T ss_pred CEEEE---CCEEEE--cCCCCcHHHHHHH----cCCCCCcEEEEECCcCcCh--hHhhhh-cCCCCEEEEEe
Confidence 66766 666554 4577887777554 5889999999999998864 244566 99999999876
No 104
>PRK06488 sulfur carrier protein ThiS; Validated
Probab=94.48 E-value=0.24 Score=35.47 Aligned_cols=60 Identities=17% Similarity=0.223 Sum_probs=44.5
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
|+|+| +|+.+.+ + ..|+.+|.+.+ +++++...+-.||+.+.. ...++.-+++||.|.+..
T Consensus 1 m~i~~---Ng~~~~~--~-~~tl~~Ll~~l----~~~~~~vavavN~~iv~~--~~~~~~~L~dgD~Ieiv~ 60 (65)
T PRK06488 1 MKLFV---NGETLQT--E-ATTLALLLAEL----DYEGNWLATAVNGELVHK--EARAQFVLHEGDRIEILS 60 (65)
T ss_pred CEEEE---CCeEEEc--C-cCcHHHHHHHc----CCCCCeEEEEECCEEcCH--HHcCccccCCCCEEEEEE
Confidence 66666 6776666 3 45888887654 677766667899999863 356677899999999876
No 105
>COG5227 SMT3 Ubiquitin-like protein (sentrin) [Posttranslational modification, protein turnover, chaperones]
Probab=94.46 E-value=0.064 Score=41.39 Aligned_cols=70 Identities=14% Similarity=0.237 Sum_probs=59.3
Q ss_pred cCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEeccCC
Q psy8018 7 SLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRVQVAR 77 (280)
Q Consensus 7 ~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~~~p~ 77 (280)
.-+|...-+.|-.+.+...|........|-..++.|++|+|+.+.-+ +|-++++..+||.|..+..|+.|
T Consensus 31 ~qd~telfFkiKktT~f~klm~af~~rqGK~m~slRfL~dG~rI~~d-qTP~dldmEdnd~iEav~eQvGG 100 (103)
T COG5227 31 DQDGTELFFKIKKTTTFKKLMDAFSRRQGKNMSSLRFLFDGKRIDLD-QTPGDLDMEDNDEIEAVTEQVGG 100 (103)
T ss_pred cCCCCEEEEEEeccchHHHHHHHHHHHhCcCcceeEEEEcceecCCC-CChhhcCCccchHHHHHHHHhcC
Confidence 44677777899999999999999999999999999999999999765 89999999999988665444433
No 106
>PRK08364 sulfur carrier protein ThiS; Provisional
Probab=93.74 E-value=0.51 Score=34.42 Aligned_cols=51 Identities=14% Similarity=0.272 Sum_probs=41.1
Q ss_pred EEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 12 CFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 12 ~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
...+++++..|+.+|.+.+ +++++.-.+..||..+.. +.-+++||.|.+..
T Consensus 15 ~~~~~~~~~~tv~~ll~~l----~~~~~~v~v~vNg~iv~~------~~~l~~gD~Veii~ 65 (70)
T PRK08364 15 EKEIEWRKGMKVADILRAV----GFNTESAIAKVNGKVALE------DDPVKDGDYVEVIP 65 (70)
T ss_pred ceEEEcCCCCcHHHHHHHc----CCCCccEEEEECCEECCC------CcCcCCCCEEEEEc
Confidence 5678899999999998765 777777777899999853 34599999998875
No 107
>PRK05659 sulfur carrier protein ThiS; Validated
Probab=93.54 E-value=0.54 Score=33.49 Aligned_cols=61 Identities=16% Similarity=0.371 Sum_probs=45.3
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
|+|+| +|+. +++++..||.++-.. .+++...-.+.+||..+.. ..-.+.=+++||.|.+..
T Consensus 1 m~i~v---NG~~--~~~~~~~tl~~lL~~----l~~~~~~vav~vNg~iv~r--~~~~~~~l~~gD~vei~~ 61 (66)
T PRK05659 1 MNIQL---NGEP--RELPDGESVAALLAR----EGLAGRRVAVEVNGEIVPR--SQHASTALREGDVVEIVH 61 (66)
T ss_pred CEEEE---CCeE--EEcCCCCCHHHHHHh----cCCCCCeEEEEECCeEeCH--HHcCcccCCCCCEEEEEE
Confidence 66666 5664 466778898887654 6888888888999998863 234445699999998875
No 108
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=93.49 E-value=0.58 Score=42.51 Aligned_cols=75 Identities=17% Similarity=0.275 Sum_probs=54.3
Q ss_pred EEEEEEEEcC--EEEEEEEcCCCcceecCHHHHHHcCCccccccceeeeeecCceeeEEeEEEEEEEEECCeEEE-eEEE
Q psy8018 149 MLYINCRVNG--HPVKAFIDSGAQTTIMSAACAARVNITRLIDTRWAGIAKGVGVQRIIGRIHMVQIAIENDFLT-TSFS 225 (280)
Q Consensus 149 ~l~i~~~InG--~~v~alVDTGAq~tiis~~~A~rlgL~~~~d~r~~g~a~gvG~~~~~g~i~~~~i~IG~~~~~-~~~~ 225 (280)
..++++.|+. +++.+++|||+..+.+. .+ .+.+|-| ..+.|.+....+.+|+..++ ..|.
T Consensus 2 ~Y~~~i~iGtp~q~~~v~~DTgS~~~wv~---------------~~-~~~Y~~g-~~~~G~~~~D~v~~g~~~~~~~~fg 64 (295)
T cd05474 2 YYSAELSVGTPPQKVTVLLDTGSSDLWVP---------------DF-SISYGDG-TSASGTWGTDTVSIGGATVKNLQFA 64 (295)
T ss_pred eEEEEEEECCCCcEEEEEEeCCCCcceee---------------ee-EEEeccC-CcEEEEEEEEEEEECCeEecceEEE
Confidence 4678999977 88999999999999888 11 1234433 35678888889999987654 4566
Q ss_pred Eec-CCCcccccchhh
Q psy8018 226 VLE-DQPMDMLLGLDM 240 (280)
Q Consensus 226 Vl~-~~~~d~ILG~D~ 240 (280)
+.. ....|+|||+-+
T Consensus 65 ~~~~~~~~~GilGLg~ 80 (295)
T cd05474 65 VANSTSSDVGVLGIGL 80 (295)
T ss_pred EEecCCCCcceeeECC
Confidence 655 346789999764
No 109
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=93.46 E-value=0.4 Score=43.57 Aligned_cols=89 Identities=17% Similarity=0.219 Sum_probs=54.6
Q ss_pred EEEEEEcC--EEEEEEEcCCCcceecCHHHHHHcCCc--ccccc-----------ceeeeeecCceeeEEeEEEEEEEEE
Q psy8018 151 YINCRVNG--HPVKAFIDSGAQTTIMSAACAARVNIT--RLIDT-----------RWAGIAKGVGVQRIIGRIHMVQIAI 215 (280)
Q Consensus 151 ~i~~~InG--~~v~alVDTGAq~tiis~~~A~rlgL~--~~~d~-----------r~~g~a~gvG~~~~~g~i~~~~i~I 215 (280)
+++++|+. +++.++||||+..+.+..+-+..|... ...+. ....+.+|-| ....|.+....+.+
T Consensus 2 ~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~~c~~~~~~~~~~y~~~~Sst~~~~~~~~~~i~Y~~G-~~~~G~~~~D~v~i 80 (278)
T cd06097 2 LTPVKIGTPPQTLNLDLDTGSSDLWVFSSETPAAQQGGHKLYDPSKSSTAKLLPGATWSISYGDG-SSASGIVYTDTVSI 80 (278)
T ss_pred eeeEEECCCCcEEEEEEeCCCCceeEeeCCCCchhhccCCcCCCccCccceecCCcEEEEEeCCC-CeEEEEEEEEEEEE
Confidence 67889987 889999999999998854322222211 00110 0011234444 24678888889999
Q ss_pred CCeEEE-eEEEEec--------CCCcccccchhh
Q psy8018 216 ENDFLT-TSFSVLE--------DQPMDMLLGLDM 240 (280)
Q Consensus 216 G~~~~~-~~~~Vl~--------~~~~d~ILG~D~ 240 (280)
|+..++ ..|.+.. ....|+||||-+
T Consensus 81 g~~~~~~~~fg~~~~~~~~~~~~~~~dGilGLg~ 114 (278)
T cd06097 81 GGVEVPNQAIELATAVSASFFSDTASDGLLGLAF 114 (278)
T ss_pred CCEEECCeEEEEEeecCccccccccccceeeecc
Confidence 986554 3344332 236899999964
No 110
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate r
Probab=93.40 E-value=0.55 Score=43.79 Aligned_cols=91 Identities=16% Similarity=0.276 Sum_probs=57.7
Q ss_pred eEEEEEEEEc--CEEEEEEEcCCCcceecCH-HHHH---HcCCccccccc----e------eeeeecCceeeEEeEEEEE
Q psy8018 148 IMLYINCRVN--GHPVKAFIDSGAQTTIMSA-ACAA---RVNITRLIDTR----W------AGIAKGVGVQRIIGRIHMV 211 (280)
Q Consensus 148 ~~l~i~~~In--G~~v~alVDTGAq~tiis~-~~A~---rlgL~~~~d~r----~------~g~a~gvG~~~~~g~i~~~ 211 (280)
...+++++|. +++++++||||+..+-+.. .|.. .|.-....+.. + -.+..|-| .+.|.+...
T Consensus 7 ~~y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~c~~~~~y~~~~SsT~~~~~~~~~~~Yg~g--~~~G~~~~D 84 (326)
T cd05487 7 TQYYGEIGIGTPPQTFKVVFDTGSSNLWVPSSKCSPLYTACVTHNLYDASDSSTYKENGTEFTIHYASG--TVKGFLSQD 84 (326)
T ss_pred CeEEEEEEECCCCcEEEEEEeCCccceEEccCCCcCcchhhcccCcCCCCCCeeeeECCEEEEEEeCCc--eEEEEEeee
Confidence 4567899997 8899999999999998853 3432 34322222211 0 11234545 368888888
Q ss_pred EEEECCeEEEeEEEEecC--------CCcccccchhh
Q psy8018 212 QIAIENDFLTTSFSVLED--------QPMDMLLGLDM 240 (280)
Q Consensus 212 ~i~IG~~~~~~~~~Vl~~--------~~~d~ILG~D~ 240 (280)
.+.+|+....-.|.++.. ...|+|||+-+
T Consensus 85 ~v~~g~~~~~~~fg~~~~~~~~~~~~~~~dGilGLg~ 121 (326)
T cd05487 85 IVTVGGIPVTQMFGEVTALPAIPFMLAKFDGVLGMGY 121 (326)
T ss_pred EEEECCEEeeEEEEEEEeccCCccceeecceEEecCC
Confidence 999998655433333321 25789999875
No 111
>PRK07696 sulfur carrier protein ThiS; Provisional
Probab=93.10 E-value=0.65 Score=33.63 Aligned_cols=61 Identities=15% Similarity=0.285 Sum_probs=45.5
Q ss_pred CEEEEEcCCCcEEEEEeCCC-ccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 1 MKVTVTSLDNECFLLDVSED-LELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~-~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
|+|+| +|+.+ ++++. .||.+|-+. .++++..-.+-+||..++. ..-+++-+++||.|.+..
T Consensus 1 m~I~v---NG~~~--~~~~~~~tv~~lL~~----l~~~~~~vav~vN~~iv~r--~~w~~~~L~~gD~iEIv~ 62 (67)
T PRK07696 1 MNLKI---NGNQI--EVPESVKTVAELLTH----LELDNKIVVVERNKDILQK--DDHTDTSVFDGDQIEIVT 62 (67)
T ss_pred CEEEE---CCEEE--EcCCCcccHHHHHHH----cCCCCCeEEEEECCEEeCH--HHcCceecCCCCEEEEEE
Confidence 66666 66655 45554 578777653 5788887888899999973 456777899999999876
No 112
>PRK08053 sulfur carrier protein ThiS; Provisional
Probab=92.55 E-value=1 Score=32.27 Aligned_cols=61 Identities=13% Similarity=0.206 Sum_probs=44.5
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
|+|+| +|+.+ ++++..|+.+|.+. .+++...-.+-.||+.+.. ..-++.-+++||.|.+..
T Consensus 1 m~i~v---Ng~~~--~~~~~~tl~~ll~~----l~~~~~~vaVavN~~iv~r--~~w~~~~L~~gD~Ieii~ 61 (66)
T PRK08053 1 MQILF---NDQPM--QCAAGQTVHELLEQ----LNQLQPGAALAINQQIIPR--EQWAQHIVQDGDQILLFQ 61 (66)
T ss_pred CEEEE---CCeEE--EcCCCCCHHHHHHH----cCCCCCcEEEEECCEEeCh--HHcCccccCCCCEEEEEE
Confidence 67776 56655 45677899888765 4566566777899999963 345556799999999876
No 113
>PF15044 CLU_N: Mitochondrial function, CLU-N-term
Probab=92.54 E-value=0.24 Score=36.99 Aligned_cols=56 Identities=20% Similarity=0.245 Sum_probs=45.9
Q ss_pred eCCCccHHHHHHHHHHHhC-CCCcceEEEECCeeeCCCcCCcccc-CCCCCCEEEEEEe
Q psy8018 17 VSEDLELENFKAFCEVQSG-ISAQEIAIEFQGNALVDNKKTLKDY-NIQDGDLVLLKRV 73 (280)
Q Consensus 17 V~~~~tV~~LK~~Ie~e~g-ip~~~Q~Li~~Gk~L~d~~~tL~~~-gI~~gd~l~l~~~ 73 (280)
|+++++|.++++.+..... ..-....|.++|+.|.+. ..|++. |+++|..|.|+.+
T Consensus 1 v~~~d~v~dvrq~L~~~~~t~~~Tn~~L~~~g~~L~~~-~el~~i~~~~~~~~L~lve~ 58 (76)
T PF15044_consen 1 VSPTDTVQDVRQVLAESPETCYLTNFSLEHNGQRLDDF-VELSEIEGIKDGCVLELVEE 58 (76)
T ss_pred CChhhHHHHHHHHHHhCccccceeEEEEEECCCccCCc-hhhhhhhCCCCCcEEEEEec
Confidence 6789999999998876643 456778899999999764 678777 6999999999875
No 114
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which
Probab=92.53 E-value=0.84 Score=42.31 Aligned_cols=100 Identities=15% Similarity=0.130 Sum_probs=59.7
Q ss_pred EEEEcCEEE------EEEEcCCCcceecCHHHHHHcCCccccccceeeeeecCceee----EEeEEEEEEEEECCeEEE-
Q psy8018 153 NCRVNGHPV------KAFIDSGAQTTIMSAACAARVNITRLIDTRWAGIAKGVGVQR----IIGRIHMVQIAIENDFLT- 221 (280)
Q Consensus 153 ~~~InG~~v------~alVDTGAq~tiis~~~A~rlgL~~~~d~r~~g~a~gvG~~~----~~g~i~~~~i~IG~~~~~- 221 (280)
.++|||..+ .++||||++.+.++.+..+++--. +.... ...|.-. .........+.+++..+.
T Consensus 193 ~v~v~g~~~~~~~~~~~iiDTGts~~~lp~~~~~~l~~~--~~~~~----~~~~~~~~~C~~~~~~P~~~f~f~g~~~~i 266 (317)
T cd05478 193 SVTINGQVVACSGGCQAIVDTGTSLLVGPSSDIANIQSD--IGASQ----NQNGEMVVNCSSISSMPDVVFTINGVQYPL 266 (317)
T ss_pred EEEECCEEEccCCCCEEEECCCchhhhCCHHHHHHHHHH--hCCcc----ccCCcEEeCCcCcccCCcEEEEECCEEEEE
Confidence 357788765 589999999999999988765211 11000 0011100 001122234556665443
Q ss_pred ------------eEEEEecC-CCcccccchhhhhcceEEEEecccEEEee
Q psy8018 222 ------------TSFSVLED-QPMDMLLGLDMLKRHQVQIAIENDFLTTS 258 (280)
Q Consensus 222 ------------~~~~Vl~~-~~~d~ILG~D~L~~~~~~ID~~~~~L~i~ 258 (280)
|-+.+... .....|||..||+.+-.+.|+.+++|-|.
T Consensus 267 ~~~~y~~~~~~~C~~~~~~~~~~~~~IlG~~fl~~~y~vfD~~~~~iG~A 316 (317)
T cd05478 267 PPSAYILQDQGSCTSGFQSMGLGELWILGDVFIRQYYSVFDRANNKVGLA 316 (317)
T ss_pred CHHHheecCCCEEeEEEEeCCCCCeEEechHHhcceEEEEeCCCCEEeec
Confidence 32222221 12458999999999999999999998774
No 115
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=92.30 E-value=1.2 Score=41.17 Aligned_cols=100 Identities=13% Similarity=0.174 Sum_probs=58.5
Q ss_pred EEEcCEEE-------EEEEcCCCcceecCHHHHHHcCCccccccceeeeeecCceeeE----EeEEEEEEEEECCeEEE-
Q psy8018 154 CRVNGHPV-------KAFIDSGAQTTIMSAACAARVNITRLIDTRWAGIAKGVGVQRI----IGRIHMVQIAIENDFLT- 221 (280)
Q Consensus 154 ~~InG~~v-------~alVDTGAq~tiis~~~A~rlgL~~~~d~r~~g~a~gvG~~~~----~g~i~~~~i~IG~~~~~- 221 (280)
+.|||... .++||||.+.+.++.+..+++-- .+.... ...|.-.. ........+.+++..++
T Consensus 188 i~v~g~~~~~~~~~~~~iiDSGtt~~~lP~~~~~~l~~--~~~~~~----~~~~~~~~~C~~~~~~p~l~~~f~g~~~~v 261 (318)
T cd05477 188 FQINGQATGWCSQGCQAIVDTGTSLLTAPQQVMSTLMQ--SIGAQQ----DQYGQYVVNCNNIQNLPTLTFTINGVSFPL 261 (318)
T ss_pred EEECCEEecccCCCceeeECCCCccEECCHHHHHHHHH--HhCCcc----ccCCCEEEeCCccccCCcEEEEECCEEEEE
Confidence 46777643 58999999999999988877511 011110 00010000 00111234555554333
Q ss_pred ------------eEEEEec------CCCcccccchhhhhcceEEEEecccEEEeee
Q psy8018 222 ------------TSFSVLE------DQPMDMLLGLDMLKRHQVQIAIENDFLTTSF 259 (280)
Q Consensus 222 ------------~~~~Vl~------~~~~d~ILG~D~L~~~~~~ID~~~~~L~i~~ 259 (280)
|-+.+.+ ......|||..|||.+-.+.|+.++++-|..
T Consensus 262 ~~~~y~~~~~~~C~~~i~~~~~~~~~~~~~~ilG~~fl~~~y~vfD~~~~~ig~a~ 317 (318)
T cd05477 262 PPSAYILQNNGYCTVGIEPTYLPSQNGQPLWILGDVFLRQYYSVYDLGNNQVGFAT 317 (318)
T ss_pred CHHHeEecCCCeEEEEEEecccCCCCCCceEEEcHHHhhheEEEEeCCCCEEeeee
Confidence 3233321 1123589999999999999999999988753
No 116
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which
Probab=92.13 E-value=0.47 Score=42.85 Aligned_cols=76 Identities=16% Similarity=0.270 Sum_probs=52.9
Q ss_pred EEEEEEEc--CEEEEEEEcCCCcceecCHHHHHHcCCccccccceeeeeecCceeeEEeEEEEEEEEECCe--EEE-eEE
Q psy8018 150 LYINCRVN--GHPVKAFIDSGAQTTIMSAACAARVNITRLIDTRWAGIAKGVGVQRIIGRIHMVQIAIEND--FLT-TSF 224 (280)
Q Consensus 150 l~i~~~In--G~~v~alVDTGAq~tiis~~~A~rlgL~~~~d~r~~g~a~gvG~~~~~g~i~~~~i~IG~~--~~~-~~~ 224 (280)
.+++++|+ .+++.+++|||++.+-+.. | .+ ...+|- +....|.+....+.+|+. .++ ..|
T Consensus 2 Y~~~i~iGtP~q~~~v~~DTGSs~~wv~~-----~--------~~-~~~Y~d-g~~~~G~~~~D~v~~g~~~~~~~~~~F 66 (265)
T cd05476 2 YLVTLSIGTPPQPFSLIVDTGSDLTWTQC-----C--------SY-EYSYGD-GSSTSGVLATETFTFGDSSVSVPNVAF 66 (265)
T ss_pred eEEEEecCCCCcceEEEecCCCCCEEEcC-----C--------ce-EeEeCC-CceeeeeEEEEEEEecCCCCccCCEEE
Confidence 46888887 6789999999999888752 2 22 123332 345688888889999986 433 456
Q ss_pred EEecCC------Ccccccchhh
Q psy8018 225 SVLEDQ------PMDMLLGLDM 240 (280)
Q Consensus 225 ~Vl~~~------~~d~ILG~D~ 240 (280)
.+.... ..|+||||.+
T Consensus 67 g~~~~~~~~~~~~~~GIlGLg~ 88 (265)
T cd05476 67 GCGTDNEGGSFGGADGILGLGR 88 (265)
T ss_pred EecccccCCccCCCCEEEECCC
Confidence 555432 5899999876
No 117
>PRK07440 hypothetical protein; Provisional
Probab=92.04 E-value=0.98 Score=33.04 Aligned_cols=56 Identities=21% Similarity=0.349 Sum_probs=43.9
Q ss_pred CCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 9 DNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 9 ~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
+|+. +++++..||.+|-. +.++++..-.+-+||..+.. ..-++.-+++||.|.+..
T Consensus 10 NG~~--~~~~~~~tl~~lL~----~l~~~~~~vav~~N~~iv~r--~~w~~~~L~~gD~IEIv~ 65 (70)
T PRK07440 10 NGET--RTCSSGTSLPDLLQ----QLGFNPRLVAVEYNGEILHR--QFWEQTQVQPGDRLEIVT 65 (70)
T ss_pred CCEE--EEcCCCCCHHHHHH----HcCCCCCeEEEEECCEEeCH--HHcCceecCCCCEEEEEE
Confidence 6664 56778889988765 45788888888999999973 356667799999999876
No 118
>PLN02799 Molybdopterin synthase sulfur carrier subunit
Probab=92.02 E-value=0.94 Score=33.69 Aligned_cols=66 Identities=11% Similarity=0.135 Sum_probs=45.1
Q ss_pred CEEEEEc------CCC-cEEEEEeCCCccHHHHHHHHHHHh-CCCC--cceEEEECCeeeCCCcCCccccCCCCCCEEEE
Q psy8018 1 MKVTVTS------LDN-ECFLLDVSEDLELENFKAFCEVQS-GISA--QEIAIEFQGNALVDNKKTLKDYNIQDGDLVLL 70 (280)
Q Consensus 1 M~ItV~~------~~g-~~~~leV~~~~tV~~LK~~Ie~e~-gip~--~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l 70 (280)
|+|+|+. ..| ...++++++..|+.+|++.+.... ++.. ..-.+..||+...++ +=+++||.|.+
T Consensus 2 m~i~V~~fa~~re~~g~~~~~~~~~~~~tv~~L~~~l~~~~p~l~~~~~~~~vavN~~~v~~~------~~l~dgDeVai 75 (82)
T PLN02799 2 VEIKVLFFARARELTGVSDMTLELPAGSTTADCLAELVAKFPSLEEVRSCCVLALNEEYTTES------AALKDGDELAI 75 (82)
T ss_pred eEEEEEehHHHHHHhCCCeEEEECCCCCcHHHHHHHHHHHChhHHHHhhCcEEEECCEEcCCC------cCcCCCCEEEE
Confidence 7888872 234 456788889999999999987664 1111 112356788876432 35899999998
Q ss_pred EE
Q psy8018 71 KR 72 (280)
Q Consensus 71 ~~ 72 (280)
..
T Consensus 76 ~P 77 (82)
T PLN02799 76 IP 77 (82)
T ss_pred eC
Confidence 65
No 119
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=91.97 E-value=1.4 Score=39.95 Aligned_cols=95 Identities=13% Similarity=0.187 Sum_probs=55.0
Q ss_pred EEEEEcCCCcceecCHHHHHHcCCccccccceeeeeecCceeeE--EeEEEEEEEEECCeEEE-----------------
Q psy8018 161 VKAFIDSGAQTTIMSAACAARVNITRLIDTRWAGIAKGVGVQRI--IGRIHMVQIAIENDFLT----------------- 221 (280)
Q Consensus 161 v~alVDTGAq~tiis~~~A~rlgL~~~~d~r~~g~a~gvG~~~~--~g~i~~~~i~IG~~~~~----------------- 221 (280)
..++||||++.+.++.+..+++-- .+...+... .+...... ... ....+.+++..+.
T Consensus 179 ~~~iiDSGt~~~~lP~~~~~~l~~--~~~~~~~~~-~~~~~~~C~~~~~-p~i~f~f~g~~~~i~~~~~~~~~~~~~~~~ 254 (295)
T cd05474 179 LPALLDSGTTLTYLPSDIVDAIAK--QLGATYDSD-EGLYVVDCDAKDD-GSLTFNFGGATISVPLSDLVLPASTDDGGD 254 (295)
T ss_pred ccEEECCCCccEeCCHHHHHHHHH--HhCCEEcCC-CcEEEEeCCCCCC-CEEEEEECCeEEEEEHHHhEeccccCCCCC
Confidence 579999999999999998887511 011111000 00000000 000 2334555553322
Q ss_pred --eEEEEecCCCcccccchhhhhcceEEEEecccEEEeee
Q psy8018 222 --TSFSVLEDQPMDMLLGLDMLKRHQVQIAIENDFLTTSF 259 (280)
Q Consensus 222 --~~~~Vl~~~~~d~ILG~D~L~~~~~~ID~~~~~L~i~~ 259 (280)
|-+.+.+...-..|||..||+.+-.+.|+.++++-|..
T Consensus 255 ~~C~~~i~~~~~~~~iLG~~fl~~~y~vfD~~~~~ig~a~ 294 (295)
T cd05474 255 GACYLGIQPSTSDYNILGDTFLRSAYVVYDLDNNEISLAQ 294 (295)
T ss_pred CCeEEEEEeCCCCcEEeChHHhhcEEEEEECCCCEEEeec
Confidence 32333332224589999999999999999999998754
No 120
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=91.96 E-value=1.1 Score=41.42 Aligned_cols=90 Identities=16% Similarity=0.322 Sum_probs=57.5
Q ss_pred EEEEEEEEc--CEEEEEEEcCCCcceecCH-HHHH-HcCCccccccc----e------eeeeecCceeeEEeEEEEEEEE
Q psy8018 149 MLYINCRVN--GHPVKAFIDSGAQTTIMSA-ACAA-RVNITRLIDTR----W------AGIAKGVGVQRIIGRIHMVQIA 214 (280)
Q Consensus 149 ~l~i~~~In--G~~v~alVDTGAq~tiis~-~~A~-rlgL~~~~d~r----~------~g~a~gvG~~~~~g~i~~~~i~ 214 (280)
..++++.|. ++++.++||||+..+-+.. .|.. .|+-....+.. + ..+.+|-|+ ..|.+....|.
T Consensus 3 ~y~~~i~iGtP~q~~~v~~DTGS~~~wv~~~~C~~~~C~~~~~f~~~~SsT~~~~~~~~~~~Yg~Gs--~~G~~~~D~i~ 80 (318)
T cd05477 3 SYYGEISIGTPPQNFLVLFDTGSSNLWVPSVLCQSQACTNHTKFNPSQSSTYSTNGETFSLQYGSGS--LTGIFGYDTVT 80 (318)
T ss_pred EEEEEEEECCCCcEEEEEEeCCCccEEEccCCCCCccccccCCCCcccCCCceECCcEEEEEECCcE--EEEEEEeeEEE
Confidence 467888997 4889999999999988853 2321 34322122111 0 113345554 57888888999
Q ss_pred ECCeEEE-eEEEEecC--------CCcccccchhh
Q psy8018 215 IENDFLT-TSFSVLED--------QPMDMLLGLDM 240 (280)
Q Consensus 215 IG~~~~~-~~~~Vl~~--------~~~d~ILG~D~ 240 (280)
+|+..++ ..|.+... ...|+|||+-+
T Consensus 81 ~g~~~i~~~~Fg~~~~~~~~~~~~~~~~GilGLg~ 115 (318)
T cd05477 81 VQGIIITNQEFGLSETEPGTNFVYAQFDGILGLAY 115 (318)
T ss_pred ECCEEEcCEEEEEEEecccccccccceeeEeecCc
Confidence 9987665 45665542 23689999974
No 121
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases. They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=91.94 E-value=0.98 Score=41.98 Aligned_cols=91 Identities=15% Similarity=0.260 Sum_probs=57.8
Q ss_pred eEEEEEEEEc--CEEEEEEEcCCCcceecC-HHHH--HHcCCccccccc----e------eeeeecCceeeEEeEEEEEE
Q psy8018 148 IMLYINCRVN--GHPVKAFIDSGAQTTIMS-AACA--ARVNITRLIDTR----W------AGIAKGVGVQRIIGRIHMVQ 212 (280)
Q Consensus 148 ~~l~i~~~In--G~~v~alVDTGAq~tiis-~~~A--~rlgL~~~~d~r----~------~g~a~gvG~~~~~g~i~~~~ 212 (280)
...+++++|+ ++++++++|||+..+.+. ..|. ..|......+.. + ..+..|-|+ +.|.+....
T Consensus 9 ~~Y~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~~C~~~~~C~~~~~y~~~~SsT~~~~~~~~~i~Yg~G~--~~G~~~~D~ 86 (317)
T cd06098 9 AQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYKSSKSSTYKKNGTSASIQYGTGS--ISGFFSQDS 86 (317)
T ss_pred CEEEEEEEECCCCeEEEEEECCCccceEEecCCCCCCccccccCcCCcccCCCcccCCCEEEEEcCCce--EEEEEEeeE
Confidence 4678899997 788999999999888774 4442 244432222111 0 113355554 578888889
Q ss_pred EEECCeEEE-eEEEEec--------CCCcccccchhh
Q psy8018 213 IAIENDFLT-TSFSVLE--------DQPMDMLLGLDM 240 (280)
Q Consensus 213 i~IG~~~~~-~~~~Vl~--------~~~~d~ILG~D~ 240 (280)
+.+|+..++ ..|.+.. ....|+||||-+
T Consensus 87 v~ig~~~v~~~~f~~~~~~~~~~~~~~~~dGilGLg~ 123 (317)
T cd06098 87 VTVGDLVVKNQVFIEATKEPGLTFLLAKFDGILGLGF 123 (317)
T ss_pred EEECCEEECCEEEEEEEecCCccccccccceeccccc
Confidence 999986554 3444332 125799999975
No 122
>PRK06083 sulfur carrier protein ThiS; Provisional
Probab=91.65 E-value=0.93 Score=34.54 Aligned_cols=56 Identities=11% Similarity=0.163 Sum_probs=43.3
Q ss_pred CCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 9 DNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 9 ~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
+|+.+ ++++..||.+|-+. .++++..-.+-+||..+.- ..-+++-+++||.|.+..
T Consensus 24 NG~~~--~~~~~~tl~~LL~~----l~~~~~~vAVevNg~iVpr--~~w~~t~L~egD~IEIv~ 79 (84)
T PRK06083 24 NDQSI--QVDISSSLAQIIAQ----LSLPELGCVFAINNQVVPR--SEWQSTVLSSGDAISLFQ 79 (84)
T ss_pred CCeEE--EcCCCCcHHHHHHH----cCCCCceEEEEECCEEeCH--HHcCcccCCCCCEEEEEE
Confidence 56544 55778888877664 4788877788899999963 467778899999999876
No 123
>PF09379 FERM_N: FERM N-terminal domain ; InterPro: IPR018979 This domain is the N-terminal ubiquitin-like structural domain of the FERM domain. The FERM domain (F for 4.1 protein, E for ezrin, R for radixin and M for moesin) is a widespread protein module involved in localising proteins to the plasma membrane []. FERM domains are found in a number of cytoskeletal-associated proteins that associate with various proteins at the interface between the plasma membrane and the cytoskeleton. The FERM domain is located at the N terminus of the majority of FERM-containing proteins [, ], which includes: Band 4.1, which links the spectrin-actin cytoskeleton of erythrocytes to the plasma membrane. Ezrin, a component of the undercoat of the microvilli plasma membrane. Moesin, which is probably involved in binding major cytoskeletal structures to the plasma membrane. Radixin, which is involved in the binding of the barbed end of actin filaments to the plasma membrane in the undercoat of the cell- to-cell adherens junction. Talin, a cytoskeletal protein concentrated in regions of cell-substratum contact and, in lymphocytes, of cell-cell contacts. Filopodin, a slime mold protein that binds actin and which is involved in the control of cell motility and chemotaxis. Merlin (or schwannomin). Protein NBL4. Unconventional myosins X, VIIa and XV, which are mutated in congenital deafness. Focal-adhesion kinases (FAKs), cytoplasmic protein tyrosine kinases involved in signalling through integrins. Janus tyrosine kinases (JAKs), cytoplasmic tyrosine kinases that are non-covalently associated with the cytoplasmic tails of receptors for cytokines or polypeptidic hormones. Non-receptor tyrosine-protein kinase TYK2. Protein-tyrosine phosphatases PTPN3 and PTPN4, enzyme that appear to act at junctions between the membrane and the cytoskeleton. Protein-tyrosine phosphatases PTPN14 and PTP-D1, PTP-RL10 and PTP2E. Caenorhabditis elegans protein phosphatase ptp-1. Ezrin, moesin, and radixin are highly related proteins (ERM protein family), but the other proteins in which the FERM domain is found do not share any region of similarity outside of this domain. ERM proteins are made of three domains, the FERM domain, a central helical domain and a C-terminal tail domain, which binds F-actin. The amino-acid sequence of the FERM domain is highly conserved among ERM proteins and is responsible for membrane association by direct binding to the cytoplasmic domain or tail of integral membrane proteins. ERM proteins are regulated by an intramolecular association of the FERM and C-terminal tail domains that masks their binding sites for other molecules. For cytoskeleton-membrane cross-linking, the dormant molecules becomes activated and the FERM domain attaches to the membrane by binding specific membrane proteins, while the last 34 residues of the tail bind actin filaments. Aside from binding to membranes, the activated FERM domain of ERM proteins can also bind the guanine nucleotide dissociation inhibitor of Rho GTPase (RhoDGI), which suggests that in addition to functioning as a cross-linker, ERM proteins may influence Rho signalling pathways. The crystal structure of the FERM domain reveals that it is composed of three structural modules (F1, F2, and F3) that together form a compact clover-shaped structure []. The FERM domain has also been called the amino-terminal domain, the 30kDa domain, 4.1N30, the membrane-cytoskeletal-linking domain, the ERM-like domain, the ezrin-like domain of the band 4.1 superfamily, the conserved N-terminal region, and the membrane attachment domain [].; PDB: 1EF1_B 1SGH_A 1E5W_A 2KC2_A 2KMA_A 3IVF_A 1H4R_B 3U8Z_A 1ISN_A 3BIN_A ....
Probab=91.29 E-value=1.7 Score=31.73 Aligned_cols=67 Identities=15% Similarity=0.134 Sum_probs=50.5
Q ss_pred EEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCC-cceEEEE----CCe--eeCCCcCCccccCCC--CCCEEEEEE
Q psy8018 5 VTSLDNECFLLDVSEDLELENFKAFCEVQSGISA-QEIAIEF----QGN--ALVDNKKTLKDYNIQ--DGDLVLLKR 72 (280)
Q Consensus 5 V~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~-~~Q~Li~----~Gk--~L~d~~~tL~~~gI~--~gd~l~l~~ 72 (280)
|..++|...+++|+++.|+.+|-..|....|+.- +-.-|.| +|. -|.. +++|.++... ...++++..
T Consensus 1 V~llD~~~~~~~v~~~~t~~~l~~~v~~~l~l~e~~~FgL~~~~~~~~~~~wL~~-~k~l~~q~~~~~~~~~l~frv 76 (80)
T PF09379_consen 1 VRLLDGTTKTFEVDPKTTGQDLLEQVCDKLGLKEKEYFGLQYQVDKDGEHHWLDL-DKKLKKQLKKNNPPFTLYFRV 76 (80)
T ss_dssp EEESSEEEEEEEEETTSBHHHHHHHHHHHHTTSSGGGEEEEE-EBTTSSEEEE-S-SSBGGGSTBTSSSSEEEEEEE
T ss_pred CCCcCCCcEEEEEcCCCcHHHHHHHHHHHcCCCCccEEEEEEeecCCCcceeccC-cccHHHHcCCCCCCEEEEEEE
Confidence 5678999999999999999999999999999964 4556777 332 2444 4899999887 444455544
No 124
>PF05618 Zn_protease: Putative ATP-dependant zinc protease; InterPro: IPR008503 This family consists of several hypothetical proteins from different archaeal and bacterial species.; PDB: 2PMA_B.
Probab=91.22 E-value=0.28 Score=40.88 Aligned_cols=46 Identities=17% Similarity=0.338 Sum_probs=31.5
Q ss_pred EEEEEECCeEEEeEEEEecCC--CcccccchhhhhcceEEEEecccEE
Q psy8018 210 MVQIAIENDFLTTSFSVLEDQ--PMDMLLGLDMLKRHQVQIAIENDFL 255 (280)
Q Consensus 210 ~~~i~IG~~~~~~~~~Vl~~~--~~d~ILG~D~L~~~~~~ID~~~~~L 255 (280)
...+.+|+..++..|+..+-. .+++|||..-+.+.++.+|..+..+
T Consensus 87 ~~~~~lg~~~~~~e~tL~dR~~m~yp~LlGrR~~l~~~~lVD~s~~~~ 134 (138)
T PF05618_consen 87 ETTLCLGGKTWKIEFTLTDRSNMKYPMLLGRRNFLRGRFLVDVSRSFL 134 (138)
T ss_dssp EEEEEETTEEEEEEEEEE-S--SS-SEEE-HHHHHHTTEEEETT---T
T ss_pred EEEEEECCEEEEEEEEEcCCCcCcCCEEEEehHHhcCCEEECCChhhc
Confidence 348999999999999988733 5789999666667788888876543
No 125
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5. Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=91.22 E-value=0.7 Score=43.14 Aligned_cols=90 Identities=14% Similarity=0.072 Sum_probs=51.3
Q ss_pred EEEEEEcCCCcceecCHHHHHHcCCccccccceeeeeecCc-eeeEEeEEEEE--EEEECCeEEEeEEEEecCCCccccc
Q psy8018 160 PVKAFIDSGAQTTIMSAACAARVNITRLIDTRWAGIAKGVG-VQRIIGRIHMV--QIAIENDFLTTSFSVLEDQPMDMLL 236 (280)
Q Consensus 160 ~v~alVDTGAq~tiis~~~A~rlgL~~~~d~r~~g~a~gvG-~~~~~g~i~~~--~i~IG~~~~~~~~~Vl~~~~~d~IL 236 (280)
...|+||||.+.+.++.+..+++-=. + .. +....+ +.+.. +... -+...+. .|-+.... ..--.||
T Consensus 231 ~~~aivDSGTs~~~lp~~~~~~l~~~--~-P~---i~~~f~~g~~~~--i~p~~y~~~~~~~--~c~~~~~~-~~~~~IL 299 (326)
T cd06096 231 GLGMLVDSGSTLSHFPEDLYNKINNF--F-PT---ITIIFENNLKID--WKPSSYLYKKESF--WCKGGEKS-VSNKPIL 299 (326)
T ss_pred CCCEEEeCCCCcccCCHHHHHHHHhh--c-Cc---EEEEEcCCcEEE--ECHHHhccccCCc--eEEEEEec-CCCceEE
Confidence 34589999999999999998876311 1 01 111111 11100 0000 0111111 12222222 2234799
Q ss_pred chhhhhcceEEEEecccEEEeeee
Q psy8018 237 GLDMLKRHQVQIAIENDFLTTSFS 260 (280)
Q Consensus 237 G~D~L~~~~~~ID~~~~~L~i~~~ 260 (280)
|-.|||.+-.+.|+.+++|-|...
T Consensus 300 G~~flr~~y~vFD~~~~riGfa~~ 323 (326)
T cd06096 300 GASFFKNKQIIFDLDNNRIGFVES 323 (326)
T ss_pred ChHHhcCcEEEEECcCCEEeeEcC
Confidence 999999999999999999988654
No 126
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=91.04 E-value=0.95 Score=41.95 Aligned_cols=88 Identities=18% Similarity=0.341 Sum_probs=54.1
Q ss_pred EEEEEEc--CEEEEEEEcCCCcceecC-HHHHH-HcCCccccccc----e------eeeeecCceeeEEeEEEEEEEEEC
Q psy8018 151 YINCRVN--GHPVKAFIDSGAQTTIMS-AACAA-RVNITRLIDTR----W------AGIAKGVGVQRIIGRIHMVQIAIE 216 (280)
Q Consensus 151 ~i~~~In--G~~v~alVDTGAq~tiis-~~~A~-rlgL~~~~d~r----~------~g~a~gvG~~~~~g~i~~~~i~IG 216 (280)
|++++|. ++++++++|||+..+-+. ..|.. .|.-....+.. + -.+.+|-|. ..|.+....+.||
T Consensus 2 ~~~i~iGtP~Q~~~v~~DTGSs~~Wv~s~~C~~~~C~~~~~y~~~~SsT~~~~~~~~~i~Yg~g~--~~G~~~~D~v~ig 79 (316)
T cd05486 2 FGQISIGTPPQNFTVIFDTGSSNLWVPSIYCTSQACTKHNRFQPSESSTYVSNGEAFSIQYGTGS--LTGIIGIDQVTVE 79 (316)
T ss_pred eEEEEECCCCcEEEEEEcCCCccEEEecCCCCCcccCccceECCCCCcccccCCcEEEEEeCCcE--EEEEeeecEEEEC
Confidence 5778886 678999999999988774 44431 34322111110 0 113355553 5788888899999
Q ss_pred CeEEE-eEEEEe--------cCCCcccccchhh
Q psy8018 217 NDFLT-TSFSVL--------EDQPMDMLLGLDM 240 (280)
Q Consensus 217 ~~~~~-~~~~Vl--------~~~~~d~ILG~D~ 240 (280)
+..++ ..|.+. .+...|+||||-+
T Consensus 80 ~~~~~~~~fg~~~~~~~~~~~~~~~dGilGLg~ 112 (316)
T cd05486 80 GITVQNQQFAESVSEPGSTFQDSEFDGILGLAY 112 (316)
T ss_pred CEEEcCEEEEEeeccCcccccccccceEeccCc
Confidence 86543 344432 1236799999964
No 127
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which
Probab=91.01 E-value=1.6 Score=40.52 Aligned_cols=91 Identities=18% Similarity=0.321 Sum_probs=56.4
Q ss_pred eEEEEEEEEc--CEEEEEEEcCCCcceecCH-HHHH-HcCCcccccc----ce------eeeeecCceeeEEeEEEEEEE
Q psy8018 148 IMLYINCRVN--GHPVKAFIDSGAQTTIMSA-ACAA-RVNITRLIDT----RW------AGIAKGVGVQRIIGRIHMVQI 213 (280)
Q Consensus 148 ~~l~i~~~In--G~~v~alVDTGAq~tiis~-~~A~-rlgL~~~~d~----r~------~g~a~gvG~~~~~g~i~~~~i 213 (280)
...++++.|. .+++.++||||+..+.+.. .|.. .|.-....+. .+ ..+..|-|+ ..|.+....+
T Consensus 9 ~~Y~~~i~vGtp~q~~~v~~DTGS~~~wv~~~~C~~~~c~~~~~f~~~~Sst~~~~~~~~~~~yg~gs--~~G~~~~D~v 86 (317)
T cd05478 9 MEYYGTISIGTPPQDFTVIFDTGSSNLWVPSVYCSSQACSNHNRFNPRQSSTYQSTGQPLSIQYGTGS--MTGILGYDTV 86 (317)
T ss_pred CEEEEEEEeCCCCcEEEEEEeCCCccEEEecCCCCcccccccCcCCCCCCcceeeCCcEEEEEECCce--EEEEEeeeEE
Confidence 5678899996 6789999999999998853 3321 2321111111 01 113345554 5788888899
Q ss_pred EECCeEEE-eEEEEecC--------CCcccccchhh
Q psy8018 214 AIENDFLT-TSFSVLED--------QPMDMLLGLDM 240 (280)
Q Consensus 214 ~IG~~~~~-~~~~Vl~~--------~~~d~ILG~D~ 240 (280)
.+|+..++ ..|.+... ...|+|||+.+
T Consensus 87 ~ig~~~i~~~~fg~~~~~~~~~~~~~~~dGilGLg~ 122 (317)
T cd05478 87 QVGGISDTNQIFGLSETEPGSFFYYAPFDGILGLAY 122 (317)
T ss_pred EECCEEECCEEEEEEEecCccccccccccceeeecc
Confidence 99986554 34444331 14789999874
No 128
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=90.99 E-value=1.9 Score=40.38 Aligned_cols=26 Identities=12% Similarity=0.135 Sum_probs=23.6
Q ss_pred ccccchhhhhcceEEEEecccEEEee
Q psy8018 233 DMLLGLDMLKRHQVQIAIENDFLTTS 258 (280)
Q Consensus 233 d~ILG~D~L~~~~~~ID~~~~~L~i~ 258 (280)
..|||..||+.+-.+.|++++++-|.
T Consensus 303 ~~IlG~~fl~~~y~vFD~~~~~ig~a 328 (329)
T cd05485 303 LWILGDVFIGKYYTEFDLGNNRVGFA 328 (329)
T ss_pred eEEEchHHhccceEEEeCCCCEEeec
Confidence 47999999999999999999998764
No 129
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme. Proteinase A preferentially hydro
Probab=90.97 E-value=1.4 Score=40.89 Aligned_cols=91 Identities=16% Similarity=0.292 Sum_probs=57.4
Q ss_pred eEEEEEEEEc--CEEEEEEEcCCCcceecC-HHHH-HHcCCccccccc----e------eeeeecCceeeEEeEEEEEEE
Q psy8018 148 IMLYINCRVN--GHPVKAFIDSGAQTTIMS-AACA-ARVNITRLIDTR----W------AGIAKGVGVQRIIGRIHMVQI 213 (280)
Q Consensus 148 ~~l~i~~~In--G~~v~alVDTGAq~tiis-~~~A-~rlgL~~~~d~r----~------~g~a~gvG~~~~~g~i~~~~i 213 (280)
...++++.|+ ++++.+++|||++.+.+. ..|. ..|......+.. + ..+..|-|+ ..|.+....+
T Consensus 9 ~~Y~~~i~iGtp~q~~~v~~DTGSs~~wv~~~~C~~~~C~~~~~y~~~~Sst~~~~~~~~~~~y~~g~--~~G~~~~D~v 86 (320)
T cd05488 9 AQYFTDITLGTPPQKFKVILDTGSSNLWVPSVKCGSIACFLHSKYDSSASSTYKANGTEFKIQYGSGS--LEGFVSQDTL 86 (320)
T ss_pred CEEEEEEEECCCCcEEEEEEecCCcceEEEcCCCCCcccCCcceECCCCCcceeeCCCEEEEEECCce--EEEEEEEeEE
Confidence 4578999997 589999999999988884 3342 134332221110 0 113345453 5788888899
Q ss_pred EECCeEEE-eEEEEec--------CCCcccccchhh
Q psy8018 214 AIENDFLT-TSFSVLE--------DQPMDMLLGLDM 240 (280)
Q Consensus 214 ~IG~~~~~-~~~~Vl~--------~~~~d~ILG~D~ 240 (280)
.+|+..++ ..|.... ....|+||||.+
T Consensus 87 ~ig~~~~~~~~f~~a~~~~g~~~~~~~~dGilGLg~ 122 (320)
T cd05488 87 SIGDLTIKKQDFAEATSEPGLAFAFGKFDGILGLAY 122 (320)
T ss_pred EECCEEECCEEEEEEecCCCcceeeeeeceEEecCC
Confidence 99987654 3343332 124689999985
No 130
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank
Probab=90.86 E-value=1.2 Score=41.45 Aligned_cols=91 Identities=14% Similarity=0.259 Sum_probs=57.3
Q ss_pred eEEEEEEEEc--CEEEEEEEcCCCcceecC-HHHH---HHcCCccccccc----e------eeeeecCceeeEEeEEEEE
Q psy8018 148 IMLYINCRVN--GHPVKAFIDSGAQTTIMS-AACA---ARVNITRLIDTR----W------AGIAKGVGVQRIIGRIHMV 211 (280)
Q Consensus 148 ~~l~i~~~In--G~~v~alVDTGAq~tiis-~~~A---~rlgL~~~~d~r----~------~g~a~gvG~~~~~g~i~~~ 211 (280)
...++++.|. ++++.+++|||++.+-+. ..|. ..|+.....+.. + ..+.+|-|+ ..|.+...
T Consensus 5 ~~Y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~C~~~~~y~~~~SsT~~~~~~~~~i~Yg~G~--~~G~~~~D 82 (325)
T cd05490 5 AQYYGEIGIGTPPQTFTVVFDTGSSNLWVPSVHCSLLDIACWLHHKYNSSKSSTYVKNGTEFAIQYGSGS--LSGYLSQD 82 (325)
T ss_pred CEEEEEEEECCCCcEEEEEEeCCCccEEEEcCCCCCCCccccCcCcCCcccCcceeeCCcEEEEEECCcE--EEEEEeee
Confidence 3567899996 588999999999988773 4443 245432222111 1 113355553 57999989
Q ss_pred EEEECCeEEE-eEEEEecC--------CCcccccchhh
Q psy8018 212 QIAIENDFLT-TSFSVLED--------QPMDMLLGLDM 240 (280)
Q Consensus 212 ~i~IG~~~~~-~~~~Vl~~--------~~~d~ILG~D~ 240 (280)
.+.+|+..++ ..|.+... ...|+||||-+
T Consensus 83 ~v~~g~~~~~~~~Fg~~~~~~~~~~~~~~~dGilGLg~ 120 (325)
T cd05490 83 TVSIGGLQVEGQLFGEAVKQPGITFIAAKFDGILGMAY 120 (325)
T ss_pred EEEECCEEEcCEEEEEEeeccCCcccceeeeEEEecCC
Confidence 9999987654 34443321 24689999864
No 131
>PF14836 Ubiquitin_3: Ubiquitin-like domain; PDB: 3JYU_A 4A3O_B 3PPA_A 3T9L_A 4A3P_A 3PV1_A.
Probab=90.85 E-value=1.6 Score=33.65 Aligned_cols=63 Identities=16% Similarity=0.150 Sum_probs=46.1
Q ss_pred CcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEE----CCee-eCCCcCCccccCCCCCCEEEEEEe
Q psy8018 10 NECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEF----QGNA-LVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 10 g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~----~Gk~-L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
....+...+..+||..++..+...+.| .++-||-- ++.. |.+.+.||+++||.+|-+|.+-.+
T Consensus 13 ~~~~t~~FSk~DTI~~v~~~~rklf~i-~~E~RLW~~~~~~~~e~L~~~~~Tv~da~L~~gQ~vliE~r 80 (88)
T PF14836_consen 13 QSVLTKQFSKTDTIGFVEKEMRKLFNI-QEETRLWNKYSENSYELLNNPEITVEDAGLYDGQVVLIEER 80 (88)
T ss_dssp CEEEEEEE-TTSBHHHHHHHHHHHCT--TS-EEEEEECTTTCEEEE--TTSBTTTTT--TTEEEEEEE-
T ss_pred ccHhHhhccccChHHHHHHHHHHHhCC-CccceehhccCCcchhhhCCCCccHHHccCcCCCEEEEEee
Confidence 457888999999999999999999999 66678853 3444 666668999999999999999876
No 132
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases. They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=90.82 E-value=1.3 Score=41.25 Aligned_cols=95 Identities=12% Similarity=0.109 Sum_probs=56.3
Q ss_pred EEEEcCEE-------EEEEEcCCCcceecCHHHHHHcCCccccccce----eeeeecCceeeEEeEEEEEEEEEC--CeE
Q psy8018 153 NCRVNGHP-------VKAFIDSGAQTTIMSAACAARVNITRLIDTRW----AGIAKGVGVQRIIGRIHMVQIAIE--NDF 219 (280)
Q Consensus 153 ~~~InG~~-------v~alVDTGAq~tiis~~~A~rlgL~~~~d~r~----~g~a~gvG~~~~~g~i~~~~i~IG--~~~ 219 (280)
.++|||.. ..|+||||.+.+.++.+.++++. ..+++.- .-+.-..|+ ..+.|. +..
T Consensus 196 ~i~v~g~~~~~~~~~~~aivDTGTs~~~lP~~~~~~i~--~~~~C~~~~~~P~i~f~f~g---------~~~~l~~~~yi 264 (317)
T cd06098 196 DVLIGGKSTGFCAGGCAAIADSGTSLLAGPTTIVTQIN--SAVDCNSLSSMPNVSFTIGG---------KTFELTPEQYI 264 (317)
T ss_pred eEEECCEEeeecCCCcEEEEecCCcceeCCHHHHHhhh--ccCCccccccCCcEEEEECC---------EEEEEChHHeE
Confidence 35778875 35899999999999999988764 2223221 001111111 112221 111
Q ss_pred --------EEeEEEEe--c---CCCcccccchhhhhcceEEEEecccEEEee
Q psy8018 220 --------LTTSFSVL--E---DQPMDMLLGLDMLKRHQVQIAIENDFLTTS 258 (280)
Q Consensus 220 --------~~~~~~Vl--~---~~~~d~ILG~D~L~~~~~~ID~~~~~L~i~ 258 (280)
-.|-+.+. + ..+...|||-.|||.+-.+.|+.+++|-|.
T Consensus 265 ~~~~~~~~~~C~~~~~~~~~~~~~~~~~IlGd~Flr~~y~VfD~~~~~iGfA 316 (317)
T cd06098 265 LKVGEGAAAQCISGFTALDVPPPRGPLWILGDVFMGAYHTVFDYGNLRVGFA 316 (317)
T ss_pred EeecCCCCCEEeceEEECCCCCCCCCeEEechHHhcccEEEEeCCCCEEeec
Confidence 12432222 1 112347999999999999999999998764
No 133
>PF00026 Asp: Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.; InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) . More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=90.65 E-value=0.98 Score=41.13 Aligned_cols=99 Identities=13% Similarity=0.136 Sum_probs=58.8
Q ss_pred EEEcCE------EEEEEEcCCCcceecCHHHHHHc----CCccccccceeeeeecCceeeEEeEEEEEEEEECCeEEE--
Q psy8018 154 CRVNGH------PVKAFIDSGAQTTIMSAACAARV----NITRLIDTRWAGIAKGVGVQRIIGRIHMVQIAIENDFLT-- 221 (280)
Q Consensus 154 ~~InG~------~v~alVDTGAq~tiis~~~A~rl----gL~~~~d~r~~g~a~gvG~~~~~g~i~~~~i~IG~~~~~-- 221 (280)
+.+++. ...|++|||+....++.+..+++ +..... . ......... .......+.+|+..++
T Consensus 187 i~i~~~~~~~~~~~~~~~Dtgt~~i~lp~~~~~~i~~~l~~~~~~--~--~~~~~c~~~---~~~p~l~f~~~~~~~~i~ 259 (317)
T PF00026_consen 187 ISIGGESVFSSSGQQAILDTGTSYIYLPRSIFDAIIKALGGSYSD--G--VYSVPCNST---DSLPDLTFTFGGVTFTIP 259 (317)
T ss_dssp EEETTEEEEEEEEEEEEEETTBSSEEEEHHHHHHHHHHHTTEEEC--S--EEEEETTGG---GGSEEEEEEETTEEEEEE
T ss_pred ccccccccccccceeeecccccccccccchhhHHHHhhhcccccc--e--eEEEecccc---cccceEEEeeCCEEEEec
Confidence 455555 36799999999999999877765 221110 0 000000000 0012223444443221
Q ss_pred ---------------eEEEEec----CCCcccccchhhhhcceEEEEecccEEEeee
Q psy8018 222 ---------------TSFSVLE----DQPMDMLLGLDMLKRHQVQIAIENDFLTTSF 259 (280)
Q Consensus 222 ---------------~~~~Vl~----~~~~d~ILG~D~L~~~~~~ID~~~~~L~i~~ 259 (280)
|-+.+.. ......+||..||+++-.+.|+.++++-|..
T Consensus 260 ~~~~~~~~~~~~~~~C~~~i~~~~~~~~~~~~iLG~~fl~~~y~vfD~~~~~ig~A~ 316 (317)
T PF00026_consen 260 PSDYIFKIEDGNGGYCYLGIQPMDSSDDSDDWILGSPFLRNYYVVFDYENNRIGFAQ 316 (317)
T ss_dssp HHHHEEEESSTTSSEEEESEEEESSTTSSSEEEEEHHHHTTEEEEEETTTTEEEEEE
T ss_pred chHhcccccccccceeEeeeecccccccCCceEecHHHhhceEEEEeCCCCEEEEec
Confidence 4333333 3356699999999999999999999998763
No 134
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=90.31 E-value=1.5 Score=41.08 Aligned_cols=91 Identities=15% Similarity=0.281 Sum_probs=57.8
Q ss_pred eEEEEEEEEc--CEEEEEEEcCCCcceecC-HHHH---HHcCCcccccc----ce------eeeeecCceeeEEeEEEEE
Q psy8018 148 IMLYINCRVN--GHPVKAFIDSGAQTTIMS-AACA---ARVNITRLIDT----RW------AGIAKGVGVQRIIGRIHMV 211 (280)
Q Consensus 148 ~~l~i~~~In--G~~v~alVDTGAq~tiis-~~~A---~rlgL~~~~d~----r~------~g~a~gvG~~~~~g~i~~~ 211 (280)
...++++.|. ++++++++|||+..+-+. ..|. ..|+.....+. .+ -.+.+|-|+ ..|.+...
T Consensus 10 ~~Y~~~i~vGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~c~~~~~y~~~~Sst~~~~~~~~~i~Y~~g~--~~G~~~~D 87 (329)
T cd05485 10 AQYYGVITIGTPPQSFKVVFDTGSSNLWVPSKKCSWTNIACLLHNKYDSTKSSTYKKNGTEFAIQYGSGS--LSGFLSTD 87 (329)
T ss_pred CeEEEEEEECCCCcEEEEEEcCCCccEEEecCCCCCCCccccCCCeECCcCCCCeEECCeEEEEEECCce--EEEEEecC
Confidence 5678999998 588999999999988775 3332 13543222211 11 012345554 57888888
Q ss_pred EEEECCeEEE-eEEEEec--------CCCcccccchhh
Q psy8018 212 QIAIENDFLT-TSFSVLE--------DQPMDMLLGLDM 240 (280)
Q Consensus 212 ~i~IG~~~~~-~~~~Vl~--------~~~~d~ILG~D~ 240 (280)
.+.+|+..++ ..|.+.. ....|+||||-+
T Consensus 88 ~v~ig~~~~~~~~fg~~~~~~~~~~~~~~~~GilGLg~ 125 (329)
T cd05485 88 TVSVGGVSVKGQTFAEAINEPGLTFVAAKFDGILGMGY 125 (329)
T ss_pred cEEECCEEECCEEEEEEEecCCccccccccceEEEcCC
Confidence 9999987654 3444332 124689999875
No 135
>PF10790 DUF2604: Protein of Unknown function (DUF2604); InterPro: IPR019726 This entry represents bacterial proteins with undetermined function.
Probab=90.22 E-value=1.3 Score=32.15 Aligned_cols=65 Identities=17% Similarity=0.258 Sum_probs=53.0
Q ss_pred CCCcEEEEEeCCCccHHHHHHHHHHHh---CCCCcceEEE-ECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 8 LDNECFLLDVSEDLELENFKAFCEVQS---GISAQEIAIE-FQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 8 ~~g~~~~leV~~~~tV~~LK~~Ie~e~---gip~~~Q~Li-~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
.+|+...+|..+...+--+.+.--+++ |-|+++..|- -+|..|.- ++.+++||+..|-+++|..+
T Consensus 3 VNGqPv~VEANvnaPLh~v~akALe~sgNvgQP~ENWElkDe~G~vlD~-~kKveD~GftngvkLFLsLK 71 (76)
T PF10790_consen 3 VNGQPVQVEANVNAPLHPVRAKALEQSGNVGQPPENWELKDESGQVLDV-NKKVEDFGFTNGVKLFLSLK 71 (76)
T ss_pred eCCCceeeecCCCCcchHHHHHHHhhccccCCCcccceeeccCCcEeec-cchhhhccccccceEEEEee
Confidence 478899999999888877777655554 4689988886 47888864 48999999999999999886
No 136
>cd00565 ThiS ThiaminS ubiquitin-like sulfur carrier protein. ThiS (ThiaminS) is a sulfur carrier protein involved in thiamin biosynthesis in bacteria. The ThiS fold, like those of two closely related proteins MoaD and Urm1, is similar to that of ubiquitin although there is little or no sequence similarity.
Probab=89.56 E-value=1.5 Score=31.22 Aligned_cols=56 Identities=20% Similarity=0.327 Sum_probs=42.2
Q ss_pred CCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 9 DNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 9 ~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
+|+. +++++..|+.+|.+.+ +++++.-.+..||+.+.. ..-++.-+++||.|.+..
T Consensus 5 Ng~~--~~~~~~~tv~~ll~~l----~~~~~~i~V~vNg~~v~~--~~~~~~~L~~gD~V~ii~ 60 (65)
T cd00565 5 NGEP--REVEEGATLAELLEEL----GLDPRGVAVALNGEIVPR--SEWASTPLQDGDRIEIVT 60 (65)
T ss_pred CCeE--EEcCCCCCHHHHHHHc----CCCCCcEEEEECCEEcCH--HHcCceecCCCCEEEEEE
Confidence 4544 5667788999988775 577788888899999863 234455699999999876
No 137
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco. CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=89.22 E-value=1.1 Score=40.95 Aligned_cols=79 Identities=16% Similarity=0.320 Sum_probs=53.2
Q ss_pred EEEEEEEc--CEEEEEEEcCCCcceecCHHHHHHcCCccccccceeeeeecCceeeEEeEEEEEEEEECCe-EEE-eEEE
Q psy8018 150 LYINCRVN--GHPVKAFIDSGAQTTIMSAACAARVNITRLIDTRWAGIAKGVGVQRIIGRIHMVQIAIEND-FLT-TSFS 225 (280)
Q Consensus 150 l~i~~~In--G~~v~alVDTGAq~tiis~~~A~rlgL~~~~d~r~~g~a~gvG~~~~~g~i~~~~i~IG~~-~~~-~~~~ 225 (280)
.++++.|+ .+++.+++|||++.+-+. | ..| ..+ .+.+|-| ..+.|.+....+.+|+. .++ ..|.
T Consensus 2 Y~~~i~iGtP~q~~~v~~DTGSs~~Wv~--c-~~c-------~~~-~i~Yg~G-s~~~G~~~~D~v~ig~~~~~~~~~Fg 69 (299)
T cd05472 2 YVVTVGLGTPARDQTVIVDTGSDLTWVQ--C-QPC-------CLY-QVSYGDG-SYTTGDLATDTLTLGSSDVVPGFAFG 69 (299)
T ss_pred eEEEEecCCCCcceEEEecCCCCccccc--C-CCC-------Cee-eeEeCCC-ceEEEEEEEEEEEeCCCCccCCEEEE
Confidence 46788887 678999999999888773 1 123 222 2345544 34568888889999976 443 4565
Q ss_pred EecCC-----Ccccccchhh
Q psy8018 226 VLEDQ-----PMDMLLGLDM 240 (280)
Q Consensus 226 Vl~~~-----~~d~ILG~D~ 240 (280)
+.... ..|+||||-+
T Consensus 70 ~~~~~~~~~~~~~GilGLg~ 89 (299)
T cd05472 70 CGHDNEGLFGGAAGLLGLGR 89 (299)
T ss_pred CCccCCCccCCCCEEEECCC
Confidence 55422 5789999964
No 138
>PTZ00165 aspartyl protease; Provisional
Probab=89.09 E-value=1.9 Score=43.13 Aligned_cols=99 Identities=18% Similarity=0.247 Sum_probs=60.3
Q ss_pred CCCcce--eeeEEEEEEEEcC--EEEEEEEcCCCcceec-CHHHHH-HcCCccccccc----e-----------eeeeec
Q psy8018 140 NPETFG--SVIMLYINCRVNG--HPVKAFIDSGAQTTIM-SAACAA-RVNITRLIDTR----W-----------AGIAKG 198 (280)
Q Consensus 140 ~Pe~~~--~~~~l~i~~~InG--~~v~alVDTGAq~tii-s~~~A~-rlgL~~~~d~r----~-----------~g~a~g 198 (280)
.||.+. .....+.++.|+. +++++++|||++.+-+ +..|.. .|......+.. + .-+.+|
T Consensus 109 ~~~~l~n~~d~~Y~~~I~IGTPpQ~f~Vv~DTGSS~lWVps~~C~~~~C~~~~~yd~s~SSTy~~~~~~~~~~~~~i~YG 188 (482)
T PTZ00165 109 LQQDLLNFHNSQYFGEIQVGTPPKSFVVVFDTGSSNLWIPSKECKSGGCAPHRKFDPKKSSTYTKLKLGDESAETYIQYG 188 (482)
T ss_pred cceecccccCCeEEEEEEeCCCCceEEEEEeCCCCCEEEEchhcCcccccccCCCCccccCCcEecCCCCccceEEEEeC
Confidence 444443 3457789999976 8899999999998877 444532 23222211110 0 012355
Q ss_pred CceeeEEeEEEEEEEEECCeEEE-eEEEEec--------CCCcccccchhh
Q psy8018 199 VGVQRIIGRIHMVQIAIENDFLT-TSFSVLE--------DQPMDMLLGLDM 240 (280)
Q Consensus 199 vG~~~~~g~i~~~~i~IG~~~~~-~~~~Vl~--------~~~~d~ILG~D~ 240 (280)
-|+ ..|.+....+.+|+..++ ..|.+.. ....|+||||-|
T Consensus 189 sGs--~~G~l~~DtV~ig~l~i~~q~FG~a~~~s~~~f~~~~~DGILGLg~ 237 (482)
T PTZ00165 189 TGE--CVLALGKDTVKIGGLKVKHQSIGLAIEESLHPFADLPFDGLVGLGF 237 (482)
T ss_pred CCc--EEEEEEEEEEEECCEEEccEEEEEEEeccccccccccccceeecCC
Confidence 553 457777789999987654 2343332 225799999987
No 139
>PRK06944 sulfur carrier protein ThiS; Provisional
Probab=89.01 E-value=3.6 Score=29.00 Aligned_cols=60 Identities=15% Similarity=0.279 Sum_probs=40.9
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
|+|+| +|+. +++++..|+.+|.+.+ +++ ..-.+.+||+.+.. ..-.+.-+++||.|.+..
T Consensus 1 m~i~v---Ng~~--~~~~~~~tl~~ll~~l----~~~-~~~~v~vN~~~v~~--~~~~~~~L~~gD~vei~~ 60 (65)
T PRK06944 1 MDIQL---NQQT--LSLPDGATVADALAAY----GAR-PPFAVAVNGDFVAR--TQHAARALAAGDRLDLVQ 60 (65)
T ss_pred CEEEE---CCEE--EECCCCCcHHHHHHhh----CCC-CCeEEEECCEEcCc--hhcccccCCCCCEEEEEe
Confidence 66666 5554 4667788999888765 443 23566789998853 123344599999999876
No 140
>cd01760 RBD Ubiquitin-like domain of RBD-like S/T kinases. The ras-binding domain (RBD) of the serine/threonine kinase raf is structurally quite similar to the beta-grasp fold of ubiquitin. A raf-like RBD is also present in RGS12 and other members of a family of GTPase activating proteins and TIAM1, a guanine nucleotide exchange protein.
Probab=88.82 E-value=2.7 Score=31.05 Aligned_cols=65 Identities=12% Similarity=0.110 Sum_probs=49.3
Q ss_pred EEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECC----eeeCCCcCCccccCCCCCCEEEEE
Q psy8018 3 VTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQG----NALVDNKKTLKDYNIQDGDLVLLK 71 (280)
Q Consensus 3 ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~G----k~L~d~~~tL~~~gI~~gd~l~l~ 71 (280)
++|..++|+...+.|.|.+|+.++-+.+-+.-|+.++.-.|++.| +++.- -++.+.-.|+-|.+.
T Consensus 2 ~~V~LPng~~t~V~vrpg~ti~d~L~~~c~kr~l~~~~~~v~~~~~~~~~~~~~----~~d~~~L~~~El~Ve 70 (72)
T cd01760 2 CRVYLPNGQRTVVPVRPGMSVRDVLAKACKKRGLNPECCDVFLLGLDEKKPLDL----DTDSSSLAGEELEVE 70 (72)
T ss_pred EEEECcCCCeEEEEECCCCCHHHHHHHHHHHcCCCHHHEEEEEecCCCcCCcCc----hhhhhhhcCCEEEEE
Confidence 467789999999999999999999999999999999988887654 33321 223345556655553
No 141
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=88.57 E-value=1 Score=40.89 Aligned_cols=78 Identities=17% Similarity=0.136 Sum_probs=47.5
Q ss_pred EEEEEEcCCCcceecCHHHHHHcCCccccccceeeeeecCceeeEEeEEEEEEEEECCeEEEeEEEEecCC--Ccccccc
Q psy8018 160 PVKAFIDSGAQTTIMSAACAARVNITRLIDTRWAGIAKGVGVQRIIGRIHMVQIAIENDFLTTSFSVLEDQ--PMDMLLG 237 (280)
Q Consensus 160 ~v~alVDTGAq~tiis~~~A~rlgL~~~~d~r~~g~a~gvG~~~~~g~i~~~~i~IG~~~~~~~~~Vl~~~--~~d~ILG 237 (280)
...++||||++.+.++.+.++++-= .+.|... .. .-|...++|+-. ++.. .+..|||
T Consensus 198 ~~~~iiDSGTs~~~lP~~~~~~l~~------~l~g~~~----~~----------~~~~~~~~C~~~-~P~i~f~~~~ilG 256 (278)
T cd06097 198 GFSAIADTGTTLILLPDAIVEAYYS------QVPGAYY----DS----------EYGGWVFPCDTT-LPDLSFAVFSILG 256 (278)
T ss_pred CceEEeecCCchhcCCHHHHHHHHH------hCcCCcc----cC----------CCCEEEEECCCC-CCCEEEEEEEEEc
Confidence 4569999999999999888776620 0101000 00 001122333211 1111 0157999
Q ss_pred hhhhhcceEEEEecccEEEee
Q psy8018 238 LDMLKRHQVQIAIENDFLTTS 258 (280)
Q Consensus 238 ~D~L~~~~~~ID~~~~~L~i~ 258 (280)
-.|||.+=.+.|+.+++|-+.
T Consensus 257 d~fl~~~y~vfD~~~~~ig~A 277 (278)
T cd06097 257 DVFLKAQYVVFDVGGPKLGFA 277 (278)
T ss_pred chhhCceeEEEcCCCceeeec
Confidence 999999999999999988664
No 142
>COG2104 ThiS Sulfur transfer protein involved in thiamine biosynthesis [Coenzyme metabolism]
Probab=88.42 E-value=3.6 Score=30.12 Aligned_cols=63 Identities=17% Similarity=0.371 Sum_probs=47.2
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
|.+++.. +|+ +++++...|+.+|-+. .|++++.-...+||.++.. .--++.-+++||.|.++.
T Consensus 1 ~~m~i~~-ng~--~~e~~~~~tv~dLL~~----l~~~~~~vav~vNg~iVpr--~~~~~~~l~~gD~ievv~ 63 (68)
T COG2104 1 MPMTIQL-NGK--EVEIAEGTTVADLLAQ----LGLNPEGVAVAVNGEIVPR--SQWADTILKEGDRIEVVR 63 (68)
T ss_pred CcEEEEE-CCE--EEEcCCCCcHHHHHHH----hCCCCceEEEEECCEEccc--hhhhhccccCCCEEEEEE
Confidence 4455544 343 5677787999998766 6788888889999999973 345667799999998875
No 143
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme. Proteinase A preferentially hydro
Probab=88.41 E-value=4.1 Score=37.83 Aligned_cols=101 Identities=13% Similarity=0.119 Sum_probs=56.6
Q ss_pred EEEcCEE-----EEEEEcCCCcceecCHHHHHHcCCccccccce--eeeeecCceeeEEeEEEEEEEEECCeEEE-----
Q psy8018 154 CRVNGHP-----VKAFIDSGAQTTIMSAACAARVNITRLIDTRW--AGIAKGVGVQRIIGRIHMVQIAIENDFLT----- 221 (280)
Q Consensus 154 ~~InG~~-----v~alVDTGAq~tiis~~~A~rlgL~~~~d~r~--~g~a~gvG~~~~~g~i~~~~i~IG~~~~~----- 221 (280)
++|+|.. ..++||||++.+.++.++++.+. ..+.... .+. .-+-... ........+.+|+..+.
T Consensus 194 i~vg~~~~~~~~~~~ivDSGtt~~~lp~~~~~~l~--~~~~~~~~~~~~-~~~~C~~-~~~~P~i~f~f~g~~~~i~~~~ 269 (320)
T cd05488 194 IGLGDEELELENTGAAIDTGTSLIALPSDLAEMLN--AEIGAKKSWNGQ-YTVDCSK-VDSLPDLTFNFDGYNFTLGPFD 269 (320)
T ss_pred EEECCEEeccCCCeEEEcCCcccccCCHHHHHHHH--HHhCCccccCCc-EEeeccc-cccCCCEEEEECCEEEEECHHH
Confidence 4556543 35899999999999999887752 1111000 000 0000000 00112234555554332
Q ss_pred --------eEEEEecC-----CCcccccchhhhhcceEEEEecccEEEee
Q psy8018 222 --------TSFSVLED-----QPMDMLLGLDMLKRHQVQIAIENDFLTTS 258 (280)
Q Consensus 222 --------~~~~Vl~~-----~~~d~ILG~D~L~~~~~~ID~~~~~L~i~ 258 (280)
|-+.+... .+...|||..||+.+-.+.|+.++++-|.
T Consensus 270 y~~~~~g~C~~~~~~~~~~~~~~~~~ilG~~fl~~~y~vfD~~~~~iG~a 319 (320)
T cd05488 270 YTLEVSGSCISAFTGMDFPEPVGPLAIVGDAFLRKYYSVYDLGNNAVGLA 319 (320)
T ss_pred heecCCCeEEEEEEECcCCCCCCCeEEEchHHhhheEEEEeCCCCEEeec
Confidence 33333221 12258999999999999999999988764
No 144
>TIGR01683 thiS thiamine biosynthesis protein ThiS. This model represents ThiS, a small thiamine-biosynthesis protein related to MoaD, a molybdenum cofactor biosynthesis protein. Both proteins are involved in sulfur transfer. ThiS has a conserved Gly-Gly C-terminus that is modified, in reactions requiring ThiI, ThiF, IscS, and a sulfur atom from Cys, into the thiocarboxylate that provides the sulfur for thiazole biosynthesis.
Probab=88.34 E-value=2 Score=30.50 Aligned_cols=56 Identities=18% Similarity=0.358 Sum_probs=41.8
Q ss_pred CCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 9 DNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 9 ~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
+|+.+ +++...|+.+|.+. .+++++.-.+..||+.+.. ..-+++-+++||.|.+..
T Consensus 4 Ng~~~--~~~~~~tv~~ll~~----l~~~~~~v~v~vN~~iv~~--~~~~~~~L~~gD~veii~ 59 (64)
T TIGR01683 4 NGEPV--EVEDGLTLAALLES----LGLDPRRVAVAVNGEIVPR--SEWDDTILKEGDRIEIVT 59 (64)
T ss_pred CCeEE--EcCCCCcHHHHHHH----cCCCCCeEEEEECCEEcCH--HHcCceecCCCCEEEEEE
Confidence 55544 55778899998876 4677777778899999853 234556799999999876
No 145
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two
Probab=88.31 E-value=2.1 Score=40.57 Aligned_cols=87 Identities=15% Similarity=0.262 Sum_probs=53.0
Q ss_pred EEEEEEEEc--CEEEEEEEcCCCcceecCHH-HHHHcCCccccccc----e------eeeeecCceeeEEeEEEEEEEEE
Q psy8018 149 MLYINCRVN--GHPVKAFIDSGAQTTIMSAA-CAARVNITRLIDTR----W------AGIAKGVGVQRIIGRIHMVQIAI 215 (280)
Q Consensus 149 ~l~i~~~In--G~~v~alVDTGAq~tiis~~-~A~rlgL~~~~d~r----~------~g~a~gvG~~~~~g~i~~~~i~I 215 (280)
-.++++.|. ++++.+++|||++.+-+... |. ......+.. + -.+.+|-|. ..|.+....+.|
T Consensus 3 ~Y~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~~~~---~~~~~f~~~~SsT~~~~~~~~~i~Yg~Gs--~~G~~~~D~v~i 77 (364)
T cd05473 3 GYYIEMLIGTPPQKLNILVDTGSSNFAVAAAPHP---FIHTYFHRELSSTYRDLGKGVTVPYTQGS--WEGELGTDLVSI 77 (364)
T ss_pred ceEEEEEecCCCceEEEEEecCCcceEEEcCCCc---cccccCCchhCcCcccCCceEEEEECcce--EEEEEEEEEEEE
Confidence 357889996 68999999999988877433 32 111111110 0 123456553 469998889999
Q ss_pred CC---eEEEeEEEEec--------CCCcccccchhh
Q psy8018 216 EN---DFLTTSFSVLE--------DQPMDMLLGLDM 240 (280)
Q Consensus 216 G~---~~~~~~~~Vl~--------~~~~d~ILG~D~ 240 (280)
|+ ..+...+.... +...|+||||-+
T Consensus 78 g~~~~~~~~~~~~~~~~~~~~~~~~~~~dGIlGLg~ 113 (364)
T cd05473 78 PKGPNVTFRANIAAITESENFFLNGSNWEGILGLAY 113 (364)
T ss_pred CCCCccceEEeeEEEeccccceecccccceeeeecc
Confidence 85 23333333221 124699999975
No 146
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco. CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=88.03 E-value=5.2 Score=36.57 Aligned_cols=28 Identities=7% Similarity=-0.003 Sum_probs=25.0
Q ss_pred ccccchhhhhcceEEEEecccEEEeeee
Q psy8018 233 DMLLGLDMLKRHQVQIAIENDFLTTSFS 260 (280)
Q Consensus 233 d~ILG~D~L~~~~~~ID~~~~~L~i~~~ 260 (280)
-.|||-.|||.+-.+.|+.+++|-+...
T Consensus 270 ~~ilG~~fl~~~~vvfD~~~~~igfa~~ 297 (299)
T cd05472 270 LSIIGNVQQQTFRVVYDVAGGRIGFAPG 297 (299)
T ss_pred CEEEchHHccceEEEEECCCCEEeEecC
Confidence 3799999999999999999999988654
No 147
>cd00754 MoaD Ubiquitin domain of MoaD-like proteins. MoaD family. Members of this family are involved in biosynthesis of the molybdenum cofactor (Moco), an essential cofactor of a diverse group of redox enzymes. Moco biosynthesis is an evolutionarily conserved pathway present in eubacteria, archaea and eukaryotes. Moco contains a tricyclic pyranopterin, termed molybdopterin (MPT), that contains the cis-dithiolene group responsible for molybdenum ligation. This dithiolene group is generated by MPT synthase, the second major step in Moco biosynthesis. MPT synthase consists of a large (MoeE) and small (MoaD) subunit. The small subunit is inserted into the lare subunit to form the active site. The small subunit, which is structurally similar to ubiquitin, contains a C-terminal thiocarboxylated glycine residue that serves as a sulfur donor for the synthesis of the MPT dithiolene group.
Probab=87.90 E-value=2.6 Score=30.69 Aligned_cols=55 Identities=18% Similarity=0.143 Sum_probs=41.4
Q ss_pred EEEEEeCCCccHHHHHHHHHHHhCC----CCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 12 CFLLDVSEDLELENFKAFCEVQSGI----SAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 12 ~~~leV~~~~tV~~LK~~Ie~e~gi----p~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
...+++++..||.+|.+.+....+- ....-.+..||+.... ++-+++||.|.+..
T Consensus 17 ~~~~~~~~~~tv~~ll~~l~~~~~~~~~~~~~~~~v~vNg~~v~~------~~~l~~gD~v~i~p 75 (80)
T cd00754 17 EEELELPEGATVGELLDALEARYPGLLEELLARVRIAVNGEYVRL------DTPLKDGDEVAIIP 75 (80)
T ss_pred eEEEECCCCCcHHHHHHHHHHHCchHHHhhhhcEEEEECCeEcCC------CcccCCCCEEEEeC
Confidence 4677888899999999999887532 2334566779998863 24599999999875
No 148
>PF11069 DUF2870: Protein of unknown function (DUF2870); InterPro: IPR021298 This is a eukaryotic family of proteins with unknown function.
Probab=87.83 E-value=1.5 Score=34.46 Aligned_cols=30 Identities=23% Similarity=0.346 Sum_probs=21.7
Q ss_pred EEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 42 AIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 42 ~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
.|-|.||.|..+ ++|++| |.-++.--++.+
T Consensus 3 ~LW~aGK~l~~~-k~l~dy-~GkNEKtKiivK 32 (98)
T PF11069_consen 3 QLWWAGKELQRG-KKLSDY-IGKNEKTKIIVK 32 (98)
T ss_pred eEEeccccccCC-CcHHHh-cCCCcceeEEEE
Confidence 578999999765 899999 555554444443
No 149
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=87.71 E-value=1.6 Score=39.67 Aligned_cols=28 Identities=14% Similarity=-0.023 Sum_probs=24.6
Q ss_pred ccccchhhhhcceEEEEecccEEEeeee
Q psy8018 233 DMLLGLDMLKRHQVQIAIENDFLTTSFS 260 (280)
Q Consensus 233 d~ILG~D~L~~~~~~ID~~~~~L~i~~~ 260 (280)
..|||-.||+.+-.+.|+.+++|-+...
T Consensus 244 ~~ilG~~~l~~~~~vfD~~~~riGfa~~ 271 (273)
T cd05475 244 TNIIGDISMQGLMVIYDNEKQQIGWVRS 271 (273)
T ss_pred eEEECceEEEeeEEEEECcCCEeCcccC
Confidence 4799999999999999999998877554
No 150
>PRK11840 bifunctional sulfur carrier protein/thiazole synthase protein; Provisional
Probab=87.15 E-value=2.9 Score=39.71 Aligned_cols=67 Identities=18% Similarity=0.275 Sum_probs=50.8
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEeccCCCC
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRVQVARTS 79 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~~~p~~s 79 (280)
|+|+| +|+. +++++..||.+|-.. .+++++.-.+.+||+.+.. ..-.++-+++||.|.+.. .+.|.|
T Consensus 1 M~I~V---NGk~--~el~e~~TL~dLL~~----L~i~~~~VAVeVNgeIVpr--~~w~~t~LkeGD~IEII~-~VgGGs 67 (326)
T PRK11840 1 MRIRL---NGEP--RQVPAGLTIAALLAE----LGLAPKKVAVERNLEIVPR--SEYGQVALEEGDELEIVH-FVGGGS 67 (326)
T ss_pred CEEEE---CCEE--EecCCCCcHHHHHHH----cCCCCCeEEEEECCEECCH--HHcCccccCCCCEEEEEE-EecCCC
Confidence 66666 5654 566778888887654 5889988899999999963 345667799999999987 356654
No 151
>cd06406 PB1_P67 A PB1 domain is present in p67 proteins which forms a signaling complex with p40, a crucial step for activation of NADPH oxidase during phagocytosis. PB1 domain is a modular domain mediating specific protein-protein interactions which play a role in many critical cell processes . A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions. The PB1 domain module is conserved in amoebas, fungi, animals, and plants. The p67 proteins contain
Probab=87.02 E-value=2.4 Score=32.09 Aligned_cols=36 Identities=11% Similarity=0.254 Sum_probs=34.0
Q ss_pred EEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECC
Q psy8018 12 CFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQG 47 (280)
Q Consensus 12 ~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~G 47 (280)
++.+.|+++.+..+|.++|.++.++|++.-+|.|.-
T Consensus 12 tIaIrvp~~~~y~~L~~ki~~kLkl~~e~i~LsYkd 47 (80)
T cd06406 12 TVAIQVARGLSYATLLQKISSKLELPAEHITLSYKS 47 (80)
T ss_pred EEEEEcCCCCCHHHHHHHHHHHhCCCchhcEEEecc
Confidence 888999999999999999999999999999999954
No 152
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=86.14 E-value=1.9 Score=38.33 Aligned_cols=81 Identities=11% Similarity=0.101 Sum_probs=48.2
Q ss_pred EEEEEEcCCCcceecCHHHHHHcCCccccccceeeeeecCceeeEEeEEEEEEEEECCeEEEeEEEEecCCCcccccchh
Q psy8018 160 PVKAFIDSGAQTTIMSAACAARVNITRLIDTRWAGIAKGVGVQRIIGRIHMVQIAIENDFLTTSFSVLEDQPMDMLLGLD 239 (280)
Q Consensus 160 ~v~alVDTGAq~tiis~~~A~rlgL~~~~d~r~~g~a~gvG~~~~~g~i~~~~i~IG~~~~~~~~~Vl~~~~~d~ILG~D 239 (280)
...+++|||...+.++.+..+.+--. +.... .. ........ ...+ ...-++.|.. ..|||..
T Consensus 202 ~~~~iiDsGt~~~~lp~~~~~~l~~~--~~~~~----~~---~~~~~~~~--~~~~-~~~p~i~f~f------~~ilG~~ 263 (283)
T cd05471 202 GGGAIVDSGTSLIYLPSSVYDAILKA--LGAAV----SS---SDGGYGVD--CSPC-DTLPDITFTF------LWILGDV 263 (283)
T ss_pred CcEEEEecCCCCEeCCHHHHHHHHHH--hCCcc----cc---cCCcEEEe--Cccc-CcCCCEEEEE------EEEccHh
Confidence 56799999999999999988876211 11111 00 00000000 0000 0000111211 7999999
Q ss_pred hhhcceEEEEecccEEEee
Q psy8018 240 MLKRHQVQIAIENDFLTTS 258 (280)
Q Consensus 240 ~L~~~~~~ID~~~~~L~i~ 258 (280)
||+.+-.+.|+.+++|-+.
T Consensus 264 fl~~~y~vfD~~~~~igfa 282 (283)
T cd05471 264 FLRNYYTVFDLDNNRIGFA 282 (283)
T ss_pred hhhheEEEEeCCCCEEeec
Confidence 9999999999999988764
No 153
>TIGR02958 sec_mycoba_snm4 secretion protein snm4. Members of this family are the 12-transmembrane domain protein snm4, where snm stands for secretion in mycocbacteria. This system acts on Mycobacterium tuberculosis related pair of virulence factors ESAT-6 and CFP-10 and on other homologs. The system is conserved in many Actinobacteria, including the non-pathogenic Mycobacterium smegmatis.
Probab=85.81 E-value=3.4 Score=40.95 Aligned_cols=70 Identities=19% Similarity=0.211 Sum_probs=52.0
Q ss_pred EEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCC------CCcceEEE-ECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 2 KVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGI------SAQEIAIE-FQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 2 ~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gi------p~~~Q~Li-~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
+|||...+ +...+-++.+..|.+|-.-+-.-.+= +...-.|. -+|.+|+.+ .||.+.||.|||+++|...
T Consensus 4 RVtV~~~~-~~~DlaLPa~~PvaellP~ll~~~~~~~~~~~~~~~w~L~r~gG~pL~~~-~sL~~~gV~DG~~L~L~p~ 80 (452)
T TIGR02958 4 RVTVLAGR-RAVDVALPADVPVAELIPDLVDLLDDRGAAELGAVRWALARAGGSPLDPD-ASLAEAGVRDGELLVLVPA 80 (452)
T ss_pred EEEEeeCC-eeeeeecCCCCcHHHHHHHHHHHhCcccccCCCCcceEEecCCCCCCCCC-CCHHHcCCCCCCeEEEeeC
Confidence 56777644 45778889999999998877665542 12233443 488899765 9999999999999999874
No 154
>KOG2086|consensus
Probab=85.41 E-value=1.8 Score=41.77 Aligned_cols=66 Identities=15% Similarity=0.320 Sum_probs=53.5
Q ss_pred EEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHh-CCCCcceEEE--ECCeeeCCCcCCccccCCCCCCE
Q psy8018 2 KVTVTSLDNECFLLDVSEDLELENFKAFCEVQS-GISAQEIAIE--FQGNALVDNKKTLKDYNIQDGDL 67 (280)
Q Consensus 2 ~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~-gip~~~Q~Li--~~Gk~L~d~~~tL~~~gI~~gd~ 67 (280)
.|-|+..+|....+-+.-+.||.+++..|...- +-+...+.|+ |=-|+|.|++.||++.|+.+--+
T Consensus 307 sIQIRLanG~RlV~~fN~sHTv~DIR~fI~~aRp~~~~~~F~L~~~FPpk~l~D~sqTle~AgL~Nsvl 375 (380)
T KOG2086|consen 307 SIQIRLANGTRLVLKFNHSHTVSDIREFIDTARPGDSSTYFILMMAFPPKPLSDDSQTLEEAGLLNSVL 375 (380)
T ss_pred eEEEEecCCceeeeeccCcccHHHHHHHHHhcCCCCcCCceeeeecCCCcccCCcchhHHhccchhhhh
Confidence 367888899998889999999999999998764 4455567776 46788999999999999976543
No 155
>PRK01777 hypothetical protein; Validated
Probab=85.01 E-value=7 Score=30.42 Aligned_cols=62 Identities=19% Similarity=0.175 Sum_probs=43.1
Q ss_pred CEEEEEc--CC-CcEEEEEeCCCccHHHHHHHHHHHhCCCCc--c-----eEEEECCeeeCCCcCCccccCCCCCCEEEE
Q psy8018 1 MKVTVTS--LD-NECFLLDVSEDLELENFKAFCEVQSGISAQ--E-----IAIEFQGNALVDNKKTLKDYNIQDGDLVLL 70 (280)
Q Consensus 1 M~ItV~~--~~-g~~~~leV~~~~tV~~LK~~Ie~e~gip~~--~-----Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l 70 (280)
|+|+|.. ++ .....+++++..||.++-.. .||+.. + -.+.-+|+...-+ .-+++||.|-+
T Consensus 4 i~v~V~ya~~~~~~~~~l~vp~GtTv~dal~~----sgi~~~~pei~~~~~~vgI~Gk~v~~d------~~L~dGDRVeI 73 (95)
T PRK01777 4 IRVEVVYALPERQYLQRLTLQEGATVEEAIRA----SGLLELRTDIDLAKNKVGIYSRPAKLT------DVLRDGDRVEI 73 (95)
T ss_pred eEEEEEEECCCceEEEEEEcCCCCcHHHHHHH----cCCCccCcccccccceEEEeCeECCCC------CcCCCCCEEEE
Confidence 5778763 22 33467899999999998666 366554 2 3555678876433 36999999998
Q ss_pred EE
Q psy8018 71 KR 72 (280)
Q Consensus 71 ~~ 72 (280)
-+
T Consensus 74 yr 75 (95)
T PRK01777 74 YR 75 (95)
T ss_pred ec
Confidence 75
No 156
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5. Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=84.99 E-value=3.3 Score=38.56 Aligned_cols=91 Identities=15% Similarity=0.202 Sum_probs=55.7
Q ss_pred EEEEEEEEc--CEEEEEEEcCCCcceecCHHHHHHcCCcc--cccc----c----------------------eeeeeec
Q psy8018 149 MLYINCRVN--GHPVKAFIDSGAQTTIMSAACAARVNITR--LIDT----R----------------------WAGIAKG 198 (280)
Q Consensus 149 ~l~i~~~In--G~~v~alVDTGAq~tiis~~~A~rlgL~~--~~d~----r----------------------~~g~a~g 198 (280)
..++++.|. ++++.+++|||+..+-+.-.-+..|+... ..+. . .-.+..|
T Consensus 3 ~Y~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~~C~~c~~~~~~~y~~~~Sst~~~~~C~~~~c~~~~~~~~~~~~~~i~Y~ 82 (326)
T cd06096 3 YYFIDIFIGNPPQKQSLILDTGSSSLSFPCSQCKNCGIHMEPPYNLNNSITSSILYCDCNKCCYCLSCLNNKCEYSISYS 82 (326)
T ss_pred eEEEEEEecCCCeEEEEEEeCCCCceEEecCCCCCcCCCCCCCcCcccccccccccCCCccccccCcCCCCcCcEEEEEC
Confidence 468899997 68899999999998888544334454321 0000 0 0112344
Q ss_pred CceeeEEeEEEEEEEEECCeEEE--------eEEEEec-------CCCcccccchhh
Q psy8018 199 VGVQRIIGRIHMVQIAIENDFLT--------TSFSVLE-------DQPMDMLLGLDM 240 (280)
Q Consensus 199 vG~~~~~g~i~~~~i~IG~~~~~--------~~~~Vl~-------~~~~d~ILG~D~ 240 (280)
-| ..+.|.+....+.+|+...+ ..|.+.. ....|+||||.+
T Consensus 83 ~g-s~~~G~~~~D~v~lg~~~~~~~~~~~~~~~fg~~~~~~~~~~~~~~~GilGLg~ 138 (326)
T cd06096 83 EG-SSISGFYFSDFVSFESYLNSNSEKESFKKIFGCHTHETNLFLTQQATGILGLSL 138 (326)
T ss_pred CC-CceeeEEEEEEEEeccCCCCccccccccEEeccCccccCcccccccceEEEccC
Confidence 44 35678888889999875432 1232221 124789999986
No 157
>smart00455 RBD Raf-like Ras-binding domain.
Probab=84.09 E-value=4.2 Score=29.78 Aligned_cols=48 Identities=17% Similarity=0.229 Sum_probs=42.1
Q ss_pred EEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECC--eee
Q psy8018 3 VTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQG--NAL 50 (280)
Q Consensus 3 ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~G--k~L 50 (280)
..|..++|+...+.+.|..|+.++-+.+-++.|+.++.-.++..| ++|
T Consensus 2 ~~v~LP~~~~~~V~vrpg~tl~e~L~~~~~kr~l~~~~~~v~~~g~~k~l 51 (70)
T smart00455 2 CKVHLPDNQRTVVKVRPGKTVRDALAKALKKRGLNPECCVVRLRGEKKPL 51 (70)
T ss_pred eEEECCCCCEEEEEECCCCCHHHHHHHHHHHcCCCHHHEEEEEcCCCcce
Confidence 356678999999999999999999999999999999999998854 455
No 158
>smart00666 PB1 PB1 domain. Phox and Bem1p domain, present in many eukaryotic cytoplasmic signalling proteins. The domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pairs associate.
Probab=83.69 E-value=5.5 Score=29.17 Aligned_cols=38 Identities=21% Similarity=0.420 Sum_probs=34.2
Q ss_pred CCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEEC
Q psy8018 9 DNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQ 46 (280)
Q Consensus 9 ~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~ 46 (280)
+|.++.+.++++.|..+|+.+|...++.+.....|-|.
T Consensus 9 ~~~~~~~~~~~~~s~~dL~~~i~~~~~~~~~~~~l~Y~ 46 (81)
T smart00666 9 GGETRRLSVPRDISFEDLRSKVAKRFGLDNQSFTLKYQ 46 (81)
T ss_pred CCEEEEEEECCCCCHHHHHHHHHHHhCCCCCCeEEEEE
Confidence 77899999999999999999999999998777777775
No 159
>PF11620 GABP-alpha: GA-binding protein alpha chain; InterPro: IPR024668 GA-binding protein alpha is a transcription factor capable of interacting with purine rich repeats (GA repeats). This N-terminal domain found in the transcription factor GABP alpha consists of a five-stranded beta-sheet crossed by a distorted helix and has been termed OST domain. The surface of the GABP alpha OST domain contains two clusters of negatively-charged residues suggesting there are positively-charged partner proteins. The OST domain binds to the CH1 and CH3 domains of the co-activator histone acetyltransferase CBP/p300 [].; PDB: 2JUO_A.
Probab=83.66 E-value=4.2 Score=31.17 Aligned_cols=61 Identities=25% Similarity=0.269 Sum_probs=43.9
Q ss_pred cEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 11 ECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 11 ~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
..+...++=...+..||.+++...+++-+.-.++.....|..+ ++|-+-|++-.-++-+..
T Consensus 3 ~vI~q~mDI~epl~~Lk~lLe~Rl~~~L~~~~f~LQD~~L~~~-k~L~dQcVqgeGlVQlnv 63 (88)
T PF11620_consen 3 DVIMQHMDIREPLSTLKKLLERRLGISLSDYEFWLQDIQLEPH-KSLVDQCVQGEGLVQLNV 63 (88)
T ss_dssp EEEEEEEESSSBGGGHHHHSHHHH-S--SS-EEEETTEE--TT-SBTTTSS----SEEEEEE
T ss_pred ceEEEEEecCCcHHHHHHHHHHhhCCCcCCCeEEeccceecCC-ccHHHhhccccCEEEEEE
Confidence 3456677778899999999999999999999999988888765 899999999988888876
No 160
>KOG4250|consensus
Probab=83.30 E-value=1.8 Score=44.87 Aligned_cols=65 Identities=17% Similarity=0.169 Sum_probs=47.6
Q ss_pred cCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeC-CCcCCccccCCCCCCEEEEEE
Q psy8018 7 SLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALV-DNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 7 ~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~-d~~~tL~~~gI~~gd~l~l~~ 72 (280)
..+...+.+-|+++.|+..+++.|+..+|||...|.|+|.|...- .++.+-.--| -+.++..+..
T Consensus 321 ~~~~~~~~~~~~~~ntl~~~~~~I~~~Tgipe~~qeLL~e~~~~h~~~~~Q~~~dg-~~~~l~l~~~ 386 (732)
T KOG4250|consen 321 MVQATSHEYYVHADNTLHSLIERISKQTGIPEGKQELLFEGGLSHLEDSAQCIPDG-LDSPLYLVSD 386 (732)
T ss_pred eccceEEEEecChhhhHHHHHHHHHHhhCCCCccceeeeecCccccCcccccCCCC-CCCceEEEec
Confidence 456778899999999999999999999999999999999875532 1223322334 4555555443
No 161
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=83.03 E-value=6 Score=35.13 Aligned_cols=90 Identities=14% Similarity=0.209 Sum_probs=52.9
Q ss_pred EEEEEEcCE--EEEEEEcCCCcceecCHHHHHHcCCccc--------------cccceeeeeecCceeeEEeEEEEEEEE
Q psy8018 151 YINCRVNGH--PVKAFIDSGAQTTIMSAACAARVNITRL--------------IDTRWAGIAKGVGVQRIIGRIHMVQIA 214 (280)
Q Consensus 151 ~i~~~InG~--~v~alVDTGAq~tiis~~~A~rlgL~~~--------------~d~r~~g~a~gvG~~~~~g~i~~~~i~ 214 (280)
++++.|+.- ++.+++|||+..+-+...-+..|..... ..........+-| ...|.+....+.
T Consensus 2 ~~~i~iGtp~q~~~l~~DTGS~~~wv~~~~c~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~Y~~g--~~~g~~~~D~v~ 79 (283)
T cd05471 2 YGEITIGTPPQKFSVIFDTGSSLLWVPSSNCTSCSCQKHPRFKYDSSKSSTYKDTGCTFSITYGDG--SVTGGLGTDTVT 79 (283)
T ss_pred EEEEEECCCCcEEEEEEeCCCCCEEEecCCCCccccccCCCCccCccCCceeecCCCEEEEEECCC--eEEEEEEEeEEE
Confidence 567788654 7999999999988885543333322110 0001112334444 345666677999
Q ss_pred ECCeEEE-eEEEEec-------CCCcccccchhhhh
Q psy8018 215 IENDFLT-TSFSVLE-------DQPMDMLLGLDMLK 242 (280)
Q Consensus 215 IG~~~~~-~~~~Vl~-------~~~~d~ILG~D~L~ 242 (280)
+|+...+ ..|.... ....|+||||..=.
T Consensus 80 ~~~~~~~~~~fg~~~~~~~~~~~~~~~GilGLg~~~ 115 (283)
T cd05471 80 IGGLTIPNQTFGCATSESGDFSSSGFDGILGLGFPS 115 (283)
T ss_pred ECCEEEeceEEEEEeccCCcccccccceEeecCCcc
Confidence 9986533 3444443 23688999987643
No 162
>cd01768 RA RA (Ras-associating) ubiquitin domain. The RA (Ras-associating) domain is structurally similar to ubiquitin and is present in one or two copies in a number of signalling molecules that bind and regulate a small GTPase called Ras or the Ras-related GTPases, Ral and Rap. RA-containing proteins include RalGDS, AF6, RIN1, RASSF1, SNX27, CYR1, STE50, and phospholipase C epsilon.
Probab=81.54 E-value=18 Score=26.75 Aligned_cols=49 Identities=16% Similarity=0.188 Sum_probs=38.1
Q ss_pred CcEEEEEeCCCccHHHHHHHHHHHhCCC--CcceEEE--EC----CeeeCCCcCCcc
Q psy8018 10 NECFLLDVSEDLELENFKAFCEVQSGIS--AQEIAIE--FQ----GNALVDNKKTLK 58 (280)
Q Consensus 10 g~~~~leV~~~~tV~~LK~~Ie~e~gip--~~~Q~Li--~~----Gk~L~d~~~tL~ 58 (280)
+...++.|++++|..++-..+..+++++ +....|+ +. .+.|.+++..+.
T Consensus 12 ~~~kti~V~~~~t~~~Vi~~~l~k~~l~~~~~~y~L~ev~~~~~~er~L~~~e~pl~ 68 (87)
T cd01768 12 GTYKTLRVSKDTTAQDVIQQLLKKFGLDDDPEDYALVEVLGDGGLERLLLPDECPLQ 68 (87)
T ss_pred ccEEEEEECCCCCHHHHHHHHHHHhCCcCCcccEEEEEEECCceEEEEeCCCCChHH
Confidence 7778999999999999999999999997 4555554 22 346777666665
No 163
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate r
Probab=81.03 E-value=10 Score=35.20 Aligned_cols=26 Identities=12% Similarity=0.249 Sum_probs=23.6
Q ss_pred ccccchhhhhcceEEEEecccEEEee
Q psy8018 233 DMLLGLDMLKRHQVQIAIENDFLTTS 258 (280)
Q Consensus 233 d~ILG~D~L~~~~~~ID~~~~~L~i~ 258 (280)
..|||..|||.+-.+.|+.++++-+.
T Consensus 299 ~~ilG~~flr~~y~vfD~~~~~IGfA 324 (326)
T cd05487 299 LWVLGATFIRKFYTEFDRQNNRIGFA 324 (326)
T ss_pred eEEEehHHhhccEEEEeCCCCEEeee
Confidence 47999999999999999999988775
No 164
>cd06407 PB1_NLP A PB1 domain is present in NIN like proteins (NLP), a key enzyme in a process of establishment of symbiosis betweeen legumes and nitrogen fixing bacteria (Rhizobium). The PB1 domain is a modular domain mediating specific protein-protein interaction which play a role in many critical cell processes like osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-inte
Probab=81.02 E-value=5.1 Score=30.29 Aligned_cols=42 Identities=7% Similarity=0.157 Sum_probs=33.9
Q ss_pred EEE-cCCCcEEEEEeCCCccHHHHHHHHHHHhCCCC-cceEEEE
Q psy8018 4 TVT-SLDNECFLLDVSEDLELENFKAFCEVQSGISA-QEIAIEF 45 (280)
Q Consensus 4 tV~-~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~-~~Q~Li~ 45 (280)
+|| +-+|....+.++|+.+..+|++.|...+++.. ....|-|
T Consensus 2 ~vK~~~~~d~~r~~l~~~~~~~~L~~~i~~r~~~~~~~~f~LkY 45 (82)
T cd06407 2 RVKATYGEEKIRFRLPPSWGFTELKQEIAKRFKLDDMSAFDLKY 45 (82)
T ss_pred EEEEEeCCeEEEEEcCCCCCHHHHHHHHHHHhCCCCCCeeEEEE
Confidence 444 23788999999999999999999999999864 4556655
No 165
>cd06409 PB1_MUG70 The MUG70 protein is a product of the meiotically up-regulated gene 70 which has a role in meiosis and harbors a PB1 domain. The PB1 domain is a modular domain mediating specific protein-protein interactions which play a role in many critical cell processes such as osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domains depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic amino acid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions. The PB1 domain module is
Probab=80.64 E-value=6.5 Score=30.14 Aligned_cols=43 Identities=9% Similarity=0.145 Sum_probs=36.9
Q ss_pred EEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCc---ceEEEE
Q psy8018 3 VTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQ---EIAIEF 45 (280)
Q Consensus 3 ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~---~Q~Li~ 45 (280)
..++++.|.++-+.+.|+.++.+|++.|....|+... .-.|.|
T Consensus 3 FK~~~~~GrvhRf~~~~s~~~~~L~~~I~~Rl~~d~~~~~~~~L~Y 48 (86)
T cd06409 3 FKFKDPKGRVHRFRLRPSESLEELRTLISQRLGDDDFETHLYALSY 48 (86)
T ss_pred EEeeCCCCCEEEEEecCCCCHHHHHHHHHHHhCCccccCCcccEEE
Confidence 3567889999999999999999999999999999873 555655
No 166
>PF03539 Spuma_A9PTase: Spumavirus aspartic protease (A9); InterPro: IPR001641 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A9 (spumapepsin family, clan AA). Foamy viruses are single-stranded enveloped retroviruses that have been noted to infect monkeys, cats and humans. In the human virus, the aspartic protease is encoded by the retroviral gag gene [], and in monkeys by the pol gene []. At present, the virus has not been proven to cause any particular disease. However, studies have shown Human foamy virus causes neurological disorders in infected mice []. It is not clear whether the Foamy virus/spumavirus proteases share a common evolutionary origin with other aspartic proteases. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 2JYS_A.
Probab=80.47 E-value=14 Score=31.27 Aligned_cols=77 Identities=19% Similarity=0.273 Sum_probs=40.7
Q ss_pred EcCEEEEEEEcCCCcceecCHHHHHHcCCccccccceeeee--ecCceeeEEeEEEEEEEEECCeEEEeEEEEecCCCcc
Q psy8018 156 VNGHPVKAFIDSGAQTTIMSAACAARVNITRLIDTRWAGIA--KGVGVQRIIGRIHMVQIAIENDFLTTSFSVLEDQPMD 233 (280)
Q Consensus 156 InG~~v~alVDTGAq~tiis~~~A~rlgL~~~~d~r~~g~a--~gvG~~~~~g~i~~~~i~IG~~~~~~~~~Vl~~~~~d 233 (280)
|-|...++.-||||+.|+|...+.+ +....|.. .-+-+ .....+.-+.++|.|..+.+...-- ++|
T Consensus 1 ikg~~l~~~wDsga~ITCiP~~fl~--------~E~Pi~~~~i~Tihg-~~~~~vYYl~fKi~grkv~aEVi~s---~~d 68 (163)
T PF03539_consen 1 IKGTKLKGHWDSGAQITCIPESFLE--------EEQPIGKTLIKTIHG-EKEQDVYYLTFKINGRKVEAEVIAS---PYD 68 (163)
T ss_dssp ETTEEEEEEE-TT-SSEEEEGGGTT--------T---SEEEEEE-SS--EEEEEEEEEEEEESS-EEEEEEEEE---SSS
T ss_pred CCCceeeEEecCCCeEEEccHHHhC--------ccccccceEEEEecC-ceeccEEEEEEEEcCeEEEEEEecC---ccc
Confidence 5688999999999999999988733 22221211 11111 2234455568899888766543322 344
Q ss_pred ccc----chhhhhcc
Q psy8018 234 MLL----GLDMLKRH 244 (280)
Q Consensus 234 ~IL----G~D~L~~~ 244 (280)
-+| -++|++..
T Consensus 69 y~li~p~diPw~~~~ 83 (163)
T PF03539_consen 69 YILISPSDIPWYKKK 83 (163)
T ss_dssp SEEE-TTT-HHHHS-
T ss_pred eEEEcccccccccCC
Confidence 322 35688763
No 167
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=80.17 E-value=5.9 Score=35.91 Aligned_cols=83 Identities=19% Similarity=0.262 Sum_probs=49.6
Q ss_pred EEEEEEEc--CEEEEEEEcCCCcceecCH-HHHHHcCCccccccceeeeeecCceeeEEeEEEEEEEEECC----eEE-E
Q psy8018 150 LYINCRVN--GHPVKAFIDSGAQTTIMSA-ACAARVNITRLIDTRWAGIAKGVGVQRIIGRIHMVQIAIEN----DFL-T 221 (280)
Q Consensus 150 l~i~~~In--G~~v~alVDTGAq~tiis~-~~A~rlgL~~~~d~r~~g~a~gvG~~~~~g~i~~~~i~IG~----~~~-~ 221 (280)
.++++.|+ .+++.+++|||++.+-+.- ..+..|+ ..+ .+.+|-| ..+.|.+-...+.++. ... .
T Consensus 3 Y~~~i~iGtP~q~~~v~~DTGS~~~Wv~c~~~c~~c~------c~~-~i~Ygd~-~~~~G~~~~D~v~~~~~~~~~~~~~ 74 (273)
T cd05475 3 YYVTINIGNPPKPYFLDIDTGSDLTWLQCDAPCTGCQ------CDY-EIEYADG-GSSMGVLVTDIFSLKLTNGSRAKPR 74 (273)
T ss_pred eEEEEEcCCCCeeEEEEEccCCCceEEeCCCCCCCCc------Ccc-EeEeCCC-CceEEEEEEEEEEEeecCCCcccCC
Confidence 57888997 7889999999999998842 1112332 222 1334433 2456777777777753 121 1
Q ss_pred eEEEEe---------cCCCcccccchhh
Q psy8018 222 TSFSVL---------EDQPMDMLLGLDM 240 (280)
Q Consensus 222 ~~~~Vl---------~~~~~d~ILG~D~ 240 (280)
..|... .....|+||||-+
T Consensus 75 ~~Fgc~~~~~~~~~~~~~~~dGIlGLg~ 102 (273)
T cd05475 75 IAFGCGYDQQGPLLNPPPPTDGILGLGR 102 (273)
T ss_pred EEEEeeeccCCcccCCCccCCEEEECCC
Confidence 223322 1225789999964
No 168
>PF14451 Ub-Mut7C: Mut7-C ubiquitin
Probab=80.10 E-value=7.2 Score=29.46 Aligned_cols=53 Identities=19% Similarity=0.312 Sum_probs=42.1
Q ss_pred CcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEE-CCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 10 NECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEF-QGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 10 g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~-~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
+..+.++.++..||.++-+. .|+|..+-.++. ||+...= +|-+++||.|.+..
T Consensus 22 ~~~~~~~~~~~~tvkd~IEs----LGVP~tEV~~i~vNG~~v~~------~~~~~~Gd~v~V~P 75 (81)
T PF14451_consen 22 GGPFTHPFDGGATVKDVIES----LGVPHTEVGLILVNGRPVDF------DYRLKDGDRVAVYP 75 (81)
T ss_pred CCceEEecCCCCcHHHHHHH----cCCChHHeEEEEECCEECCC------cccCCCCCEEEEEe
Confidence 45778899999999887554 799998887764 9998642 37799999999865
No 169
>PTZ00147 plasmepsin-1; Provisional
Probab=79.55 E-value=13 Score=36.80 Aligned_cols=29 Identities=10% Similarity=0.226 Sum_probs=26.5
Q ss_pred ccccchhhhhcceEEEEecccEEEeeeee
Q psy8018 233 DMLLGLDMLKRHQVQIAIENDFLTTSFSV 261 (280)
Q Consensus 233 d~ILG~D~L~~~~~~ID~~~~~L~i~~~~ 261 (280)
..|||-.|||.+-.+.|+.++++-+...+
T Consensus 422 ~~ILGd~FLr~~YtVFD~~n~rIGfA~a~ 450 (453)
T PTZ00147 422 TFILGDPFMRKYFTVFDYDNHTVGFALAK 450 (453)
T ss_pred CEEECHHHhccEEEEEECCCCEEEEEEec
Confidence 47999999999999999999999998765
No 170
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank
Probab=79.53 E-value=7.9 Score=35.86 Aligned_cols=26 Identities=19% Similarity=0.130 Sum_probs=23.3
Q ss_pred ccccchhhhhcceEEEEecccEEEee
Q psy8018 233 DMLLGLDMLKRHQVQIAIENDFLTTS 258 (280)
Q Consensus 233 d~ILG~D~L~~~~~~ID~~~~~L~i~ 258 (280)
..|||-.|||.+-.+.|+.+++|-+.
T Consensus 299 ~~ilGd~flr~~y~vfD~~~~~IGfA 324 (325)
T cd05490 299 LWILGDVFIGRYYTVFDRDNDRVGFA 324 (325)
T ss_pred eEEEChHhheeeEEEEEcCCcEeecc
Confidence 47999999999999999999998764
No 171
>PF08825 E2_bind: E2 binding domain; InterPro: IPR014929 E1 and E2 enzymes play a central role in ubiquitin and ubiquitin-like protein transfer cascades. This is an E2 binding domain that is found on NEDD8 activating E1 enzyme. The protein resembles ubiquitin, and recruits the catalytic core of the E2 enzyme Ubc12 in a similar manner to that in which ubiquitin interacts with ubiquitin binding domains []. ; GO: 0005524 ATP binding, 0016881 acid-amino acid ligase activity, 0045116 protein neddylation; PDB: 3GZN_D 3DBL_F 1R4N_H 1R4M_D 2NVU_B 1TT5_D 3DBR_D 3DBH_H 1YOV_B 3FN1_A ....
Probab=79.50 E-value=3.2 Score=31.59 Aligned_cols=61 Identities=16% Similarity=0.268 Sum_probs=43.4
Q ss_pred EEeCCCccHHHHHHHHHHHhCC-------CCcceEEEECCee-e-----CCCcCCccccCCCCCCEEEEEEeccC
Q psy8018 15 LDVSEDLELENFKAFCEVQSGI-------SAQEIAIEFQGNA-L-----VDNKKTLKDYNIQDGDLVLLKRVQVA 76 (280)
Q Consensus 15 leV~~~~tV~~LK~~Ie~e~gi-------p~~~Q~Li~~Gk~-L-----~d~~~tL~~~gI~~gd~l~l~~~~~p 76 (280)
++|++++|+++|-..+.+...+ ..+.-.|++.+-+ | .+-+++|.+. +.+|+-|.+.-...|
T Consensus 1 i~v~~~~TL~~lid~L~~~~~~qlk~PSlt~~~k~LYm~~pp~Lee~Tr~NL~k~l~eL-~~~g~ei~VtD~~lp 74 (84)
T PF08825_consen 1 IEVSPSWTLQDLIDSLCEKPEFQLKKPSLTTANKTLYMQSPPSLEEATRPNLSKKLKEL-LSDGEEITVTDPTLP 74 (84)
T ss_dssp EEESTTSBSHHHHHHHHHSTTT--SS-EEESSEEEEEESSSHHHHHHTGGGGSSBTTTT-HHSSEEEEEEETTES
T ss_pred CCcCccchHHHHHHHHHhChhhhcCCCcccCCCceEEEeCCHHHHHHhhhhhhhhHHHH-hcCCCEEEEECCCCc
Confidence 6899999999999999887433 3344455554431 2 1334789999 999999999765444
No 172
>TIGR01682 moaD molybdopterin converting factor, subunit 1, non-archaeal. The C-terminal Gly-Gly of this model is critical to function.
Probab=78.93 E-value=14 Score=27.12 Aligned_cols=55 Identities=13% Similarity=0.239 Sum_probs=40.3
Q ss_pred EEEEEeCCC-ccHHHHHHHHHHHhC-C-C-CcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 12 CFLLDVSED-LELENFKAFCEVQSG-I-S-AQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 12 ~~~leV~~~-~tV~~LK~~Ie~e~g-i-p-~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
...++++++ .||.+|++.+.+... + . .....+..||+...+ +.-+++||.|.+..
T Consensus 17 ~~~~~~~~~~~tv~~L~~~L~~~~p~l~~~~~~~~v~vn~~~v~~------~~~l~dgDevai~P 75 (80)
T TIGR01682 17 EETLELPDESTTVGELKEHLAKEGPELAASRGQVMVAVNEEYVTD------DALLNEGDEVAFIP 75 (80)
T ss_pred eEEEECCCCCcCHHHHHHHHHHhCchhhhhccceEEEECCEEcCC------CcCcCCCCEEEEeC
Confidence 457888877 899999999988753 1 1 123456678888764 34699999999875
No 173
>PF00564 PB1: PB1 domain; InterPro: IPR000270 The Phox and Bem1p domain, is present in many eukaryotic cytoplasmic signalling proteins. The domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pairs associate.; GO: 0005515 protein binding; PDB: 1IPG_A 1IP9_A 2KFK_A 1WMH_A 1VD2_A 1WI0_A 1OEY_C 1PQS_A 1Q1O_A 1TZ1_A ....
Probab=78.56 E-value=8.8 Score=28.09 Aligned_cols=38 Identities=5% Similarity=0.330 Sum_probs=32.9
Q ss_pred CCcEEE-EEeCCCccHHHHHHHHHHHhCCCCcceEEEEC
Q psy8018 9 DNECFL-LDVSEDLELENFKAFCEVQSGISAQEIAIEFQ 46 (280)
Q Consensus 9 ~g~~~~-leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~ 46 (280)
+|..+. +.++++.|..+|++.|+..++.+.....|.|.
T Consensus 9 ~~~~~~~~~~~~~~s~~~L~~~i~~~~~~~~~~~~l~Y~ 47 (84)
T PF00564_consen 9 GGDIRRIISLPSDVSFDDLRSKIREKFGLLDEDFQLKYK 47 (84)
T ss_dssp TTEEEEEEEECSTSHHHHHHHHHHHHHTTSTSSEEEEEE
T ss_pred CCeeEEEEEcCCCCCHHHHHHHHHHHhCCCCccEEEEee
Confidence 555555 99999999999999999999999888888883
No 174
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=78.42 E-value=11 Score=34.76 Aligned_cols=26 Identities=12% Similarity=0.049 Sum_probs=23.0
Q ss_pred ccccchhhhhcceEEEEecccEEEee
Q psy8018 233 DMLLGLDMLKRHQVQIAIENDFLTTS 258 (280)
Q Consensus 233 d~ILG~D~L~~~~~~ID~~~~~L~i~ 258 (280)
..|||-.|||.+-.+.|+.++++-+.
T Consensus 290 ~~ILGd~flr~~y~vfD~~~~~IGfA 315 (316)
T cd05486 290 LWILGDVFIRQYYSVFDRGNNRVGFA 315 (316)
T ss_pred eEEEchHHhcceEEEEeCCCCEeecc
Confidence 37999999999999999999987663
No 175
>PLN03146 aspartyl protease family protein; Provisional
Probab=78.15 E-value=9 Score=37.54 Aligned_cols=30 Identities=10% Similarity=-0.039 Sum_probs=26.6
Q ss_pred ccccchhhhhcceEEEEecccEEEeeeeec
Q psy8018 233 DMLLGLDMLKRHQVQIAIENDFLTTSFSVL 262 (280)
Q Consensus 233 d~ILG~D~L~~~~~~ID~~~~~L~i~~~~~ 262 (280)
..|||-.+++.+.++-|+++++|-|.....
T Consensus 399 ~~IlG~~~q~~~~vvyDl~~~~igFa~~~C 428 (431)
T PLN03146 399 IAIFGNLAQMNFLVGYDLESKTVSFKPTDC 428 (431)
T ss_pred ceEECeeeEeeEEEEEECCCCEEeeecCCc
Confidence 379999999999999999999999877654
No 176
>smart00295 B41 Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
Probab=76.59 E-value=27 Score=29.57 Aligned_cols=71 Identities=18% Similarity=0.214 Sum_probs=51.8
Q ss_pred EEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCc-ceEEEE-C--C---eeeCCCcCCccccCCC-CCCEEEEEEe
Q psy8018 2 KVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQ-EIAIEF-Q--G---NALVDNKKTLKDYNIQ-DGDLVLLKRV 73 (280)
Q Consensus 2 ~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~-~Q~Li~-~--G---k~L~d~~~tL~~~gI~-~gd~l~l~~~ 73 (280)
.+.|..++|....+.++++.|++++...+..+.|++.. ..-|.+ + + ..|.. .++|.+...+ ....+.+.++
T Consensus 5 ~~~V~l~dg~~~~~~~~~~~t~~ev~~~v~~~~~l~~~~~F~L~~~~~~~~~~~~l~~-~~~l~~~~~~~~~~~l~fr~r 83 (207)
T smart00295 5 VLKVYLLDGTTLEFEVDSSTTAEELLETVCRKLGIRESEYFGLQFEDPDEDLSHWLDP-AKTLLDQDVKSEPLTLYFRVK 83 (207)
T ss_pred EEEEEecCCCEEEEEECCCCCHHHHHHHHHHHhCCCccceeEEEEEcCCCCcCeeCCC-ccCHHHhcCCCCCcEEEEEEE
Confidence 46677899999999999999999999999999999642 233443 1 2 34543 3788888776 4556666665
No 177
>TIGR01687 moaD_arch MoaD family protein, archaeal. Members of this family appear to be archaeal versions of MoaD, subunit 1 of molybdopterin converting factor. This model has been split from the bacterial/eukaryotic equivalog model TIGR01682 because the presence of two members of this family in a substantial number of archaeal species suggests that roles might not be interchangeable.
Probab=74.94 E-value=31 Score=25.61 Aligned_cols=57 Identities=11% Similarity=0.091 Sum_probs=39.9
Q ss_pred cEEEEEeCCCccHHHHHHHHHHHhCC-----------CCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 11 ECFLLDVSEDLELENFKAFCEVQSGI-----------SAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 11 ~~~~leV~~~~tV~~LK~~Ie~e~gi-----------p~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
....++++ ..||.+|.+.+.+...- .-....+..||+....+ .. .-+++||.|.+..
T Consensus 16 ~~~~v~~~-~~tv~~l~~~l~~~~p~~~~~~l~~~~~~~~~~~v~vN~~~v~~~-~~---~~l~dgdev~i~P 83 (88)
T TIGR01687 16 KSEEIEIE-GKTVGDLLNELMARYPKEFSELFKEGLGLVPNVIILVNGRNVDWG-LG---TELKDGDVVAIFP 83 (88)
T ss_pred ceEEEEeC-CCCHHHHHHHHHHHCcHHHHHhCccCCcccccEEEEECCEecCcc-CC---CCCCCCCEEEEeC
Confidence 35677886 89999999999877531 01234566788887543 21 5699999999875
No 178
>COG5100 NPL4 Nuclear pore protein [Nuclear structure]
Probab=74.21 E-value=15 Score=36.00 Aligned_cols=70 Identities=19% Similarity=0.214 Sum_probs=52.7
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHH--hCCCCcceEEEEC----Ce---eeCCCcCCccccCCCCCCEEEEE
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQ--SGISAQEIAIEFQ----GN---ALVDNKKTLKDYNIQDGDLVLLK 71 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e--~gip~~~Q~Li~~----Gk---~L~d~~~tL~~~gI~~gd~l~l~ 71 (280)
|-+.++...|+. .+++.++.+++.|-+++-.. .+.+|++..+.-+ |- .|+ ++|+.++|++.|++|.|.
T Consensus 1 Mi~rfRsk~G~~-Rve~qe~d~lg~l~~kll~~~~~n~~~e~~svc~~p~~qG~~~s~l~--dqt~~dlGL~hGqmLyl~ 77 (571)
T COG5100 1 MIFRFRSKEGQR-RVEVQESDVLGMLSPKLLAFFEVNYSPEQISVCSAPDGQGEIFSLLK--DQTPDDLGLRHGQMLYLE 77 (571)
T ss_pred CeEEEecCCCce-eeeccccchhhhhhHHHHhhhccCCCccceEEEeCCCCCceeeeccc--ccChhhhccccCcEEEEE
Confidence 677788877754 59999999999888877654 3556666666542 33 233 389999999999999998
Q ss_pred Ee
Q psy8018 72 RV 73 (280)
Q Consensus 72 ~~ 73 (280)
..
T Consensus 78 ys 79 (571)
T COG5100 78 YS 79 (571)
T ss_pred ec
Confidence 74
No 179
>PF12754 Blt1: Cell-cycle control medial ring component; InterPro: IPR024737 During size-dependent cell cycle transitions controlled by the ubiquitous cyclin-dependent kinase Cdk1, Blt1 has been shown to co-localise with Cdr2 in the medial interphase nodes, as well as with Mid1 which was previously shown to localise to similar interphase structures. Physical interactions between Blt1-Mid1, Blt1-Cdr2 and Cdr2-Mid1 were detected, indicating that medial cortical nodes are formed by the ordered, Cdr2-dependent assembly of multiple interacting proteins during interphase[].; PDB: 2LO0_A.
Probab=73.63 E-value=1.1 Score=42.15 Aligned_cols=43 Identities=14% Similarity=0.277 Sum_probs=0.0
Q ss_pred CccHHHHHHHHHH----------HhCCCCcceE-----EEECCeeeCCCcCCccccCCC
Q psy8018 20 DLELENFKAFCEV----------QSGISAQEIA-----IEFQGNALVDNKKTLKDYNIQ 63 (280)
Q Consensus 20 ~~tV~~LK~~Ie~----------e~gip~~~Q~-----Li~~Gk~L~d~~~tL~~~gI~ 63 (280)
+.||.++|..+++ ++++|.+..+ |+|+-|++.| ++||.+..-.
T Consensus 103 ttSv~dlk~~v~~rv~~~~~~~~~~~vp~dKik~~~~~lL~~kkPv~~-~ktl~e~l~~ 160 (309)
T PF12754_consen 103 TTSVQDLKDAVQQRVHPSQATYDETRVPLDKIKNFRCRLLYKKKPVGD-SKTLAEVLAD 160 (309)
T ss_dssp -----------------------------------------------------------
T ss_pred cCcHHHHHHHHHhhhcccccccccccCCHHHhhhhhhhheecCccCCC-cCcHHHHHhc
Confidence 6899999999999 9999999999 9999999965 4899887544
No 180
>KOG4583|consensus
Probab=73.56 E-value=1.7 Score=41.21 Aligned_cols=60 Identities=13% Similarity=0.195 Sum_probs=42.4
Q ss_pred EEEEcCCCcEEEE--EeCCCccHHHHHHHHHHHh-CCC-CcceEEEECCeeeCCCcCCccccCCC
Q psy8018 3 VTVTSLDNECFLL--DVSEDLELENFKAFCEVQS-GIS-AQEIAIEFQGNALVDNKKTLKDYNIQ 63 (280)
Q Consensus 3 ItV~~~~g~~~~l--eV~~~~tV~~LK~~Ie~e~-gip-~~~Q~Li~~Gk~L~d~~~tL~~~gI~ 63 (280)
+.||..+.+-..+ ..+-..||++||..++.-. +=| ...|||+|.||.|.| ...|++.=+|
T Consensus 12 lliks~Nq~y~dl~i~~dl~wtv~~Lk~hls~VyPskpl~~dqrliYsgkllld-~qcl~d~lrk 75 (391)
T KOG4583|consen 12 LLIKSPNQSYKDLSISLDLKWTVGDLKVHLSQVYPSKPLELDQRLIYSGKLLLD-HQCLTDWLRK 75 (391)
T ss_pred EEecCCCccccceeeehhhhhhHHHHhhhHhhcCCCCCchhhHHHHhhcccccc-chhHHHHHHH
Confidence 4566666554444 4556789999999988763 333 358999999999976 4777776443
No 181
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two
Probab=73.52 E-value=8.1 Score=36.55 Aligned_cols=31 Identities=13% Similarity=0.159 Sum_probs=28.5
Q ss_pred cccchhhhhcceEEEEecccEEEeeeeecCC
Q psy8018 234 MLLGLDMLKRHQVQIAIENDFLTTSFSVLED 264 (280)
Q Consensus 234 ~ILG~D~L~~~~~~ID~~~~~L~i~~~~~~~ 264 (280)
.|||--|||.+-.+-|+.+++|-+....+..
T Consensus 319 ~ILG~~flr~~yvvfD~~~~rIGfa~~~C~~ 349 (364)
T cd05473 319 TVIGAVIMEGFYVVFDRANKRVGFAVSTCAE 349 (364)
T ss_pred eEEeeeeEcceEEEEECCCCEEeeEeccccc
Confidence 7999999999999999999999999887753
No 182
>PF14732 UAE_UbL: Ubiquitin/SUMO-activating enzyme ubiquitin-like domain; PDB: 1Y8Q_B 1Y8R_E 3KYD_B 3KYC_B.
Probab=73.28 E-value=4.3 Score=31.00 Aligned_cols=57 Identities=14% Similarity=0.265 Sum_probs=29.9
Q ss_pred EEeCC-CccHHHHHHHH-HHHhCCCCc----ceEEEECCee---eCCCcCCccccCCCCCCEEEEE
Q psy8018 15 LDVSE-DLELENFKAFC-EVQSGISAQ----EIAIEFQGNA---LVDNKKTLKDYNIQDGDLVLLK 71 (280)
Q Consensus 15 leV~~-~~tV~~LK~~I-e~e~gip~~----~Q~Li~~Gk~---L~d~~~tL~~~gI~~gd~l~l~ 71 (280)
+.++. .+|+.+|-..| ..+.|.... .-.++|..-. =.+..++|+++||++|+.|.+.
T Consensus 2 v~~d~~~~TL~~lv~~Vlk~~Lg~~~P~v~~~~~ilyd~de~~~~~~l~k~L~elgi~~gs~L~v~ 67 (87)
T PF14732_consen 2 VKVDTKKMTLGDLVEKVLKKKLGMNEPDVSVGGTILYDSDEEEYDDNLPKKLSELGIVNGSILTVD 67 (87)
T ss_dssp EEE-TTT-BHHHHHHHCCCCCS--SSEEEEES-EEEE-SSSSSSTTCTTSBGGGGT--TT-EEEEE
T ss_pred EEEechhCcHHHHHHHHHHhccCCCCCEEEeCCCEEEcCCcchhhhcccCChhHcCCCCCCEEEEE
Confidence 45553 68999998864 556676542 2233343222 1123478999999999999774
No 183
>cd05489 xylanase_inhibitor_I_like TAXI-I inhibits degradation of xylan in the cell wall. Xylanase inhibitor-I (TAXI-I) is a member of potent TAXI-type inhibitors of fungal and bacterial family 11 xylanases. Plants developed a diverse battery of defense mechanisms in response to continual challenges by a broad spectrum of pathogenic microorganisms. Their defense arsenal includes inhibitors of cell wall-degrading enzymes, which hinder a possible invasion and colonization by antagonists. Xylanases of fungal and bacterial pathogens are the key enzymes in the degradation of xylan in the cell wall. Plants secrete proteins that inhibit these degradation glycosidases, including xylanase. Surprisingly, TAXI-I displays structural homology with the pepsin-like family of aspartic proteases but is proteolytically nonfunctional, because one or more residues of the essential catalytic triad are absent. The structure of the TAXI-inhibitor, Aspergillus niger xylanase I complex, illustrates the ability
Probab=73.24 E-value=19 Score=34.39 Aligned_cols=26 Identities=15% Similarity=0.216 Sum_probs=23.7
Q ss_pred cccchhhhhcceEEEEecccEEEeee
Q psy8018 234 MLLGLDMLKRHQVQIAIENDFLTTSF 259 (280)
Q Consensus 234 ~ILG~D~L~~~~~~ID~~~~~L~i~~ 259 (280)
.|||--+++.+..+-|+.+++|-|..
T Consensus 335 ~IlG~~~~~~~~vvyD~~~~riGfa~ 360 (362)
T cd05489 335 VVIGGHQMEDNLLVFDLEKSRLGFSS 360 (362)
T ss_pred EEEeeheecceEEEEECCCCEeeccc
Confidence 58999999999999999999998764
No 184
>PTZ00165 aspartyl protease; Provisional
Probab=72.26 E-value=33 Score=34.27 Aligned_cols=32 Identities=6% Similarity=0.059 Sum_probs=28.3
Q ss_pred cccccchhhhhcceEEEEecccEEEeeeeecC
Q psy8018 232 MDMLLGLDMLKRHQVQIAIENDFLTTSFSVLE 263 (280)
Q Consensus 232 ~d~ILG~D~L~~~~~~ID~~~~~L~i~~~~~~ 263 (280)
...|||-.||+++-.+.|+.++++-+...+..
T Consensus 418 ~~~ILGd~Flr~yy~VFD~~n~rIGfA~a~~~ 449 (482)
T PTZ00165 418 PLFVLGNNFIRKYYSIFDRDHMMVGLVPAKHD 449 (482)
T ss_pred ceEEEchhhheeEEEEEeCCCCEEEEEeeccC
Confidence 34799999999999999999999999887654
No 185
>KOG2982|consensus
Probab=71.58 E-value=5.1 Score=38.10 Aligned_cols=59 Identities=17% Similarity=0.272 Sum_probs=47.9
Q ss_pred EEEeCCCccHHHHHHHHHHHhCCCCcceEEEE---CCeee----CCCcCCccccCCCCCCEEEEEE
Q psy8018 14 LLDVSEDLELENFKAFCEVQSGISAQEIAIEF---QGNAL----VDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 14 ~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~---~Gk~L----~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
+.-|+-.-||.|||..++..-|+.+...+|+| +||.- .|..+.|-.|+|++||.+.+-.
T Consensus 351 s~~I~~~~TV~D~~~~Ld~~VGvk~trMkLf~L~eD~rt~~~ss~~~N~~L~~fkIedGDs~lvqe 416 (418)
T KOG2982|consen 351 SGLICMTRTVLDFMKILDPKVGVKFTRMKLFLLREDGRTDDFSSSDYNMPLHYFKIEDGDSFLVQE 416 (418)
T ss_pred ceEEEeehHHHHHHHHhccccccccceeEEEEEcccCccCCccccCCCCcceEEeccCCCEeeeec
Confidence 34455667999999999999999999999998 66653 2445789999999999988753
No 186
>cd05992 PB1 The PB1 domain is a modular domain mediating specific protein-protein interactions which play a role in many critical cell processes, such as osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domain, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as a noncanonical PB1-interactions. The PB1 domain module is conserved in amoebas, fungi, animals, and plants.
Probab=71.56 E-value=11 Score=27.27 Aligned_cols=37 Identities=14% Similarity=0.269 Sum_probs=30.6
Q ss_pred CCcEEEEEeC-CCccHHHHHHHHHHHhCCCCcceEEEE
Q psy8018 9 DNECFLLDVS-EDLELENFKAFCEVQSGISAQEIAIEF 45 (280)
Q Consensus 9 ~g~~~~leV~-~~~tV~~LK~~Ie~e~gip~~~Q~Li~ 45 (280)
+|..+.+.++ ++.|..+|+++|...++.+.....|.|
T Consensus 8 ~~~~~~~~~~~~~~s~~~L~~~i~~~~~~~~~~~~l~y 45 (81)
T cd05992 8 GGEIRRFVVVSRSISFEDLRSKIAEKFGLDAVSFKLKY 45 (81)
T ss_pred cCCCEEEEEecCCCCHHHHHHHHHHHhCCCCCcEEEEe
Confidence 4678888888 999999999999999999865555544
No 187
>PF02196 RBD: Raf-like Ras-binding domain; InterPro: IPR003116 This is the Ras-binding domain found in proteins related to Ras. It is found in association with the PE-bind and pkinase domains.; GO: 0005057 receptor signaling protein activity, 0007165 signal transduction; PDB: 1RFA_A 1C1Y_B 3KUD_B 1GUA_B 3KUC_B 2L05_A 3NY5_D 1RRB_A 1WFY_A 1WXM_A.
Probab=71.41 E-value=15 Score=26.76 Aligned_cols=49 Identities=8% Similarity=0.141 Sum_probs=37.4
Q ss_pred EEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEE--CCeeeC
Q psy8018 3 VTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEF--QGNALV 51 (280)
Q Consensus 3 ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~--~Gk~L~ 51 (280)
+.|..++|+...+.|.|..|+.+.-+.+-+.-|+.++.-.++. ..++|.
T Consensus 3 ~~v~LP~~q~t~V~vrpg~ti~d~L~~~~~kr~L~~~~~~V~~~~~~k~l~ 53 (71)
T PF02196_consen 3 CRVHLPNGQRTVVQVRPGMTIRDALSKACKKRGLNPECCDVRLVGEKKPLD 53 (71)
T ss_dssp EEEEETTTEEEEEEE-TTSBHHHHHHHHHHTTT--CCCEEEEEEEEEEEE-
T ss_pred EEEECCCCCEEEEEEcCCCCHHHHHHHHHHHcCCCHHHEEEEEcCCCcccc
Confidence 4466789999999999999999999999999999998777654 345554
No 188
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=70.84 E-value=28 Score=34.52 Aligned_cols=28 Identities=7% Similarity=0.227 Sum_probs=25.2
Q ss_pred ccccchhhhhcceEEEEecccEEEeeee
Q psy8018 233 DMLLGLDMLKRHQVQIAIENDFLTTSFS 260 (280)
Q Consensus 233 d~ILG~D~L~~~~~~ID~~~~~L~i~~~ 260 (280)
..|||-.|||.+-.+.|+.++++-+...
T Consensus 421 ~~ILGd~FLr~~Y~VFD~~n~rIGfA~a 448 (450)
T PTZ00013 421 TFILGDPFMRKYFTVFDYDKESVGFAIA 448 (450)
T ss_pred CEEECHHHhccEEEEEECCCCEEEEEEe
Confidence 4799999999999999999999988654
No 189
>cd06408 PB1_NoxR The PB1 domain is present in the Epichloe festucae NoxR protein (NADPH oxidase regulator), a key regulator of NADPH oxidase isoform, NoxA. NoxA is essential for growth control of the fungal endophyte in plant tissue in the process of symbiotic interaction between a fungi and its plant host. The Epichloe festucae p67(phox)-like regulator, NoxR, dispensable in culture but essential in plants for the symbiotic interaction. Plants infected with a noxR deletion mutant show severe stunting and premature senescence, whereas hyphae in the meristematic tissues show increased branching leading to increased fungal colonization of pseudostem and leaf blade tissue. The PB1 domain is a modular domain mediating specific protein-protein interactions which a play role in many critical cell processes such as osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is
Probab=70.64 E-value=20 Score=27.55 Aligned_cols=36 Identities=14% Similarity=0.241 Sum_probs=31.4
Q ss_pred EEEEEcC-CCcEEEEEeCCCccHHHHHHHHHHHhCCC
Q psy8018 2 KVTVTSL-DNECFLLDVSEDLELENFKAFCEVQSGIS 37 (280)
Q Consensus 2 ~ItV~~~-~g~~~~leV~~~~tV~~LK~~Ie~e~gip 37 (280)
+|+||.. +|.+..+-|+++.+..+|.+.|...+++.
T Consensus 2 ~ikVKv~~~~Dv~~i~v~~~i~f~dL~~kIrdkf~~~ 38 (86)
T cd06408 2 KIRVKVHAQDDTRYIMIGPDTGFADFEDKIRDKFGFK 38 (86)
T ss_pred cEEEEEEecCcEEEEEcCCCCCHHHHHHHHHHHhCCC
Confidence 4666654 78899999999999999999999999984
No 190
>PF03419 Peptidase_U4: Sporulation factor SpoIIGA This family belongs to family U4 of the peptidase classification.; InterPro: IPR005081 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This group of peptidases belong to the MEROPS peptidase family U4 (SpoIIGA peptidase family, clan U-). Sporulation in bacteria such as Bacillus subtilis involves the formation of a polar septum, which divides the sporangium into a mother cell and a forespore. The sigma E factor, which is encoded within the spoIIG operon, is a cell-specific regulatory protein that directs gene transcription in the mother cell. Sigma E is synthesised as an inactive proprotein pro-sigma E, which is converted to the mature factor by the putative processing enzyme SpoIIGA []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis, 0030436 asexual sporulation
Probab=69.66 E-value=51 Score=30.45 Aligned_cols=36 Identities=17% Similarity=0.385 Sum_probs=28.5
Q ss_pred eeEEEEEEEEcCEE--EEEEEcCCCcce---------ecCHHHHHHc
Q psy8018 147 VIMLYINCRVNGHP--VKAFIDSGAQTT---------IMSAACAARV 182 (280)
Q Consensus 147 ~~~l~i~~~InG~~--v~alVDTGAq~t---------iis~~~A~rl 182 (280)
.....+.+.++|+. ++|++|||-+.. +++.+.++++
T Consensus 155 ~~~~~v~i~~~~~~~~~~allDTGN~L~DPitg~PV~Vve~~~~~~~ 201 (293)
T PF03419_consen 155 DYLYPVTIEIGGKKIELKALLDTGNQLRDPITGRPVIVVEYEALEKL 201 (293)
T ss_pred cEEEEEEEEECCEEEEEEEEEECCCcccCCCCCCcEEEEEHHHHHhh
Confidence 35666888999995 689999998754 7788888877
No 191
>smart00314 RA Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS); putative RasGTP effectors in other cases. Kalhammer et al. have shown that not all RA domains bind RasGTP. Predicted structure similar to that determined, and that of the RasGTP-binding domain of Raf kinase. Predicted RA domains in PLC210 and nore1 found to bind RasGTP. Included outliers (Grb7, Grb14, adenylyl cyclases etc.)
Probab=68.96 E-value=38 Score=25.18 Aligned_cols=51 Identities=16% Similarity=0.152 Sum_probs=38.5
Q ss_pred CCcEEEEEeCCCccHHHHHHHHHHHhCCCC--cceEEE--E-CC--eeeCCCcCCccc
Q psy8018 9 DNECFLLDVSEDLELENFKAFCEVQSGISA--QEIAIE--F-QG--NALVDNKKTLKD 59 (280)
Q Consensus 9 ~g~~~~leV~~~~tV~~LK~~Ie~e~gip~--~~Q~Li--~-~G--k~L~d~~~tL~~ 59 (280)
++...++.|++++|..++-..+..+++++. ..-.|+ + +| +.|.+++..|.-
T Consensus 14 ~~~~kti~v~~~tTa~~Vi~~~l~k~~l~~~~~~y~L~e~~~~~~er~L~~~e~Pl~~ 71 (90)
T smart00314 14 GGTYKTLRVSSRTTARDVIQQLLEKFHLTDDPEEYVLVEVLPDGKERVLPDDENPLQL 71 (90)
T ss_pred CCcEEEEEECCCCCHHHHHHHHHHHhCCCCCcccEEEEEEeCCcEEEEeCCCCcceEe
Confidence 477889999999999999999999999975 455554 3 45 567665555443
No 192
>cd01775 CYR1_RA Ubiquitin domain of CYR1 adenylate cyclase. CYR1 is a fungal adenylate cyclase with at least four domains, an N-terminal RA (Ras association) domain, a middle leucine-rich repeat domain, a catalytic domain. The N-terminal RA domain of CYR1 post-translationally modifies a small GTPase called Ras. The Ras-CYR1 pathway has been implicated in the transduction of a glucose-triggered signal to an intracellular environment where a protein phosphorylation cascade is initiated by cyclic AMP.
Probab=68.94 E-value=22 Score=27.86 Aligned_cols=65 Identities=9% Similarity=0.011 Sum_probs=45.7
Q ss_pred cCCCcEEEEEeCCCccHHHHHHHHHHHhCCCC-cceEE-EECC---eeeCCCcCC-------ccccCCCCCCEEEEE
Q psy8018 7 SLDNECFLLDVSEDLELENFKAFCEVQSGISA-QEIAI-EFQG---NALVDNKKT-------LKDYNIQDGDLVLLK 71 (280)
Q Consensus 7 ~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~-~~Q~L-i~~G---k~L~d~~~t-------L~~~gI~~gd~l~l~ 71 (280)
-.++...++.++.+.||.++-..+..++.++. ++-+| +..| |+|...+++ |+..|-++.|-|...
T Consensus 9 r~D~Tf~Tls~~l~tTv~eli~~L~rK~~l~~~~ny~l~l~~~~l~RvL~p~ErPl~IqkrlL~q~GY~~~D~l~~l 85 (97)
T cd01775 9 RSDGTFTTLSCPLNTTVSELIPQLAKKFYLPSGGNYQLSLKKHDLSRVLRPTEKPLLIQKRLLLQVGYEERDRIEDI 85 (97)
T ss_pred ecCCcEEEEEcCCcCcHHHHHHHHHHhhcCCCCCCeEEEEEECCeeeecCCcCCcHHHHHHHHHHcCCCCCCcHHHh
Confidence 45787889999999999999999999998876 34444 3433 455433344 566677766665543
No 193
>cd06396 PB1_NBR1 The PB1 domain is an essential part of NBR1 protein, next to BRCA1, a scaffold protein mediating specific protein-protein interaction with both titin protein kinase and with another scaffold protein p62. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domain, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. The NBR1 protein contains a type I PB1 domain.
Probab=67.94 E-value=17 Score=27.62 Aligned_cols=35 Identities=20% Similarity=0.323 Sum_probs=30.3
Q ss_pred EEEE-cCCCcEEEEEeCC--CccHHHHHHHHHHHhCCC
Q psy8018 3 VTVT-SLDNECFLLDVSE--DLELENFKAFCEVQSGIS 37 (280)
Q Consensus 3 ItV~-~~~g~~~~leV~~--~~tV~~LK~~Ie~e~gip 37 (280)
|+|| +-+|.+..+.++| +.+.++|++.+...++++
T Consensus 1 V~vKaty~~d~~rf~~~~~~~~~~~~L~~ev~~rf~l~ 38 (81)
T cd06396 1 VNLKVTYNGESQSFLVSDSENTTWASVEAMVKVSFGLN 38 (81)
T ss_pred CEEEEEECCeEEEEEecCCCCCCHHHHHHHHHHHhCCC
Confidence 4566 3478899999999 779999999999999999
No 194
>cd06411 PB1_p51 The PB1 domain is present in the p51 protein, a homolog of the p67 protein. p51 plays an important role in NADPH oxidase activation during phagosytosis. The PB1 domain is a modular domain mediating specific protein-protein interaction in many critical cell processes such as osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions. The PB1 domain mo
Probab=67.28 E-value=12 Score=28.13 Aligned_cols=35 Identities=11% Similarity=0.179 Sum_probs=32.1
Q ss_pred EEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEEC
Q psy8018 12 CFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQ 46 (280)
Q Consensus 12 ~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~ 46 (280)
++.+.|.+..+..+|.++|+++...+++.-+|.|.
T Consensus 8 TVai~v~~g~~y~~L~~~ls~kL~l~~~~~~LSY~ 42 (78)
T cd06411 8 TVALRAPRGADVSSLRALLSQALPQQAQRGQLSYR 42 (78)
T ss_pred EEEEEccCCCCHHHHHHHHHHHhcCChhhcEEEec
Confidence 45678999999999999999999999999999994
No 195
>PF08337 Plexin_cytopl: Plexin cytoplasmic RasGAP domain; InterPro: IPR013548 This domain is found at C terminus of various plexins (e.g. P51805 from SWISSPROT). Plexins are receptors for semaphorins, and plexin signalling is important in pathfinding and patterning of both neurons and developing blood vessels [, ]. The cytoplasmic region, which has been called a SEX domain [], and is involved in downstream signalling pathways, by interaction with proteins such as Rac1, RhoD, Rnd1 and other plexins []. ; PDB: 3H6N_A 4E71_A 4E74_A 3IG3_A 2REX_C 2JPH_A 2R2O_A 3HM6_X 3SU8_X 3SUA_E ....
Probab=66.61 E-value=22 Score=36.09 Aligned_cols=65 Identities=25% Similarity=0.317 Sum_probs=44.0
Q ss_pred CcEEEEEeCCCccHHHHHHHHHHH--hCCC------CcceEEEE--C--Ce-eeCCCc------------CCccccCCCC
Q psy8018 10 NECFLLDVSEDLELENFKAFCEVQ--SGIS------AQEIAIEF--Q--GN-ALVDNK------------KTLKDYNIQD 64 (280)
Q Consensus 10 g~~~~leV~~~~tV~~LK~~Ie~e--~gip------~~~Q~Li~--~--Gk-~L~d~~------------~tL~~~gI~~ 64 (280)
+..+++.|-..+||.++|++|-.. .+.| +++.-|-+ + |+ +|.|.+ .||+.|||.|
T Consensus 201 ~~~i~VkVLdCDTItQVKeKiLDavyk~~p~S~rp~~~d~dLEwr~~~~~~~iL~D~D~ts~~~~~wkrLNTL~HY~V~d 280 (539)
T PF08337_consen 201 SEEIPVKVLDCDTITQVKEKILDAVYKNTPYSQRPRADDVDLEWRQGRGGRLILQDEDSTSKVEGGWKRLNTLAHYKVPD 280 (539)
T ss_dssp STCEEEEEETTSBHHHHHHHHHHHHTTTS-GGGS--GGGEEEEEEETTSEEEEESSSSTTSEEETTEEE--BHHHHT--T
T ss_pred CceEEEEEEecCcccHHHHHHHHHHHcCCCCCCCCCccccceeeecCCCCcccccCCCCCcccCCCceEeccHhhcCCCC
Confidence 456889999999999999998665 2443 34445533 2 33 455532 2699999999
Q ss_pred CCEEEEEEec
Q psy8018 65 GDLVLLKRVQ 74 (280)
Q Consensus 65 gd~l~l~~~~ 74 (280)
|+++-|+.++
T Consensus 281 ga~vaLv~k~ 290 (539)
T PF08337_consen 281 GATVALVPKQ 290 (539)
T ss_dssp TEEEEEEES-
T ss_pred CceEEEeecc
Confidence 9999999874
No 196
>PF02597 ThiS: ThiS family; InterPro: IPR003749 ThiS (thiaminS) is a 66 aa protein involved in sulphur transfer. ThiS is coded in the thiCEFSGH operon in Escherichia coli. This family of proteins have two conserved Glycines at the COOH terminus. Thiocarboxylate is formed at the last G in the activation process. Sulphur is transferred from ThiI to ThiS in a reaction catalysed by IscS []. MoaD, a protein involved in sulphur transfer during molybdopterin synthesis, is about the same length and shows limited sequence similarity to ThiS. Both have the conserved GG at the COOH end.; PDB: 1JW9_D 1JWB_D 1JWA_D 3BII_D 1NVI_D 1FMA_D 1FM0_D 2QIE_G 2Q5W_D 2K5P_A ....
Probab=65.64 E-value=28 Score=24.80 Aligned_cols=58 Identities=17% Similarity=0.170 Sum_probs=44.5
Q ss_pred EEEEEeCCCccHHHHHHHHHHHhCC--CCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 12 CFLLDVSEDLELENFKAFCEVQSGI--SAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 12 ~~~leV~~~~tV~~LK~~Ie~e~gi--p~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
...+++.+..||.+|.+.+.....- ....-.+..||+...+ .-.+.-+++||.|.+..
T Consensus 13 ~~~~~~~~~~tv~~ll~~l~~~~p~~~~~~~~~v~vN~~~v~~---~~~~~~l~~gD~V~i~p 72 (77)
T PF02597_consen 13 EEEIEVPEGSTVRDLLEALAERYPELALRDRVAVAVNGEIVPD---DGLDTPLKDGDEVAILP 72 (77)
T ss_dssp EEEEEESSTSBHHHHHHHHCHHTGGGHTTTTEEEEETTEEEGG---GTTTSBEETTEEEEEEE
T ss_pred CeEEecCCCCcHHHHHHHHHhhccccccCccEEEEECCEEcCC---ccCCcCcCCCCEEEEEC
Confidence 6678899999999999999877521 2256677889999875 23345689999999876
No 197
>cd01787 GRB7_RA RA (RAS-associated like) domain of Grb7. Grb7_RA The RA (RAS-associated like) domain of Grb7. Grb7 is an adaptor molecule that mediates signal transduction from multiple cell surface receptors to various downstream signaling pathways. Grb7 and its related family members Grb10 and Grb14 share a conserved domain architecture that includes an amino-terminal proline-rich region, a central segment termed the GM region (for Grb and Mig) which includes the RA, PIR, and PH domains, and a carboxyl-terminal SH2 domain. Grb7/10/14 family proteins are phosphorylated on serine/threonine as well as tyrosine residues and are mainly localized to the cytoplasm.
Probab=63.50 E-value=23 Score=27.10 Aligned_cols=40 Identities=15% Similarity=0.103 Sum_probs=32.1
Q ss_pred EEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCc-ceEEE
Q psy8018 5 VTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQ-EIAIE 44 (280)
Q Consensus 5 V~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~-~Q~Li 44 (280)
|-..+|.+.+++|++++|+.++-.++..+.....+ .-.|+
T Consensus 7 v~~~Dg~sK~l~V~~~~Ta~dV~~~L~~K~h~~~~~~W~Lv 47 (85)
T cd01787 7 VYSEDGASKSLEVDERMTARDVCQLLVDKNHCQDDSSWTLV 47 (85)
T ss_pred EEecCCCeeEEEEcCCCcHHHHHHHHHHHhCCCCCCCeEEE
Confidence 44678999999999999999999999999877443 44443
No 198
>PF10209 DUF2340: Uncharacterized conserved protein (DUF2340); InterPro: IPR018794 This entry consists of small proteins of approximately 150 amino acids whose function is unknown.
Probab=63.48 E-value=21 Score=29.15 Aligned_cols=57 Identities=19% Similarity=0.317 Sum_probs=39.1
Q ss_pred EeCC-CccHHHHHHHHHHH----hCCCC------cceEEEEC----------------C-eee--CCCcCCccccCCCCC
Q psy8018 16 DVSE-DLELENFKAFCEVQ----SGISA------QEIAIEFQ----------------G-NAL--VDNKKTLKDYNIQDG 65 (280)
Q Consensus 16 eV~~-~~tV~~LK~~Ie~e----~gip~------~~Q~Li~~----------------G-k~L--~d~~~tL~~~gI~~g 65 (280)
.|+. +.|+.+|++.|.++ .|+|| +..++++. . -.| .++++||.++||.+.
T Consensus 21 ~vdL~~~Tv~~l~~~v~~~I~t~~~~~Pfrn~~yDtlKIy~~AHg~Kt~nLvInle~De~~iL~~~~~~~tL~~~gv~nE 100 (122)
T PF10209_consen 21 NVDLKDTTVKDLKEQVKQDIKTRPGLPPFRNVKYDTLKIYTKAHGSKTNNLVINLEDDEDWILDVSDDDKTLKELGVENE 100 (122)
T ss_pred cCCcccCcHHHHHHHHHHHHhcCCCCCCceeeecceEEEEecCCCCCcCCceeeccCCcceeeecCCCCCcHHHcCCCcc
Confidence 5776 89999999988776 45554 44455542 1 345 244588999999888
Q ss_pred CEEEEEE
Q psy8018 66 DLVLLKR 72 (280)
Q Consensus 66 d~l~l~~ 72 (280)
.-|-+..
T Consensus 101 TEiSfF~ 107 (122)
T PF10209_consen 101 TEISFFN 107 (122)
T ss_pred ceeeeeC
Confidence 8776654
No 199
>cd06410 PB1_UP2 Uncharacterized protein 2. The PB1 domain is a modular domain mediating specific protein-protein interaction which play a role in many critical cell processes such as osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions.
Probab=63.37 E-value=27 Score=27.26 Aligned_cols=40 Identities=8% Similarity=0.227 Sum_probs=34.5
Q ss_pred EEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEE
Q psy8018 5 VTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEF 45 (280)
Q Consensus 5 V~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~ 45 (280)
++..+|++.-+.|+.+.|..+|++.+.+.++++.. ..|-|
T Consensus 17 l~Y~GG~tr~i~V~r~~s~~el~~kl~~~~~~~~~-~~lky 56 (97)
T cd06410 17 LRYVGGETRIVSVDRSISFKELVSKLSELFGAGVV-VTLKY 56 (97)
T ss_pred EEEcCCceEEEEEcCCCCHHHHHHHHHHHhCCCCc-eEEEE
Confidence 34568999999999999999999999999999876 66655
No 200
>PF14541 TAXi_C: Xylanase inhibitor C-terminal; PDB: 3AUP_D 3HD8_A 1T6G_A 1T6E_X 2B42_A 3VLB_A 3VLA_A.
Probab=61.53 E-value=28 Score=28.85 Aligned_cols=28 Identities=14% Similarity=0.046 Sum_probs=24.4
Q ss_pred CcccccchhhhhcceEEEEecccEEEee
Q psy8018 231 PMDMLLGLDMLKRHQVQIAIENDFLTTS 258 (280)
Q Consensus 231 ~~d~ILG~D~L~~~~~~ID~~~~~L~i~ 258 (280)
.--.+||.-.++.+.+..|+.+++|.|.
T Consensus 133 ~~~~viG~~~~~~~~v~fDl~~~~igF~ 160 (161)
T PF14541_consen 133 DGVSVIGNFQQQNYHVVFDLENGRIGFA 160 (161)
T ss_dssp SSSEEE-HHHCCTEEEEEETTTTEEEEE
T ss_pred CCcEEECHHHhcCcEEEEECCCCEEEEe
Confidence 4559999999999999999999999885
No 201
>TIGR02854 spore_II_GA sigma-E processing peptidase SpoIIGA. Members of this protein family are the stage II sporulation protein SpoIIGA. This protein acts as an activating protease for Sigma-E, one of several specialized sigma factors of the sporulation process in Bacillus subtilis and related endospore-forming bacteria.
Probab=61.19 E-value=20 Score=33.30 Aligned_cols=35 Identities=17% Similarity=0.419 Sum_probs=28.0
Q ss_pred eEEEEEEEEcCEE--EEEEEcCCCc---------ceecCHHHHHHc
Q psy8018 148 IMLYINCRVNGHP--VKAFIDSGAQ---------TTIMSAACAARV 182 (280)
Q Consensus 148 ~~l~i~~~InG~~--v~alVDTGAq---------~tiis~~~A~rl 182 (280)
...-+.+.++|+. ++|++|||-+ +.+++.+.++.+
T Consensus 157 ~~~~v~i~~~g~~~~~~alvDTGN~L~DPlT~~PV~Ive~~~~~~~ 202 (288)
T TIGR02854 157 QIYELEICLDGKKVTIKGFLDTGNQLRDPLTKLPVIVVEYDSLKSI 202 (288)
T ss_pred eEEEEEEEECCEEEEEEEEEecCCcccCCCCCCCEEEEEHHHhhhh
Confidence 5566888899994 7899999987 457788888777
No 202
>cd01817 RGS12_RBD Ubiquitin domain of RGS12 and RGS14. RGS12 (regulator of G signalling 12), and RGS14, are members of a family of GTPase-activating proteins (GAP's) specific for the G-alpha subunit, which act as key inhibitors of G-protein-mediated cell responses in eukaryotes. Their domain architecture includes tandem RBD domains as well as PDZ , PTB, and RGS, and GoLoco domains.
Probab=59.74 E-value=70 Score=23.79 Aligned_cols=65 Identities=17% Similarity=0.205 Sum_probs=47.3
Q ss_pred EEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECC--eeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 5 VTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQG--NALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 5 V~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~G--k~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
|..+||+...+.+.|.+||.+.-..+-+.-|++++.-.++.-| ++|.=+ .++.+-.|.-|.|..|
T Consensus 4 V~LPdg~~T~V~vrpG~ti~d~L~kllekRgl~~~~~~vf~~g~~k~l~~~----qD~~~L~~~El~vE~r 70 (73)
T cd01817 4 VILPDGSTTVVPTRPGESIRDLLSGLCEKRGINYAAVDLFLVGGDKPLVLD----QDSSVLAGQEVRLEKR 70 (73)
T ss_pred EECCCCCeEEEEecCCCCHHHHHHHHHHHcCCChhHEEEEEecCCcccccC----CccceeeccEEEEEEe
Confidence 4568999999999999999999999999999999888887655 344211 1234444555555443
No 203
>cd01818 TIAM1_RBD Ubiquitin domain of Tiam1 guanine nucleotide exchange factor. Tiam1 (T lymphoma invasion and metastasis 1) a guanine nucleotide exchange factor that activates Rac, is an important regulator of Rho GTPase functions in tumor cells including regulation of cell shape and invasiveness in epithelial cells and fibroblasts. TIAM1 has an RBD (Ras-binding domain) similar to that of Raf kinase as well as PH (pleckstrin homology), PDZ, and RhoGEF domains.
Probab=59.72 E-value=23 Score=26.59 Aligned_cols=41 Identities=17% Similarity=0.188 Sum_probs=36.2
Q ss_pred EEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEE
Q psy8018 5 VTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEF 45 (280)
Q Consensus 5 V~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~ 45 (280)
|..++|+..++-|.|++|++++-+..-...+..|..--|-.
T Consensus 4 V~lPn~~~~~v~vrp~~tv~dvLe~aCk~~~ldp~eh~Lrl 44 (77)
T cd01818 4 VCLPDNQPVLTYLRPGMSVEDFLESACKRKQLDPMEHYLRL 44 (77)
T ss_pred EECCCCceEEEEECCCCCHHHHHHHHHHhcCCChhHheeEE
Confidence 56689999999999999999999999999999998776643
No 204
>PTZ00380 microtubule-associated protein (MAP); Provisional
Probab=59.71 E-value=10 Score=30.94 Aligned_cols=57 Identities=18% Similarity=0.309 Sum_probs=39.8
Q ss_pred EEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCcccc--CCC-CCCEEEEEE
Q psy8018 15 LDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDY--NIQ-DGDLVLLKR 72 (280)
Q Consensus 15 leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~--gI~-~gd~l~l~~ 72 (280)
+=|+.++||.+|...|....++++++.-|+.|+ .|...+.|+++. .-+ ++-.|.+..
T Consensus 45 llVP~d~tV~qF~~iIRkrl~l~~~k~flfVnn-~lp~~s~~mg~lYe~~KDeDGFLYi~Y 104 (121)
T PTZ00380 45 LALPRDATVAELEAAVRQALGTSAKKVTLAIEG-STPAVTATVGDIADACKRDDGFLYVSV 104 (121)
T ss_pred EEcCCCCcHHHHHHHHHHHcCCChhHEEEEECC-ccCCccchHHHHHHHhcCCCCeEEEEE
Confidence 369999999999999999999999984444466 444344677654 122 345566654
No 205
>PF02505 MCR_D: Methyl-coenzyme M reductase operon protein D; InterPro: IPR003901 Methyl-coenzyme M reductase (MCR) catalyses the reduction of methyl-coenzyme M (CH3-SCoM) and coenzyme B (HS-CoB) to methane and the corresponding heterosulphide CoM-S-S-CoB (2.8.4.1 from EC), the final step in methane biosynthesis. This reaction proceeds under anaerobic conditions by methanogenic Archaea [], and requires a nickel-porphinoid prosthetic group, coenzyme F430, which is in the EPR-detectable Ni(I) oxidation state in the active enzyme. Studies on a catalytically inactive enzyme aerobically co-crystallized with coenzyme M displayed a fully occupied coenzyme M-binding site with no alternate conformations. The binding of coenzyme M appears to induce specific conformational changes that suggests a molecular mechanism by which the enzyme ensures that methyl-coenzyme M enters the substrate channel prior to coenzyme B, as required by the active-site geometry []. MCR is a hexamer composed of 2 alpha, 2 beta, and 2 gamma subunits with two identical nickel porphinoid active sites, which form two long active site channels with F430 embedded at the bottom [, ]. Genes encoding the beta (mcrB) and gamma (mcrG) subunits of MCR are separated by two open reading frames coding for two proteins C and D [, ]. The function of proteins C and D is unknown. This entry represents protein D.; GO: 0015948 methanogenesis
Probab=58.51 E-value=19 Score=30.45 Aligned_cols=49 Identities=18% Similarity=0.368 Sum_probs=36.4
Q ss_pred CcEEEEEeCC-CccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCC
Q psy8018 10 NECFLLDVSE-DLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDG 65 (280)
Q Consensus 10 g~~~~leV~~-~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~g 65 (280)
|.. .++++. ...++.+++.|++-+-++.+ ++-|+-+... .|+++| +|-|
T Consensus 75 Gri-~lele~~~~~ie~I~~iCee~lpf~y~----i~~G~f~r~~-~TvtDY-~KyG 124 (153)
T PF02505_consen 75 GRI-ILELEDEEDVIEKIREICEEVLPFGYD----IKEGKFIRTK-PTVTDY-AKYG 124 (153)
T ss_pred eEE-EEEecCcHHHHHHHHHHHHHhCCCceE----eeeeEEeccC-Cchhhh-hhcC
Confidence 544 488888 89999999999998755543 2458888864 899998 4444
No 206
>PF06234 TmoB: Toluene-4-monooxygenase system protein B (TmoB); InterPro: IPR009355 This family consists of several Toluene-4-monooxygenase system protein B (TmoB) sequences. Pseudomonas mendocina KR1 metabolises toluene as a carbon source. The initial step of the pathway is hydroxylation of toluene to form p-cresol by a multicomponent toluene-4-monooxygenase (T4MO) system [].; PDB: 3N1Y_C 3RNG_C 3RNA_C 3RN9_C 3RNC_C 3N1X_C 3RNE_C 3RNF_C 3N1Z_C 3N20_C ....
Probab=57.75 E-value=51 Score=25.27 Aligned_cols=59 Identities=17% Similarity=0.219 Sum_probs=41.9
Q ss_pred EEEEeCCCccHHHHHHHHHHHh-CC--CC---cceEEEECC--eeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 13 FLLDVSEDLELENFKAFCEVQS-GI--SA---QEIAIEFQG--NALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 13 ~~leV~~~~tV~~LK~~Ie~e~-gi--p~---~~Q~Li~~G--k~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
.-+.|+.++|+.++-+++.... |. ++ .-.++.++| +.+..+ .|+++.||++-+.|-+..
T Consensus 17 ~Lv~VDt~dTmdqVA~k~A~HsVGrRV~~~pg~~lrVr~~g~~~~~p~~-~tVaeagl~P~e~vev~~ 83 (85)
T PF06234_consen 17 QLVPVDTEDTMDQVAAKVAHHSVGRRVAPRPGAPLRVRRQGDTQPFPRS-MTVAEAGLQPMEWVEVRF 83 (85)
T ss_dssp EEEEEETT-BHHHHHHHHHTTTTTTSS---TTSEEEEEETTTSSEE-TT--BGGGHT--TTEEEEEEE
T ss_pred EEEEeCCCCcHHHHHHHHhhhhcceecCCCCCCEEEEEecCCCccCCCc-cEehhcCCCcceEEEEEE
Confidence 3578999999999999998763 32 22 356777899 889865 899999999999988764
No 207
>PRK12751 cpxP periplasmic stress adaptor protein CpxP; Reviewed
Probab=57.61 E-value=58 Score=27.83 Aligned_cols=32 Identities=31% Similarity=0.556 Sum_probs=25.4
Q ss_pred HHHHHhcCCCCHHHHHHHHHHHHhhhHHHHHH
Q psy8018 103 RQRMLTADPFDTEAQKLIAEEIKKSNIQANME 134 (280)
Q Consensus 103 ~~~~~~~dp~~~e~q~~i~e~i~q~~i~~~~~ 134 (280)
...++.+||||+++=+-..+...+..++...+
T Consensus 87 m~~Li~Ad~FDeaAvra~~~kma~~~~e~~v~ 118 (162)
T PRK12751 87 MHKLITADKFDEAAVRAQAEKMSQNQIERHVE 118 (162)
T ss_pred HHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHH
Confidence 45678899999999999888888887764443
No 208
>PF12436 USP7_ICP0_bdg: ICP0-binding domain of Ubiquitin-specific protease 7; InterPro: IPR024729 This domain is found in eukaryotes, and is approximately 40 amino acids in length. It is found in proteins of the peptidase C19 family, which contains contains ubiquitinyl hydrolases like ubiquitin-specific protease 7 (USP7). USP7 regulates the turnover of p53 [].; PDB: 2KVR_A 2YLM_A.
Probab=56.65 E-value=16 Score=33.08 Aligned_cols=59 Identities=15% Similarity=0.379 Sum_probs=41.8
Q ss_pred EEEeCCCccHHHHHHHHHHHhCCCCcceEEEEC----Cee--eCCCcCCccccCCCCCCEEEEEEe
Q psy8018 14 LLDVSEDLELENFKAFCEVQSGISAQEIAIEFQ----GNA--LVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 14 ~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~----Gk~--L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
.+-|+.+.+|+++-..|.+..|+|++...++|. +++ + +.+.|++.+.+.+||+|...+.
T Consensus 88 h~~v~~~~~v~~l~~~i~~~~g~p~~t~l~lyEEi~~~~ie~i-~~~~t~~~~el~~GdIi~fQ~~ 152 (249)
T PF12436_consen 88 HVYVPKNDKVSELVPLINERAGLPPDTPLLLYEEIKPNMIEPI-DPNQTFEKAELQDGDIICFQRA 152 (249)
T ss_dssp EEEEETT-BGGGTHHHHHHHHT--TT--EEEEEEEETTEEEE---SSSBHHHTT--TTEEEEEEE-
T ss_pred EEEECCCCCHHHHHHHHHHHcCCCCCCceEEEEEeccceeeEc-CCCCchhhcccCCCCEEEEEec
Confidence 468999999999999999999999998888774 332 4 4568999999999999988764
No 209
>cd01782 AF6_RA_repeat1 Ubiquitin domain of AT-6, first repeat. The AF-6 protein (also known as afadin and canoe) is a multidomain cell junction protein that contains two N-terminal Ras-associating (RA) domains in addition to FHA (forkhead-associated), DIL (class V myosin homology region), and PDZ domains and a proline-rich region. AF6 acts downstream of the Egfr (Epidermal Growth Factor-receptor)/Ras signalling pathway and provides a link from Egfr to cytoskeletal elements.
Probab=56.49 E-value=79 Score=25.40 Aligned_cols=57 Identities=14% Similarity=0.137 Sum_probs=40.2
Q ss_pred CEEEEEcCCCcE--EEEEeCCCccHHHHHHHHHHHhCCC-----CcceEEE--E-CC--eeeCCCcCCc
Q psy8018 1 MKVTVTSLDNEC--FLLDVSEDLELENFKAFCEVQSGIS-----AQEIAIE--F-QG--NALVDNKKTL 57 (280)
Q Consensus 1 M~ItV~~~~g~~--~~leV~~~~tV~~LK~~Ie~e~gip-----~~~Q~Li--~-~G--k~L~d~~~tL 57 (280)
|+.++...+++. ..|-|++++|..++.+.+-+++-+. +...-|+ + +| +.|.|++.+|
T Consensus 24 mrf~~qd~~~k~atK~VrVsS~~tt~eVI~~LLeKFk~d~~~~s~p~FALYevh~nGe~RKL~d~E~PL 92 (112)
T cd01782 24 MRFYFQDGGEKVATKCIRVSSTATTRDVIDTLSEKFRPDMRMLSNPTYSLYEVHENGEERRLLDDEKPL 92 (112)
T ss_pred EEEEEEcCCCcEEEEEEEEecCCCHHHHHHHHHHHhcccccccCCcceEEEEEecCCceEEcCCcCCCe
Confidence 667777766654 4689999999999999999988854 3455554 2 55 5676655553
No 210
>PF02824 TGS: TGS domain; InterPro: IPR004095 The TGS domain is present in a number of enzymes, for example, in threonyl-tRNA synthetase (ThrRS), GTPase, and guanosine-3',5'-bis(diphosphate) 3'-pyrophosphohydrolase (SpoT) []. The TGS domain is also present at the amino terminus of the uridine kinase from the spirochaete Treponema pallidum (but not any other organism, including the related spirochaete Borrelia burgdorferi). TGS is a small domain that consists of ~50 amino acid residues and is predicted to possess a predominantly beta-sheet structure. There is no direct information on the functions of the TGS domain, but its presence in two types of regulatory proteins (the GTPases and guanosine polyphosphate phosphohydrolases/synthetases) suggests a ligand (most likely nucleotide)-binding, regulatory role []. ; PDB: 1WXQ_A 1NYR_B 1NYQ_B 2KMM_A 1WWT_A 1TKE_A 1TKG_A 1TJE_A 1QF6_A 1TKY_A ....
Probab=56.02 E-value=38 Score=23.65 Aligned_cols=59 Identities=17% Similarity=0.144 Sum_probs=39.7
Q ss_pred EEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEE
Q psy8018 3 VTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLK 71 (280)
Q Consensus 3 ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~ 71 (280)
|+|.+++|+... ++...|+.++-..|..+.+-.+ -.=..||+...-+ +-+++||+|.+.
T Consensus 1 I~v~lpdG~~~~--~~~g~T~~d~A~~I~~~l~~~~--~~A~Vng~~vdl~------~~L~~~d~v~ii 59 (60)
T PF02824_consen 1 IRVYLPDGSIKE--LPEGSTVLDVAYSIHSSLAKRA--VAAKVNGQLVDLD------HPLEDGDVVEII 59 (60)
T ss_dssp EEEEETTSCEEE--EETTBBHHHHHHHHSHHHHHCE--EEEEETTEEEETT------SBB-SSEEEEEE
T ss_pred CEEECCCCCeee--CCCCCCHHHHHHHHCHHHHhhe--eEEEEcCEECCCC------CCcCCCCEEEEE
Confidence 567778888765 7888899999999987754221 1223588776432 357888888764
No 211
>PF11834 DUF3354: Domain of unknown function (DUF3354); InterPro: IPR021789 Potassium channels take part in important processes of higher plants, including opening and closing of stomatal pores and leaf movement. Inward rectifying potassium (K(+)in) channels play an important role in turgor regulation and ion uptake in higher plants. All of them comprise, from their N-terminal to their C-terminal ends: a short hydrophilic region, a hydrophobic region structurally analogous and partially homologous to the transmembrane domain of voltage-gated animal channels from the Shaker superfamily, a putative cyclic nucleotide-binding domain, and a conserved C-terminal KHA domain. Between these last two regions, some of them (AKT1, AKT2 and SKT1) contain an ankyrin-repeat domain with six repeats homologous to those of human erythrocyte ankyrin. This entry represents the KHA domain which is unique to plant K(+)in channels. The KHA domain contains two high-homology blocks enriched for hydrophobic and acidic residues, respectively. The KHA domain is essential for interaction of plant K(+)in channels. The KHA domain mediates tetramerization and/or stabilisation of the heteromers [, , ].
Probab=54.27 E-value=16 Score=26.81 Aligned_cols=44 Identities=25% Similarity=0.312 Sum_probs=33.2
Q ss_pred ccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEE
Q psy8018 21 LELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLK 71 (280)
Q Consensus 21 ~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~ 71 (280)
.|+++|....++++|++ ..-.+.-+|-.+.|= . =|.|||.|+++
T Consensus 26 ~SleeLl~ia~~kfg~~-~~~v~~~dgaeIdDI-~-----~IRDgD~L~~~ 69 (69)
T PF11834_consen 26 DSLEELLKIASEKFGFS-ATKVLNEDGAEIDDI-D-----VIRDGDHLYLV 69 (69)
T ss_pred ccHHHHHHHHHHHhCCC-ceEEEcCCCCEEeEE-E-----EEEcCCEEEEC
Confidence 59999999999999997 445555667666542 1 38899998873
No 212
>PF00788 RA: Ras association (RalGDS/AF-6) domain; InterPro: IPR000159 Proteins with this domain are mostly RasGTP effectors and include guanine-nucleotide releasing factor in mammals []. This factor stimulates the dissociation of GDP from the Ras-related RALA and RALB GTPases, which allows GTP binding and activation of the GTPases. It interacts and acts as an effector molecule for R-ras, K-Ras and Rap []. The domain is also present in a number of other proteins among them the sexual differentiation protein in yeast that is essential for mating and meiosis and yeast adenylate cyclase. These proteins contain repeated leucine-rich (LRR) segments.; GO: 0007165 signal transduction; PDB: 3EC8_A 2C5L_D 2BYF_A 2CS4_A 3KH0_A 2B3A_A 1RAX_A 2RGF_A 1WGR_A 1WXA_A ....
Probab=53.13 E-value=57 Score=23.91 Aligned_cols=56 Identities=14% Similarity=0.127 Sum_probs=39.9
Q ss_pred EEEcCCCc----EEEEEeCCCccHHHHHHHHHHHhCC--CCcceEEE-E---C--CeeeCCCcCCccc
Q psy8018 4 TVTSLDNE----CFLLDVSEDLELENFKAFCEVQSGI--SAQEIAIE-F---Q--GNALVDNKKTLKD 59 (280)
Q Consensus 4 tV~~~~g~----~~~leV~~~~tV~~LK~~Ie~e~gi--p~~~Q~Li-~---~--Gk~L~d~~~tL~~ 59 (280)
.|-..++. ..++-|++++|+.++-..+-.++++ ++....|+ + . .+.|.+++..|.-
T Consensus 6 rVy~~~~~~~~~~k~i~v~~~tTa~evi~~~l~k~~l~~~~~~y~L~~~~~~~~~er~L~~~E~pl~i 73 (93)
T PF00788_consen 6 RVYDGDGSPGSTYKTIKVSSSTTAREVIEMALEKFGLAEDPSDYCLVEVEESGGEERPLDDDECPLQI 73 (93)
T ss_dssp EEEETTSSSCCSEEEEEEETTSBHHHHHHHHHHHTTTSSSGGGEEEEEEECTTTEEEEETTTSBHHHH
T ss_pred EEEcCCCCCCccEEEEEECCCCCHHHHHHHHHHHhCCCCCCCCEEEEEEEcCCCEEEEcCCCCchHHH
Confidence 34445555 8899999999999999999999999 45566774 2 2 2457665455433
No 213
>cd06398 PB1_Joka2 The PB1 domain is present in the Nicotiana plumbaginifolia Joka2 protein which interacts with sulfur stress inducible UP9 protein. The PB1 domain is a modular domain mediating specific protein-protein interactions which play a role in many critical cell processes, such as osteoclastogenesis, angiogenesis, early cardiovascular development and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domain, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions. The PB1 domain module
Probab=51.79 E-value=53 Score=25.28 Aligned_cols=42 Identities=12% Similarity=0.238 Sum_probs=31.3
Q ss_pred EEEc-CCCcEEEEEeC-----CCccHHHHHHHHHHHhCCCC-cceEEEE
Q psy8018 4 TVTS-LDNECFLLDVS-----EDLELENFKAFCEVQSGISA-QEIAIEF 45 (280)
Q Consensus 4 tV~~-~~g~~~~leV~-----~~~tV~~LK~~Ie~e~gip~-~~Q~Li~ 45 (280)
+||. -+|...-+.++ ++.+.++|+++|...+++|+ ....|.|
T Consensus 2 ~vKv~y~~~~rRf~l~~~~~~~d~~~~~L~~kI~~~f~l~~~~~~~l~Y 50 (91)
T cd06398 2 VVKVKYGGTLRRFTFPVAENQLDLNMDGLREKVEELFSLSPDADLSLTY 50 (91)
T ss_pred EEEEEeCCEEEEEEeccccccCCCCHHHHHHHHHHHhCCCCCCcEEEEE
Confidence 4442 35666666666 58999999999999999998 5556666
No 214
>PRK10363 cpxP periplasmic repressor CpxP; Reviewed
Probab=50.00 E-value=83 Score=27.09 Aligned_cols=54 Identities=22% Similarity=0.394 Sum_probs=35.4
Q ss_pred hHHHHHHHHHHHHHHH----HHHHHHHHHhcCCCCHHHHHHHHHHHHhhhHHHHHHHh
Q psy8018 83 DDFARILRQQYEEREK----REQLRQRMLTADPFDTEAQKLIAEEIKKSNIQANMEAA 136 (280)
Q Consensus 83 ~~f~~~~~~~~~~~~~----~~~~~~~~~~~dp~~~e~q~~i~e~i~q~~i~~~~~~a 136 (280)
+.++++++....++.. +.++...++.+|+||..+-+-..+.+.+..++..++.+
T Consensus 57 qQmRdLm~~~r~~~~~~~~~er~amh~LI~ad~FDEaavra~a~kma~~~~e~~Vem~ 114 (166)
T PRK10363 57 QQMRDLMQQARHEQPPVNVSEMETMHRLVTAENFDENAVRAQAEKMAQEQVARQVEMA 114 (166)
T ss_pred HHHHHHHHHHHhcccccCHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3566666554332111 12223467789999999999999999888888666644
No 215
>cd01776 Rin1_RA Ubiquitin domain of RIN1 RAS effector. Rin1_RA RIN1 is a RAS effector that binds with specificity and high affinity to activated RAS via its carboxy-terminal RA (RAS-associated) domain. RIN1 competes directly with RAF1 for RAS binding and is thought to divert signaling away from RAF and the MAPK pathway while also shunting RAS signals through alternate pathways. In addition, Rin1 and Rin2 are Rab5-binding proteins, binding preferentially to the GTP-bound form, that enhance the GDP-GTP exchange reaction on Rab5 that regulate the docking and fusion processes of endocytic vesicles. In addition to the RA domain, RIN1 and RIN2 have an SH2 (Src homology 2) domain, a proline-rich SH3 domain, and a Vps9 domain.
Probab=49.92 E-value=38 Score=25.91 Aligned_cols=44 Identities=14% Similarity=0.178 Sum_probs=33.9
Q ss_pred CcEEEEEeCCCccHHHHHHHHHHHhCCC-CcceEEEE--CCe--eeCCC
Q psy8018 10 NECFLLDVSEDLELENFKAFCEVQSGIS-AQEIAIEF--QGN--ALVDN 53 (280)
Q Consensus 10 g~~~~leV~~~~tV~~LK~~Ie~e~gip-~~~Q~Li~--~Gk--~L~d~ 53 (280)
-...||-|.|++|.++|-.+|+.++.+. |+.-.|++ +|. .|.||
T Consensus 13 ct~KTL~V~P~~tt~~vc~lcA~Kf~V~qPe~y~LFl~vdg~~~qLadd 61 (87)
T cd01776 13 CTGKTLLVRPYITTEDVCQLCAEKFKVTQPEEYSLFLFVEETWQQLAPD 61 (87)
T ss_pred ceeeeeecCCCCcHHHHHHHHHHHhccCChhheeEEEEECCcEEEcCcc
Confidence 3456999999999999999999999984 66666653 443 46554
No 216
>TIGR03260 met_CoM_red_D methyl-coenzyme M reductase operon protein D. Members of this protein family are protein D, a non-structural protein, of the operon for methyl coenzyme M reductase, also called coenzyme-B sulfoethylthiotransferase (EC 2.8.4.1). That enzyme, with alpha, beta, and gamma subunits, catalyzes the last step in methanogenesis; it has several modified sites, so accessory proteins are expected. Several methanogens have encode two such enzymes, designated I and II; this model does not separate the isozymes. Proteins in this family are expressed at much lower levels than the methyl-coenzyme M reductase itself and associate and have been shown to form at least transient associations. The precise function is unknown.
Probab=49.67 E-value=32 Score=29.05 Aligned_cols=45 Identities=18% Similarity=0.314 Sum_probs=34.1
Q ss_pred CcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCcccc
Q psy8018 10 NECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDY 60 (280)
Q Consensus 10 g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~ 60 (280)
|..+ +++.....++.+++.|++-+-++.+ +.-|+-+... .|+++|
T Consensus 74 GrI~-le~~~~~~i~~I~eiC~e~~pF~y~----i~~g~f~r~~-~TvtDY 118 (150)
T TIGR03260 74 GRII-LELEDEDIVEEIEEICKEMLPFGYE----VRVGKFLRTK-PTVTDY 118 (150)
T ss_pred eEEE-EEecCHHHHHHHHHHHHhhCCCceE----eeeeeEeecC-Cchhhh
Confidence 5444 7888999999999999998766543 3457777764 799888
No 217
>PF12436 USP7_ICP0_bdg: ICP0-binding domain of Ubiquitin-specific protease 7; InterPro: IPR024729 This domain is found in eukaryotes, and is approximately 40 amino acids in length. It is found in proteins of the peptidase C19 family, which contains contains ubiquitinyl hydrolases like ubiquitin-specific protease 7 (USP7). USP7 regulates the turnover of p53 [].; PDB: 2KVR_A 2YLM_A.
Probab=48.97 E-value=37 Score=30.81 Aligned_cols=43 Identities=9% Similarity=0.166 Sum_probs=31.6
Q ss_pred EEEEEc---CCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEE
Q psy8018 2 KVTVTS---LDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIE 44 (280)
Q Consensus 2 ~ItV~~---~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li 44 (280)
.|+++. .++..|+++++..+|-.+|-+.|.+..|++|+..+|.
T Consensus 178 ~V~f~~~~~~~~~~F~l~ls~~~tY~~la~~Va~~l~~dP~~lr~~ 223 (249)
T PF12436_consen 178 EVEFKPKDNPNDPEFTLWLSKKMTYDQLAEKVAEHLNVDPEHLRFF 223 (249)
T ss_dssp EEEEEETTSTT---EEEEEETT--HHHHHHHHHHHHTS-GGGEEEE
T ss_pred EEEEEECCCCCCCCEEEEECCCCCHHHHHHHHHHHHCCChHHEEEE
Confidence 345553 4466899999999999999999999999999999986
No 218
>cd01766 Ufm1 Urm1-like ubiquitin domain. Ufm1 (ubiquitin-fold modifier 1) is a post-translational UBL (ubiquitin-like) modifier with a tertiary structure similar to that of ubiquitin. Ufm1 is initially expressed as a precursor which undergoes C-terminal cleavage to expose a conserved glycine residue that is required for the conjugation reactions involving Ufm1.
Probab=48.71 E-value=96 Score=23.23 Aligned_cols=61 Identities=18% Similarity=0.155 Sum_probs=46.3
Q ss_pred EEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEE-CCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 12 CFLLDVSEDLELENFKAFCEVQSGISAQEIAIEF-QGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 12 ~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~-~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
...+.|+.+.....+-....+++.+|+..--++- +|-=+. ..+|-..+=++.|+-|-|..|
T Consensus 17 fkvlsVpE~aPftAvlkfaAEeFkv~~~TsAiiTndGvGIN-P~qtAGnvflkhgselrliPR 78 (82)
T cd01766 17 FKVLSVPESTPFTAVLKFAAEEFKVPAATSAIITNDGIGIN-PAQTAGNVFLKHGSELRLIPR 78 (82)
T ss_pred ceEEeccccCchHHHHHHHHHhcCCCccceeEEecCccccC-hhhcccceeeecCCEeeeccc
Confidence 3467999998899999999999999987666665 454443 457777777888888887654
No 219
>PRK11130 moaD molybdopterin synthase small subunit; Provisional
Probab=48.24 E-value=1.1e+02 Score=22.50 Aligned_cols=53 Identities=13% Similarity=0.170 Sum_probs=33.0
Q ss_pred EEEeCC-CccHHHHHHHHHHHhC-----CCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 14 LLDVSE-DLELENFKAFCEVQSG-----ISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 14 ~leV~~-~~tV~~LK~~Ie~e~g-----ip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
.+++++ ..||.+|++.+..+.. ......+..-|++.... +.-+++||.|-+..
T Consensus 18 ~~~v~~~~~tv~~l~~~L~~~~~~~~~~~~~~~~~~aVN~~~~~~------~~~l~dgDeVai~P 76 (81)
T PRK11130 18 ALELAADFPTVEALRQHLAQKGDRWALALEDGKLLAAVNQTLVSF------DHPLTDGDEVAFFP 76 (81)
T ss_pred eEEecCCCCCHHHHHHHHHHhCccHHhhhcCCCEEEEECCEEcCC------CCCCCCCCEEEEeC
Confidence 355544 4799999999987752 11122234446654332 23599999998865
No 220
>cd01611 GABARAP Ubiquitin domain of GABA-receptor-associated protein. GABARAP (GABA-receptor-associated protein) belongs ot a large family of proteins that mediate intracellular membrane trafficking and/or fusion. GABARAP binds not only to GABA, type A but also to tubulin, gephrin, and ULK1. Orthologues of GABARAP include Gate-16 (golgi-associated ATPase enhancer), LC3 (microtubule-associated protein light chain 3), and ATG8 (autophagy protein 8). ATG8 is a ubiquitin-like protein that is conjugated to the membrane phospholipid, phosphatidylethanolamine as part of a ubiquitin-like conjugation system essential for autophagosome-formation.
Probab=47.13 E-value=27 Score=27.97 Aligned_cols=56 Identities=13% Similarity=0.202 Sum_probs=39.9
Q ss_pred EeCCCccHHHHHHHHHHHhCCCCcceEEEE-CCeeeCCCcCCcccc----CCCCCCEEEEEEe
Q psy8018 16 DVSEDLELENFKAFCEVQSGISAQEIAIEF-QGNALVDNKKTLKDY----NIQDGDLVLLKRV 73 (280)
Q Consensus 16 eV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~-~Gk~L~d~~~tL~~~----gI~~gd~l~l~~~ 73 (280)
-|+.+.||.+|...|....++++++-..+| |+ .+...+.++++. +- ++..|.|...
T Consensus 46 lVp~~~tv~~f~~~irk~l~l~~~~slfl~Vn~-~~p~~~~~~~~lY~~~kd-~DGfLyl~Ys 106 (112)
T cd01611 46 LVPSDLTVGQFVYIIRKRIQLRPEKALFLFVNN-SLPPTSATMSQLYEEHKD-EDGFLYMTYS 106 (112)
T ss_pred EecCCCCHHHHHHHHHHHhCCCccceEEEEECC-ccCCchhHHHHHHHHhCC-CCCEEEEEEe
Confidence 499999999999999999999887644444 55 454445677654 43 3557777653
No 221
>PF13801 Metal_resist: Heavy-metal resistance; PDB: 3EPV_C 2Y3D_A 2Y3H_D 2Y3G_B 2Y3B_A 2Y39_A 3LAY_H.
Probab=46.79 E-value=90 Score=23.64 Aligned_cols=27 Identities=33% Similarity=0.426 Sum_probs=19.1
Q ss_pred HHHHHHHhcCCCCHHHHHHHHHHHHhh
Q psy8018 101 QLRQRMLTADPFDTEAQKLIAEEIKKS 127 (280)
Q Consensus 101 ~~~~~~~~~dp~~~e~q~~i~e~i~q~ 127 (280)
.+...++.++|+|++.=+.+.+.|.+.
T Consensus 73 ~~l~~ll~~~~~D~~~i~a~~~~~~~~ 99 (125)
T PF13801_consen 73 QELRALLAAPPPDEAAIEALLEEIREA 99 (125)
T ss_dssp HHHHHHHCCSSS-HHHHHHHHHHHHHH
T ss_pred HHHHHHHcCCCCCHHHHHHHHHHHHHH
Confidence 345667789999999877777777544
No 222
>PF03671 Ufm1: Ubiquitin fold modifier 1 protein; InterPro: IPR005375 Ubiquitinylation is an ATP-dependent process that involves the action of at least three enzymes: a ubiquitin-activating enzyme (E1, IPR000011 from INTERPRO), a ubiquitin-conjugating enzyme (E2, IPR000608 from INTERPRO), and a ubiquitin ligase (E3, IPR000569 from INTERPRO, IPR003613 from INTERPRO), which work sequentially in a cascade. There are many different E3 ligases, which are responsible for the type of ubiquitin chain formed, the specificity of the target protein, and the regulation of the ubiquitinylation process []. Ubiquitinylation is an important regulatory tool that controls the concentration of key signalling proteins, such as those involved in cell cycle control, as well as removing misfolded, damaged or mutant proteins that could be harmful to the cell. Several ubiquitin-like molecules have been discovered, such as Ufm1 (IPR005375 from INTERPRO), SUMO1 (IPR003653 from INTERPRO), NEDD8, Rad23 (IPR004806 from INTERPRO), Elongin B and Parkin (IPR003977 from INTERPRO), the latter being involved in Parkinson's disease []. Ubiquitin-like molecules (UBLs) can be divided into two subclasses: type-1 UBLs, which ligate to target proteins in a manner similar, but not identical, to the ubiquitylation pathway, such as SUMO, NEDD8, and UCRP/ISG15, and type-2 UBLs (also called UDPs, ubiquitin-domain proteins), which contain ubiquitin-like structure embedded in a variety of different classes of large proteins with apparently distinct functions, such as Rad23, Elongin B, Scythe, Parkin, and HOIL-1. This entry represents Ufm1 (ubiquitin-fold modifier), which is a ubiquitin-like protein with structural similarities to ubiquitin [, ]. Ufm1 is one of a number of ubiquitin-like modifiers that conjugate to target proteins in cells through Uba5 (E1) and Ufc1 (E2). The Ufm1-system is conserved in metazoa and plants, suggesting it has a potential role in multicellular organisms []. Human Ufm1 is synthesized as a precursor consisting of 85 amino-acid residues. Prior to activation by Uba5, the extra amino acids at the C-terminal region of Ufm1 are removed to expose Gly, which is necessary for conjugation to target molecule(s). C-terminal processing of Ufm1 requires two specific cysteine peptidases (IPR012462 from INTERPRO): UfSP1 and UfSP2; both peptidases are also able to release Ufm1 from Ufm1-conjugated cellular proteins. UfSP2 is present in most, if not all, of multi-cellular organisms including plant, nematode, fly, and mammal, whereas UfSP1 is not present in plants and nematodes []. For further information on ubiquitin, please see Protein of the Month [].; PDB: 1J0G_A 1WXS_A 1L7Y_A.
Probab=46.25 E-value=1.2e+02 Score=22.59 Aligned_cols=58 Identities=16% Similarity=0.202 Sum_probs=41.0
Q ss_pred EEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEE-CCeeeCCCcCCccccCCCCCCEEEE
Q psy8018 12 CFLLDVSEDLELENFKAFCEVQSGISAQEIAIEF-QGNALVDNKKTLKDYNIQDGDLVLL 70 (280)
Q Consensus 12 ~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~-~Gk~L~d~~~tL~~~gI~~gd~l~l 70 (280)
...+.|+.+.....+-...++++.+|+..-.++- +|--+. ..+|-.+.=++.|+.|-|
T Consensus 17 ~kv~sVPE~apftaVlkfaAeeF~vp~~tsaiItndG~GIn-P~QTag~vflKhGseLrl 75 (76)
T PF03671_consen 17 YKVISVPEEAPFTAVLKFAAEEFKVPPATSAIITNDGVGIN-PQQTAGNVFLKHGSELRL 75 (76)
T ss_dssp EEEEEEETTSBHHHHHHHHHHHTTS-SSSEEEEESSS-EE--TTSBHHHHHHHT-SEEEE
T ss_pred ceEEecCCCCchHHHHHHHHHHcCCCCceEEEEecCCcccc-cchhhhhhHhhcCcEeee
Confidence 3467999999999999999999999998777775 455454 346766666777776654
No 223
>PF06487 SAP18: Sin3 associated polypeptide p18 (SAP18); InterPro: IPR010516 This family consists of several eukaryotic Sin3 associated polypeptide p18 (SAP18) sequences. SAP18 is known to be a component of the Sin3-containing complex, which is responsible for the repression of transcription via the modification of histone polypeptides []. SAP18 is also present in the ASAP complex which is thought to be involved in the regulation of splicing during the execution of programmed cell death [].; PDB: 2HDE_A 4A90_A 4A6Q_A 4A8X_C.
Probab=46.05 E-value=27 Score=28.41 Aligned_cols=59 Identities=17% Similarity=0.200 Sum_probs=36.3
Q ss_pred cEEEEEeCCCccHHHHHHHHHHHhCCCCc-------ceEEEE-----------------CCeeeCCCcCCccccCCCCCC
Q psy8018 11 ECFLLDVSEDLELENFKAFCEVQSGISAQ-------EIAIEF-----------------QGNALVDNKKTLKDYNIQDGD 66 (280)
Q Consensus 11 ~~~~leV~~~~tV~~LK~~Ie~e~gip~~-------~Q~Li~-----------------~Gk~L~d~~~tL~~~gI~~gd 66 (280)
....+-...+.|+.+|-.+|.+. .|+. ..+++| .|+...|+++||++++...||
T Consensus 37 ~elqIYtW~d~TLrEL~~Lik~~--~~~~r~~~tr~~F~~VypD~~~~r~~~kdlGsv~~g~~~~d~~kTL~~~~F~iGD 114 (120)
T PF06487_consen 37 NELQIYTWMDATLRELADLIKDV--NPPARRRGTRLSFRLVYPDTRSGRYVSKDLGSVVSGRKGPDDNKTLADLRFVIGD 114 (120)
T ss_dssp TEEEEEE-TT-BHHHHHHHHHHH---HHHHSTT-EEEEEEEEECTTTTCEEEEEEEEEETTB--TTTTSBCGGGT--TT-
T ss_pred CeeEEEEcccCCHHHHHHHHHHh--CcccCCCCCEEEEEEEeecCCCCceeeecCCeEECCCCCCCcccCHhhCCcccCC
Confidence 45667788999999999998873 1111 123333 344445778999999999999
Q ss_pred EEEEE
Q psy8018 67 LVLLK 71 (280)
Q Consensus 67 ~l~l~ 71 (280)
.|.+.
T Consensus 115 yidva 119 (120)
T PF06487_consen 115 YIDVA 119 (120)
T ss_dssp EEEEE
T ss_pred EEEEe
Confidence 99875
No 224
>PF00276 Ribosomal_L23: Ribosomal protein L23; InterPro: IPR013025 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This domain is found in both eukaryotic L25 and prokaryotic and eukaryotic L23 proteins.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 3DF4_T 3DF2_T 4A1E_R 4A1A_R 4A1C_R 4A17_R 2HGU_W 2HGQ_W 1N88_A 2HGJ_W ....
Probab=44.61 E-value=58 Score=24.89 Aligned_cols=42 Identities=12% Similarity=0.256 Sum_probs=35.3
Q ss_pred CcEEEEEeCCCccHHHHHHHHHHHhCCCCcceE-EEECCeeeC
Q psy8018 10 NECFLLDVSEDLELENFKAFCEVQSGISAQEIA-IEFQGNALV 51 (280)
Q Consensus 10 g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~-Li~~Gk~L~ 51 (280)
...+++.|++++|=.++|+.|+.-+|+++..-+ +.+.|+.-.
T Consensus 20 ~n~~tF~V~~~atK~~Ik~aie~iy~V~V~~Vnt~~~~gk~kR 62 (91)
T PF00276_consen 20 PNQYTFEVDPRATKTEIKEAIEKIYGVKVKKVNTMNYPGKKKR 62 (91)
T ss_dssp SSEEEEEETTTSTHHHHHHHHHHHHTSEEEEEEEEEETSEEEE
T ss_pred CCEEEEEEeCCCCHHHHHHHHHhhcCCCeeEEEEeEeCCCceE
Confidence 368899999999999999999999999986554 567887543
No 225
>PF02991 Atg8: Autophagy protein Atg8 ubiquitin like; InterPro: IPR004241 Autophagy is generally known as a process involved in the degradation of bulk cytoplasmic components that are non-specifically sequestered into an autophagosome, where they are sequestered into double-membrane vesicles and delivered to the degradative organelle, the lysosome/vacuole, for breakdown and eventual recycling of the resulting macromolecules. The yeast proteins are involved in the autophagosome, and Atg8 binds Atg19, via its N terminus and the C terminus of Atg19. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity [] Related proteins that belong to this group include the human ganglioside expression factor and a symbiosis-related fungal protein.; PDB: 3ECI_A 3D32_B 1GNU_A 1KM7_A 1KLV_A 1KOT_A 3DOW_A 1KJT_A 1V49_A 2ZJD_C ....
Probab=44.26 E-value=38 Score=26.76 Aligned_cols=54 Identities=13% Similarity=0.280 Sum_probs=36.3
Q ss_pred eCCCccHHHHHHHHHHHhCCCCcc-eEEEECCeeeCCCcCCcccc----CCCCCCEEEEEE
Q psy8018 17 VSEDLELENFKAFCEVQSGISAQE-IAIEFQGNALVDNKKTLKDY----NIQDGDLVLLKR 72 (280)
Q Consensus 17 V~~~~tV~~LK~~Ie~e~gip~~~-Q~Li~~Gk~L~d~~~tL~~~----gI~~gd~l~l~~ 72 (280)
|+.+.||.+|...|.....+++++ .-|+.|+ .|...+.|+++. . .++..|.+..
T Consensus 39 vp~~~tv~qf~~~ir~rl~l~~~~alfl~Vn~-~lp~~s~tm~elY~~~k-deDGFLY~~Y 97 (104)
T PF02991_consen 39 VPKDLTVGQFVYIIRKRLQLSPEQALFLFVNN-TLPSTSSTMGELYEKYK-DEDGFLYMTY 97 (104)
T ss_dssp EETTSBHHHHHHHHHHHTT--TTS-EEEEBTT-BESSTTSBHHHHHHHHB--TTSSEEEEE
T ss_pred EcCCCchhhHHHHhhhhhcCCCCceEEEEEcC-cccchhhHHHHHHHHhC-CCCCeEEEEe
Confidence 799999999999999999998864 3344466 565455787764 2 2455566554
No 226
>PRK12750 cpxP periplasmic repressor CpxP; Reviewed
Probab=42.84 E-value=91 Score=26.76 Aligned_cols=28 Identities=25% Similarity=0.431 Sum_probs=19.8
Q ss_pred HHHHHhcCCCCHHHHHHHHHHHHhhhHH
Q psy8018 103 RQRMLTADPFDTEAQKLIAEEIKKSNIQ 130 (280)
Q Consensus 103 ~~~~~~~dp~~~e~q~~i~e~i~q~~i~ 130 (280)
...++.++|||++.=+.+.+.+.+...+
T Consensus 94 ~~~Ll~a~~FDeaavral~~~~~~~~~e 121 (170)
T PRK12750 94 VQALVLADDFDEAAANDLAKQMVEKQVE 121 (170)
T ss_pred HHHHHhcCCCCHHHHHHHHHHHHHHHHH
Confidence 4567889999999877776655554444
No 227
>PF14533 USP7_C2: Ubiquitin-specific protease C-terminal; PDB: 2YLM_A.
Probab=42.07 E-value=1.2e+02 Score=26.65 Aligned_cols=119 Identities=13% Similarity=0.179 Sum_probs=59.0
Q ss_pred cEEEEEeCCCccHHHHHHHHHHHhCCCCc---ceEE--EECCee---eCCCcCCccccCCCCCCEEEEEEeccCCCCCCh
Q psy8018 11 ECFLLDVSEDLELENFKAFCEVQSGISAQ---EIAI--EFQGNA---LVDNKKTLKDYNIQDGDLVLLKRVQVARTSNPS 82 (280)
Q Consensus 11 ~~~~leV~~~~tV~~LK~~Ie~e~gip~~---~Q~L--i~~Gk~---L~d~~~tL~~~gI~~gd~l~l~~~~~p~~s~~~ 82 (280)
+.+.+-|+.+.||.||...+..+.+++.+ ..+| +++||. +..+ .+|++. .+...+.+-. +|..
T Consensus 34 ~~~~~~vpk~~tV~Dll~~l~~k~~~~~~~~~~lrl~ev~~~ki~~~~~~d-~~i~~l--~~~~~~r~E~--ip~e---- 104 (213)
T PF14533_consen 34 QEYELLVPKTGTVSDLLEELQKKVGFSEEGTGKLRLWEVSNHKIYKILSED-EPISSL--NDYITLRIEE--IPEE---- 104 (213)
T ss_dssp -EEEE--BTT-BHHHHHHHHHTT----TT----EEEEEEETTEEEEEE-TT-SBGGGS----TTEEEEEE----GG----
T ss_pred eEEEEEECCCCCHHHHHHHHHHHcCCCcCCcCcEEEEEeECCEEEeecCCC-Cchhhc--cCcceeeeec--CChH----
Confidence 46788999999999999999999998765 4444 367776 4433 677776 4444443322 1110
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHhhhHHHHHHHhHhhCCCcceeeeEEEEEE-EEcCEEE
Q psy8018 83 DDFARILRQQYEEREKREQLRQRMLTADPFDTEAQKLIAEEIKKSNIQANMEAAMEYNPETFGSVIMLYINC-RVNGHPV 161 (280)
Q Consensus 83 ~~f~~~~~~~~~~~~~~~~~~~~~~~~dp~~~e~q~~i~e~i~q~~i~~~~~~a~e~~Pe~~~~~~~l~i~~-~InG~~v 161 (280)
+ ++.++.+ +.+ .+..|.+++-.. .-=|.|.
T Consensus 105 --------------------e---~~~~~~~-~~~-------------------------~li~V~hf~k~~~~~hGiPF 135 (213)
T PF14533_consen 105 --------------------E---LNLDDES-EGE-------------------------KLIPVFHFHKDPSRTHGIPF 135 (213)
T ss_dssp --------------------G---SS--TT---TE-------------------------EEEEEEEESSSTT-EEEEEE
T ss_pred --------------------H---hhccccc-ccc-------------------------eEEEEEEEecCccccCCCCE
Confidence 0 0000000 000 122222221110 3457899
Q ss_pred EEEEcCCCcceecCHHHHHHcCCccc
Q psy8018 162 KAFIDSGAQTTIMSAACAARVNITRL 187 (280)
Q Consensus 162 ~alVDTGAq~tiis~~~A~rlgL~~~ 187 (280)
.+.|--|=...=+-..+.+|+|++..
T Consensus 136 ~f~v~~gE~f~~tK~Rl~~rlgv~~k 161 (213)
T PF14533_consen 136 LFVVKPGETFSDTKERLQKRLGVSDK 161 (213)
T ss_dssp EEEEETT--HHHHHHHHHHHH---HH
T ss_pred EEEeeCCCcHHHHHHHHHHHhCCChh
Confidence 99999999888888999999999754
No 228
>KOG3391|consensus
Probab=41.91 E-value=27 Score=29.17 Aligned_cols=55 Identities=22% Similarity=0.294 Sum_probs=38.0
Q ss_pred CCCccHHHHHHHHHHH------hCCCCcceEEE-----------------ECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 18 SEDLELENFKAFCEVQ------SGISAQEIAIE-----------------FQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 18 ~~~~tV~~LK~~Ie~e------~gip~~~Q~Li-----------------~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
.-++|+.+|-.+|.+- .|..- +...+ ..|+...||++||++++++-||.|.+...
T Consensus 59 W~datL~ELtsLvkevnpeaR~kgt~f-~fa~Vf~d~~~~~y~~RevG~t~~g~Kg~ddnktL~~~kf~iGD~lDVaI~ 136 (151)
T KOG3391|consen 59 WMDATLRELTSLVKEVNPEARKKGTSF-DFAVVFPDKKSPRYIVREVGTTCLGRKGIDDNKTLQQTKFEIGDYLDVAIT 136 (151)
T ss_pred hhhhhHHHHHHHHHHcCHHHhccCceE-EEEEEeccCCCCCceeeeecccccCcccCCccchhhhCCccccceEEEEec
Confidence 3468899998888762 12111 11222 23566667789999999999999999876
No 229
>KOG2689|consensus
Probab=41.83 E-value=78 Score=29.49 Aligned_cols=70 Identities=11% Similarity=0.242 Sum_probs=53.4
Q ss_pred EEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCee---e--CCCcCCccccCCCCCCEEEEEE
Q psy8018 2 KVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNA---L--VDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 2 ~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~---L--~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
.|-|+.++|++.+.+..+..++.++...|+...|....--. ++.|-+ + .|..++|+.+|+-+-.+|.+..
T Consensus 212 rlQiRl~DG~Tl~~tF~a~E~L~~VR~wVd~n~~~~~~P~~-f~t~fPR~tf~edD~~KpLq~L~L~Psa~lil~~ 286 (290)
T KOG2689|consen 212 RLQIRLPDGQTLTQTFNARETLAAVRLWVDLNRGDGLDPYS-FHTGFPRVTFTEDDELKPLQELDLVPSAVLILEP 286 (290)
T ss_pred EEEEEcCCCCeeeeecCchhhHHHHHHHHHHhccCCCCCee-eecCCCceecccccccccHHHhccccchheeccc
Confidence 36688899999999999999999999999999877552222 233322 3 2445799999999998887743
No 230
>TIGR00691 spoT_relA (p)ppGpp synthetase, RelA/SpoT family. (p)ppGpp is a regulatory metabolite of the stringent response, but appears also to be involved in antibiotic biosynthesis in some species.
Probab=40.81 E-value=4.1e+02 Score=27.92 Aligned_cols=62 Identities=11% Similarity=0.195 Sum_probs=46.4
Q ss_pred EEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 2 KVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 2 ~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
.|+|=|+.|+.+ .++...|+.||--.|..+.|.-...- .-||+...-+ +-+++||+|.+...
T Consensus 361 ~i~vfTPkG~~~--~lp~gst~~DfAy~ih~~~g~~~~~a--~vng~~v~l~------~~l~~gd~vei~t~ 422 (683)
T TIGR00691 361 EIYVFTPKGDVV--ELPSGSTPVDFAYAVHTDVGNKCTGA--KVNGKIVPLD------KELENGDVVEIITG 422 (683)
T ss_pred ceEEECCCCeEE--EcCCCCCHHHHHHHHhHHhHhceeEE--EECCEECCCC------ccCCCCCEEEEEeC
Confidence 367778888765 67899999999999988877544222 2588876533 56999999999875
No 231
>KOG2561|consensus
Probab=40.81 E-value=23 Score=35.22 Aligned_cols=56 Identities=16% Similarity=0.222 Sum_probs=46.7
Q ss_pred EeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 16 DVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 16 eV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
+.+-+.|=.++.+.|...+||+-+.-+.+-+||+|.-. +||.+-|++.+--+.+..
T Consensus 55 k~sL~i~Gselqa~iakklgi~enhvKci~~~Kils~~-ktlaeQglk~nq~~mv~~ 110 (568)
T KOG2561|consen 55 KCSLHITGSELQALIAKKLGIKENHVKCIINGKILSCR-KTLAEQGLKINQELMVAV 110 (568)
T ss_pred hcccccccHHHHHHHHHHcCCchhhhheeeccceeecc-cchhhhhhhhhhHHHHHh
Confidence 44445666789999999999999988999999999876 899999999887666544
No 232
>KOG2507|consensus
Probab=40.61 E-value=41 Score=33.24 Aligned_cols=72 Identities=11% Similarity=0.187 Sum_probs=60.4
Q ss_pred EEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEE--EECCeee--CCCcCCccccCCCCCCEEEEEEe
Q psy8018 2 KVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAI--EFQGNAL--VDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 2 ~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~L--i~~Gk~L--~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
+|-|+.++|..|+=+.+.+.-+..++..+...-+|.....-| .|=-|.. .|.++||.++.+-+...|+|..+
T Consensus 316 rLqiRLPdGssfte~Fps~~vL~~vr~yvrq~~~i~~g~f~LatpyPRReft~eDy~KtllEl~L~psaalvvlpk 391 (506)
T KOG2507|consen 316 RLQIRLPDGSSFTEKFPSTSVLRMVRDYVRQNQTIGLGAFDLATPYPRREFTDEDYDKTLLELRLFPSAALVVLPK 391 (506)
T ss_pred EEEEecCCccchhhcCCcchHHHHHHHHHHhcccccccceeeccccccccccchhhhhhHHHhccCCcceEEEEec
Confidence 577889999999999999999999999999888887776665 3655665 35568999999999999999887
No 233
>cd05479 RP_DDI RP_DDI; retropepsin-like domain of DNA damage inducible protein. The family represents the retropepsin-like domain of DNA damage inducible protein. DNA damage inducible protein has a retropepsin-like domain and an amino-terminal ubiquitin-like domain and/or a UBA (ubiquitin-associated) domain. This CD represents the retropepsin-like domain of DDI.
Probab=38.97 E-value=45 Score=26.62 Aligned_cols=36 Identities=61% Similarity=0.916 Sum_probs=27.0
Q ss_pred eEEEEecccEEEeeeeecCCCccceeeccchhhccC
Q psy8018 245 QVQIAIENDFLTTSFSVLEDQPMDMLLGLDMLKRHQ 280 (280)
Q Consensus 245 ~~~ID~~~~~L~i~~~~~~~~~~~~~~~~~~~~~~~ 280 (280)
.+.+-+....+.+...+.+..++|.++|.|.|++|.
T Consensus 77 ~~~l~i~~~~~~~~~~Vl~~~~~d~ILG~d~L~~~~ 112 (124)
T cd05479 77 LAQVKIGNLFLPCSFTVLEDDDVDFLIGLDMLKRHQ 112 (124)
T ss_pred EEEEEECCEEeeeEEEEECCCCcCEEecHHHHHhCC
Confidence 345555666666666777766899999999999984
No 234
>PF08154 NLE: NLE (NUC135) domain; InterPro: IPR012972 This domain is located N-terminal to WD40 repeats(IPR001680 from INTERPRO). It is found in the microtubule-associated protein Q12024 from SWISSPROT [].
Probab=37.97 E-value=1.4e+02 Score=21.22 Aligned_cols=45 Identities=9% Similarity=0.149 Sum_probs=33.2
Q ss_pred EEEEeCCCccHHHHHHHHHHHh--CCCCcceEEEECCeeeCCCcCCcccc
Q psy8018 13 FLLDVSEDLELENFKAFCEVQS--GISAQEIAIEFQGNALVDNKKTLKDY 60 (280)
Q Consensus 13 ~~leV~~~~tV~~LK~~Ie~e~--gip~~~Q~Li~~Gk~L~d~~~tL~~~ 60 (280)
-+++|+.+.|..+|-++|..-. .-.+-....+.+|..|. .+|+++
T Consensus 18 ~~~~VP~~~t~~~Ls~LvN~LL~~~~~~vpfdF~i~~~~lr---~sL~~~ 64 (65)
T PF08154_consen 18 TPISVPSNITRKELSELVNQLLDDEEEPVPFDFLINGEELR---TSLEEH 64 (65)
T ss_pred CCEEEeCCCCHHHHHHHHHHHhccCCCCCcEEEEECCEEee---chHHHh
Confidence 3689999999999999988776 33444556777998886 356553
No 235
>KOG1339|consensus
Probab=37.79 E-value=1.5e+02 Score=28.48 Aligned_cols=91 Identities=15% Similarity=0.183 Sum_probs=53.9
Q ss_pred eEEEEEEEEcCE--EEEEEEcCCCcceecCHHHHH-HcCC--ccccccc-----------------------------ee
Q psy8018 148 IMLYINCRVNGH--PVKAFIDSGAQTTIMSAACAA-RVNI--TRLIDTR-----------------------------WA 193 (280)
Q Consensus 148 ~~l~i~~~InG~--~v~alVDTGAq~tiis~~~A~-rlgL--~~~~d~r-----------------------------~~ 193 (280)
.-+|+++.|.-= ++.+++|||+...-+.-.... .|.- .+..+.. +
T Consensus 45 ~~Y~~~i~IGTPpq~f~v~~DTGS~~lWV~c~~c~~~C~~~~~~~f~p~~SSt~~~~~c~~~~c~~~~~~~~~~~~C~y- 123 (398)
T KOG1339|consen 45 GEYYGNISIGTPPQSFTVVLDTGSDLLWVPCAPCSSACYSQHNPIFDPSASSTYKSVGCSSPRCKSLPQSCSPNSSCPY- 123 (398)
T ss_pred cccEEEEecCCCCeeeEEEEeCCCCceeeccccccccccccCCCccCccccccccccCCCCccccccccCcccCCcCce-
Confidence 367888888544 588999999999988654333 4532 1101110 1
Q ss_pred eeeecCceeeEEeEEEEEEEEECCe--------EEEeEEEEec---C-CCcccccchhh
Q psy8018 194 GIAKGVGVQRIIGRIHMVQIAIEND--------FLTTSFSVLE---D-QPMDMLLGLDM 240 (280)
Q Consensus 194 g~a~gvG~~~~~g~i~~~~i~IG~~--------~~~~~~~Vl~---~-~~~d~ILG~D~ 240 (280)
.+.+|-|+ ...|.+....+.+++. .|-|...-.. . ...|+||||-|
T Consensus 124 ~i~Ygd~~-~~~G~l~~Dtv~~~~~~~~~~~~~~FGc~~~~~g~~~~~~~~dGIlGLg~ 181 (398)
T KOG1339|consen 124 SIQYGDGS-STSGYLATDTVTFGGTTSLPVPNQTFGCGTNNPGSFGLFAAFDGILGLGR 181 (398)
T ss_pred EEEeCCCC-ceeEEEEEEEEEEccccccccccEEEEeeecCccccccccccceEeecCC
Confidence 12344433 5678888888888872 2322221110 1 35899999875
No 236
>PF09269 DUF1967: Domain of unknown function (DUF1967); InterPro: IPR015349 The Obg family comprises a group of ancient P-loop small G proteins (GTPases) belonging to the TRAFAC (for translation factors) class and can be subdivided into several distinct protein subfamilies []. OBG GTPases have been found in both prokaryotes and eukaryotes []. The structure of the OBG GTPase from Thermus thermophilus has been determined []. This entry represents a C-terminal domain found in certain OBG GTPases. This domain contains a four-stranded beta sheet and three alpha helices flanked by an additional beta strand. It is predominantly found in the bacterial GTP-binding protein Obg, and is functionally uncharacterised. ; GO: 0000166 nucleotide binding; PDB: 1UDX_A.
Probab=37.01 E-value=19 Score=26.14 Aligned_cols=16 Identities=31% Similarity=0.555 Sum_probs=11.0
Q ss_pred CCccccCCCCCCEEEE
Q psy8018 55 KTLKDYNIQDGDLVLL 70 (280)
Q Consensus 55 ~tL~~~gI~~gd~l~l 70 (280)
+.|.+.|+++||+|.+
T Consensus 47 ~~L~~~G~~~GD~V~I 62 (69)
T PF09269_consen 47 KALRKAGAKEGDTVRI 62 (69)
T ss_dssp HHHHTTT--TT-EEEE
T ss_pred HHHHHcCCCCCCEEEE
Confidence 5688899999999976
No 237
>cd01777 SNX27_RA Ubiquitin domain of SNX27 (sorting nexin protein 27). SNX27_RA SNX27 (sorting nexin protein 27) belongs to a large family of endosome-localized proteins related to sorting nexin1 which is implicated in regulating membrane traffic. The domain architecture of SNX27 includes an amino-terminal PDZ domain, a PX (PhoX homologous) domain, and a carboxy-terminal RA (RAS-associated) domain.
Probab=36.66 E-value=79 Score=24.33 Aligned_cols=41 Identities=7% Similarity=0.135 Sum_probs=36.3
Q ss_pred EEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceE
Q psy8018 2 KVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIA 42 (280)
Q Consensus 2 ~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~ 42 (280)
++-|-.++|...++.|..+++.+++-+.+..+.|+|.+-..
T Consensus 3 ~L~V~Lpdg~~i~V~v~~s~~a~~Vleav~~kl~L~~e~~~ 43 (87)
T cd01777 3 ELRIALPDKATVTVRVRKNATTDQVYQALVAKAGMDSYTQN 43 (87)
T ss_pred EEEEEccCCCEEEEEEEEcccHHHHHHHHHHHhCCCHHHHh
Confidence 35566789999999999999999999999999999987664
No 238
>PF14543 TAXi_N: Xylanase inhibitor N-terminal; PDB: 3HD8_A 3VLB_A 3VLA_A 3AUP_D 1T6G_A 1T6E_X 2B42_A.
Probab=36.37 E-value=41 Score=28.18 Aligned_cols=25 Identities=20% Similarity=0.545 Sum_probs=19.1
Q ss_pred EEEEEEEcC--EEEEEEEcCCCcceec
Q psy8018 150 LYINCRVNG--HPVKAFIDSGAQTTIM 174 (280)
Q Consensus 150 l~i~~~InG--~~v~alVDTGAq~tii 174 (280)
+++++.|+- +++.++||||++.+-+
T Consensus 1 Y~~~~~iGtP~~~~~lvvDtgs~l~W~ 27 (164)
T PF14543_consen 1 YYVSVSIGTPPQPFSLVVDTGSDLTWV 27 (164)
T ss_dssp EEEEEECTCTTEEEEEEEETT-SSEEE
T ss_pred CEEEEEeCCCCceEEEEEECCCCceEE
Confidence 367788866 5789999999988775
No 239
>TIGR03595 Obg_CgtA_exten Obg family GTPase CgtA, C-terminal extension. CgtA (see model TIGR02729) is a broadly conserved member of the obg family of GTPases associated with ribosome maturation. This model represents a unique C-terminal domain found in some but not all sequences of CgtA. This region is preceded, and may be followed, by a region of low-complexity sequence.
Probab=34.65 E-value=25 Score=25.57 Aligned_cols=17 Identities=29% Similarity=0.487 Sum_probs=14.9
Q ss_pred CCccccCCCCCCEEEEE
Q psy8018 55 KTLKDYNIQDGDLVLLK 71 (280)
Q Consensus 55 ~tL~~~gI~~gd~l~l~ 71 (280)
+.|.+.|+++||+|.+-
T Consensus 47 ~~L~~~G~~~GD~V~Ig 63 (69)
T TIGR03595 47 DALRKAGAKDGDTVRIG 63 (69)
T ss_pred HHHHHcCCCCCCEEEEc
Confidence 57999999999999863
No 240
>PF00077 RVP: Retroviral aspartyl protease The Prosite entry also includes Pfam:PF00026; InterPro: IPR018061 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to the MEROPS peptidase family A2 (retropepsin family, clan AA), subfamily A2A. The family includes the single domain aspartic proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). Retroviral aspartyl protease is synthesised as part of the POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, RNase H and integrase. POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins.; PDB: 3D3T_B 3SQF_A 1NSO_A 2HB3_A 2HS2_A 2HS1_B 3K4V_A 3GGV_C 1HTG_B 2FDE_A ....
Probab=34.06 E-value=72 Score=23.85 Aligned_cols=37 Identities=32% Similarity=0.579 Sum_probs=27.6
Q ss_pred cceEEEEecccEEEeeeeecCCCccceeeccchhhccC
Q psy8018 243 RHQVQIAIENDFLTTSFSVLEDQPMDMLLGLDMLKRHQ 280 (280)
Q Consensus 243 ~~~~~ID~~~~~L~i~~~~~~~~~~~~~~~~~~~~~~~ 280 (280)
...+.+.+.........-+.+..|++ |+|.|.|+++.
T Consensus 60 ~~~~~v~~~~~~~~~~~~v~~~~~~~-ILG~D~L~~~~ 96 (100)
T PF00077_consen 60 STTVEVKIGGKEFNHTFLVVPDLPMN-ILGRDFLKKLN 96 (100)
T ss_dssp EEEEEEEETTEEEEEEEEESSTCSSE-EEEHHHHTTTT
T ss_pred EEEEEEEEECccceEEEEecCCCCCC-EeChhHHHHcC
Confidence 34566666777777666666678899 99999999863
No 241
>PRK10872 relA (p)ppGpp synthetase I/GTP pyrophosphokinase; Provisional
Probab=34.00 E-value=6.1e+02 Score=27.07 Aligned_cols=62 Identities=16% Similarity=0.221 Sum_probs=48.1
Q ss_pred EEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEEe
Q psy8018 2 KVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKRV 73 (280)
Q Consensus 2 ~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~~ 73 (280)
+|+|=|++|+ .+.++...|+-||--.|..+.|.-..-- ..||+...-+ |-+++||+|.+...
T Consensus 405 ~V~VfTPkG~--~~~Lp~gaT~lDfAy~iHt~iG~~~~gA--kvng~~v~l~------~~L~~GD~VeIits 466 (743)
T PRK10872 405 RVYVFTPKGD--VVDLPAGSTPLDFAYHIHSDVGHRCIGA--KIGGRIVPFT------YQLQMGDQIEIITQ 466 (743)
T ss_pred eEEEECCCCC--eEEcCCCCcHHHHHHHHhHHHHhhceEE--EECCEECCCC------cCCCCCCEEEEEeC
Confidence 4788899998 4678899999999999988877644322 3688876533 57999999999876
No 242
>cd01764 Urm1 Urm1-like ubuitin domain. Urm1 (Ubiquitin-Related Modifier1) The Urm1 fold, like those of two closely related proteins MoaD (molybdopterin synthase) and ThiS (sulfur carrier protein), is similar to that of ubiquitin although there is little or no sequence similarity. The C-terminal glycines of Urm1 are conjugated to an E1-like protein Uba4 as part of a novel conjugation system in yeast. The Urm1 fold is found only in eukaryotes.
Probab=34.00 E-value=1.1e+02 Score=23.53 Aligned_cols=56 Identities=20% Similarity=0.377 Sum_probs=33.3
Q ss_pred EEeCCCccHHHHHHHHHHHhCCCCcceEEEE-CC------eeeCCCc--CCc--cccCCCCCCEEEEEE
Q psy8018 15 LDVSEDLELENFKAFCEVQSGISAQEIAIEF-QG------NALVDNK--KTL--KDYNIQDGDLVLLKR 72 (280)
Q Consensus 15 leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~-~G------k~L~d~~--~tL--~~~gI~~gd~l~l~~ 72 (280)
+++++..||.+|-+.+.... |...-.|+. +| .+|-++. +.| .++-+++||.|.+..
T Consensus 23 ~~~~~~~tV~dll~~L~~~~--~~~~~~lf~~~g~lr~~i~VlvN~~di~~l~g~~t~L~dgD~v~i~P 89 (94)
T cd01764 23 LDGEKPVTVGDLLDYVASNL--LEERPDLFIEGGSVRPGIIVLINDTDWELLGEEDYILEDGDHVVFIS 89 (94)
T ss_pred ccCCCCCcHHHHHHHHHHhC--chhhhhhEecCCcccCCEEEEECCccccccCCcccCCCCcCEEEEEC
Confidence 44446779999999988775 333333332 22 1222221 223 356799999999875
No 243
>PRK10455 periplasmic protein; Reviewed
Probab=33.26 E-value=2.5e+02 Score=23.88 Aligned_cols=48 Identities=10% Similarity=0.254 Sum_probs=27.1
Q ss_pred hHHHHHHHHHHHHH----HHHHHHHHHHHhcCCCCHHHHHHHHHHHHhhhHH
Q psy8018 83 DDFARILRQQYEER----EKREQLRQRMLTADPFDTEAQKLIAEEIKKSNIQ 130 (280)
Q Consensus 83 ~~f~~~~~~~~~~~----~~~~~~~~~~~~~dp~~~e~q~~i~e~i~q~~i~ 130 (280)
..++++++...... ....++...++.+||||.+.=+...+.+.+...+
T Consensus 63 qqir~im~~~r~~~~~~~~~~r~~l~~li~ad~FDeaavra~~~k~~~~~~~ 114 (161)
T PRK10455 63 QQIRDIMKAQRDQMKRPPLEERRAMHDIIASDTFDKAKAEAQITKMEAQRKA 114 (161)
T ss_pred HHHHHHHHHHHHhhccccHHHHHHHHHHHccCccCHHHHHHHHHHHHHHHHH
Confidence 34556655543321 1122234567789999998766666655555444
No 244
>PRK05738 rplW 50S ribosomal protein L23; Reviewed
Probab=33.04 E-value=73 Score=24.49 Aligned_cols=40 Identities=18% Similarity=0.383 Sum_probs=34.1
Q ss_pred CcEEEEEeCCCccHHHHHHHHHHHhCCCCcceE-EEECCee
Q psy8018 10 NECFLLDVSEDLELENFKAFCEVQSGISAQEIA-IEFQGNA 49 (280)
Q Consensus 10 g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~-Li~~Gk~ 49 (280)
...+++.|+++.|=.++|+.+|.-+|+++..-. +.+.|+.
T Consensus 20 ~n~~~F~V~~~a~K~eIK~aie~lf~VkV~~VnT~~~~gk~ 60 (92)
T PRK05738 20 QNKYVFEVAPDATKPEIKAAVEKLFGVKVESVNTLNVKGKT 60 (92)
T ss_pred CCEEEEEECCCCCHHHHHHHHHHHcCCceeEEEEEEeCCce
Confidence 578999999999999999999999999986554 5566665
No 245
>cd06397 PB1_UP1 Uncharacterized protein 1. The PB1 domain is a modular domain mediating specific protein-protein interaction which play a role in many critical cell processes, such as osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domain, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions.
Probab=32.90 E-value=1.2e+02 Score=23.06 Aligned_cols=37 Identities=5% Similarity=0.030 Sum_probs=31.7
Q ss_pred CCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEE
Q psy8018 9 DNECFLLDVSEDLELENFKAFCEVQSGISAQEIAIEF 45 (280)
Q Consensus 9 ~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~ 45 (280)
+|.+.-+.++.+-|.+.|+++|+.-+++|+...-+.|
T Consensus 8 ~g~~RRf~~~~~pt~~~L~~kl~~Lf~lp~~~~~vtY 44 (82)
T cd06397 8 LGDTRRIVFPDIPTWEALASKLENLYNLPEIKVGVTY 44 (82)
T ss_pred CCceEEEecCCCccHHHHHHHHHHHhCCChhHeEEEE
Confidence 5667777788889999999999999999998777766
No 246
>cd01778 RASSF1_RA Ubiquitin-like domain of RASSF1 tumour supproessor protein. RASSF1 (also known as RASSF3 and NORE1) is a tumour suppressor protein with a C-terminal Ras-associating (RA) domain that binds Ras. RASSF1 also binds the proapoptotic protein kinase MST1 and is thus thought to regulate the proapoptotic signalling pathway. RASSF1 also associates with microtubule-associated proteins like MAP1B and regulates tubulin polymerization. RASSF1 also binds CDC20 and regulates mitosis by inhibiting the anaphase-promoting complex and preventing degradation of cyclin A and cyclin B until the spindle checkpoint becomes fully operational.
Probab=32.80 E-value=2.4e+02 Score=22.07 Aligned_cols=51 Identities=14% Similarity=0.085 Sum_probs=37.4
Q ss_pred cCCCcEEEEEeCCCccHHHHHHHHHHHhCCC--CcceEEE-E---CC----eeeCCCcCCc
Q psy8018 7 SLDNECFLLDVSEDLELENFKAFCEVQSGIS--AQEIAIE-F---QG----NALVDNKKTL 57 (280)
Q Consensus 7 ~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip--~~~Q~Li-~---~G----k~L~d~~~tL 57 (280)
.+.+....|-|++++|..++-..+-.++.|- |...-|+ . .| +.|.|++.+|
T Consensus 13 lp~~s~k~v~IsS~tTt~eVI~~LL~KF~v~~nP~kFALYe~~h~~ge~~~rkL~d~E~PL 73 (96)
T cd01778 13 LPKDTAKHLHISSKTTVREVIEALLKKFLVVDNPRKFALFEREHRTGQVYLRKLSDDECPL 73 (96)
T ss_pred ccCCceeEEEEecCCcHHHHHHHHHHhheeccCCcceEEEEEEecCCcEEEEECCCCCCCe
Confidence 4678888999999999999999998888874 3344454 2 45 5577665554
No 247
>PF03931 Skp1_POZ: Skp1 family, tetramerisation domain; InterPro: IPR016073 SKP1 (together with SKP2) was identified as an essential component of the cyclin A-CDK2 S phase kinase complex []. It was found to bind several F-box containing proteins (e.g., Cdc4, Skp2, cyclin F) and to be involved in the ubiquitin protein degradation pathway. A yeast homologue of SKP1 (P52286) was identified in the centromere bound kinetochore complex [] and is also involved in the ubiquitin pathway []. In Dictyostelium discoideum (Slime mold) FP21 was shown to be glycosylated in the cytosol and has homology to SKP1 []. This entry represents a POZ domain with a core structure consisting of beta(2)/alpha(2)/beta(2)/alpha(2) in two layers, alpha/beta. This domain is found at the N-terminal of SKP1 proteins [] as well as in subunit D of the centromere DNA-binding protein complex Cbf3 []. ; GO: 0006511 ubiquitin-dependent protein catabolic process; PDB: 1LM8_C 2XAI_E 1VCB_E 3ZRC_K 3ZRF_E 3DCG_B 2C9W_C 1LQB_B 2IZV_C 1HV2_A ....
Probab=32.18 E-value=34 Score=23.96 Aligned_cols=32 Identities=22% Similarity=0.207 Sum_probs=24.3
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHH
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEV 32 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~ 32 (280)
|.|++.+.+|+.|.++.+.-.--.-|+.+++.
T Consensus 1 ~~v~L~SsDg~~f~V~~~~a~~S~~i~~ml~~ 32 (62)
T PF03931_consen 1 MYVKLVSSDGQEFEVSREAAKQSKTIKNMLED 32 (62)
T ss_dssp -EEEEEETTSEEEEEEHHHHTTSHHHHHHHHC
T ss_pred CEEEEEcCCCCEEEeeHHHHHHhHHHHHHHhh
Confidence 78999999999999987665555666777764
No 248
>PF03658 Ub-RnfH: RnfH family Ubiquitin; InterPro: IPR005346 This is a small family of proteins of unknown function.; PDB: 2HJ1_B.
Probab=31.81 E-value=1.9e+02 Score=22.08 Aligned_cols=66 Identities=20% Similarity=0.255 Sum_probs=36.4
Q ss_pred CEEEEE--cCCCc-EEEEEeCCCccHHHHHH---HHHHHhCCCCcceEEEECCeeeCCCcCCccccCCCCCCEEEEEE
Q psy8018 1 MKVTVT--SLDNE-CFLLDVSEDLELENFKA---FCEVQSGISAQEIAIEFQGNALVDNKKTLKDYNIQDGDLVLLKR 72 (280)
Q Consensus 1 M~ItV~--~~~g~-~~~leV~~~~tV~~LK~---~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~gI~~gd~l~l~~ 72 (280)
|+|+|. .++.+ .++++|++.+||.+=-+ +++.--++..+..++=-=||...-+ .-+++||-|.+-+
T Consensus 1 i~VeV~yA~p~~q~~~~l~vp~GtTv~~Ai~~Sgi~~~~p~idl~~~~vGIfGk~~~~d------~~L~~GDRVEIYR 72 (84)
T PF03658_consen 1 IRVEVAYALPERQVILTLEVPEGTTVAQAIEASGILEQFPEIDLEKNKVGIFGKLVKLD------TVLRDGDRVEIYR 72 (84)
T ss_dssp EEEEEEEEETTCEEEEEEEEETT-BHHHHHHHHTHHHH-TT--TTTSEEEEEE-S--TT-------B--TT-EEEEE-
T ss_pred CEEEEEEECCCeEEEEEEECCCcCcHHHHHHHcCchhhCcccCcccceeeeeeeEcCCC------CcCCCCCEEEEec
Confidence 678886 34433 45899999999987555 3444456777778873336655433 3588999998754
No 249
>PF08284 RVP_2: Retroviral aspartyl protease; InterPro: IPR013242 This region defines single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein; usually pol, more rarely gag. Retroviral proteases appear to be homologous to a single domain of the two-domain eukaryotic aspartyl proteases.
Probab=30.63 E-value=67 Score=26.30 Aligned_cols=37 Identities=30% Similarity=0.523 Sum_probs=30.6
Q ss_pred ceEEEEecccEEEeeeeecCCCccceeeccchhhccC
Q psy8018 244 HQVQIAIENDFLTTSFSVLEDQPMDMLLGLDMLKRHQ 280 (280)
Q Consensus 244 ~~~~ID~~~~~L~i~~~~~~~~~~~~~~~~~~~~~~~ 280 (280)
..|.+.+....+....-+++-..+|.++|.|-|++|.
T Consensus 81 ~~~~~~i~g~~~~~dl~vl~l~~~DvILGm~WL~~~~ 117 (135)
T PF08284_consen 81 PDVPLSIQGHEFVVDLLVLDLGGYDVILGMDWLKKHN 117 (135)
T ss_pred eeEEEEECCeEEEeeeEEecccceeeEeccchHHhCC
Confidence 4577777777777777778877899999999999983
No 250
>PF02192 PI3K_p85B: PI3-kinase family, p85-binding domain; InterPro: IPR003113 This is the region of the p110 phosphatidylinositol 3-kinase (PI3-Kinase) that binds the p85 subunit.; GO: 0046934 phosphatidylinositol-4,5-bisphosphate 3-kinase activity, 0007165 signal transduction, 0005942 phosphatidylinositol 3-kinase complex; PDB: 3HIZ_A 3HHM_A 2RD0_A 4A55_A 2Y3A_A 2V1Y_A.
Probab=30.16 E-value=64 Score=24.20 Aligned_cols=21 Identities=29% Similarity=0.180 Sum_probs=17.6
Q ss_pred EEEEeCCCccHHHHHHHHHHH
Q psy8018 13 FLLDVSEDLELENFKAFCEVQ 33 (280)
Q Consensus 13 ~~leV~~~~tV~~LK~~Ie~e 33 (280)
++++|+.+.|+.++|+.+-.+
T Consensus 2 i~l~~~~~~Tl~~iK~~lw~~ 22 (78)
T PF02192_consen 2 IPLRVSRDATLSEIKEELWEE 22 (78)
T ss_dssp EEEEEETT-BHHHHHHHHHHH
T ss_pred eEEEccCcCcHHHHHHHHHHH
Confidence 578999999999999987666
No 251
>PF14847 Ras_bdg_2: Ras-binding domain of Byr2; PDB: 1I35_A 1K8R_B.
Probab=29.98 E-value=2.4e+02 Score=22.29 Aligned_cols=36 Identities=14% Similarity=0.113 Sum_probs=28.6
Q ss_pred EEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCC
Q psy8018 3 VTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISA 38 (280)
Q Consensus 3 ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~ 38 (280)
|+|-..+|++.+|.|..-.+-.++|..+=.++|++.
T Consensus 3 i~~I~~dG~tk~VNV~~c~~a~eI~~rvLKKfg~~~ 38 (105)
T PF14847_consen 3 IRFILEDGSTKTVNVSGCFNAQEIKRRVLKKFGLPE 38 (105)
T ss_dssp EEEEETTTEEEEEE--S--HHHHHHHHHHHHHTSS-
T ss_pred EEEECCCCcEEEEEECCCCCHHHHHHHHHHHcCCcc
Confidence 567778999999999999999999999999999987
No 252
>KOG4572|consensus
Probab=28.36 E-value=86 Score=33.68 Aligned_cols=64 Identities=13% Similarity=0.255 Sum_probs=46.7
Q ss_pred CCcEEEEEeCC-CccHHHHHHHHHHHhCCCCcceEEEECCeeeCCCcCCccccC-C-CCCCEEEEEE
Q psy8018 9 DNECFLLDVSE-DLELENFKAFCEVQSGISAQEIAIEFQGNALVDNKKTLKDYN-I-QDGDLVLLKR 72 (280)
Q Consensus 9 ~g~~~~leV~~-~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L~d~~~tL~~~g-I-~~gd~l~l~~ 72 (280)
.|+..+++... ..|+.+||..|+.+.|+....|.++-+|-.-.+.++.|.+|. . .+-.-|++..
T Consensus 3 rGqaltFDleaetqT~adLk~aiqke~~~aIq~~tfl~egGecmaadkrl~e~StaGTdTnPiffFn 69 (1424)
T KOG4572|consen 3 RGQALTFDLEAETQTFADLKDAIQKEVGHAIQDLTFLDEGGECMAADKRLAEISTAGTDTNPIFFFN 69 (1424)
T ss_pred CCceeEEeecceeehHHHHHHHHHHHhchhhceeeeeecCCcCcccccchhhhccccCCCCceEEee
Confidence 47888888875 679999999999999999999998876655444447777775 2 2334455553
No 253
>cd01612 APG12_C Ubiquitin-like domain of APG12. APG12_C The carboxy-terminal ubiquitin-like domain of APG12. Autophagy is a process in which cytoplasmic components are delivered to the lysosome/vacuole for degradation. Autophagy requires a ubiquitin-like protein conjugation system, in which APG12 is covalently bound to APG5.
Probab=27.74 E-value=79 Score=24.07 Aligned_cols=59 Identities=5% Similarity=0.118 Sum_probs=40.0
Q ss_pred EEeCCCccHHHHHHHHHHHhCCCCcceEEEE-CCeeeCCCcCCcccc--CCCCCCEEEEEEe
Q psy8018 15 LDVSEDLELENFKAFCEVQSGISAQEIAIEF-QGNALVDNKKTLKDY--NIQDGDLVLLKRV 73 (280)
Q Consensus 15 leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~-~Gk~L~d~~~tL~~~--gI~~gd~l~l~~~ 73 (280)
+-|+++.|++++...|....++++++-..+| |..-+...+.++++. .-.++..|.+...
T Consensus 20 flv~~~~tv~~~~~~lrk~L~l~~~~slflyvnn~f~p~~d~~~g~LY~~~~~dGfLyi~Ys 81 (87)
T cd01612 20 FKISATQSFQAVIDFLRKRLKLKASDSLFLYINNSFAPSPDENVGNLYRCFGTNGELIVSYC 81 (87)
T ss_pred EEeCCCCCHHHHHHHHHHHhCCCccCeEEEEECCccCCCchhHHHHHHHhcCCCCEEEEEEe
Confidence 4599999999999999999999877644444 543243333565442 1256778877653
No 254
>TIGR03636 L23_arch archaeal ribosomal protein L23. This model describes the archaeal ribosomal protein L23P and rigorously excludes the bacterial counterpart L23. In order to capture every known instance of archaeal L23P, the trusted cutoff is set lower than a few of the highest scoring eukaryotic cytosolic ribosomal counterparts.
Probab=27.66 E-value=1e+02 Score=22.99 Aligned_cols=33 Identities=9% Similarity=0.089 Sum_probs=29.4
Q ss_pred CcEEEEEeCCCccHHHHHHHHHHHhCCCCcceE
Q psy8018 10 NECFLLDVSEDLELENFKAFCEVQSGISAQEIA 42 (280)
Q Consensus 10 g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~ 42 (280)
...+++.|+++.+=.++|..+|.-+|+.+..-.
T Consensus 14 ~n~y~F~V~~~anK~eIK~avE~lf~VkV~~Vn 46 (77)
T TIGR03636 14 ENKLTFIVDRKATKGDIKRAVEKLFDVKVEKVN 46 (77)
T ss_pred CCEEEEEECCCCCHHHHHHHHHHHhCCceEEEE
Confidence 368999999999999999999999999876554
No 255
>PF13856 Gifsy-2: ATP-binding sugar transporter from pro-phage; PDB: 2PP6_A.
Probab=27.62 E-value=64 Score=24.60 Aligned_cols=18 Identities=28% Similarity=0.497 Sum_probs=15.7
Q ss_pred EEEEEcCEEEEEEEcCCC
Q psy8018 152 INCRVNGHPVKAFIDSGA 169 (280)
Q Consensus 152 i~~~InG~~v~alVDTGA 169 (280)
-.++|||+++++++|..-
T Consensus 22 ~~~~InG~~~~~v~d~~~ 39 (95)
T PF13856_consen 22 EEHTINGKPYRAVFDEPE 39 (95)
T ss_dssp EEEEETTEEEEEEEGGGT
T ss_pred CeEEECCEEEEEEECCch
Confidence 568999999999999864
No 256
>PRK14548 50S ribosomal protein L23P; Provisional
Probab=27.02 E-value=1.1e+02 Score=23.27 Aligned_cols=33 Identities=9% Similarity=0.082 Sum_probs=29.7
Q ss_pred CcEEEEEeCCCccHHHHHHHHHHHhCCCCcceE
Q psy8018 10 NECFLLDVSEDLELENFKAFCEVQSGISAQEIA 42 (280)
Q Consensus 10 g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~ 42 (280)
...+++.|+++.+=.++|..||.-+|+++..-.
T Consensus 21 ~n~y~F~V~~~anK~eIK~AvE~lf~VkV~~Vn 53 (84)
T PRK14548 21 ENKLTFIVDRRATKPDIKRAVEELFDVKVEKVN 53 (84)
T ss_pred CCEEEEEECCCCCHHHHHHHHHHHhCCceEEEE
Confidence 468999999999999999999999999886554
No 257
>PF04126 Cyclophil_like: Cyclophilin-like; InterPro: IPR007256 Proteins of this family have no known function.; PDB: 2KA0_A 1ZX8_C 2NNZ_A.
Probab=26.20 E-value=59 Score=26.15 Aligned_cols=29 Identities=17% Similarity=0.240 Sum_probs=24.6
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHH
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFC 30 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~I 30 (280)
|+|.|+. +++.+..++..+.|..+|.+++
T Consensus 1 mkI~i~i-~~~~~~a~L~d~~ta~~~~~~L 29 (120)
T PF04126_consen 1 MKIKITI-GGQEIEAELNDSPTARAFAAQL 29 (120)
T ss_dssp EEEEEEE-TTEEEEEEEETTHHHHHHHHC-
T ss_pred CeEEEEE-CCEEEEEEECCCHHHHHHHHhC
Confidence 8888887 5889999999999999988875
No 258
>COG1730 GIM5 Predicted prefoldin, molecular chaperone implicated in de novo protein folding [Posttranslational modification, protein turnover, chaperones]
Probab=25.24 E-value=83 Score=26.46 Aligned_cols=42 Identities=21% Similarity=0.249 Sum_probs=33.5
Q ss_pred HHHHHhHhhCCCcceeee-EEEEEEEEcCEEEEEEEcCCCcceec
Q psy8018 131 ANMEAAMEYNPETFGSVI-MLYINCRVNGHPVKAFIDSGAQTTIM 174 (280)
Q Consensus 131 ~~~~~a~e~~Pe~~~~~~-~l~i~~~InG~~v~alVDTGAq~tii 174 (280)
++++.+-+ .||.+..++ -+|++.+|.+. =+++||-|+.+.+=
T Consensus 47 ~~lk~~~~-g~E~LVpvGag~fv~~kv~~~-~kviV~iGsg~~ae 89 (145)
T COG1730 47 ENLKGAGE-GKEVLVPVGAGLFVKAKVKDM-DKVIVSIGSGYYAE 89 (145)
T ss_pred HHHHhcCC-CceEEEEcCCCceEEEEeccC-ceEEEEcCCceeee
Confidence 45555555 899888774 78999999887 78999999998874
No 259
>KOG3048|consensus
Probab=25.11 E-value=47 Score=28.00 Aligned_cols=52 Identities=21% Similarity=0.185 Sum_probs=34.7
Q ss_pred HHHHHHHHHHHhhhHHHHHHHhHhhCCCcc---e-eeeEEEEEEEEcCEEEEEEEcCCCcceec
Q psy8018 115 EAQKLIAEEIKKSNIQANMEAAMEYNPETF---G-SVIMLYINCRVNGHPVKAFIDSGAQTTIM 174 (280)
Q Consensus 115 e~q~~i~e~i~q~~i~~~~~~a~e~~Pe~~---~-~~~~l~i~~~InG~~v~alVDTGAq~tii 174 (280)
-+|.+-.|.|.-=| +..-.||.- . -+..+||+.+++. +-+++||-|-.+-|=
T Consensus 41 ~aq~k~~~~~~aln-------~~~~~~eGk~~LVPLTsSlYVPGkl~d-~~k~lVDIGTGYyVE 96 (153)
T KOG3048|consen 41 GAQTKYEESIAALN-------DVQAANEGKKLLVPLTSSLYVPGKLSD-NSKFLVDIGTGYYVE 96 (153)
T ss_pred HHHHHHHHHHHHHh-------hcccCCCCCeEEEecccceeccceecc-ccceeEeccCceEEe
Confidence 45666666654332 233444432 2 3468999999998 889999999877663
No 260
>PF12685 SpoIIIAH: SpoIIIAH-like protein; InterPro: IPR024232 Stage III sporulation protein AH (SpoIIIAH) is a protein that is involved in forespore engulfment. It forms a channel with SpoIIIAH that is open on the forespore end and closed (or gated) on the mother cell end. This allows sigma-E-directed gene expression in the mother-cell compartment of the sporangium to trigger the activation of sigma-G forespore-specific gene expression by a pathway of intercellular signaling. This family of proteins is found in bacteria, archaea and eukaryotes and so must have a wider function than in sporulation. Proteins in this family are typically between 174 and 223 amino acids in length.; PDB: 3UZ0_A 3TUF_A.
Probab=23.93 E-value=4.8e+02 Score=22.56 Aligned_cols=56 Identities=13% Similarity=0.173 Sum_probs=32.7
Q ss_pred HHHHhcCCCC-----HHHHHHHHHHHHhhhHHHHHHHhHh--hCCCcceeeeEEEEEEEEcCEEEEEEEcCCC
Q psy8018 104 QRMLTADPFD-----TEAQKLIAEEIKKSNIQANMEAAME--YNPETFGSVIMLYINCRVNGHPVKAFIDSGA 169 (280)
Q Consensus 104 ~~~~~~dp~~-----~e~q~~i~e~i~q~~i~~~~~~a~e--~~Pe~~~~~~~l~i~~~InG~~v~alVDTGA 169 (280)
..++ +||.. .++|.++.+......-+.++|..+. .++. +=|.+.+-.++++|-+.-
T Consensus 107 ~~ii-~~~~~s~~~k~~A~~~~~~l~~~~~kE~~iE~llkakGf~d---------avv~~~~~~v~VvV~~~~ 169 (196)
T PF12685_consen 107 KEII-NNENASEEEKKEAQDKLLELTEKMEKEMEIENLLKAKGFED---------AVVFIEDDSVDVVVKADK 169 (196)
T ss_dssp HHHH-T-TTS-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTS-SE---------EEEE-SSSEEEEEEE-S-
T ss_pred HHHH-hCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc---------eEEEeeCCEEEEEEeCCC
Confidence 3344 67654 3456667777776666667776663 4555 236777778888887764
No 261
>PF14533 USP7_C2: Ubiquitin-specific protease C-terminal; PDB: 2YLM_A.
Probab=23.53 E-value=75 Score=28.08 Aligned_cols=30 Identities=20% Similarity=0.203 Sum_probs=22.9
Q ss_pred CcEEEEEeCCCccHHHHHHHHHHHhCCCCc
Q psy8018 10 NECFLLDVSEDLELENFKAFCEVQSGISAQ 39 (280)
Q Consensus 10 g~~~~leV~~~~tV~~LK~~Ie~e~gip~~ 39 (280)
|..|.+-|.|+.|..++|..|+..+|+|-.
T Consensus 132 GiPF~f~v~~gE~f~~tK~Rl~~rlgv~~k 161 (213)
T PF14533_consen 132 GIPFLFVVKPGETFSDTKERLQKRLGVSDK 161 (213)
T ss_dssp EEEEEEEEETT--HHHHHHHHHHHH---HH
T ss_pred CCCEEEEeeCCCcHHHHHHHHHHHhCCChh
Confidence 778889999999999999999999999863
No 262
>PF07946 DUF1682: Protein of unknown function (DUF1682); InterPro: IPR012879 The members of this family are all hypothetical eukaryotic proteins of unknown function. One member (Q920S6 from SWISSPROT) is described as being an adipocyte-specific protein, but no evidence of this was found.
Probab=23.45 E-value=2.1e+02 Score=26.97 Aligned_cols=19 Identities=21% Similarity=0.275 Sum_probs=15.4
Q ss_pred cCCCCHHHHHHHHHHHHhh
Q psy8018 109 ADPFDTEAQKLIAEEIKKS 127 (280)
Q Consensus 109 ~dp~~~e~q~~i~e~i~q~ 127 (280)
..-+|||.|++++|.=++.
T Consensus 299 ~~~lspeeQrK~eeKe~kk 317 (321)
T PF07946_consen 299 LSKLSPEEQRKYEEKERKK 317 (321)
T ss_pred HhcCCHHHHHHHHHHHHHH
Confidence 5678999999999976554
No 263
>cd01666 TGS_DRG_C TGS_DRG_C: DRG (developmentally regulated GTP-binding protein) represents a family of GTP-binding proteins that includes two members, DRG1 and DRG2. DRG1 and DRG2 have a C-terminal TGS domain (named after the ThrRS, GTPase, and SpoT proteins where it occurs) with a predominantly beta-sheet structure. The function of TGS is unknown but its presence in two types of regulatory proteins (the DRG GTPases and guanosine polyphosphate phosphohydrolases/synthetases) suggests a ligand (most likely nucleotide)-binding, regulatory role.
Probab=23.11 E-value=2.4e+02 Score=20.78 Aligned_cols=53 Identities=8% Similarity=-0.049 Sum_probs=33.8
Q ss_pred EEEEeCCCccHHHHHHHHHHHhCCCCcceEE-----EECCeeeCCCcCCccccCCCCCCEEEEE
Q psy8018 13 FLLDVSEDLELENFKAFCEVQSGISAQEIAI-----EFQGNALVDNKKTLKDYNIQDGDLVLLK 71 (280)
Q Consensus 13 ~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~L-----i~~Gk~L~d~~~tL~~~gI~~gd~l~l~ 71 (280)
-++.++...||+++-..|..+.+-...--.. -++|+...-+ +-+++||+|.+.
T Consensus 17 ~~liL~~GaTV~D~a~~iH~di~~~f~~A~v~g~s~~~~gq~Vgl~------~~L~d~DvVeI~ 74 (75)
T cd01666 17 EPVILRRGSTVEDVCNKIHKDLVKQFKYALVWGSSVKHSPQRVGLD------HVLEDEDVVQIV 74 (75)
T ss_pred CCEEECCCCCHHHHHHHHHHHHHHhCCeeEEeccCCcCCCeECCCC------CEecCCCEEEEe
Confidence 4678899999999999988653211111111 1567665432 458889988764
No 264
>PF00031 Cystatin: Cystatin domain; InterPro: IPR000010 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. The cystatins are cysteine proteinase inhibitors belonging to MEROPS inhibitor family I25, clan IH [, , ]. They mainly inhibit peptidases belonging to peptidase families C1 (papain family) and C13 (legumain family). The cystatin family includes: The Type 1 cystatins, which are intracellular cystatins that are present in the cytosol of many cell types, but can also appear in body fluids at significant concentrations. They are single-chain polypeptides of about 100 residues, which have neither disulphide bonds nor carbohydrate side chains. The Type 2 cystatins, which are mainly extracellular secreted polypeptides synthesised with a 19-28 residue signal peptide. They are broadly distributed and found in most body fluids. The Type 3 cystatins, which are multidomain proteins. The mammalian representatives of this group are the kininogens. There are three different kininogens in mammals: H- (high molecular mass, IPR002395 from INTERPRO) and L- (low molecular mass) kininogen which are found in a number of species, and T-kininogen that is found only in rat. Unclassified cystatins. These are cystatin-like proteins found in a range of organisms: plant phytocystatins, fetuin in mammals, insect cystatins and a puff adder venom cystatin which inhibits metalloproteases of the MEROPS peptidase family M12 (astacin/adamalysin). Also a number of the cystatins-like proteins have been shown to be devoid of inhibitory activity. All true cystatins inhibit cysteine peptidases of the papain family (MEROPS peptidase family C1), and some also inhibit legumain family enzymes (MEROPS peptidase family C13). These peptidases play key roles in physiological processes, such as intracellular protein degradation (cathepsins B, H and L), are pivotal in the remodelling of bone (cathepsin K), and may be important in the control of antigen presentation (cathepsin S, mammalian legumain). Moreover, the activities of such peptidases are increased in pathophysiological conditions, such as cancer metastasis and inflammation. Additionally, such peptidases are essential for several pathogenic parasites and bacteria. Thus in animals cystatins not only have capacity to regulate normal body processes and perhaps cause disease when down-regulated, but in other organisms may also participate in defence against biotic and abiotic stress. ; GO: 0004869 cysteine-type endopeptidase inhibitor activity; PDB: 3L0R_B 2W9P_K 2W9Q_A 3S67_A 3QRD_D 1R4C_G 3GAX_A 1TIJ_B 1G96_A 3NX0_A ....
Probab=22.87 E-value=1.7e+02 Score=21.54 Aligned_cols=23 Identities=35% Similarity=0.341 Sum_probs=18.7
Q ss_pred hcCCCCHHHHHHHHHHHHhhhHH
Q psy8018 108 TADPFDTEAQKLIAEEIKKSNIQ 130 (280)
Q Consensus 108 ~~dp~~~e~q~~i~e~i~q~~i~ 130 (280)
..||.||+.|..+...+.+-|-.
T Consensus 5 ~~~~~dp~v~~~~~~al~~~N~~ 27 (94)
T PF00031_consen 5 PVDPNDPEVQEAAEFALDKFNEQ 27 (94)
T ss_dssp EECTTSHHHHHHHHHHHHHHHHH
T ss_pred cCCCCCHHHHHHHHHHHHHHHHh
Confidence 36899999999999988876543
No 265
>PF07929 PRiA4_ORF3: Plasmid pRiA4b ORF-3-like protein; InterPro: IPR012912 Members of this family are similar to the protein product of ORF-3 (Q44206 from SWISSPROT) found on plasmid pRiA4 in the bacterium Agrobacterium rhizogenes. This plasmid is responsible for tumourigenesis at wound sites of plants infected by this bacterium, but the ORF-3 product does not seem to be involved in the pathogenetic process []. Other proteins found in this family are annotated as being putative TnpR resolvases (Q9LCU7 from SWISSPROT, Q50439 from SWISSPROT), but no further evidence was found to back this. Moreover, another member of this family is described as a probable lexA repressor (Q7UEI4 from SWISSPROT) and in fact carries a LexA DNA binding domain (IPR006199 from INTERPRO), but no references were found to expand on this. ; PDB: 2I1S_A.
Probab=22.76 E-value=2.6e+02 Score=23.59 Aligned_cols=60 Identities=15% Similarity=0.226 Sum_probs=35.0
Q ss_pred EEEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECC------------------eeeCCCcCCccccCCCCCCEEEEE
Q psy8018 12 CFLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQG------------------NALVDNKKTLKDYNIQDGDLVLLK 71 (280)
Q Consensus 12 ~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~G------------------k~L~d~~~tL~~~gI~~gd~l~l~ 71 (280)
...+.|+.+.|+.+|=..|+..+|..-....-++-+ .......-+|.++....|+.+...
T Consensus 19 wRri~Vp~~~tl~~Lh~~Iq~afgw~~~HL~~F~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~~~~~~~~~~Y~ 96 (179)
T PF07929_consen 19 WRRIEVPADITLADLHEVIQAAFGWDDDHLYEFFIGGERYGIPDEDGMDFSEGDEIKDASEVKLGELLLEEGDKFTYV 96 (179)
T ss_dssp EEEEEEETT-BHHHHHHHHHHHTT----S-EEEEEE-TTTSSESSS---------EEETTT-BHHHC-BTTC-EEEEE
T ss_pred EEEEEECCCCCHHHHHHHHHHHhCcCCCEeEEEEECCCccccccccccccccCCCcceeeeEEhhhhccCcCCEEEEE
Confidence 457999999999999999999999876544333221 122122356777777888877654
No 266
>COG0089 RplW Ribosomal protein L23 [Translation, ribosomal structure and biogenesis]
Probab=22.63 E-value=1.3e+02 Score=23.47 Aligned_cols=40 Identities=18% Similarity=0.312 Sum_probs=32.7
Q ss_pred CCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceE-EEECCe
Q psy8018 9 DNECFLLDVSEDLELENFKAFCEVQSGISAQEIA-IEFQGN 48 (280)
Q Consensus 9 ~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~-Li~~Gk 48 (280)
....+++.|+++.|=.++|+.+|+-+|+-+..-. |+..|+
T Consensus 20 ~~nk~vF~V~~~AtK~~IK~AvE~lF~VkV~kVNTl~~k~k 60 (94)
T COG0089 20 KENKYVFIVDPDATKPEIKAAVEELFGVKVEKVNTLNTKGK 60 (94)
T ss_pred hCCEEEEEECCCCCHHHHHHHHHHHhCCeEEEEEEEEeCCc
Confidence 3578999999999999999999999999776544 445554
No 267
>smart00144 PI3K_rbd PI3-kinase family, Ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding RA domains (unpublished observation).
Probab=22.36 E-value=3.8e+02 Score=20.91 Aligned_cols=62 Identities=13% Similarity=0.150 Sum_probs=39.5
Q ss_pred CcEEEEEeCCCccHHHHHHHHHHHh------CCCCc-ceEEEECCee--eCCCcCCcccc-----CCCCCCEEEEEE
Q psy8018 10 NECFLLDVSEDLELENFKAFCEVQS------GISAQ-EIAIEFQGNA--LVDNKKTLKDY-----NIQDGDLVLLKR 72 (280)
Q Consensus 10 g~~~~leV~~~~tV~~LK~~Ie~e~------gip~~-~Q~Li~~Gk~--L~d~~~tL~~~-----gI~~gd~l~l~~ 72 (280)
...+++.+++++|+.+|.+.+-.+. .-+++ +-.|--.|+. |..+ ..|.+| .++.|..++|+.
T Consensus 28 ~~~~t~~v~~~~~p~~li~~~l~k~~~~~~~~~~~~~dyvLkV~G~~Eyl~~~-~~L~~~~yIr~cl~~~~~~~L~L 103 (108)
T smart00144 28 QQTKTLKVNPNCTPDSVLAQAFTKMLSLHDQVDPTSEDYILKVCGRDEYLLGD-HPLGSFEYIRNCLKNGREPHLVL 103 (108)
T ss_pred ceeEEEEECCCCCHHHHHHHHHHHHHhccccccCCCCcEEEEecCcEEEEeCC-eeeechHHHHHHHhcCCCceEEE
Confidence 5679999999999999999766551 11222 4555556654 4332 566665 255566666654
No 268
>cd06396 PB1_NBR1 The PB1 domain is an essential part of NBR1 protein, next to BRCA1, a scaffold protein mediating specific protein-protein interaction with both titin protein kinase and with another scaffold protein p62. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domain, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. The NBR1 protein contains a type I PB1 domain.
Probab=21.86 E-value=1.2e+02 Score=22.90 Aligned_cols=35 Identities=9% Similarity=0.164 Sum_probs=29.2
Q ss_pred EEEEEEcCEEEEEEEcCCC--cceecCHHHHHHcCCc
Q psy8018 151 YINCRVNGHPVKAFIDSGA--QTTIMSAACAARVNIT 185 (280)
Q Consensus 151 ~i~~~InG~~v~alVDTGA--q~tiis~~~A~rlgL~ 185 (280)
.|+++.||..+++.++..+ +..=+...++++|+|.
T Consensus 2 ~vKaty~~d~~rf~~~~~~~~~~~~L~~ev~~rf~l~ 38 (81)
T cd06396 2 NLKVTYNGESQSFLVSDSENTTWASVEAMVKVSFGLN 38 (81)
T ss_pred EEEEEECCeEEEEEecCCCCCCHHHHHHHHHHHhCCC
Confidence 5789999999999999966 4445568899999987
No 269
>KOG3439|consensus
Probab=21.72 E-value=1.8e+02 Score=23.52 Aligned_cols=38 Identities=8% Similarity=0.113 Sum_probs=32.9
Q ss_pred EEEEeCCCccHHHHHHHHHHHhCCCCcceEEEECCeee
Q psy8018 13 FLLDVSEDLELENFKAFCEVQSGISAQEIAIEFQGNAL 50 (280)
Q Consensus 13 ~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~Li~~Gk~L 50 (280)
....|+++.|++.+--.|....++++.+|..+|=..-.
T Consensus 47 ~k~~i~~t~tfa~vi~Flkk~Lkl~as~slflYVN~sF 84 (116)
T KOG3439|consen 47 SKFKINPTQTFAKVILFLKKFLKLQASDSLFLYVNNSF 84 (116)
T ss_pred ceEEeCcchhhHHHHHHHHHHhCCcccCeEEEEEcCcc
Confidence 35689999999999999999999999999988855554
No 270
>CHL00030 rpl23 ribosomal protein L23
Probab=21.72 E-value=1.5e+02 Score=22.97 Aligned_cols=40 Identities=13% Similarity=0.249 Sum_probs=32.5
Q ss_pred CCcEEEEEeCCCccHHHHHHHHHHHhCCCCcceE-EEECCe
Q psy8018 9 DNECFLLDVSEDLELENFKAFCEVQSGISAQEIA-IEFQGN 48 (280)
Q Consensus 9 ~g~~~~leV~~~~tV~~LK~~Ie~e~gip~~~Q~-Li~~Gk 48 (280)
....++++|+++.|=.++|+.||.-+|+.+..-. +...|+
T Consensus 18 e~n~y~F~V~~~anK~eIK~avE~lf~VkV~~VNt~~~~~k 58 (93)
T CHL00030 18 EKNQYTFDVDSGSTKTEIKHWIELFFGVKVIAVNSHRLPRK 58 (93)
T ss_pred HCCEEEEEECCCCCHHHHHHHHHHHhCCeEEEEEEEEcCCC
Confidence 4578999999999999999999999999776543 344554
No 271
>COG2080 CoxS Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs [Energy production and conversion]
Probab=21.37 E-value=1.3e+02 Score=25.67 Aligned_cols=47 Identities=23% Similarity=0.325 Sum_probs=30.6
Q ss_pred CEEEEEcCCCcEEEEEeCCCccHHHHHHHHHHHhCCCC----------cceEEEECCeeeC
Q psy8018 1 MKVTVTSLDNECFLLDVSEDLELENFKAFCEVQSGISA----------QEIAIEFQGNALV 51 (280)
Q Consensus 1 M~ItV~~~~g~~~~leV~~~~tV~~LK~~Ie~e~gip~----------~~Q~Li~~Gk~L~ 51 (280)
|.|+++ .||+.+.++++|.+++.++ +.++.+..- -.--++.+|+...
T Consensus 2 ~~i~lt-vNG~~~~~~~~p~~~Ll~~---LRd~l~ltgtk~GC~~g~CGACtVlvDG~~v~ 58 (156)
T COG2080 2 MPITLT-VNGEPVELDVDPRTPLLDV---LRDELGLTGTKKGCGHGQCGACTVLVDGEAVN 58 (156)
T ss_pred CcEEEE-ECCeEEEEEeCCCChHHHH---HHHhcCCCCcCCCCCCccCCceEEEECCeEeh
Confidence 345554 4999999999999998776 333333322 1235667887765
No 272
>PRK15326 type III secretion system needle complex protein PrgI; Provisional
Probab=21.07 E-value=3.7e+02 Score=20.31 Aligned_cols=32 Identities=19% Similarity=0.216 Sum_probs=23.5
Q ss_pred HHHHHhcCCCCHHHHHHHHHHHHhhhHHHHHH
Q psy8018 103 RQRMLTADPFDTEAQKLIAEEIKKSNIQANME 134 (280)
Q Consensus 103 ~~~~~~~dp~~~e~q~~i~e~i~q~~i~~~~~ 134 (280)
-+.-++.+|.||..=+.+...+..=++-.|++
T Consensus 30 Al~~l~~~pdnP~~LA~~Qa~l~eyn~~RNaQ 61 (80)
T PRK15326 30 ALDKLAAKPSDPALLAAYQSKLSEYNLYRNAQ 61 (80)
T ss_pred HHHHhhcCCCCHHHHHHHHHHHHHHHHHHHHH
Confidence 34556899999988777777776666666655
No 273
>smart00143 PI3K_p85B PI3-kinase family, p85-binding domain. Region of p110 PI3K that binds the p85 subunit.
Probab=20.72 E-value=1.2e+02 Score=22.91 Aligned_cols=21 Identities=24% Similarity=0.088 Sum_probs=18.4
Q ss_pred EEEEeCCCccHHHHHHHHHHH
Q psy8018 13 FLLDVSEDLELENFKAFCEVQ 33 (280)
Q Consensus 13 ~~leV~~~~tV~~LK~~Ie~e 33 (280)
.+++|+.+.|+.++|+.+-++
T Consensus 2 i~l~v~~~aTl~~IK~~lw~~ 22 (78)
T smart00143 2 VTLRVLREATLSTIKHELFKQ 22 (78)
T ss_pred eeEEccccccHHHHHHHHHHH
Confidence 478999999999999988665
Done!