Query psy15482
Match_columns 302
No_of_seqs 264 out of 1156
Neff 6.2
Searched_HMMs 46136
Date Fri Aug 16 22:14:17 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy15482.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15482hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0012|consensus 100.0 7.5E-67 1.6E-71 490.0 20.9 287 1-287 1-307 (380)
2 PF09668 Asp_protease: Asparty 100.0 7.9E-41 1.7E-45 277.4 7.1 96 192-287 1-96 (124)
3 cd05480 NRIP_C NRIP_C; putativ 100.0 1.6E-31 3.5E-36 211.6 6.7 70 218-287 1-72 (103)
4 KOG0010|consensus 99.9 2.9E-24 6.2E-29 210.0 10.5 190 1-221 16-253 (493)
5 cd05479 RP_DDI RP_DDI; retrope 99.8 5.9E-20 1.3E-24 152.4 8.2 86 201-286 2-87 (124)
6 cd01807 GDX_N ubiquitin-like d 99.8 4.2E-19 9.1E-24 134.4 9.0 73 1-75 1-73 (74)
7 cd01793 Fubi Fubi ubiquitin-li 99.7 4.2E-18 9.2E-23 128.9 8.5 71 1-75 1-71 (74)
8 cd01797 NIRF_N amino-terminal 99.7 6.9E-18 1.5E-22 129.6 8.8 74 1-76 1-76 (78)
9 cd01805 RAD23_N Ubiquitin-like 99.7 1.9E-17 4.1E-22 125.6 9.6 74 1-76 1-76 (77)
10 TIGR00601 rad23 UV excision re 99.7 7.9E-17 1.7E-21 156.5 15.9 73 1-75 1-76 (378)
11 PTZ00044 ubiquitin; Provisiona 99.7 2.1E-17 4.7E-22 125.1 9.0 73 1-75 1-73 (76)
12 cd01791 Ubl5 UBL5 ubiquitin-li 99.7 1.8E-17 3.8E-22 125.8 7.8 71 1-73 2-72 (73)
13 cd01796 DDI1_N DNA damage indu 99.7 3.6E-17 7.7E-22 123.2 7.6 69 3-72 1-70 (71)
14 cd01798 parkin_N amino-termina 99.7 4.3E-17 9.2E-22 122.0 7.8 70 3-74 1-70 (70)
15 cd01802 AN1_N ubiquitin-like d 99.7 5.1E-17 1.1E-21 131.2 8.7 73 1-75 28-100 (103)
16 cd01810 ISG15_repeat2 ISG15 ub 99.7 5.6E-17 1.2E-21 122.8 8.0 71 3-75 1-71 (74)
17 cd01806 Nedd8 Nebb8-like ubiq 99.7 1.1E-16 2.3E-21 120.6 9.4 73 1-75 1-73 (76)
18 cd01803 Ubiquitin Ubiquitin. U 99.7 1.2E-16 2.5E-21 120.4 8.9 73 1-75 1-73 (76)
19 cd01809 Scythe_N Ubiquitin-lik 99.7 1.4E-16 3E-21 118.8 9.0 72 1-74 1-72 (72)
20 cd01804 midnolin_N Ubiquitin-l 99.7 1.8E-16 4E-21 121.5 8.5 72 1-75 2-73 (78)
21 cd01794 DC_UbP_C dendritic cel 99.7 2.8E-16 6E-21 118.3 7.4 68 4-73 2-69 (70)
22 cd01792 ISG15_repeat1 ISG15 ub 99.7 3.5E-16 7.5E-21 120.3 7.9 73 1-75 3-77 (80)
23 PF00240 ubiquitin: Ubiquitin 99.6 8.2E-16 1.8E-20 114.1 8.8 68 6-75 1-68 (69)
24 cd01808 hPLIC_N Ubiquitin-like 99.6 7.6E-16 1.7E-20 115.6 8.4 71 1-74 1-71 (71)
25 cd01812 BAG1_N Ubiquitin-like 99.6 2E-15 4.3E-20 112.5 8.3 70 1-73 1-70 (71)
26 cd01790 Herp_N Homocysteine-re 99.6 1.8E-15 3.8E-20 116.5 7.1 71 1-73 2-78 (79)
27 KOG0011|consensus 99.6 1.3E-14 2.7E-19 136.5 14.3 74 1-76 1-76 (340)
28 cd01800 SF3a120_C Ubiquitin-li 99.6 4E-15 8.8E-20 113.3 8.0 66 9-75 5-70 (76)
29 cd01813 UBP_N UBP ubiquitin pr 99.6 6.1E-15 1.3E-19 112.1 8.5 70 1-73 1-73 (74)
30 KOG0003|consensus 99.5 1.5E-15 3.3E-20 121.5 0.4 74 1-76 1-74 (128)
31 smart00213 UBQ Ubiquitin homol 99.5 3.6E-14 7.8E-19 102.6 7.5 64 1-67 1-64 (64)
32 KOG0005|consensus 99.5 1.5E-14 3.2E-19 104.3 4.6 70 1-72 1-70 (70)
33 cd01799 Hoil1_N Ubiquitin-like 99.5 6.3E-14 1.4E-18 106.9 7.7 70 2-72 2-73 (75)
34 cd01763 Sumo Small ubiquitin-r 99.4 9E-13 1.9E-17 103.0 9.2 73 1-75 12-84 (87)
35 cd01815 BMSC_UbP_N Ubiquitin-l 99.4 2.3E-13 5E-18 103.6 5.3 57 17-74 15-75 (75)
36 KOG0004|consensus 99.4 1.7E-13 3.7E-18 116.6 4.1 74 1-76 1-74 (156)
37 cd01769 UBL Ubiquitin-like dom 99.3 9.9E-12 2.1E-16 90.9 7.6 67 5-73 2-68 (69)
38 cd01795 USP48_C USP ubiquitin- 99.3 7.2E-12 1.6E-16 99.5 7.0 63 13-75 16-78 (107)
39 PF11976 Rad60-SLD: Ubiquitin- 99.2 5.6E-11 1.2E-15 88.9 7.7 70 1-72 1-71 (72)
40 cd01789 Alp11_N Ubiquitin-like 99.2 1.8E-10 3.9E-15 89.6 9.9 74 2-75 3-82 (84)
41 cd01814 NTGP5 Ubiquitin-like N 99.1 1.1E-10 2.4E-15 95.0 5.6 73 3-77 7-93 (113)
42 TIGR02281 clan_AA_DTGA clan AA 99.0 1.1E-09 2.4E-14 90.7 7.9 77 213-290 9-85 (121)
43 cd05483 retropepsin_like_bacte 98.9 3E-09 6.5E-14 82.1 7.2 71 215-287 2-72 (96)
44 PF14560 Ubiquitin_2: Ubiquiti 98.9 5.4E-09 1.2E-13 81.4 8.6 75 2-76 3-85 (87)
45 cd05484 retropepsin_like_LTR_2 98.8 8.3E-09 1.8E-13 80.6 7.2 66 217-285 2-69 (91)
46 PF13650 Asp_protease_2: Aspar 98.8 8.3E-09 1.8E-13 78.8 6.9 69 218-287 1-69 (90)
47 PF13975 gag-asp_proteas: gag- 98.8 7.9E-09 1.7E-13 77.9 6.4 65 212-276 5-70 (72)
48 PLN02560 enoyl-CoA reductase 98.8 1.1E-08 2.4E-13 97.4 8.4 72 1-74 1-83 (308)
49 KOG4248|consensus 98.8 6.2E-09 1.4E-13 109.8 6.8 73 2-77 4-76 (1143)
50 KOG0001|consensus 98.8 8.1E-08 1.7E-12 69.6 9.3 71 3-75 2-72 (75)
51 cd06095 RP_RTVL_H_like Retrope 98.7 4.8E-08 1E-12 75.8 5.9 64 218-287 1-66 (86)
52 cd01788 ElonginB Ubiquitin-lik 98.6 1.5E-07 3.3E-12 76.7 7.4 74 1-75 1-81 (119)
53 cd01801 Tsc13_N Ubiquitin-like 98.6 1.4E-07 3.1E-12 71.9 6.6 56 16-72 16-75 (77)
54 PF13881 Rad60-SLD_2: Ubiquiti 98.5 9.7E-07 2.1E-11 72.3 9.9 73 3-77 5-91 (111)
55 PF11543 UN_NPL4: Nuclear pore 98.5 2.7E-07 5.8E-12 71.3 6.0 71 1-73 5-79 (80)
56 cd00196 UBQ Ubiquitin-like pro 98.5 8.6E-07 1.9E-11 60.7 7.5 64 9-73 5-68 (69)
57 COG3577 Predicted aspartyl pro 98.2 2.8E-06 6E-11 76.1 7.3 78 214-294 104-183 (215)
58 PF00077 RVP: Retroviral aspar 98.2 2.2E-06 4.8E-11 67.5 4.7 68 215-287 5-73 (100)
59 KOG1872|consensus 98.1 5.8E-06 1.3E-10 81.7 7.7 74 4-78 5-79 (473)
60 PF08284 RVP_2: Retroviral asp 98.0 1.5E-05 3.3E-10 67.3 6.3 69 216-286 22-92 (135)
61 cd05481 retropepsin_like_LTR_1 97.9 2.4E-05 5.1E-10 61.9 5.6 64 219-285 2-71 (93)
62 TIGR03698 clan_AA_DTGF clan AA 97.8 4.3E-05 9.3E-10 61.9 6.5 66 217-286 1-73 (107)
63 KOG0006|consensus 97.8 3.6E-05 7.8E-10 72.9 6.1 72 1-73 1-74 (446)
64 PF08817 YukD: WXG100 protein 97.6 0.00012 2.7E-09 55.9 5.7 69 2-72 4-79 (79)
65 cd00303 retropepsin_like Retro 97.6 0.00021 4.4E-09 51.4 6.5 33 218-250 1-33 (92)
66 KOG4495|consensus 97.5 0.00017 3.7E-09 57.2 4.4 63 1-64 1-65 (110)
67 cd01811 OASL_repeat1 2'-5' oli 97.4 0.0013 2.9E-08 50.0 8.0 72 1-75 1-77 (80)
68 KOG3493|consensus 97.2 0.00012 2.5E-09 54.1 1.2 68 3-72 4-71 (73)
69 COG5417 Uncharacterized small 97.0 0.0042 9.2E-08 47.2 7.2 69 2-71 6-80 (81)
70 KOG1769|consensus 96.9 0.0067 1.5E-07 48.5 8.6 72 2-74 20-92 (99)
71 PF00789 UBX: UBX domain; Int 96.9 0.0068 1.5E-07 46.1 8.3 70 2-72 8-81 (82)
72 PF09280 XPC-binding: XPC-bind 96.9 0.002 4.3E-08 47.0 4.7 46 111-157 4-55 (59)
73 PF10302 DUF2407: DUF2407 ubiq 96.7 0.0045 9.9E-08 49.5 5.8 58 2-61 2-64 (97)
74 KOG1639|consensus 96.6 0.0044 9.5E-08 57.3 6.1 73 1-74 1-79 (297)
75 KOG3206|consensus 96.6 0.0065 1.4E-07 54.7 7.0 75 2-76 3-83 (234)
76 smart00166 UBX Domain present 96.6 0.015 3.2E-07 44.4 7.8 70 2-72 6-79 (80)
77 cd05482 HIV_retropepsin_like R 96.5 0.0043 9.4E-08 48.7 4.7 26 219-244 2-27 (87)
78 PF12384 Peptidase_A2B: Ty3 tr 96.5 0.009 1.9E-07 52.2 6.7 49 216-264 35-85 (177)
79 PF11470 TUG-UBL1: GLUT4 regul 96.4 0.013 2.9E-07 43.5 6.7 62 9-71 4-65 (65)
80 cd01767 UBX UBX (ubiquitin reg 96.4 0.023 4.9E-07 43.0 8.2 69 2-73 4-76 (77)
81 PF05585 DUF1758: Putative pep 96.3 0.0024 5.2E-08 55.0 2.3 29 225-253 11-39 (164)
82 cd01770 p47_UBX p47-like ubiqu 96.1 0.036 7.8E-07 42.5 7.6 67 2-70 6-75 (79)
83 PF14453 ThiS-like: ThiS-like 95.9 0.036 7.7E-07 40.2 6.3 56 1-74 1-56 (57)
84 cd01772 SAKS1_UBX SAKS1-like U 95.8 0.062 1.3E-06 41.1 8.0 69 2-72 6-78 (79)
85 PF13019 Telomere_Sde2: Telome 95.7 0.066 1.4E-06 46.7 8.4 66 1-67 1-73 (162)
86 PRK06437 hypothetical protein; 95.1 0.16 3.5E-06 37.7 7.7 60 4-74 4-63 (67)
87 cd01774 Faf1_like2_UBX Faf1 ik 95.1 0.18 3.9E-06 39.3 8.2 69 2-72 6-83 (85)
88 cd01771 Faf1_UBX Faf1 UBX doma 94.7 0.26 5.7E-06 37.9 8.1 70 2-73 6-79 (80)
89 PRK08364 sulfur carrier protei 94.4 0.28 6.2E-06 36.5 7.6 51 13-73 15-65 (70)
90 cd01773 Faf1_like1_UBX Faf1 ik 94.4 0.37 8E-06 37.5 8.4 71 2-74 7-81 (82)
91 KOG0013|consensus 94.3 0.079 1.7E-06 48.0 5.1 64 10-74 155-218 (231)
92 PRK06488 sulfur carrier protei 94.1 0.33 7.1E-06 35.4 7.3 60 1-73 1-60 (65)
93 PRK05863 sulfur carrier protei 94.0 0.25 5.4E-06 36.3 6.6 61 1-74 1-61 (65)
94 PF15044 CLU_N: Mitochondrial 93.9 0.12 2.6E-06 39.4 4.8 56 18-74 1-58 (76)
95 COG5550 Predicted aspartyl pro 93.4 0.29 6.2E-06 40.9 6.6 52 230-285 30-82 (125)
96 PRK05659 sulfur carrier protei 93.0 0.57 1.2E-05 34.0 7.0 62 1-74 1-62 (66)
97 COG5227 SMT3 Ubiquitin-like pr 92.5 0.51 1.1E-05 37.4 6.4 63 9-72 32-94 (103)
98 PRK08053 sulfur carrier protei 92.1 1.1 2.4E-05 32.8 7.5 62 1-74 1-62 (66)
99 PRK07696 sulfur carrier protei 91.9 0.97 2.1E-05 33.4 7.1 62 1-74 1-63 (67)
100 PRK07440 hypothetical protein; 91.7 0.9 1.9E-05 34.0 6.8 57 10-74 10-66 (70)
101 PF14836 Ubiquitin_3: Ubiquiti 91.3 1.1 2.5E-05 35.2 7.2 66 10-76 12-82 (88)
102 cd00565 ThiS ThiaminS ubiquiti 90.9 0.82 1.8E-05 33.3 5.8 57 10-74 5-61 (65)
103 PRK06944 sulfur carrier protei 90.7 1.9 4.1E-05 31.1 7.6 60 1-73 1-60 (65)
104 PRK06083 sulfur carrier protei 90.7 1.1 2.4E-05 34.8 6.6 57 10-74 24-80 (84)
105 TIGR02958 sec_mycoba_snm4 secr 90.3 1.7 3.7E-05 43.8 9.3 72 2-76 4-82 (452)
106 cd00754 MoaD Ubiquitin domain 90.0 1.5 3.2E-05 32.7 6.7 55 13-73 17-75 (80)
107 PLN02799 Molybdopterin synthas 89.9 1.3 2.8E-05 33.6 6.4 67 1-73 2-77 (82)
108 TIGR01683 thiS thiamine biosyn 89.9 1.2 2.7E-05 32.3 6.0 57 10-74 4-60 (64)
109 smart00666 PB1 PB1 domain. Pho 89.5 1.3 2.8E-05 33.2 6.1 45 3-47 2-46 (81)
110 cd06406 PB1_P67 A PB1 domain i 89.1 1.5 3.2E-05 34.0 6.1 36 13-48 12-47 (80)
111 COG2104 ThiS Sulfur transfer p 88.1 4.5 9.9E-05 30.2 8.0 64 1-74 1-64 (68)
112 PF09379 FERM_N: FERM N-termin 87.3 4.8 0.0001 29.9 7.9 66 5-72 1-75 (80)
113 KOG4583|consensus 87.2 0.3 6.5E-06 47.2 1.4 58 14-72 24-85 (391)
114 PF11620 GABP-alpha: GA-bindin 86.6 2 4.4E-05 33.6 5.5 63 12-75 3-65 (88)
115 cd06094 RP_Saci_like RP_Saci_l 85.9 0.75 1.6E-05 36.3 2.8 52 226-284 9-62 (89)
116 cd06407 PB1_NLP A PB1 domain i 85.9 4 8.6E-05 31.6 6.8 44 1-46 1-45 (82)
117 smart00727 STI1 Heat shock cha 85.2 1.1 2.3E-05 29.6 3.0 20 122-141 20-39 (41)
118 PF00564 PB1: PB1 domain; Int 84.9 4 8.7E-05 30.5 6.5 45 3-47 2-47 (84)
119 PRK01777 hypothetical protein; 84.9 7.3 0.00016 31.0 8.1 63 1-73 4-75 (95)
120 KOG4250|consensus 82.9 2.2 4.8E-05 45.1 5.5 69 3-72 316-385 (732)
121 PRK11840 bifunctional sulfur c 82.9 4.2 9.1E-05 39.4 7.1 64 1-76 1-64 (326)
122 cd06411 PB1_p51 The PB1 domain 82.8 3.1 6.7E-05 32.1 4.9 42 3-47 1-42 (78)
123 cd06409 PB1_MUG70 The MUG70 pr 82.5 8.7 0.00019 30.1 7.5 42 4-46 4-48 (86)
124 KOG2982|consensus 81.9 3.2 7E-05 40.2 5.8 59 15-73 351-416 (418)
125 PF10790 DUF2604: Protein of U 80.9 6.8 0.00015 29.2 6.0 66 10-76 4-73 (76)
126 TIGR01682 moaD molybdopterin c 80.6 11 0.00023 28.3 7.3 55 13-73 17-75 (80)
127 cd05992 PB1 The PB1 domain is 80.3 4.3 9.4E-05 30.1 5.0 44 4-47 2-46 (81)
128 PF11834 DUF3354: Domain of un 79.4 4.8 0.0001 30.2 4.9 44 22-72 26-69 (69)
129 TIGR01687 moaD_arch MoaD famil 79.3 15 0.00033 27.9 7.9 57 12-73 16-83 (88)
130 cd01760 RBD Ubiquitin-like dom 78.7 12 0.00025 28.3 6.8 45 3-48 2-46 (72)
131 PF12382 Peptidase_A2E: Retrot 78.6 2.9 6.3E-05 33.9 3.7 55 227-284 48-104 (137)
132 cd06408 PB1_NoxR The PB1 domai 75.2 13 0.00028 29.2 6.4 44 2-46 2-45 (86)
133 cd01768 RA RA (Ras-associating 74.6 31 0.00066 26.0 8.4 49 11-59 12-68 (87)
134 smart00295 B41 Band 4.1 homolo 74.5 25 0.00055 30.3 9.0 72 2-75 5-84 (207)
135 KOG2086|consensus 73.6 7.4 0.00016 38.4 5.7 65 3-68 308-375 (380)
136 cd06396 PB1_NBR1 The PB1 domai 73.3 11 0.00023 29.3 5.4 46 3-54 1-48 (81)
137 TIGR02854 spore_II_GA sigma-E 70.7 10 0.00022 35.9 5.9 35 215-249 158-203 (288)
138 PF08825 E2_bind: E2 binding d 70.5 6.6 0.00014 30.5 3.8 58 16-74 1-71 (84)
139 smart00455 RBD Raf-like Ras-bi 69.5 15 0.00033 27.3 5.5 47 4-51 3-51 (70)
140 PF12754 Blt1: Cell-cycle cont 69.5 1.5 3.3E-05 42.0 0.0 43 21-64 103-160 (309)
141 PF14451 Ub-Mut7C: Mut7-C ubiq 69.0 22 0.00047 27.5 6.4 55 11-75 22-77 (81)
142 PF02597 ThiS: ThiS family; I 68.8 17 0.00038 26.5 5.7 58 13-73 13-72 (77)
143 PF08337 Plexin_cytopl: Plexin 67.6 14 0.00031 38.2 6.5 67 10-76 200-291 (539)
144 smart00727 STI1 Heat shock cha 67.0 8.5 0.00018 25.2 3.3 9 147-155 23-31 (41)
145 PF14732 UAE_UbL: Ubiquitin/SU 66.4 6.9 0.00015 30.5 3.2 52 21-72 8-67 (87)
146 PF02505 MCR_D: Methyl-coenzym 66.2 12 0.00026 32.4 4.8 55 2-66 69-124 (153)
147 PRK08507 prephenate dehydrogen 66.1 15 0.00032 34.0 5.9 47 111-157 217-267 (275)
148 cd01775 CYR1_RA Ubiquitin doma 65.1 28 0.0006 28.0 6.3 66 9-74 10-87 (97)
149 PF02153 PDH: Prephenate dehyd 64.7 12 0.00026 34.5 5.0 13 114-126 211-223 (258)
150 cd05470 pepsin_retropepsin_lik 64.5 11 0.00023 29.3 4.0 65 218-286 1-82 (109)
151 PF08968 DUF1885: Domain of un 64.3 4.4 9.4E-05 33.9 1.7 46 255-301 67-114 (130)
152 TIGR03260 met_CoM_red_D methyl 64.3 13 0.00029 32.0 4.7 51 2-61 68-118 (150)
153 PF03539 Spuma_A9PTase: Spumav 63.3 3.7 8.1E-05 35.4 1.2 57 222-279 1-62 (163)
154 PRK12751 cpxP periplasmic stre 62.4 41 0.00089 29.4 7.5 26 171-196 89-114 (162)
155 PTZ00380 microtubule-associate 62.1 8.9 0.00019 32.0 3.2 59 16-75 45-106 (121)
156 PF00026 Asp: Eukaryotic aspar 61.6 9.1 0.0002 35.2 3.6 71 217-287 3-86 (317)
157 cd06398 PB1_Joka2 The PB1 doma 61.0 31 0.00067 27.2 6.0 44 4-47 2-51 (91)
158 smart00314 RA Ras association 60.9 54 0.0012 24.8 7.3 47 11-57 15-68 (90)
159 PRK07417 arogenate dehydrogena 60.5 23 0.0005 32.8 6.1 46 112-157 220-269 (279)
160 PF02160 Peptidase_A3: Caulifl 60.4 5.9 0.00013 35.9 2.0 67 215-284 4-77 (201)
161 PF03419 Peptidase_U4: Sporula 59.2 9.7 0.00021 35.9 3.4 34 215-248 157-201 (293)
162 cd06410 PB1_UP2 Uncharacterize 57.3 39 0.00085 26.9 6.0 37 9-46 20-56 (97)
163 PF14841 FliG_M: FliG middle d 57.1 82 0.0018 23.8 8.9 69 114-195 1-71 (79)
164 PRK07502 cyclohexadienyl dehyd 57.0 29 0.00063 32.6 6.2 47 111-157 230-280 (307)
165 PF14533 USP7_C2: Ubiquitin-sp 56.3 64 0.0014 29.1 8.0 58 12-72 34-99 (213)
166 PRK14806 bifunctional cyclohex 55.7 17 0.00036 38.6 4.7 50 111-160 226-279 (735)
167 cd06097 Aspergillopepsin_like 54.9 19 0.00042 33.0 4.6 70 217-287 2-86 (278)
168 KOG0010|consensus 54.5 36 0.00078 34.8 6.6 65 117-192 147-217 (493)
169 PF02196 RBD: Raf-like Ras-bin 53.2 61 0.0013 24.1 6.2 48 4-52 4-53 (71)
170 cd01766 Ufm1 Urm1-like ubiquit 53.0 82 0.0018 24.2 6.7 59 15-74 19-78 (82)
171 KOG2561|consensus 52.7 11 0.00023 38.2 2.6 59 15-74 53-111 (568)
172 PF12436 USP7_ICP0_bdg: ICP0-b 51.6 17 0.00037 33.6 3.6 60 15-75 88-153 (249)
173 PF14327 CSTF2_hinge: Hinge do 51.5 17 0.00037 28.1 3.0 27 117-143 37-63 (84)
174 COG5100 NPL4 Nuclear pore prot 51.4 52 0.0011 33.1 6.9 70 1-74 1-79 (571)
175 PRK11130 moaD molybdopterin sy 51.2 95 0.0021 23.3 7.2 52 16-73 19-76 (81)
176 cd06098 phytepsin Phytepsin, a 51.0 28 0.00061 32.7 5.1 71 215-287 10-95 (317)
177 smart00144 PI3K_rbd PI3-kinase 50.7 93 0.002 25.0 7.4 71 3-74 20-104 (108)
178 cd01782 AF6_RA_repeat1 Ubiquit 49.6 91 0.002 25.6 7.0 56 1-57 24-91 (112)
179 PF07319 DnaI_N: Primosomal pr 48.9 26 0.00056 27.5 3.8 43 114-156 20-62 (94)
180 COG0287 TyrA Prephenate dehydr 48.5 38 0.00083 32.0 5.5 45 112-156 222-270 (279)
181 cd06397 PB1_UP1 Uncharacterize 47.9 47 0.001 25.9 4.8 42 5-46 3-44 (82)
182 PRK11199 tyrA bifunctional cho 46.6 36 0.00078 33.3 5.2 46 112-157 297-345 (374)
183 KOG4572|consensus 45.9 1.2E+02 0.0026 33.3 8.9 64 10-73 3-69 (1424)
184 cd01787 GRB7_RA RA (RAS-associ 45.8 60 0.0013 25.4 5.3 63 9-71 10-82 (85)
185 PF00276 Ribosomal_L23: Riboso 45.3 43 0.00094 26.2 4.5 41 11-51 20-61 (91)
186 PRK08655 prephenate dehydrogen 45.3 39 0.00085 33.8 5.3 46 112-157 218-266 (437)
187 PF10209 DUF2340: Uncharacteri 44.8 45 0.00098 27.9 4.7 57 17-73 21-107 (122)
188 KOG0011|consensus 44.1 57 0.0012 31.8 5.9 45 112-157 220-270 (340)
189 PRK06545 prephenate dehydrogen 43.7 58 0.0013 31.5 6.1 45 112-156 222-270 (359)
190 PF12436 USP7_ICP0_bdg: ICP0-b 42.3 51 0.0011 30.5 5.2 43 3-45 179-223 (249)
191 PF06234 TmoB: Toluene-4-monoo 42.2 1.3E+02 0.0028 23.6 6.6 59 14-73 17-83 (85)
192 PF00788 RA: Ras association ( 41.9 92 0.002 23.2 5.8 46 13-58 18-71 (93)
193 PHA00431 internal virion prote 41.8 1.4E+02 0.0031 31.5 8.7 104 115-225 82-207 (746)
194 cd01612 APG12_C Ubiquitin-like 41.0 42 0.00091 26.2 3.8 59 17-75 21-82 (87)
195 cd05490 Cathepsin_D2 Cathepsin 40.8 50 0.0011 31.0 5.1 70 216-287 7-92 (325)
196 cd01818 TIAM1_RBD Ubiquitin do 40.5 61 0.0013 24.9 4.4 38 9-46 7-44 (77)
197 PRK10363 cpxP periplasmic repr 39.5 1.7E+02 0.0036 25.8 7.6 22 175-196 87-108 (166)
198 PF11069 DUF2870: Protein of u 39.4 66 0.0014 25.9 4.7 31 43-75 3-33 (98)
199 PRK12750 cpxP periplasmic repr 39.2 2.1E+02 0.0045 25.0 8.3 27 170-196 95-121 (170)
200 PF03658 Ub-RnfH: RnfH family 39.1 1.3E+02 0.0029 23.4 6.3 67 1-73 1-72 (84)
201 PF03671 Ufm1: Ubiquitin fold 38.8 97 0.0021 23.7 5.2 57 14-71 18-75 (76)
202 cd01611 GABARAP Ubiquitin doma 38.7 42 0.00092 27.4 3.6 57 17-75 46-107 (112)
203 cd01764 Urm1 Urm1-like ubuitin 38.5 87 0.0019 24.6 5.3 60 12-73 17-89 (94)
204 PF02991 Atg8: Autophagy prote 36.2 53 0.0011 26.5 3.8 56 18-75 39-99 (104)
205 PF09269 DUF1967: Domain of un 35.8 17 0.00037 27.0 0.8 16 56-71 47-62 (69)
206 PRK11861 bifunctional prephena 35.6 88 0.0019 33.1 6.3 48 111-158 160-211 (673)
207 cd05474 SAP_like SAPs, pepsin- 35.6 1E+02 0.0023 28.1 6.2 55 216-287 3-59 (295)
208 PF04126 Cyclophil_like: Cyclo 35.4 38 0.00082 27.8 2.9 32 1-34 1-32 (120)
209 KOG2689|consensus 35.3 1E+02 0.0022 29.4 5.9 68 3-72 213-285 (290)
210 PF05823 Gp-FAR-1: Nematode fa 33.7 81 0.0018 27.1 4.8 46 126-192 43-88 (154)
211 TIGR03595 Obg_CgtA_exten Obg f 33.6 21 0.00045 26.6 0.9 17 56-72 47-63 (69)
212 cd01817 RGS12_RBD Ubiquitin do 33.6 2.1E+02 0.0047 21.7 7.4 62 9-74 7-70 (73)
213 cd05488 Proteinase_A_fungi Fun 33.0 63 0.0014 30.4 4.4 71 215-287 10-94 (320)
214 KOG3391|consensus 32.9 51 0.0011 28.2 3.2 58 19-77 59-139 (151)
215 PF13801 Metal_resist: Heavy-m 32.9 2.3E+02 0.0049 21.7 8.3 42 151-192 57-98 (125)
216 PF06487 SAP18: Sin3 associate 32.6 58 0.0013 27.0 3.5 59 12-72 37-119 (120)
217 PRK05738 rplW 50S ribosomal pr 32.0 69 0.0015 25.2 3.7 40 11-50 20-60 (92)
218 cd05477 gastricsin Gastricsins 30.7 77 0.0017 29.6 4.5 70 216-287 4-87 (318)
219 cd05478 pepsin_A Pepsin A, asp 30.7 57 0.0012 30.5 3.7 29 220-248 194-228 (317)
220 PF01606 Arteri_env: Arterivir 30.1 36 0.00078 30.3 2.0 38 199-239 118-157 (214)
221 PTZ00013 plasmepsin 4 (PM4); P 29.9 1.1E+02 0.0025 30.8 5.8 72 214-287 137-222 (450)
222 PRK10872 relA (p)ppGpp synthet 29.5 2.1E+02 0.0046 31.0 8.0 64 2-76 405-468 (743)
223 PF14847 Ras_bdg_2: Ras-bindin 29.1 2.5E+02 0.0054 22.7 6.6 45 3-48 3-50 (105)
224 PTZ00147 plasmepsin-1; Provisi 28.5 1.1E+02 0.0025 30.8 5.5 72 214-287 138-223 (453)
225 TIGR03636 L23_arch archaeal ri 28.3 1E+02 0.0022 23.6 3.9 34 11-44 14-47 (77)
226 PRK10455 periplasmic protein; 27.7 3.8E+02 0.0083 23.2 8.0 12 177-188 129-140 (161)
227 PRK14548 50S ribosomal protein 27.6 90 0.002 24.3 3.6 34 11-44 21-54 (84)
228 PF02192 PI3K_p85B: PI3-kinase 27.6 71 0.0015 24.5 3.0 21 14-34 2-22 (78)
229 KOG2507|consensus 27.3 1.1E+02 0.0023 31.1 4.9 75 2-77 316-394 (506)
230 COG3159 Uncharacterized protei 26.9 2.7E+02 0.006 25.5 7.1 28 113-144 4-31 (218)
231 cd05476 pepsin_A_like_plant Ch 26.8 1.4E+02 0.0031 27.1 5.5 53 217-284 3-57 (265)
232 cd05487 renin_like Renin stimu 26.3 1.1E+02 0.0023 28.9 4.6 69 216-286 9-93 (326)
233 cd05472 cnd41_like Chloroplast 26.0 69 0.0015 29.6 3.3 38 209-248 146-193 (299)
234 PF07946 DUF1682: Protein of u 25.9 2.7E+02 0.0059 26.6 7.4 20 175-194 299-318 (321)
235 PF08783 DWNN: DWNN domain; I 25.9 3E+02 0.0064 20.9 7.2 34 4-37 2-36 (74)
236 PF10454 DUF2458: Protein of u 25.5 2.6E+02 0.0057 24.0 6.5 10 134-143 20-29 (150)
237 cd05472 cnd41_like Chloroplast 25.5 1.5E+02 0.0032 27.4 5.4 57 216-284 2-60 (299)
238 COG4055 McrD Methyl coenzyme M 25.4 1.7E+02 0.0036 25.4 5.1 56 3-68 78-134 (165)
239 PF00794 PI3K_rbd: PI3-kinase 25.3 1.6E+02 0.0035 23.1 4.9 59 2-61 18-84 (106)
240 COG2080 CoxS Aerobic-type carb 25.3 1.1E+02 0.0023 26.8 4.0 26 1-28 2-27 (156)
241 cd05477 gastricsin Gastricsins 24.6 85 0.0018 29.4 3.6 28 220-247 188-222 (318)
242 cd06096 Plasmepsin_5 Plasmepsi 24.6 1.4E+02 0.0031 28.0 5.2 72 215-287 3-104 (326)
243 cd05485 Cathepsin_D_like Cathe 24.3 89 0.0019 29.5 3.8 71 214-286 10-96 (329)
244 cd06404 PB1_aPKC PB1 domain is 24.2 1.7E+02 0.0036 22.9 4.5 44 3-46 1-45 (83)
245 cd01776 Rin1_RA Ubiquitin doma 24.1 1.2E+02 0.0027 23.7 3.7 37 9-45 10-48 (87)
246 cd05486 Cathespin_E Cathepsin 23.9 85 0.0018 29.3 3.5 29 220-248 186-220 (316)
247 PF13856 Gifsy-2: ATP-binding 23.7 81 0.0018 24.5 2.8 19 218-236 22-40 (95)
248 COG0089 RplW Ribosomal protein 23.5 1E+02 0.0022 24.6 3.3 39 11-49 21-60 (94)
249 cd01666 TGS_DRG_C TGS_DRG_C: 23.4 2.1E+02 0.0045 21.6 4.9 53 14-72 17-74 (75)
250 PF09280 XPC-binding: XPC-bind 23.3 91 0.002 22.6 2.8 20 134-153 18-41 (59)
251 cd06098 phytepsin Phytepsin, a 23.3 1E+02 0.0022 28.9 3.9 30 219-248 196-232 (317)
252 PF02824 TGS: TGS domain; Int 23.2 2.1E+02 0.0045 20.3 4.7 59 3-72 1-59 (60)
253 TIGR02867 spore_II_P stage II 23.2 1.4E+02 0.003 26.9 4.5 101 199-299 73-190 (196)
254 PF05618 Zn_protease: Putative 23.2 46 0.00099 28.2 1.4 22 223-244 13-34 (138)
255 TIGR00691 spoT_relA (p)ppGpp s 23.0 2.9E+02 0.0063 29.5 7.6 63 2-75 361-423 (683)
256 TIGR02812 fadR_gamma fatty aci 22.9 2.6E+02 0.0056 24.9 6.3 34 174-211 201-234 (235)
257 cd05473 beta_secretase_like Be 22.9 91 0.002 29.9 3.6 20 228-247 213-232 (364)
258 PF12053 DUF3534: Domain of un 22.9 4.9E+02 0.011 22.4 8.6 72 1-76 1-82 (145)
259 PF08154 NLE: NLE (NUC135) dom 22.5 3E+02 0.0065 19.9 5.5 43 15-60 19-63 (65)
260 PF14533 USP7_C2: Ubiquitin-sp 22.4 83 0.0018 28.3 3.0 31 11-41 132-162 (213)
261 cd05486 Cathespin_E Cathepsin 22.3 1.3E+02 0.0028 28.2 4.4 68 217-286 2-83 (316)
262 PRK10963 hypothetical protein; 22.2 3.7E+02 0.008 24.3 7.2 27 114-144 4-30 (223)
263 cd01777 SNX27_RA Ubiquitin dom 21.8 1.7E+02 0.0038 23.0 4.2 41 3-44 4-44 (87)
264 cd06097 Aspergillopepsin_like 21.7 83 0.0018 28.8 2.9 23 226-248 198-220 (278)
265 CHL00030 rpl23 ribosomal prote 21.7 1.4E+02 0.0029 23.7 3.7 40 10-49 18-58 (93)
266 KOG1464|consensus 21.0 1.9E+02 0.0042 28.0 5.2 32 122-158 298-329 (440)
267 KOG3483|consensus 20.8 4E+02 0.0087 20.6 6.8 60 15-74 30-89 (94)
268 cd01783 DAGK_delta_RA Ubiquiti 20.6 2.5E+02 0.0055 22.5 5.0 28 14-41 19-46 (97)
No 1
>KOG0012|consensus
Probab=100.00 E-value=7.5e-67 Score=490.01 Aligned_cols=287 Identities=42% Similarity=0.618 Sum_probs=251.4
Q ss_pred CEEEEEeCC-CCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCC-cccccccCCCCCCEEEEEEecCCC
Q psy15482 1 MKVTVSTQH-TNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDN-KKTLKTHGIQDGDLVLLKHLGSYH 78 (302)
Q Consensus 1 MqItV~~~~-~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~-~~TL~~~gI~dgd~l~L~~~~~~~ 78 (302)
|.+||.+.. ....++++|+.+..+.+|+++++.++|++.+..-|+||++++.++ +.+|.++|+++||+|.+..+...+
T Consensus 1 M~~tvs~~l~~~~~~~i~v~~dg~L~nl~aL~~~d~g~~~~~~~li~n~~~l~s~~s~~l~Q~g~~~~dsl~lr~ks~d~ 80 (380)
T KOG0012|consen 1 MSLTVSVALNFEKKFPIPVTTDGELNNLAALCWKDTGIVYDPSDLIYNPRPLVSNESQGLTQIGLKDGDSLALRCKSSDP 80 (380)
T ss_pred CeEEEEEEecceeeeccccccccchhhHHHHHHHHhCcccchhhcccCCCccccchhhhhhhcccccceeEeccCCCCCC
Confidence 889998832 567789999999999999999999999999999999999999765 678999999999999998776543
Q ss_pred --CCCCCCCCCCCCCCcccCCCCCCC---CCC------------CCCCCC-CCCHHHHHHHHhcCHHHHHHHHhcChhHH
Q psy15482 79 --ATPAAVIPNFDFSGIVVPSASSSS---SNG------------QSSNSL-SDDPATVRQLLLDNPKEIALLQQNNPRLA 140 (302)
Q Consensus 79 --~~~~~~~p~~d~s~~~~p~~s~s~---~~s------------~~~~~~-~~npe~iR~~~l~NP~~~~~L~~~NPeLa 140 (302)
.-+.+..+..||+++.+|..+... .++ +...+. .++|+.+|++++++|..+..+++.+|.|+
T Consensus 81 r~~v~~~~~~~~dFat~A~~~fs~q~a~~~~gaq~~rg~~~~di~~a~~~~ldsp~~~Rq~~la~pf~L~~~r~~lp~La 160 (380)
T KOG0012|consen 81 RPIVPIQVRLISDFATIAVPMFSSQRARQLQGAQRTRGRLQTDIPEASSLSLDSPATFRQALLAMPFFLHLDRAYLPPLA 160 (380)
T ss_pred CccccccceehhcccccccccccchhccccccccccccccccccccccccCcCCHHHHHHHHhcCchhhhhchhhcCccc
Confidence 111233344567666665432111 000 011223 59999999999999999999999999999
Q ss_pred HHhhcCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHhhHHHHHHHHHHhCCCCcceeeEEEEEE
Q psy15482 141 DALLNGSPEDFERILRQQYEEREKREQLRLRMLNAHPFDTEAQKLIAEEIKKNNIEANMEAAMEYNPETFGTVVMLYINC 220 (302)
Q Consensus 141 ~al~~~d~e~f~~~lr~~~~~~~~~e~~~~~~l~~DP~d~e~Q~~i~e~i~q~~i~~n~~~A~E~~PE~f~~v~mlyi~~ 220 (302)
.++..+|.+.|.+.+++++..+..++..+.+++++||||+|+|+||+|.|+|++|+|||.+|+||+||.|+.|.||||||
T Consensus 161 ~~l~~g~~~k~~~~~~~~q~d~~rr~~~~~rl~eanPfd~E~q~rIee~irq~~i~eq~~~ai~~~pe~f~~v~ML~iN~ 240 (380)
T KOG0012|consen 161 ALLVLGDLEKFDRTLKEMQKDYQRRSVHQRRLLEANPFDLEAQRRIEEKIRQNAIDEQMSHAIEYHPEDFTQVTMLYINC 240 (380)
T ss_pred hhhcccchhhhhhhhhhhccccchhhhhhHHHHhcCCcchhhhhhhhHHHHHHHHHHHHHHhhhcCccccccceEEEEEE
Confidence 99999999999999998777777778888899999999999999999999999999999999999999999999999999
Q ss_pred EEcCeeEEEEEcCCCccccCCHHHHHHcCchhhhhccccceecccCcceeeeEEEEeeeeecccccc
Q psy15482 221 KVNGYPVKAFIDSGAQTTIMSLACAERVNIMRLLDTRWAGIAKGVGVQKILGRIHMAIAELTNFNGD 287 (302)
Q Consensus 221 ~ing~~v~a~vDsGaq~~i~s~~~a~~~gl~~~~d~~~~~~a~gvg~~~i~G~i~~~~~~ig~~~~~ 287 (302)
+|||+|||||||||||+||||.+|||||||.+|||+||.|+|+|||+.||+|+||+++++||+.|++
T Consensus 241 ~ing~~VKAfVDsGaq~timS~~Caer~gL~rlid~r~~g~a~gvg~~ki~g~Ih~~~lki~~~~l~ 307 (380)
T KOG0012|consen 241 EINGVPVKAFVDSGAQTTIMSAACAERCGLNRLIDKRFQGEARGVGTEKILGRIHQAQLKIEDLYLP 307 (380)
T ss_pred EECCEEEEEEEcccchhhhhhHHHHHHhChHHHhhhhhhccccCCCcccccceeEEEEEEeccEeec
Confidence 9999999999999999999999999999999999999999999999999999999999999999986
No 2
>PF09668 Asp_protease: Aspartyl protease; InterPro: IPR019103 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This family of eukaryotic aspartyl proteases have a fold similar to retroviral proteases which implies they function proteolytically during regulated protein turnover []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 3S8I_A 2I1A_B.
Probab=100.00 E-value=7.9e-41 Score=277.35 Aligned_cols=96 Identities=60% Similarity=0.990 Sum_probs=59.7
Q ss_pred HhhHHHHHHHHHHhCCCCcceeeEEEEEEEEcCeeEEEEEcCCCccccCCHHHHHHcCchhhhhccccceecccCcceee
Q psy15482 192 KNNIEANMEAAMEYNPETFGTVVMLYINCKVNGYPVKAFIDSGAQTTIMSLACAERVNIMRLLDTRWAGIAKGVGVQKIL 271 (302)
Q Consensus 192 q~~i~~n~~~A~E~~PE~f~~v~mlyi~~~ing~~v~a~vDsGaq~~i~s~~~a~~~gl~~~~d~~~~~~a~gvg~~~i~ 271 (302)
|+||+|||++||||+||.|++|.||||+|+|||++||||||||||+||||.+||+||||++++|+||+|+++|||+++++
T Consensus 1 q~~i~~~~~~a~e~~PE~f~~v~mLyI~~~ing~~vkA~VDtGAQ~tims~~~a~r~gL~~lid~r~~g~a~GvG~~~i~ 80 (124)
T PF09668_consen 1 QENIDENLENAMEHSPESFGQVSMLYINCKINGVPVKAFVDTGAQSTIMSKSCAERCGLMRLIDKRFAGVAKGVGTQKIL 80 (124)
T ss_dssp -------------------------EEEEEETTEEEEEEEETT-SS-EEEHHHHHHTTGGGGEEGGG-EE-------EEE
T ss_pred ChhHHHHHHHHHHhCcHhhcCcceEEEEEEECCEEEEEEEeCCCCccccCHHHHHHcCChhhccccccccccCCCcCcee
Confidence 78999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred eEEEEeeeeecccccc
Q psy15482 272 GRIHMAIAELTNFNGD 287 (302)
Q Consensus 272 G~i~~~~~~ig~~~~~ 287 (302)
||||.++++||+.++.
T Consensus 81 G~Ih~~~l~ig~~~~~ 96 (124)
T PF09668_consen 81 GRIHSVQLKIGGLFFP 96 (124)
T ss_dssp EEEEEEEEEETTEEEE
T ss_pred EEEEEEEEEECCEEEE
Confidence 9999999999998864
No 3
>cd05480 NRIP_C NRIP_C; putative nuclear receptor interacting protein. Proteins in this family have been described as probable nuclear receptor interacting proteins. The C-terminal domain of this family is homologous to the retroviral aspartyl protease domain. The domain is structurally related to one lobe of the pepsin molecule. The conserved active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=99.97 E-value=1.6e-31 Score=211.57 Aligned_cols=70 Identities=21% Similarity=0.466 Sum_probs=67.1
Q ss_pred EEEEEcCeeEEEEEcCCCccccCCHHHHHHcCchhhhhc-cccceecccCc-ceeeeEEEEeeeeecccccc
Q psy15482 218 INCKVNGYPVKAFIDSGAQTTIMSLACAERVNIMRLLDT-RWAGIAKGVGV-QKILGRIHMAIAELTNFNGD 287 (302)
Q Consensus 218 i~~~ing~~v~a~vDsGaq~~i~s~~~a~~~gl~~~~d~-~~~~~a~gvg~-~~i~G~i~~~~~~ig~~~~~ 287 (302)
|||+|||+|||||||||||+||||++|||||||+|++|+ ||+|+|+|||+ .||+||||.++|+||+.+++
T Consensus 1 vnCk~nG~~vkAfVDsGaQ~timS~~caercgL~r~v~~~r~~g~A~gvgt~~kiiGrih~~~ikig~~~~~ 72 (103)
T cd05480 1 VSCQCAGKELRALVDTGCQYNLISAACLDRLGLKERVLKAKAEEEAPSLPTSVKVIGQIERLVLQLGQLTVE 72 (103)
T ss_pred CceeECCEEEEEEEecCCchhhcCHHHHHHcChHhhhhhccccccccCCCcceeEeeEEEEEEEEeCCEEee
Confidence 689999999999999999999999999999999999987 99999999998 69999999999999998874
No 4
>KOG0010|consensus
Probab=99.91 E-value=2.9e-24 Score=209.98 Aligned_cols=190 Identities=27% Similarity=0.409 Sum_probs=140.1
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEecCCCCC
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLGSYHAT 80 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~~~~~~ 80 (302)
++|+||+ .++ ++.+.|..+.||.+||++|...+++|+++++|||.||+|+|+ +||..|||+||.||||+++....++
T Consensus 16 irV~Vkt-~~d-k~~~~V~~~ssV~qlKE~I~~~f~a~~dqlvLIfaGrILKD~-dTL~~~gI~Dg~TvHLVik~~~~~~ 92 (493)
T KOG0010|consen 16 IRVTVKT-PKD-KYEVNVASDSSVLQLKELIAQRFGAPPDQLVLIYAGRILKDD-DTLKQYGIQDGHTVHLVIKSQPRPT 92 (493)
T ss_pred eEEEEec-CCc-ceeEecccchHHHHHHHHHHHhcCCChhHeeeeecCccccCh-hhHHHcCCCCCcEEEEEeccCCCCC
Confidence 3688888 555 999999999999999999999999999999999999999997 9999999999999999998775432
Q ss_pred C----C----CCCCC----------CCCCCcccCCCCCC--CC---------C-------------------CCCCCCCC
Q psy15482 81 P----A----AVIPN----------FDFSGIVVPSASSS--SS---------N-------------------GQSSNSLS 112 (302)
Q Consensus 81 ~----~----~~~p~----------~d~s~~~~p~~s~s--~~---------~-------------------s~~~~~~~ 112 (302)
+ + .+.|+ ..|+-...-+..+. .. . .|..++++
T Consensus 93 ~~~t~ap~~t~~~P~~~st~~~a~~~p~~l~~~~~~~g~~~~~~~~~~F~~l~~~~q~~~~snpe~~~~~m~nP~vq~ll 172 (493)
T KOG0010|consen 93 GTATSAPSSTASTPNNISTGRSASSNPFSLLTVGGFAGLSSLGLFAAMFGELQSQMQNQLLSNPEALRQMMENPIVQSLL 172 (493)
T ss_pred CccccccccccCCCCCCCCcccccCCcccccccccccccccCCcchhhcccccccchhccccCHHHHHHhhhChHHHHHh
Confidence 2 1 11110 11110000000000 00 0 02235689
Q ss_pred CCHHHHHHHHhcCHHHHHHHHhcChhHHHHhhcCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHH
Q psy15482 113 DDPATVRQLLLDNPKEIALLQQNNPRLADALLNGSPEDFERILRQQYEEREKREQLRLRMLNAHPFDTEAQKLIAEEIKK 192 (302)
Q Consensus 113 ~npe~iR~~~l~NP~~~~~L~~~NPeLa~al~~~d~e~f~~~lr~~~~~~~~~e~~~~~~l~~DP~d~e~Q~~i~e~i~q 192 (302)
+||+++|+++.+||+|++.| ++|||+.+.+ +||+.+||.+ +-.++++..+..|.+.|-
T Consensus 173 ~Npd~mrq~I~anPqmq~lm-~~npei~h~l--n~p~i~rQtl----e~arNP~m~qemmrn~d~--------------- 230 (493)
T KOG0010|consen 173 NNPDLMRQLIMANPQMQDLM-QRNPEIGHLL--NNPLILRQTL----ESARNPEMMQEMMRNQDR--------------- 230 (493)
T ss_pred cChHHHHHHHhcCHHHHHHH-hhCCcchhhh--cChHHHHHHH----HhccCHHHHHHHHhhccc---------------
Confidence 99999999999999999888 9999999988 8999999988 334455544444444441
Q ss_pred hhHHHHHHHHHHhCCCCcceeeEEEEEEE
Q psy15482 193 NNIEANMEAAMEYNPETFGTVVMLYINCK 221 (302)
Q Consensus 193 ~~i~~n~~~A~E~~PE~f~~v~mlyi~~~ 221 (302)
--+.+|-.|+.|.-..++|.++.
T Consensus 231 ------a~SnlesiPgG~n~l~~my~diq 253 (493)
T KOG0010|consen 231 ------AMSNLESIPGGYNALRRMYTDIQ 253 (493)
T ss_pred ------cccChhcCccHHHHHHHHhhhcc
Confidence 12667889999999999998864
No 5
>cd05479 RP_DDI RP_DDI; retropepsin-like domain of DNA damage inducible protein. The family represents the retropepsin-like domain of DNA damage inducible protein. DNA damage inducible protein has a retropepsin-like domain and an amino-terminal ubiquitin-like domain and/or a UBA (ubiquitin-associated) domain. This CD represents the retropepsin-like domain of DDI.
Probab=99.81 E-value=5.9e-20 Score=152.38 Aligned_cols=86 Identities=64% Similarity=0.996 Sum_probs=82.6
Q ss_pred HHHHhCCCCcceeeEEEEEEEEcCeeEEEEEcCCCccccCCHHHHHHcCchhhhhccccceecccCcceeeeEEEEeeee
Q psy15482 201 AAMEYNPETFGTVVMLYINCKVNGYPVKAFIDSGAQTTIMSLACAERVNIMRLLDTRWAGIAKGVGVQKILGRIHMAIAE 280 (302)
Q Consensus 201 ~A~E~~PE~f~~v~mlyi~~~ing~~v~a~vDsGaq~~i~s~~~a~~~gl~~~~d~~~~~~a~gvg~~~i~G~i~~~~~~ 280 (302)
+|+|++||.|..+.++|++|+|||++++|+|||||+.|+||++||+||||....+.++.+.+.|+|+.+..|+++.+.++
T Consensus 2 ~~~~~~~~~~~~~~~~~v~~~Ing~~~~~LvDTGAs~s~Is~~~a~~lgl~~~~~~~~~~~~~g~g~~~~~g~~~~~~l~ 81 (124)
T cd05479 2 NAMEHHPESFGKVPMLYINVEINGVPVKAFVDSGAQMTIMSKACAEKCGLMRLIDKRFQGIAKGVGTQKILGRIHLAQVK 81 (124)
T ss_pred chhhcCcchhceeeEEEEEEEECCEEEEEEEeCCCceEEeCHHHHHHcCCccccCcceEEEEecCCCcEEEeEEEEEEEE
Confidence 58999999999999999999999999999999999999999999999999988888899999999999999999999999
Q ss_pred eccccc
Q psy15482 281 LTNFNG 286 (302)
Q Consensus 281 ig~~~~ 286 (302)
||++..
T Consensus 82 i~~~~~ 87 (124)
T cd05479 82 IGNLFL 87 (124)
T ss_pred ECCEEe
Confidence 999875
No 6
>cd01807 GDX_N ubiquitin-like domain of GDX. GDX contains an N-terminal ubiquitin-like domain as well as an uncharacterized c-terminal domain. The function of GDX is unknown.
Probab=99.79 E-value=4.2e-19 Score=134.35 Aligned_cols=73 Identities=23% Similarity=0.228 Sum_probs=70.6
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEec
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
|+|+|++ .+|++++++|++++||++||+.|+.++|+|+++|+|+|+|++|.|+ .||++|||++|++|+|+.++
T Consensus 1 m~i~vk~-~~G~~~~l~v~~~~tV~~lK~~i~~~~gi~~~~q~L~~~G~~L~d~-~~L~~~~i~~~~~l~l~~~~ 73 (74)
T cd01807 1 MFLTVKL-LQGRECSLQVSEKESVSTLKKLVSEHLNVPEEQQRLLFKGKALADD-KRLSDYSIGPNAKLNLVVRP 73 (74)
T ss_pred CEEEEEe-CCCCEEEEEECCCCcHHHHHHHHHHHHCCCHHHeEEEECCEECCCC-CCHHHCCCCCCCEEEEEEcC
Confidence 8999999 9999999999999999999999999999999999999999999997 99999999999999998774
No 7
>cd01793 Fubi Fubi ubiquitin-like protein. Fubi is a ubiquitin-like protein encoded by the fau gene which has an N-terminal ubiquitin-like domain (also referred to as FUBI) fused to the ribosomal protein S30. Fubi is thought to be a tumor suppressor protein and the FUBI domain may act as a substitute or an inhibitor of ubiquitin or one of ubiquitin's close relatives UCRP, FAT10, and Nedd8.
Probab=99.75 E-value=4.2e-18 Score=128.94 Aligned_cols=71 Identities=20% Similarity=0.254 Sum_probs=67.6
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEec
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
|+|+||+ +++++++|++++||++||++|+.++|+|+++|+|+|+|++|.|+ +||++|||+++++|||+.+.
T Consensus 1 mqi~vk~---~~~~~l~v~~~~tV~~lK~~i~~~~gip~~~q~Li~~Gk~L~D~-~tL~~~~i~~~~tl~l~~~l 71 (74)
T cd01793 1 MQLFVRA---QNTHTLEVTGQETVSDIKAHVAGLEGIDVEDQVLLLAGVPLEDD-ATLGQCGVEELCTLEVAGRL 71 (74)
T ss_pred CEEEEEC---CCEEEEEECCcCcHHHHHHHHHhhhCCCHHHEEEEECCeECCCC-CCHHHcCCCCCCEEEEEEec
Confidence 8999998 36899999999999999999999999999999999999999997 99999999999999998765
No 8
>cd01797 NIRF_N amino-terminal ubiquitin-like domain of Np95 and NIRF. NIRF_N This CD represents the amino-terminal ubiquitin-like domain of a family of nuclear proteins that includes Np95 and NIRF (Np95/ICBP90-like RING finger) protein. Both Np95 and NIRF have a domain architecture consisting of a ubiquitin-like domain, a PHD finger, a YDG/SRA domain, Rb-binding motifs and a RING finger domain. Both Np95 and NIRF are ubiquitin ligases that ubiquitinate PCNP (PEST-containing nuclear proteins). While Np95 is capable of binding histones, NIRF is involved in cell cycle regulation.
Probab=99.74 E-value=6.9e-18 Score=129.57 Aligned_cols=74 Identities=16% Similarity=0.139 Sum_probs=69.7
Q ss_pred CEEEEEeCCCCeE-EEEE-eCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEecC
Q psy15482 1 MKVTVSTQHTNEC-FLLD-VSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLGS 76 (302)
Q Consensus 1 MqItV~~~~~g~~-~~le-V~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~~ 76 (302)
|+|+|++ .+|+. ++++ ++++.||++||++|++.+|+|+++|+|+|+|+.|+|+ .||++|||++|++|+|+.+..
T Consensus 1 M~I~vk~-~~G~~~~~l~~v~~~~TV~~lK~~i~~~~gi~~~~QrLi~~Gk~L~D~-~tL~~y~i~~~~~i~l~~~~~ 76 (78)
T cd01797 1 MWIQVRT-MDGKETRTVDSLSRLTKVEELREKIQELFNVEPECQRLFYRGKQMEDG-HTLFDYNVGLNDIIQLLVRQD 76 (78)
T ss_pred CEEEEEc-CCCCEEEEeeccCCcCcHHHHHHHHHHHhCCCHHHeEEEeCCEECCCC-CCHHHcCCCCCCEEEEEEecC
Confidence 9999999 88886 7895 8999999999999999999999999999999999996 999999999999999998764
No 9
>cd01805 RAD23_N Ubiquitin-like domain of RAD23. RAD23 belongs to a family of adaptor molecules having affinity for both the proteasome and ubiquitinylated proteins and thought to shuttle these ubiquitinylated proteins to the proteasome for destruction. RAD23 interacts with ubiquitin through its C-terminal ubiquitin-associated domains (UBA) and with the proteasome through its N-terminal ubiquitin-like domain (UBL).
Probab=99.73 E-value=1.9e-17 Score=125.57 Aligned_cols=74 Identities=26% Similarity=0.428 Sum_probs=70.6
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCC--CCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEecC
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGY--PAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLGS 76 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gI--p~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~~ 76 (302)
|+|+|++ .+|+.++++|++++||.+||+.|+..+|+ |+++|+|+|+|++|.|+ .+|++|||++|++|++..+.+
T Consensus 1 m~i~vk~-~~g~~~~l~v~~~~TV~~lK~~i~~~~~i~~~~~~q~L~~~G~~L~d~-~~L~~~~i~~~~~i~~~~~~~ 76 (77)
T cd01805 1 MKITFKT-LKQQTFPIEVDPDDTVAELKEKIEEEKGCDYPPEQQKLIYSGKILKDD-TTLEEYKIDEKDFVVVMVSKP 76 (77)
T ss_pred CEEEEEe-CCCCEEEEEECCCCcHHHHHHHHHHhhCCCCChhHeEEEECCEEccCC-CCHHHcCCCCCCEEEEEEecC
Confidence 9999999 99999999999999999999999999999 99999999999999986 899999999999999987753
No 10
>TIGR00601 rad23 UV excision repair protein Rad23. All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=99.73 E-value=7.9e-17 Score=156.54 Aligned_cols=73 Identities=26% Similarity=0.461 Sum_probs=69.3
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhC---CCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEec
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSG---YPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~g---Ip~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
|+||||+ .+|++|+|+|++++||.+||.+|+.+.| +|+++|+|+|+||+|+|+ +||++|||++|++|+++.++
T Consensus 1 MkItVKt-l~g~~~~IeV~~~~TV~dLK~kI~~~~g~~~ip~~~QkLIy~GkiL~Dd-~tL~dy~I~e~~~Ivvmv~k 76 (378)
T TIGR00601 1 MTLTFKT-LQQQKFKIDMEPDETVKELKEKIEAEQGKDAYPVAQQKLIYSGKILSDD-KTVREYKIKEKDFVVVMVSK 76 (378)
T ss_pred CEEEEEe-CCCCEEEEEeCCcChHHHHHHHHHHhhCCCCCChhHeEEEECCEECCCC-CcHHHcCCCCCCEEEEEecc
Confidence 9999999 9999999999999999999999999998 999999999999999997 89999999999998877554
No 11
>PTZ00044 ubiquitin; Provisional
Probab=99.72 E-value=2.1e-17 Score=125.08 Aligned_cols=73 Identities=15% Similarity=0.219 Sum_probs=70.5
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEec
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
|+|+|++ .+|++++++|++++||++||++|+..+|+|+++|+|+|+|++|.|+ .+|++|||++|++|+|..+.
T Consensus 1 m~i~vk~-~~G~~~~l~v~~~~tv~~lK~~i~~~~gi~~~~q~L~~~g~~L~d~-~~l~~~~i~~~~~i~l~~~~ 73 (76)
T PTZ00044 1 MQILIKT-LTGKKQSFNFEPDNTVQQVKMALQEKEGIDVKQIRLIYSGKQMSDD-LKLSDYKVVPGSTIHMVLQL 73 (76)
T ss_pred CEEEEEe-CCCCEEEEEECCCCcHHHHHHHHHHHHCCCHHHeEEEECCEEccCC-CcHHHcCCCCCCEEEEEEEc
Confidence 8999999 9999999999999999999999999999999999999999999986 89999999999999998764
No 12
>cd01791 Ubl5 UBL5 ubiquitin-like modifier. UBL5 (also known as HUB1) is a ubiquitin-like modifier that is both widely expressed and highly phylogenetically conserved. At the C-terminal end of the ubiquitin-like fold of UBL5 is a di-tyrosine motif followed by a single variable residue instead of the characteristic di-glycine found in all other ubiquitin-like modifiers. ULB5 interacts with a cyclin-like kinase called CLK4 but not with other cyclin-like kinase family members.
Probab=99.72 E-value=1.8e-17 Score=125.85 Aligned_cols=71 Identities=17% Similarity=0.218 Sum_probs=67.9
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEE
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKH 73 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~ 73 (302)
|+|+|++ ..|+.+.++|+|+.||++||+.|+...|+|+++|+|+|.|++|.|+ +||++|||++|++|||-.
T Consensus 2 ~~i~vkt-~~Gk~~~~~v~~~~TV~~LK~~I~~~~~~~~~~qrLi~~Gk~L~D~-~tL~~ygi~~~stv~l~~ 72 (73)
T cd01791 2 IEVVCND-RLGKKVRVKCNPDDTIGDLKKLIAAQTGTRPEKIVLKKWYTIFKDH-ISLGDYEIHDGMNLELYY 72 (73)
T ss_pred EEEEEEC-CCCCEEEEEeCCCCcHHHHHHHHHHHhCCChHHEEEEeCCcCCCCC-CCHHHcCCCCCCEEEEEe
Confidence 7899999 8999999999999999999999999999999999999999999996 899999999999999853
No 13
>cd01796 DDI1_N DNA damage inducible protein 1 ubiquitin-like domain. DDI1_N DDI1 (DNA damage inducible protein 1) has an amino-terminal ubiquitin-like domain, an retroviral protease-like (RVP-like) domain, and a UBA (ubiquitin-associated) domain. This CD represents the amino-terminal ubiquitin-like domain of DDI1.
Probab=99.70 E-value=3.6e-17 Score=123.20 Aligned_cols=69 Identities=51% Similarity=0.806 Sum_probs=65.2
Q ss_pred EEEEeCC-CCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEE
Q psy15482 3 VTVSTQH-TNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLK 72 (302)
Q Consensus 3 ItV~~~~-~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~ 72 (302)
|||++ . +|++++++|++++||++||++|+.++|+|+++|+|+|+|++|.|++.+|++|||++|++|+|.
T Consensus 1 l~v~~-~~~g~~~~l~v~~~~TV~~lK~~I~~~~gip~~~q~Li~~Gk~L~D~~~~L~~~gi~~~~~l~l~ 70 (71)
T cd01796 1 ITVYT-ARSETTFSLDVDPDLELENFKALCEAESGIPASQQQLIYNGRELVDNKRLLALYGVKDGDLVVLR 70 (71)
T ss_pred CEEEE-CCCCCEEEEEECCcCCHHHHHHHHHHHhCCCHHHeEEEECCeEccCCcccHHHcCCCCCCEEEEe
Confidence 68999 6 899999999999999999999999999999999999999999886578999999999999985
No 14
>cd01798 parkin_N amino-terminal ubiquitin-like of parkin protein. parkin_N parkin protein is a RING-type E3 ubiquitin ligase with an amino-terminal ubiquitin-like (Ubl) domain and an RBR signature consisting of two RING finger domains separated by an IBR/DRIL domain. Naturally occurring mutations in parkin are thought to cause the disease AR_JP (autosomal-recessive juvenile parkinsonism). Parkin binds the Rpn10 subunit of 26S proteasomes through its Ubl domain.
Probab=99.70 E-value=4.3e-17 Score=121.98 Aligned_cols=70 Identities=20% Similarity=0.387 Sum_probs=67.0
Q ss_pred EEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEe
Q psy15482 3 VTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 3 ItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
|+|++ ..|++++++|++++||.+||++|+.++|+|+++|+|+|+|++|.|+ .||++|||++|++|||+.|
T Consensus 1 i~vk~-~~g~~~~~~v~~~~tV~~lK~~i~~~~gi~~~~q~Li~~G~~L~d~-~~l~~~~i~~~stl~l~~~ 70 (70)
T cd01798 1 VYVRT-NTGHTFPVEVDPDTDIKQLKEVVAKRQGVPPDQLRVIFAGKELRNT-TTIQECDLGQQSILHAVRR 70 (70)
T ss_pred CEEEc-CCCCEEEEEECCCChHHHHHHHHHHHHCCCHHHeEEEECCeECCCC-CcHHHcCCCCCCEEEEEeC
Confidence 58899 8999999999999999999999999999999999999999999996 9999999999999999864
No 15
>cd01802 AN1_N ubiquitin-like domain of AN1. AN1 (also known as ANUBL1 and RSD-7) is ubiquitin-like protein with a testis-specific expression in rats that has an N-terminal ubiquitin-like domain and a C-terminal zinc-binding domain. Unlike ubiquitin polyproteins and most ubiquitin fusion proteins, the N-terminal ubiquitin-like domain of An1 does not undergo proteolytic processing. The function of AN1 is unknown.
Probab=99.70 E-value=5.1e-17 Score=131.17 Aligned_cols=73 Identities=25% Similarity=0.276 Sum_probs=70.5
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEec
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
|+|+|++ .+|++++++|++++||.+||++|+...|||+++|+|+|+|++|+|+ .+|++|||++|++|+|+.+.
T Consensus 28 M~I~Vk~-l~G~~~~leV~~~~TV~~lK~kI~~~~gip~~~QrLi~~Gk~L~D~-~tL~dy~I~~~stL~l~~~l 100 (103)
T cd01802 28 MELFIET-LTGTCFELRVSPFETVISVKAKIQRLEGIPVAQQHLIWNNMELEDE-YCLNDYNISEGCTLKLVLAM 100 (103)
T ss_pred EEEEEEc-CCCCEEEEEeCCCCcHHHHHHHHHHHhCCChHHEEEEECCEECCCC-CcHHHcCCCCCCEEEEEEec
Confidence 8999999 9999999999999999999999999999999999999999999997 99999999999999998764
No 16
>cd01810 ISG15_repeat2 ISG15 ubiquitin-like protein, second repeat of 2. ISG15 is a ubiquitin-like protein containing two ubiquitin homology domains and becomes conjugated to a variety of proteins when cells are treated with type I interferon or lipopolysaccharide. Although ISG15 has properties similar to those of other ubiquitin-like molecules, it is a unique member of the ubiquitin-like superfamily, whose expression and conjugation to target proteins are tightly regulated by specific signaling pathways, indicating it may have specialized functions in the immune system.
Probab=99.70 E-value=5.6e-17 Score=122.79 Aligned_cols=71 Identities=15% Similarity=0.244 Sum_probs=68.1
Q ss_pred EEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEec
Q psy15482 3 VTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 3 ItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
|+|++ ..|++++++|+++.||++||+.|+...|+|+++|+|+|+|++|.|+ +||++|||++|++|+|..+.
T Consensus 1 i~vk~-~~g~~~~l~v~~~~tV~~lK~~I~~~~gi~~~~q~L~~~G~~L~D~-~tL~~~~i~~~~tl~l~~~l 71 (74)
T cd01810 1 ILVRN-DKGRSSIYEVQLTQTVATLKQQVSQRERVQADQFWLSFEGRPMEDE-HPLGEYGLKPGCTVFMNLRL 71 (74)
T ss_pred CEEEC-CCCCEEEEEECCcChHHHHHHHHHHHhCCCHHHeEEEECCEECCCC-CCHHHcCCCCCCEEEEEEEc
Confidence 68999 9999999999999999999999999999999999999999999997 99999999999999998764
No 17
>cd01806 Nedd8 Nebb8-like ubiquitin protein. Nedd8 (also known as Rub1) has a single conserved ubiquitin-like domain that is part of a protein modification pathway similar to that of ubiquitin. Nedd8 modifies a family of molecular scaffold proteins called cullins that are responsible for assembling the ROC1/Rbx1 RING-based E3 ubiquitin ligases, of which several play a direct role in tumorigenesis.
Probab=99.69 E-value=1.1e-16 Score=120.64 Aligned_cols=73 Identities=22% Similarity=0.315 Sum_probs=70.3
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEec
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
|+|+|++ .+|+.+.++|+++.||++||++|+..+|+|++.|+|+|+|+.|.|+ +||++|||++|++|+++.+.
T Consensus 1 m~i~v~~-~~g~~~~~~v~~~~tv~~lK~~i~~~~g~~~~~qrL~~~g~~L~d~-~tl~~~~i~~g~~i~l~~~~ 73 (76)
T cd01806 1 MLIKVKT-LTGKEIEIDIEPTDKVERIKERVEEKEGIPPQQQRLIYSGKQMNDD-KTAADYKLEGGSVLHLVLAL 73 (76)
T ss_pred CEEEEEe-CCCCEEEEEECCCCCHHHHHHHHhHhhCCChhhEEEEECCeEccCC-CCHHHcCCCCCCEEEEEEEc
Confidence 8999999 9999999999999999999999999999999999999999999986 89999999999999998864
No 18
>cd01803 Ubiquitin Ubiquitin. Ubiquitin (includes Ubq/RPL40e and Ubq/RPS27a fusions as well as homopolymeric multiubiquitin protein chains)
Probab=99.69 E-value=1.2e-16 Score=120.39 Aligned_cols=73 Identities=29% Similarity=0.337 Sum_probs=70.4
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEec
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
|+|+|++ .+|+.+.++|++++||++||+.|+..+|+|++.|+|+|+|+.|.|+ ++|++|||++|++|+|+.+.
T Consensus 1 m~i~v~~-~~g~~~~~~v~~~~tV~~lK~~i~~~~g~~~~~q~L~~~g~~L~d~-~~L~~~~i~~~~~i~l~~~~ 73 (76)
T cd01803 1 MQIFVKT-LTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDG-RTLSDYNIQKESTLHLVLRL 73 (76)
T ss_pred CEEEEEc-CCCCEEEEEECCcCcHHHHHHHHHHHhCCCHHHeEEEECCEECCCC-CcHHHcCCCCCCEEEEEEEc
Confidence 8999999 8999999999999999999999999999999999999999999996 89999999999999998874
No 19
>cd01809 Scythe_N Ubiquitin-like domain of Scythe protein. Scythe protein (also known as Bat3) is an apoptotic regulator that is highly conserved in eukaryotes and contains a ubiquitin-like domain near its N-terminus. Scythe binds reaper, a potent apoptotic inducer, and Scythe/Reaper are thought to signal apoptosis, in part through regulating the folding and activity of apoptotic signaling molecules.
Probab=99.69 E-value=1.4e-16 Score=118.79 Aligned_cols=72 Identities=21% Similarity=0.399 Sum_probs=69.1
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEe
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
|+|+|++ .+|++++++|+++.||.+||+.|+..+|+|++.|+|+|+|++|.|+ .+|++|||++|++|+|+.+
T Consensus 1 i~i~vk~-~~g~~~~~~v~~~~tv~~lK~~i~~~~gi~~~~q~L~~~g~~L~d~-~~L~~~~i~~~~~l~l~~~ 72 (72)
T cd01809 1 IEIKVKT-LDSQTHTFTVEEEITVLDLKEKIAEEVGIPVEQQRLIYSGRVLKDD-ETLSEYKVEDGHTIHLVKR 72 (72)
T ss_pred CEEEEEe-CCCCEEEEEECCCCcHHHHHHHHHHHHCcCHHHeEEEECCEECCCc-CcHHHCCCCCCCEEEEEeC
Confidence 8999999 9999999999999999999999999999999999999999999996 8999999999999999764
No 20
>cd01804 midnolin_N Ubiquitin-like domain of midnolin. midnolin_N Midnolin (midbrain nucleolar protein) is expressed in the nucleolus and is thought to regulate genes involved in neurogenesis. Midnolin contains an amino-terminal ubiquitin-like domain.
Probab=99.67 E-value=1.8e-16 Score=121.48 Aligned_cols=72 Identities=22% Similarity=0.291 Sum_probs=68.7
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEec
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
|+|+|++ ..|+.++++|+++.||++||..|+.+.|+|+++|+|+|+|++|.|+ +|++|||++|++|+|+...
T Consensus 2 m~I~Vk~-~~G~~~~l~v~~~~TV~~LK~~I~~~~~~~~~~qrL~~~Gk~L~d~--~L~~~gi~~~~~i~l~~~~ 73 (78)
T cd01804 2 MNLNIHS-TTGTRFDLSVPPDETVEGLKKRISQRLKVPKERLALLHRETRLSSG--KLQDLGLGDGSKLTLVPTV 73 (78)
T ss_pred eEEEEEE-CCCCEEEEEECCcCHHHHHHHHHHHHhCCChHHEEEEECCcCCCCC--cHHHcCCCCCCEEEEEeec
Confidence 8999999 8899999999999999999999999999999999999999999884 8999999999999998764
No 21
>cd01794 DC_UbP_C dendritic cell derived ubiquitin-like protein. DC_UbP (dendritic cell derived ubiquitin-like protein) is a ubiquitin-like protein from human dendritic cells that is expressed in the mitochondrion. The ubiquitin-like domain of this protein is found at the C-terminus and lacks the canonical gly-gly motif of ubiquitin required for ubiquitinization. DC_UbP is expressed in tumor cells but not in normal human adult tissue suggesting a role for DC_UbP in tumorogenesis.
Probab=99.65 E-value=2.8e-16 Score=118.29 Aligned_cols=68 Identities=24% Similarity=0.196 Sum_probs=64.3
Q ss_pred EEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEE
Q psy15482 4 TVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKH 73 (302)
Q Consensus 4 tV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~ 73 (302)
.|+. .+|+.++++|++++||.+||++|+..+|+|+++|+|+|+|++|+|+ .+|++|||++|++|||..
T Consensus 2 ~vk~-~~G~~~~l~v~~~~TV~~lK~~I~~~~gi~~~~q~Li~~G~~L~D~-~~l~~~~i~~~~tv~~~~ 69 (70)
T cd01794 2 KVRL-STGKDVKLSVSSKDTVGQLKKQLQAAEGVDPCCQRWFFSGKLLTDK-TRLQETKIQKDYVVQVIV 69 (70)
T ss_pred eEEc-CCCCEEEEEECCcChHHHHHHHHHHHhCCCHHHeEEEECCeECCCC-CCHHHcCCCCCCEEEEEe
Confidence 4677 8899999999999999999999999999999999999999999997 999999999999999864
No 22
>cd01792 ISG15_repeat1 ISG15 ubiquitin-like protein, first repeat of 2. ISG15 is a ubiquitin-like protein containing two ubiquitin homology domains that becomes conjugated to a variety of proteins when cells are treated with type I interferon or lipopolysaccharide. Although ISG15 has properties similar to those of other ubiquitin-like molecules, it is a unique member of the ubiquitin-like superfamily, whose expression and conjugation to target proteins are tightly regulated by specific signaling pathways, indicating it may have specialized functions in the immune system.
Probab=99.65 E-value=3.5e-16 Score=120.32 Aligned_cols=73 Identities=22% Similarity=0.308 Sum_probs=69.7
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEE--EECCeecCCCcccccccCCCCCCEEEEEEec
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAI--EFQGVLLQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~L--i~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
|+|+|++ .+|+.++++|+++.||.+||..|+..+|+|+++|+| +|+|++|.|+ +||++|||++|++|+|+.+.
T Consensus 3 ~~i~Vk~-~~G~~~~~~v~~~~TV~~lK~~I~~~~~i~~~~qrL~~~~~G~~L~D~-~tL~~~gi~~gs~l~l~~~~ 77 (80)
T cd01792 3 WDLKVKM-LGGNEFLVSLRDSMTVSELKQQIAQKIGVPAFQQRLAHLDSREVLQDG-VPLVSQGLGPGSTVLLVVQN 77 (80)
T ss_pred eEEEEEe-CCCCEEEEEcCCCCcHHHHHHHHHHHhCCCHHHEEEEeccCCCCCCCC-CCHHHcCCCCCCEEEEEEEc
Confidence 7899999 999999999999999999999999999999999999 8999999996 89999999999999998875
No 23
>PF00240 ubiquitin: Ubiquitin family; InterPro: IPR000626 Ubiquitinylation is an ATP-dependent process that involves the action of at least three enzymes: a ubiquitin-activating enzyme (E1, IPR000011 from INTERPRO), a ubiquitin-conjugating enzyme (E2, IPR000608 from INTERPRO), and a ubiquitin ligase (E3, IPR000569 from INTERPRO, IPR003613 from INTERPRO), which work sequentially in a cascade. There are many different E3 ligases, which are responsible for the type of ubiquitin chain formed, the specificity of the target protein, and the regulation of the ubiquitinylation process []. Ubiquitinylation is an important regulatory tool that controls the concentration of key signalling proteins, such as those involved in cell cycle control, as well as removing misfolded, damaged or mutant proteins that could be harmful to the cell. Several ubiquitin-like molecules have been discovered, such as Ufm1 (IPR005375 from INTERPRO), SUMO1 (IPR003653 from INTERPRO), NEDD8, Rad23 (IPR004806 from INTERPRO), Elongin B and Parkin (IPR003977 from INTERPRO), the latter being involved in Parkinson's disease []. Ubiquitin is a protein of 76 amino acid residues, found in all eukaryotic cells and whose sequence is extremely well conserved from protozoan to vertebrates. Ubiquitin acts through its post-translational attachment (ubiquitinylation) to other proteins, where these modifications alter the function, location or trafficking of the protein, or targets it for destruction by the 26S proteasome []. The terminal glycine in the C-terminal 4-residue tail of ubiquitin can form an isopeptide bond with a lysine residue in the target protein, or with a lysine in another ubiquitin molecule to form a ubiquitin chain that attaches itself to a target protein. Ubiquitin has seven lysine residues, any one of which can be used to link ubiquitin molecules together, resulting in different structures that alter the target protein in different ways. It appears that Lys(11)-, Lys(29) and Lys(48)-linked poly-ubiquitin chains target the protein to the proteasome for degradation, while mono-ubiquitinylated and Lys(6)- or Lys(63)-linked poly-ubiquitin chains signal reversible modifications in protein activity, location or trafficking []. For example, Lys(63)-linked poly-ubiquitinylation is known to be involved in DNA damage tolerance, inflammatory response, protein trafficking and signal transduction through kinase activation []. In addition, the length of the ubiquitin chain alters the fate of the target protein. Regulatory proteins such as transcription factors and histones are frequent targets of ubquitinylation [].; GO: 0005515 protein binding; PDB: 2DZI_A 2XEW_E 3NOB_E 2KWU_B 2Y5B_F 3PHD_G 2KWV_B 2KOX_A 2XK5_B 3NHE_B ....
Probab=99.64 E-value=8.2e-16 Score=114.06 Aligned_cols=68 Identities=28% Similarity=0.426 Sum_probs=64.0
Q ss_pred EeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEec
Q psy15482 6 STQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 6 ~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
|+ .+|+.|+++|+++.||.+||..|+.++|+|++.|+|+|+|+.|.|+ .||++|||++|++|+|+.++
T Consensus 1 k~-~~g~~~~~~v~~~~tV~~lK~~i~~~~~~~~~~~~L~~~G~~L~d~-~tL~~~~i~~~~~I~l~~k~ 68 (69)
T PF00240_consen 1 KT-LSGKTFTLEVDPDDTVADLKQKIAEETGIPPEQQRLIYNGKELDDD-KTLSDYGIKDGSTIHLVIKP 68 (69)
T ss_dssp EE-TTSEEEEEEEETTSBHHHHHHHHHHHHTSTGGGEEEEETTEEESTT-SBTGGGTTSTTEEEEEEESS
T ss_pred CC-CCCcEEEEEECCCCCHHHhhhhcccccccccccceeeeeeecccCc-CcHHHcCCCCCCEEEEEEec
Confidence 45 7899999999999999999999999999999999999999999885 99999999999999998764
No 24
>cd01808 hPLIC_N Ubiquitin-like domain of hPLIC-1 and hPLIC2. hPLIC-1 and hPLIC-2 (human homologs of the yeast ubiquitin-like Dsk2 protein) are type2 UBL's (ubiquitin-like) proteins that are thought to serve as adaptors that link the ubiquitination machinery to the proteasome. The hPLIC's have an N-terminal UBL domain that binds the S5a subunit of the proteasome and a C-terminal UBA (ubiquitin-associated) domain that binds a ubiquitylated protein.
Probab=99.64 E-value=7.6e-16 Score=115.58 Aligned_cols=71 Identities=30% Similarity=0.428 Sum_probs=66.4
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEe
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
|+|+|++ .+|. ++++|+++.||.+||+.|+..+|+|+++|+|+|+|++|.|+ +||++|||++|++|||+.+
T Consensus 1 ~~i~vk~-~~g~-~~l~v~~~~TV~~lK~~I~~~~~i~~~~~~Li~~Gk~L~d~-~tL~~~~i~~~stl~l~~~ 71 (71)
T cd01808 1 IKVTVKT-PKDK-EEIEIAEDASVKDFKEAVSKKFKANQEQLVLIFAGKILKDT-DTLTQHNIKDGLTVHLVIK 71 (71)
T ss_pred CEEEEEc-CCCC-EEEEECCCChHHHHHHHHHHHhCCCHHHEEEEECCeEcCCC-CcHHHcCCCCCCEEEEEEC
Confidence 6899999 7886 58999999999999999999999999999999999999996 8999999999999999753
No 25
>cd01812 BAG1_N Ubiquitin-like domain of BAG1. BAG1_N N-terminal ubiquitin-like (Ubl) domain of the BAG1 protein. This domain occurs together with the BAG domain and is closely related to the Ubl domain of a family of deubiquitinases that includes Rpn11, UBP6 (USP14), USP7 (HAUSP).
Probab=99.61 E-value=2e-15 Score=112.45 Aligned_cols=70 Identities=24% Similarity=0.397 Sum_probs=65.8
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEE
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKH 73 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~ 73 (302)
++|+|++ . |+.++++|++++||++||+.|+..+|+|++.|+|+|+|+.|.|+ ++|++|||++|++|+++.
T Consensus 1 i~i~vk~-~-g~~~~i~v~~~~tv~~lK~~i~~~~gi~~~~q~L~~~g~~l~d~-~~L~~~~i~~g~~l~v~~ 70 (71)
T cd01812 1 IRVRVKH-G-GESHDLSISSQATFGDLKKMLAPVTGVEPRDQKLIFKGKERDDA-ETLDMSGVKDGSKVMLLE 70 (71)
T ss_pred CEEEEEE-C-CEEEEEEECCCCcHHHHHHHHHHhhCCChHHeEEeeCCcccCcc-CcHHHcCCCCCCEEEEec
Confidence 5789998 4 99999999999999999999999999999999999999999986 899999999999999863
No 26
>cd01790 Herp_N Homocysteine-responsive endoplasmic reticulum-resident ubiquitin-like domain protein. Herp (Homocysteine-responsive endoplasmic reticulum-resident ubiquitin-like domain protein) , is an integral membrane protein that is induced by the endoplasmic reticulum (ER) stress response pathway and is involved in improving the balance of folding capacity and protein loads in the ER. Herp has an N-terminal ubiquitin-like domain that is involved in Herp degradation, but is not necessary for its enhancement of amyloid beta-protein generation.
Probab=99.60 E-value=1.8e-15 Score=116.46 Aligned_cols=71 Identities=11% Similarity=0.110 Sum_probs=62.9
Q ss_pred CEEEEEeCCCCeE--EEEEeCCCccHHHHHHHHHhhhC--CCCCCeEEEECCeecCCCcccccccC--CCCCCEEEEEE
Q psy15482 1 MKVTVSTQHTNEC--FLLDVSEDLELENFKAFCEVQSG--YPAHEIAIEFQGVLLQDNKKTLKTHG--IQDGDLVLLKH 73 (302)
Q Consensus 1 MqItV~~~~~g~~--~~leV~~~~TV~dLK~~Ie~e~g--Ip~~~Q~Li~~Gk~L~D~~~TL~~~g--I~dgd~l~L~~ 73 (302)
+.|+||+ ++++. |++++++++||.+||+.|+...+ .|+++|+|||+||+|+|+ .||++|+ +.+|.+|||+-
T Consensus 2 i~l~IK~-~~~~~~~~~ve~~~~~TV~~lK~~i~~~~~~~~~~~~QrLIy~GKiLkD~-~tL~~~~~~~~~~~tiHLV~ 78 (79)
T cd01790 2 VTLLIKS-PNQKYEDQTVSCFLNWTVGELKTHLSRVYPSKPLEQDQRLIYSGKLLPDH-LKLRDVLRKQDEYHMVHLVC 78 (79)
T ss_pred eEEEEEC-CCCCeEEEEEecCCcChHHHHHHHHHHhcCCCCChhHeEEEEcCeeccch-hhHHHHhhcccCCceEEEEe
Confidence 4689999 88888 56666999999999999999874 558999999999999996 9999997 99999999974
No 27
>KOG0011|consensus
Probab=99.60 E-value=1.3e-14 Score=136.47 Aligned_cols=74 Identities=23% Similarity=0.420 Sum_probs=70.1
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhC--CCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEecC
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSG--YPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLGS 76 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~g--Ip~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~~ 76 (302)
|+||||+ +.+..|+++|.|+.||.++|..|+...| +|++.|+|||+||+|+|+ .|+++|+|++++.|++...+.
T Consensus 1 m~lt~Kt-L~q~~F~iev~Pe~tV~evK~kIet~~g~dyP~~~QkLIy~GkiL~D~-~tv~Eykv~E~~fiVvMlsK~ 76 (340)
T KOG0011|consen 1 MKLTVKT-LKQQTFTIEVKPEDTVVEVKKKIETEKGPDYPAEQQKLIYSGKILKDE-TTVGEYKVKEKKFIVVMLSKD 76 (340)
T ss_pred CeeEeee-ccCceeEeecCcchhHHHHHHHHHhccCCCCchhhheeeecceeccCC-cchhhhccccCceEEEEEecC
Confidence 8999999 9999999999999999999999999999 999999999999999997 999999999999988775554
No 28
>cd01800 SF3a120_C Ubiquitin-like domain of Mammalian splicing factor SF3a_120. SF3a120_C Mammalian splicing factor SF3a consists of three subunits of 60, 66, and 120 kDa and functions early during pre-mRNA splicing by converting the U2 snRNP to its active form. The 120kDa subunit (SF3a120) has a carboxy-terminal ubiquitin-like domain and two SWAP (suppressor-of-white-apricot) domains, referred to collectively as the SURP module, at its amino-terminus.
Probab=99.59 E-value=4e-15 Score=113.29 Aligned_cols=66 Identities=14% Similarity=0.322 Sum_probs=63.3
Q ss_pred CCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEec
Q psy15482 9 HTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 9 ~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
.+|++++++|++++||++||+.|+..+|||++.|+|+|+|+.|+|+ +||++|||++|++|+|+.+.
T Consensus 5 l~g~~~~l~v~~~~TV~~lK~~i~~~~gip~~~q~L~~~G~~L~d~-~tL~~~~i~~g~~l~v~~~~ 70 (76)
T cd01800 5 LNGQMLNFTLQLSDPVSVLKVKIHEETGMPAGKQKLQYEGIFIKDS-NSLAYYNLANGTIIHLQLKE 70 (76)
T ss_pred cCCeEEEEEECCCCcHHHHHHHHHHHHCCCHHHEEEEECCEEcCCC-CcHHHcCCCCCCEEEEEEec
Confidence 5899999999999999999999999999999999999999999996 99999999999999998875
No 29
>cd01813 UBP_N UBP ubiquitin processing protease. The UBP (ubiquitin processing protease) domain (also referred to as USP which stands for "ubiquitin-specific protease") is present at in a large family of cysteine proteases that specifically cleave ubiquitin conjugates. This family includes Rpn11, UBP6 (USP14), USP7 (HAUSP). This domain is closely related to the amino-terminal ubiquitin-like domain of BAG1 (Bcl2-associated anthanogene1) protein and is found only in eukaryotes.
Probab=99.59 E-value=6.1e-15 Score=112.15 Aligned_cols=70 Identities=10% Similarity=0.152 Sum_probs=65.2
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEE---CCeecCCCcccccccCCCCCCEEEEEE
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEF---QGVLLQDNKKTLKTHGIQDGDLVLLKH 73 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~---~Gk~L~D~~~TL~~~gI~dgd~l~L~~ 73 (302)
|.|+|+. +|+.|+|+|+++.||++||+.|+..+|+|+++|+|+| .|+.|.|+ .+|++|||++|+.|+|+.
T Consensus 1 ~~i~vk~--~g~~~~v~v~~~~Tv~~lK~~i~~~tgvp~~~QKLi~~~~~Gk~l~D~-~~L~~~~i~~g~~i~lmG 73 (74)
T cd01813 1 VPVIVKW--GGQEYSVTTLSEDTVLDLKQFIKTLTGVLPERQKLLGLKVKGKPAEDD-VKISALKLKPNTKIMMMG 73 (74)
T ss_pred CEEEEEE--CCEEEEEEECCCCCHHHHHHHHHHHHCCCHHHEEEEeecccCCcCCCC-cCHHHcCCCCCCEEEEEe
Confidence 5677776 8999999999999999999999999999999999997 99999986 999999999999999874
No 30
>KOG0003|consensus
Probab=99.52 E-value=1.5e-15 Score=121.45 Aligned_cols=74 Identities=27% Similarity=0.325 Sum_probs=70.8
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEecC
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLGS 76 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~~ 76 (302)
|++++++ ..|++++|+|+|+.||.+||++|..++|||+++|+|+|+|++|+|. .||++|||+..||||++.+..
T Consensus 1 ~~~~~~~-~~GKT~~le~EpS~ti~~vKA~i~~~~Gi~~~~~~L~~~~k~LED~-~Tla~Y~i~~~~Tl~~~~rL~ 74 (128)
T KOG0003|consen 1 MQIFVKT-LTGKTITLEVEPSDTIDNVKAKIQDKEGIPPDQQRLIFAGKQLEDG-RTLADYNIQKESTLHLVLRLR 74 (128)
T ss_pred CcEEEEE-eeCceEEEEecccchHHHHHHHhccccCCCHHHHHHHhcccccccC-CcccccCccchhhhhhhHHHh
Confidence 7899999 9999999999999999999999999999999999999999999996 999999999999999988763
No 31
>smart00213 UBQ Ubiquitin homologues. Ubiquitin-mediated proteolysis is involved in the regulated turnover of proteins required for controlling cell cycle progression
Probab=99.52 E-value=3.6e-14 Score=102.63 Aligned_cols=64 Identities=27% Similarity=0.433 Sum_probs=60.7
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCC
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGD 67 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd 67 (302)
|+|+|++ .+ ..+.++|++++||++||+.|+..+|+|++.|+|+|+|+.|.|+ +||++|||++|+
T Consensus 1 ~~i~vk~-~~-~~~~~~v~~~~tv~~lk~~i~~~~~~~~~~~~L~~~g~~L~d~-~tL~~~~i~~~~ 64 (64)
T smart00213 1 IELTVKT-LD-GTITLEVKPSDTVSELKEKIAELTGIPVEQQRLIYKGKVLEDD-RTLADYNIQDGS 64 (64)
T ss_pred CEEEEEE-CC-ceEEEEECCCCcHHHHHHHHHHHHCCCHHHEEEEECCEECCCC-CCHHHcCCcCCC
Confidence 8999999 66 6899999999999999999999999999999999999999986 999999999986
No 32
>KOG0005|consensus
Probab=99.51 E-value=1.5e-14 Score=104.28 Aligned_cols=70 Identities=23% Similarity=0.352 Sum_probs=67.7
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEE
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLK 72 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~ 72 (302)
|.|.|++ ++|+.+.++++|+++|+.+|+.|+++.||||.+|+|+|.||++.|+ +|-++|++.-|++||++
T Consensus 1 m~iKvkt-Lt~KeIeidIep~DkverIKErvEEkeGIPp~qqrli~~gkqm~DD-~tA~~Y~~~~GSVlHlv 70 (70)
T KOG0005|consen 1 MLIKVKT-LTGKEIEIDIEPTDKVERIKERVEEKEGIPPQQQRLIYAGKQMNDD-KTAAHYNLLGGSVLHLV 70 (70)
T ss_pred CeeeEee-eccceEEEeeCcchHHHHHHHHhhhhcCCCchhhhhhhcccccccc-ccHHHhhhccceeEeeC
Confidence 8899999 9999999999999999999999999999999999999999999997 99999999999999984
No 33
>cd01799 Hoil1_N Ubiquitin-like domain of HOIL1. HOIL1_N HOIL-1 (heme-oxidized IRP2 ubiquitin ligase-1) is an E3 ubiquitin-protein ligase that recognizes heme-oxidized IRP2 (iron regulatory protein2) and is thought to affect the turnover of oxidatively damaged proteins. Hoil-1 has an amino-terminal ubiquitin-like domain as well as an RBR signature consisting of two RING finger domains separated by an IBR/DRIL domain.
Probab=99.50 E-value=6.3e-14 Score=106.92 Aligned_cols=70 Identities=26% Similarity=0.399 Sum_probs=61.5
Q ss_pred EEEEEeC-CCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCC-CCCEEEEE
Q psy15482 2 KVTVSTQ-HTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQ-DGDLVLLK 72 (302)
Q Consensus 2 qItV~~~-~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~-dgd~l~L~ 72 (302)
.+.|..+ ..|.+++++|++++||++||++|+..+|||+++|+| |+|+.|.|+.+||++|||+ +||+|+|-
T Consensus 2 ~~~~~~~~~~~~t~~l~v~~~~TV~~lK~kI~~~~gip~~~QrL-~~G~~L~dD~~tL~~ygi~~~g~~~~l~ 73 (75)
T cd01799 2 NVSVEDAQSHTVTIWLTVRPDMTVAQLKDKVFLDYGFPPAVQRW-VIGQRLARDQETLYSHGIRTNGDSAFLY 73 (75)
T ss_pred EEEEeccccCCCeEEEEECCCCcHHHHHHHHHHHHCcCHHHEEE-EcCCeeCCCcCCHHHcCCCCCCCEEEEE
Confidence 3455544 468889999999999999999999999999999999 9999996555899999999 88999985
No 34
>cd01763 Sumo Small ubiquitin-related modifier (SUMO). Small ubiquitin-related modifier (SUMO) proteins are conjugated to numerous intracellular targets and serve to modulate protein interaction, localization, activity or stability. SUMO (also known as "Smt3" and "sentrin" in other organisms) is linked to several different pathways, including nucleocytoplasmic transport. Attachment of SUMO to targets proteins is stimulated by PIAS (Protein inhibitor of activated STATs) proteins which serve as E3-like ligases.
Probab=99.42 E-value=9e-13 Score=103.04 Aligned_cols=73 Identities=16% Similarity=0.304 Sum_probs=68.7
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEec
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
|+|+|++ .+|+.+.+.|.+++++..||..++...|+|+++|+|+|+|+.|.++ .|+++|||++||+|+++.+.
T Consensus 12 i~I~v~~-~~g~~~~~~v~~~~~l~~l~~~y~~~~gi~~~~~rf~f~G~~L~~~-~T~~~l~m~d~d~I~v~l~l 84 (87)
T cd01763 12 INLKVKG-QDGNEVFFKIKRSTPLKKLMEAYCQRQGLSMNSVRFLFDGQRIRDN-QTPDDLGMEDGDEIEVMLEQ 84 (87)
T ss_pred EEEEEEC-CCCCEEEEEEcCCCHHHHHHHHHHHHhCCCccceEEEECCeECCCC-CCHHHcCCCCCCEEEEEEec
Confidence 5788888 7899999999999999999999999999999999999999999996 89999999999999998764
No 35
>cd01815 BMSC_UbP_N Ubiquitin-like domain of BMSC-UbP. BMSC_UbP (bone marrow stromal cell-derived ubiquitin-like protein) has an N-terminal ubiquitin-like (UBQ) domain and a C-terminal ubiquitin-associated (UBA) domain, a domain architecture similar to those of the UBIN, Chap1, and ubiquilin proteins. This CD represents the N-terminal ubiquitin-like domain.
Probab=99.42 E-value=2.3e-13 Score=103.62 Aligned_cols=57 Identities=28% Similarity=0.392 Sum_probs=50.8
Q ss_pred EeCC-CccHHHHHHHHHhhh--CCC-CCCeEEEECCeecCCCcccccccCCCCCCEEEEEEe
Q psy15482 17 DVSE-DLELENFKAFCEVQS--GYP-AHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 17 eV~~-~~TV~dLK~~Ie~e~--gIp-~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
+|+| ++||++||++|+.++ |++ +++|+|+|+||+|+|+ +||++|||++|++|||+++
T Consensus 15 ~~~~~~~TV~~LK~kI~~~~~egi~~~dqQrLIy~GKiL~D~-~TL~dygI~~gstlhLv~~ 75 (75)
T cd01815 15 DVSPGGYQVSTLKQLIAAQLPDSLPDPELIDLIHCGRKLKDD-QTLDFYGIQSGSTIHILRK 75 (75)
T ss_pred CcCCccCcHHHHHHHHHHhhccCCCChHHeEEEeCCcCCCCC-CcHHHcCCCCCCEEEEEeC
Confidence 4554 789999999999995 575 9999999999999997 9999999999999999874
No 36
>KOG0004|consensus
Probab=99.40 E-value=1.7e-13 Score=116.56 Aligned_cols=74 Identities=27% Similarity=0.337 Sum_probs=70.9
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEecC
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLGS 76 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~~ 76 (302)
|+|+|++ +.+++++++|+++.||.++|+.|+..+|||+++|+|+|.|++|.|. .||+||+|+.-++|+|+.+..
T Consensus 1 m~ifVk~-l~~kti~~eve~~~ti~~~Kakiq~~egIp~dqqrlifag~qLedg-rtlSDY~Iqkestl~l~l~l~ 74 (156)
T KOG0004|consen 1 MQIFVKT-LTGKTITLEVEANDTIDNVKAKIQDKEGIPPDQQRLIFAGKQLEDG-RTLSDYNIQKESTLHLVLRLR 74 (156)
T ss_pred Cccchhh-ccccceeeeecccccHHHHHHhhhcccCCCchhhhhhhhhcccccC-CccccccccccceEEEEEEec
Confidence 8999999 9999999999999999999999999999999999999999999997 999999999999999987643
No 37
>cd01769 UBL Ubiquitin-like domain of UBL. UBLs function by remodeling the surface of their target proteins, changing their target's half-life, enzymatic activity, protein-protein interactions, subcellular localization or other properties. At least 10 different ubiquitin-like modifications exist in mammals, and attachment of different ubls to a target leads to different biological consequences. Ubl-conjugation cascades are initiated by activating enzymes, which also coordinate the ubls with their downstream pathways.
Probab=99.29 E-value=9.9e-12 Score=90.94 Aligned_cols=67 Identities=36% Similarity=0.507 Sum_probs=62.3
Q ss_pred EEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEE
Q psy15482 5 VSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKH 73 (302)
Q Consensus 5 V~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~ 73 (302)
|+. .+|..+.+++++++||++||.+|+..+|+|++.|+|+|+|+.|.|+ .+|++||+++|+.|++..
T Consensus 2 v~~-~~~~~~~~~~~~~~ti~~lK~~i~~~~~~~~~~~~l~~~g~~l~d~-~~l~~~~v~~~~~i~v~~ 68 (69)
T cd01769 2 VKT-LTGKTFELEVSPDDTVAELKAKIAAKEGVPPEQQRLIYAGKILKDD-KTLSDYGIQDGSTLHLVL 68 (69)
T ss_pred eEc-cCCCEEEEEECCCChHHHHHHHHHHHHCcChHHEEEEECCcCCCCc-CCHHHCCCCCCCEEEEEE
Confidence 566 6799999999999999999999999999999999999999999886 899999999999999864
No 38
>cd01795 USP48_C USP ubiquitin-specific protease. The USP (ubiquitin-specific protease) family is one of at least seven deubiquitylating enzyme (DUB) families capable of deconjugating ubiquitin and ubiquitin-like adducts. While the USP's have a conserved catalytic core domain, they differ in their domain architectures. This subfamily, which includes USP31, and USP48, has a carboxy-terminal ubiquitin-like domain in addition to a DUSP (domain of ubiquitin-specific proteases) domain,
Probab=99.29 E-value=7.2e-12 Score=99.53 Aligned_cols=63 Identities=24% Similarity=0.227 Sum_probs=59.4
Q ss_pred EEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEec
Q psy15482 13 CFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 13 ~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
..+++|+++.||.+||.+|...+++|+.+|+|+|+|+.|.|+++||++|||..|++|+|..+.
T Consensus 16 ~~~L~V~~~~TVg~LK~lImQ~f~V~P~dQkL~~dG~~L~DDsrTLssyGv~sgSvl~Llide 78 (107)
T cd01795 16 EKALLVSANQTLKELKIQIMHAFSVAPFDQNLSIDGKILSDDCATLGTLGVIPESVILLKADE 78 (107)
T ss_pred CceEEeCccccHHHHHHHHHHHhcCCcccceeeecCceeccCCccHHhcCCCCCCEEEEEecC
Confidence 347899999999999999999999999999999999999999999999999999999998764
No 39
>PF11976 Rad60-SLD: Ubiquitin-2 like Rad60 SUMO-like; InterPro: IPR022617 This entry includes small ubiquitin-related modifier (SUMO) proteins. SUMOs are small proteins that are covalently attached to lysines as post-translational modifications and are used to control multiple cellular process including signal transduction, nuclear transport and DNA replication and repair []. Unlike ubiquitin, they are not involved in protein degradation. This entry also contains the C-terminal Rad60 DNA repair protein SUMO-like domain.; PDB: 3RD2_A 2JXX_A 3RCZ_A 3GOE_A 3A4S_D 3A4R_B 2IO1_D 1U4A_A 2K1F_A 1WZ0_A ....
Probab=99.20 E-value=5.6e-11 Score=88.88 Aligned_cols=70 Identities=21% Similarity=0.340 Sum_probs=65.0
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCC-CCeEEEECCeecCCCcccccccCCCCCCEEEEE
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPA-HEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLK 72 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~-~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~ 72 (302)
|+|+|++ .+|+.+.+.|.+++++..|+.......|+|+ +..+|+|+|+.|.++ +|+++|||++||+|.+.
T Consensus 1 I~i~v~~-~~~~~~~~~v~~~~~~~~l~~~~~~~~~i~~~~~~~l~fdG~~L~~~-~T~~~~~ied~d~Idv~ 71 (72)
T PF11976_consen 1 ITIKVRS-QDGKEIKFKVKPTTTVSKLIEKYCEKKGIPPEESIRLIFDGKRLDPN-DTPEDLGIEDGDTIDVI 71 (72)
T ss_dssp EEEEEEE-TTSEEEEEEEETTSCCHHHHHHHHHHHTTTT-TTEEEEETTEEE-TT-SCHHHHT-STTEEEEEE
T ss_pred CEEEEEe-CCCCEEEEEECCCCcHHHHHHHHHHhhCCCccceEEEEECCEEcCCC-CCHHHCCCCCCCEEEEE
Confidence 6899999 8899999999999999999999999999999 999999999999886 89999999999999985
No 40
>cd01789 Alp11_N Ubiquitin-like domain of Alp11 tubulin-folding cofactor B. Alp11, also known as tubulin-folding cofactor B, is one of at least three proteins required for the proper folding of tubulins prior to their incorporation into microtubules. These cofactors are necessary for the biogenesis of microtubules and for cell viability. Alp11 has three domains including an N-terminal ubiquitin-like domain (represented by this CD) which executes the essential function, a central coiled-coil domain necessary for maintenance of cellular alpha-tubulin levels, and a C-terminal CLIP-170 domain is required for efficient binding to alpha-tubulin.
Probab=99.18 E-value=1.8e-10 Score=89.59 Aligned_cols=74 Identities=22% Similarity=0.346 Sum_probs=61.0
Q ss_pred EEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEE-ECCe-----ecCCCcccccccCCCCCCEEEEEEec
Q psy15482 2 KVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIE-FQGV-----LLQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 2 qItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li-~~Gk-----~L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
.|+|+...+....+..+++++||.+||++|+..+|+|++.|+|. |.|+ .|.|+.++|+.||+++|..||++-..
T Consensus 3 ~v~i~~~~~~~~~ekr~~~~~Tv~~lK~kl~~~~G~~~~~mrL~l~~~~~~~~~~l~~d~~~L~~y~~~dg~~IhVvD~~ 82 (84)
T cd01789 3 TVNITSSADSFSFEKKYSRGLTIAELKKKLELVVGTPASSMRLQLFDGDDKLVSKLDDDDALLGSYPVDDGCRIHVIDVS 82 (84)
T ss_pred EEEEEeCCCceeeeEecCCCCcHHHHHHHHHHHHCCCccceEEEEEcCCCCeEeecCCCccEeeeccCCCCCEEEEEeCC
Confidence 46677633333344459999999999999999999999999994 8988 67666699999999999999998654
No 41
>cd01814 NTGP5 Ubiquitin-like NTGP5 and ATGP4. NTGP5 and ATGP4 are plant-specific isoprenylated GTP-binding proteins with a single fold that resembles ubiquitin. The function of these proteins is unknown.
Probab=99.11 E-value=1.1e-10 Score=95.02 Aligned_cols=73 Identities=11% Similarity=0.008 Sum_probs=60.7
Q ss_pred EEEEeCCCCeEE-EEEeCCCccHHHHHHHHHhhh-----CCC--CCCeEEEECCeecCCCcccccccC------CCCCCE
Q psy15482 3 VTVSTQHTNECF-LLDVSEDLELENFKAFCEVQS-----GYP--AHEIAIEFQGVLLQDNKKTLKTHG------IQDGDL 68 (302)
Q Consensus 3 ItV~~~~~g~~~-~leV~~~~TV~dLK~~Ie~e~-----gIp--~~~Q~Li~~Gk~L~D~~~TL~~~g------I~dgd~ 68 (302)
|.++. .+|.-+ +..+++++||.+||+.|+++. |+| +++|+|||.||+|.|+ +||++|+ +....|
T Consensus 7 ~kfrl-~dg~digp~~~~~sdTV~~lKekI~~~~p~~ke~~P~~~~~qKLIysGKiLeD~-~TL~d~~~p~g~~~~~~~T 84 (113)
T cd01814 7 IKFRL-YDGSDIGPKRYPAATTVDFLKERVVSQWPKDKEVGPKTVNEVKLISAGKILENS-KTVGECRSPVGDIAGGVIT 84 (113)
T ss_pred EEEEc-cCCCccCccccChhhHHHHHHHHHHHhcccccccCCCCHHHeEEEeCCeecCCC-CcHHHhCCcccccCCCceE
Confidence 45555 556444 788999999999999999555 455 9999999999999997 9999999 777899
Q ss_pred EEEEEecCC
Q psy15482 69 VLLKHLGSY 77 (302)
Q Consensus 69 l~L~~~~~~ 77 (302)
+||+.+.+.
T Consensus 85 mHvvlr~~~ 93 (113)
T cd01814 85 MHVVVQPPL 93 (113)
T ss_pred EEEEecCCC
Confidence 999988654
No 42
>TIGR02281 clan_AA_DTGA clan AA aspartic protease, TIGR02281 family. This family consists of predicted aspartic proteases, typically from 180 to 230 amino acids in length, in MEROPS clan AA. This model describes the well-conserved 121-residue C-terminal region. The poorly conserved, variable length N-terminal region usually contains a predicted transmembrane helix. Sequences in the seed alignment and those scoring above the trusted cutoff are Proteobacterial; homologs scroing between trusted and noise are found in Pyrobaculum aerophilum str. IM2 (archaeal), Pirellula sp. (Planctomycetes), and Nostoc sp. PCC 7120 (Cyanobacteria).
Probab=99.01 E-value=1.1e-09 Score=90.68 Aligned_cols=77 Identities=16% Similarity=0.160 Sum_probs=59.7
Q ss_pred eeEEEEEEEEcCeeEEEEEcCCCccccCCHHHHHHcCchhhhhccccceecccCcceeeeEEEEeeeeecccccchhH
Q psy15482 213 VVMLYINCKVNGYPVKAFIDSGAQTTIMSLACAERVNIMRLLDTRWAGIAKGVGVQKILGRIHMAIAELTNFNGDKKK 290 (302)
Q Consensus 213 v~mlyi~~~ing~~v~a~vDsGaq~~i~s~~~a~~~gl~~~~d~~~~~~a~gvg~~~i~G~i~~~~~~ig~~~~~~~~ 290 (302)
--++|++++|||+++.++|||||++|++|+++|+++||..--. .+.....+++......+++...|+||++.+..-.
T Consensus 9 ~g~~~v~~~InG~~~~flVDTGAs~t~is~~~A~~Lgl~~~~~-~~~~~~~ta~G~~~~~~~~l~~l~iG~~~~~nv~ 85 (121)
T TIGR02281 9 DGHFYATGRVNGRNVRFLVDTGATSVALNEEDAQRLGLDLNRL-GYTVTVSTANGQIKAARVTLDRVAIGGIVVNDVD 85 (121)
T ss_pred CCeEEEEEEECCEEEEEEEECCCCcEEcCHHHHHHcCCCcccC-CceEEEEeCCCcEEEEEEEeCEEEECCEEEeCcE
Confidence 3578999999999999999999999999999999999975321 2333445555444466778888999999876433
No 43
>cd05483 retropepsin_like_bacteria Bacterial aspartate proteases, retropepsin-like protease family. This family of bacteria aspartate proteases is a subfamily of retropepsin-like protease family, which includes enzymes from retrovirus and retrotransposons. While fungal and mammalian pepsin-like aspartate proteases are bilobal proteins with structurally related N- and C-termini, this family of bacteria aspartate proteases is half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate proteases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=98.93 E-value=3e-09 Score=82.09 Aligned_cols=71 Identities=21% Similarity=0.299 Sum_probs=58.4
Q ss_pred EEEEEEEEcCeeEEEEEcCCCccccCCHHHHHHcCchhhhhccccceecccCcceeeeEEEEeeeeecccccc
Q psy15482 215 MLYINCKVNGYPVKAFIDSGAQTTIMSLACAERVNIMRLLDTRWAGIAKGVGVQKILGRIHMAIAELTNFNGD 287 (302)
Q Consensus 215 mlyi~~~ing~~v~a~vDsGaq~~i~s~~~a~~~gl~~~~d~~~~~~a~gvg~~~i~G~i~~~~~~ig~~~~~ 287 (302)
++||+|+|||.+++++|||||..++++.++|++|++. ..........+++.....++.+...++||++.+.
T Consensus 2 ~~~v~v~i~~~~~~~llDTGa~~s~i~~~~~~~l~~~--~~~~~~~~~~~~~G~~~~~~~~~~~i~ig~~~~~ 72 (96)
T cd05483 2 HFVVPVTINGQPVRFLLDTGASTTVISEELAERLGLP--LTLGGKVTVQTANGRVRAARVRLDSLQIGGITLR 72 (96)
T ss_pred cEEEEEEECCEEEEEEEECCCCcEEcCHHHHHHcCCC--ccCCCcEEEEecCCCccceEEEcceEEECCcEEe
Confidence 6899999999999999999999999999999999982 2222344667777777777777889999998654
No 44
>PF14560 Ubiquitin_2: Ubiquitin-like domain; PDB: 1WJN_A 2KJ6_A 2KJR_A 1V6E_A 1T0Y_A.
Probab=98.93 E-value=5.4e-09 Score=81.42 Aligned_cols=75 Identities=17% Similarity=0.336 Sum_probs=59.3
Q ss_pred EEEEEeCCC-CeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEEC----C---eecCCCcccccccCCCCCCEEEEEE
Q psy15482 2 KVTVSTQHT-NECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQ----G---VLLQDNKKTLKTHGIQDGDLVLLKH 73 (302)
Q Consensus 2 qItV~~~~~-g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~----G---k~L~D~~~TL~~~gI~dgd~l~L~~ 73 (302)
+|+|+.... +......+++++||.+||.+|+..+|+|++.|+|.+. | ..+.|+.++|..||+++|+.||+.-
T Consensus 3 ~l~It~~~~~~~~~ekr~~~~~Tv~eLK~kl~~~~Gi~~~~m~L~l~~~~~~~~~~~~~dd~~~L~~y~~~dg~~i~V~D 82 (87)
T PF14560_consen 3 KLFITSSNSKQRSVEKRFPKSITVSELKQKLEKLTGIPPSDMRLQLKSDKDDSKIEELDDDDATLGSYGIKDGMRIHVVD 82 (87)
T ss_dssp EEEEEESSSSSSEEEEEEETTSBHHHHHHHHHHHHTS-TTTEEEEEE-TSSSSEEEESSGSSSBCCHHT-STTEEEEEEE
T ss_pred EEEEEeCCCCCeeEEEEcCCCCCHHHHHHHHHHHhCCCcccEEEEEEecCCCccccccCCCccEeecCCCCCCCEEEEEe
Confidence 578887432 3577899999999999999999999999999999875 2 2345566999999999999999987
Q ss_pred ecC
Q psy15482 74 LGS 76 (302)
Q Consensus 74 ~~~ 76 (302)
..+
T Consensus 83 ~~p 85 (87)
T PF14560_consen 83 TNP 85 (87)
T ss_dssp -T-
T ss_pred CCC
Confidence 653
No 45
>cd05484 retropepsin_like_LTR_2 Retropepsins_like_LTR, pepsin-like aspartate proteases. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classif
Probab=98.85 E-value=8.3e-09 Score=80.59 Aligned_cols=66 Identities=21% Similarity=0.264 Sum_probs=55.4
Q ss_pred EEEEEEcCeeEEEEEcCCCccccCCHHHHHHcCchhhhhccccceecccC--cceeeeEEEEeeeeecccc
Q psy15482 217 YINCKVNGYPVKAFIDSGAQTTIMSLACAERVNIMRLLDTRWAGIAKGVG--VQKILGRIHMAIAELTNFN 285 (302)
Q Consensus 217 yi~~~ing~~v~a~vDsGaq~~i~s~~~a~~~gl~~~~d~~~~~~a~gvg--~~~i~G~i~~~~~~ig~~~ 285 (302)
|+.++|||++++++|||||..|+||.+.+.+.|+..+.+. .-..+|.+ ..++.|++ .+.+++|+..
T Consensus 2 ~~~~~Ing~~i~~lvDTGA~~svis~~~~~~lg~~~~~~~--~~~v~~a~G~~~~~~G~~-~~~v~~~~~~ 69 (91)
T cd05484 2 TVTLLVNGKPLKFQLDTGSAITVISEKTWRKLGSPPLKPT--KKRLRTATGTKLSVLGQI-LVTVKYGGKT 69 (91)
T ss_pred EEEEEECCEEEEEEEcCCcceEEeCHHHHHHhCCCccccc--cEEEEecCCCEeeEeEEE-EEEEEECCEE
Confidence 8999999999999999999999999999999999875444 33555555 35679999 7999999863
No 46
>PF13650 Asp_protease_2: Aspartyl protease
Probab=98.84 E-value=8.3e-09 Score=78.78 Aligned_cols=69 Identities=22% Similarity=0.286 Sum_probs=53.5
Q ss_pred EEEEEcCeeEEEEEcCCCccccCCHHHHHHcCchhhhhccccceecccCcceeeeEEEEeeeeecccccc
Q psy15482 218 INCKVNGYPVKAFIDSGAQTTIMSLACAERVNIMRLLDTRWAGIAKGVGVQKILGRIHMAIAELTNFNGD 287 (302)
Q Consensus 218 i~~~ing~~v~a~vDsGaq~~i~s~~~a~~~gl~~~~d~~~~~~a~gvg~~~i~G~i~~~~~~ig~~~~~ 287 (302)
|+++|||++++|+|||||..++|++++|+++|+...-.. ......|.|......+.....++||++.+.
T Consensus 1 V~v~vng~~~~~liDTGa~~~~i~~~~~~~l~~~~~~~~-~~~~~~~~~g~~~~~~~~~~~i~ig~~~~~ 69 (90)
T PF13650_consen 1 VPVKVNGKPVRFLIDTGASISVISRSLAKKLGLKPRPKS-VPISVSGAGGSVTVYRGRVDSITIGGITLK 69 (90)
T ss_pred CEEEECCEEEEEEEcCCCCcEEECHHHHHHcCCCCcCCc-eeEEEEeCCCCEEEEEEEEEEEEECCEEEE
Confidence 689999999999999999999999999999999653322 123456777555555555558999997764
No 47
>PF13975 gag-asp_proteas: gag-polyprotein putative aspartyl protease
Probab=98.84 E-value=7.9e-09 Score=77.87 Aligned_cols=65 Identities=22% Similarity=0.249 Sum_probs=50.9
Q ss_pred eeeEEEEEEEEcCeeEEEEEcCCCccccCCHHHHHHcCchhhhhc-cccceecccCcceeeeEEEE
Q psy15482 212 TVVMLYINCKVNGYPVKAFIDSGAQTTIMSLACAERVNIMRLLDT-RWAGIAKGVGVQKILGRIHM 276 (302)
Q Consensus 212 ~v~mlyi~~~ing~~v~a~vDsGaq~~i~s~~~a~~~gl~~~~d~-~~~~~a~gvg~~~i~G~i~~ 276 (302)
.-..+|+.|.|||+.++|+|||||..++||.+.|+|+|+...... +..-....=...++.|.++.
T Consensus 5 ~~g~~~v~~~I~g~~~~alvDtGat~~fis~~~a~rLgl~~~~~~~~~~v~~a~g~~~~~~g~~~~ 70 (72)
T PF13975_consen 5 DPGLMYVPVSIGGVQVKALVDTGATHNFISESLAKRLGLPLEKPPSPIRVKLANGSVIEIRGVAEN 70 (72)
T ss_pred cCCEEEEEEEECCEEEEEEEeCCCcceecCHHHHHHhCCCcccCCCCEEEEECCCCccccceEEEe
Confidence 446789999999999999999999999999999999999986665 43333333234566777664
No 48
>PLN02560 enoyl-CoA reductase
Probab=98.82 E-value=1.1e-08 Score=97.43 Aligned_cols=72 Identities=28% Similarity=0.418 Sum_probs=63.2
Q ss_pred CEEEEEeCCCCeEE---EEEeCCCccHHHHHHHHHhhhCC-CCCCeEEEEC---C----eecCCCcccccccCCCCCCEE
Q psy15482 1 MKVTVSTQHTNECF---LLDVSEDLELENFKAFCEVQSGY-PAHEIAIEFQ---G----VLLQDNKKTLKTHGIQDGDLV 69 (302)
Q Consensus 1 MqItV~~~~~g~~~---~leV~~~~TV~dLK~~Ie~e~gI-p~~~Q~Li~~---G----k~L~D~~~TL~~~gI~dgd~l 69 (302)
|+|+|+. .+|+.+ +|||+++.||+|||..|+++.++ ++++|+|++. | +.|+|+ +||+++|+++|++|
T Consensus 1 M~I~Vk~-~~Gk~i~~~~lev~~~aTV~dLK~~Isk~~~~~~~~RqRL~~~~~~gk~~g~~L~d~-ktL~d~gv~~gstL 78 (308)
T PLN02560 1 MKVTVVS-RSGREIIKGGLEVPDSATVADLKKAIHKRKKKYYPSRQRLTLPLPPGKTRPTVLDDS-KSLKDYGLGDGGTV 78 (308)
T ss_pred CEEEEEc-CCCCeecceeEEcCCCCcHHHHHHHHHHHcCCCChhheEEEEecCCCCcCccccCCC-CCHHhcCCCCCceE
Confidence 8999997 777776 89999999999999999999986 8999999983 4 377876 89999999999999
Q ss_pred EEEEe
Q psy15482 70 LLKHL 74 (302)
Q Consensus 70 ~L~~~ 74 (302)
++.--
T Consensus 79 y~kDL 83 (308)
T PLN02560 79 VFKDL 83 (308)
T ss_pred EEEeC
Confidence 88654
No 49
>KOG4248|consensus
Probab=98.81 E-value=6.2e-09 Score=109.80 Aligned_cols=73 Identities=21% Similarity=0.349 Sum_probs=69.3
Q ss_pred EEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEecCC
Q psy15482 2 KVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLGSY 77 (302)
Q Consensus 2 qItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~~~ 77 (302)
.|+||+ ++..+.++.|...+||.+||.+|..+.+|+.+.|||||.||+|.|+ +++.+||| ||-+|||+-|+++
T Consensus 4 ~v~vkt-ld~r~~t~~ig~q~ti~~~~d~~r~~~ni~s~~qr~i~~grvl~~~-k~vq~~~v-dgk~~hlverppp 76 (1143)
T KOG4248|consen 4 NVLVKT-LDSRTRTFIIGAQMTIKEFKDHIRASVNIPSEKQRLIYQGRVLQDD-KKVQEYNV-DGKVIHLVERPPP 76 (1143)
T ss_pred ceeeee-cccceeEEEechHHHHHHHHHHHHHhcccccccceeeecceeeccc-hhhhhccC-CCeEEEeeccCCC
Confidence 378899 9999999999999999999999999999999999999999999997 99999999 9999999998665
No 50
>KOG0001|consensus
Probab=98.75 E-value=8.1e-08 Score=69.56 Aligned_cols=71 Identities=28% Similarity=0.318 Sum_probs=67.0
Q ss_pred EEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEec
Q psy15482 3 VTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 3 ItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
+++.+ ..|+.+.+++.+..++..+|..|+...|+|..+|+|.+.|+.|.|+ .+|.+|+|..++++++..+.
T Consensus 2 ~~~~~-~~gk~~~~~~~~~~~i~~~k~~i~~~~~~~~~~q~~~~~~~~l~d~-~~l~~~~i~~~~~~~l~~~~ 72 (75)
T KOG0001|consen 2 IFVKT-LDGKTITLEVSPSDTIEVVKAKIRDKEGIPVDQQRLIFGGKPLEDG-RTLADYNIQEGSTLHLVLSL 72 (75)
T ss_pred EEEEe-cCCCEEEEEecCCCHHHHHHHHHHhhcCCCCeeEEEEECCEECcCC-CcHHHhCCCCCCEEEEEEec
Confidence 57778 8999999999999999999999999999999999999999999997 99999999999999998764
No 51
>cd06095 RP_RTVL_H_like Retropepsin of the RTVL_H family of human endogenous retrovirus-like elements. This family includes aspartate proteases from retroelements with LTR (long terminal repeats) including the RTVL_H family of human endogenous retrovirus-like elements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where
Probab=98.65 E-value=4.8e-08 Score=75.82 Aligned_cols=64 Identities=16% Similarity=0.284 Sum_probs=48.9
Q ss_pred EEEEEcCeeEEEEEcCCCccccCCHHHHHHcCchhhhhccccceecccCcce--eeeEEEEeeeeecccccc
Q psy15482 218 INCKVNGYPVKAFIDSGAQTTIMSLACAERVNIMRLLDTRWAGIAKGVGVQK--ILGRIHMAIAELTNFNGD 287 (302)
Q Consensus 218 i~~~ing~~v~a~vDsGaq~~i~s~~~a~~~gl~~~~d~~~~~~a~gvg~~~--i~G~i~~~~~~ig~~~~~ 287 (302)
|.+.|||+++.++|||||+.|+++...|+++ . ..+.....+|+|... -+.+.-. .+++|+.++.
T Consensus 1 ~~v~InG~~~~fLvDTGA~~tii~~~~a~~~---~--~~~~~~~v~gagG~~~~~v~~~~~-~v~vg~~~~~ 66 (86)
T cd06095 1 VTITVEGVPIVFLVDTGATHSVLKSDLGPKQ---E--LSTTSVLIRGVSGQSQQPVTTYRT-LVDLGGHTVS 66 (86)
T ss_pred CEEEECCEEEEEEEECCCCeEEECHHHhhhc---c--CCCCcEEEEeCCCcccccEEEeee-EEEECCEEEE
Confidence 4689999999999999999999999999998 2 233567888999765 2222211 6888888665
No 52
>cd01788 ElonginB Ubiquitin-like domain of Elongin B. Elongin B is part of an E3 ubiquitin ligase complex called VEC that activates ubiquitylation by the E2 ubiquitin-conjugating enzyme Ubc5. VEC is composed of von Hippel-Lindau tumor suppressor protein (pVHL), elongin C, cullin 2, NEDD8, and Rbx1. ElonginB binds elonginC to form the elonginBC complex which is a positive regulator of RNA polymerase II elongation factor Elongin A. The BC complex then binds VHL (von Hippel-Lindau) tumour suppressor protein to form a VCB ternary complex. Elongin B has a ubiquitin-llike domain.
Probab=98.60 E-value=1.5e-07 Score=76.66 Aligned_cols=74 Identities=23% Similarity=0.226 Sum_probs=63.8
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCC-------CCCCEEEEEE
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGI-------QDGDLVLLKH 73 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI-------~dgd~l~L~~ 73 (302)
|-+|+...-...++-+++.++.||.+||.+|+.-...|+++|+|+..+.+|.|+ +||++||+ +...+|-|..
T Consensus 1 MdvFlmIrR~KTTiF~dakes~tVlelK~~iegI~k~pp~dQrL~kd~qvLeD~-kTL~d~g~t~~~akaq~pA~vgLa~ 79 (119)
T cd01788 1 MDVFLMIRRHKTTIFTDAKESTTVYELKRIVEGILKRPPEDQRLYKDDQLLDDG-KTLGDCGFTSQTARPQAPATVGLAF 79 (119)
T ss_pred CceEEEEEecceEEEeecCCcccHHHHHHHHHHHhcCChhHheeecCceeeccc-ccHHHcCccccccccCCCCeEEEEE
Confidence 556666534566678999999999999999999999999999999888888885 99999999 7788888888
Q ss_pred ec
Q psy15482 74 LG 75 (302)
Q Consensus 74 ~~ 75 (302)
+.
T Consensus 80 r~ 81 (119)
T cd01788 80 RS 81 (119)
T ss_pred ec
Confidence 75
No 53
>cd01801 Tsc13_N Ubiquitin-like domain of Tsc13. Tsc13_N N-terminal domain of Tsc13. Tsc13 is an enoyl reductase involved in elongation of long chain fatty acids that localizes to the endoplasmic reticulum and is highly enriched in a novel structure marking nuclear-vacuolar junctions.
Probab=98.59 E-value=1.4e-07 Score=71.87 Aligned_cols=56 Identities=23% Similarity=0.374 Sum_probs=49.0
Q ss_pred EEe-CCCccHHHHHHHHHhhhC-CCCCCeEEE--ECCeecCCCcccccccCCCCCCEEEEE
Q psy15482 16 LDV-SEDLELENFKAFCEVQSG-YPAHEIAIE--FQGVLLQDNKKTLKTHGIQDGDLVLLK 72 (302)
Q Consensus 16 leV-~~~~TV~dLK~~Ie~e~g-Ip~~~Q~Li--~~Gk~L~D~~~TL~~~gI~dgd~l~L~ 72 (302)
+++ +++.||++||..|+...+ ++++.|+|. +.|+.|.|+ +||++|||++|++|++.
T Consensus 16 ~~~~~~~aTV~dlk~~i~~~~~~~~~~Rqrl~~~~~g~~L~d~-~tL~~~gv~~g~~lyvK 75 (77)
T cd01801 16 LKVSSGDATIADLKKLIAKSSPQLTVNRQSLRLEPKGKSLKDD-DTLVDLGVGAGATLYVR 75 (77)
T ss_pred cccCCCCccHHHHHHHHHHHcCCCCcceeEEEeCCCCcccCCc-ccHhhcCCCCCCEEEEe
Confidence 444 488999999999999976 589999995 899999986 89999999999999874
No 54
>PF13881 Rad60-SLD_2: Ubiquitin-2 like Rad60 SUMO-like; PDB: 1SE9_A 1WGH_A 2GOW_A.
Probab=98.50 E-value=9.7e-07 Score=72.31 Aligned_cols=73 Identities=14% Similarity=0.174 Sum_probs=53.9
Q ss_pred EEEEeCCCCe-EEEEEeCCCccHHHHHHHHHhhh-------CCCCCCeEEEECCeecCCCcccccccCCCCCC------E
Q psy15482 3 VTVSTQHTNE-CFLLDVSEDLELENFKAFCEVQS-------GYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGD------L 68 (302)
Q Consensus 3 ItV~~~~~g~-~~~leV~~~~TV~dLK~~Ie~e~-------gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd------~ 68 (302)
|.++. .+|. +.++..++++||.+||+.|-.+. -..++..||||.||.|.|+ +||+++++.-|+ +
T Consensus 5 lkf~l-~~G~d~~~~~~~~~~TV~~lKe~i~~~WP~d~~~~p~s~~~lRLI~~GriL~d~-~tL~~~~~~~~~~~~~~~v 82 (111)
T PF13881_consen 5 LKFRL-ADGKDIGPFRFDPSTTVADLKERIWAEWPEDWEERPKSPSDLRLIYAGRILEDN-KTLSDCRLPSGETPGGPTV 82 (111)
T ss_dssp EEEEE-TTS-EEEEEEE-TTSBHHHHHHHHHHSSSTTSSSTT-SGGGEEEEETTEEE-SS-SBTGGGT--TTSETT--EE
T ss_pred EEEEE-eCCCcccccccCccChHHHHHHHHHHHCccccccCCCChhhEEEEeCCeecCCc-CcHHHhCCCCCCCCCCCEE
Confidence 45555 5777 78999999999999999999754 1245788999999999986 999999999888 5
Q ss_pred EEEEEecCC
Q psy15482 69 VLLKHLGSY 77 (302)
Q Consensus 69 l~L~~~~~~ 77 (302)
+||+.+.+.
T Consensus 83 mHlvvrp~~ 91 (111)
T PF13881_consen 83 MHLVVRPNA 91 (111)
T ss_dssp EEEEE-SSS
T ss_pred EEEEecCCC
Confidence 777777543
No 55
>PF11543 UN_NPL4: Nuclear pore localisation protein NPL4; InterPro: IPR024682 Npl4, along with Ufd1, forms the heterodimer adaptor complex UN, which is involved in the recruitment of p97, an AAA ATPase, for tasks involving the ubiquitin pathway. Npl4 has a N-terminal ubiquitin-like domain which has within its structure a beta-grasp fold with a helical insert []. This entry represents the ubiquitin-like domain.; PDB: 2PJH_A 1WF9_A.
Probab=98.49 E-value=2.7e-07 Score=71.29 Aligned_cols=71 Identities=24% Similarity=0.327 Sum_probs=42.6
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEEC---Ceec-CCCcccccccCCCCCCEEEEEE
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQ---GVLL-QDNKKTLKTHGIQDGDLVLLKH 73 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~---Gk~L-~D~~~TL~~~gI~dgd~l~L~~ 73 (302)
|-|.|++ .+-.+-|+|++++|+.+|+++|++.+++|...|.|+.+ ...| .++.+||+++||+.||+|+|..
T Consensus 5 milRvrS--~dG~~Rie~~~~~t~~~L~~kI~~~l~~~~~~~~L~~~~~~~~~l~s~~~~tl~~lglkHGdmlyL~~ 79 (80)
T PF11543_consen 5 MILRVRS--KDGMKRIEVSPSSTLSDLKEKISEQLSIPDSSQSLSKDRNNKEELKSSDSKTLSSLGLKHGDMLYLKP 79 (80)
T ss_dssp -EEEEE---SSEEEEEEE-TTSBHHHHHHHHHHHS---TTT---BSSGGGGGCSSS-TT-CCCCT---TT-EEE---
T ss_pred EEEEEEC--CCCCEEEEcCCcccHHHHHHHHHHHcCCCCcceEEEecCCCCcccccCCcCCHHHcCCCCccEEEEec
Confidence 4456666 55577899999999999999999999999999998643 2234 2445899999999999999853
No 56
>cd00196 UBQ Ubiquitin-like proteins. Ubiquitin homologs; Includes ubiquitin and ubiquitin-like proteins. Ubiquitin-mediated proteolysis is part of the regulated turnover of proteins required for controlling cell cycle progression. Other family members are protein modifiers that perform a wide range of functions. Ubiquitination usually results in a covalent bond between the C-terminus of ubiquitin and the epsilon-amino group of a substrate lysine. The three-step mechanism requires an activating enzyme (E1) that forms a thiol ester with the C-terminal carboxy group, a conjugating enzyme (E2) that transiently carries the activated ubiquitin molecule as a thiol ester, and a ligase (E3) that transfers the activated ubiquitin from the E2 to the substrate lysine residue. In poly-ubiquitination, ubiquitin itself is the substrate.
Probab=98.47 E-value=8.6e-07 Score=60.65 Aligned_cols=64 Identities=25% Similarity=0.365 Sum_probs=58.4
Q ss_pred CCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEE
Q psy15482 9 HTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKH 73 (302)
Q Consensus 9 ~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~ 73 (302)
.++....+.++++.|+.+||..|..+.|++++.+.|+++|..+.+. ..+.++++.+|+.|++..
T Consensus 5 ~~~~~~~~~~~~~~tv~~l~~~i~~~~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~i~~~~ 68 (69)
T cd00196 5 NDGKTVELLVPSGTTVADLKEKLAKKLGLPPEQQRLLVNGKILPDS-LTLEDYGLQDGDELVLVP 68 (69)
T ss_pred cCCCEEEEEcCCCCcHHHHHHHHHHHHCcChHHeEEEECCeECCCC-CcHHHcCCCCCCEEEEEe
Confidence 3577888999999999999999999999999999999999999886 677899999999999864
No 57
>COG3577 Predicted aspartyl protease [General function prediction only]
Probab=98.25 E-value=2.8e-06 Score=76.13 Aligned_cols=78 Identities=22% Similarity=0.249 Sum_probs=60.8
Q ss_pred eEEEEEEEEcCeeEEEEEcCCCccccCCHHHHHHcCchhh-hhcc-ccceecccCcceeeeEEEEeeeeecccccchhHH
Q psy15482 214 VMLYINCKVNGYPVKAFIDSGAQTTIMSLACAERVNIMRL-LDTR-WAGIAKGVGVQKILGRIHMAIAELTNFNGDKKKA 291 (302)
Q Consensus 214 ~mlyi~~~ing~~v~a~vDsGaq~~i~s~~~a~~~gl~~~-~d~~-~~~~a~gvg~~~i~G~i~~~~~~ig~~~~~~~~~ 291 (302)
-+.+++..|||++|+++|||||.+..++.+.|+|+||..- .|.. --..|.| +.---+|..-.++||++++.-=+|
T Consensus 104 GHF~a~~~VNGk~v~fLVDTGATsVal~~~dA~RlGid~~~l~y~~~v~TANG---~~~AA~V~Ld~v~IG~I~~~nV~A 180 (215)
T COG3577 104 GHFEANGRVNGKKVDFLVDTGATSVALNEEDARRLGIDLNSLDYTITVSTANG---RARAAPVTLDRVQIGGIRVKNVDA 180 (215)
T ss_pred CcEEEEEEECCEEEEEEEecCcceeecCHHHHHHhCCCccccCCceEEEccCC---ccccceEEeeeEEEccEEEcCchh
Confidence 3799999999999999999999999999999999999853 2222 2233444 122457888889999999887777
Q ss_pred HHH
Q psy15482 292 LAA 294 (302)
Q Consensus 292 ~~~ 294 (302)
|-+
T Consensus 181 ~V~ 183 (215)
T COG3577 181 MVA 183 (215)
T ss_pred hee
Confidence 644
No 58
>PF00077 RVP: Retroviral aspartyl protease The Prosite entry also includes Pfam:PF00026; InterPro: IPR018061 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to the MEROPS peptidase family A2 (retropepsin family, clan AA), subfamily A2A. The family includes the single domain aspartic proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). Retroviral aspartyl protease is synthesised as part of the POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, RNase H and integrase. POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins.; PDB: 3D3T_B 3SQF_A 1NSO_A 2HB3_A 2HS2_A 2HS1_B 3K4V_A 3GGV_C 1HTG_B 2FDE_A ....
Probab=98.18 E-value=2.2e-06 Score=67.45 Aligned_cols=68 Identities=25% Similarity=0.345 Sum_probs=52.9
Q ss_pred EEEEEEEEcCeeEEEEEcCCCccccCCHHHHHHcCchhhhhccccceecccCcc-eeeeEEEEeeeeecccccc
Q psy15482 215 MLYINCKVNGYPVKAFIDSGAQTTIMSLACAERVNIMRLLDTRWAGIAKGVGVQ-KILGRIHMAIAELTNFNGD 287 (302)
Q Consensus 215 mlyi~~~ing~~v~a~vDsGaq~~i~s~~~a~~~gl~~~~d~~~~~~a~gvg~~-~i~G~i~~~~~~ig~~~~~ 287 (302)
.-|+.++|||.+++|+|||||..|+|+.++....+.. .......+|+|.. ++.|.. .+.+++++....
T Consensus 5 rp~i~v~i~g~~i~~LlDTGA~vsiI~~~~~~~~~~~----~~~~~~v~~~~g~~~~~~~~-~~~v~~~~~~~~ 73 (100)
T PF00077_consen 5 RPYITVKINGKKIKALLDTGADVSIISEKDWKKLGPP----PKTSITVRGAGGSSSILGST-TVEVKIGGKEFN 73 (100)
T ss_dssp SSEEEEEETTEEEEEEEETTBSSEEESSGGSSSTSSE----EEEEEEEEETTEEEEEEEEE-EEEEEETTEEEE
T ss_pred CceEEEeECCEEEEEEEecCCCcceeccccccccccc----ccCCceeccCCCcceeeeEE-EEEEEEECccce
Confidence 4589999999999999999999999999988777655 2334566788854 456665 778888876544
No 59
>KOG1872|consensus
Probab=98.14 E-value=5.8e-06 Score=81.71 Aligned_cols=74 Identities=16% Similarity=0.187 Sum_probs=67.2
Q ss_pred EEEeCCCCeEEEEE-eCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEecCCC
Q psy15482 4 TVSTQHTNECFLLD-VSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLGSYH 78 (302)
Q Consensus 4 tV~~~~~g~~~~le-V~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~~~~ 78 (302)
+|..+..|+.|+++ ++.+.|+..||+.+...+|+||+.|++.+.|+.+.|+ -.+...+|++|.+|+|.....+.
T Consensus 5 ~v~VKW~gk~y~v~~l~~d~t~~vlKaqlf~LTgV~PeRQKv~vKGg~a~dd-~~~~al~iKpn~~lmMmGt~e~~ 79 (473)
T KOG1872|consen 5 TVIVKWGGKKYPVETLSTDETPSVLKAQLFALTGVPPERQKVMVKGGLAKDD-VDWGALQIKPNETLMMMGTAEAG 79 (473)
T ss_pred eEeeeecCccccceeccCCCchHHHHHHHHHhcCCCccceeEEEeccccccc-ccccccccCCCCEEEeecccccc
Confidence 45555799999998 9999999999999999999999999999999999986 68999999999999999887654
No 60
>PF08284 RVP_2: Retroviral aspartyl protease; InterPro: IPR013242 This region defines single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein; usually pol, more rarely gag. Retroviral proteases appear to be homologous to a single domain of the two-domain eukaryotic aspartyl proteases.
Probab=97.98 E-value=1.5e-05 Score=67.29 Aligned_cols=69 Identities=17% Similarity=0.170 Sum_probs=52.6
Q ss_pred EEEEEEEcCeeEEEEEcCCCccccCCHHHHHHcCchhh-hhccccceecccC-cceeeeEEEEeeeeeccccc
Q psy15482 216 LYINCKVNGYPVKAFIDSGAQTTIMSLACAERVNIMRL-LDTRWAGIAKGVG-VQKILGRIHMAIAELTNFNG 286 (302)
Q Consensus 216 lyi~~~ing~~v~a~vDsGaq~~i~s~~~a~~~gl~~~-~d~~~~~~a~gvg-~~~i~G~i~~~~~~ig~~~~ 286 (302)
+-..+.|+++++.+||||||..++||.++|.++||.-. ++..+ -+. +.| .....++++.+.+.+++...
T Consensus 22 i~g~~~I~~~~~~vLiDSGAThsFIs~~~a~~~~l~~~~l~~~~-~V~-~~g~~~~~~~~~~~~~~~i~g~~~ 92 (135)
T PF08284_consen 22 ITGTFLINSIPASVLIDSGATHSFISSSFAKKLGLPLEPLPRPI-VVS-APGGSINCEGVCPDVPLSIQGHEF 92 (135)
T ss_pred EEEEEEeccEEEEEEEecCCCcEEccHHHHHhcCCEEEEccCee-EEe-cccccccccceeeeEEEEECCeEE
Confidence 34568899999999999999999999999999998743 34433 222 233 44557888899999988654
No 61
>cd05481 retropepsin_like_LTR_1 Retropepsins_like_LTR; pepsin-like aspartate protease from retrotransposons with long terminal repeats. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identifi
Probab=97.90 E-value=2.4e-05 Score=61.94 Aligned_cols=64 Identities=20% Similarity=0.216 Sum_probs=52.5
Q ss_pred EEEEcC-eeEEEEEcCCCccccCCHHHHHHcC---chhhhhccccceecccC--cceeeeEEEEeeeeecccc
Q psy15482 219 NCKVNG-YPVKAFIDSGAQTTIMSLACAERVN---IMRLLDTRWAGIAKGVG--VQKILGRIHMAIAELTNFN 285 (302)
Q Consensus 219 ~~~ing-~~v~a~vDsGaq~~i~s~~~a~~~g---l~~~~d~~~~~~a~gvg--~~~i~G~i~~~~~~ig~~~ 285 (302)
.++||| +++++.|||||+.|+||.+...+.| ..+|--+.+ ..++-+ ...+.|. +.+.+++|+.+
T Consensus 2 ~~~i~g~~~v~~~vDtGA~vnllp~~~~~~l~~~~~~~L~~t~~--~L~~~~g~~~~~~G~-~~~~v~~~~~~ 71 (93)
T cd05481 2 DMKINGKQSVKFQLDTGATCNVLPLRWLKSLTPDKDPELRPSPV--RLTAYGGSTIPVEGG-VKLKCRYRNPK 71 (93)
T ss_pred ceEeCCceeEEEEEecCCEEEeccHHHHhhhccCCCCcCccCCe--EEEeeCCCEeeeeEE-EEEEEEECCcE
Confidence 578999 9999999999999999999999998 677777765 444444 4456888 57889999876
No 62
>TIGR03698 clan_AA_DTGF clan AA aspartic protease, AF_0612 family. Members of this protein family are clan AA aspartic proteases, related to family TIGR02281. These proteins resemble retropepsins, pepsin-like proteases of retroviruses such as HIV. Members of this family are found in archaea and bacteria.
Probab=97.85 E-value=4.3e-05 Score=61.93 Aligned_cols=66 Identities=17% Similarity=0.308 Sum_probs=47.9
Q ss_pred EEEEEEcC------eeEEEEEcCCCcccc-CCHHHHHHcCchhhhhccccceecccCcceeeeEEEEeeeeeccccc
Q psy15482 217 YINCKVNG------YPVKAFIDSGAQTTI-MSLACAERVNIMRLLDTRWAGIAKGVGVQKILGRIHMAIAELTNFNG 286 (302)
Q Consensus 217 yi~~~ing------~~v~a~vDsGaq~~i-~s~~~a~~~gl~~~~d~~~~~~a~gvg~~~i~G~i~~~~~~ig~~~~ 286 (302)
|+++++.+ .+|+|+|||||...+ ++.+.|+++||... .......|.|. ...-++....+++|+.+.
T Consensus 1 ~~~v~~~~p~~~~~~~v~~LVDTGat~~~~l~~~~a~~lgl~~~-~~~~~~tA~G~---~~~~~v~~~~v~igg~~~ 73 (107)
T TIGR03698 1 TLDVELSNPKNPEFMEVRALVDTGFSGFLLVPPDIVNKLGLPEL-DQRRVYLADGR---EVLTDVAKASIIINGLEI 73 (107)
T ss_pred CEEEEEeCCCCCCceEEEEEEECCCCeEEecCHHHHHHcCCCcc-cCcEEEecCCc---EEEEEEEEEEEEECCEEE
Confidence 46677755 499999999999887 99999999999874 33334455552 334456677777777654
No 63
>KOG0006|consensus
Probab=97.81 E-value=3.6e-05 Score=72.93 Aligned_cols=72 Identities=18% Similarity=0.313 Sum_probs=61.6
Q ss_pred CEEEEEeCC--CCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEE
Q psy15482 1 MKVTVSTQH--TNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKH 73 (302)
Q Consensus 1 MqItV~~~~--~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~ 73 (302)
|-+.|.... ....++|+|+.+.+|.+||+.++...|+|+++.++||.||.|.|+ .|++.+.+..-+.+|++.
T Consensus 1 m~~lvqf~~~~~~h~l~v~v~~~t~I~~lke~Vak~~gvp~D~L~viFaGKeLs~~-ttv~~cDL~qqs~~hi~~ 74 (446)
T KOG0006|consen 1 MIVLVQFNKTGSSHGLPVEVDSDTSIFQLKEVVAKRQGVPADQLRVIFAGKELSND-TTVQNCDLSQQSATHIML 74 (446)
T ss_pred CeEEEEeCCccccCceeEEEecCCCHHHHHHHHHHhhCCChhheEEEEeccccccC-ceeecccccccchhhhhc
Confidence 667777632 344568999999999999999999999999999999999999996 899988888888888764
No 64
>PF08817 YukD: WXG100 protein secretion system (Wss), protein YukD; InterPro: IPR014921 YukD is a bacterial protein that adopts a ubiquitin-like fold []. Ubiquitin covalently binds to protein and flags them for protein degradation, however conjugation assays have indicated that YukD lacks the capacity for covalent bond formation with other proteins []. ; PDB: 2BPS_B.
Probab=97.63 E-value=0.00012 Score=55.91 Aligned_cols=69 Identities=23% Similarity=0.383 Sum_probs=47.6
Q ss_pred EEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCC------eEEE-ECCeecCCCcccccccCCCCCCEEEEE
Q psy15482 2 KVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHE------IAIE-FQGVLLQDNKKTLKTHGIQDGDLVLLK 72 (302)
Q Consensus 2 qItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~------Q~Li-~~Gk~L~D~~~TL~~~gI~dgd~l~L~ 72 (302)
+|||.. ..+..+.+.++.+.+|.+|...|-...+.+... -+|. -+|.+|.++ .||+++||.|||+|+|.
T Consensus 4 rVtv~~-~~~~~~Dl~lP~~vpv~~li~~l~~~~~~~~~~~~~~~~~~L~~~~g~~L~~~-~tL~~~gV~dGd~L~L~ 79 (79)
T PF08817_consen 4 RVTVDA-GNGRQVDLALPADVPVAELIPELVELLGLPGDDPPGHGQWVLARAGGRPLDPD-QTLADAGVRDGDVLVLR 79 (79)
T ss_dssp EEEEE--TT--EEEEEEETTSBTTHHHHHHHHHS---S---TT-E-EEEG-GGTEEEETT-SBCGGGT--TT-EEEE-
T ss_pred EEEEEc-CCCcEEEEEcCCCCcHHHHHHHHHHHhCCccCCCCCcceEEEEecCCcccCCc-CcHhHcCCCCCCEEEeC
Confidence 355555 336899999999999999999999988764432 3555 679999886 99999999999999874
No 65
>cd00303 retropepsin_like Retropepsins; pepsin-like aspartate proteases. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements, as well as eukaryotic dna-damage-inducible proteins (DDIs), and bacterial aspartate peptidases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples
Probab=97.62 E-value=0.00021 Score=51.42 Aligned_cols=33 Identities=36% Similarity=0.695 Sum_probs=31.1
Q ss_pred EEEEEcCeeEEEEEcCCCccccCCHHHHHHcCc
Q psy15482 218 INCKVNGYPVKAFIDSGAQTTIMSLACAERVNI 250 (302)
Q Consensus 218 i~~~ing~~v~a~vDsGaq~~i~s~~~a~~~gl 250 (302)
|.|.+||.++++++|+||..++|+..++++.++
T Consensus 1 ~~~~~~~~~~~~liDtgs~~~~~~~~~~~~~~~ 33 (92)
T cd00303 1 LKGKINGVPVRALVDSGASVNFISESLAKKLGL 33 (92)
T ss_pred CEEEECCEEEEEEEcCCCcccccCHHHHHHcCC
Confidence 578899999999999999999999999999887
No 66
>KOG4495|consensus
Probab=97.45 E-value=0.00017 Score=57.23 Aligned_cols=63 Identities=24% Similarity=0.227 Sum_probs=50.6
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEE-CC-eecCCCcccccccCCC
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEF-QG-VLLQDNKKTLKTHGIQ 64 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~-~G-k~L~D~~~TL~~~gI~ 64 (302)
|.++++..-...++-++.+++.||-+||.+++.-..-|++.|+|+. .. ..|.| .+||+++|..
T Consensus 1 ~~~f~~VrR~kttif~da~es~tV~elK~~l~gi~~~Pvn~qrL~kmd~eqlL~D-~ktL~d~gft 65 (110)
T KOG4495|consen 1 MDVFLRVRRHKTTIFTDAKESSTVFELKRKLEGILKRPVNEQRLYKMDTEQLLDD-GKTLGDCGFT 65 (110)
T ss_pred CceeeeeeecceeEEeecCccccHHHHHHHHHHHHhCCCcchheeecCHHHHhhc-cchhhhcccc
Confidence 4556555335566779999999999999999999999999999997 44 45566 5999999764
No 67
>cd01811 OASL_repeat1 2'-5' oligoadenylate synthetase-like protein, repeat 1 of 2. OASL_repeat1 (2'-5' oligoadenylate synthetase-like protein) belongs to a family of interferon-induced 2'-5' oligoadenylate synthetases which are important for the antiviral activity of interferons. While each member of this famliy has a conserved N-terminal OAS catalytic domain, only OASL has two tandem ubiquitin-like repeats located at the C-terminus and this CD represents one of those repeats.
Probab=97.36 E-value=0.0013 Score=49.96 Aligned_cols=72 Identities=19% Similarity=0.164 Sum_probs=60.4
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEEC----Cee-cCCCcccccccCCCCCCEEEEEEec
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQ----GVL-LQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~----Gk~-L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
.||||+- .....+++-|+|..+|..+|+.|....|++- .|+|.|. -++ |.+. .||++|||=.+-.|.|.-..
T Consensus 1 iqVtV~q-~g~~dl~l~vnPy~pI~k~K~kI~~~~~~~g-~qrLsfQepgg~rqlL~s~-~sLA~yGiFs~~~i~lleT~ 77 (80)
T cd01811 1 IQVTVEQ-TGYSDWILRVNPYSPIRKIKEKIRRSRNCSG-LQRLSFQEPGGERQLLSSR-KSLADYGIFSKTNICLLETF 77 (80)
T ss_pred CEEEeee-cCCCceEEEeCCcchHHHHHHHHHHhhCccc-ceEEEeecCCccccccccc-ccHhhhcceeccEEEEEecC
Confidence 3788988 8888899999999999999999999999998 9999982 233 4564 89999999888777776543
No 68
>KOG3493|consensus
Probab=97.25 E-value=0.00012 Score=54.13 Aligned_cols=68 Identities=19% Similarity=0.241 Sum_probs=58.0
Q ss_pred EEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEE
Q psy15482 3 VTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLK 72 (302)
Q Consensus 3 ItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~ 72 (302)
+++.. --|+..-+-..+++||+|+|.+|+.++|..++..+|---+.+++|. -+|++|.|++|--+.|-
T Consensus 4 v~~nD-rLGKKVRvKCn~dDtiGD~KKliaaQtGT~~~kivl~k~~~i~kd~-I~L~dyeihdg~~lely 71 (73)
T KOG3493|consen 4 VVLND-RLGKKVRVKCNTDDTIGDLKKLIAAQTGTRPEKIVLKKWYTIFKDH-ITLSDYEIHDGMNLELY 71 (73)
T ss_pred ehhhh-hcCceEEEEeCCcccccCHHHHHHHhhCCChhHhHHHhhhhhhhcc-cceeeEEeccCccEEEe
Confidence 34444 4588889999999999999999999999999999998667777886 89999999999877663
No 69
>COG5417 Uncharacterized small protein [Function unknown]
Probab=96.96 E-value=0.0042 Score=47.21 Aligned_cols=69 Identities=20% Similarity=0.256 Sum_probs=55.8
Q ss_pred EEEEEeC-CCCeEEEEEeCCCccHHHHHHHHHhhhCCC-----CCCeEEEECCeecCCCcccccccCCCCCCEEEE
Q psy15482 2 KVTVSTQ-HTNECFLLDVSEDLELENFKAFCEVQSGYP-----AHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLL 71 (302)
Q Consensus 2 qItV~~~-~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp-----~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L 71 (302)
+|||-.. -+|.+|.|.++.-.++..|-..+.+...|. -...+..-+++.|.++ ..|.+|||.+||.|.+
T Consensus 6 kVTvD~t~y~g~~yDLrl~d~~pikklIdivwe~~kis~~~reg~~Ikv~nKa~llsgd-~kL~d~~IadGD~Lei 80 (81)
T COG5417 6 KVTVDFTNYNGGTYDLRLPDYLPIKKLIDIVWESLKISIFDREGTQIKVMNKAQLLSGD-DKLIDYQIADGDILEI 80 (81)
T ss_pred EEEEEeEecCCceEEEeccccchHHHHHHHHHHHhhccccccCCCEEEEeccceEecCC-ceEEeccccCCCEEEe
Confidence 4554322 579999999999999999998888765543 2557888899999886 8999999999999875
No 70
>KOG1769|consensus
Probab=96.94 E-value=0.0067 Score=48.54 Aligned_cols=72 Identities=11% Similarity=0.263 Sum_probs=63.2
Q ss_pred EEEEEeC-CCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEe
Q psy15482 2 KVTVSTQ-HTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 2 qItV~~~-~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
+|+++.. -++....+-|..++.+.-|+..-....|++.++.+++|+|+.+.+. +|-.+++.++||.|.+..-
T Consensus 20 hi~LKV~gqd~~~~~Fkikr~t~LkKLM~aYc~r~Gl~~~s~RFlFdG~rI~~~-~TP~~L~mEd~D~Iev~~~ 92 (99)
T KOG1769|consen 20 HINLKVKGQDGSVVVFKIKRHTPLKKLMKAYCERQGLSMNSLRFLFDGQRIRET-HTPADLEMEDGDEIEVVQE 92 (99)
T ss_pred eEEEEEecCCCCEEEEEeecCChHHHHHHHHHHHcCCccceEEEEECCcCcCCC-CChhhhCCcCCcEEEEEee
Confidence 5666553 4566668899999999999999999999999999999999999986 8999999999999998754
No 71
>PF00789 UBX: UBX domain; InterPro: IPR001012 The UBX domain is found in ubiquitin-regulatory proteins, which are members of the ubiquitination pathway, as well as a number of other proteins including FAF-1 (FAS-associated factor 1), the human Rep-8 reproduction protein and several hypothetical proteins from yeast. The function of the UBX domain is not known although the fragment of avian FAF-1 containing the UBX domain causes apoptosis of transfected cells.; GO: 0005515 protein binding; PDB: 3QX1_A 1H8C_A 3QCA_B 3QQ8_B 3QC8_B 3R3M_A 3QWZ_B 1S3S_H 1JRU_A 1I42_A ....
Probab=96.92 E-value=0.0068 Score=46.15 Aligned_cols=70 Identities=19% Similarity=0.266 Sum_probs=60.1
Q ss_pred EEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCC-eEEE--ECCeecCCCc-ccccccCCCCCCEEEEE
Q psy15482 2 KVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHE-IAIE--FQGVLLQDNK-KTLKTHGIQDGDLVLLK 72 (302)
Q Consensus 2 qItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~-Q~Li--~~Gk~L~D~~-~TL~~~gI~dgd~l~L~ 72 (302)
+|-|+. .+|..+.-...++.||.+|...|......+... ..|+ |-.+.|.+++ +||+++|+..+.+|+|.
T Consensus 8 ~I~vRl-pdG~~l~~~F~~~~tl~~l~~~v~~~~~~~~~~~f~L~~~~Pr~~l~~~~~~tl~e~~l~p~~~l~v~ 81 (82)
T PF00789_consen 8 RIQVRL-PDGSRLQRRFPKSDTLQDLYDFVESQLFSPEESDFELITAFPRRELTDEDSKTLEEAGLLPSATLIVE 81 (82)
T ss_dssp EEEEEE-TTSTEEEEEEETTSBHHHHHHHHHHHHHCTTTSSEEEEESSSTEECCSTTTSBTCCCTTSSCEEEEEE
T ss_pred EEEEEC-CCCCEEEEEECCcchHHHHHHHHHHhcCCCCCccEEEEeCCCCcCCCccccccHHHhcCCCCeEEEEE
Confidence 467888 889999999999999999999999998887765 7776 6678886654 79999999999999885
No 72
>PF09280 XPC-binding: XPC-binding domain; InterPro: IPR015360 Members of this entry adopt a structure consisting of four alpha helices, arranged in an array. They bind specifically and directly to the xeroderma pigmentosum group C protein (XPC) to initiate nucleotide excision repair []. ; GO: 0003684 damaged DNA binding, 0006289 nucleotide-excision repair, 0043161 proteasomal ubiquitin-dependent protein catabolic process; PDB: 1PVE_A 1QZE_A 1OQY_A 1TP4_A 1X3W_B 3ESW_B 2QSG_X 2QSF_X 1X3Z_B 2QSH_X ....
Probab=96.86 E-value=0.002 Score=47.01 Aligned_cols=46 Identities=37% Similarity=0.678 Sum_probs=37.1
Q ss_pred CCCCHH--HHHHHHhcCH----HHHHHHHhcChhHHHHhhcCChHHHHHHHHH
Q psy15482 111 LSDDPA--TVRQLLLDNP----KEIALLQQNNPRLADALLNGSPEDFERILRQ 157 (302)
Q Consensus 111 ~~~npe--~iR~~~l~NP----~~~~~L~~~NPeLa~al~~~d~e~f~~~lr~ 157 (302)
+.++|. .+|+++.+|| .++++|.++||+|++.|. +|++.|-+++..
T Consensus 4 Lr~~Pqf~~lR~~vq~NP~lL~~lLqql~~~nP~l~q~I~-~n~e~Fl~ll~~ 55 (59)
T PF09280_consen 4 LRNNPQFQQLRQLVQQNPQLLPPLLQQLGQSNPQLLQLIQ-QNPEEFLRLLNE 55 (59)
T ss_dssp GTTSHHHHHHHHHHHC-GGGHHHHHHHHHCCSHHHHHHHH-HTHHHHHHHHHS
T ss_pred HHcChHHHHHHHHHHHCHHHHHHHHHHHhccCHHHHHHHH-HCHHHHHHHHcC
Confidence 345565 7899999998 567778999999999997 589999998853
No 73
>PF10302 DUF2407: DUF2407 ubiquitin-like domain; InterPro: IPR019413 This entry represents a family of proteins of unknown function found in fungi. They contain a characteristic GFDRL sequence motif.
Probab=96.66 E-value=0.0045 Score=49.53 Aligned_cols=58 Identities=16% Similarity=0.221 Sum_probs=43.0
Q ss_pred EEEEEeCCC-CeEEEEEeC--CCccHHHHHHHHHhhhC--CCCCCeEEEECCeecCCCccccccc
Q psy15482 2 KVTVSTQHT-NECFLLDVS--EDLELENFKAFCEVQSG--YPAHEIAIEFQGVLLQDNKKTLKTH 61 (302)
Q Consensus 2 qItV~~~~~-g~~~~leV~--~~~TV~dLK~~Ie~e~g--Ip~~~Q~Li~~Gk~L~D~~~TL~~~ 61 (302)
.|+|+. .+ ---.+|+|+ .+.||..||.+|....+ ..-..+||||+||.|.|. ..|+..
T Consensus 2 ~l~IRF-s~sipDl~L~I~~~~~~Tv~~LK~lIR~~~p~~~s~~rLRlI~~Gr~L~d~-t~l~~~ 64 (97)
T PF10302_consen 2 YLTIRF-SDSIPDLPLDIPSPNTTTVAWLKQLIRERLPPEPSRRRLRLIYAGRLLNDH-TDLSSE 64 (97)
T ss_pred eEEEEE-CCCCCCceeecCCCCcccHHHHHHHHHhhcCCCCccccEEeeecCcccCcc-chhhhh
Confidence 367776 32 122567777 88999999999999973 445678999999999986 555544
No 74
>KOG1639|consensus
Probab=96.62 E-value=0.0044 Score=57.30 Aligned_cols=73 Identities=21% Similarity=0.306 Sum_probs=57.3
Q ss_pred CEEEEEeCCCCeEEE-EEeCCCccHHHHH-HHHHhhhCCCCCCeEEEE----CCeecCCCcccccccCCCCCCEEEEEEe
Q psy15482 1 MKVTVSTQHTNECFL-LDVSEDLELENFK-AFCEVQSGYPAHEIAIEF----QGVLLQDNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 1 MqItV~~~~~g~~~~-leV~~~~TV~dLK-~~Ie~e~gIp~~~Q~Li~----~Gk~L~D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
|.||+...+.|...+ .+.+...|+.|++ +....+..+.+..+|+.+ +|++|.|+ .+|++||...|++|.+..-
T Consensus 1 m~It~~srs~~~~~~~~~~s~~~ti~d~~~~~~~~~~k~~~~~~r~tlr~e~kgkpl~~~-s~l~e~~~~s~~~i~vKDL 79 (297)
T KOG1639|consen 1 MEITIASRSKGLRIKEKDLSGSETIDDLLKAISAKNLKITPYRIRLTLRVEPKGKPLIDN-SKLQEYGDGSGATIYVKDL 79 (297)
T ss_pred CceeeeccCCCceeeeecCCCCCcHHHHHHHHHHhhhccCccchhheeeccCCCccccch-hHHHHhccCCCCEEEEecc
Confidence 889998744555655 7888899999999 555666778886665554 79999997 7999999999999887543
No 75
>KOG3206|consensus
Probab=96.62 E-value=0.0065 Score=54.72 Aligned_cols=75 Identities=21% Similarity=0.298 Sum_probs=60.9
Q ss_pred EEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEE-ECC-----eecCCCcccccccCCCCCCEEEEEEec
Q psy15482 2 KVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIE-FQG-----VLLQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 2 qItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li-~~G-----k~L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
+|.|++.++.-......++++||.+||.+++..+|.+++.+.|. |.| -.|.|+++.|..|+..||-.||++-..
T Consensus 3 ~v~Iss~~~~~~~Ekr~~~~ltl~q~K~KLe~~~G~~~~~M~l~l~~~~d~~~~~lsn~d~~lg~~~~~Dg~rihviD~~ 82 (234)
T KOG3206|consen 3 RVVISSSLNDFRTEKRLSNSLTLAQFKDKLELLTGTEAESMELELYDGDDKKVSALSNEDADLGFYKVEDGLRIHVIDSN 82 (234)
T ss_pred EEEEecccccchhhhhcCCcCcHHHHHhhhhhhhCCCccceEEEEEcCCCceeeeccCCcccccccCCCCceEEEEEecC
Confidence 45666633444456778999999999999999999999999996 655 457777789999999999999998765
Q ss_pred C
Q psy15482 76 S 76 (302)
Q Consensus 76 ~ 76 (302)
+
T Consensus 83 ~ 83 (234)
T KOG3206|consen 83 A 83 (234)
T ss_pred c
Confidence 3
No 76
>smart00166 UBX Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p.
Probab=96.56 E-value=0.015 Score=44.38 Aligned_cols=70 Identities=10% Similarity=0.152 Sum_probs=58.2
Q ss_pred EEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEE--ECCeecCCC--cccccccCCCCCCEEEEE
Q psy15482 2 KVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIE--FQGVLLQDN--KKTLKTHGIQDGDLVLLK 72 (302)
Q Consensus 2 qItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li--~~Gk~L~D~--~~TL~~~gI~dgd~l~L~ 72 (302)
+|-|+. .+|..+.....++.||.+|.+.|....+.+.....|+ |-.+.+.++ ++||.++|+..+.+|+|.
T Consensus 6 ~I~iRl-PdG~ri~~~F~~~~tl~~v~~~v~~~~~~~~~~f~L~t~~Prk~l~~~d~~~tL~e~gL~p~~~l~v~ 79 (80)
T smart00166 6 RLQIRL-PDGSRLVRRFPSSDTLRTVYEFVSAALTDGNDPFTLNSPFPRRTFTKDDYSKTLLELALLPSSTLVLE 79 (80)
T ss_pred EEEEEc-CCCCEEEEEeCCCCcHHHHHHHHHHcccCCCCCEEEEeCCCCcCCccccccCCHHHCCCCCceEEEEe
Confidence 477888 8999999999999999999999977677776777886 567778653 479999999999998874
No 77
>cd05482 HIV_retropepsin_like Retropepsins, pepsin-like aspartate proteases. This is a subfamily of retropepsins. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This gro
Probab=96.52 E-value=0.0043 Score=48.67 Aligned_cols=26 Identities=23% Similarity=0.405 Sum_probs=23.4
Q ss_pred EEEEcCeeEEEEEcCCCccccCCHHH
Q psy15482 219 NCKVNGYPVKAFIDSGAQTTIMSLAC 244 (302)
Q Consensus 219 ~~~ing~~v~a~vDsGaq~~i~s~~~ 244 (302)
.++|+|+++.|++||||..||++...
T Consensus 2 ~~~i~g~~~~~llDTGAd~Tvi~~~~ 27 (87)
T cd05482 2 TLYINGKLFEGLLDTGADVSIIAEND 27 (87)
T ss_pred EEEECCEEEEEEEccCCCCeEEcccc
Confidence 57899999999999999999999743
No 78
>PF12384 Peptidase_A2B: Ty3 transposon peptidase; InterPro: IPR024650 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Ty3 is a gypsy-type, retrovirus-like, element found in the budding yeast. The Ty3 aspartyl protease is required for processing of the viral polyprotein into its mature species [].
Probab=96.46 E-value=0.009 Score=52.18 Aligned_cols=49 Identities=22% Similarity=0.443 Sum_probs=40.7
Q ss_pred EEEEEEEcCeeEEEEEcCCCccccCCHHHHHHcCchhhhh--ccccceecc
Q psy15482 216 LYINCKVNGYPVKAFIDSGAQTTIMSLACAERVNIMRLLD--TRWAGIAKG 264 (302)
Q Consensus 216 lyi~~~ing~~v~a~vDsGaq~~i~s~~~a~~~gl~~~~d--~~~~~~a~g 264 (302)
+-+++.|+|.+|++|+|||+-+++|++..+++++|..+-- .||+|...|
T Consensus 35 ~~v~l~~~~t~i~vLfDSGSPTSfIr~di~~kL~L~~~~app~~fRG~vs~ 85 (177)
T PF12384_consen 35 AIVQLNCKGTPIKVLFDSGSPTSFIRSDIVEKLELPTHDAPPFRFRGFVSG 85 (177)
T ss_pred EEEEEeecCcEEEEEEeCCCccceeehhhHHhhCCccccCCCEEEeeeccC
Confidence 4567889999999999999999999999999999997543 356665554
No 79
>PF11470 TUG-UBL1: GLUT4 regulating protein TUG; InterPro: IPR021569 TUG is a GLUT4 regulating protein and functions to retain membrane vesicles containing GLUT4 intracellularly. TUG releases the GLUT4 containing vesicles to the cellular exocytic machinery in response to insulin stimulation which allows translocation to the plasma membrane []. TUG has an N-terminal ubiquitin-like domain (UBL1) which in similar proteins appears to participate in protein-protein interactions []. The region does have a area of negative electrostatic potential and increased backbone motility which leads to suggestions of a potential protein-protein interaction site []. ; PDB: 2AL3_A.
Probab=96.44 E-value=0.013 Score=43.48 Aligned_cols=62 Identities=10% Similarity=0.085 Sum_probs=45.5
Q ss_pred CCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEE
Q psy15482 9 HTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLL 71 (302)
Q Consensus 9 ~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L 71 (302)
.++..+.+.|.|++++.++-+..=.++|+++++-.|.|++|+|+-. -++.-.|+-+|..|.|
T Consensus 4 ~~~rr~~vkvtp~~~l~~VL~eac~k~~l~~~~~~L~h~~k~ldls-lp~R~snL~n~akLeL 65 (65)
T PF11470_consen 4 YNFRRFKVKVTPNTTLNQVLEEACKKFGLDPSSYDLKHNNKPLDLS-LPFRLSNLPNNAKLEL 65 (65)
T ss_dssp TTS-EEEE---TTSBHHHHHHHHHHHTT--GGG-EEEETTEEESSS--BHHHH---SS-EEEE
T ss_pred cCCcEEEEEECCCCCHHHHHHHHHHHcCCCccceEEEECCEEeccc-cceeecCCCCCCEEeC
Confidence 4788999999999999999999999999999999999999999664 8999999999998865
No 80
>cd01767 UBX UBX (ubiquitin regulatory X) domain. The UBX (ubiquitin regulatory X) domain has a beta-grasp fold that is structurally quite similar to ubiquitin although UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins. Most UBX-containing proteins including p47, FAF1, and SAKS1 (Y33K) also contain a UBA (ubiquitin-associated) domain and are thought to serve as adaptor molecules that shuttle proteins to the proteasome for degradation.
Probab=96.44 E-value=0.023 Score=42.95 Aligned_cols=69 Identities=17% Similarity=0.292 Sum_probs=54.2
Q ss_pred EEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEE--ECCeecCC--CcccccccCCCCCCEEEEEE
Q psy15482 2 KVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIE--FQGVLLQD--NKKTLKTHGIQDGDLVLLKH 73 (302)
Q Consensus 2 qItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li--~~Gk~L~D--~~~TL~~~gI~dgd~l~L~~ 73 (302)
+|-|+. .+|+.+......+.||.+|.+.|.....- .....|+ |-.+.+.+ .+.||+++|+. ++++.+..
T Consensus 4 ~i~iRl-pdG~~~~~~F~~~~tl~~l~~fv~~~~~~-~~~f~L~t~~Pr~~~~~~~~~~TL~e~gL~-~s~~~~~~ 76 (77)
T cd01767 4 KIQIRL-PDGKRLEQRFNSTHKLSDVRDFVESNGPP-AEPFTLMTSFPRRVLTDLDYELTLQEAGLV-NEVVFQRL 76 (77)
T ss_pred EEEEEc-CCCCEEEEEeCCCCCHHHHHHHHHHcCCC-CCCEEEEeCCCCccCCCCCccCcHHHcCCc-cceEEEEe
Confidence 467888 88999999999999999999999877543 4556676 55677865 56899999999 56666543
No 81
>PF05585 DUF1758: Putative peptidase (DUF1758); InterPro: IPR008737 This is a family of nematode proteins of unknown function []. However, it seems likely that these proteins act as aspartic peptidases.
Probab=96.29 E-value=0.0024 Score=55.01 Aligned_cols=29 Identities=24% Similarity=0.443 Sum_probs=26.4
Q ss_pred eeEEEEEcCCCccccCCHHHHHHcCchhh
Q psy15482 225 YPVKAFIDSGAQTTIMSLACAERVNIMRL 253 (302)
Q Consensus 225 ~~v~a~vDsGaq~~i~s~~~a~~~gl~~~ 253 (302)
++++||+|||+|.|+|+.++|+++||...
T Consensus 11 ~~~~~LlDsGSq~SfIt~~la~~L~L~~~ 39 (164)
T PF05585_consen 11 VEARALLDSGSQRSFITESLANKLNLPGT 39 (164)
T ss_pred EEEEEEEecCCchhHHhHHHHHHhCCCCC
Confidence 45889999999999999999999999853
No 82
>cd01770 p47_UBX p47-like ubiquitin domain. p47_UBX p47 is an adaptor molecule of the cytosolic AAA ATPase p97. The principal role of the p97-p47 complex is to regulate membrane fusion events. Mono-ubiquitin recognition by p47 is crucial for p97-p47-mediated Golgi membrane fusion events. p47 has carboxy-terminal SEP and UBX domains. The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins.
Probab=96.07 E-value=0.036 Score=42.54 Aligned_cols=67 Identities=12% Similarity=0.186 Sum_probs=53.9
Q ss_pred EEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCC-CCCeEEE--ECCeecCCCcccccccCCCCCCEEE
Q psy15482 2 KVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYP-AHEIAIE--FQGVLLQDNKKTLKTHGIQDGDLVL 70 (302)
Q Consensus 2 qItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp-~~~Q~Li--~~Gk~L~D~~~TL~~~gI~dgd~l~ 70 (302)
+|-|+. .+|+.+......+.||.+|.+.|..+.+-+ .....|. |=.|.|.|++.||+++|+.+ +.|.
T Consensus 6 ~iqiRl-pdG~r~~~rF~~~~tv~~l~~~v~~~~~~~~~~~f~L~t~fP~k~l~~~~~Tl~eagL~~-s~v~ 75 (79)
T cd01770 6 SIQIRL-ADGKRLVQKFNSSHRVSDVRDFIVNARPEFAARPFTLMTAFPVKELSDESLTLKEANLLN-AVIV 75 (79)
T ss_pred EEEEEC-CCCCEEEEEeCCCCcHHHHHHHHHHhCCCCCCCCEEEecCCCCcccCCCCCcHHHCCCcC-cEEE
Confidence 467888 899999999999999999999999876432 3556776 66888977779999999995 4443
No 83
>PF14453 ThiS-like: ThiS-like ubiquitin
Probab=95.89 E-value=0.036 Score=40.21 Aligned_cols=56 Identities=18% Similarity=0.315 Sum_probs=43.5
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEe
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
|+|+|.. + .+++..+.|+.+||..+..+. =.++|||-+.+++ +-+++||.|++..|
T Consensus 1 M~I~vN~----k--~~~~~~~~tl~~lr~~~k~~~------DI~I~NGF~~~~d------~~L~e~D~v~~Ikk 56 (57)
T PF14453_consen 1 MKIKVNE----K--EIETEENTTLFELRKESKPDA------DIVILNGFPTKED------IELKEGDEVFLIKK 56 (57)
T ss_pred CEEEECC----E--EEEcCCCcCHHHHHHhhCCCC------CEEEEcCcccCCc------cccCCCCEEEEEeC
Confidence 6777643 2 578889999999999876533 3789999998764 46889999998775
No 84
>cd01772 SAKS1_UBX SAKS1-like UBX domain. SAKS1 (SAPK-substrate-1), also known as Y33K, is a widely expressed protein containing N-terminal UBA (ubiquitin-associated) and C-terminal UBX (ubiqiutin-like) domains that was identified as a substrate of stress-activated protein kinases (SAPKs). SAKS1 is related evolutionarily to two other UBA/UBX-containing proteins, p47 and Faf1. The UBA and UBX domains of SAKS1 bind ubiquitin tetramers and valosin-containing protein (VCP), respectively suggesting a role for SAKS1 as an adaptor that directs VCP to polyubiquitinated proteins facilitating its destruction by the proteasome. The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins.
Probab=95.84 E-value=0.062 Score=41.08 Aligned_cols=69 Identities=12% Similarity=0.284 Sum_probs=55.6
Q ss_pred EEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEE--ECCeecCC--CcccccccCCCCCCEEEEE
Q psy15482 2 KVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIE--FQGVLLQD--NKKTLKTHGIQDGDLVLLK 72 (302)
Q Consensus 2 qItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li--~~Gk~L~D--~~~TL~~~gI~dgd~l~L~ 72 (302)
+|-|+. .+|+.+.-..+.+.|+.+|.+.|+...+.+ ....|+ |--|.+.+ .++||+++|+.+..+|+|.
T Consensus 6 ~i~iRl-p~G~~~~~~F~~~~tl~~v~~fV~~~~~~~-~~f~L~t~fPrk~~~~~d~~~TL~elgL~Psa~L~v~ 78 (79)
T cd01772 6 RIQIRL-LDGTTLKQTFKAREQLAAVRLFVELNTGNG-GPFTLMTPFPRKVFTEDDMEKPLQELGLVPSAVLIVT 78 (79)
T ss_pred EEEEEC-CCCCEEEEEeCCCChHHHHHHHHHHcCCCC-CCEEEEeCCCCeECCcccccCCHHHCCCCCceEEEEe
Confidence 467888 889999889999999999999999765543 446665 66788853 3589999999999998874
No 85
>PF13019 Telomere_Sde2: Telomere stability and silencing
Probab=95.68 E-value=0.066 Score=46.71 Aligned_cols=66 Identities=14% Similarity=0.120 Sum_probs=50.9
Q ss_pred CEEEEEeCCCC----eEEEEEeCCCccHHHHHHHHHhhhCCCCCCe-EEEE-CCeec-CCCcccccccCCCCCC
Q psy15482 1 MKVTVSTQHTN----ECFLLDVSEDLELENFKAFCEVQSGYPAHEI-AIEF-QGVLL-QDNKKTLKTHGIQDGD 67 (302)
Q Consensus 1 MqItV~~~~~g----~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q-~Li~-~Gk~L-~D~~~TL~~~gI~dgd 67 (302)
|+|+|++ .+| .++.+.+++++||.+|+..|....++|...| .|++ .|+.| .+++..++++.-.+++
T Consensus 1 i~Vlvss-~~g~~lp~tl~~~lp~~ttv~dL~~~l~~~~~~~~~~~~~L~~~~n~~l~~~~~~~~s~l~~~~~~ 73 (162)
T PF13019_consen 1 INVLVSS-FDGLTLPPTLSLSLPSTTTVSDLKDRLSERLPIPSSSQLYLTTNSNGQLSPSSDIPLSSLLSSSQD 73 (162)
T ss_pred CeEEEec-CCCCCCCCeEEeeCCCCCcHHHHHHHHHhhcCCCccceeEEEEeCCCeeCCCccccHHhhccCcCC
Confidence 6899999 888 5889999999999999999999999999884 4555 34455 3333567777655544
No 86
>PRK06437 hypothetical protein; Provisional
Probab=95.08 E-value=0.16 Score=37.67 Aligned_cols=60 Identities=13% Similarity=0.253 Sum_probs=46.5
Q ss_pred EEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEe
Q psy15482 4 TVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 4 tV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
.|+. ..++..++++++..||.+|-+. .|+++....+..||+++.. ++-+++||.|.++.-
T Consensus 4 ~~~v-~g~~~~~~~i~~~~tv~dLL~~----Lgi~~~~vaV~vNg~iv~~------~~~L~dgD~Veiv~~ 63 (67)
T PRK06437 4 MIRV-KGHINKTIEIDHELTVNDIIKD----LGLDEEEYVVIVNGSPVLE------DHNVKKEDDVLILEV 63 (67)
T ss_pred eEEe-cCCcceEEEcCCCCcHHHHHHH----cCCCCccEEEEECCEECCC------ceEcCCCCEEEEEec
Confidence 4444 3455577888999999998755 4889998999999999963 446899999998753
No 87
>cd01774 Faf1_like2_UBX Faf1 ike-2 UBX domain. Faf1_like2 is a protein of unknown function with a domain architecture that includes the UAS (ubiquitin-associated) and UBX (ubiquitin-like) domains. This protein is related to other UBA/UBX-containing proteins like Faf1, p47, and SAKS1 and may serve as an adaptor molecule that shuttles proteins to the proteasome for degradation. The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins.
Probab=95.05 E-value=0.18 Score=39.29 Aligned_cols=69 Identities=12% Similarity=0.073 Sum_probs=55.8
Q ss_pred EEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECC--eecC-------CCcccccccCCCCCCEEEEE
Q psy15482 2 KVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQG--VLLQ-------DNKKTLKTHGIQDGDLVLLK 72 (302)
Q Consensus 2 qItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~G--k~L~-------D~~~TL~~~gI~dgd~l~L~ 72 (302)
+|-|+. .+|....-....+.||++|...|.. .+..+....|+.+= |.+. |.+.||++.|+....+|++.
T Consensus 6 ~I~iRl-p~G~Rl~rrF~~~~tl~~l~~fv~~-~~~~~~~f~L~t~FPrr~~~~~~~~~~~~~~TL~eaGL~~s~~L~V~ 83 (85)
T cd01774 6 KIVFKL-PNGTRVERRFLFTQSLRVIHDFLFS-LKETPEKFQIVTNFPRRVLPCLPSEGDPPPPTLLEAGLSNSEVLFVQ 83 (85)
T ss_pred EEEEEC-CCCCEEEEEeCCCCcHHHHHHHHHh-CCCCCCcEEEecCCCCccccccccccCcCCCCHHHcCCCCccEEEEe
Confidence 567888 8899988888999999999999964 45566788888755 7785 33479999999998888774
No 88
>cd01771 Faf1_UBX Faf1 UBX domain. Faf1 (fas-associated factor1) is a nucleolar protein that was first identified as an interaction partner of the death receptor Fas. Faf1 contains N-terminal UAS (ubiquitin-associated) and C-terminal UBX (ubiquitin-like) domains and is closely related to other UBA/UBX-containing proteins like p47, Rep8 and SAKS1. Faf1 is thought to be involved in 18S rRNA synthesis and/or 40S ribosomal subunit assembly. The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins.
Probab=94.65 E-value=0.26 Score=37.91 Aligned_cols=70 Identities=21% Similarity=0.281 Sum_probs=57.8
Q ss_pred EEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEE--ECCeecC--CCcccccccCCCCCCEEEEEE
Q psy15482 2 KVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIE--FQGVLLQ--DNKKTLKTHGIQDGDLVLLKH 73 (302)
Q Consensus 2 qItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li--~~Gk~L~--D~~~TL~~~gI~dgd~l~L~~ 73 (302)
+|-|+. .+|....-....++++.+|-..|..+ |.+....+|+ |=-|.+. |.+.||.+.|+....+|+|.-
T Consensus 6 ~i~iRl-P~G~r~~rrF~~t~~L~~l~~fv~~~-~~~~~~f~L~t~fPRk~~~~~d~~~TL~e~gL~p~~~L~Vee 79 (80)
T cd01771 6 KLRVRT-PSGDFLERRFLGDTPLQVLLNFVASK-GYPIDEYKLLSSWPRRDLTQLDPNFTLLELKLYPQETLILEE 79 (80)
T ss_pred EEEEEC-CCCCEEEEEeCCCCcHHHHHHHHHhc-CCCCCCEEEecCCCCCCCcCCCCCCcHHHcCCCCCcEEEEEc
Confidence 567788 88999888999999999999999875 7777888887 5567774 444799999999999998753
No 89
>PRK08364 sulfur carrier protein ThiS; Provisional
Probab=94.38 E-value=0.28 Score=36.50 Aligned_cols=51 Identities=12% Similarity=0.235 Sum_probs=41.1
Q ss_pred EEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEE
Q psy15482 13 CFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKH 73 (302)
Q Consensus 13 ~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~ 73 (302)
...+++++.+||.+|.+.+ +++...-.+..||..+.. +.-+++||.|.+..
T Consensus 15 ~~~~~~~~~~tv~~ll~~l----~~~~~~v~v~vNg~iv~~------~~~l~~gD~Veii~ 65 (70)
T PRK08364 15 EKEIEWRKGMKVADILRAV----GFNTESAIAKVNGKVALE------DDPVKDGDYVEVIP 65 (70)
T ss_pred ceEEEcCCCCcHHHHHHHc----CCCCccEEEEECCEECCC------CcCcCCCCEEEEEc
Confidence 5678889999999988765 677777788899999854 34589999998865
No 90
>cd01773 Faf1_like1_UBX Faf1 ike-1 UBX domain. Faf1_like1 is a protein of unknown function with a domain architecture that includes the UAS (ubiquitin-associated) and UBX (ubiquitin-like) domains. This protein is related to other UBA/UBX-containing proteins like Faf1, p47, and SAKS1 and may serve as an adaptor molecule that shuttles proteins to the proteasome for degradation. The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins.
Probab=94.36 E-value=0.37 Score=37.46 Aligned_cols=71 Identities=17% Similarity=0.277 Sum_probs=59.2
Q ss_pred EEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEE--ECCeecC--CCcccccccCCCCCCEEEEEEe
Q psy15482 2 KVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIE--FQGVLLQ--DNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 2 qItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li--~~Gk~L~--D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
+|-|+. .+|+...-....+.++.+|-..|.. .|.+++...|+ |==|++. |.+.||+++|+..-.+|+|.-|
T Consensus 7 ~i~vRl-P~G~r~~rrF~~~~~L~~v~~fv~~-~g~~~~~f~L~t~FPRr~~~~~d~~~TL~e~GL~P~~~LfVq~r 81 (82)
T cd01773 7 RLMLRY-PDGKREQIALPEQAKLLALVRHVQS-KGYPNERFELLTNFPRRKLSHLDYDITLQEAGLCPQETVFVQER 81 (82)
T ss_pred EEEEEC-CCCCEEEEEeCCCCcHHHHHHHHHh-cCCCCCCEEEecCCCCcccCCcccCCCHHHcCCCCCcEEEEecC
Confidence 578888 8999998889999999999999998 57788888887 5456663 3347999999999999998644
No 91
>KOG0013|consensus
Probab=94.28 E-value=0.079 Score=47.95 Aligned_cols=64 Identities=23% Similarity=0.274 Sum_probs=56.6
Q ss_pred CCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEe
Q psy15482 10 TNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 10 ~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
+++.+.+.+..-.|+.++|..+.+..|+++-.|+++|+|++|.|. ..|++|+|..|..-++-..
T Consensus 155 T~~d~~lta~~~Dtv~eik~~L~Aaeg~D~~sQrif~Sg~~l~dk-t~LeEc~iekg~rYvlqvi 218 (231)
T KOG0013|consen 155 TREDFWLTAPHYDTVGEIKRALRAAEGVDPLSQRIFFSGGVLVDK-TDLEECKIEKGQRYVLQVI 218 (231)
T ss_pred hhhheeecccCcCcHHHHHHHHHHhhccchhhheeeccCCceecc-ccceeeeecCCCEEEEEEE
Confidence 577788899999999999999999999999999999999999985 7899999999966554433
No 92
>PRK06488 sulfur carrier protein ThiS; Validated
Probab=94.09 E-value=0.33 Score=35.43 Aligned_cols=60 Identities=20% Similarity=0.251 Sum_probs=44.0
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEE
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKH 73 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~ 73 (302)
|+|+| +|+.+.+ + ..|+.+|...+ ++++....+-+||..+.. ....+.-+++||.|.+..
T Consensus 1 m~i~~----Ng~~~~~--~-~~tl~~Ll~~l----~~~~~~vavavN~~iv~~--~~~~~~~L~dgD~Ieiv~ 60 (65)
T PRK06488 1 MKLFV----NGETLQT--E-ATTLALLLAEL----DYEGNWLATAVNGELVHK--EARAQFVLHEGDRIEILS 60 (65)
T ss_pred CEEEE----CCeEEEc--C-cCcHHHHHHHc----CCCCCeEEEEECCEEcCH--HHcCccccCCCCEEEEEE
Confidence 55555 5666665 3 46898888765 677766677899999864 366777899999999875
No 93
>PRK05863 sulfur carrier protein ThiS; Provisional
Probab=94.02 E-value=0.25 Score=36.27 Aligned_cols=61 Identities=23% Similarity=0.335 Sum_probs=45.1
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEe
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
|+|+| +|+.+ ++++..|+.+|-.. .+++.....+.+||..+... ..+.+ +++||.|.++.-
T Consensus 1 m~i~v----NG~~~--~~~~~~tl~~ll~~----l~~~~~~vav~~N~~iv~r~--~~~~~-L~~gD~ieIv~~ 61 (65)
T PRK05863 1 MIVVV----NEEQV--EVDEQTTVAALLDS----LGFPEKGIAVAVDWSVLPRS--DWATK-LRDGARLEVVTA 61 (65)
T ss_pred CEEEE----CCEEE--EcCCCCcHHHHHHH----cCCCCCcEEEEECCcCcChh--Hhhhh-cCCCCEEEEEee
Confidence 56665 45544 45577888887554 58999999999999988653 44556 999999998754
No 94
>PF15044 CLU_N: Mitochondrial function, CLU-N-term
Probab=93.87 E-value=0.12 Score=39.41 Aligned_cols=56 Identities=21% Similarity=0.246 Sum_probs=46.3
Q ss_pred eCCCccHHHHHHHHHhhhC-CCCCCeEEEECCeecCCCccccccc-CCCCCCEEEEEEe
Q psy15482 18 VSEDLELENFKAFCEVQSG-YPAHEIAIEFQGVLLQDNKKTLKTH-GIQDGDLVLLKHL 74 (302)
Q Consensus 18 V~~~~TV~dLK~~Ie~e~g-Ip~~~Q~Li~~Gk~L~D~~~TL~~~-gI~dgd~l~L~~~ 74 (302)
|+++++|.|++..+..... ..-....|.|+|+.|++. ..|+++ |+++|..|.|+..
T Consensus 1 v~~~d~v~dvrq~L~~~~~t~~~Tn~~L~~~g~~L~~~-~el~~i~~~~~~~~L~lve~ 58 (76)
T PF15044_consen 1 VSPTDTVQDVRQVLAESPETCYLTNFSLEHNGQRLDDF-VELSEIEGIKDGCVLELVEE 58 (76)
T ss_pred CChhhHHHHHHHHHHhCccccceeEEEEEECCCccCCc-hhhhhhhCCCCCcEEEEEec
Confidence 6789999999998887644 556788999999999775 677777 6999999999865
No 95
>COG5550 Predicted aspartyl protease [Posttranslational modification, protein turnover, chaperones]
Probab=93.44 E-value=0.29 Score=40.90 Aligned_cols=52 Identities=15% Similarity=0.245 Sum_probs=40.5
Q ss_pred EEcCCCc-cccCCHHHHHHcCchhhhhccccceecccCcceeeeEEEEeeeeecccc
Q psy15482 230 FIDSGAQ-TTIMSLACAERVNIMRLLDTRWAGIAKGVGVQKILGRIHMAIAELTNFN 285 (302)
Q Consensus 230 ~vDsGaq-~~i~s~~~a~~~gl~~~~d~~~~~~a~gvg~~~i~G~i~~~~~~ig~~~ 285 (302)
+||||+. +.+++++.|++.|+..+-..|..-.+.| +..=+||.+-++|++.+
T Consensus 30 LiDTGFtg~lvlp~~vaek~~~~~~~~~~~~~a~~~----~v~t~V~~~~iki~g~e 82 (125)
T COG5550 30 LIDTGFTGYLVLPPQVAEKLGLPLFSTIRIVLADGG----VVKTSVALATIKIDGVE 82 (125)
T ss_pred EEecCCceeEEeCHHHHHhcCCCccCChhhhhhcCC----EEEEEEEEEEEEECCEE
Confidence 7999999 9999999999999998744444444444 66777888888887744
No 96
>PRK05659 sulfur carrier protein ThiS; Validated
Probab=92.98 E-value=0.57 Score=34.03 Aligned_cols=62 Identities=18% Similarity=0.389 Sum_probs=45.8
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEe
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
|+|+| +|+ +++++...||.+|-.. .|++...-.+.+||..+.. ..-.+.-+++||.|.+..-
T Consensus 1 m~i~v----NG~--~~~~~~~~tl~~lL~~----l~~~~~~vav~vNg~iv~r--~~~~~~~l~~gD~vei~~~ 62 (66)
T PRK05659 1 MNIQL----NGE--PRELPDGESVAALLAR----EGLAGRRVAVEVNGEIVPR--SQHASTALREGDVVEIVHA 62 (66)
T ss_pred CEEEE----CCe--EEEcCCCCCHHHHHHh----cCCCCCeEEEEECCeEeCH--HHcCcccCCCCCEEEEEEE
Confidence 56655 444 3567778898887654 5889988899999998864 2345556999999998754
No 97
>COG5227 SMT3 Ubiquitin-like protein (sentrin) [Posttranslational modification, protein turnover, chaperones]
Probab=92.48 E-value=0.51 Score=37.35 Aligned_cols=63 Identities=10% Similarity=0.173 Sum_probs=56.2
Q ss_pred CCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEE
Q psy15482 9 HTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLK 72 (302)
Q Consensus 9 ~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~ 72 (302)
-++..+-+.|..+++..-|........|=..++.|+.|+|+.+.-+ +|-.+++..++|.|..+
T Consensus 32 qd~telfFkiKktT~f~klm~af~~rqGK~m~slRfL~dG~rI~~d-qTP~dldmEdnd~iEav 94 (103)
T COG5227 32 QDGTELFFKIKKTTTFKKLMDAFSRRQGKNMSSLRFLFDGKRIDLD-QTPGDLDMEDNDEIEAV 94 (103)
T ss_pred CCCCEEEEEEeccchHHHHHHHHHHHhCcCcceeEEEEcceecCCC-CChhhcCCccchHHHHH
Confidence 4566677889999999999999999999999999999999999876 89999999999988654
No 98
>PRK08053 sulfur carrier protein ThiS; Provisional
Probab=92.06 E-value=1.1 Score=32.83 Aligned_cols=62 Identities=15% Similarity=0.250 Sum_probs=44.6
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEe
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
|+|+| +|+. ++++...||.+|... .+++.....+-.||..+.. ..-++.-+++||.|.+..-
T Consensus 1 m~i~v----Ng~~--~~~~~~~tl~~ll~~----l~~~~~~vaVavN~~iv~r--~~w~~~~L~~gD~Ieii~~ 62 (66)
T PRK08053 1 MQILF----NDQP--MQCAAGQTVHELLEQ----LNQLQPGAALAINQQIIPR--EQWAQHIVQDGDQILLFQV 62 (66)
T ss_pred CEEEE----CCeE--EEcCCCCCHHHHHHH----cCCCCCcEEEEECCEEeCh--HHcCccccCCCCEEEEEEE
Confidence 66666 4454 455777899988765 4566666778899999863 3456667999999998753
No 99
>PRK07696 sulfur carrier protein ThiS; Provisional
Probab=91.89 E-value=0.97 Score=33.42 Aligned_cols=62 Identities=15% Similarity=0.257 Sum_probs=45.7
Q ss_pred CEEEEEeCCCCeEEEEEeCCC-ccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEe
Q psy15482 1 MKVTVSTQHTNECFLLDVSED-LELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~-~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
|+|+| +|+.+ +++.. .||.+|-+ ..++++..-.+-+||..+.. ..-+++-+++||.|.++.-
T Consensus 1 m~I~v----NG~~~--~~~~~~~tv~~lL~----~l~~~~~~vav~vN~~iv~r--~~w~~~~L~~gD~iEIv~~ 63 (67)
T PRK07696 1 MNLKI----NGNQI--EVPESVKTVAELLT----HLELDNKIVVVERNKDILQK--DDHTDTSVFDGDQIEIVTF 63 (67)
T ss_pred CEEEE----CCEEE--EcCCCcccHHHHHH----HcCCCCCeEEEEECCEEeCH--HHcCceecCCCCEEEEEEE
Confidence 55655 45544 55555 67887765 35788888888899999965 3677778999999998754
No 100
>PRK07440 hypothetical protein; Provisional
Probab=91.74 E-value=0.9 Score=33.96 Aligned_cols=57 Identities=21% Similarity=0.374 Sum_probs=44.8
Q ss_pred CCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEe
Q psy15482 10 TNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 10 ~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
+|+. ++++...||.+|-. +.++++....+-+||..+.- ..-.+.-+++||.|.++.-
T Consensus 10 NG~~--~~~~~~~tl~~lL~----~l~~~~~~vav~~N~~iv~r--~~w~~~~L~~gD~IEIv~~ 66 (70)
T PRK07440 10 NGET--RTCSSGTSLPDLLQ----QLGFNPRLVAVEYNGEILHR--QFWEQTQVQPGDRLEIVTI 66 (70)
T ss_pred CCEE--EEcCCCCCHHHHHH----HcCCCCCeEEEEECCEEeCH--HHcCceecCCCCEEEEEEE
Confidence 5664 56677899998765 45788888889999999964 3567778999999998754
No 101
>PF14836 Ubiquitin_3: Ubiquitin-like domain; PDB: 3JYU_A 4A3O_B 3PPA_A 3T9L_A 4A3P_A 3PV1_A.
Probab=91.28 E-value=1.1 Score=35.23 Aligned_cols=66 Identities=14% Similarity=0.125 Sum_probs=47.4
Q ss_pred CCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEE----CCee-cCCCcccccccCCCCCCEEEEEEecC
Q psy15482 10 TNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEF----QGVL-LQDNKKTLKTHGIQDGDLVLLKHLGS 76 (302)
Q Consensus 10 ~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~----~Gk~-L~D~~~TL~~~gI~dgd~l~L~~~~~ 76 (302)
-+..++...+..+||..++..+.+.+.| ..+-||.- ++.. |.+.+.||++.||.+|-+|.+-.+..
T Consensus 12 l~~~~t~~FSk~DTI~~v~~~~rklf~i-~~E~RLW~~~~~~~~e~L~~~~~Tv~da~L~~gQ~vliE~rn~ 82 (88)
T PF14836_consen 12 LQSVLTKQFSKTDTIGFVEKEMRKLFNI-QEETRLWNKYSENSYELLNNPEITVEDAGLYDGQVVLIEERNE 82 (88)
T ss_dssp CCEEEEEEE-TTSBHHHHHHHHHHHCT--TS-EEEEEECTTTCEEEE--TTSBTTTTT--TTEEEEEEE--T
T ss_pred cccHhHhhccccChHHHHHHHHHHHhCC-CccceehhccCCcchhhhCCCCccHHHccCcCCCEEEEEeecc
Confidence 3567888999999999999999999999 66789963 3444 56655799999999999999987753
No 102
>cd00565 ThiS ThiaminS ubiquitin-like sulfur carrier protein. ThiS (ThiaminS) is a sulfur carrier protein involved in thiamin biosynthesis in bacteria. The ThiS fold, like those of two closely related proteins MoaD and Urm1, is similar to that of ubiquitin although there is little or no sequence similarity.
Probab=90.89 E-value=0.82 Score=33.27 Aligned_cols=57 Identities=19% Similarity=0.300 Sum_probs=42.9
Q ss_pred CCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEe
Q psy15482 10 TNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 10 ~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
+|+. ++++...||.+|.+.+ +++.+...+.+||+.+.. ..-++.-+++||.|.+..-
T Consensus 5 Ng~~--~~~~~~~tv~~ll~~l----~~~~~~i~V~vNg~~v~~--~~~~~~~L~~gD~V~ii~~ 61 (65)
T cd00565 5 NGEP--REVEEGATLAELLEEL----GLDPRGVAVALNGEIVPR--SEWASTPLQDGDRIEIVTA 61 (65)
T ss_pred CCeE--EEcCCCCCHHHHHHHc----CCCCCcEEEEECCEEcCH--HHcCceecCCCCEEEEEEe
Confidence 4444 5566788999988775 478888888899999864 2445567999999998753
No 103
>PRK06944 sulfur carrier protein ThiS; Provisional
Probab=90.74 E-value=1.9 Score=31.11 Aligned_cols=60 Identities=15% Similarity=0.270 Sum_probs=40.7
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEE
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKH 73 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~ 73 (302)
|+|+| +|+. +++++..||.+|.+.+ +++ ....+-+||..+... .-.+.-+++||.|.+..
T Consensus 1 m~i~v----Ng~~--~~~~~~~tl~~ll~~l----~~~-~~~~v~vN~~~v~~~--~~~~~~L~~gD~vei~~ 60 (65)
T PRK06944 1 MDIQL----NQQT--LSLPDGATVADALAAY----GAR-PPFAVAVNGDFVART--QHAARALAAGDRLDLVQ 60 (65)
T ss_pred CEEEE----CCEE--EECCCCCcHHHHHHhh----CCC-CCeEEEECCEEcCch--hcccccCCCCCEEEEEe
Confidence 55555 4444 5667788999988765 343 345677899988531 33444589999999875
No 104
>PRK06083 sulfur carrier protein ThiS; Provisional
Probab=90.66 E-value=1.1 Score=34.84 Aligned_cols=57 Identities=12% Similarity=0.191 Sum_probs=44.2
Q ss_pred CCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEe
Q psy15482 10 TNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 10 ~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
+|+. .++++..||.+|-.. .++++..-.+-+||..+.- ..-+++-+++||.|.++.-
T Consensus 24 NG~~--~~~~~~~tl~~LL~~----l~~~~~~vAVevNg~iVpr--~~w~~t~L~egD~IEIv~~ 80 (84)
T PRK06083 24 NDQS--IQVDISSSLAQIIAQ----LSLPELGCVFAINNQVVPR--SEWQSTVLSSGDAISLFQA 80 (84)
T ss_pred CCeE--EEcCCCCcHHHHHHH----cCCCCceEEEEECCEEeCH--HHcCcccCCCCCEEEEEEE
Confidence 5554 455677888887664 4788888888899999964 4677888999999998753
No 105
>TIGR02958 sec_mycoba_snm4 secretion protein snm4. Members of this family are the 12-transmembrane domain protein snm4, where snm stands for secretion in mycocbacteria. This system acts on Mycobacterium tuberculosis related pair of virulence factors ESAT-6 and CFP-10 and on other homologs. The system is conserved in many Actinobacteria, including the non-pathogenic Mycobacterium smegmatis.
Probab=90.26 E-value=1.7 Score=43.85 Aligned_cols=72 Identities=19% Similarity=0.230 Sum_probs=53.7
Q ss_pred EEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCC------CCCCeEEE-ECCeecCCCcccccccCCCCCCEEEEEEe
Q psy15482 2 KVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGY------PAHEIAIE-FQGVLLQDNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 2 qItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gI------p~~~Q~Li-~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
+|||.. ......+-++.+..|.+|---|-...|= +...-.|. -+|.+|+.+ .||.+.||.|||+++|+..
T Consensus 4 RVtV~~--~~~~~DlaLPa~~PvaellP~ll~~~~~~~~~~~~~~~w~L~r~gG~pL~~~-~sL~~~gV~DG~~L~L~p~ 80 (452)
T TIGR02958 4 RVTVLA--GRRAVDVALPADVPVAELIPDLVDLLDDRGAAELGAVRWALARAGGSPLDPD-ASLAEAGVRDGELLVLVPA 80 (452)
T ss_pred EEEEee--CCeeeeeecCCCCcHHHHHHHHHHHhCcccccCCCCcceEEecCCCCCCCCC-CCHHHcCCCCCCeEEEeeC
Confidence 567776 3455788899999999998877666543 12223443 488899875 9999999999999999875
Q ss_pred cC
Q psy15482 75 GS 76 (302)
Q Consensus 75 ~~ 76 (302)
..
T Consensus 81 ~~ 82 (452)
T TIGR02958 81 SA 82 (452)
T ss_pred CC
Confidence 43
No 106
>cd00754 MoaD Ubiquitin domain of MoaD-like proteins. MoaD family. Members of this family are involved in biosynthesis of the molybdenum cofactor (Moco), an essential cofactor of a diverse group of redox enzymes. Moco biosynthesis is an evolutionarily conserved pathway present in eubacteria, archaea and eukaryotes. Moco contains a tricyclic pyranopterin, termed molybdopterin (MPT), that contains the cis-dithiolene group responsible for molybdenum ligation. This dithiolene group is generated by MPT synthase, the second major step in Moco biosynthesis. MPT synthase consists of a large (MoeE) and small (MoaD) subunit. The small subunit is inserted into the lare subunit to form the active site. The small subunit, which is structurally similar to ubiquitin, contains a C-terminal thiocarboxylated glycine residue that serves as a sulfur donor for the synthesis of the MPT dithiolene group.
Probab=90.00 E-value=1.5 Score=32.65 Aligned_cols=55 Identities=16% Similarity=0.134 Sum_probs=41.9
Q ss_pred EEEEEeCCCccHHHHHHHHHhhhCC----CCCCeEEEECCeecCCCcccccccCCCCCCEEEEEE
Q psy15482 13 CFLLDVSEDLELENFKAFCEVQSGY----PAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKH 73 (302)
Q Consensus 13 ~~~leV~~~~TV~dLK~~Ie~e~gI----p~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~ 73 (302)
...+++++..||.+|.+.+....+- ......+..||+.... +.-|++||.|.+..
T Consensus 17 ~~~~~~~~~~tv~~ll~~l~~~~~~~~~~~~~~~~v~vNg~~v~~------~~~l~~gD~v~i~p 75 (80)
T cd00754 17 EEELELPEGATVGELLDALEARYPGLLEELLARVRIAVNGEYVRL------DTPLKDGDEVAIIP 75 (80)
T ss_pred eEEEECCCCCcHHHHHHHHHHHCchHHHhhhhcEEEEECCeEcCC------CcccCCCCEEEEeC
Confidence 4577888899999999999887542 3345567789998863 34599999999864
No 107
>PLN02799 Molybdopterin synthase sulfur carrier subunit
Probab=89.93 E-value=1.3 Score=33.58 Aligned_cols=67 Identities=10% Similarity=0.093 Sum_probs=44.7
Q ss_pred CEEEEEeC-----CCC-eEEEEEeCCCccHHHHHHHHHhhh-CCCC--CCeEEEECCeecCCCcccccccCCCCCCEEEE
Q psy15482 1 MKVTVSTQ-----HTN-ECFLLDVSEDLELENFKAFCEVQS-GYPA--HEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLL 71 (302)
Q Consensus 1 MqItV~~~-----~~g-~~~~leV~~~~TV~dLK~~Ie~e~-gIp~--~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L 71 (302)
|+|+|+.- ..| ...+++++...||.+|+..+.... ++.. ..-.+..||+...+ +.-+++||.|.+
T Consensus 2 m~i~V~~fa~~re~~g~~~~~~~~~~~~tv~~L~~~l~~~~p~l~~~~~~~~vavN~~~v~~------~~~l~dgDeVai 75 (82)
T PLN02799 2 VEIKVLFFARARELTGVSDMTLELPAGSTTADCLAELVAKFPSLEEVRSCCVLALNEEYTTE------SAALKDGDELAI 75 (82)
T ss_pred eEEEEEehHHHHHHhCCCeEEEECCCCCcHHHHHHHHHHHChhHHHHhhCcEEEECCEEcCC------CcCcCCCCEEEE
Confidence 67888741 123 456788888999999999887664 1111 11245678887643 235899999988
Q ss_pred EE
Q psy15482 72 KH 73 (302)
Q Consensus 72 ~~ 73 (302)
..
T Consensus 76 ~P 77 (82)
T PLN02799 76 IP 77 (82)
T ss_pred eC
Confidence 64
No 108
>TIGR01683 thiS thiamine biosynthesis protein ThiS. This model represents ThiS, a small thiamine-biosynthesis protein related to MoaD, a molybdenum cofactor biosynthesis protein. Both proteins are involved in sulfur transfer. ThiS has a conserved Gly-Gly C-terminus that is modified, in reactions requiring ThiI, ThiF, IscS, and a sulfur atom from Cys, into the thiocarboxylate that provides the sulfur for thiazole biosynthesis.
Probab=89.91 E-value=1.2 Score=32.26 Aligned_cols=57 Identities=16% Similarity=0.324 Sum_probs=42.4
Q ss_pred CCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEe
Q psy15482 10 TNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 10 ~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
+|+. ++++...||.+|...+ ++++....+.+||+.+.. ..-.++-+++||.|.+..-
T Consensus 4 Ng~~--~~~~~~~tv~~ll~~l----~~~~~~v~v~vN~~iv~~--~~~~~~~L~~gD~veii~~ 60 (64)
T TIGR01683 4 NGEP--VEVEDGLTLAALLESL----GLDPRRVAVAVNGEIVPR--SEWDDTILKEGDRIEIVTF 60 (64)
T ss_pred CCeE--EEcCCCCcHHHHHHHc----CCCCCeEEEEECCEEcCH--HHcCceecCCCCEEEEEEe
Confidence 4555 4556778999988764 577777888899999853 2455667999999998753
No 109
>smart00666 PB1 PB1 domain. Phox and Bem1p domain, present in many eukaryotic cytoplasmic signalling proteins. The domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pairs associate.
Probab=89.51 E-value=1.3 Score=33.21 Aligned_cols=45 Identities=20% Similarity=0.342 Sum_probs=38.1
Q ss_pred EEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEEC
Q psy15482 3 VTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQ 47 (302)
Q Consensus 3 ItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~ 47 (302)
++||...++.++.+.++++.|..+|+.+|...++++.....|.|.
T Consensus 2 ~~vK~~~~~~~~~~~~~~~~s~~dL~~~i~~~~~~~~~~~~l~Y~ 46 (81)
T smart00666 2 VDVKLRYGGETRRLSVPRDISFEDLRSKVAKRFGLDNQSFTLKYQ 46 (81)
T ss_pred ccEEEEECCEEEEEEECCCCCHHHHHHHHHHHhCCCCCCeEEEEE
Confidence 345554478899999999999999999999999998877888775
No 110
>cd06406 PB1_P67 A PB1 domain is present in p67 proteins which forms a signaling complex with p40, a crucial step for activation of NADPH oxidase during phagocytosis. PB1 domain is a modular domain mediating specific protein-protein interactions which play a role in many critical cell processes . A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions. The PB1 domain module is conserved in amoebas, fungi, animals, and plants. The p67 proteins contain
Probab=89.10 E-value=1.5 Score=33.98 Aligned_cols=36 Identities=14% Similarity=0.282 Sum_probs=34.1
Q ss_pred EEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECC
Q psy15482 13 CFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQG 48 (302)
Q Consensus 13 ~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~G 48 (302)
++.+.|+++.+..+|..+|..+.++|++...|.|.-
T Consensus 12 tIaIrvp~~~~y~~L~~ki~~kLkl~~e~i~LsYkd 47 (80)
T cd06406 12 TVAIQVARGLSYATLLQKISSKLELPAEHITLSYKS 47 (80)
T ss_pred EEEEEcCCCCCHHHHHHHHHHHhCCCchhcEEEecc
Confidence 889999999999999999999999999999999954
No 111
>COG2104 ThiS Sulfur transfer protein involved in thiamine biosynthesis [Coenzyme metabolism]
Probab=88.07 E-value=4.5 Score=30.21 Aligned_cols=64 Identities=14% Similarity=0.307 Sum_probs=48.8
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEe
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
|.++|.. +|+ +++++...|+.||-+. .|+++..-.+.+||.++.. .--++.-+++||.|.++.-
T Consensus 1 ~~m~i~~--ng~--~~e~~~~~tv~dLL~~----l~~~~~~vav~vNg~iVpr--~~~~~~~l~~gD~ievv~~ 64 (68)
T COG2104 1 MPMTIQL--NGK--EVEIAEGTTVADLLAQ----LGLNPEGVAVAVNGEIVPR--SQWADTILKEGDRIEVVRV 64 (68)
T ss_pred CcEEEEE--CCE--EEEcCCCCcHHHHHHH----hCCCCceEEEEECCEEccc--hhhhhccccCCCEEEEEEe
Confidence 5566666 455 4677777999998765 5788888899999999974 2456677999999988754
No 112
>PF09379 FERM_N: FERM N-terminal domain ; InterPro: IPR018979 This domain is the N-terminal ubiquitin-like structural domain of the FERM domain. The FERM domain (F for 4.1 protein, E for ezrin, R for radixin and M for moesin) is a widespread protein module involved in localising proteins to the plasma membrane []. FERM domains are found in a number of cytoskeletal-associated proteins that associate with various proteins at the interface between the plasma membrane and the cytoskeleton. The FERM domain is located at the N terminus of the majority of FERM-containing proteins [, ], which includes: Band 4.1, which links the spectrin-actin cytoskeleton of erythrocytes to the plasma membrane. Ezrin, a component of the undercoat of the microvilli plasma membrane. Moesin, which is probably involved in binding major cytoskeletal structures to the plasma membrane. Radixin, which is involved in the binding of the barbed end of actin filaments to the plasma membrane in the undercoat of the cell- to-cell adherens junction. Talin, a cytoskeletal protein concentrated in regions of cell-substratum contact and, in lymphocytes, of cell-cell contacts. Filopodin, a slime mold protein that binds actin and which is involved in the control of cell motility and chemotaxis. Merlin (or schwannomin). Protein NBL4. Unconventional myosins X, VIIa and XV, which are mutated in congenital deafness. Focal-adhesion kinases (FAKs), cytoplasmic protein tyrosine kinases involved in signalling through integrins. Janus tyrosine kinases (JAKs), cytoplasmic tyrosine kinases that are non-covalently associated with the cytoplasmic tails of receptors for cytokines or polypeptidic hormones. Non-receptor tyrosine-protein kinase TYK2. Protein-tyrosine phosphatases PTPN3 and PTPN4, enzyme that appear to act at junctions between the membrane and the cytoskeleton. Protein-tyrosine phosphatases PTPN14 and PTP-D1, PTP-RL10 and PTP2E. Caenorhabditis elegans protein phosphatase ptp-1. Ezrin, moesin, and radixin are highly related proteins (ERM protein family), but the other proteins in which the FERM domain is found do not share any region of similarity outside of this domain. ERM proteins are made of three domains, the FERM domain, a central helical domain and a C-terminal tail domain, which binds F-actin. The amino-acid sequence of the FERM domain is highly conserved among ERM proteins and is responsible for membrane association by direct binding to the cytoplasmic domain or tail of integral membrane proteins. ERM proteins are regulated by an intramolecular association of the FERM and C-terminal tail domains that masks their binding sites for other molecules. For cytoskeleton-membrane cross-linking, the dormant molecules becomes activated and the FERM domain attaches to the membrane by binding specific membrane proteins, while the last 34 residues of the tail bind actin filaments. Aside from binding to membranes, the activated FERM domain of ERM proteins can also bind the guanine nucleotide dissociation inhibitor of Rho GTPase (RhoDGI), which suggests that in addition to functioning as a cross-linker, ERM proteins may influence Rho signalling pathways. The crystal structure of the FERM domain reveals that it is composed of three structural modules (F1, F2, and F3) that together form a compact clover-shaped structure []. The FERM domain has also been called the amino-terminal domain, the 30kDa domain, 4.1N30, the membrane-cytoskeletal-linking domain, the ERM-like domain, the ezrin-like domain of the band 4.1 superfamily, the conserved N-terminal region, and the membrane attachment domain [].; PDB: 1EF1_B 1SGH_A 1E5W_A 2KC2_A 2KMA_A 3IVF_A 1H4R_B 3U8Z_A 1ISN_A 3BIN_A ....
Probab=87.31 E-value=4.8 Score=29.88 Aligned_cols=66 Identities=12% Similarity=0.106 Sum_probs=47.8
Q ss_pred EEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCC-CCeEEEE----CC--eecCCCcccccccCCC--CCCEEEEE
Q psy15482 5 VSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPA-HEIAIEF----QG--VLLQDNKKTLKTHGIQ--DGDLVLLK 72 (302)
Q Consensus 5 V~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~-~~Q~Li~----~G--k~L~D~~~TL~~~gI~--dgd~l~L~ 72 (302)
|.. ++|...+++|++++|+.+|-..|....|+.. +-.-|.| +| .-|..+ ++|.++... ...++++.
T Consensus 1 V~l-lD~~~~~~~v~~~~t~~~l~~~v~~~l~l~e~~~FgL~~~~~~~~~~~wL~~~-k~l~~q~~~~~~~~~l~fr 75 (80)
T PF09379_consen 1 VRL-LDGTTKTFEVDPKTTGQDLLEQVCDKLGLKEKEYFGLQYQVDKDGEHHWLDLD-KKLKKQLKKNNPPFTLYFR 75 (80)
T ss_dssp EEE-SSEEEEEEEEETTSBHHHHHHHHHHHHTTSSGGGEEEEE-EBTTSSEEEE-SS-SBGGGSTBTSSSSEEEEEE
T ss_pred CCC-cCCCcEEEEEcCCCcHHHHHHHHHHHcCCCCccEEEEEEeecCCCcceeccCc-ccHHHHcCCCCCCEEEEEE
Confidence 455 7889999999999999999999999999875 4456777 23 224444 788888777 33334443
No 113
>KOG4583|consensus
Probab=87.17 E-value=0.3 Score=47.17 Aligned_cols=58 Identities=14% Similarity=0.171 Sum_probs=42.5
Q ss_pred EEEEeCCCccHHHHHHHHHhhhC-CC-CCCeEEEECCeecCCCcccccccCCCC--CCEEEEE
Q psy15482 14 FLLDVSEDLELENFKAFCEVQSG-YP-AHEIAIEFQGVLLQDNKKTLKTHGIQD--GDLVLLK 72 (302)
Q Consensus 14 ~~leV~~~~TV~dLK~~Ie~e~g-Ip-~~~Q~Li~~Gk~L~D~~~TL~~~gI~d--gd~l~L~ 72 (302)
.+|+.+-..||++||..++.-.- =| ..+|||+|.||.|.|. ..|++.=+|. -+++||+
T Consensus 24 l~i~~dl~wtv~~Lk~hls~VyPskpl~~dqrliYsgkllld~-qcl~d~lrkq~k~Hv~hlv 85 (391)
T KOG4583|consen 24 LSISLDLKWTVGDLKVHLSQVYPSKPLELDQRLIYSGKLLLDH-QCLTDWLRKQVKEHVKHLV 85 (391)
T ss_pred eeeehhhhhhHHHHhhhHhhcCCCCCchhhHHHHhhccccccc-hhHHHHHHHHHHHHHHHHh
Confidence 46677778899999999987642 22 3679999999999885 7888875543 3445554
No 114
>PF11620 GABP-alpha: GA-binding protein alpha chain; InterPro: IPR024668 GA-binding protein alpha is a transcription factor capable of interacting with purine rich repeats (GA repeats). This N-terminal domain found in the transcription factor GABP alpha consists of a five-stranded beta-sheet crossed by a distorted helix and has been termed OST domain. The surface of the GABP alpha OST domain contains two clusters of negatively-charged residues suggesting there are positively-charged partner proteins. The OST domain binds to the CH1 and CH3 domains of the co-activator histone acetyltransferase CBP/p300 [].; PDB: 2JUO_A.
Probab=86.61 E-value=2 Score=33.64 Aligned_cols=63 Identities=19% Similarity=0.225 Sum_probs=44.7
Q ss_pred eEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEec
Q psy15482 12 ECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 12 ~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
..+...++=...+..||.+++...|++-+.-.++..+..|..+ ++|-+.||+-.-+|-+..-.
T Consensus 3 ~vI~q~mDI~epl~~Lk~lLe~Rl~~~L~~~~f~LQD~~L~~~-k~L~dQcVqgeGlVQlnvQi 65 (88)
T PF11620_consen 3 DVIMQHMDIREPLSTLKKLLERRLGISLSDYEFWLQDIQLEPH-KSLVDQCVQGEGLVQLNVQI 65 (88)
T ss_dssp EEEEEEEESSSBGGGHHHHSHHHH-S--SS-EEEETTEE--TT-SBTTTSS----SEEEEEEEE
T ss_pred ceEEEEEecCCcHHHHHHHHHHhhCCCcCCCeEEeccceecCC-ccHHHhhccccCEEEEEEEE
Confidence 4555666777899999999999999999999999999889886 99999999998888876543
No 115
>cd06094 RP_Saci_like RP_Saci_like, retropepsin family. Retropepsin on retrotransposons with long terminal repeats (LTR) including Saci-1, -2 and -3 of Schistosoma mansoni. Retropepsins are related to fungal and mammalian pepsins. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified
Probab=85.87 E-value=0.75 Score=36.32 Aligned_cols=52 Identities=12% Similarity=0.201 Sum_probs=34.0
Q ss_pred eEEEEEcCCCccccCCHHHHHHcCchhhhhccccceecccCcc--eeeeEEEEeeeeeccc
Q psy15482 226 PVKAFIDSGAQTTIMSLACAERVNIMRLLDTRWAGIAKGVGVQ--KILGRIHMAIAELTNF 284 (302)
Q Consensus 226 ~v~a~vDsGaq~~i~s~~~a~~~gl~~~~d~~~~~~a~gvg~~--~i~G~i~~~~~~ig~~ 284 (302)
.+++||||||..++++....++- +.+..+ .-.+.... ++.| .+.+.+.+|.-
T Consensus 9 ~~~fLVDTGA~vSviP~~~~~~~----~~~~~~--~l~AANgt~I~tyG-~~~l~ldlGlr 62 (89)
T cd06094 9 GLRFLVDTGAAVSVLPASSTKKS----LKPSPL--TLQAANGTPIATYG-TRSLTLDLGLR 62 (89)
T ss_pred CcEEEEeCCCceEeecccccccc----ccCCce--EEEeCCCCeEeeee-eEEEEEEcCCC
Confidence 47999999999999997765542 222223 44455544 4578 55777777653
No 116
>cd06407 PB1_NLP A PB1 domain is present in NIN like proteins (NLP), a key enzyme in a process of establishment of symbiosis betweeen legumes and nitrogen fixing bacteria (Rhizobium). The PB1 domain is a modular domain mediating specific protein-protein interaction which play a role in many critical cell processes like osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-inte
Probab=85.86 E-value=4 Score=31.56 Aligned_cols=44 Identities=7% Similarity=0.128 Sum_probs=35.9
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCC-CCeEEEE
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPA-HEIAIEF 46 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~-~~Q~Li~ 46 (302)
|+|.+.. +|+++-+.++++.++.+|++.|...+++.. ....|.|
T Consensus 1 ~~vK~~~--~~d~~r~~l~~~~~~~~L~~~i~~r~~~~~~~~f~LkY 45 (82)
T cd06407 1 VRVKATY--GEEKIRFRLPPSWGFTELKQEIAKRFKLDDMSAFDLKY 45 (82)
T ss_pred CEEEEEe--CCeEEEEEcCCCCCHHHHHHHHHHHhCCCCCCeeEEEE
Confidence 3444444 788999999999999999999999999875 5667766
No 117
>smart00727 STI1 Heat shock chaperonin-binding motif.
Probab=85.21 E-value=1.1 Score=29.60 Aligned_cols=20 Identities=35% Similarity=0.617 Sum_probs=9.4
Q ss_pred HhcCHHHHHHHHhcChhHHH
Q psy15482 122 LLDNPKEIALLQQNNPRLAD 141 (302)
Q Consensus 122 ~l~NP~~~~~L~~~NPeLa~ 141 (302)
+.+||+.+..+.+.||+++.
T Consensus 20 ~~~nP~~~~~~~~~nP~~~~ 39 (41)
T smart00727 20 MQQNPDMLAQMLQENPQLLQ 39 (41)
T ss_pred HHHCHHHHHHHHHhCHHhHh
Confidence 33455544444333555543
No 118
>PF00564 PB1: PB1 domain; InterPro: IPR000270 The Phox and Bem1p domain, is present in many eukaryotic cytoplasmic signalling proteins. The domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pairs associate.; GO: 0005515 protein binding; PDB: 1IPG_A 1IP9_A 2KFK_A 1WMH_A 1VD2_A 1WI0_A 1OEY_C 1PQS_A 1Q1O_A 1TZ1_A ....
Probab=84.92 E-value=4 Score=30.55 Aligned_cols=45 Identities=9% Similarity=0.282 Sum_probs=37.2
Q ss_pred EEEEeCCCCeEEE-EEeCCCccHHHHHHHHHhhhCCCCCCeEEEEC
Q psy15482 3 VTVSTQHTNECFL-LDVSEDLELENFKAFCEVQSGYPAHEIAIEFQ 47 (302)
Q Consensus 3 ItV~~~~~g~~~~-leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~ 47 (302)
+.||....+..+- +.+..+.|..+|+..|+..++.+.....|.|.
T Consensus 2 ~~vK~~~~~~~~~~~~~~~~~s~~~L~~~i~~~~~~~~~~~~l~Y~ 47 (84)
T PF00564_consen 2 VRVKVRYGGDIRRIISLPSDVSFDDLRSKIREKFGLLDEDFQLKYK 47 (84)
T ss_dssp EEEEEEETTEEEEEEEECSTSHHHHHHHHHHHHHTTSTSSEEEEEE
T ss_pred EEEEEEECCeeEEEEEcCCCCCHHHHHHHHHHHhCCCCccEEEEee
Confidence 3455434677776 99999999999999999999999888888883
No 119
>PRK01777 hypothetical protein; Validated
Probab=84.89 E-value=7.3 Score=30.99 Aligned_cols=63 Identities=17% Similarity=0.117 Sum_probs=42.8
Q ss_pred CEEEEEeCCC--CeEEEEEeCCCccHHHHHHHHHhhhCCCCC--C-----eEEEECCeecCCCcccccccCCCCCCEEEE
Q psy15482 1 MKVTVSTQHT--NECFLLDVSEDLELENFKAFCEVQSGYPAH--E-----IAIEFQGVLLQDNKKTLKTHGIQDGDLVLL 71 (302)
Q Consensus 1 MqItV~~~~~--g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~--~-----Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L 71 (302)
|+|+|..-.. .....+++++.+||.++-..+ ||+.. + -.+.-+|+...- +.-+++||.|-+
T Consensus 4 i~v~V~ya~~~~~~~~~l~vp~GtTv~dal~~s----gi~~~~pei~~~~~~vgI~Gk~v~~------d~~L~dGDRVeI 73 (95)
T PRK01777 4 IRVEVVYALPERQYLQRLTLQEGATVEEAIRAS----GLLELRTDIDLAKNKVGIYSRPAKL------TDVLRDGDRVEI 73 (95)
T ss_pred eEEEEEEECCCceEEEEEEcCCCCcHHHHHHHc----CCCccCcccccccceEEEeCeECCC------CCcCCCCCEEEE
Confidence 5777765222 334578999999999976654 66555 2 355567776643 336899999998
Q ss_pred EE
Q psy15482 72 KH 73 (302)
Q Consensus 72 ~~ 73 (302)
.+
T Consensus 74 yr 75 (95)
T PRK01777 74 YR 75 (95)
T ss_pred ec
Confidence 65
No 120
>KOG4250|consensus
Probab=82.94 E-value=2.2 Score=45.07 Aligned_cols=69 Identities=20% Similarity=0.207 Sum_probs=49.1
Q ss_pred EEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCC-cccccccCCCCCCEEEEE
Q psy15482 3 VTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDN-KKTLKTHGIQDGDLVLLK 72 (302)
Q Consensus 3 ItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~-~~TL~~~gI~dgd~l~L~ 72 (302)
|+|-.-.+...+.+-|+++.|+.+|++.|...+|||...|.|+|.|...... +++-.--| .+.++..+.
T Consensus 316 vhiFs~~~~~~~~~~~~~~ntl~~~~~~I~~~Tgipe~~qeLL~e~~~~h~~~~~Q~~~dg-~~~~l~l~~ 385 (732)
T KOG4250|consen 316 VHIFSMVQATSHEYYVHADNTLHSLIERISKQTGIPEGKQELLFEGGLSHLEDSAQCIPDG-LDSPLYLVS 385 (732)
T ss_pred eEEEeeccceEEEEecChhhhHHHHHHHHHHhhCCCCccceeeeecCccccCcccccCCCC-CCCceEEEe
Confidence 4443324677889999999999999999999999999999999987654332 22322334 344544443
No 121
>PRK11840 bifunctional sulfur carrier protein/thiazole synthase protein; Provisional
Probab=82.90 E-value=4.2 Score=39.37 Aligned_cols=64 Identities=17% Similarity=0.279 Sum_probs=48.8
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEecC
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLGS 76 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~~ 76 (302)
|+|+| +|+. +++++..||.+|-.. .+++.+...+.+||+++.. ..-.++-+++||.|.++.-.+
T Consensus 1 M~I~V----NGk~--~el~e~~TL~dLL~~----L~i~~~~VAVeVNgeIVpr--~~w~~t~LkeGD~IEII~~Vg 64 (326)
T PRK11840 1 MRIRL----NGEP--RQVPAGLTIAALLAE----LGLAPKKVAVERNLEIVPR--SEYGQVALEEGDELEIVHFVG 64 (326)
T ss_pred CEEEE----CCEE--EecCCCCcHHHHHHH----cCCCCCeEEEEECCEECCH--HHcCccccCCCCEEEEEEEec
Confidence 56655 4454 566778898887654 5899999999999999964 356777899999999987653
No 122
>cd06411 PB1_p51 The PB1 domain is present in the p51 protein, a homolog of the p67 protein. p51 plays an important role in NADPH oxidase activation during phagosytosis. The PB1 domain is a modular domain mediating specific protein-protein interaction in many critical cell processes such as osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions. The PB1 domain mo
Probab=82.84 E-value=3.1 Score=32.09 Aligned_cols=42 Identities=14% Similarity=0.174 Sum_probs=36.1
Q ss_pred EEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEEC
Q psy15482 3 VTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQ 47 (302)
Q Consensus 3 ItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~ 47 (302)
|||.+ .-++.+.|.+..+..+|..+|+++...+++.-.|.|.
T Consensus 1 ~~Vh~---~fTVai~v~~g~~y~~L~~~ls~kL~l~~~~~~LSY~ 42 (78)
T cd06411 1 VTVQC---AFTVALRAPRGADVSSLRALLSQALPQQAQRGQLSYR 42 (78)
T ss_pred CEEEE---EEEEEEEccCCCCHHHHHHHHHHHhcCChhhcEEEec
Confidence 35555 3456789999999999999999999999999999994
No 123
>cd06409 PB1_MUG70 The MUG70 protein is a product of the meiotically up-regulated gene 70 which has a role in meiosis and harbors a PB1 domain. The PB1 domain is a modular domain mediating specific protein-protein interactions which play a role in many critical cell processes such as osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domains depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic amino acid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions. The PB1 domain module is
Probab=82.54 E-value=8.7 Score=30.11 Aligned_cols=42 Identities=12% Similarity=0.217 Sum_probs=36.0
Q ss_pred EEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCC---CCeEEEE
Q psy15482 4 TVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPA---HEIAIEF 46 (302)
Q Consensus 4 tV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~---~~Q~Li~ 46 (302)
.++. .+|.++-+.+.|+..+.+|++.|..++|+.. ....|.|
T Consensus 4 K~~~-~~GrvhRf~~~~s~~~~~L~~~I~~Rl~~d~~~~~~~~L~Y 48 (86)
T cd06409 4 KFKD-PKGRVHRFRLRPSESLEELRTLISQRLGDDDFETHLYALSY 48 (86)
T ss_pred EeeC-CCCCEEEEEecCCCCHHHHHHHHHHHhCCccccCCcccEEE
Confidence 3556 6899999999999999999999999999997 4666666
No 124
>KOG2982|consensus
Probab=81.88 E-value=3.2 Score=40.23 Aligned_cols=59 Identities=17% Similarity=0.191 Sum_probs=47.0
Q ss_pred EEEeCCCccHHHHHHHHHhhhCCCCCCeEEEE---CCeec----CCCcccccccCCCCCCEEEEEE
Q psy15482 15 LLDVSEDLELENFKAFCEVQSGYPAHEIAIEF---QGVLL----QDNKKTLKTHGIQDGDLVLLKH 73 (302)
Q Consensus 15 ~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~---~Gk~L----~D~~~TL~~~gI~dgd~l~L~~ 73 (302)
+.-|+-.-||.|||..++.+.|+.++.++|+| .||.- .|..+.|-.|+|++||-+.+..
T Consensus 351 s~~I~~~~TV~D~~~~Ld~~VGvk~trMkLf~L~eD~rt~~~ss~~~N~~L~~fkIedGDs~lvqe 416 (418)
T KOG2982|consen 351 SGLICMTRTVLDFMKILDPKVGVKFTRMKLFLLREDGRTDDFSSSDYNMPLHYFKIEDGDSFLVQE 416 (418)
T ss_pred ceEEEeehHHHHHHHHhccccccccceeEEEEEcccCccCCccccCCCCcceEEeccCCCEeeeec
Confidence 34455567999999999999999999999998 56543 2334678999999999987754
No 125
>PF10790 DUF2604: Protein of Unknown function (DUF2604); InterPro: IPR019726 This entry represents bacterial proteins with undetermined function.
Probab=80.92 E-value=6.8 Score=29.23 Aligned_cols=66 Identities=20% Similarity=0.284 Sum_probs=51.5
Q ss_pred CCeEEEEEeCCCccHHHHHHHHHhhh---CCCCCCeEEE-ECCeecCCCcccccccCCCCCCEEEEEEecC
Q psy15482 10 TNECFLLDVSEDLELENFKAFCEVQS---GYPAHEIAIE-FQGVLLQDNKKTLKTHGIQDGDLVLLKHLGS 76 (302)
Q Consensus 10 ~g~~~~leV~~~~TV~dLK~~Ie~e~---gIp~~~Q~Li-~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~~ 76 (302)
+|+...+|..+...+--+.+.--+++ |-|++.=.|- -+|..|+-+ +.+++||+.+|-+++|..+..
T Consensus 4 NGqPv~VEANvnaPLh~v~akALe~sgNvgQP~ENWElkDe~G~vlD~~-kKveD~GftngvkLFLsLKAG 73 (76)
T PF10790_consen 4 NGQPVQVEANVNAPLHPVRAKALEQSGNVGQPPENWELKDESGQVLDVN-KKVEDFGFTNGVKLFLSLKAG 73 (76)
T ss_pred CCCceeeecCCCCcchHHHHHHHhhccccCCCcccceeeccCCcEeecc-chhhhccccccceEEEEeecc
Confidence 57888888888887776666554443 5788877776 578888665 899999999999999988754
No 126
>TIGR01682 moaD molybdopterin converting factor, subunit 1, non-archaeal. The C-terminal Gly-Gly of this model is critical to function.
Probab=80.61 E-value=11 Score=28.34 Aligned_cols=55 Identities=11% Similarity=0.181 Sum_probs=40.2
Q ss_pred EEEEEeCCC-ccHHHHHHHHHhhhC-C-C-CCCeEEEECCeecCCCcccccccCCCCCCEEEEEE
Q psy15482 13 CFLLDVSED-LELENFKAFCEVQSG-Y-P-AHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKH 73 (302)
Q Consensus 13 ~~~leV~~~-~TV~dLK~~Ie~e~g-I-p-~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~ 73 (302)
...++++++ .||.+|+..+....+ + . .....+..||+...+ +.-|++||.|.+..
T Consensus 17 ~~~~~~~~~~~tv~~L~~~L~~~~p~l~~~~~~~~v~vn~~~v~~------~~~l~dgDevai~P 75 (80)
T TIGR01682 17 EETLELPDESTTVGELKEHLAKEGPELAASRGQVMVAVNEEYVTD------DALLNEGDEVAFIP 75 (80)
T ss_pred eEEEECCCCCcCHHHHHHHHHHhCchhhhhccceEEEECCEEcCC------CcCcCCCCEEEEeC
Confidence 357888877 899999999988763 1 1 123456678888764 34689999998864
No 127
>cd05992 PB1 The PB1 domain is a modular domain mediating specific protein-protein interactions which play a role in many critical cell processes, such as osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domain, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as a noncanonical PB1-interactions. The PB1 domain module is conserved in amoebas, fungi, animals, and plants.
Probab=80.29 E-value=4.3 Score=30.14 Aligned_cols=44 Identities=14% Similarity=0.202 Sum_probs=34.4
Q ss_pred EEEeCCCCeEEEEEeC-CCccHHHHHHHHHhhhCCCCCCeEEEEC
Q psy15482 4 TVSTQHTNECFLLDVS-EDLELENFKAFCEVQSGYPAHEIAIEFQ 47 (302)
Q Consensus 4 tV~~~~~g~~~~leV~-~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~ 47 (302)
+||...+|..+.+.++ .+.|..+|+++|...++++.....|.|.
T Consensus 2 ~vK~~~~~~~~~~~~~~~~~s~~~L~~~i~~~~~~~~~~~~l~y~ 46 (81)
T cd05992 2 RVKVKYGGEIRRFVVVSRSISFEDLRSKIAEKFGLDAVSFKLKYP 46 (81)
T ss_pred cEEEEecCCCEEEEEecCCCCHHHHHHHHHHHhCCCCCcEEEEee
Confidence 3444346778888888 9999999999999999998755566553
No 128
>PF11834 DUF3354: Domain of unknown function (DUF3354); InterPro: IPR021789 Potassium channels take part in important processes of higher plants, including opening and closing of stomatal pores and leaf movement. Inward rectifying potassium (K(+)in) channels play an important role in turgor regulation and ion uptake in higher plants. All of them comprise, from their N-terminal to their C-terminal ends: a short hydrophilic region, a hydrophobic region structurally analogous and partially homologous to the transmembrane domain of voltage-gated animal channels from the Shaker superfamily, a putative cyclic nucleotide-binding domain, and a conserved C-terminal KHA domain. Between these last two regions, some of them (AKT1, AKT2 and SKT1) contain an ankyrin-repeat domain with six repeats homologous to those of human erythrocyte ankyrin. This entry represents the KHA domain which is unique to plant K(+)in channels. The KHA domain contains two high-homology blocks enriched for hydrophobic and acidic residues, respectively. The KHA domain is essential for interaction of plant K(+)in channels. The KHA domain mediates tetramerization and/or stabilisation of the heteromers [, , ].
Probab=79.45 E-value=4.8 Score=30.24 Aligned_cols=44 Identities=23% Similarity=0.323 Sum_probs=34.4
Q ss_pred ccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEE
Q psy15482 22 LELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLK 72 (302)
Q Consensus 22 ~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~ 72 (302)
.|+++|....++++|++ ..-.+.-+|-.+.|= . =|.|||.|+++
T Consensus 26 ~SleeLl~ia~~kfg~~-~~~v~~~dgaeIdDI--~----~IRDgD~L~~~ 69 (69)
T PF11834_consen 26 DSLEELLKIASEKFGFS-ATKVLNEDGAEIDDI--D----VIRDGDHLYLV 69 (69)
T ss_pred ccHHHHHHHHHHHhCCC-ceEEEcCCCCEEeEE--E----EEEcCCEEEEC
Confidence 69999999999999997 555566677777653 1 38899999873
No 129
>TIGR01687 moaD_arch MoaD family protein, archaeal. Members of this family appear to be archaeal versions of MoaD, subunit 1 of molybdopterin converting factor. This model has been split from the bacterial/eukaryotic equivalog model TIGR01682 because the presence of two members of this family in a substantial number of archaeal species suggests that roles might not be interchangeable.
Probab=79.32 E-value=15 Score=27.87 Aligned_cols=57 Identities=11% Similarity=0.085 Sum_probs=40.1
Q ss_pred eEEEEEeCCCccHHHHHHHHHhhhCC------C-----CCCeEEEECCeecCCCcccccccCCCCCCEEEEEE
Q psy15482 12 ECFLLDVSEDLELENFKAFCEVQSGY------P-----AHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKH 73 (302)
Q Consensus 12 ~~~~leV~~~~TV~dLK~~Ie~e~gI------p-----~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~ 73 (302)
....++++ ..||.+|.+.+.....- . -....+..||+....+ .. .-+++||.|.+..
T Consensus 16 ~~~~v~~~-~~tv~~l~~~l~~~~p~~~~~~l~~~~~~~~~~~v~vN~~~v~~~-~~---~~l~dgdev~i~P 83 (88)
T TIGR01687 16 KSEEIEIE-GKTVGDLLNELMARYPKEFSELFKEGLGLVPNVIILVNGRNVDWG-LG---TELKDGDVVAIFP 83 (88)
T ss_pred ceEEEEeC-CCCHHHHHHHHHHHCcHHHHHhCccCCcccccEEEEECCEecCcc-CC---CCCCCCCEEEEeC
Confidence 44677886 89999999999877531 0 1235566799887553 21 5689999998864
No 130
>cd01760 RBD Ubiquitin-like domain of RBD-like S/T kinases. The ras-binding domain (RBD) of the serine/threonine kinase raf is structurally quite similar to the beta-grasp fold of ubiquitin. A raf-like RBD is also present in RGS12 and other members of a family of GTPase activating proteins and TIAM1, a guanine nucleotide exchange protein.
Probab=78.68 E-value=12 Score=28.34 Aligned_cols=45 Identities=11% Similarity=0.096 Sum_probs=39.4
Q ss_pred EEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECC
Q psy15482 3 VTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQG 48 (302)
Q Consensus 3 ItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~G 48 (302)
++|-. .+|+...+.|.|.+|+.|+-+.+=++-|+.++.-.|++.|
T Consensus 2 ~~V~L-Png~~t~V~vrpg~ti~d~L~~~c~kr~l~~~~~~v~~~~ 46 (72)
T cd01760 2 CRVYL-PNGQRTVVPVRPGMSVRDVLAKACKKRGLNPECCDVFLLG 46 (72)
T ss_pred EEEEC-cCCCeEEEEECCCCCHHHHHHHHHHHcCCCHHHEEEEEec
Confidence 34555 6788999999999999999999999999999999888764
No 131
>PF12382 Peptidase_A2E: Retrotransposon peptidase; InterPro: IPR024648 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. This entry represents a small family of fungal retroviral aspartyl peptidases.
Probab=78.63 E-value=2.9 Score=33.87 Aligned_cols=55 Identities=22% Similarity=0.231 Sum_probs=33.1
Q ss_pred EEEEEcCCCccccCCHHHHHHcCchh--hhhccccceecccCcceeeeEEEEeeeeeccc
Q psy15482 227 VKAFIDSGAQTTIMSLACAERVNIMR--LLDTRWAGIAKGVGVQKILGRIHMAIAELTNF 284 (302)
Q Consensus 227 v~a~vDsGaq~~i~s~~~a~~~gl~~--~~d~~~~~~a~gvg~~~i~G~i~~~~~~ig~~ 284 (302)
+-+++|||||.+||+......--|.. +..+- +.-||=-.||--+-..+.|.+.++
T Consensus 48 ipclidtgaq~niiteetvrahklptrpw~~sv---iyggvyp~kinrkt~kl~i~lngi 104 (137)
T PF12382_consen 48 IPCLIDTGAQVNIITEETVRAHKLPTRPWSQSV---IYGGVYPNKINRKTIKLNINLNGI 104 (137)
T ss_pred ceeEEccCceeeeeehhhhhhccCCCCcchhhe---EeccccccccccceEEEEEEecce
Confidence 56899999999999999876665542 22221 223444445544444445555444
No 132
>cd06408 PB1_NoxR The PB1 domain is present in the Epichloe festucae NoxR protein (NADPH oxidase regulator), a key regulator of NADPH oxidase isoform, NoxA. NoxA is essential for growth control of the fungal endophyte in plant tissue in the process of symbiotic interaction between a fungi and its plant host. The Epichloe festucae p67(phox)-like regulator, NoxR, dispensable in culture but essential in plants for the symbiotic interaction. Plants infected with a noxR deletion mutant show severe stunting and premature senescence, whereas hyphae in the meristematic tissues show increased branching leading to increased fungal colonization of pseudostem and leaf blade tissue. The PB1 domain is a modular domain mediating specific protein-protein interactions which a play role in many critical cell processes such as osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is
Probab=75.19 E-value=13 Score=29.15 Aligned_cols=44 Identities=14% Similarity=0.318 Sum_probs=34.9
Q ss_pred EEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEE
Q psy15482 2 KVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEF 46 (302)
Q Consensus 2 qItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~ 46 (302)
+|+||.--.|.++.+-|+++.+..+|...|...+++. ....|-|
T Consensus 2 ~ikVKv~~~~Dv~~i~v~~~i~f~dL~~kIrdkf~~~-~~~~iKy 45 (86)
T cd06408 2 KIRVKVHAQDDTRYIMIGPDTGFADFEDKIRDKFGFK-RRLKIKM 45 (86)
T ss_pred cEEEEEEecCcEEEEEcCCCCCHHHHHHHHHHHhCCC-CceEEEE
Confidence 4566662378899999999999999999999999995 3444444
No 133
>cd01768 RA RA (Ras-associating) ubiquitin domain. The RA (Ras-associating) domain is structurally similar to ubiquitin and is present in one or two copies in a number of signalling molecules that bind and regulate a small GTPase called Ras or the Ras-related GTPases, Ral and Rap. RA-containing proteins include RalGDS, AF6, RIN1, RASSF1, SNX27, CYR1, STE50, and phospholipase C epsilon.
Probab=74.64 E-value=31 Score=26.01 Aligned_cols=49 Identities=18% Similarity=0.149 Sum_probs=37.2
Q ss_pred CeEEEEEeCCCccHHHHHHHHHhhhCCC--CCCeEEE--EC----CeecCCCccccc
Q psy15482 11 NECFLLDVSEDLELENFKAFCEVQSGYP--AHEIAIE--FQ----GVLLQDNKKTLK 59 (302)
Q Consensus 11 g~~~~leV~~~~TV~dLK~~Ie~e~gIp--~~~Q~Li--~~----Gk~L~D~~~TL~ 59 (302)
+...+|.|+.++|..++-..+..+++++ +..-.|+ +. .+.|.|++..|.
T Consensus 12 ~~~kti~V~~~~t~~~Vi~~~l~k~~l~~~~~~y~L~ev~~~~~~er~L~~~e~pl~ 68 (87)
T cd01768 12 GTYKTLRVSKDTTAQDVIQQLLKKFGLDDDPEDYALVEVLGDGGLERLLLPDECPLQ 68 (87)
T ss_pred ccEEEEEECCCCCHHHHHHHHHHHhCCcCCcccEEEEEEECCceEEEEeCCCCChHH
Confidence 5667999999999999999999999998 5555665 22 346666656654
No 134
>smart00295 B41 Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
Probab=74.53 E-value=25 Score=30.26 Aligned_cols=72 Identities=13% Similarity=0.115 Sum_probs=50.0
Q ss_pred EEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCC-CCeEEEE---CC---eecCCCcccccccCCC-CCCEEEEEE
Q psy15482 2 KVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPA-HEIAIEF---QG---VLLQDNKKTLKTHGIQ-DGDLVLLKH 73 (302)
Q Consensus 2 qItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~-~~Q~Li~---~G---k~L~D~~~TL~~~gI~-dgd~l~L~~ 73 (302)
.|.|.. .+|....+.+++++|++++...|..+.|++. ...-|.+ ++ ..|... ++|.+...+ ....+++..
T Consensus 5 ~~~V~l-~dg~~~~~~~~~~~t~~ev~~~v~~~~~l~~~~~F~L~~~~~~~~~~~~l~~~-~~l~~~~~~~~~~~l~fr~ 82 (207)
T smart00295 5 VLKVYL-LDGTTLEFEVDSSTTAEELLETVCRKLGIRESEYFGLQFEDPDEDLSHWLDPA-KTLLDQDVKSEPLTLYFRV 82 (207)
T ss_pred EEEEEe-cCCCEEEEEECCCCCHHHHHHHHHHHhCCCccceeEEEEEcCCCCcCeeCCCc-cCHHHhcCCCCCcEEEEEE
Confidence 356667 7899999999999999999999999999964 2224443 11 345443 677777665 344555555
Q ss_pred ec
Q psy15482 74 LG 75 (302)
Q Consensus 74 ~~ 75 (302)
+-
T Consensus 83 r~ 84 (207)
T smart00295 83 KF 84 (207)
T ss_pred EE
Confidence 43
No 135
>KOG2086|consensus
Probab=73.64 E-value=7.4 Score=38.39 Aligned_cols=65 Identities=15% Similarity=0.321 Sum_probs=51.6
Q ss_pred EEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhh-CCCCCCeEEE--ECCeecCCCcccccccCCCCCCE
Q psy15482 3 VTVSTQHTNECFLLDVSEDLELENFKAFCEVQS-GYPAHEIAIE--FQGVLLQDNKKTLKTHGIQDGDL 68 (302)
Q Consensus 3 ItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~-gIp~~~Q~Li--~~Gk~L~D~~~TL~~~gI~dgd~ 68 (302)
|-|+. .+|.-..+-+.-..||.|++..|...- +-+...+.|+ |=-|+|.|++.||++.|+.+--+
T Consensus 308 IQIRL-anG~RlV~~fN~sHTv~DIR~fI~~aRp~~~~~~F~L~~~FPpk~l~D~sqTle~AgL~Nsvl 375 (380)
T KOG2086|consen 308 IQIRL-ANGTRLVLKFNHSHTVSDIREFIDTARPGDSSTYFILMMAFPPKPLSDDSQTLEEAGLLNSVL 375 (380)
T ss_pred EEEEe-cCCceeeeeccCcccHHHHHHHHHhcCCCCcCCceeeeecCCCcccCCcchhHHhccchhhhh
Confidence 56777 778888888888899999999998764 4455567776 56789988889999999986533
No 136
>cd06396 PB1_NBR1 The PB1 domain is an essential part of NBR1 protein, next to BRCA1, a scaffold protein mediating specific protein-protein interaction with both titin protein kinase and with another scaffold protein p62. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domain, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. The NBR1 protein contains a type I PB1 domain.
Probab=73.26 E-value=11 Score=29.33 Aligned_cols=46 Identities=20% Similarity=0.284 Sum_probs=35.4
Q ss_pred EEEEeCCCCeEEEEEeCC--CccHHHHHHHHHhhhCCCCCCeEEEECCeecCCC
Q psy15482 3 VTVSTQHTNECFLLDVSE--DLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDN 54 (302)
Q Consensus 3 ItV~~~~~g~~~~leV~~--~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~ 54 (302)
|+||..-+|++..+.+++ +.+..+|++.|...++++ ...| |-|+|+
T Consensus 1 V~vKaty~~d~~rf~~~~~~~~~~~~L~~ev~~rf~l~--~f~l----KYlDde 48 (81)
T cd06396 1 VNLKVTYNGESQSFLVSDSENTTWASVEAMVKVSFGLN--DIQI----KYVDEE 48 (81)
T ss_pred CEEEEEECCeEEEEEecCCCCCCHHHHHHHHHHHhCCC--ccee----EEEcCC
Confidence 355554588898899988 779999999999999999 3444 446554
No 137
>TIGR02854 spore_II_GA sigma-E processing peptidase SpoIIGA. Members of this protein family are the stage II sporulation protein SpoIIGA. This protein acts as an activating protease for Sigma-E, one of several specialized sigma factors of the sporulation process in Bacillus subtilis and related endospore-forming bacteria.
Probab=70.66 E-value=10 Score=35.92 Aligned_cols=35 Identities=17% Similarity=0.414 Sum_probs=27.6
Q ss_pred EEEEEEEEcCee--EEEEEcCCCc---------cccCCHHHHHHcC
Q psy15482 215 MLYINCKVNGYP--VKAFIDSGAQ---------TTIMSLACAERVN 249 (302)
Q Consensus 215 mlyi~~~ing~~--v~a~vDsGaq---------~~i~s~~~a~~~g 249 (302)
..-|.++++|.. +|||||||-| ..|++.++.+.+-
T Consensus 158 ~~~v~i~~~g~~~~~~alvDTGN~L~DPlT~~PV~Ive~~~~~~~l 203 (288)
T TIGR02854 158 IYELEICLDGKKVTIKGFLDTGNQLRDPLTKLPVIVVEYDSLKSIL 203 (288)
T ss_pred EEEEEEEECCEEEEEEEEEecCCcccCCCCCCCEEEEEHHHhhhhC
Confidence 344677889987 8999999999 4578888877774
No 138
>PF08825 E2_bind: E2 binding domain; InterPro: IPR014929 E1 and E2 enzymes play a central role in ubiquitin and ubiquitin-like protein transfer cascades. This is an E2 binding domain that is found on NEDD8 activating E1 enzyme. The protein resembles ubiquitin, and recruits the catalytic core of the E2 enzyme Ubc12 in a similar manner to that in which ubiquitin interacts with ubiquitin binding domains []. ; GO: 0005524 ATP binding, 0016881 acid-amino acid ligase activity, 0045116 protein neddylation; PDB: 3GZN_D 3DBL_F 1R4N_H 1R4M_D 2NVU_B 1TT5_D 3DBR_D 3DBH_H 1YOV_B 3FN1_A ....
Probab=70.55 E-value=6.6 Score=30.54 Aligned_cols=58 Identities=17% Similarity=0.257 Sum_probs=41.4
Q ss_pred EEeCCCccHHHHHHHHHhhhCC-------CCCCeEEEECCe-ecC-----CCcccccccCCCCCCEEEEEEe
Q psy15482 16 LDVSEDLELENFKAFCEVQSGY-------PAHEIAIEFQGV-LLQ-----DNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 16 leV~~~~TV~dLK~~Ie~e~gI-------p~~~Q~Li~~Gk-~L~-----D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
|+|++++|+.+|-..+.....+ ....-.||+.+- .|. +=+++|.++ +.+|+-|.+.-.
T Consensus 1 i~v~~~~TL~~lid~L~~~~~~qlk~PSlt~~~k~LYm~~pp~Lee~Tr~NL~k~l~eL-~~~g~ei~VtD~ 71 (84)
T PF08825_consen 1 IEVSPSWTLQDLIDSLCEKPEFQLKKPSLTTANKTLYMQSPPSLEEATRPNLSKKLKEL-LSDGEEITVTDP 71 (84)
T ss_dssp EEESTTSBSHHHHHHHHHSTTT--SS-EEESSEEEEEESSSHHHHHHTGGGGSSBTTTT-HHSSEEEEEEET
T ss_pred CCcCccchHHHHHHHHHhChhhhcCCCcccCCCceEEEeCCHHHHHHhhhhhhhhHHHH-hcCCCEEEEECC
Confidence 6899999999999999887433 344445555443 121 224789999 999999998543
No 139
>smart00455 RBD Raf-like Ras-binding domain.
Probab=69.52 E-value=15 Score=27.33 Aligned_cols=47 Identities=15% Similarity=0.138 Sum_probs=40.4
Q ss_pred EEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECC--eec
Q psy15482 4 TVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQG--VLL 51 (302)
Q Consensus 4 tV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~G--k~L 51 (302)
.|-. .+|+...+.+.|..|+.|+-+.+=++.|+.++.-.|++.| ++|
T Consensus 3 ~v~L-P~~~~~~V~vrpg~tl~e~L~~~~~kr~l~~~~~~v~~~g~~k~l 51 (70)
T smart00455 3 KVHL-PDNQRTVVKVRPGKTVRDALAKALKKRGLNPECCVVRLRGEKKPL 51 (70)
T ss_pred EEEC-CCCCEEEEEECCCCCHHHHHHHHHHHcCCCHHHEEEEEcCCCcce
Confidence 3444 6789999999999999999999999999999999998854 455
No 140
>PF12754 Blt1: Cell-cycle control medial ring component; InterPro: IPR024737 During size-dependent cell cycle transitions controlled by the ubiquitous cyclin-dependent kinase Cdk1, Blt1 has been shown to co-localise with Cdr2 in the medial interphase nodes, as well as with Mid1 which was previously shown to localise to similar interphase structures. Physical interactions between Blt1-Mid1, Blt1-Cdr2 and Cdr2-Mid1 were detected, indicating that medial cortical nodes are formed by the ordered, Cdr2-dependent assembly of multiple interacting proteins during interphase[].; PDB: 2LO0_A.
Probab=69.46 E-value=1.5 Score=41.99 Aligned_cols=43 Identities=16% Similarity=0.241 Sum_probs=0.0
Q ss_pred CccHHHHHHHHHh----------hhCCCCCCeE-----EEECCeecCCCcccccccCCC
Q psy15482 21 DLELENFKAFCEV----------QSGYPAHEIA-----IEFQGVLLQDNKKTLKTHGIQ 64 (302)
Q Consensus 21 ~~TV~dLK~~Ie~----------e~gIp~~~Q~-----Li~~Gk~L~D~~~TL~~~gI~ 64 (302)
+.||.|+|..++. ++++|.+..+ |.|+-|++.|. +||.+..-.
T Consensus 103 ttSv~dlk~~v~~rv~~~~~~~~~~~vp~dKik~~~~~lL~~kkPv~~~-ktl~e~l~~ 160 (309)
T PF12754_consen 103 TTSVQDLKDAVQQRVHPSQATYDETRVPLDKIKNFRCRLLYKKKPVGDS-KTLAEVLAD 160 (309)
T ss_dssp -----------------------------------------------------------
T ss_pred cCcHHHHHHHHHhhhcccccccccccCCHHHhhhhhhhheecCccCCCc-CcHHHHHhc
Confidence 6899999999999 9999999999 99999999774 898887543
No 141
>PF14451 Ub-Mut7C: Mut7-C ubiquitin
Probab=69.05 E-value=22 Score=27.45 Aligned_cols=55 Identities=16% Similarity=0.198 Sum_probs=43.1
Q ss_pred CeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEE-ECCeecCCCcccccccCCCCCCEEEEEEec
Q psy15482 11 NECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIE-FQGVLLQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 11 g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li-~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
+..+.+++++..||.++-+ ..|||-.+-.++ -||+...= +|-+++||.|.+....
T Consensus 22 ~~~~~~~~~~~~tvkd~IE----sLGVP~tEV~~i~vNG~~v~~------~~~~~~Gd~v~V~P~~ 77 (81)
T PF14451_consen 22 GGPFTHPFDGGATVKDVIE----SLGVPHTEVGLILVNGRPVDF------DYRLKDGDRVAVYPVF 77 (81)
T ss_pred CCceEEecCCCCcHHHHHH----HcCCChHHeEEEEECCEECCC------cccCCCCCEEEEEecc
Confidence 4567889999999998644 479999988777 48887643 3679999999987543
No 142
>PF02597 ThiS: ThiS family; InterPro: IPR003749 ThiS (thiaminS) is a 66 aa protein involved in sulphur transfer. ThiS is coded in the thiCEFSGH operon in Escherichia coli. This family of proteins have two conserved Glycines at the COOH terminus. Thiocarboxylate is formed at the last G in the activation process. Sulphur is transferred from ThiI to ThiS in a reaction catalysed by IscS []. MoaD, a protein involved in sulphur transfer during molybdopterin synthesis, is about the same length and shows limited sequence similarity to ThiS. Both have the conserved GG at the COOH end.; PDB: 1JW9_D 1JWB_D 1JWA_D 3BII_D 1NVI_D 1FMA_D 1FM0_D 2QIE_G 2Q5W_D 2K5P_A ....
Probab=68.78 E-value=17 Score=26.45 Aligned_cols=58 Identities=16% Similarity=0.157 Sum_probs=44.3
Q ss_pred EEEEEeCCCccHHHHHHHHHhhhCC--CCCCeEEEECCeecCCCcccccccCCCCCCEEEEEE
Q psy15482 13 CFLLDVSEDLELENFKAFCEVQSGY--PAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKH 73 (302)
Q Consensus 13 ~~~leV~~~~TV~dLK~~Ie~e~gI--p~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~ 73 (302)
...+++....||.+|.+.+.....- ....-.+..||+.+.+ .-.+.-+++||.|.+..
T Consensus 13 ~~~~~~~~~~tv~~ll~~l~~~~p~~~~~~~~~v~vN~~~v~~---~~~~~~l~~gD~V~i~p 72 (77)
T PF02597_consen 13 EEEIEVPEGSTVRDLLEALAERYPELALRDRVAVAVNGEIVPD---DGLDTPLKDGDEVAILP 72 (77)
T ss_dssp EEEEEESSTSBHHHHHHHHCHHTGGGHTTTTEEEEETTEEEGG---GTTTSBEETTEEEEEEE
T ss_pred CeEEecCCCCcHHHHHHHHHhhccccccCccEEEEECCEEcCC---ccCCcCcCCCCEEEEEC
Confidence 5678899999999999999877531 2356677889998865 13445679999999865
No 143
>PF08337 Plexin_cytopl: Plexin cytoplasmic RasGAP domain; InterPro: IPR013548 This domain is found at C terminus of various plexins (e.g. P51805 from SWISSPROT). Plexins are receptors for semaphorins, and plexin signalling is important in pathfinding and patterning of both neurons and developing blood vessels [, ]. The cytoplasmic region, which has been called a SEX domain [], and is involved in downstream signalling pathways, by interaction with proteins such as Rac1, RhoD, Rnd1 and other plexins []. ; PDB: 3H6N_A 4E71_A 4E74_A 3IG3_A 2REX_C 2JPH_A 2R2O_A 3HM6_X 3SU8_X 3SUA_E ....
Probab=67.59 E-value=14 Score=38.21 Aligned_cols=67 Identities=24% Similarity=0.271 Sum_probs=43.9
Q ss_pred CCeEEEEEeCCCccHHHHHHHHHhh--hCCC------CCCeEEEE----CCe-ecCCCc------------ccccccCCC
Q psy15482 10 TNECFLLDVSEDLELENFKAFCEVQ--SGYP------AHEIAIEF----QGV-LLQDNK------------KTLKTHGIQ 64 (302)
Q Consensus 10 ~g~~~~leV~~~~TV~dLK~~Ie~e--~gIp------~~~Q~Li~----~Gk-~L~D~~------------~TL~~~gI~ 64 (302)
.+..+.+.|=..+||.++|++|=.. .+.| +++..|-+ .|+ +|.|.+ .||.+|||.
T Consensus 200 ~~~~i~VkVLdCDTItQVKeKiLDavyk~~p~S~rp~~~d~dLEwr~~~~~~~iL~D~D~ts~~~~~wkrLNTL~HY~V~ 279 (539)
T PF08337_consen 200 GSEEIPVKVLDCDTITQVKEKILDAVYKNTPYSQRPRADDVDLEWRQGRGGRLILQDEDSTSKVEGGWKRLNTLAHYKVP 279 (539)
T ss_dssp SSTCEEEEEETTSBHHHHHHHHHHHHTTTS-GGGS--GGGEEEEEEETTSEEEEESSSSTTSEEETTEEE--BHHHHT--
T ss_pred CCceEEEEEEecCcccHHHHHHHHHHHcCCCCCCCCCccccceeeecCCCCcccccCCCCCcccCCCceEeccHhhcCCC
Confidence 3455788998999999999988765 2333 34455543 133 455532 269999999
Q ss_pred CCCEEEEEEecC
Q psy15482 65 DGDLVLLKHLGS 76 (302)
Q Consensus 65 dgd~l~L~~~~~ 76 (302)
||.+|-|+.+..
T Consensus 280 dga~vaLv~k~~ 291 (539)
T PF08337_consen 280 DGATVALVPKQH 291 (539)
T ss_dssp TTEEEEEEES--
T ss_pred CCceEEEeeccc
Confidence 999999998863
No 144
>smart00727 STI1 Heat shock chaperonin-binding motif.
Probab=67.00 E-value=8.5 Score=25.20 Aligned_cols=9 Identities=22% Similarity=0.674 Sum_probs=4.5
Q ss_pred ChHHHHHHH
Q psy15482 147 SPEDFERIL 155 (302)
Q Consensus 147 d~e~f~~~l 155 (302)
||+.++.++
T Consensus 23 nP~~~~~~~ 31 (41)
T smart00727 23 NPDMLAQML 31 (41)
T ss_pred CHHHHHHHH
Confidence 555555444
No 145
>PF14732 UAE_UbL: Ubiquitin/SUMO-activating enzyme ubiquitin-like domain; PDB: 1Y8Q_B 1Y8R_E 3KYD_B 3KYC_B.
Probab=66.43 E-value=6.9 Score=30.46 Aligned_cols=52 Identities=15% Similarity=0.223 Sum_probs=27.6
Q ss_pred CccHHHHHHHH-HhhhCCCCCC----eEEEECCee---cCCCcccccccCCCCCCEEEEE
Q psy15482 21 DLELENFKAFC-EVQSGYPAHE----IAIEFQGVL---LQDNKKTLKTHGIQDGDLVLLK 72 (302)
Q Consensus 21 ~~TV~dLK~~I-e~e~gIp~~~----Q~Li~~Gk~---L~D~~~TL~~~gI~dgd~l~L~ 72 (302)
.+|+.+|-..| ..+.|+..-. -.++|..-. =.+..++|+++||++|+.|.+.
T Consensus 8 ~~TL~~lv~~Vlk~~Lg~~~P~v~~~~~ilyd~de~~~~~~l~k~L~elgi~~gs~L~v~ 67 (87)
T PF14732_consen 8 KMTLGDLVEKVLKKKLGMNEPDVSVGGTILYDSDEEEYDDNLPKKLSELGIVNGSILTVD 67 (87)
T ss_dssp T-BHHHHHHHCCCCCS--SSEEEEES-EEEE-SSSSSSTTCTTSBGGGGT--TT-EEEEE
T ss_pred hCcHHHHHHHHHHhccCCCCCEEEeCCCEEEcCCcchhhhcccCChhHcCCCCCCEEEEE
Confidence 57899988854 5566655422 233333222 1122478999999999999874
No 146
>PF02505 MCR_D: Methyl-coenzyme M reductase operon protein D; InterPro: IPR003901 Methyl-coenzyme M reductase (MCR) catalyses the reduction of methyl-coenzyme M (CH3-SCoM) and coenzyme B (HS-CoB) to methane and the corresponding heterosulphide CoM-S-S-CoB (2.8.4.1 from EC), the final step in methane biosynthesis. This reaction proceeds under anaerobic conditions by methanogenic Archaea [], and requires a nickel-porphinoid prosthetic group, coenzyme F430, which is in the EPR-detectable Ni(I) oxidation state in the active enzyme. Studies on a catalytically inactive enzyme aerobically co-crystallized with coenzyme M displayed a fully occupied coenzyme M-binding site with no alternate conformations. The binding of coenzyme M appears to induce specific conformational changes that suggests a molecular mechanism by which the enzyme ensures that methyl-coenzyme M enters the substrate channel prior to coenzyme B, as required by the active-site geometry []. MCR is a hexamer composed of 2 alpha, 2 beta, and 2 gamma subunits with two identical nickel porphinoid active sites, which form two long active site channels with F430 embedded at the bottom [, ]. Genes encoding the beta (mcrB) and gamma (mcrG) subunits of MCR are separated by two open reading frames coding for two proteins C and D [, ]. The function of proteins C and D is unknown. This entry represents protein D.; GO: 0015948 methanogenesis
Probab=66.23 E-value=12 Score=32.42 Aligned_cols=55 Identities=16% Similarity=0.335 Sum_probs=40.2
Q ss_pred EEEEEeCCCCeEEEEEeCC-CccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCC
Q psy15482 2 KVTVSTQHTNECFLLDVSE-DLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDG 66 (302)
Q Consensus 2 qItV~~~~~g~~~~leV~~-~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dg 66 (302)
.++|+. |.. .++++. ...++.+++.|++-+-++-+ ++-|+-+... .|++|| +|-|
T Consensus 69 eL~V~v---Gri-~lele~~~~~ie~I~~iCee~lpf~y~----i~~G~f~r~~-~TvtDY-~KyG 124 (153)
T PF02505_consen 69 ELTVKV---GRI-ILELEDEEDVIEKIREICEEVLPFGYD----IKEGKFIRTK-PTVTDY-AKYG 124 (153)
T ss_pred EEEEEE---eEE-EEEecCcHHHHHHHHHHHHHhCCCceE----eeeeEEeccC-Cchhhh-hhcC
Confidence 455555 555 478888 88999999999998755544 3469999885 899998 3444
No 147
>PRK08507 prephenate dehydrogenase; Validated
Probab=66.07 E-value=15 Score=33.95 Aligned_cols=47 Identities=17% Similarity=0.236 Sum_probs=28.1
Q ss_pred CCCCHHHHHHHHhcCH----HHHHHHHhcChhHHHHhhcCChHHHHHHHHH
Q psy15482 111 LSDDPATVRQLLLDNP----KEIALLQQNNPRLADALLNGSPEDFERILRQ 157 (302)
Q Consensus 111 ~~~npe~iR~~~l~NP----~~~~~L~~~NPeLa~al~~~d~e~f~~~lr~ 157 (302)
..+||++.+++++.|+ ..++.++..--+|.++|.++|.+.|.+.|..
T Consensus 217 a~~~p~l~~~i~~~N~~~~~~~l~~~~~~l~~~~~~l~~~d~~~~~~~~~~ 267 (275)
T PRK08507 217 AKSSPAMWSDIFKQNKENVLEAIDEFIKELEQFKQLIENEDWEELEEWMEQ 267 (275)
T ss_pred ccCCHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 3566777777777775 2234444455566666666666666666643
No 148
>cd01775 CYR1_RA Ubiquitin domain of CYR1 adenylate cyclase. CYR1 is a fungal adenylate cyclase with at least four domains, an N-terminal RA (Ras association) domain, a middle leucine-rich repeat domain, a catalytic domain. The N-terminal RA domain of CYR1 post-translationally modifies a small GTPase called Ras. The Ras-CYR1 pathway has been implicated in the transduction of a glucose-triggered signal to an intracellular environment where a protein phosphorylation cascade is initiated by cyclic AMP.
Probab=65.13 E-value=28 Score=27.98 Aligned_cols=66 Identities=11% Similarity=0.036 Sum_probs=46.6
Q ss_pred CCCeEEEEEeCCCccHHHHHHHHHhhhCCCC-CCeEE-EECC---eecCCCccc-------ccccCCCCCCEEEEEEe
Q psy15482 9 HTNECFLLDVSEDLELENFKAFCEVQSGYPA-HEIAI-EFQG---VLLQDNKKT-------LKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 9 ~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~-~~Q~L-i~~G---k~L~D~~~T-------L~~~gI~dgd~l~L~~~ 74 (302)
.++...++.++.++||.++-..+..++.++. ..-+| ++.| |+|...+++ |...|-++.|-|....+
T Consensus 10 ~D~Tf~Tls~~l~tTv~eli~~L~rK~~l~~~~ny~l~l~~~~l~RvL~p~ErPl~IqkrlL~q~GY~~~D~l~~lGr 87 (97)
T cd01775 10 SDGTFTTLSCPLNTTVSELIPQLAKKFYLPSGGNYQLSLKKHDLSRVLRPTEKPLLIQKRLLLQVGYEERDRIEDIGR 87 (97)
T ss_pred cCCcEEEEEcCCcCcHHHHHHHHHHhhcCCCCCCeEEEEEECCeeeecCCcCCcHHHHHHHHHHcCCCCCCcHHHhCc
Confidence 3566678999999999999999999999887 33344 3443 455433333 67777777777765544
No 149
>PF02153 PDH: Prephenate dehydrogenase; InterPro: IPR003099 Members of this family are prephenate dehydrogenases 1.3.1.12 from EC involved in tyrosine biosynthesis. ; GO: 0004665 prephenate dehydrogenase (NADP+) activity, 0008977 prephenate dehydrogenase activity, 0006571 tyrosine biosynthetic process, 0055114 oxidation-reduction process; PDB: 2F1K_B 2PV7_A 3DZB_B 3KTD_B 3B1F_A 2G5C_D 3GGP_C 3GGG_C 3GGO_D.
Probab=64.69 E-value=12 Score=34.49 Aligned_cols=13 Identities=38% Similarity=0.754 Sum_probs=4.7
Q ss_pred CHHHHHHHHhcCH
Q psy15482 114 DPATVRQLLLDNP 126 (302)
Q Consensus 114 npe~iR~~~l~NP 126 (302)
||++.++++..||
T Consensus 211 ~p~l~~~I~~~N~ 223 (258)
T PF02153_consen 211 DPELWADIFLSNP 223 (258)
T ss_dssp -HHHHHHHHHHTH
T ss_pred ChHHHHHHHHHCH
Confidence 3333333333333
No 150
>cd05470 pepsin_retropepsin_like Cellular and retroviral pepsin-like aspartate proteases. This family includes both cellular and retroviral pepsin-like aspartate proteases. The cellular pepsin and pepsin-like enzymes are twice as long as their retroviral counterparts. The cellular pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, rennin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (rennin, cathepsin D and E, pepsin) or commercially (chymosin) important. The eukaryotic pepsin-like proteases contain two domains possessing similar topological features. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except in the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The eukaryotic pepsin-like proteases have two active site
Probab=64.52 E-value=11 Score=29.35 Aligned_cols=65 Identities=12% Similarity=0.270 Sum_probs=40.5
Q ss_pred EEEEEcC--eeEEEEEcCCCccccCCHHHHHHcCchhhhhccc---------------cceecccCcceeeeEEEEeeee
Q psy15482 218 INCKVNG--YPVKAFIDSGAQTTIMSLACAERVNIMRLLDTRW---------------AGIAKGVGVQKILGRIHMAIAE 280 (302)
Q Consensus 218 i~~~ing--~~v~a~vDsGaq~~i~s~~~a~~~gl~~~~d~~~---------------~~~a~gvg~~~i~G~i~~~~~~ 280 (302)
|++.|.. ++++..+|||...+.+..+.+..|+.... .+| ..+..|-| .+.|.+-.-.+.
T Consensus 1 ~~i~vGtP~q~~~~~~DTGSs~~Wv~~~~c~~~~~~~~--~~~~~~~~sst~~~~~~~~~~~Y~~g--~~~g~~~~D~v~ 76 (109)
T cd05470 1 IEIGIGTPPQTFNVLLDTGSSNLWVPSVDCQSLAIYSH--SSYDDPSASSTYSDNGCTFSITYGTG--SLSGGLSTDTVS 76 (109)
T ss_pred CEEEeCCCCceEEEEEeCCCCCEEEeCCCCCCcccccc--cccCCcCCCCCCCCCCcEEEEEeCCC--eEEEEEEEEEEE
Confidence 4677775 88999999999988777665455543211 111 11223333 356777777888
Q ss_pred eccccc
Q psy15482 281 LTNFNG 286 (302)
Q Consensus 281 ig~~~~ 286 (302)
||+..+
T Consensus 77 ig~~~~ 82 (109)
T cd05470 77 IGDIEV 82 (109)
T ss_pred ECCEEE
Confidence 887653
No 151
>PF08968 DUF1885: Domain of unknown function (DUF1885); InterPro: IPR015062 This family consists of hypothetical proteins produced by bacteria of the Bacillus genus. ; PDB: 1T6A_A.
Probab=64.33 E-value=4.4 Score=33.92 Aligned_cols=46 Identities=33% Similarity=0.398 Sum_probs=26.8
Q ss_pred hccccceecccCcceeeeEEEEeeeeecccccchhHH--HHHHHHhhcc
Q psy15482 255 DTRWAGIAKGVGVQKILGRIHMAIAELTNFNGDKKKA--LAALLAKSLI 301 (302)
Q Consensus 255 d~~~~~~a~gvg~~~i~G~i~~~~~~ig~~~~~~~~~--~~~~~~~~~~ 301 (302)
+.||+.+.-|||+.+.---| ++.+-=|..++||-|| +...|||.|.
T Consensus 67 ~~~Y~~I~iGVg~e~e~~~I-Qv~LP~~AThGDK~KANEfckfLAk~l~ 114 (130)
T PF08968_consen 67 NDRYKYIVIGVGTENEQSYI-QVVLPDGATHGDKGKANEFCKFLAKKLK 114 (130)
T ss_dssp STT--EEEEEEEEETTEEEE-EEE--TT--HHHHHHHHHHHHHHHHHH-
T ss_pred CCceeeEEEeeccCCcceEE-EEECCCCCccCcchhHHHHHHHHHHHhh
Confidence 36799999999954332222 4444557899999998 5667787663
No 152
>TIGR03260 met_CoM_red_D methyl-coenzyme M reductase operon protein D. Members of this protein family are protein D, a non-structural protein, of the operon for methyl coenzyme M reductase, also called coenzyme-B sulfoethylthiotransferase (EC 2.8.4.1). That enzyme, with alpha, beta, and gamma subunits, catalyzes the last step in methanogenesis; it has several modified sites, so accessory proteins are expected. Several methanogens have encode two such enzymes, designated I and II; this model does not separate the isozymes. Proteins in this family are expressed at much lower levels than the methyl-coenzyme M reductase itself and associate and have been shown to form at least transient associations. The precise function is unknown.
Probab=64.25 E-value=13 Score=32.00 Aligned_cols=51 Identities=14% Similarity=0.263 Sum_probs=38.4
Q ss_pred EEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCccccccc
Q psy15482 2 KVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTH 61 (302)
Q Consensus 2 qItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~ 61 (302)
.++|+. |.++ |++.....++.+++.|++.+-++-+ +.-|+-+... .|++||
T Consensus 68 eL~V~V---GrI~-le~~~~~~i~~I~eiC~e~~pF~y~----i~~g~f~r~~-~TvtDY 118 (150)
T TIGR03260 68 ELRVQV---GRII-LELEDEDIVEEIEEICKEMLPFGYE----VRVGKFLRTK-PTVTDY 118 (150)
T ss_pred EEEEEE---eEEE-EEecCHHHHHHHHHHHHhhCCCceE----eeeeeEeecC-Cchhhh
Confidence 455555 5554 7777889999999999998766644 4568888875 899998
No 153
>PF03539 Spuma_A9PTase: Spumavirus aspartic protease (A9); InterPro: IPR001641 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A9 (spumapepsin family, clan AA). Foamy viruses are single-stranded enveloped retroviruses that have been noted to infect monkeys, cats and humans. In the human virus, the aspartic protease is encoded by the retroviral gag gene [], and in monkeys by the pol gene []. At present, the virus has not been proven to cause any particular disease. However, studies have shown Human foamy virus causes neurological disorders in infected mice []. It is not clear whether the Foamy virus/spumavirus proteases share a common evolutionary origin with other aspartic proteases. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 2JYS_A.
Probab=63.34 E-value=3.7 Score=35.44 Aligned_cols=57 Identities=32% Similarity=0.270 Sum_probs=30.5
Q ss_pred EcCeeEEEEEcCCCccccCCHHHHHHc-Cchh-hhhccccceecccC---cceeeeEEEEeee
Q psy15482 222 VNGYPVKAFIDSGAQTTIMSLACAERV-NIMR-LLDTRWAGIAKGVG---VQKILGRIHMAIA 279 (302)
Q Consensus 222 ing~~v~a~vDsGaq~~i~s~~~a~~~-gl~~-~~d~~~~~~a~gvg---~~~i~G~i~~~~~ 279 (302)
|-|..++++-||||+.|-++....+.- -+.. +|.+ ..|+.++-= +-||.||=..+.|
T Consensus 1 ikg~~l~~~wDsga~ITCiP~~fl~~E~Pi~~~~i~T-ihg~~~~~vYYl~fKi~grkv~aEV 62 (163)
T PF03539_consen 1 IKGTKLKGHWDSGAQITCIPESFLEEEQPIGKTLIKT-IHGEKEQDVYYLTFKINGRKVEAEV 62 (163)
T ss_dssp ETTEEEEEEE-TT-SSEEEEGGGTTT---SEEEEEE--SS-EEEEEEEEEEEEESS-EEEEEE
T ss_pred CCCceeeEEecCCCeEEEccHHHhCccccccceEEEE-ecCceeccEEEEEEEEcCeEEEEEE
Confidence 568999999999999999988764322 2222 3333 333333211 3456666555544
No 154
>PRK12751 cpxP periplasmic stress adaptor protein CpxP; Reviewed
Probab=62.40 E-value=41 Score=29.38 Aligned_cols=26 Identities=35% Similarity=0.597 Sum_probs=13.9
Q ss_pred HHHhcCCCCHHHHHHHHHHHHHhhHH
Q psy15482 171 RMLNAHPFDTEAQKLIAEEIKKNNIE 196 (302)
Q Consensus 171 ~~l~~DP~d~e~Q~~i~e~i~q~~i~ 196 (302)
.++.+||||+++=+-+.+.+.+..++
T Consensus 89 ~Li~Ad~FDeaAvra~~~kma~~~~e 114 (162)
T PRK12751 89 KLITADKFDEAAVRAQAEKMSQNQIE 114 (162)
T ss_pred HHHhcCCCCHHHHHHHHHHHHHHHHH
Confidence 34455666665555555555554444
No 155
>PTZ00380 microtubule-associated protein (MAP); Provisional
Probab=62.08 E-value=8.9 Score=31.98 Aligned_cols=59 Identities=14% Similarity=0.209 Sum_probs=41.9
Q ss_pred EEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCccccccc--CCC-CCCEEEEEEec
Q psy15482 16 LDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTH--GIQ-DGDLVLLKHLG 75 (302)
Q Consensus 16 leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~--gI~-dgd~l~L~~~~ 75 (302)
+=|+.+.||++|...|....++++++.-|+.|+..+.. +.|+++. .-+ +|-.|.+....
T Consensus 45 llVP~d~tV~qF~~iIRkrl~l~~~k~flfVnn~lp~~-s~~mg~lYe~~KDeDGFLYi~Ys~ 106 (121)
T PTZ00380 45 LALPRDATVAELEAAVRQALGTSAKKVTLAIEGSTPAV-TATVGDIADACKRDDGFLYVSVRT 106 (121)
T ss_pred EEcCCCCcHHHHHHHHHHHcCCChhHEEEEECCccCCc-cchHHHHHHHhcCCCCeEEEEEcc
Confidence 46999999999999999999999998544556644444 4677654 112 34567776654
No 156
>PF00026 Asp: Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.; InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) . More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=61.59 E-value=9.1 Score=35.24 Aligned_cols=71 Identities=14% Similarity=0.280 Sum_probs=45.9
Q ss_pred EEEEEEc--CeeEEEEEcCCCccccCC-HHHHHH--cCchhhhhcc----cc----ceecccCcceeeeEEEEeeeeecc
Q psy15482 217 YINCKVN--GYPVKAFIDSGAQTTIMS-LACAER--VNIMRLLDTR----WA----GIAKGVGVQKILGRIHMAIAELTN 283 (302)
Q Consensus 217 yi~~~in--g~~v~a~vDsGaq~~i~s-~~~a~~--~gl~~~~d~~----~~----~~a~gvg~~~i~G~i~~~~~~ig~ 283 (302)
|++++|- +++++++||||.-.+.+. ..|-.. |.-....+.. +. .....-|...+-|.++.-.+.||+
T Consensus 3 ~~~v~iGtp~q~~~~~iDTGS~~~wv~~~~c~~~~~~~~~~~y~~~~S~t~~~~~~~~~~~y~~g~~~G~~~~D~v~ig~ 82 (317)
T PF00026_consen 3 YINVTIGTPPQTFRVLIDTGSSDTWVPSSNCNSCSSCASSGFYNPSKSSTFSNQGKPFSISYGDGSVSGNLVSDTVSIGG 82 (317)
T ss_dssp EEEEEETTTTEEEEEEEETTBSSEEEEBTTECSHTHHCTSC-BBGGGSTTEEEEEEEEEEEETTEEEEEEEEEEEEEETT
T ss_pred EEEEEECCCCeEEEEEEecccceeeeceeccccccccccccccccccccccccceeeeeeeccCcccccccccceEeeee
Confidence 7899996 899999999998887665 344433 4333333321 11 011112333489999999999999
Q ss_pred cccc
Q psy15482 284 FNGD 287 (302)
Q Consensus 284 ~~~~ 287 (302)
+.+.
T Consensus 83 ~~~~ 86 (317)
T PF00026_consen 83 LTIP 86 (317)
T ss_dssp EEEE
T ss_pred cccc
Confidence 7765
No 157
>cd06398 PB1_Joka2 The PB1 domain is present in the Nicotiana plumbaginifolia Joka2 protein which interacts with sulfur stress inducible UP9 protein. The PB1 domain is a modular domain mediating specific protein-protein interactions which play a role in many critical cell processes, such as osteoclastogenesis, angiogenesis, early cardiovascular development and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domain, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions. The PB1 domain module
Probab=60.96 E-value=31 Score=27.16 Aligned_cols=44 Identities=9% Similarity=0.181 Sum_probs=32.9
Q ss_pred EEEeCCCCeEEEEEeC-----CCccHHHHHHHHHhhhCCCC-CCeEEEEC
Q psy15482 4 TVSTQHTNECFLLDVS-----EDLELENFKAFCEVQSGYPA-HEIAIEFQ 47 (302)
Q Consensus 4 tV~~~~~g~~~~leV~-----~~~TV~dLK~~Ie~e~gIp~-~~Q~Li~~ 47 (302)
+||..-+|...-+.++ ++.+..+|+.+|...+++++ ....|.|.
T Consensus 2 ~vKv~y~~~~rRf~l~~~~~~~d~~~~~L~~kI~~~f~l~~~~~~~l~Y~ 51 (91)
T cd06398 2 VVKVKYGGTLRRFTFPVAENQLDLNMDGLREKVEELFSLSPDADLSLTYT 51 (91)
T ss_pred EEEEEeCCEEEEEEeccccccCCCCHHHHHHHHHHHhCCCCCCcEEEEEE
Confidence 4554346667666665 58999999999999999998 56666663
No 158
>smart00314 RA Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS); putative RasGTP effectors in other cases. Kalhammer et al. have shown that not all RA domains bind RasGTP. Predicted structure similar to that determined, and that of the RasGTP-binding domain of Raf kinase. Predicted RA domains in PLC210 and nore1 found to bind RasGTP. Included outliers (Grb7, Grb14, adenylyl cyclases etc.)
Probab=60.89 E-value=54 Score=24.83 Aligned_cols=47 Identities=15% Similarity=0.122 Sum_probs=35.4
Q ss_pred CeEEEEEeCCCccHHHHHHHHHhhhCCCC--CCeEEE--E-CC--eecCCCccc
Q psy15482 11 NECFLLDVSEDLELENFKAFCEVQSGYPA--HEIAIE--F-QG--VLLQDNKKT 57 (302)
Q Consensus 11 g~~~~leV~~~~TV~dLK~~Ie~e~gIp~--~~Q~Li--~-~G--k~L~D~~~T 57 (302)
+...+|.|++++|..++-..+..+++++. ..-.|+ + +| +.|.+++..
T Consensus 15 ~~~kti~v~~~tTa~~Vi~~~l~k~~l~~~~~~y~L~e~~~~~~er~L~~~e~P 68 (90)
T smart00314 15 GTYKTLRVSSRTTARDVIQQLLEKFHLTDDPEEYVLVEVLPDGKERVLPDDENP 68 (90)
T ss_pred CcEEEEEECCCCCHHHHHHHHHHHhCCCCCcccEEEEEEeCCcEEEEeCCCCcc
Confidence 55679999999999999999999999975 455555 2 44 556554344
No 159
>PRK07417 arogenate dehydrogenase; Reviewed
Probab=60.47 E-value=23 Score=32.84 Aligned_cols=46 Identities=17% Similarity=0.151 Sum_probs=31.2
Q ss_pred CCCHHHHHHHHhcCHHH----HHHHHhcChhHHHHhhcCChHHHHHHHHH
Q psy15482 112 SDDPATVRQLLLDNPKE----IALLQQNNPRLADALLNGSPEDFERILRQ 157 (302)
Q Consensus 112 ~~npe~iR~~~l~NP~~----~~~L~~~NPeLa~al~~~d~e~f~~~lr~ 157 (302)
.+||++-+++++.|+.- ++.+++.--+|.++|..+|.+.+.+.|.+
T Consensus 220 ~~~p~~w~~i~~~N~~~i~~~l~~~~~~l~~~~~~l~~~d~~~l~~~~~~ 269 (279)
T PRK07417 220 GGNPELGVMMAEYNRAALLRSLASYRQSLDQLEELIEQENWSALEQKLEQ 269 (279)
T ss_pred CCChHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 56788888888888743 34445555667777777777777777643
No 160
>PF02160 Peptidase_A3: Cauliflower mosaic virus peptidase (A3); InterPro: IPR000588 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of sequences contain an aspartic peptidase signature that belongs to MEROPS peptidase family A3, subfamily A3A (cauliflower mosaic virus-type endopeptidase, clan AA). Cauliflower mosaic virus belongs to the Retro-transcribing viruses, which have a double-stranded DNA genome. The genome includes an open reading frame (ORF V) that shows similarities to the pol gene of retroviruses. This ORF codes for a polyprotein that includes a reverse transcriptase, which, on the basis of a DTG triplet near the N terminus, was suggested to include an aspartic protease. The presence of an aspartic protease has been confirmed by mutational studies, implicating Asp-45 in catalysis. The protease releases itself from the polyprotein and is involved in reactions required to process the ORF IV polyprotein, which includes the viral coat protein []. The viral aspartic peptidase signature has also been found associated with a polyprotein encoded by integrated pararetrovirus-like sequences in the genome of Nicotiana tabacum (Common tobacco) []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis
Probab=60.36 E-value=5.9 Score=35.87 Aligned_cols=67 Identities=16% Similarity=0.220 Sum_probs=39.5
Q ss_pred EEEEEEE--EcC---eeEEEEEcCCCccccCCHHHHHHcCchhhhhccccceecccC-c-ceeeeEEEEeeeeeccc
Q psy15482 215 MLYINCK--VNG---YPVKAFIDSGAQTTIMSLACAERVNIMRLLDTRWAGIAKGVG-V-QKILGRIHMAIAELTNF 284 (302)
Q Consensus 215 mlyi~~~--ing---~~v~a~vDsGaq~~i~s~~~a~~~gl~~~~d~~~~~~a~gvg-~-~~i~G~i~~~~~~ig~~ 284 (302)
..||.++ +.| ..+.|+|||||.+-+.+..+-=. ..+...++.-..+|+- . ..+-=++....|.|++-
T Consensus 4 siyI~~~i~~~gy~~~~~~~~vDTGAt~C~~~~~iiP~---e~we~~~~~i~v~~an~~~~~i~~~~~~~~i~I~~~ 77 (201)
T PF02160_consen 4 SIYIKVKISFPGYKKFNYHCYVDTGATICCASKKIIPE---EYWEKSKKPIKVKGANGSIIQINKKAKNGKIQIADK 77 (201)
T ss_pred cEEEEEEEEEcCceeEEEEEEEeCCCceEEecCCcCCH---HHHHhCCCcEEEEEecCCceEEEEEecCceEEEccE
Confidence 4566554 455 34888999999987766554210 1234444555666663 2 34555566666666653
No 161
>PF03419 Peptidase_U4: Sporulation factor SpoIIGA This family belongs to family U4 of the peptidase classification.; InterPro: IPR005081 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This group of peptidases belong to the MEROPS peptidase family U4 (SpoIIGA peptidase family, clan U-). Sporulation in bacteria such as Bacillus subtilis involves the formation of a polar septum, which divides the sporangium into a mother cell and a forespore. The sigma E factor, which is encoded within the spoIIG operon, is a cell-specific regulatory protein that directs gene transcription in the mother cell. Sigma E is synthesised as an inactive proprotein pro-sigma E, which is converted to the mature factor by the putative processing enzyme SpoIIGA []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis, 0030436 asexual sporulation
Probab=59.20 E-value=9.7 Score=35.91 Aligned_cols=34 Identities=21% Similarity=0.477 Sum_probs=27.1
Q ss_pred EEEEEEEEcCee--EEEEEcCCCcc---------ccCCHHHHHHc
Q psy15482 215 MLYINCKVNGYP--VKAFIDSGAQT---------TIMSLACAERV 248 (302)
Q Consensus 215 mlyi~~~ing~~--v~a~vDsGaq~---------~i~s~~~a~~~ 248 (302)
..-|.++++|+. ++||+|||-|. .|++.++.+.+
T Consensus 157 ~~~v~i~~~~~~~~~~allDTGN~L~DPitg~PV~Vve~~~~~~~ 201 (293)
T PF03419_consen 157 LYPVTIEIGGKKIELKALLDTGNQLRDPITGRPVIVVEYEALEKL 201 (293)
T ss_pred EEEEEEEECCEEEEEEEEEECCCcccCCCCCCcEEEEEHHHHHhh
Confidence 345677899987 89999999994 47778877777
No 162
>cd06410 PB1_UP2 Uncharacterized protein 2. The PB1 domain is a modular domain mediating specific protein-protein interaction which play a role in many critical cell processes such as osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions.
Probab=57.27 E-value=39 Score=26.92 Aligned_cols=37 Identities=8% Similarity=0.168 Sum_probs=33.0
Q ss_pred CCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEE
Q psy15482 9 HTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEF 46 (302)
Q Consensus 9 ~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~ 46 (302)
.+|++.-+.|+.+.|..+|+..+.+.++++.. ..|-|
T Consensus 20 ~GG~tr~i~V~r~~s~~el~~kl~~~~~~~~~-~~lky 56 (97)
T cd06410 20 VGGETRIVSVDRSISFKELVSKLSELFGAGVV-VTLKY 56 (97)
T ss_pred cCCceEEEEEcCCCCHHHHHHHHHHHhCCCCc-eEEEE
Confidence 57888999999999999999999999999977 66666
No 163
>PF14841 FliG_M: FliG middle domain; PDB: 3HJL_A 3AJC_A 1LKV_X 3SOH_D 3USY_B 3USW_A.
Probab=57.05 E-value=82 Score=23.75 Aligned_cols=69 Identities=25% Similarity=0.366 Sum_probs=39.1
Q ss_pred CHHHHHHHHh-cCHHHHHHH-HhcChhHHHHhhcCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHH
Q psy15482 114 DPATVRQLLL-DNPKEIALL-QQNNPRLADALLNGSPEDFERILRQQYEEREKREQLRLRMLNAHPFDTEAQKLIAEEIK 191 (302)
Q Consensus 114 npe~iR~~~l-~NP~~~~~L-~~~NPeLa~al~~~d~e~f~~~lr~~~~~~~~~e~~~~~~l~~DP~d~e~Q~~i~e~i~ 191 (302)
||+.+..++. ..|+..+.+ ..-+|+.|..+.+.=|+..+.-+ -.++...+..+|+.-+.|++.++
T Consensus 1 dp~~l~~~L~~EhPq~iAliLs~L~~~~AA~VL~~lp~e~r~~v-------------~~Ria~~~~v~~~~i~~ie~~L~ 67 (79)
T PF14841_consen 1 DPEQLAALLKDEHPQTIALILSYLPPEQAAEVLSQLPEELRAEV-------------VRRIARLESVSPEVIEEIEEVLE 67 (79)
T ss_dssp -HHHHHHHHTTS-HHHHHHHHHTS-HHHHHHHHHTS-HHHHHHH-------------HHHHHTCCCCSHHHHHHHHHHHH
T ss_pred CHHHHHHHHHhcCHHHHHHHHHcCCHHHHHHHHHHCCHHHHHHH-------------HHHHHccCCCCHHHHHHHHHHHH
Confidence 4666666654 566666653 33455555544432222211111 23566789999999999999998
Q ss_pred HhhH
Q psy15482 192 KNNI 195 (302)
Q Consensus 192 q~~i 195 (302)
++--
T Consensus 68 ~~l~ 71 (79)
T PF14841_consen 68 EKLE 71 (79)
T ss_dssp HHCH
T ss_pred HHHH
Confidence 7543
No 164
>PRK07502 cyclohexadienyl dehydrogenase; Validated
Probab=57.00 E-value=29 Score=32.57 Aligned_cols=47 Identities=17% Similarity=0.328 Sum_probs=33.3
Q ss_pred CCCCHHHHHHHHhcCHHH----HHHHHhcChhHHHHhhcCChHHHHHHHHH
Q psy15482 111 LSDDPATVRQLLLDNPKE----IALLQQNNPRLADALLNGSPEDFERILRQ 157 (302)
Q Consensus 111 ~~~npe~iR~~~l~NP~~----~~~L~~~NPeLa~al~~~d~e~f~~~lr~ 157 (302)
..+||++.+++++.|+.- ++.++..--++.++|.++|.+.|.+.|.+
T Consensus 230 a~~~~~~w~~i~~~N~~~~~~~l~~~~~~l~~~~~~l~~~d~~~l~~~~~~ 280 (307)
T PRK07502 230 AASDPTMWRDVFLHNKDAVLEMLGRFTEDLAALQRAIRWGDGDALFDLFTR 280 (307)
T ss_pred ccCChHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 356788888888888753 34555566677788877888777777754
No 165
>PF14533 USP7_C2: Ubiquitin-specific protease C-terminal; PDB: 2YLM_A.
Probab=56.27 E-value=64 Score=29.06 Aligned_cols=58 Identities=7% Similarity=0.126 Sum_probs=33.4
Q ss_pred eEEEEEeCCCccHHHHHHHHHhhhCCCCC---CeEEE--ECCeec---CCCcccccccCCCCCCEEEEE
Q psy15482 12 ECFLLDVSEDLELENFKAFCEVQSGYPAH---EIAIE--FQGVLL---QDNKKTLKTHGIQDGDLVLLK 72 (302)
Q Consensus 12 ~~~~leV~~~~TV~dLK~~Ie~e~gIp~~---~Q~Li--~~Gk~L---~D~~~TL~~~gI~dgd~l~L~ 72 (302)
+.+.+-|+.+.||.||...+..+.+++.+ ..+|+ ++||.. ..+ .+|.+. .+...+.+-
T Consensus 34 ~~~~~~vpk~~tV~Dll~~l~~k~~~~~~~~~~lrl~ev~~~ki~~~~~~d-~~i~~l--~~~~~~r~E 99 (213)
T PF14533_consen 34 QEYELLVPKTGTVSDLLEELQKKVGFSEEGTGKLRLWEVSNHKIYKILSED-EPISSL--NDYITLRIE 99 (213)
T ss_dssp -EEEE--BTT-BHHHHHHHHHTT----TT----EEEEEEETTEEEEEE-TT-SBGGGS----TTEEEEE
T ss_pred eEEEEEECCCCCHHHHHHHHHHHcCCCcCCcCcEEEEEeECCEEEeecCCC-Cchhhc--cCcceeeee
Confidence 35788999999999999999999999775 44553 677764 443 677776 444445443
No 166
>PRK14806 bifunctional cyclohexadienyl dehydrogenase/ 3-phosphoshikimate 1-carboxyvinyltransferase; Provisional
Probab=55.65 E-value=17 Score=38.58 Aligned_cols=50 Identities=14% Similarity=0.270 Sum_probs=39.1
Q ss_pred CCCCHHHHHHHHhcCHHH----HHHHHhcChhHHHHhhcCChHHHHHHHHHHHH
Q psy15482 111 LSDDPATVRQLLLDNPKE----IALLQQNNPRLADALLNGSPEDFERILRQQYE 160 (302)
Q Consensus 111 ~~~npe~iR~~~l~NP~~----~~~L~~~NPeLa~al~~~d~e~f~~~lr~~~~ 160 (302)
...||+..++++.+|+.. ++.+.+.--+|++++..+|.+.|.+.|.+-.+
T Consensus 226 a~~~p~~~~di~~~n~~~~~~~l~~~~~~l~~~~~~l~~~d~~~~~~~~~~~~~ 279 (735)
T PRK14806 226 AASDPVMWHDIFLANKEAVLRALDHFRDDLDALRAAIEAGDGHALLGVFTRARA 279 (735)
T ss_pred ccCCHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHH
Confidence 367999999999999753 35554444789999999999999999976433
No 167
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=54.91 E-value=19 Score=33.02 Aligned_cols=70 Identities=13% Similarity=0.086 Sum_probs=44.1
Q ss_pred EEEEEEcC--eeEEEEEcCCCccccCCHHHHHHcCchh--hhhcc-----------ccceecccCcceeeeEEEEeeeee
Q psy15482 217 YINCKVNG--YPVKAFIDSGAQTTIMSLACAERVNIMR--LLDTR-----------WAGIAKGVGVQKILGRIHMAIAEL 281 (302)
Q Consensus 217 yi~~~ing--~~v~a~vDsGaq~~i~s~~~a~~~gl~~--~~d~~-----------~~~~a~gvg~~~i~G~i~~~~~~i 281 (302)
|++++|.. ++++++||||.-.+.+..+-+..|+-.. ..|.. ...+..|-| ..+.|.+..-.|.|
T Consensus 2 ~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~~c~~~~~~~~~~y~~~~Sst~~~~~~~~~~i~Y~~G-~~~~G~~~~D~v~i 80 (278)
T cd06097 2 LTPVKIGTPPQTLNLDLDTGSSDLWVFSSETPAAQQGGHKLYDPSKSSTAKLLPGATWSISYGDG-SSASGIVYTDTVSI 80 (278)
T ss_pred eeeEEECCCCcEEEEEEeCCCCceeEeeCCCCchhhccCCcCCCccCccceecCCcEEEEEeCCC-CeEEEEEEEEEEEE
Confidence 78899988 8899999999888877654333343221 12211 111223333 24788888888888
Q ss_pred cccccc
Q psy15482 282 TNFNGD 287 (302)
Q Consensus 282 g~~~~~ 287 (302)
|+..+.
T Consensus 81 g~~~~~ 86 (278)
T cd06097 81 GGVEVP 86 (278)
T ss_pred CCEEEC
Confidence 887654
No 168
>KOG0010|consensus
Probab=54.49 E-value=36 Score=34.77 Aligned_cols=65 Identities=15% Similarity=0.335 Sum_probs=39.0
Q ss_pred HHHHHHhcCHHHHHHHHhcChhHHHHhhcCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCC------CCHHHHHHHHHHH
Q psy15482 117 TVRQLLLDNPKEIALLQQNNPRLADALLNGSPEDFERILRQQYEEREKREQLRLRMLNAHP------FDTEAQKLIAEEI 190 (302)
Q Consensus 117 ~iR~~~l~NP~~~~~L~~~NPeLa~al~~~d~e~f~~~lr~~~~~~~~~e~~~~~~l~~DP------~d~e~Q~~i~e~i 190 (302)
.+.+++.+||.++.++ ..||-+...+ +||+.+++++...-+. .+++..+| .+|+..+--.|..
T Consensus 147 ~~q~~~~snpe~~~~~-m~nP~vq~ll--~Npd~mrq~I~anPqm--------q~lm~~npei~h~ln~p~i~rQtle~a 215 (493)
T KOG0010|consen 147 QMQNQLLSNPEALRQM-MENPIVQSLL--NNPDLMRQLIMANPQM--------QDLMQRNPEIGHLLNNPLILRQTLESA 215 (493)
T ss_pred cchhccccCHHHHHHh-hhChHHHHHh--cChHHHHHHHhcCHHH--------HHHHhhCCcchhhhcChHHHHHHHHhc
Confidence 3455567777777776 5677777777 6788888776541110 11233333 4667777777766
Q ss_pred HH
Q psy15482 191 KK 192 (302)
Q Consensus 191 ~q 192 (302)
|.
T Consensus 216 rN 217 (493)
T KOG0010|consen 216 RN 217 (493)
T ss_pred cC
Confidence 64
No 169
>PF02196 RBD: Raf-like Ras-binding domain; InterPro: IPR003116 This is the Ras-binding domain found in proteins related to Ras. It is found in association with the PE-bind and pkinase domains.; GO: 0005057 receptor signaling protein activity, 0007165 signal transduction; PDB: 1RFA_A 1C1Y_B 3KUD_B 1GUA_B 3KUC_B 2L05_A 3NY5_D 1RRB_A 1WFY_A 1WXM_A.
Probab=53.18 E-value=61 Score=24.10 Aligned_cols=48 Identities=8% Similarity=0.075 Sum_probs=36.0
Q ss_pred EEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEEC--CeecC
Q psy15482 4 TVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQ--GVLLQ 52 (302)
Q Consensus 4 tV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~--Gk~L~ 52 (302)
.|-. .+|+...+.|.|..|+.|+-+.+=+.-|+.++.-.++.. .++|.
T Consensus 4 ~v~L-P~~q~t~V~vrpg~ti~d~L~~~~~kr~L~~~~~~V~~~~~~k~l~ 53 (71)
T PF02196_consen 4 RVHL-PNGQRTVVQVRPGMTIRDALSKACKKRGLNPECCDVRLVGEKKPLD 53 (71)
T ss_dssp EEEE-TTTEEEEEEE-TTSBHHHHHHHHHHTTT--CCCEEEEEEEEEEEE-
T ss_pred EEEC-CCCCEEEEEEcCCCCHHHHHHHHHHHcCCCHHHEEEEEcCCCcccc
Confidence 4555 789999999999999999999999999999987777643 44554
No 170
>cd01766 Ufm1 Urm1-like ubiquitin domain. Ufm1 (ubiquitin-fold modifier 1) is a post-translational UBL (ubiquitin-like) modifier with a tertiary structure similar to that of ubiquitin. Ufm1 is initially expressed as a precursor which undergoes C-terminal cleavage to expose a conserved glycine residue that is required for the conjugation reactions involving Ufm1.
Probab=52.97 E-value=82 Score=24.18 Aligned_cols=59 Identities=20% Similarity=0.190 Sum_probs=46.4
Q ss_pred EEEeCCCccHHHHHHHHHhhhCCCCCCeEEEEC-CeecCCCcccccccCCCCCCEEEEEEe
Q psy15482 15 LLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQ-GVLLQDNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 15 ~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~-Gk~L~D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
.+.|+++.-...+-...++++++|+..--++-| |-=+.- ..|-..+=+|-|+-|.|..|
T Consensus 19 vlsVpE~aPftAvlkfaAEeFkv~~~TsAiiTndGvGINP-~qtAGnvflkhgselrliPR 78 (82)
T cd01766 19 VLSVPESTPFTAVLKFAAEEFKVPAATSAIITNDGIGINP-AQTAGNVFLKHGSELRLIPR 78 (82)
T ss_pred EEeccccCchHHHHHHHHHhcCCCccceeEEecCccccCh-hhcccceeeecCCEeeeccc
Confidence 678999998999999999999999877666654 544544 47888888888888887655
No 171
>KOG2561|consensus
Probab=52.74 E-value=11 Score=38.20 Aligned_cols=59 Identities=17% Similarity=0.218 Sum_probs=50.2
Q ss_pred EEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEe
Q psy15482 15 LLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 15 ~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
.++.+-+.|-.+|.+.|..++||+-+..+.+-+||+|.-. +||.+-|++.+--+++...
T Consensus 53 l~k~sL~i~Gselqa~iakklgi~enhvKci~~~Kils~~-ktlaeQglk~nq~~mv~~~ 111 (568)
T KOG2561|consen 53 LKKCSLHITGSELQALIAKKLGIKENHVKCIINGKILSCR-KTLAEQGLKINQELMVAVG 111 (568)
T ss_pred hhhcccccccHHHHHHHHHHcCCchhhhheeeccceeecc-cchhhhhhhhhhHHHHHhc
Confidence 3456667788899999999999999999999999999986 9999999998876665444
No 172
>PF12436 USP7_ICP0_bdg: ICP0-binding domain of Ubiquitin-specific protease 7; InterPro: IPR024729 This domain is found in eukaryotes, and is approximately 40 amino acids in length. It is found in proteins of the peptidase C19 family, which contains contains ubiquitinyl hydrolases like ubiquitin-specific protease 7 (USP7). USP7 regulates the turnover of p53 [].; PDB: 2KVR_A 2YLM_A.
Probab=51.62 E-value=17 Score=33.60 Aligned_cols=60 Identities=15% Similarity=0.364 Sum_probs=41.4
Q ss_pred EEEeCCCccHHHHHHHHHhhhCCCCCCeEEEEC----Ce--ecCCCcccccccCCCCCCEEEEEEec
Q psy15482 15 LLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQ----GV--LLQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 15 ~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~----Gk--~L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
.+-|+.+.+|.+|-..|-+..|+|++...++|. ++ .+ |.+.|+..+.|.+||+|...+..
T Consensus 88 h~~v~~~~~v~~l~~~i~~~~g~p~~t~l~lyEEi~~~~ie~i-~~~~t~~~~el~~GdIi~fQ~~~ 153 (249)
T PF12436_consen 88 HVYVPKNDKVSELVPLINERAGLPPDTPLLLYEEIKPNMIEPI-DPNQTFEKAELQDGDIICFQRAP 153 (249)
T ss_dssp EEEEETT-BGGGTHHHHHHHHT--TT--EEEEEEEETTEEEE---SSSBHHHTT--TTEEEEEEE--
T ss_pred EEEECCCCCHHHHHHHHHHHcCCCCCCceEEEEEeccceeeEc-CCCCchhhcccCCCCEEEEEecc
Confidence 467999999999999999999999998888874 22 24 44589999999999999886643
No 173
>PF14327 CSTF2_hinge: Hinge domain of cleavage stimulation factor subunit 2; PDB: 4EBA_G.
Probab=51.48 E-value=17 Score=28.07 Aligned_cols=27 Identities=44% Similarity=0.607 Sum_probs=16.4
Q ss_pred HHHHHHhcCHHHHHHHHhcChhHHHHh
Q psy15482 117 TVRQLLLDNPKEIALLQQNNPRLADAL 143 (302)
Q Consensus 117 ~iR~~~l~NP~~~~~L~~~NPeLa~al 143 (302)
.++.+...||.....|-.+||+|+-|+
T Consensus 37 ~mK~l~~~~p~~ar~lL~~nPqLa~Al 63 (84)
T PF14327_consen 37 QMKQLAQQNPEQARQLLQQNPQLAYAL 63 (84)
T ss_dssp HHHHHHC----HHHHHHHS-THHHHHH
T ss_pred HHHHHHHhCHHHHHHHHHHCcHHHHHH
Confidence 345666788877777778999999888
No 174
>COG5100 NPL4 Nuclear pore protein [Nuclear structure]
Probab=51.44 E-value=52 Score=33.07 Aligned_cols=70 Identities=20% Similarity=0.246 Sum_probs=49.8
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhh--hCCCCCCeEEEEC----Ce---ecCCCcccccccCCCCCCEEEE
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQ--SGYPAHEIAIEFQ----GV---LLQDNKKTLKTHGIQDGDLVLL 71 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e--~gIp~~~Q~Li~~----Gk---~L~D~~~TL~~~gI~dgd~l~L 71 (302)
|-+.++. .....-+++.++.+++-|-++|-.. .+..+++.-+.-+ |- .|+| .|+.++|++.|++|.|
T Consensus 1 Mi~rfRs--k~G~~Rve~qe~d~lg~l~~kll~~~~~n~~~e~~svc~~p~~qG~~~s~l~d--qt~~dlGL~hGqmLyl 76 (571)
T COG5100 1 MIFRFRS--KEGQRRVEVQESDVLGMLSPKLLAFFEVNYSPEQISVCSAPDGQGEIFSLLKD--QTPDDLGLRHGQMLYL 76 (571)
T ss_pred CeEEEec--CCCceeeeccccchhhhhhHHHHhhhccCCCccceEEEeCCCCCceeeecccc--cChhhhccccCcEEEE
Confidence 6667777 3334569999999999887766554 3555666655532 33 3444 7999999999999999
Q ss_pred EEe
Q psy15482 72 KHL 74 (302)
Q Consensus 72 ~~~ 74 (302)
...
T Consensus 77 ~ys 79 (571)
T COG5100 77 EYS 79 (571)
T ss_pred Eec
Confidence 873
No 175
>PRK11130 moaD molybdopterin synthase small subunit; Provisional
Probab=51.24 E-value=95 Score=23.29 Aligned_cols=52 Identities=15% Similarity=0.210 Sum_probs=32.5
Q ss_pred EEeCC-CccHHHHHHHHHhhhC-----CCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEE
Q psy15482 16 LDVSE-DLELENFKAFCEVQSG-----YPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKH 73 (302)
Q Consensus 16 leV~~-~~TV~dLK~~Ie~e~g-----Ip~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~ 73 (302)
+++++ ..||.+|+..+..+.. ......+...|+..... +.-|++||.|-+..
T Consensus 19 ~~v~~~~~tv~~l~~~L~~~~~~~~~~~~~~~~~~aVN~~~~~~------~~~l~dgDeVai~P 76 (81)
T PRK11130 19 LELAADFPTVEALRQHLAQKGDRWALALEDGKLLAAVNQTLVSF------DHPLTDGDEVAFFP 76 (81)
T ss_pred EEecCCCCCHHHHHHHHHHhCccHHhhhcCCCEEEEECCEEcCC------CCCCCCCCEEEEeC
Confidence 44443 4799999999987752 11223344456654332 23599999998853
No 176
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases. They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=50.96 E-value=28 Score=32.72 Aligned_cols=71 Identities=13% Similarity=0.165 Sum_probs=45.7
Q ss_pred EEEEEEEEc--CeeEEEEEcCCCcccc-CCHHHH--HHcCchhhhhcc----cc------ceecccCcceeeeEEEEeee
Q psy15482 215 MLYINCKVN--GYPVKAFIDSGAQTTI-MSLACA--ERVNIMRLLDTR----WA------GIAKGVGVQKILGRIHMAIA 279 (302)
Q Consensus 215 mlyi~~~in--g~~v~a~vDsGaq~~i-~s~~~a--~~~gl~~~~d~~----~~------~~a~gvg~~~i~G~i~~~~~ 279 (302)
.-|++++|. ++++++++|||.-.+. .+..|. ..|......|.. |+ .+..|-| .+.|.+-.-.|
T Consensus 10 ~Y~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~~C~~~~~C~~~~~y~~~~SsT~~~~~~~~~i~Yg~G--~~~G~~~~D~v 87 (317)
T cd06098 10 QYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYKSSKSSTYKKNGTSASIQYGTG--SISGFFSQDSV 87 (317)
T ss_pred EEEEEEEECCCCeEEEEEECCCccceEEecCCCCCCccccccCcCCcccCCCcccCCCEEEEEcCCc--eEEEEEEeeEE
Confidence 457899997 6889999999976554 455563 357654444321 11 1234444 36788888888
Q ss_pred eecccccc
Q psy15482 280 ELTNFNGD 287 (302)
Q Consensus 280 ~ig~~~~~ 287 (302)
.||+..+.
T Consensus 88 ~ig~~~v~ 95 (317)
T cd06098 88 TVGDLVVK 95 (317)
T ss_pred EECCEEEC
Confidence 88886554
No 177
>smart00144 PI3K_rbd PI3-kinase family, Ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding RA domains (unpublished observation).
Probab=50.70 E-value=93 Score=24.98 Aligned_cols=71 Identities=14% Similarity=0.159 Sum_probs=44.4
Q ss_pred EEEEeCCCCeEEEEEeCCCccHHHHHHHHHhh------hCCCCC-CeEEEECCee--cCCCccccccc-----CCCCCCE
Q psy15482 3 VTVSTQHTNECFLLDVSEDLELENFKAFCEVQ------SGYPAH-EIAIEFQGVL--LQDNKKTLKTH-----GIQDGDL 68 (302)
Q Consensus 3 ItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e------~gIp~~-~Q~Li~~Gk~--L~D~~~TL~~~-----gI~dgd~ 68 (302)
|.|..+.....+++.+++++|+.+|.+.+-.+ ..-+++ +-.|--.|+. |..+ ..|.+| -++.|..
T Consensus 20 v~i~~~~~~~~~t~~v~~~~~p~~li~~~l~k~~~~~~~~~~~~~dyvLkV~G~~Eyl~~~-~~L~~~~yIr~cl~~~~~ 98 (108)
T smart00144 20 IVVHLEKDQQTKTLKVNPNCTPDSVLAQAFTKMLSLHDQVDPTSEDYILKVCGRDEYLLGD-HPLGSFEYIRNCLKNGRE 98 (108)
T ss_pred EEEEEccCceeEEEEECCCCCHHHHHHHHHHHHHhccccccCCCCcEEEEecCcEEEEeCC-eeeechHHHHHHHhcCCC
Confidence 44444345678999999999999999866554 112233 5666666664 3333 566665 3555666
Q ss_pred EEEEEe
Q psy15482 69 VLLKHL 74 (302)
Q Consensus 69 l~L~~~ 74 (302)
++|+..
T Consensus 99 ~~L~L~ 104 (108)
T smart00144 99 PHLVLM 104 (108)
T ss_pred ceEEEE
Confidence 666543
No 178
>cd01782 AF6_RA_repeat1 Ubiquitin domain of AT-6, first repeat. The AF-6 protein (also known as afadin and canoe) is a multidomain cell junction protein that contains two N-terminal Ras-associating (RA) domains in addition to FHA (forkhead-associated), DIL (class V myosin homology region), and PDZ domains and a proline-rich region. AF6 acts downstream of the Egfr (Epidermal Growth Factor-receptor)/Ras signalling pathway and provides a link from Egfr to cytoskeletal elements.
Probab=49.56 E-value=91 Score=25.64 Aligned_cols=56 Identities=13% Similarity=0.049 Sum_probs=37.9
Q ss_pred CEEEEEeCCCCeEE--EEEeCCCccHHHHHHHHHhhhCCC-----CCCeEEE---ECC--eecCCCccc
Q psy15482 1 MKVTVSTQHTNECF--LLDVSEDLELENFKAFCEVQSGYP-----AHEIAIE---FQG--VLLQDNKKT 57 (302)
Q Consensus 1 MqItV~~~~~g~~~--~leV~~~~TV~dLK~~Ie~e~gIp-----~~~Q~Li---~~G--k~L~D~~~T 57 (302)
|+.++.. .+++.- .|-|+.++|+.++.+.+-.++.+. +...-|+ .+| +.|.|++.+
T Consensus 24 mrf~~qd-~~~k~atK~VrVsS~~tt~eVI~~LLeKFk~d~~~~s~p~FALYevh~nGe~RKL~d~E~P 91 (112)
T cd01782 24 MRFYFQD-GGEKVATKCIRVSSTATTRDVIDTLSEKFRPDMRMLSNPTYSLYEVHENGEERRLLDDEKP 91 (112)
T ss_pred EEEEEEc-CCCcEEEEEEEEecCCCHHHHHHHHHHHhcccccccCCcceEEEEEecCCceEEcCCcCCC
Confidence 5666766 444443 589999999999999999998855 4456665 244 455554333
No 179
>PF07319 DnaI_N: Primosomal protein DnaI N-terminus; InterPro: IPR009928 This entry represents the N terminus (approximately 120 residues) of bacterial primosomal DnaI proteins, although one family member appears to be of viral origin. DnaI is one of the components of the Bacillus subtilis replication restart primosome, and is required for the DnaB75-dependent loading of the DnaC helicase [].; PDB: 2K7R_A.
Probab=48.94 E-value=26 Score=27.52 Aligned_cols=43 Identities=21% Similarity=0.384 Sum_probs=31.2
Q ss_pred CHHHHHHHHhcCHHHHHHHHhcChhHHHHhhcCChHHHHHHHH
Q psy15482 114 DPATVRQLLLDNPKEIALLQQNNPRLADALLNGSPEDFERILR 156 (302)
Q Consensus 114 npe~iR~~~l~NP~~~~~L~~~NPeLa~al~~~d~e~f~~~lr 156 (302)
+-+.+++.++++|..++-|.+..|+|-+..-..+...|.+.+.
T Consensus 20 ~~~~l~~~vl~dp~V~~Fl~~h~~eLt~~~i~rsl~kLyEy~~ 62 (94)
T PF07319_consen 20 RYEQLKQEVLSDPEVQAFLQEHQPELTQEMIERSLSKLYEYVS 62 (94)
T ss_dssp HHHHHHHHHTT-HHHHHHHHHSTTT--HHHHHHTHHHHHHHHH
T ss_pred HHHHHHHHHHcCHHHHHHHHHhHHhcCHHHHHHHHHHHHHHHH
Confidence 4567888999999999999766699987776667777777663
No 180
>COG0287 TyrA Prephenate dehydrogenase [Amino acid transport and metabolism]
Probab=48.48 E-value=38 Score=31.99 Aligned_cols=45 Identities=22% Similarity=0.372 Sum_probs=23.9
Q ss_pred CCCHHHHHHHHhcCHH-H---HHHHHhcChhHHHHhhcCChHHHHHHHH
Q psy15482 112 SDDPATVRQLLLDNPK-E---IALLQQNNPRLADALLNGSPEDFERILR 156 (302)
Q Consensus 112 ~~npe~iR~~~l~NP~-~---~~~L~~~NPeLa~al~~~d~e~f~~~lr 156 (302)
.+||++.+++++.||. . .+.+++.--+|.+++..+|.+.|.+.++
T Consensus 222 ~~~P~m~~dI~~~N~~~~l~~i~~~~~~l~~l~~~i~~~d~~~l~~~~~ 270 (279)
T COG0287 222 SSDPEMYADIQLSNKEALLEAIERFAKSLDELKELIENGDAEALADLFE 270 (279)
T ss_pred cCChHHHHHHHHhCcHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHH
Confidence 4556666666666665 2 2333444445555555555555555553
No 181
>cd06397 PB1_UP1 Uncharacterized protein 1. The PB1 domain is a modular domain mediating specific protein-protein interaction which play a role in many critical cell processes, such as osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domain, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions.
Probab=47.94 E-value=47 Score=25.85 Aligned_cols=42 Identities=7% Similarity=0.095 Sum_probs=34.2
Q ss_pred EEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEE
Q psy15482 5 VSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEF 46 (302)
Q Consensus 5 V~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~ 46 (302)
+|.+.+|.+.-+.++.+-|.+.|+++|+..+++|+....|.|
T Consensus 3 fKv~~~g~~RRf~~~~~pt~~~L~~kl~~Lf~lp~~~~~vtY 44 (82)
T cd06397 3 FKSSFLGDTRRIVFPDIPTWEALASKLENLYNLPEIKVGVTY 44 (82)
T ss_pred EEEEeCCceEEEecCCCccHHHHHHHHHHHhCCChhHeEEEE
Confidence 333346777777788888999999999999999998877777
No 182
>PRK11199 tyrA bifunctional chorismate mutase/prephenate dehydrogenase; Provisional
Probab=46.62 E-value=36 Score=33.28 Aligned_cols=46 Identities=17% Similarity=0.349 Sum_probs=27.2
Q ss_pred CCCHHHHHHHHhcCHHHH---HHHHhcChhHHHHhhcCChHHHHHHHHH
Q psy15482 112 SDDPATVRQLLLDNPKEI---ALLQQNNPRLADALLNGSPEDFERILRQ 157 (302)
Q Consensus 112 ~~npe~iR~~~l~NP~~~---~~L~~~NPeLa~al~~~d~e~f~~~lr~ 157 (302)
.+||++.++++++||... +.++..--+|.++|.++|.+.|.+.|.+
T Consensus 297 ~~~p~lw~dI~~~N~~~~~~l~~~~~~l~~l~~~l~~~d~~~l~~~~~~ 345 (374)
T PRK11199 297 AQDPQLYADIIMSSPENLALIKRYYQRFGEALELLEQGDKQAFIDSFRK 345 (374)
T ss_pred cCCHHHHHHHHHhChhHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 456666666666666443 3334444456666766677777766644
No 183
>KOG4572|consensus
Probab=45.91 E-value=1.2e+02 Score=33.35 Aligned_cols=64 Identities=11% Similarity=0.177 Sum_probs=46.6
Q ss_pred CCeEEEEEeCC-CccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCC--CCCCEEEEEE
Q psy15482 10 TNECFLLDVSE-DLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGI--QDGDLVLLKH 73 (302)
Q Consensus 10 ~g~~~~leV~~-~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI--~dgd~l~L~~ 73 (302)
.|+..+++... ..|+.+||..|+.+.|+....|.++-+|..-.+.++.|..|.- .+-.-|++..
T Consensus 3 rGqaltFDleaetqT~adLk~aiqke~~~aIq~~tfl~egGecmaadkrl~e~StaGTdTnPiffFn 69 (1424)
T KOG4572|consen 3 RGQALTFDLEAETQTFADLKDAIQKEVGHAIQDLTFLDEGGECMAADKRLAEISTAGTDTNPIFFFN 69 (1424)
T ss_pred CCceeEEeecceeehHHHHHHHHHHHhchhhceeeeeecCCcCcccccchhhhccccCCCCceEEee
Confidence 47777777764 5699999999999999999999998877665444477776651 2333455553
No 184
>cd01787 GRB7_RA RA (RAS-associated like) domain of Grb7. Grb7_RA The RA (RAS-associated like) domain of Grb7. Grb7 is an adaptor molecule that mediates signal transduction from multiple cell surface receptors to various downstream signaling pathways. Grb7 and its related family members Grb10 and Grb14 share a conserved domain architecture that includes an amino-terminal proline-rich region, a central segment termed the GM region (for Grb and Mig) which includes the RA, PIR, and PH domains, and a carboxyl-terminal SH2 domain. Grb7/10/14 family proteins are phosphorylated on serine/threonine as well as tyrosine residues and are mainly localized to the cytoplasm.
Probab=45.79 E-value=60 Score=25.41 Aligned_cols=63 Identities=11% Similarity=0.139 Sum_probs=41.1
Q ss_pred CCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCe-EEE-E-----CCeecCCCcc---cccccCCCCCCEEEE
Q psy15482 9 HTNECFLLDVSEDLELENFKAFCEVQSGYPAHEI-AIE-F-----QGVLLQDNKK---TLKTHGIQDGDLVLL 71 (302)
Q Consensus 9 ~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q-~Li-~-----~Gk~L~D~~~---TL~~~gI~dgd~l~L 71 (302)
.+|.+.+|+|+.++|+.++-.++..+.++..+.- .|+ + =.|.++|.+. -|+..++..+..+++
T Consensus 10 ~Dg~sK~l~V~~~~Ta~dV~~~L~~K~h~~~~~~W~LvE~~P~l~lER~~EDHE~vvdvl~~W~~~~~n~l~f 82 (85)
T cd01787 10 EDGASKSLEVDERMTARDVCQLLVDKNHCQDDSSWTLVEHLPHLQLERLFEDHELVVEVLSTWHSAGNSVLFF 82 (85)
T ss_pred cCCCeeEEEEcCCCcHHHHHHHHHHHhCCCCCCCeEEEEecchhhhhhhccchHHHHHHHHhcccCCCcEEEE
Confidence 5788999999999999999999999988765432 332 1 1344555311 245556644444443
No 185
>PF00276 Ribosomal_L23: Ribosomal protein L23; InterPro: IPR013025 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This domain is found in both eukaryotic L25 and prokaryotic and eukaryotic L23 proteins.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 3DF4_T 3DF2_T 4A1E_R 4A1A_R 4A1C_R 4A17_R 2HGU_W 2HGQ_W 1N88_A 2HGJ_W ....
Probab=45.30 E-value=43 Score=26.17 Aligned_cols=41 Identities=12% Similarity=0.198 Sum_probs=34.7
Q ss_pred CeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeE-EEECCeec
Q psy15482 11 NECFLLDVSEDLELENFKAFCEVQSGYPAHEIA-IEFQGVLL 51 (302)
Q Consensus 11 g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~-Li~~Gk~L 51 (302)
...+++.|+++.|=.++|+.|+.-+|+++..-+ +.+.|+.-
T Consensus 20 ~n~~tF~V~~~atK~~Ik~aie~iy~V~V~~Vnt~~~~gk~k 61 (91)
T PF00276_consen 20 PNQYTFEVDPRATKTEIKEAIEKIYGVKVKKVNTMNYPGKKK 61 (91)
T ss_dssp SSEEEEEETTTSTHHHHHHHHHHHHTSEEEEEEEEEETSEEE
T ss_pred CCEEEEEEeCCCCHHHHHHHHHhhcCCCeeEEEEeEeCCCce
Confidence 367899999999999999999999999996654 45777754
No 186
>PRK08655 prephenate dehydrogenase; Provisional
Probab=45.26 E-value=39 Score=33.81 Aligned_cols=46 Identities=17% Similarity=0.408 Sum_probs=24.5
Q ss_pred CCCHHHHHHHHhcCHHHHHH---HHhcChhHHHHhhcCChHHHHHHHHH
Q psy15482 112 SDDPATVRQLLLDNPKEIAL---LQQNNPRLADALLNGSPEDFERILRQ 157 (302)
Q Consensus 112 ~~npe~iR~~~l~NP~~~~~---L~~~NPeLa~al~~~d~e~f~~~lr~ 157 (302)
.+||++.+++++.||...+. +...--+|.++|.++|.+.|.+.|..
T Consensus 218 ~~~p~lw~dI~~~N~~~~~~l~~~~~~l~~l~~~l~~~D~~~l~~~~~~ 266 (437)
T PRK08655 218 GQNPYLYASIQMNNPQIPEIHETFIKECEELSELVKNGDREEFVERMKE 266 (437)
T ss_pred cCCHHHHHHHHHhCHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 44566666666666633332 23344455555556666666665543
No 187
>PF10209 DUF2340: Uncharacterized conserved protein (DUF2340); InterPro: IPR018794 This entry consists of small proteins of approximately 150 amino acids whose function is unknown.
Probab=44.76 E-value=45 Score=27.88 Aligned_cols=57 Identities=23% Similarity=0.387 Sum_probs=37.9
Q ss_pred EeCC-CccHHHHHHHHHhh----hCCCC------CCeEEEEC----------------C-eec--CCCcccccccCCCCC
Q psy15482 17 DVSE-DLELENFKAFCEVQ----SGYPA------HEIAIEFQ----------------G-VLL--QDNKKTLKTHGIQDG 66 (302)
Q Consensus 17 eV~~-~~TV~dLK~~Ie~e----~gIp~------~~Q~Li~~----------------G-k~L--~D~~~TL~~~gI~dg 66 (302)
.|+. +.||.+|++.|..+ .|++| +..+|++. . -+| .++++||.++||.+.
T Consensus 21 ~vdL~~~Tv~~l~~~v~~~I~t~~~~~Pfrn~~yDtlKIy~~AHg~Kt~nLvInle~De~~iL~~~~~~~tL~~~gv~nE 100 (122)
T PF10209_consen 21 NVDLKDTTVKDLKEQVKQDIKTRPGLPPFRNVKYDTLKIYTKAHGSKTNNLVINLEDDEDWILDVSDDDKTLKELGVENE 100 (122)
T ss_pred cCCcccCcHHHHHHHHHHHHhcCCCCCCceeeecceEEEEecCCCCCcCCceeeccCCcceeeecCCCCCcHHHcCCCcc
Confidence 5776 89999999987766 45544 44455542 1 345 233478888888888
Q ss_pred CEEEEEE
Q psy15482 67 DLVLLKH 73 (302)
Q Consensus 67 d~l~L~~ 73 (302)
.-|-+..
T Consensus 101 TEiSfF~ 107 (122)
T PF10209_consen 101 TEISFFN 107 (122)
T ss_pred ceeeeeC
Confidence 7776544
No 188
>KOG0011|consensus
Probab=44.10 E-value=57 Score=31.76 Aligned_cols=45 Identities=27% Similarity=0.554 Sum_probs=35.1
Q ss_pred CCCHH--HHHHHHhcCHHHH----HHHHhcChhHHHHhhcCChHHHHHHHHH
Q psy15482 112 SDDPA--TVRQLLLDNPKEI----ALLQQNNPRLADALLNGSPEDFERILRQ 157 (302)
Q Consensus 112 ~~npe--~iR~~~l~NP~~~----~~L~~~NPeLa~al~~~d~e~f~~~lr~ 157 (302)
.++|. .+|+++..||+++ ++|-++||.|.+.|. .|.+.|-++|.+
T Consensus 220 r~~~qf~~lR~~iqqNP~ll~~~Lqqlg~~nP~L~q~Iq-~nqe~Fl~mlne 270 (340)
T KOG0011|consen 220 RNQPQFQQLRQMIQQNPELLHPLLQQLGKQNPQLLQLIQ-ENQEAFLQLLNE 270 (340)
T ss_pred hccHHHHHHHHHHhhCHHHHHHHHHHHhhhCHHHHHHHH-HHHHHHHHHhhc
Confidence 44454 6788899999754 567778999999996 588999998875
No 189
>PRK06545 prephenate dehydrogenase; Validated
Probab=43.66 E-value=58 Score=31.49 Aligned_cols=45 Identities=27% Similarity=0.399 Sum_probs=26.6
Q ss_pred CCCHHHHHHHHhcCHH----HHHHHHhcChhHHHHhhcCChHHHHHHHH
Q psy15482 112 SDDPATVRQLLLDNPK----EIALLQQNNPRLADALLNGSPEDFERILR 156 (302)
Q Consensus 112 ~~npe~iR~~~l~NP~----~~~~L~~~NPeLa~al~~~d~e~f~~~lr 156 (302)
..||++.++++.+|+. .++.++..--++.++|.++|.+.+.+.|.
T Consensus 222 ~~~p~~w~di~~~N~~~~~~~l~~~~~~l~~~~~~l~~~d~~~l~~~~~ 270 (359)
T PRK06545 222 SSDPGMWRDILESNAEALLDALDEWIEDLDRARDALESGDAEAIAELFD 270 (359)
T ss_pred CCCHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHH
Confidence 4678888888887763 23334444445555665555555555553
No 190
>PF12436 USP7_ICP0_bdg: ICP0-binding domain of Ubiquitin-specific protease 7; InterPro: IPR024729 This domain is found in eukaryotes, and is approximately 40 amino acids in length. It is found in proteins of the peptidase C19 family, which contains contains ubiquitinyl hydrolases like ubiquitin-specific protease 7 (USP7). USP7 regulates the turnover of p53 [].; PDB: 2KVR_A 2YLM_A.
Probab=42.32 E-value=51 Score=30.46 Aligned_cols=43 Identities=9% Similarity=0.111 Sum_probs=31.3
Q ss_pred EEEEeC--CCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEE
Q psy15482 3 VTVSTQ--HTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIE 45 (302)
Q Consensus 3 ItV~~~--~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li 45 (302)
|+++.. ..+..|+++++..+|..+|-+.|+...|++|+..+|+
T Consensus 179 V~f~~~~~~~~~~F~l~ls~~~tY~~la~~Va~~l~~dP~~lr~~ 223 (249)
T PF12436_consen 179 VEFKPKDNPNDPEFTLWLSKKMTYDQLAEKVAEHLNVDPEHLRFF 223 (249)
T ss_dssp EEEEETTSTT---EEEEEETT--HHHHHHHHHHHHTS-GGGEEEE
T ss_pred EEEEECCCCCCCCEEEEECCCCCHHHHHHHHHHHHCCChHHEEEE
Confidence 444431 3456899999999999999999999999999999887
No 191
>PF06234 TmoB: Toluene-4-monooxygenase system protein B (TmoB); InterPro: IPR009355 This family consists of several Toluene-4-monooxygenase system protein B (TmoB) sequences. Pseudomonas mendocina KR1 metabolises toluene as a carbon source. The initial step of the pathway is hydroxylation of toluene to form p-cresol by a multicomponent toluene-4-monooxygenase (T4MO) system [].; PDB: 3N1Y_C 3RNG_C 3RNA_C 3RN9_C 3RNC_C 3N1X_C 3RNE_C 3RNF_C 3N1Z_C 3N20_C ....
Probab=42.20 E-value=1.3e+02 Score=23.60 Aligned_cols=59 Identities=19% Similarity=0.245 Sum_probs=42.1
Q ss_pred EEEEeCCCccHHHHHHHHHhhh-C--CCC---CCeEEEECC--eecCCCcccccccCCCCCCEEEEEE
Q psy15482 14 FLLDVSEDLELENFKAFCEVQS-G--YPA---HEIAIEFQG--VLLQDNKKTLKTHGIQDGDLVLLKH 73 (302)
Q Consensus 14 ~~leV~~~~TV~dLK~~Ie~e~-g--Ip~---~~Q~Li~~G--k~L~D~~~TL~~~gI~dgd~l~L~~ 73 (302)
.-+.|+.+.|+.++-++++... | +++ ...++.++| ..+..+ .|+.+.||+.-+.|.+..
T Consensus 17 ~Lv~VDt~dTmdqVA~k~A~HsVGrRV~~~pg~~lrVr~~g~~~~~p~~-~tVaeagl~P~e~vev~~ 83 (85)
T PF06234_consen 17 QLVPVDTEDTMDQVAAKVAHHSVGRRVAPRPGAPLRVRRQGDTQPFPRS-MTVAEAGLQPMEWVEVRF 83 (85)
T ss_dssp EEEEEETT-BHHHHHHHHHTTTTTTSS---TTSEEEEEETTTSSEE-TT--BGGGHT--TTEEEEEEE
T ss_pred EEEEeCCCCcHHHHHHHHhhhhcceecCCCCCCEEEEEecCCCccCCCc-cEehhcCCCcceEEEEEE
Confidence 3578999999999999998763 4 332 345777899 888885 899999999999988764
No 192
>PF00788 RA: Ras association (RalGDS/AF-6) domain; InterPro: IPR000159 Proteins with this domain are mostly RasGTP effectors and include guanine-nucleotide releasing factor in mammals []. This factor stimulates the dissociation of GDP from the Ras-related RALA and RALB GTPases, which allows GTP binding and activation of the GTPases. It interacts and acts as an effector molecule for R-ras, K-Ras and Rap []. The domain is also present in a number of other proteins among them the sexual differentiation protein in yeast that is essential for mating and meiosis and yeast adenylate cyclase. These proteins contain repeated leucine-rich (LRR) segments.; GO: 0007165 signal transduction; PDB: 3EC8_A 2C5L_D 2BYF_A 2CS4_A 3KH0_A 2B3A_A 1RAX_A 2RGF_A 1WGR_A 1WXA_A ....
Probab=41.94 E-value=92 Score=23.18 Aligned_cols=46 Identities=13% Similarity=0.111 Sum_probs=34.2
Q ss_pred EEEEEeCCCccHHHHHHHHHhhhCC--CCCCeEEE-E---C--CeecCCCcccc
Q psy15482 13 CFLLDVSEDLELENFKAFCEVQSGY--PAHEIAIE-F---Q--GVLLQDNKKTL 58 (302)
Q Consensus 13 ~~~leV~~~~TV~dLK~~Ie~e~gI--p~~~Q~Li-~---~--Gk~L~D~~~TL 58 (302)
..+|-|++++|+.++-..+-.++++ .+..-.|+ + . .+.|.+++..|
T Consensus 18 ~k~i~v~~~tTa~evi~~~l~k~~l~~~~~~y~L~~~~~~~~~er~L~~~E~pl 71 (93)
T PF00788_consen 18 YKTIKVSSSTTAREVIEMALEKFGLAEDPSDYCLVEVEESGGEERPLDDDECPL 71 (93)
T ss_dssp EEEEEEETTSBHHHHHHHHHHHTTTSSSGGGEEEEEEECTTTEEEEETTTSBHH
T ss_pred EEEEEECCCCCHHHHHHHHHHHhCCCCCCCCEEEEEEEcCCCEEEEcCCCCchH
Confidence 7789999999999999999999999 44556674 2 1 24566554443
No 193
>PHA00431 internal virion protein C
Probab=41.84 E-value=1.4e+02 Score=31.54 Aligned_cols=104 Identities=19% Similarity=0.272 Sum_probs=59.9
Q ss_pred HHHHHHHH-------hcCHHHHHHHHh---------cChhHHHHhhcCChHHHHHHHHHHHHHHHH-HHHHHHHHHhcCC
Q psy15482 115 PATVRQLL-------LDNPKEIALLQQ---------NNPRLADALLNGSPEDFERILRQQYEEREK-REQLRLRMLNAHP 177 (302)
Q Consensus 115 pe~iR~~~-------l~NP~~~~~L~~---------~NPeLa~al~~~d~e~f~~~lr~~~~~~~~-~e~~~~~~l~~DP 177 (302)
||..|+.+ ..+|..|+.|+. -+-+.+..|.+++. .=|+.|.+.+..|-. ....+-+...-||
T Consensus 82 PEQrReAi~nGTLLYQDDPYAMeALr~KtGRnaAy~VDdev~~Ki~nGeF-rtRqemEEyRhsrL~e~aksyAE~~GInp 160 (746)
T PHA00431 82 PEQRREAIKNGTLLYQDDPYAMEALRFKTGRNAAYLVDDEVAQKIKNGEF-RTRQEMEEYRHSRLQESAKSYAEQFGINP 160 (746)
T ss_pred HHHHHHHHhcCceeecCCHHHHHHHHHHhccchhhhccHHHHHHHhccCc-ccHHHHHHHHHHHHHHHHHHHHHHhCCCC
Confidence 88888765 378999988865 24556666654432 112333332111110 1112334456799
Q ss_pred CCHHHHHHHHHHHHHhhHH-----HHHHHHHHhCCCCcceeeEEEEEEEEcCe
Q psy15482 178 FDTEAQKLIAEEIKKNNIE-----ANMEAAMEYNPETFGTVVMLYINCKVNGY 225 (302)
Q Consensus 178 ~d~e~Q~~i~e~i~q~~i~-----~n~~~A~E~~PE~f~~v~mlyi~~~ing~ 225 (302)
.|++.|+-+-.-|-++||- .++.+..+ -.-.+|-..+++||+
T Consensus 161 ~D~~fq~GFnsdIteRNialYg~hd~~~S~q~------q~~A~lnsr~eln~v 207 (746)
T PHA00431 161 TDEEYQRGFNSDITERNIALYGAHDNFLSDQA------QKGAILNSRVELNGV 207 (746)
T ss_pred CCHHHHhhcccchHhhhHHHhhhhHHHhHHHH------hhcchhhhhhhhhhh
Confidence 9999999999999988885 22222222 234455556666654
No 194
>cd01612 APG12_C Ubiquitin-like domain of APG12. APG12_C The carboxy-terminal ubiquitin-like domain of APG12. Autophagy is a process in which cytoplasmic components are delivered to the lysosome/vacuole for degradation. Autophagy requires a ubiquitin-like protein conjugation system, in which APG12 is covalently bound to APG5.
Probab=40.99 E-value=42 Score=26.16 Aligned_cols=59 Identities=5% Similarity=0.069 Sum_probs=39.8
Q ss_pred EeCCCccHHHHHHHHHhhhCCCCCCeEEEE-CCeecCCCccccccc--CCCCCCEEEEEEec
Q psy15482 17 DVSEDLELENFKAFCEVQSGYPAHEIAIEF-QGVLLQDNKKTLKTH--GIQDGDLVLLKHLG 75 (302)
Q Consensus 17 eV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~-~Gk~L~D~~~TL~~~--gI~dgd~l~L~~~~ 75 (302)
-|+.+.|+.+|...|....++++++-..+| |...+...+.++++. .-.+|..|.+....
T Consensus 21 lv~~~~tv~~~~~~lrk~L~l~~~~slflyvnn~f~p~~d~~~g~LY~~~~~dGfLyi~Ys~ 82 (87)
T cd01612 21 KISATQSFQAVIDFLRKRLKLKASDSLFLYINNSFAPSPDENVGNLYRCFGTNGELIVSYCK 82 (87)
T ss_pred EeCCCCCHHHHHHHHHHHhCCCccCeEEEEECCccCCCchhHHHHHHHhcCCCCEEEEEEeC
Confidence 499999999999999999999887644444 543243222565543 11567788877653
No 195
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank
Probab=40.82 E-value=50 Score=30.99 Aligned_cols=70 Identities=13% Similarity=0.233 Sum_probs=45.9
Q ss_pred EEEEEEEc--CeeEEEEEcCCCccccC-CHHHH---HHcCchhhhhccc----------cceecccCcceeeeEEEEeee
Q psy15482 216 LYINCKVN--GYPVKAFIDSGAQTTIM-SLACA---ERVNIMRLLDTRW----------AGIAKGVGVQKILGRIHMAIA 279 (302)
Q Consensus 216 lyi~~~in--g~~v~a~vDsGaq~~i~-s~~~a---~~~gl~~~~d~~~----------~~~a~gvg~~~i~G~i~~~~~ 279 (302)
-|++++|- ++++++++|||.-.+-+ +..|. ..|+.....|..- -.+..|-| .+-|.+..-.|
T Consensus 7 Y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~C~~~~~y~~~~SsT~~~~~~~~~i~Yg~G--~~~G~~~~D~v 84 (325)
T cd05490 7 YYGEIGIGTPPQTFTVVFDTGSSNLWVPSVHCSLLDIACWLHHKYNSSKSSTYVKNGTEFAIQYGSG--SLSGYLSQDTV 84 (325)
T ss_pred EEEEEEECCCCcEEEEEEeCCCccEEEEcCCCCCCCccccCcCcCCcccCcceeeCCcEEEEEECCc--EEEEEEeeeEE
Confidence 47888996 48899999999776655 44554 3676544443211 11234444 36799999999
Q ss_pred eecccccc
Q psy15482 280 ELTNFNGD 287 (302)
Q Consensus 280 ~ig~~~~~ 287 (302)
.+|+..+.
T Consensus 85 ~~g~~~~~ 92 (325)
T cd05490 85 SIGGLQVE 92 (325)
T ss_pred EECCEEEc
Confidence 99987654
No 196
>cd01818 TIAM1_RBD Ubiquitin domain of Tiam1 guanine nucleotide exchange factor. Tiam1 (T lymphoma invasion and metastasis 1) a guanine nucleotide exchange factor that activates Rac, is an important regulator of Rho GTPase functions in tumor cells including regulation of cell shape and invasiveness in epithelial cells and fibroblasts. TIAM1 has an RBD (Ras-binding domain) similar to that of Raf kinase as well as PH (pleckstrin homology), PDZ, and RhoGEF domains.
Probab=40.52 E-value=61 Score=24.92 Aligned_cols=38 Identities=13% Similarity=0.127 Sum_probs=34.4
Q ss_pred CCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEE
Q psy15482 9 HTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEF 46 (302)
Q Consensus 9 ~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~ 46 (302)
.+|...++-|.|++|++|+-+..-..-++.++...|-.
T Consensus 7 Pn~~~~~v~vrp~~tv~dvLe~aCk~~~ldp~eh~Lrl 44 (77)
T cd01818 7 PDNQPVLTYLRPGMSVEDFLESACKRKQLDPMEHYLRL 44 (77)
T ss_pred CCCceEEEEECCCCCHHHHHHHHHHhcCCChhHheeEE
Confidence 57888999999999999999999999999999887754
No 197
>PRK10363 cpxP periplasmic repressor CpxP; Reviewed
Probab=39.53 E-value=1.7e+02 Score=25.82 Aligned_cols=22 Identities=27% Similarity=0.462 Sum_probs=8.7
Q ss_pred cCCCCHHHHHHHHHHHHHhhHH
Q psy15482 175 AHPFDTEAQKLIAEEIKKNNIE 196 (302)
Q Consensus 175 ~DP~d~e~Q~~i~e~i~q~~i~ 196 (302)
+|+||..+=+-..|.+-++.++
T Consensus 87 ad~FDEaavra~a~kma~~~~e 108 (166)
T PRK10363 87 AENFDENAVRAQAEKMAQEQVA 108 (166)
T ss_pred cCCCCHHHHHHHHHHHHHHHHH
Confidence 4444444433333333333333
No 198
>PF11069 DUF2870: Protein of unknown function (DUF2870); InterPro: IPR021298 This is a eukaryotic family of proteins with unknown function.
Probab=39.39 E-value=66 Score=25.88 Aligned_cols=31 Identities=19% Similarity=0.114 Sum_probs=22.1
Q ss_pred EEEECCeecCCCcccccccCCCCCCEEEEEEec
Q psy15482 43 AIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 43 ~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
.|.|.||.|..+ ++|++| |..++.--++.+.
T Consensus 3 ~LW~aGK~l~~~-k~l~dy-~GkNEKtKiivKl 33 (98)
T PF11069_consen 3 QLWWAGKELQRG-KKLSDY-IGKNEKTKIIVKL 33 (98)
T ss_pred eEEeccccccCC-CcHHHh-cCCCcceeEEEEe
Confidence 588999999876 899999 5555444444443
No 199
>PRK12750 cpxP periplasmic repressor CpxP; Reviewed
Probab=39.16 E-value=2.1e+02 Score=25.03 Aligned_cols=27 Identities=22% Similarity=0.377 Sum_probs=17.3
Q ss_pred HHHHhcCCCCHHHHHHHHHHHHHhhHH
Q psy15482 170 LRMLNAHPFDTEAQKLIAEEIKKNNIE 196 (302)
Q Consensus 170 ~~~l~~DP~d~e~Q~~i~e~i~q~~i~ 196 (302)
..++.+||||+++=+.+.+.+.+..++
T Consensus 95 ~~Ll~a~~FDeaavral~~~~~~~~~e 121 (170)
T PRK12750 95 QALVLADDFDEAAANDLAKQMVEKQVE 121 (170)
T ss_pred HHHHhcCCCCHHHHHHHHHHHHHHHHH
Confidence 467778999998766555554444443
No 200
>PF03658 Ub-RnfH: RnfH family Ubiquitin; InterPro: IPR005346 This is a small family of proteins of unknown function.; PDB: 2HJ1_B.
Probab=39.09 E-value=1.3e+02 Score=23.39 Aligned_cols=67 Identities=18% Similarity=0.201 Sum_probs=37.8
Q ss_pred CEEEEEeC--CCCeEEEEEeCCCccHHHHHH---HHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEE
Q psy15482 1 MKVTVSTQ--HTNECFLLDVSEDLELENFKA---FCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKH 73 (302)
Q Consensus 1 MqItV~~~--~~g~~~~leV~~~~TV~dLK~---~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~ 73 (302)
|+|+|... .....++++|++.+||.+--. +++..-++..+..++=--||...- +.-+++||-|.+-+
T Consensus 1 i~VeV~yA~p~~q~~~~l~vp~GtTv~~Ai~~Sgi~~~~p~idl~~~~vGIfGk~~~~------d~~L~~GDRVEIYR 72 (84)
T PF03658_consen 1 IRVEVAYALPERQVILTLEVPEGTTVAQAIEASGILEQFPEIDLEKNKVGIFGKLVKL------DTVLRDGDRVEIYR 72 (84)
T ss_dssp EEEEEEEEETTCEEEEEEEEETT-BHHHHHHHHTHHHH-TT--TTTSEEEEEE-S--T------T-B--TT-EEEEE-
T ss_pred CEEEEEEECCCeEEEEEEECCCcCcHHHHHHHcCchhhCcccCcccceeeeeeeEcCC------CCcCCCCCEEEEec
Confidence 67787642 223345899999999998655 445555677777887544665543 23589999999865
No 201
>PF03671 Ufm1: Ubiquitin fold modifier 1 protein; InterPro: IPR005375 Ubiquitinylation is an ATP-dependent process that involves the action of at least three enzymes: a ubiquitin-activating enzyme (E1, IPR000011 from INTERPRO), a ubiquitin-conjugating enzyme (E2, IPR000608 from INTERPRO), and a ubiquitin ligase (E3, IPR000569 from INTERPRO, IPR003613 from INTERPRO), which work sequentially in a cascade. There are many different E3 ligases, which are responsible for the type of ubiquitin chain formed, the specificity of the target protein, and the regulation of the ubiquitinylation process []. Ubiquitinylation is an important regulatory tool that controls the concentration of key signalling proteins, such as those involved in cell cycle control, as well as removing misfolded, damaged or mutant proteins that could be harmful to the cell. Several ubiquitin-like molecules have been discovered, such as Ufm1 (IPR005375 from INTERPRO), SUMO1 (IPR003653 from INTERPRO), NEDD8, Rad23 (IPR004806 from INTERPRO), Elongin B and Parkin (IPR003977 from INTERPRO), the latter being involved in Parkinson's disease []. Ubiquitin-like molecules (UBLs) can be divided into two subclasses: type-1 UBLs, which ligate to target proteins in a manner similar, but not identical, to the ubiquitylation pathway, such as SUMO, NEDD8, and UCRP/ISG15, and type-2 UBLs (also called UDPs, ubiquitin-domain proteins), which contain ubiquitin-like structure embedded in a variety of different classes of large proteins with apparently distinct functions, such as Rad23, Elongin B, Scythe, Parkin, and HOIL-1. This entry represents Ufm1 (ubiquitin-fold modifier), which is a ubiquitin-like protein with structural similarities to ubiquitin [, ]. Ufm1 is one of a number of ubiquitin-like modifiers that conjugate to target proteins in cells through Uba5 (E1) and Ufc1 (E2). The Ufm1-system is conserved in metazoa and plants, suggesting it has a potential role in multicellular organisms []. Human Ufm1 is synthesized as a precursor consisting of 85 amino-acid residues. Prior to activation by Uba5, the extra amino acids at the C-terminal region of Ufm1 are removed to expose Gly, which is necessary for conjugation to target molecule(s). C-terminal processing of Ufm1 requires two specific cysteine peptidases (IPR012462 from INTERPRO): UfSP1 and UfSP2; both peptidases are also able to release Ufm1 from Ufm1-conjugated cellular proteins. UfSP2 is present in most, if not all, of multi-cellular organisms including plant, nematode, fly, and mammal, whereas UfSP1 is not present in plants and nematodes []. For further information on ubiquitin, please see Protein of the Month [].; PDB: 1J0G_A 1WXS_A 1L7Y_A.
Probab=38.75 E-value=97 Score=23.66 Aligned_cols=57 Identities=19% Similarity=0.216 Sum_probs=40.9
Q ss_pred EEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEEC-CeecCCCcccccccCCCCCCEEEE
Q psy15482 14 FLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQ-GVLLQDNKKTLKTHGIQDGDLVLL 71 (302)
Q Consensus 14 ~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~-Gk~L~D~~~TL~~~gI~dgd~l~L 71 (302)
..+.|+++.....+-...++++.+|+..-.++-+ |-=+.- ..|-.+.=+|-|+.|.|
T Consensus 18 kv~sVPE~apftaVlkfaAeeF~vp~~tsaiItndG~GInP-~QTag~vflKhGseLrl 75 (76)
T PF03671_consen 18 KVISVPEEAPFTAVLKFAAEEFKVPPATSAIITNDGVGINP-QQTAGNVFLKHGSELRL 75 (76)
T ss_dssp EEEEEETTSBHHHHHHHHHHHTTS-SSSEEEEESSS-EE-T-TSBHHHHHHHT-SEEEE
T ss_pred eEEecCCCCchHHHHHHHHHHcCCCCceEEEEecCCccccc-chhhhhhHhhcCcEeee
Confidence 3578999999999999999999999988888764 444544 36766666677777655
No 202
>cd01611 GABARAP Ubiquitin domain of GABA-receptor-associated protein. GABARAP (GABA-receptor-associated protein) belongs ot a large family of proteins that mediate intracellular membrane trafficking and/or fusion. GABARAP binds not only to GABA, type A but also to tubulin, gephrin, and ULK1. Orthologues of GABARAP include Gate-16 (golgi-associated ATPase enhancer), LC3 (microtubule-associated protein light chain 3), and ATG8 (autophagy protein 8). ATG8 is a ubiquitin-like protein that is conjugated to the membrane phospholipid, phosphatidylethanolamine as part of a ubiquitin-like conjugation system essential for autophagosome-formation.
Probab=38.72 E-value=42 Score=27.41 Aligned_cols=57 Identities=11% Similarity=0.075 Sum_probs=40.0
Q ss_pred EeCCCccHHHHHHHHHhhhCCCCCCeEE-EECCeecCCCccccccc----CCCCCCEEEEEEec
Q psy15482 17 DVSEDLELENFKAFCEVQSGYPAHEIAI-EFQGVLLQDNKKTLKTH----GIQDGDLVLLKHLG 75 (302)
Q Consensus 17 eV~~~~TV~dLK~~Ie~e~gIp~~~Q~L-i~~Gk~L~D~~~TL~~~----gI~dgd~l~L~~~~ 75 (302)
-|+.+.||.+|...|....++++++-.. +.|+ .+...+.++++. +-. |..|+|....
T Consensus 46 lVp~~~tv~~f~~~irk~l~l~~~~slfl~Vn~-~~p~~~~~~~~lY~~~kd~-DGfLyl~Ys~ 107 (112)
T cd01611 46 LVPSDLTVGQFVYIIRKRIQLRPEKALFLFVNN-SLPPTSATMSQLYEEHKDE-DGFLYMTYSS 107 (112)
T ss_pred EecCCCCHHHHHHHHHHHhCCCccceEEEEECC-ccCCchhHHHHHHHHhCCC-CCEEEEEEec
Confidence 3999999999999999999998876444 4455 453334677654 433 5577776653
No 203
>cd01764 Urm1 Urm1-like ubuitin domain. Urm1 (Ubiquitin-Related Modifier1) The Urm1 fold, like those of two closely related proteins MoaD (molybdopterin synthase) and ThiS (sulfur carrier protein), is similar to that of ubiquitin although there is little or no sequence similarity. The C-terminal glycines of Urm1 are conjugated to an E1-like protein Uba4 as part of a novel conjugation system in yeast. The Urm1 fold is found only in eukaryotes.
Probab=38.48 E-value=87 Score=24.60 Aligned_cols=60 Identities=13% Similarity=0.234 Sum_probs=34.4
Q ss_pred eEEEE--EeCCCccHHHHHHHHHhhhCCCCCCeEEEE-CC------eecCCCc--ccc--cccCCCCCCEEEEEE
Q psy15482 12 ECFLL--DVSEDLELENFKAFCEVQSGYPAHEIAIEF-QG------VLLQDNK--KTL--KTHGIQDGDLVLLKH 73 (302)
Q Consensus 12 ~~~~l--eV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~-~G------k~L~D~~--~TL--~~~gI~dgd~l~L~~ 73 (302)
+.+.+ +++...||.+|-..+.... |...-.|+. +| -+|-|+. ..| .++-+++||.|.+.+
T Consensus 17 ~~~~~~~~~~~~~tV~dll~~L~~~~--~~~~~~lf~~~g~lr~~i~VlvN~~di~~l~g~~t~L~dgD~v~i~P 89 (94)
T cd01764 17 KEHHVVLDGEKPVTVGDLLDYVASNL--LEERPDLFIEGGSVRPGIIVLINDTDWELLGEEDYILEDGDHVVFIS 89 (94)
T ss_pred eEEEEeccCCCCCcHHHHHHHHHHhC--chhhhhhEecCCcccCCEEEEECCccccccCCcccCCCCcCEEEEEC
Confidence 44444 4446779999999998776 232333322 11 1121211 223 356799999999865
No 204
>PF02991 Atg8: Autophagy protein Atg8 ubiquitin like; InterPro: IPR004241 Autophagy is generally known as a process involved in the degradation of bulk cytoplasmic components that are non-specifically sequestered into an autophagosome, where they are sequestered into double-membrane vesicles and delivered to the degradative organelle, the lysosome/vacuole, for breakdown and eventual recycling of the resulting macromolecules. The yeast proteins are involved in the autophagosome, and Atg8 binds Atg19, via its N terminus and the C terminus of Atg19. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity [] Related proteins that belong to this group include the human ganglioside expression factor and a symbiosis-related fungal protein.; PDB: 3ECI_A 3D32_B 1GNU_A 1KM7_A 1KLV_A 1KOT_A 3DOW_A 1KJT_A 1V49_A 2ZJD_C ....
Probab=36.18 E-value=53 Score=26.52 Aligned_cols=56 Identities=11% Similarity=0.172 Sum_probs=37.5
Q ss_pred eCCCccHHHHHHHHHhhhCCCCCC-eEEEECCeecCCCccccccc----CCCCCCEEEEEEec
Q psy15482 18 VSEDLELENFKAFCEVQSGYPAHE-IAIEFQGVLLQDNKKTLKTH----GIQDGDLVLLKHLG 75 (302)
Q Consensus 18 V~~~~TV~dLK~~Ie~e~gIp~~~-Q~Li~~Gk~L~D~~~TL~~~----gI~dgd~l~L~~~~ 75 (302)
|+.+.||.+|...|.....+++++ .-|+.|+..... +.||++. . .+|..|.+....
T Consensus 39 vp~~~tv~qf~~~ir~rl~l~~~~alfl~Vn~~lp~~-s~tm~elY~~~k-deDGFLY~~Ys~ 99 (104)
T PF02991_consen 39 VPKDLTVGQFVYIIRKRLQLSPEQALFLFVNNTLPST-SSTMGELYEKYK-DEDGFLYMTYSS 99 (104)
T ss_dssp EETTSBHHHHHHHHHHHTT--TTS-EEEEBTTBESST-TSBHHHHHHHHB--TTSSEEEEEES
T ss_pred EcCCCchhhHHHHhhhhhcCCCCceEEEEEcCcccch-hhHHHHHHHHhC-CCCCeEEEEecc
Confidence 789999999999999999998864 444456644444 4787764 2 245567776543
No 205
>PF09269 DUF1967: Domain of unknown function (DUF1967); InterPro: IPR015349 The Obg family comprises a group of ancient P-loop small G proteins (GTPases) belonging to the TRAFAC (for translation factors) class and can be subdivided into several distinct protein subfamilies []. OBG GTPases have been found in both prokaryotes and eukaryotes []. The structure of the OBG GTPase from Thermus thermophilus has been determined []. This entry represents a C-terminal domain found in certain OBG GTPases. This domain contains a four-stranded beta sheet and three alpha helices flanked by an additional beta strand. It is predominantly found in the bacterial GTP-binding protein Obg, and is functionally uncharacterised. ; GO: 0000166 nucleotide binding; PDB: 1UDX_A.
Probab=35.83 E-value=17 Score=27.01 Aligned_cols=16 Identities=38% Similarity=0.704 Sum_probs=11.3
Q ss_pred ccccccCCCCCCEEEE
Q psy15482 56 KTLKTHGIQDGDLVLL 71 (302)
Q Consensus 56 ~TL~~~gI~dgd~l~L 71 (302)
+.|...|+++||+|.+
T Consensus 47 ~~L~~~G~~~GD~V~I 62 (69)
T PF09269_consen 47 KALRKAGAKEGDTVRI 62 (69)
T ss_dssp HHHHTTT--TT-EEEE
T ss_pred HHHHHcCCCCCCEEEE
Confidence 5788999999999976
No 206
>PRK11861 bifunctional prephenate dehydrogenase/3-phosphoshikimate 1-carboxyvinyltransferase; Provisional
Probab=35.65 E-value=88 Score=33.11 Aligned_cols=48 Identities=19% Similarity=0.270 Sum_probs=38.5
Q ss_pred CCCCHHHHHHHHhcCHHH----HHHHHhcChhHHHHhhcCChHHHHHHHHHH
Q psy15482 111 LSDDPATVRQLLLDNPKE----IALLQQNNPRLADALLNGSPEDFERILRQQ 158 (302)
Q Consensus 111 ~~~npe~iR~~~l~NP~~----~~~L~~~NPeLa~al~~~d~e~f~~~lr~~ 158 (302)
...||++-+++|++|+.- ++.++..--++.++|.++|.+.|.+.+.+-
T Consensus 160 a~~~p~lw~di~~~N~~~i~~~l~~~~~~l~~~~~~l~~~d~~~l~~~~~~~ 211 (673)
T PRK11861 160 AASSPEMWRDVCLANRAALLDELDAYTAVLARLRAAIDAGDGAALEAVFARS 211 (673)
T ss_pred ccCCHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHH
Confidence 467899999999999874 455666777888899888988888888553
No 207
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=35.65 E-value=1e+02 Score=28.07 Aligned_cols=55 Identities=13% Similarity=0.138 Sum_probs=40.6
Q ss_pred EEEEEEEcC--eeEEEEEcCCCccccCCHHHHHHcCchhhhhccccceecccCcceeeeEEEEeeeeecccccc
Q psy15482 216 LYINCKVNG--YPVKAFIDSGAQTTIMSLACAERVNIMRLLDTRWAGIAKGVGVQKILGRIHMAIAELTNFNGD 287 (302)
Q Consensus 216 lyi~~~ing--~~v~a~vDsGaq~~i~s~~~a~~~gl~~~~d~~~~~~a~gvg~~~i~G~i~~~~~~ig~~~~~ 287 (302)
-|++++|-. +++++.+|||.-.+.+. .| .+..|- ...+.|.+..-.+.+|+..+.
T Consensus 3 Y~~~i~iGtp~q~~~v~~DTgS~~~wv~---------------~~-~~~Y~~-g~~~~G~~~~D~v~~g~~~~~ 59 (295)
T cd05474 3 YSAELSVGTPPQKVTVLLDTGSSDLWVP---------------DF-SISYGD-GTSASGTWGTDTVSIGGATVK 59 (295)
T ss_pred EEEEEEECCCCcEEEEEEeCCCCcceee---------------ee-EEEecc-CCcEEEEEEEEEEEECCeEec
Confidence 478889988 88999999999998888 12 233332 246788888888888876543
No 208
>PF04126 Cyclophil_like: Cyclophilin-like; InterPro: IPR007256 Proteins of this family have no known function.; PDB: 2KA0_A 1ZX8_C 2NNZ_A.
Probab=35.36 E-value=38 Score=27.84 Aligned_cols=32 Identities=13% Similarity=0.221 Sum_probs=26.7
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhh
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFKAFCEVQ 34 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e 34 (302)
|+|.|+. +++.+..++.++.|..+|.+++--.
T Consensus 1 mkI~i~i--~~~~~~a~L~d~~ta~~~~~~LPlt 32 (120)
T PF04126_consen 1 MKIKITI--GGQEIEAELNDSPTARAFAAQLPLT 32 (120)
T ss_dssp EEEEEEE--TTEEEEEEEETTHHHHHHHHC-SEE
T ss_pred CeEEEEE--CCEEEEEEECCCHHHHHHHHhCCeE
Confidence 8999998 5899999999999999998876433
No 209
>KOG2689|consensus
Probab=35.29 E-value=1e+02 Score=29.38 Aligned_cols=68 Identities=12% Similarity=0.229 Sum_probs=51.3
Q ss_pred EEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECC---eec--CCCcccccccCCCCCCEEEEE
Q psy15482 3 VTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQG---VLL--QDNKKTLKTHGIQDGDLVLLK 72 (302)
Q Consensus 3 ItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~G---k~L--~D~~~TL~~~gI~dgd~l~L~ 72 (302)
|-|+. .+|++++.+..+..++.+|+..|+...|...+=-. ++.| +.+ .|..++|..+++.+-.+|.+.
T Consensus 213 lQiRl-~DG~Tl~~tF~a~E~L~~VR~wVd~n~~~~~~P~~-f~t~fPR~tf~edD~~KpLq~L~L~Psa~lil~ 285 (290)
T KOG2689|consen 213 LQIRL-PDGQTLTQTFNARETLAAVRLWVDLNRGDGLDPYS-FHTGFPRVTFTEDDELKPLQELDLVPSAVLILE 285 (290)
T ss_pred EEEEc-CCCCeeeeecCchhhHHHHHHHHHHhccCCCCCee-eecCCCceecccccccccHHHhccccchheecc
Confidence 55777 78999999999999999999999999887663332 2333 223 344578999999888887764
No 210
>PF05823 Gp-FAR-1: Nematode fatty acid retinoid binding protein (Gp-FAR-1); InterPro: IPR008632 Parasitic nematodes produce at least two structurally novel classes of small helix-rich retinol- and fatty-acid-binding proteins that have no counterparts in their plant or animal hosts and thus represent potential targets for new nematicides. Gp-FAR-1 is a member of the nematode-specific fatty-acid- and retinol-binding (FAR) family of proteins but localises to the surface of the organism, placing it in a strategic position for interaction with the host. Gp-FAR-1 functions as a broad-spectrum retinol- and fatty-acid-binding protein, and it is thought that it is involved in the evasion of primary host plant defence systems [].; GO: 0008289 lipid binding; PDB: 2W9Y_A.
Probab=33.68 E-value=81 Score=27.13 Aligned_cols=46 Identities=26% Similarity=0.419 Sum_probs=28.7
Q ss_pred HHHHHHHHhcChhHHHHhhcCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHH
Q psy15482 126 PKEIALLQQNNPRLADALLNGSPEDFERILRQQYEEREKREQLRLRMLNAHPFDTEAQKLIAEEIKK 192 (302)
Q Consensus 126 P~~~~~L~~~NPeLa~al~~~d~e~f~~~lr~~~~~~~~~e~~~~~~l~~DP~d~e~Q~~i~e~i~q 192 (302)
.++.+.|++.+|.|+..+ +.+...++.. -|=.+||+|..+.+.|.+
T Consensus 43 de~i~~LK~ksP~L~~k~-----~~l~~~~k~k----------------i~~L~peak~Fv~~li~~ 88 (154)
T PF05823_consen 43 DEMIAALKEKSPSLYEKA-----EKLRDKLKKK----------------IDKLSPEAKAFVKELIAK 88 (154)
T ss_dssp TTHHHHHHHH-HHHHHHH-----HHHHHHHHHT----------------TTT--HHHHHHHHHHHHH
T ss_pred HHHHHHHHHhCHHHHHHH-----HHHHHHHHHH----------------HHcCCHHHHHHHHHHHHH
Confidence 457777888888888777 3444444321 355678888888877775
No 211
>TIGR03595 Obg_CgtA_exten Obg family GTPase CgtA, C-terminal extension. CgtA (see model TIGR02729) is a broadly conserved member of the obg family of GTPases associated with ribosome maturation. This model represents a unique C-terminal domain found in some but not all sequences of CgtA. This region is preceded, and may be followed, by a region of low-complexity sequence.
Probab=33.64 E-value=21 Score=26.60 Aligned_cols=17 Identities=35% Similarity=0.628 Sum_probs=15.0
Q ss_pred ccccccCCCCCCEEEEE
Q psy15482 56 KTLKTHGIQDGDLVLLK 72 (302)
Q Consensus 56 ~TL~~~gI~dgd~l~L~ 72 (302)
+.|...|+++||+|.+-
T Consensus 47 ~~L~~~G~~~GD~V~Ig 63 (69)
T TIGR03595 47 DALRKAGAKDGDTVRIG 63 (69)
T ss_pred HHHHHcCCCCCCEEEEc
Confidence 57999999999999873
No 212
>cd01817 RGS12_RBD Ubiquitin domain of RGS12 and RGS14. RGS12 (regulator of G signalling 12), and RGS14, are members of a family of GTPase-activating proteins (GAP's) specific for the G-alpha subunit, which act as key inhibitors of G-protein-mediated cell responses in eukaryotes. Their domain architecture includes tandem RBD domains as well as PDZ , PTB, and RGS, and GoLoco domains.
Probab=33.63 E-value=2.1e+02 Score=21.73 Aligned_cols=62 Identities=10% Similarity=0.070 Sum_probs=45.4
Q ss_pred CCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECC--eecCCCcccccccCCCCCCEEEEEEe
Q psy15482 9 HTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQG--VLLQDNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 9 ~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~G--k~L~D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
.+|+...+.+.|.+||.|.-..+-++-|+..+.-.++.-| ++|.=+ .+..+-.|.-|.|..+
T Consensus 7 Pdg~~T~V~vrpG~ti~d~L~kllekRgl~~~~~~vf~~g~~k~l~~~----qD~~~L~~~El~vE~r 70 (73)
T cd01817 7 PDGSTTVVPTRPGESIRDLLSGLCEKRGINYAAVDLFLVGGDKPLVLD----QDSSVLAGQEVRLEKR 70 (73)
T ss_pred CCCCeEEEEecCCCCHHHHHHHHHHHcCCChhHEEEEEecCCcccccC----CccceeeccEEEEEEe
Confidence 5788889999999999999999999999999888888766 344211 2334444555665544
No 213
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme. Proteinase A preferentially hydro
Probab=33.00 E-value=63 Score=30.37 Aligned_cols=71 Identities=13% Similarity=0.266 Sum_probs=46.1
Q ss_pred EEEEEEEEc--CeeEEEEEcCCCccccCC-HHHH-HHcCchhhhhcc----c------cceecccCcceeeeEEEEeeee
Q psy15482 215 MLYINCKVN--GYPVKAFIDSGAQTTIMS-LACA-ERVNIMRLLDTR----W------AGIAKGVGVQKILGRIHMAIAE 280 (302)
Q Consensus 215 mlyi~~~in--g~~v~a~vDsGaq~~i~s-~~~a-~~~gl~~~~d~~----~------~~~a~gvg~~~i~G~i~~~~~~ 280 (302)
.-|++++|. ++++.+++|||...+.+. ..|. ..|......|.. | ..+..|-| .+-|.+..-.+.
T Consensus 10 ~Y~~~i~iGtp~q~~~v~~DTGSs~~wv~~~~C~~~~C~~~~~y~~~~Sst~~~~~~~~~~~y~~g--~~~G~~~~D~v~ 87 (320)
T cd05488 10 QYFTDITLGTPPQKFKVILDTGSSNLWVPSVKCGSIACFLHSKYDSSASSTYKANGTEFKIQYGSG--SLEGFVSQDTLS 87 (320)
T ss_pred EEEEEEEECCCCcEEEEEEecCCcceEEEcCCCCCcccCCcceECCCCCcceeeCCCEEEEEECCc--eEEEEEEEeEEE
Confidence 358899997 488999999998877664 3443 246543333211 1 12233434 468999999999
Q ss_pred ecccccc
Q psy15482 281 LTNFNGD 287 (302)
Q Consensus 281 ig~~~~~ 287 (302)
||++.+.
T Consensus 88 ig~~~~~ 94 (320)
T cd05488 88 IGDLTIK 94 (320)
T ss_pred ECCEEEC
Confidence 9987654
No 214
>KOG3391|consensus
Probab=32.94 E-value=51 Score=28.21 Aligned_cols=58 Identities=21% Similarity=0.213 Sum_probs=38.9
Q ss_pred CCCccHHHHHHHHHhh------hCCCCCCeEEEE-----------------CCeecCCCcccccccCCCCCCEEEEEEec
Q psy15482 19 SEDLELENFKAFCEVQ------SGYPAHEIAIEF-----------------QGVLLQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 19 ~~~~TV~dLK~~Ie~e------~gIp~~~Q~Li~-----------------~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
.-++|+.+|-.+|.+- .|..-+ ..++| .|+.+-|+++||++++++-||.|-+..+.
T Consensus 59 W~datL~ELtsLvkevnpeaR~kgt~f~-fa~Vf~d~~~~~y~~RevG~t~~g~Kg~ddnktL~~~kf~iGD~lDVaI~~ 137 (151)
T KOG3391|consen 59 WMDATLRELTSLVKEVNPEARKKGTSFD-FAVVFPDKKSPRYIVREVGTTCLGRKGIDDNKTLQQTKFEIGDYLDVAITP 137 (151)
T ss_pred hhhhhHHHHHHHHHHcCHHHhccCceEE-EEEEeccCCCCCceeeeecccccCcccCCccchhhhCCccccceEEEEecC
Confidence 3467899998888762 122211 12222 35555555699999999999999998876
Q ss_pred CC
Q psy15482 76 SY 77 (302)
Q Consensus 76 ~~ 77 (302)
+.
T Consensus 138 p~ 139 (151)
T KOG3391|consen 138 PN 139 (151)
T ss_pred cc
Confidence 54
No 215
>PF13801 Metal_resist: Heavy-metal resistance; PDB: 3EPV_C 2Y3D_A 2Y3H_D 2Y3G_B 2Y3B_A 2Y39_A 3LAY_H.
Probab=32.87 E-value=2.3e+02 Score=21.73 Aligned_cols=42 Identities=29% Similarity=0.329 Sum_probs=23.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHH
Q psy15482 151 FERILRQQYEEREKREQLRLRMLNAHPFDTEAQKLIAEEIKK 192 (302)
Q Consensus 151 f~~~lr~~~~~~~~~e~~~~~~l~~DP~d~e~Q~~i~e~i~q 192 (302)
|+.-+++...+.+....+...++.++|+|++.=+.+.+.|..
T Consensus 57 ~~~~~~~~r~~~~~~r~~l~~ll~~~~~D~~~i~a~~~~~~~ 98 (125)
T PF13801_consen 57 FRQEMRALRQELRAARQELRALLAAPPPDEAAIEALLEEIRE 98 (125)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHCCSSS-HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHH
Confidence 333333333333333344557788999999976666666654
No 216
>PF06487 SAP18: Sin3 associated polypeptide p18 (SAP18); InterPro: IPR010516 This family consists of several eukaryotic Sin3 associated polypeptide p18 (SAP18) sequences. SAP18 is known to be a component of the Sin3-containing complex, which is responsible for the repression of transcription via the modification of histone polypeptides []. SAP18 is also present in the ASAP complex which is thought to be involved in the regulation of splicing during the execution of programmed cell death [].; PDB: 2HDE_A 4A90_A 4A6Q_A 4A8X_C.
Probab=32.55 E-value=58 Score=27.03 Aligned_cols=59 Identities=17% Similarity=0.159 Sum_probs=35.5
Q ss_pred eEEEEEeCCCccHHHHHHHHHhhhCCCCC-------CeEEE-----------------ECCeecCCCcccccccCCCCCC
Q psy15482 12 ECFLLDVSEDLELENFKAFCEVQSGYPAH-------EIAIE-----------------FQGVLLQDNKKTLKTHGIQDGD 67 (302)
Q Consensus 12 ~~~~leV~~~~TV~dLK~~Ie~e~gIp~~-------~Q~Li-----------------~~Gk~L~D~~~TL~~~gI~dgd 67 (302)
..+.|-...|.|+.+|-.+|... .|+. ..+++ +.|+...|+++||.+++..-||
T Consensus 37 ~elqIYtW~d~TLrEL~~Lik~~--~~~~r~~~tr~~F~~VypD~~~~r~~~kdlGsv~~g~~~~d~~kTL~~~~F~iGD 114 (120)
T PF06487_consen 37 NELQIYTWMDATLRELADLIKDV--NPPARRRGTRLSFRLVYPDTRSGRYVSKDLGSVVSGRKGPDDNKTLADLRFVIGD 114 (120)
T ss_dssp TEEEEEE-TT-BHHHHHHHHHHH---HHHHSTT-EEEEEEEEECTTTTCEEEEEEEEEETTB--TTTTSBCGGGT--TT-
T ss_pred CeeEEEEcccCCHHHHHHHHHHh--CcccCCCCCEEEEEEEeecCCCCceeeecCCeEECCCCCCCcccCHhhCCcccCC
Confidence 45567788999999999998873 1110 12333 3355545667999999999999
Q ss_pred EEEEE
Q psy15482 68 LVLLK 72 (302)
Q Consensus 68 ~l~L~ 72 (302)
.|.+.
T Consensus 115 yidva 119 (120)
T PF06487_consen 115 YIDVA 119 (120)
T ss_dssp EEEEE
T ss_pred EEEEe
Confidence 99764
No 217
>PRK05738 rplW 50S ribosomal protein L23; Reviewed
Probab=31.99 E-value=69 Score=25.16 Aligned_cols=40 Identities=18% Similarity=0.295 Sum_probs=33.9
Q ss_pred CeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeE-EEECCee
Q psy15482 11 NECFLLDVSEDLELENFKAFCEVQSGYPAHEIA-IEFQGVL 50 (302)
Q Consensus 11 g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~-Li~~Gk~ 50 (302)
...+++.|+++.|=.++|..|+.-+|+++..-. +.+.|+.
T Consensus 20 ~n~~~F~V~~~a~K~eIK~aie~lf~VkV~~VnT~~~~gk~ 60 (92)
T PRK05738 20 QNKYVFEVAPDATKPEIKAAVEKLFGVKVESVNTLNVKGKT 60 (92)
T ss_pred CCEEEEEECCCCCHHHHHHHHHHHcCCceeEEEEEEeCCce
Confidence 578999999999999999999999999996654 4466654
No 218
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=30.73 E-value=77 Score=29.64 Aligned_cols=70 Identities=10% Similarity=0.185 Sum_probs=44.6
Q ss_pred EEEEEEEcC--eeEEEEEcCCCccccCCH-HHHH-HcCchhhhhcc----c------cceecccCcceeeeEEEEeeeee
Q psy15482 216 LYINCKVNG--YPVKAFIDSGAQTTIMSL-ACAE-RVNIMRLLDTR----W------AGIAKGVGVQKILGRIHMAIAEL 281 (302)
Q Consensus 216 lyi~~~ing--~~v~a~vDsGaq~~i~s~-~~a~-~~gl~~~~d~~----~------~~~a~gvg~~~i~G~i~~~~~~i 281 (302)
-|+++.|-. +++.++||||.-.+-+.. .|.. .|+-....|.. | .....|-| .+.|.+..-.|.+
T Consensus 4 y~~~i~iGtP~q~~~v~~DTGS~~~wv~~~~C~~~~C~~~~~f~~~~SsT~~~~~~~~~~~Yg~G--s~~G~~~~D~i~~ 81 (318)
T cd05477 4 YYGEISIGTPPQNFLVLFDTGSSNLWVPSVLCQSQACTNHTKFNPSQSSTYSTNGETFSLQYGSG--SLTGIFGYDTVTV 81 (318)
T ss_pred EEEEEEECCCCcEEEEEEeCCCccEEEccCCCCCccccccCCCCcccCCCceECCcEEEEEECCc--EEEEEEEeeEEEE
Confidence 467788874 889999999987766543 3432 46543334321 1 12233444 3678888888889
Q ss_pred cccccc
Q psy15482 282 TNFNGD 287 (302)
Q Consensus 282 g~~~~~ 287 (302)
|+..+.
T Consensus 82 g~~~i~ 87 (318)
T cd05477 82 QGIIIT 87 (318)
T ss_pred CCEEEc
Confidence 887664
No 219
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which
Probab=30.66 E-value=57 Score=30.54 Aligned_cols=29 Identities=21% Similarity=0.356 Sum_probs=24.0
Q ss_pred EEEcCeeE------EEEEcCCCccccCCHHHHHHc
Q psy15482 220 CKVNGYPV------KAFIDSGAQTTIMSLACAERV 248 (302)
Q Consensus 220 ~~ing~~v------~a~vDsGaq~~i~s~~~a~~~ 248 (302)
+.|||..+ .|+||||...+.++....++.
T Consensus 194 v~v~g~~~~~~~~~~~iiDTGts~~~lp~~~~~~l 228 (317)
T cd05478 194 VTINGQVVACSGGCQAIVDTGTSLLVGPSSDIANI 228 (317)
T ss_pred EEECCEEEccCCCCEEEECCCchhhhCCHHHHHHH
Confidence 56788765 589999999999999987653
No 220
>PF01606 Arteri_env: Arterivirus envelope protein; InterPro: IPR002556 This family consists of viral envelope proteins from the Arteriviridae; this includes Porcine reproductive and respiratory syndrome virus (PRRSV) envelope protein GP3 and Lactate dehydrogenase-elevating virus (LDV) structural glycoprotein. Arteriviruses consists of positive ssRNA and do not have a DNA stage.
Probab=30.10 E-value=36 Score=30.27 Aligned_cols=38 Identities=32% Similarity=0.527 Sum_probs=26.5
Q ss_pred HHHHHHhCCCCc--ceeeEEEEEEEEcCeeEEEEEcCCCcccc
Q psy15482 199 MEAAMEYNPETF--GTVVMLYINCKVNGYPVKAFIDSGAQTTI 239 (302)
Q Consensus 199 ~~~A~E~~PE~f--~~v~mlyi~~~ing~~v~a~vDsGaq~~i 239 (302)
+-.|..++||.| +.|+.+||+.+ .+-+-|.-| |-..|+
T Consensus 118 fsyaaqfhPEiFgiGNVs~V~vd~~--hq~iCa~hd-g~NsT~ 157 (214)
T PF01606_consen 118 FSYAAQFHPEIFGIGNVSRVYVDIK--HQFICAEHD-GQNSTL 157 (214)
T ss_pred HHHHHhhChhhhccCceeEEEEecc--ceEEEEeec-CCCCcc
Confidence 457888999999 99999999843 334444444 544444
No 221
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=29.94 E-value=1.1e+02 Score=30.79 Aligned_cols=72 Identities=19% Similarity=0.302 Sum_probs=48.5
Q ss_pred eEEEEEEEEc--CeeEEEEEcCCCccccCCH-HHH-HHcCchhhhhccc----------cceecccCcceeeeEEEEeee
Q psy15482 214 VMLYINCKVN--GYPVKAFIDSGAQTTIMSL-ACA-ERVNIMRLLDTRW----------AGIAKGVGVQKILGRIHMAIA 279 (302)
Q Consensus 214 ~mlyi~~~in--g~~v~a~vDsGaq~~i~s~-~~a-~~~gl~~~~d~~~----------~~~a~gvg~~~i~G~i~~~~~ 279 (302)
.+-|.+++|. +++++.++|||.-.+-+.. .|. ..|.-....|.+- -.+..|-| .+.|.+..-.+
T Consensus 137 ~~Yy~~i~IGTP~Q~f~vi~DTGSsdlWV~s~~C~~~~C~~~~~yd~s~SsT~~~~~~~~~i~YG~G--sv~G~~~~Dtv 214 (450)
T PTZ00013 137 IMFYGEGEVGDNHQKFMLIFDTGSANLWVPSKKCDSIGCSIKNLYDSSKSKSYEKDGTKVDITYGSG--TVKGFFSKDLV 214 (450)
T ss_pred CEEEEEEEECCCCeEEEEEEeCCCCceEEecccCCccccccCCCccCccCcccccCCcEEEEEECCc--eEEEEEEEEEE
Confidence 3678888886 7889999999998887653 443 2354333343211 12344544 37899999999
Q ss_pred eecccccc
Q psy15482 280 ELTNFNGD 287 (302)
Q Consensus 280 ~ig~~~~~ 287 (302)
.||+..++
T Consensus 215 ~iG~~~~~ 222 (450)
T PTZ00013 215 TLGHLSMP 222 (450)
T ss_pred EECCEEEc
Confidence 99987664
No 222
>PRK10872 relA (p)ppGpp synthetase I/GTP pyrophosphokinase; Provisional
Probab=29.52 E-value=2.1e+02 Score=30.99 Aligned_cols=64 Identities=17% Similarity=0.174 Sum_probs=47.1
Q ss_pred EEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEecC
Q psy15482 2 KVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLGS 76 (302)
Q Consensus 2 qItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~~ 76 (302)
.|+|-+ ..|+ .+.++...|+-||--.|..+.|--..-= ..||+...- ++-+++||+|.+.....
T Consensus 405 ~V~VfT-PkG~--~~~Lp~gaT~lDfAy~iHt~iG~~~~gA--kvng~~v~l------~~~L~~GD~VeIits~~ 468 (743)
T PRK10872 405 RVYVFT-PKGD--VVDLPAGSTPLDFAYHIHSDVGHRCIGA--KIGGRIVPF------TYQLQMGDQIEIITQKQ 468 (743)
T ss_pred eEEEEC-CCCC--eEEcCCCCcHHHHHHHHhHHHHhhceEE--EECCEECCC------CcCCCCCCEEEEEeCCC
Confidence 477878 6676 4678889999999999998877544222 368876643 35789999999986543
No 223
>PF14847 Ras_bdg_2: Ras-binding domain of Byr2; PDB: 1I35_A 1K8R_B.
Probab=29.10 E-value=2.5e+02 Score=22.72 Aligned_cols=45 Identities=13% Similarity=0.124 Sum_probs=31.6
Q ss_pred EEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCC---CCeEEEECC
Q psy15482 3 VTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPA---HEIAIEFQG 48 (302)
Q Consensus 3 ItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~---~~Q~Li~~G 48 (302)
|+|.. .+|.+.+|.|..-.+-.++|..+=.++|++. +--..+.+|
T Consensus 3 i~~I~-~dG~tk~VNV~~c~~a~eI~~rvLKKfg~~~~~~~~~~~v~d~ 50 (105)
T PF14847_consen 3 IRFIL-EDGSTKTVNVSGCFNAQEIKRRVLKKFGLPEHPRNYCFYVLDG 50 (105)
T ss_dssp EEEEE-TTTEEEEEE--S--HHHHHHHHHHHHHTSS--CCCEEEEEE-S
T ss_pred EEEEC-CCCcEEEEEECCCCCHHHHHHHHHHHcCCccccccceEEEecc
Confidence 56666 7999999999999999999999999999987 323444556
No 224
>PTZ00147 plasmepsin-1; Provisional
Probab=28.49 E-value=1.1e+02 Score=30.79 Aligned_cols=72 Identities=21% Similarity=0.329 Sum_probs=46.6
Q ss_pred eEEEEEEEEc--CeeEEEEEcCCCccccC-CHHHH-HHcCchhhhhccc----------cceecccCcceeeeEEEEeee
Q psy15482 214 VMLYINCKVN--GYPVKAFIDSGAQTTIM-SLACA-ERVNIMRLLDTRW----------AGIAKGVGVQKILGRIHMAIA 279 (302)
Q Consensus 214 ~mlyi~~~in--g~~v~a~vDsGaq~~i~-s~~~a-~~~gl~~~~d~~~----------~~~a~gvg~~~i~G~i~~~~~ 279 (302)
.+-|.+++|- +++.+.++|||.-.+.+ |..|. ..|.-....|..- -.+..|-| .+.|.+-.-.|
T Consensus 138 ~~Y~~~I~IGTP~Q~f~Vi~DTGSsdlWVps~~C~~~~C~~~~~yd~s~SsT~~~~~~~f~i~Yg~G--svsG~~~~DtV 215 (453)
T PTZ00147 138 VMSYGEAKLGDNGQKFNFIFDTGSANLWVPSIKCTTEGCETKNLYDSSKSKTYEKDGTKVEMNYVSG--TVSGFFSKDLV 215 (453)
T ss_pred CEEEEEEEECCCCeEEEEEEeCCCCcEEEeecCCCcccccCCCccCCccCcceEECCCEEEEEeCCC--CEEEEEEEEEE
Confidence 4567899997 78899999999776655 44564 2455444444321 12234544 36788888888
Q ss_pred eecccccc
Q psy15482 280 ELTNFNGD 287 (302)
Q Consensus 280 ~ig~~~~~ 287 (302)
.||+..+.
T Consensus 216 tiG~~~v~ 223 (453)
T PTZ00147 216 TIGNLSVP 223 (453)
T ss_pred EECCEEEE
Confidence 88887553
No 225
>TIGR03636 L23_arch archaeal ribosomal protein L23. This model describes the archaeal ribosomal protein L23P and rigorously excludes the bacterial counterpart L23. In order to capture every known instance of archaeal L23P, the trusted cutoff is set lower than a few of the highest scoring eukaryotic cytosolic ribosomal counterparts.
Probab=28.26 E-value=1e+02 Score=23.60 Aligned_cols=34 Identities=9% Similarity=0.020 Sum_probs=30.0
Q ss_pred CeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEE
Q psy15482 11 NECFLLDVSEDLELENFKAFCEVQSGYPAHEIAI 44 (302)
Q Consensus 11 g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~L 44 (302)
...+++.|+++.|=.++|..|+..+|+.+..-.-
T Consensus 14 ~n~y~F~V~~~anK~eIK~avE~lf~VkV~~Vnt 47 (77)
T TIGR03636 14 ENKLTFIVDRKATKGDIKRAVEKLFDVKVEKVNT 47 (77)
T ss_pred CCEEEEEECCCCCHHHHHHHHHHHhCCceEEEEe
Confidence 3689999999999999999999999998866544
No 226
>PRK10455 periplasmic protein; Reviewed
Probab=27.72 E-value=3.8e+02 Score=23.20 Aligned_cols=12 Identities=17% Similarity=0.210 Sum_probs=5.2
Q ss_pred CCCHHHHHHHHH
Q psy15482 177 PFDTEAQKLIAE 188 (302)
Q Consensus 177 P~d~e~Q~~i~e 188 (302)
-++||-++.+.+
T Consensus 129 vLTPEQr~q~~~ 140 (161)
T PRK10455 129 VLTPEQKKQFNA 140 (161)
T ss_pred hCCHHHHHHHHH
Confidence 344444444433
No 227
>PRK14548 50S ribosomal protein L23P; Provisional
Probab=27.65 E-value=90 Score=24.26 Aligned_cols=34 Identities=9% Similarity=0.013 Sum_probs=30.3
Q ss_pred CeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEE
Q psy15482 11 NECFLLDVSEDLELENFKAFCEVQSGYPAHEIAI 44 (302)
Q Consensus 11 g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~L 44 (302)
...+++.|+++.+=.++|..|+..+|+.+..-.-
T Consensus 21 ~n~y~F~V~~~anK~eIK~AvE~lf~VkV~~VnT 54 (84)
T PRK14548 21 ENKLTFIVDRRATKPDIKRAVEELFDVKVEKVNT 54 (84)
T ss_pred CCEEEEEECCCCCHHHHHHHHHHHhCCceEEEEe
Confidence 4689999999999999999999999999876554
No 228
>PF02192 PI3K_p85B: PI3-kinase family, p85-binding domain; InterPro: IPR003113 This is the region of the p110 phosphatidylinositol 3-kinase (PI3-Kinase) that binds the p85 subunit.; GO: 0046934 phosphatidylinositol-4,5-bisphosphate 3-kinase activity, 0007165 signal transduction, 0005942 phosphatidylinositol 3-kinase complex; PDB: 3HIZ_A 3HHM_A 2RD0_A 4A55_A 2Y3A_A 2V1Y_A.
Probab=27.60 E-value=71 Score=24.49 Aligned_cols=21 Identities=29% Similarity=0.180 Sum_probs=17.5
Q ss_pred EEEEeCCCccHHHHHHHHHhh
Q psy15482 14 FLLDVSEDLELENFKAFCEVQ 34 (302)
Q Consensus 14 ~~leV~~~~TV~dLK~~Ie~e 34 (302)
++++|+.+.|+.++|+.+-.+
T Consensus 2 i~l~~~~~~Tl~~iK~~lw~~ 22 (78)
T PF02192_consen 2 IPLRVSRDATLSEIKEELWEE 22 (78)
T ss_dssp EEEEEETT-BHHHHHHHHHHH
T ss_pred eEEEccCcCcHHHHHHHHHHH
Confidence 578999999999999977765
No 229
>KOG2507|consensus
Probab=27.30 E-value=1.1e+02 Score=31.09 Aligned_cols=75 Identities=8% Similarity=0.087 Sum_probs=61.7
Q ss_pred EEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEE--ECCeec--CCCcccccccCCCCCCEEEEEEecCC
Q psy15482 2 KVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIE--FQGVLL--QDNKKTLKTHGIQDGDLVLLKHLGSY 77 (302)
Q Consensus 2 qItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li--~~Gk~L--~D~~~TL~~~gI~dgd~l~L~~~~~~ 77 (302)
+|-|+. .+|..|+=..+.+.-+..++..|...-+|.....-|- |=-|+. +|.++||.++.+-+...|+|..+..+
T Consensus 316 rLqiRL-PdGssfte~Fps~~vL~~vr~yvrq~~~i~~g~f~LatpyPRReft~eDy~KtllEl~L~psaalvvlpk~r~ 394 (506)
T KOG2507|consen 316 RLQIRL-PDGSSFTEKFPSTSVLRMVRDYVRQNQTIGLGAFDLATPYPRREFTDEDYDKTLLELRLFPSAALVVLPKKRA 394 (506)
T ss_pred EEEEec-CCccchhhcCCcchHHHHHHHHHHhcccccccceeeccccccccccchhhhhhHHHhccCCcceEEEEecCCc
Confidence 567788 8899998888888999999999999888888777763 666666 45568999999999999888877654
No 230
>COG3159 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=26.91 E-value=2.7e+02 Score=25.53 Aligned_cols=28 Identities=21% Similarity=0.269 Sum_probs=24.0
Q ss_pred CCHHHHHHHHhcCHHHHHHHHhcChhHHHHhh
Q psy15482 113 DDPATVRQLLLDNPKEIALLQQNNPRLADALL 144 (302)
Q Consensus 113 ~npe~iR~~~l~NP~~~~~L~~~NPeLa~al~ 144 (302)
-|++.+.+.+..+|++. .+||+|.+.|.
T Consensus 4 lda~~VadyL~~hPeFf----~~h~~Ll~~L~ 31 (218)
T COG3159 4 LDAEDVADYLRQHPEFF----IQHAELLEELR 31 (218)
T ss_pred CCHHHHHHHHHhCcHHH----HhCHHHHHHcC
Confidence 47889999999999877 67999999984
No 231
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which
Probab=26.76 E-value=1.4e+02 Score=27.06 Aligned_cols=53 Identities=11% Similarity=0.197 Sum_probs=38.0
Q ss_pred EEEEEEc--CeeEEEEEcCCCccccCCHHHHHHcCchhhhhccccceecccCcceeeeEEEEeeeeeccc
Q psy15482 217 YINCKVN--GYPVKAFIDSGAQTTIMSLACAERVNIMRLLDTRWAGIAKGVGVQKILGRIHMAIAELTNF 284 (302)
Q Consensus 217 yi~~~in--g~~v~a~vDsGaq~~i~s~~~a~~~gl~~~~d~~~~~~a~gvg~~~i~G~i~~~~~~ig~~ 284 (302)
|++++|. +++++..+|||...+-+.. | .| ....| ....+.|.+..-.+.||+.
T Consensus 3 ~~~i~iGtP~q~~~v~~DTGSs~~wv~~-----~--------~~-~~~Y~-dg~~~~G~~~~D~v~~g~~ 57 (265)
T cd05476 3 LVTLSIGTPPQPFSLIVDTGSDLTWTQC-----C--------SY-EYSYG-DGSSTSGVLATETFTFGDS 57 (265)
T ss_pred EEEEecCCCCcceEEEecCCCCCEEEcC-----C--------ce-EeEeC-CCceeeeeEEEEEEEecCC
Confidence 7888888 6789999999987777652 3 12 12333 2446788888888888887
No 232
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate r
Probab=26.32 E-value=1.1e+02 Score=28.91 Aligned_cols=69 Identities=16% Similarity=0.246 Sum_probs=45.3
Q ss_pred EEEEEEEc--CeeEEEEEcCCCccccCCH-HHH---HHcCchhhhhcc----c------cceecccCcceeeeEEEEeee
Q psy15482 216 LYINCKVN--GYPVKAFIDSGAQTTIMSL-ACA---ERVNIMRLLDTR----W------AGIAKGVGVQKILGRIHMAIA 279 (302)
Q Consensus 216 lyi~~~in--g~~v~a~vDsGaq~~i~s~-~~a---~~~gl~~~~d~~----~------~~~a~gvg~~~i~G~i~~~~~ 279 (302)
-|++++|- +++++.+||||--.+.+.. .|. ..|.-..+.|.. | -.+..|-| .+-|.+..-.+
T Consensus 9 y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~c~~~~~y~~~~SsT~~~~~~~~~~~Yg~g--~~~G~~~~D~v 86 (326)
T cd05487 9 YYGEIGIGTPPQTFKVVFDTGSSNLWVPSSKCSPLYTACVTHNLYDASDSSTYKENGTEFTIHYASG--TVKGFLSQDIV 86 (326)
T ss_pred EEEEEEECCCCcEEEEEEeCCccceEEccCCCcCcchhhcccCcCCCCCCeeeeECCEEEEEEeCCc--eEEEEEeeeEE
Confidence 46788887 7889999999988887753 453 356544444321 0 11233434 47888888888
Q ss_pred eeccccc
Q psy15482 280 ELTNFNG 286 (302)
Q Consensus 280 ~ig~~~~ 286 (302)
.||+..+
T Consensus 87 ~~g~~~~ 93 (326)
T cd05487 87 TVGGIPV 93 (326)
T ss_pred EECCEEe
Confidence 8887654
No 233
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco. CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=26.03 E-value=69 Score=29.60 Aligned_cols=38 Identities=29% Similarity=0.286 Sum_probs=26.3
Q ss_pred CcceeeEEEEEEEEcCeeE----------EEEEcCCCccccCCHHHHHHc
Q psy15482 209 TFGTVVMLYINCKVNGYPV----------KAFIDSGAQTTIMSLACAERV 248 (302)
Q Consensus 209 ~f~~v~mlyi~~~ing~~v----------~a~vDsGaq~~i~s~~~a~~~ 248 (302)
.|-.|.+--| .|+|..+ .|+||||...+.++.+..++.
T Consensus 146 ~~y~v~l~~i--~vg~~~~~~~~~~~~~~~~ivDSGTt~~~lp~~~~~~l 193 (299)
T cd05472 146 TFYYVGLTGI--SVGGRRLPIPPASFGAGGVIIDSGTVITRLPPSAYAAL 193 (299)
T ss_pred CeEEEeeEEE--EECCEECCCCccccCCCCeEEeCCCcceecCHHHHHHH
Confidence 3444444433 3566554 489999999999999887653
No 234
>PF07946 DUF1682: Protein of unknown function (DUF1682); InterPro: IPR012879 The members of this family are all hypothetical eukaryotic proteins of unknown function. One member (Q920S6 from SWISSPROT) is described as being an adipocyte-specific protein, but no evidence of this was found.
Probab=25.91 E-value=2.7e+02 Score=26.64 Aligned_cols=20 Identities=20% Similarity=0.277 Sum_probs=16.2
Q ss_pred cCCCCHHHHHHHHHHHHHhh
Q psy15482 175 AHPFDTEAQKLIAEEIKKNN 194 (302)
Q Consensus 175 ~DP~d~e~Q~~i~e~i~q~~ 194 (302)
.+-+|||.|+|++|.=+++.
T Consensus 299 ~~~lspeeQrK~eeKe~kk~ 318 (321)
T PF07946_consen 299 LSKLSPEEQRKYEEKERKKE 318 (321)
T ss_pred HhcCCHHHHHHHHHHHHHHh
Confidence 46689999999999877654
No 235
>PF08783 DWNN: DWNN domain; InterPro: IPR014891 The ~75-residue DWNN (Domain With No Name) domain is highly conserved through eukaryotic species but is absent in prokaryotes. The DWNN domain is found only at the N terminus of the RBBP6 family of proteins which includes: Mammalian RBBP6, a splicing-associated protein that plays a role in the induction of apoptosis and regulation of the cell cycle. Drosophila melanogaster (Fruit fly) SNAMA (something that sticks like glue), a protein that appears to play a role in apoptosis. All of the identified RBBP6 homologues include the DWNN domain, a CCHC-type zinc finger (see PDOC50158 from PROSITEDOC) and a RING-type zinc finger (see PDOC00449 from PROSITEDOC). The three domain form is found in plants, protozoa, fungi and microsporidia. The RBBP6 homologues in vertebrates, insects and worms are longer and include additional domains. In addition to forming part of the full-length RBBP6 protein, the DWNN domain is also expressed in vertebrates as a small protein containing a DWNN domain and a short C-terminal tail (RBBP6 variant 3). The DWNN domain adopts a fold similar to the ubiquitin one, characterised by two alpha-helices and four beta-sheets ordered as beta-beta-alpha-beta-alpha-beta along the sequence. The similarity of DWNN domain to ubiquitin and the presence of the RING finger suggest that the DWNN domain may act as an ubiquitin-like modifier, possibly playing a role in the regulation of the splicing machinery [, ]. ; GO: 0008270 zinc ion binding, 0005634 nucleus; PDB: 2C7H_A.
Probab=25.89 E-value=3e+02 Score=20.89 Aligned_cols=34 Identities=3% Similarity=-0.030 Sum_probs=22.1
Q ss_pred EEEeCCCCeEEEEEeCC-CccHHHHHHHHHhhhCC
Q psy15482 4 TVSTQHTNECFLLDVSE-DLELENFKAFCEVQSGY 37 (302)
Q Consensus 4 tV~~~~~g~~~~leV~~-~~TV~dLK~~Ie~e~gI 37 (302)
+-+..+.-...+|.++. ..+|.|||..|..+.++
T Consensus 2 ~YKFkS~k~~~~i~fdG~~Isv~dLKr~I~~~~~l 36 (74)
T PF08783_consen 2 HYKFKSQKDYDTITFDGTSISVFDLKREIIEKKKL 36 (74)
T ss_dssp EEEETT-SSEEEEEESSSEEEHHHHHHHHHHHHT-
T ss_pred eEEecccCCccEEEECCCeeEHHHHHHHHHHHhCC
Confidence 33443344455677774 67999999999777665
No 236
>PF10454 DUF2458: Protein of unknown function (DUF2458); InterPro: IPR018858 This entry represents a family of uncharacterised proteins.
Probab=25.54 E-value=2.6e+02 Score=23.98 Aligned_cols=10 Identities=20% Similarity=0.541 Sum_probs=5.0
Q ss_pred hcChhHHHHh
Q psy15482 134 QNNPRLADAL 143 (302)
Q Consensus 134 ~~NPeLa~al 143 (302)
.+||++...|
T Consensus 20 ~~n~~~~~~I 29 (150)
T PF10454_consen 20 AQNPEFLQRI 29 (150)
T ss_pred hcCHHHHHHH
Confidence 4455555444
No 237
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco. CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=25.47 E-value=1.5e+02 Score=27.40 Aligned_cols=57 Identities=14% Similarity=0.283 Sum_probs=37.2
Q ss_pred EEEEEEEc--CeeEEEEEcCCCccccCCHHHHHHcCchhhhhccccceecccCcceeeeEEEEeeeeeccc
Q psy15482 216 LYINCKVN--GYPVKAFIDSGAQTTIMSLACAERVNIMRLLDTRWAGIAKGVGVQKILGRIHMAIAELTNF 284 (302)
Q Consensus 216 lyi~~~in--g~~v~a~vDsGaq~~i~s~~~a~~~gl~~~~d~~~~~~a~gvg~~~i~G~i~~~~~~ig~~ 284 (302)
-|+++.|. .+++++++|||...+-+. | ..| ..| .+..|-| ..+.|....-.+.||+.
T Consensus 2 Y~~~i~iGtP~q~~~v~~DTGSs~~Wv~--c-~~c-------~~~-~i~Yg~G-s~~~G~~~~D~v~ig~~ 60 (299)
T cd05472 2 YVVTVGLGTPARDQTVIVDTGSDLTWVQ--C-QPC-------CLY-QVSYGDG-SYTTGDLATDTLTLGSS 60 (299)
T ss_pred eEEEEecCCCCcceEEEecCCCCccccc--C-CCC-------Cee-eeEeCCC-ceEEEEEEEEEEEeCCC
Confidence 37788887 578999999998877763 2 234 112 2334433 23467777777778775
No 238
>COG4055 McrD Methyl coenzyme M reductase, subunit D [Coenzyme metabolism]
Probab=25.39 E-value=1.7e+02 Score=25.42 Aligned_cols=56 Identities=14% Similarity=0.286 Sum_probs=38.5
Q ss_pred EEEEeCCCCeEEEEEeCCCc-cHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCE
Q psy15482 3 VTVSTQHTNECFLLDVSEDL-ELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDL 68 (302)
Q Consensus 3 ItV~~~~~g~~~~leV~~~~-TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~ 68 (302)
+.|++ |+.+ |+++++. +++.++..|..-+-+.-+ ++-|+-+... .|+.|| +|-|.-
T Consensus 78 L~Vkv---Gri~-~eie~e~~~~e~ie~ic~e~lPf~y~----v~vG~F~r~k-pTVTDy-~KyG~~ 134 (165)
T COG4055 78 LKVKV---GRII-LEIEDEDETMEKIEEICDEMLPFGYE----VRVGKFTRRK-PTVTDY-IKYGED 134 (165)
T ss_pred EEEEe---eEEE-EEecCcHhHHHHHHHHHHHhCCCcee----eeeeeeeccC-Ccchhh-hhhCcc
Confidence 44444 5553 6777664 999999999877544433 4568888885 899999 566654
No 239
>PF00794 PI3K_rbd: PI3-kinase family, ras-binding domain; InterPro: IPR000341 Phosphatidylinositol 3-kinase (PI3K) (2.7.1.137 from EC) is an enzyme that phosphorylates phosphoinositides on the 3-hydroxyl group of the inositol ring. A subset of PI3Ks has the capacity to bind and be activated by the GTP-bound small GTPase p21Ras (Ras). PI3Ks are recognised as one of the principal effectors of Ras signalling to the cell-cycle control machinery. In the structure of the Ras-PI3K gamma complex, contacts between the two molecules are made primarily via the so-called switch I region of Ras and the PI3K RBD. The RBD fold comprises a five-stranded mixed beta-sheet, flanked by two alpha-helices. Interaction between Ras and the PI3K RBD is primarily polar in character and, as characterised by kinetic measurements, is reversible and transient [].; GO: 0016303 1-phosphatidylinositol-3-kinase activity, 0005942 phosphatidylinositol 3-kinase complex; PDB: 2RD0_A 3HIZ_A 3HHM_A 1E8W_A 1E8X_A 1E7V_A 1E90_A 1E7U_A 2Y3A_A 3L54_A ....
Probab=25.33 E-value=1.6e+02 Score=23.14 Aligned_cols=59 Identities=17% Similarity=0.213 Sum_probs=36.0
Q ss_pred EEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhh--hCCCCC----CeEEEECCee--cCCCccccccc
Q psy15482 2 KVTVSTQHTNECFLLDVSEDLELENFKAFCEVQ--SGYPAH----EIAIEFQGVL--LQDNKKTLKTH 61 (302)
Q Consensus 2 qItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e--~gIp~~----~Q~Li~~Gk~--L~D~~~TL~~~ 61 (302)
.|.|.....+..+++.|+.+.|+.+|-+.+-.+ .+.... +-.|--.|+. |.++ ..|.+|
T Consensus 18 ~v~v~~~~~~~~~t~~~~~~~t~~~li~~~l~k~~~~~~~~~~~~dyvLKV~G~~EyL~g~-~~L~~y 84 (106)
T PF00794_consen 18 KVSVHFENSQQSFTFQVDPNSTPEELIAQALKKKLKDLLPPDPEDDYVLKVCGREEYLLGD-HPLSQY 84 (106)
T ss_dssp EEEEEETTCSEEEEEEEETTS-HHHHHHHHHHHHHHHTT-CHHHHGEEEEETTSSEEE-SS-S-GGGB
T ss_pred EEEEEEcCCCcEEEEEECCCCCHHHHHHHHHHHHHhhcCCcccccCEEEEecCceEEeeCC-eeeecc
Confidence 356666457888999999999999998866655 222221 4556555553 3333 566665
No 240
>COG2080 CoxS Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs [Energy production and conversion]
Probab=25.28 E-value=1.1e+02 Score=26.75 Aligned_cols=26 Identities=27% Similarity=0.293 Sum_probs=21.5
Q ss_pred CEEEEEeCCCCeEEEEEeCCCccHHHHH
Q psy15482 1 MKVTVSTQHTNECFLLDVSEDLELENFK 28 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~~~~TV~dLK 28 (302)
|.|+++. +|+.+.++|+|++++.++-
T Consensus 2 ~~i~ltv--NG~~~~~~~~p~~~Ll~~L 27 (156)
T COG2080 2 MPITLTV--NGEPVELDVDPRTPLLDVL 27 (156)
T ss_pred CcEEEEE--CCeEEEEEeCCCChHHHHH
Confidence 4566666 8999999999999988864
No 241
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=24.63 E-value=85 Score=29.35 Aligned_cols=28 Identities=18% Similarity=0.339 Sum_probs=22.8
Q ss_pred EEEcCeeE-------EEEEcCCCccccCCHHHHHH
Q psy15482 220 CKVNGYPV-------KAFIDSGAQTTIMSLACAER 247 (302)
Q Consensus 220 ~~ing~~v-------~a~vDsGaq~~i~s~~~a~~ 247 (302)
+.|+|..+ .|+||||...+.++....++
T Consensus 188 i~v~g~~~~~~~~~~~~iiDSGtt~~~lP~~~~~~ 222 (318)
T cd05477 188 FQINGQATGWCSQGCQAIVDTGTSLLTAPQQVMST 222 (318)
T ss_pred EEECCEEecccCCCceeeECCCCccEECCHHHHHH
Confidence 45777653 58999999999999988775
No 242
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5. Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=24.63 E-value=1.4e+02 Score=28.02 Aligned_cols=72 Identities=11% Similarity=0.159 Sum_probs=46.7
Q ss_pred EEEEEEEEc--CeeEEEEEcCCCccccCCHHHHHHcCch--hhhhc----c----------------------ccceecc
Q psy15482 215 MLYINCKVN--GYPVKAFIDSGAQTTIMSLACAERVNIM--RLLDT----R----------------------WAGIAKG 264 (302)
Q Consensus 215 mlyi~~~in--g~~v~a~vDsGaq~~i~s~~~a~~~gl~--~~~d~----~----------------------~~~~a~g 264 (302)
.-|+++.|- ++++..++|||.-.+-+...-+..|+.. ...|. . -..+..|
T Consensus 3 ~Y~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~~C~~c~~~~~~~y~~~~Sst~~~~~C~~~~c~~~~~~~~~~~~~~i~Y~ 82 (326)
T cd06096 3 YYFIDIFIGNPPQKQSLILDTGSSSLSFPCSQCKNCGIHMEPPYNLNNSITSSILYCDCNKCCYCLSCLNNKCEYSISYS 82 (326)
T ss_pred eEEEEEEecCCCeEEEEEEeCCCCceEEecCCCCCcCCCCCCCcCcccccccccccCCCccccccCcCCCCcCcEEEEEC
Confidence 347888888 5889999999988877765544567643 11111 0 1122334
Q ss_pred cCcceeeeEEEEeeeeecccccc
Q psy15482 265 VGVQKILGRIHMAIAELTNFNGD 287 (302)
Q Consensus 265 vg~~~i~G~i~~~~~~ig~~~~~ 287 (302)
-| ..+.|....-.+.||+..++
T Consensus 83 ~g-s~~~G~~~~D~v~lg~~~~~ 104 (326)
T cd06096 83 EG-SSISGFYFSDFVSFESYLNS 104 (326)
T ss_pred CC-CceeeEEEEEEEEeccCCCC
Confidence 33 35789888889999987665
No 243
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=24.32 E-value=89 Score=29.53 Aligned_cols=71 Identities=14% Similarity=0.230 Sum_probs=44.2
Q ss_pred eEEEEEEEEc--CeeEEEEEcCCCccccCC-HHHH---HHcCchhhhhcc----cc------ceecccCcceeeeEEEEe
Q psy15482 214 VMLYINCKVN--GYPVKAFIDSGAQTTIMS-LACA---ERVNIMRLLDTR----WA------GIAKGVGVQKILGRIHMA 277 (302)
Q Consensus 214 ~mlyi~~~in--g~~v~a~vDsGaq~~i~s-~~~a---~~~gl~~~~d~~----~~------~~a~gvg~~~i~G~i~~~ 277 (302)
..-|++++|- ++++++++|||.-.+-+. ..|. ..|+.....|.. |+ .+..|-|. +-|.+-.-
T Consensus 10 ~~Y~~~i~vGtP~q~~~v~~DTGSs~~Wv~~~~C~~~~~~c~~~~~y~~~~Sst~~~~~~~~~i~Y~~g~--~~G~~~~D 87 (329)
T cd05485 10 AQYYGVITIGTPPQSFKVVFDTGSSNLWVPSKKCSWTNIACLLHNKYDSTKSSTYKKNGTEFAIQYGSGS--LSGFLSTD 87 (329)
T ss_pred CeEEEEEEECCCCcEEEEEEcCCCccEEEecCCCCCCCccccCCCeECCcCCCCeEECCeEEEEEECCce--EEEEEecC
Confidence 3568899998 588999999998876554 4443 356654333210 11 12344443 56777777
Q ss_pred eeeeccccc
Q psy15482 278 IAELTNFNG 286 (302)
Q Consensus 278 ~~~ig~~~~ 286 (302)
.+.||+..+
T Consensus 88 ~v~ig~~~~ 96 (329)
T cd05485 88 TVSVGGVSV 96 (329)
T ss_pred cEEECCEEE
Confidence 777776654
No 244
>cd06404 PB1_aPKC PB1 domain is an essential modular domain of the atypical protein kinase C (aPKC) which in complex with Par6 and Par3 proteins is crucial for establishment of apical-basal polarity of animal cells. PB1 domain is a modular domain mediating specific protein-protein interaction which play roles in many critical cell processes. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions. The PB1 domain module is conserved in amoebas, fungi,
Probab=24.21 E-value=1.7e+02 Score=22.90 Aligned_cols=44 Identities=9% Similarity=0.225 Sum_probs=34.1
Q ss_pred EEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCC-CeEEEE
Q psy15482 3 VTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAH-EIAIEF 46 (302)
Q Consensus 3 ItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~-~Q~Li~ 46 (302)
|+||..-+|.++-..++++.|.+.|.+.+..-+..+.+ ...+.|
T Consensus 1 i~~K~~y~gdi~it~~d~~~s~e~L~~~v~~~c~~~~~q~ft~kw 45 (83)
T cd06404 1 VRVKAAYNGDIMITSIDPSISLEELCNEVRDMCRFHNDQPFTLKW 45 (83)
T ss_pred CeEEEEecCcEEEEEcCCCcCHHHHHHHHHHHhCCCCCCcEEEEE
Confidence 35555448999999999999999999999999998764 334433
No 245
>cd01776 Rin1_RA Ubiquitin domain of RIN1 RAS effector. Rin1_RA RIN1 is a RAS effector that binds with specificity and high affinity to activated RAS via its carboxy-terminal RA (RAS-associated) domain. RIN1 competes directly with RAF1 for RAS binding and is thought to divert signaling away from RAF and the MAPK pathway while also shunting RAS signals through alternate pathways. In addition, Rin1 and Rin2 are Rab5-binding proteins, binding preferentially to the GTP-bound form, that enhance the GDP-GTP exchange reaction on Rab5 that regulate the docking and fusion processes of endocytic vesicles. In addition to the RA domain, RIN1 and RIN2 have an SH2 (Src homology 2) domain, a proline-rich SH3 domain, and a Vps9 domain.
Probab=24.10 E-value=1.2e+02 Score=23.71 Aligned_cols=37 Identities=14% Similarity=0.101 Sum_probs=30.2
Q ss_pred CCCeE-EEEEeCCCccHHHHHHHHHhhhCCC-CCCeEEE
Q psy15482 9 HTNEC-FLLDVSEDLELENFKAFCEVQSGYP-AHEIAIE 45 (302)
Q Consensus 9 ~~g~~-~~leV~~~~TV~dLK~~Ie~e~gIp-~~~Q~Li 45 (302)
.+|.+ .+|-|.|++|.++|=.+|+.++.|. |+.-.|+
T Consensus 10 ~sgct~KTL~V~P~~tt~~vc~lcA~Kf~V~qPe~y~LF 48 (87)
T cd01776 10 NSGCTGKTLLVRPYITTEDVCQLCAEKFKVTQPEEYSLF 48 (87)
T ss_pred CCCceeeeeecCCCCcHHHHHHHHHHHhccCChhheeEE
Confidence 34444 4899999999999999999999985 5666665
No 246
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=23.94 E-value=85 Score=29.35 Aligned_cols=29 Identities=17% Similarity=0.290 Sum_probs=22.7
Q ss_pred EEEcCeeE------EEEEcCCCccccCCHHHHHHc
Q psy15482 220 CKVNGYPV------KAFIDSGAQTTIMSLACAERV 248 (302)
Q Consensus 220 ~~ing~~v------~a~vDsGaq~~i~s~~~a~~~ 248 (302)
++|+|..+ .|+||||.-.+.++.+...++
T Consensus 186 i~v~g~~~~~~~~~~aiiDTGTs~~~lP~~~~~~l 220 (316)
T cd05486 186 IQVGGTVIFCSDGCQAIVDTGTSLITGPSGDIKQL 220 (316)
T ss_pred EEEecceEecCCCCEEEECCCcchhhcCHHHHHHH
Confidence 56677654 599999999999998876554
No 247
>PF13856 Gifsy-2: ATP-binding sugar transporter from pro-phage; PDB: 2PP6_A.
Probab=23.72 E-value=81 Score=24.54 Aligned_cols=19 Identities=26% Similarity=0.401 Sum_probs=15.5
Q ss_pred EEEEEcCeeEEEEEcCCCc
Q psy15482 218 INCKVNGYPVKAFIDSGAQ 236 (302)
Q Consensus 218 i~~~ing~~v~a~vDsGaq 236 (302)
-.+.|||.+++|.+|....
T Consensus 22 ~~~~InG~~~~~v~d~~~~ 40 (95)
T PF13856_consen 22 EEHTINGKPYRAVFDEPEL 40 (95)
T ss_dssp EEEEETTEEEEEEEGGGT-
T ss_pred CeEEECCEEEEEEECCchh
Confidence 4678999999999997643
No 248
>COG0089 RplW Ribosomal protein L23 [Translation, ribosomal structure and biogenesis]
Probab=23.51 E-value=1e+02 Score=24.55 Aligned_cols=39 Identities=18% Similarity=0.213 Sum_probs=32.3
Q ss_pred CeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEE-EECCe
Q psy15482 11 NECFLLDVSEDLELENFKAFCEVQSGYPAHEIAI-EFQGV 49 (302)
Q Consensus 11 g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~L-i~~Gk 49 (302)
...+++.|+++.|=.++|+.++..+|+-+..-.. +..|+
T Consensus 21 ~nk~vF~V~~~AtK~~IK~AvE~lF~VkV~kVNTl~~k~k 60 (94)
T COG0089 21 ENKYVFIVDPDATKPEIKAAVEELFGVKVEKVNTLNTKGK 60 (94)
T ss_pred CCEEEEEECCCCCHHHHHHHHHHHhCCeEEEEEEEEeCCc
Confidence 4788999999999999999999999998866543 44554
No 249
>cd01666 TGS_DRG_C TGS_DRG_C: DRG (developmentally regulated GTP-binding protein) represents a family of GTP-binding proteins that includes two members, DRG1 and DRG2. DRG1 and DRG2 have a C-terminal TGS domain (named after the ThrRS, GTPase, and SpoT proteins where it occurs) with a predominantly beta-sheet structure. The function of TGS is unknown but its presence in two types of regulatory proteins (the DRG GTPases and guanosine polyphosphate phosphohydrolases/synthetases) suggests a ligand (most likely nucleotide)-binding, regulatory role.
Probab=23.41 E-value=2.1e+02 Score=21.61 Aligned_cols=53 Identities=9% Similarity=-0.036 Sum_probs=33.2
Q ss_pred EEEEeCCCccHHHHHHHHHhhhCCCCCCeEE-----EECCeecCCCcccccccCCCCCCEEEEE
Q psy15482 14 FLLDVSEDLELENFKAFCEVQSGYPAHEIAI-----EFQGVLLQDNKKTLKTHGIQDGDLVLLK 72 (302)
Q Consensus 14 ~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~L-----i~~Gk~L~D~~~TL~~~gI~dgd~l~L~ 72 (302)
-++.+....||+|+-..|....+-...--.+ -|+|+...- ++-++|||+|.+.
T Consensus 17 ~~liL~~GaTV~D~a~~iH~di~~~f~~A~v~g~s~~~~gq~Vgl------~~~L~d~DvVeI~ 74 (75)
T cd01666 17 EPVILRRGSTVEDVCNKIHKDLVKQFKYALVWGSSVKHSPQRVGL------DHVLEDEDVVQIV 74 (75)
T ss_pred CCEEECCCCCHHHHHHHHHHHHHHhCCeeEEeccCCcCCCeECCC------CCEecCCCEEEEe
Confidence 3677888999999999998653221111111 146655532 3357889998774
No 250
>PF09280 XPC-binding: XPC-binding domain; InterPro: IPR015360 Members of this entry adopt a structure consisting of four alpha helices, arranged in an array. They bind specifically and directly to the xeroderma pigmentosum group C protein (XPC) to initiate nucleotide excision repair []. ; GO: 0003684 damaged DNA binding, 0006289 nucleotide-excision repair, 0043161 proteasomal ubiquitin-dependent protein catabolic process; PDB: 1PVE_A 1QZE_A 1OQY_A 1TP4_A 1X3W_B 3ESW_B 2QSG_X 2QSF_X 1X3Z_B 2QSH_X ....
Probab=23.34 E-value=91 Score=22.56 Aligned_cols=20 Identities=30% Similarity=0.431 Sum_probs=9.7
Q ss_pred hcCh----hHHHHhhcCChHHHHH
Q psy15482 134 QNNP----RLADALLNGSPEDFER 153 (302)
Q Consensus 134 ~~NP----eLa~al~~~d~e~f~~ 153 (302)
++|| .+-+-|...||+.++.
T Consensus 18 q~NP~lL~~lLqql~~~nP~l~q~ 41 (59)
T PF09280_consen 18 QQNPQLLPPLLQQLGQSNPQLLQL 41 (59)
T ss_dssp HC-GGGHHHHHHHHHCCSHHHHHH
T ss_pred HHCHHHHHHHHHHHhccCHHHHHH
Confidence 5665 3555555556654443
No 251
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases. They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=23.27 E-value=1e+02 Score=28.89 Aligned_cols=30 Identities=17% Similarity=0.238 Sum_probs=23.6
Q ss_pred EEEEcCeeE-------EEEEcCCCccccCCHHHHHHc
Q psy15482 219 NCKVNGYPV-------KAFIDSGAQTTIMSLACAERV 248 (302)
Q Consensus 219 ~~~ing~~v-------~a~vDsGaq~~i~s~~~a~~~ 248 (302)
.++|||..+ .|+||||.-.+.++.+..+..
T Consensus 196 ~i~v~g~~~~~~~~~~~aivDTGTs~~~lP~~~~~~i 232 (317)
T cd06098 196 DVLIGGKSTGFCAGGCAAIADSGTSLLAGPTTIVTQI 232 (317)
T ss_pred eEEECCEEeeecCCCcEEEEecCCcceeCCHHHHHhh
Confidence 356788653 599999999999999877643
No 252
>PF02824 TGS: TGS domain; InterPro: IPR004095 The TGS domain is present in a number of enzymes, for example, in threonyl-tRNA synthetase (ThrRS), GTPase, and guanosine-3',5'-bis(diphosphate) 3'-pyrophosphohydrolase (SpoT) []. The TGS domain is also present at the amino terminus of the uridine kinase from the spirochaete Treponema pallidum (but not any other organism, including the related spirochaete Borrelia burgdorferi). TGS is a small domain that consists of ~50 amino acid residues and is predicted to possess a predominantly beta-sheet structure. There is no direct information on the functions of the TGS domain, but its presence in two types of regulatory proteins (the GTPases and guanosine polyphosphate phosphohydrolases/synthetases) suggests a ligand (most likely nucleotide)-binding, regulatory role []. ; PDB: 1WXQ_A 1NYR_B 1NYQ_B 2KMM_A 1WWT_A 1TKE_A 1TKG_A 1TJE_A 1QF6_A 1TKY_A ....
Probab=23.22 E-value=2.1e+02 Score=20.27 Aligned_cols=59 Identities=19% Similarity=0.193 Sum_probs=37.9
Q ss_pred EEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEE
Q psy15482 3 VTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLK 72 (302)
Q Consensus 3 ItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~ 72 (302)
|+|.+ .+|+... ++...|+.|+-..|....+=.. -.-..||+...- ++-+++||+|.+.
T Consensus 1 I~v~l-pdG~~~~--~~~g~T~~d~A~~I~~~l~~~~--~~A~Vng~~vdl------~~~L~~~d~v~ii 59 (60)
T PF02824_consen 1 IRVYL-PDGSIKE--LPEGSTVLDVAYSIHSSLAKRA--VAAKVNGQLVDL------DHPLEDGDVVEII 59 (60)
T ss_dssp EEEEE-TTSCEEE--EETTBBHHHHHHHHSHHHHHCE--EEEEETTEEEET------TSBB-SSEEEEEE
T ss_pred CEEEC-CCCCeee--CCCCCCHHHHHHHHCHHHHhhe--eEEEEcCEECCC------CCCcCCCCEEEEE
Confidence 46667 6677654 7788999999999987643221 122357866533 2357788888764
No 253
>TIGR02867 spore_II_P stage II sporulation protein P. Stage II sporulation protein P is a protein of the endospore formation program in a number of lineages in the Firmicutes (low-GC Gram-positive bacteria). It is expressed in the mother cell compartment, under control of Sigma-E. SpoIIP, along with SpoIIM and SpoIID, is one of three major proteins involved in engulfment of the forespore by the mother cell. This protein family is named for the single member in Bacillus subtilis, although most sporulating bacteria have two members.
Probab=23.16 E-value=1.4e+02 Score=26.88 Aligned_cols=101 Identities=16% Similarity=0.194 Sum_probs=62.1
Q ss_pred HHHHHHhCC--CCcceeeEE-----EEEEEEcCeeE-EEEEcCCCccccCCHHHHHHcCchhhhhccccceecccCcce-
Q psy15482 199 MEAAMEYNP--ETFGTVVML-----YINCKVNGYPV-KAFIDSGAQTTIMSLACAERVNIMRLLDTRWAGIAKGVGVQK- 269 (302)
Q Consensus 199 ~~~A~E~~P--E~f~~v~ml-----yi~~~ing~~v-~a~vDsGaq~~i~s~~~a~~~gl~~~~d~~~~~~a~gvg~~~- 269 (302)
.+.+++.+| ++|.-+++= .-.++|||.++ |.++=.|....=.....+=..-|...+++.|.|..+||=.-.
T Consensus 73 v~~~l~~~p~i~~viDiHRDs~~~~~~t~~inG~~~Aki~fVvG~~np~~~~N~~fA~~l~~~~~~~yPgl~rgi~~k~~ 152 (196)
T TIGR02867 73 VKKALKENKDLKYIIDLHRDSVRRKKTTVNINGESVAKVMFVIGKNNPHFEKNLQLANKLHAKLEKKYPGLSRGIFYKDY 152 (196)
T ss_pred HHHHHHHCCCceEEEEeecCCCCCCcceEEECCEEEEEEEEEEcCCCCCHHHHHHHHHHHHHHHHhhCCCcccceEEccC
Confidence 346678888 677766663 45789999995 444444555443344455555678888999999988864310
Q ss_pred eeeE------EEEeeeeecccccchhHH--HHHHHHhh
Q psy15482 270 ILGR------IHMAIAELTNFNGDKKKA--LAALLAKS 299 (302)
Q Consensus 270 i~G~------i~~~~~~ig~~~~~~~~~--~~~~~~~~ 299 (302)
--|+ -..+-|+||+..=..+.| -+.+||+.
T Consensus 153 ~~~~YNQdl~~~alLiEiG~~~Nt~eEa~~s~~~lA~~ 190 (196)
T TIGR02867 153 GNGVYNQDLSPNAILIEVGGVDNTLEEAKRSADYLAEA 190 (196)
T ss_pred CCCcccccCCCCeEEEEECCCCCcHHHHHHHHHHHHHH
Confidence 1111 235778999885554443 33455543
No 254
>PF05618 Zn_protease: Putative ATP-dependant zinc protease; InterPro: IPR008503 This family consists of several hypothetical proteins from different archaeal and bacterial species.; PDB: 2PMA_B.
Probab=23.16 E-value=46 Score=28.17 Aligned_cols=22 Identities=36% Similarity=0.468 Sum_probs=15.6
Q ss_pred cCeeEEEEEcCCCccccCCHHH
Q psy15482 223 NGYPVKAFIDSGAQTTIMSLAC 244 (302)
Q Consensus 223 ng~~v~a~vDsGaq~~i~s~~~ 244 (302)
-|..++|=|||||.++-++..-
T Consensus 13 ~~~~~~aKiDTGA~tSSLhA~~ 34 (138)
T PF05618_consen 13 LGLTIKAKIDTGAKTSSLHATD 34 (138)
T ss_dssp GTEEEEEEE-TT-SSEEEE-EE
T ss_pred CCCEEEEEEcCCCcccceeecc
Confidence 4667999999999999887653
No 255
>TIGR00691 spoT_relA (p)ppGpp synthetase, RelA/SpoT family. (p)ppGpp is a regulatory metabolite of the stringent response, but appears also to be involved in antibiotic biosynthesis in some species.
Probab=23.00 E-value=2.9e+02 Score=29.53 Aligned_cols=63 Identities=13% Similarity=0.205 Sum_probs=45.0
Q ss_pred EEEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEec
Q psy15482 2 KVTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHLG 75 (302)
Q Consensus 2 qItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~~ 75 (302)
.|+|-+ ..|+. +.++...|+.||--.|..+.|.-... -.-||+...- ++-+++||+|.+....
T Consensus 361 ~i~vfT-PkG~~--~~lp~gst~~DfAy~ih~~~g~~~~~--a~vng~~v~l------~~~l~~gd~vei~t~~ 423 (683)
T TIGR00691 361 EIYVFT-PKGDV--VELPSGSTPVDFAYAVHTDVGNKCTG--AKVNGKIVPL------DKELENGDVVEIITGK 423 (683)
T ss_pred ceEEEC-CCCeE--EEcCCCCCHHHHHHHHhHHhHhceeE--EEECCEECCC------CccCCCCCEEEEEeCC
Confidence 366667 66666 56788899999999999877654421 1257876543 3468999999998654
No 256
>TIGR02812 fadR_gamma fatty acid metabolism transcriptional regulator FadR. Members of this family are FadR, a transcriptional regulator of fatty acid metabolism, including both biosynthesis and beta-oxidation. It is found exclusively in a subset of Gammaproteobacteria, with strictly one copy per genome. It has an N-terminal DNA-binding domain and a less well conserved C-terminal long chain acyl-CoA-binding domain. FadR from this family heterologously expressed in Escherichia coli show differences in regulatory response and fatty acid binding profiles. The family is nevertheless designated equivalog, as all member proteins have at least nominally the same function.
Probab=22.91 E-value=2.6e+02 Score=24.87 Aligned_cols=34 Identities=9% Similarity=0.008 Sum_probs=23.4
Q ss_pred hcCCCCHHHHHHHHHHHHHhhHHHHHHHHHHhCCCCcc
Q psy15482 174 NAHPFDTEAQKLIAEEIKKNNIEANMEAAMEYNPETFG 211 (302)
Q Consensus 174 ~~DP~d~e~Q~~i~e~i~q~~i~~n~~~A~E~~PE~f~ 211 (302)
+.|| .++.+.+.+.|.. ..+.+...++.+|+.|.
T Consensus 201 ~~d~--~~A~~~m~~H~~~--~~~~~~~~~~~~~~~~~ 234 (235)
T TIGR02812 201 AGNH--DEVPDCIRQYGIE--SGEIWHQMQDELPQDLS 234 (235)
T ss_pred cCCH--HHHHHHHHHHHHH--hhhHHHHHHhccccccc
Confidence 3455 4666677777754 45567778899999884
No 257
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two
Probab=22.90 E-value=91 Score=29.89 Aligned_cols=20 Identities=25% Similarity=0.388 Sum_probs=17.7
Q ss_pred EEEEcCCCccccCCHHHHHH
Q psy15482 228 KAFIDSGAQTTIMSLACAER 247 (302)
Q Consensus 228 ~a~vDsGaq~~i~s~~~a~~ 247 (302)
.|+||||.-.+.++.+..+.
T Consensus 213 ~~ivDSGTs~~~lp~~~~~~ 232 (364)
T cd05473 213 KAIVDSGTTNLRLPVKVFNA 232 (364)
T ss_pred cEEEeCCCcceeCCHHHHHH
Confidence 48999999999999997665
No 258
>PF12053 DUF3534: Domain of unknown function (DUF3534); InterPro: IPR021922 This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is about 150 amino acids in length. This domain is found associated with PF00595 from PFAM. This domain has a conserved GILD sequence motif. ; PDB: 2NS5_A.
Probab=22.86 E-value=4.9e+02 Score=22.35 Aligned_cols=72 Identities=17% Similarity=0.179 Sum_probs=38.4
Q ss_pred CEEEEEeCCCCeEEEEEeC-CCccHHHHHHHHHh----hhCCCCCCe----EEEE-CCeecCCCcccccccCCCCCCEEE
Q psy15482 1 MKVTVSTQHTNECFLLDVS-EDLELENFKAFCEV----QSGYPAHEI----AIEF-QGVLLQDNKKTLKTHGIQDGDLVL 70 (302)
Q Consensus 1 MqItV~~~~~g~~~~leV~-~~~TV~dLK~~Ie~----e~gIp~~~Q----~Li~-~Gk~L~D~~~TL~~~gI~dgd~l~ 70 (302)
|||||.. ....+.|-+. .+.||.+|-..--. ..+..++.. .|.| .|-+|..+ ..|.+. +-|.|.|+
T Consensus 1 mkvtV~f--g~~~vvVPC~dg~~tV~~L~~~A~~RY~K~~~~~~~~~v~V~~l~~~dggiLd~D-D~l~dV-~dd~d~li 76 (145)
T PF12053_consen 1 MKVTVCF--GRTRVVVPCGDGQLTVRDLIQQALRRYRKAKEKDPDYWVVVHHLEYTDGGILDPD-DVLCDV-VDDRDQLI 76 (145)
T ss_dssp -EEEEEE--TTEEEEEEESSS---HHHHHHHHHHHHHHHTT--TTS-EEEEEEE-SSS-EE-TT-S-HHHH-S-TTEEEE
T ss_pred CeEEEEe--CCeEEEEEeCCCCccHHHHHHHHhHhHHHhhccCCCceEEEeeEEecCCceeccc-cceeEe-ccChhhhh
Confidence 9999999 4455677777 45899998653322 234444433 3333 56677443 677776 56888888
Q ss_pred EEEecC
Q psy15482 71 LKHLGS 76 (302)
Q Consensus 71 L~~~~~ 76 (302)
.+..-+
T Consensus 77 AvydEq 82 (145)
T PF12053_consen 77 AVYDEQ 82 (145)
T ss_dssp EEEEE-
T ss_pred eeeccc
Confidence 877653
No 259
>PF08154 NLE: NLE (NUC135) domain; InterPro: IPR012972 This domain is located N-terminal to WD40 repeats(IPR001680 from INTERPRO). It is found in the microtubule-associated protein Q12024 from SWISSPROT [].
Probab=22.53 E-value=3e+02 Score=19.90 Aligned_cols=43 Identities=9% Similarity=0.142 Sum_probs=32.4
Q ss_pred EEEeCCCccHHHHHHHHHhhh--CCCCCCeEEEECCeecCCCcccccc
Q psy15482 15 LLDVSEDLELENFKAFCEVQS--GYPAHEIAIEFQGVLLQDNKKTLKT 60 (302)
Q Consensus 15 ~leV~~~~TV~dLK~~Ie~e~--gIp~~~Q~Li~~Gk~L~D~~~TL~~ 60 (302)
+++|+.+.|..+|-++|..-. .-.+-....+.+|..|.+ +|.+
T Consensus 19 ~~~VP~~~t~~~Ls~LvN~LL~~~~~~vpfdF~i~~~~lr~---sL~~ 63 (65)
T PF08154_consen 19 PISVPSNITRKELSELVNQLLDDEEEPVPFDFLINGEELRT---SLEE 63 (65)
T ss_pred CEEEeCCCCHHHHHHHHHHHhccCCCCCcEEEEECCEEeec---hHHH
Confidence 688999999999999988876 344455677789988864 4554
No 260
>PF14533 USP7_C2: Ubiquitin-specific protease C-terminal; PDB: 2YLM_A.
Probab=22.41 E-value=83 Score=28.31 Aligned_cols=31 Identities=19% Similarity=0.161 Sum_probs=22.6
Q ss_pred CeEEEEEeCCCccHHHHHHHHHhhhCCCCCC
Q psy15482 11 NECFLLDVSEDLELENFKAFCEVQSGYPAHE 41 (302)
Q Consensus 11 g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~ 41 (302)
|..|.+-|.++.|+.++|..|+..+|++-.+
T Consensus 132 GiPF~f~v~~gE~f~~tK~Rl~~rlgv~~ke 162 (213)
T PF14533_consen 132 GIPFLFVVKPGETFSDTKERLQKRLGVSDKE 162 (213)
T ss_dssp EEEEEEEEETT--HHHHHHHHHHHH---HHH
T ss_pred CCCEEEEeeCCCcHHHHHHHHHHHhCCChhh
Confidence 6677889999999999999999999998643
No 261
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=22.32 E-value=1.3e+02 Score=28.17 Aligned_cols=68 Identities=13% Similarity=0.272 Sum_probs=42.1
Q ss_pred EEEEEEc--CeeEEEEEcCCCccccC-CHHHH-HHcCchhhhhcc----c------cceecccCcceeeeEEEEeeeeec
Q psy15482 217 YINCKVN--GYPVKAFIDSGAQTTIM-SLACA-ERVNIMRLLDTR----W------AGIAKGVGVQKILGRIHMAIAELT 282 (302)
Q Consensus 217 yi~~~in--g~~v~a~vDsGaq~~i~-s~~~a-~~~gl~~~~d~~----~------~~~a~gvg~~~i~G~i~~~~~~ig 282 (302)
|++++|- ++++++++|||.-..-+ |..|. ..|.-....|.. | ..+..|-| .+.|.+-.-.+.||
T Consensus 2 ~~~i~iGtP~Q~~~v~~DTGSs~~Wv~s~~C~~~~C~~~~~y~~~~SsT~~~~~~~~~i~Yg~g--~~~G~~~~D~v~ig 79 (316)
T cd05486 2 FGQISIGTPPQNFTVIFDTGSSNLWVPSIYCTSQACTKHNRFQPSESSTYVSNGEAFSIQYGTG--SLTGIIGIDQVTVE 79 (316)
T ss_pred eEEEEECCCCcEEEEEEcCCCccEEEecCCCCCcccCccceECCCCCcccccCCcEEEEEeCCc--EEEEEeeecEEEEC
Confidence 6778887 67899999999876655 44453 246533333321 1 12234444 36788887788888
Q ss_pred cccc
Q psy15482 283 NFNG 286 (302)
Q Consensus 283 ~~~~ 286 (302)
++.+
T Consensus 80 ~~~~ 83 (316)
T cd05486 80 GITV 83 (316)
T ss_pred CEEE
Confidence 7654
No 262
>PRK10963 hypothetical protein; Provisional
Probab=22.18 E-value=3.7e+02 Score=24.28 Aligned_cols=27 Identities=33% Similarity=0.477 Sum_probs=22.6
Q ss_pred CHHHHHHHHhcCHHHHHHHHhcChhHHHHhh
Q psy15482 114 DPATVRQLLLDNPKEIALLQQNNPRLADALL 144 (302)
Q Consensus 114 npe~iR~~~l~NP~~~~~L~~~NPeLa~al~ 144 (302)
+++.++..++.||... .++|+|-..|.
T Consensus 4 ~~~~V~~yL~~~PdFf----~~h~~Ll~~L~ 30 (223)
T PRK10963 4 DDRAVVDYLLQNPDFF----IRNARLVEQMR 30 (223)
T ss_pred CHHHHHHHHHHCchHH----hhCHHHHHhcc
Confidence 5788999999999876 67899988885
No 263
>cd01777 SNX27_RA Ubiquitin domain of SNX27 (sorting nexin protein 27). SNX27_RA SNX27 (sorting nexin protein 27) belongs to a large family of endosome-localized proteins related to sorting nexin1 which is implicated in regulating membrane traffic. The domain architecture of SNX27 includes an amino-terminal PDZ domain, a PX (PhoX homologous) domain, and a carboxy-terminal RA (RAS-associated) domain.
Probab=21.82 E-value=1.7e+02 Score=22.97 Aligned_cols=41 Identities=5% Similarity=0.066 Sum_probs=35.0
Q ss_pred EEEEeCCCCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEE
Q psy15482 3 VTVSTQHTNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAI 44 (302)
Q Consensus 3 ItV~~~~~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~L 44 (302)
|-|-. .+|...+++|..+++.+++-+.+..+.|+|.+-...
T Consensus 4 L~V~L-pdg~~i~V~v~~s~~a~~Vleav~~kl~L~~e~~~Y 44 (87)
T cd01777 4 LRIAL-PDKATVTVRVRKNATTDQVYQALVAKAGMDSYTQNY 44 (87)
T ss_pred EEEEc-cCCCEEEEEEEEcccHHHHHHHHHHHhCCCHHHHhh
Confidence 44555 679999999999999999999999999999876643
No 264
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=21.73 E-value=83 Score=28.79 Aligned_cols=23 Identities=22% Similarity=0.263 Sum_probs=19.4
Q ss_pred eEEEEEcCCCccccCCHHHHHHc
Q psy15482 226 PVKAFIDSGAQTTIMSLACAERV 248 (302)
Q Consensus 226 ~v~a~vDsGaq~~i~s~~~a~~~ 248 (302)
...|+||||...+.++.+.+++.
T Consensus 198 ~~~~iiDSGTs~~~lP~~~~~~l 220 (278)
T cd06097 198 GFSAIADTGTTLILLPDAIVEAY 220 (278)
T ss_pred CceEEeecCCchhcCCHHHHHHH
Confidence 46799999999999999877554
No 265
>CHL00030 rpl23 ribosomal protein L23
Probab=21.66 E-value=1.4e+02 Score=23.72 Aligned_cols=40 Identities=13% Similarity=0.165 Sum_probs=32.3
Q ss_pred CCeEEEEEeCCCccHHHHHHHHHhhhCCCCCCeEE-EECCe
Q psy15482 10 TNECFLLDVSEDLELENFKAFCEVQSGYPAHEIAI-EFQGV 49 (302)
Q Consensus 10 ~g~~~~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~L-i~~Gk 49 (302)
....+++.|+++.|=.++|..||..+|+.+..-.- ...|+
T Consensus 18 e~n~y~F~V~~~anK~eIK~avE~lf~VkV~~VNt~~~~~k 58 (93)
T CHL00030 18 EKNQYTFDVDSGSTKTEIKHWIELFFGVKVIAVNSHRLPRK 58 (93)
T ss_pred HCCEEEEEECCCCCHHHHHHHHHHHhCCeEEEEEEEEcCCC
Confidence 35789999999999999999999999998765543 34444
No 266
>KOG1464|consensus
Probab=21.05 E-value=1.9e+02 Score=27.97 Aligned_cols=32 Identities=31% Similarity=0.511 Sum_probs=24.2
Q ss_pred HhcCHHHHHHHHhcChhHHHHhhcCChHHHHHHHHHH
Q psy15482 122 LLDNPKEIALLQQNNPRLADALLNGSPEDFERILRQQ 158 (302)
Q Consensus 122 ~l~NP~~~~~L~~~NPeLa~al~~~d~e~f~~~lr~~ 158 (302)
+.++|+.+++- .|..|.++||..+|.++|+..
T Consensus 298 yKNdPEIlAMT-----nlv~aYQ~NdI~eFE~Il~~~ 329 (440)
T KOG1464|consen 298 YKNDPEILAMT-----NLVAAYQNNDIIEFERILKSN 329 (440)
T ss_pred CCCCHHHHHHH-----HHHHHHhcccHHHHHHHHHhh
Confidence 45666666553 577888899999999999664
No 267
>KOG3483|consensus
Probab=20.81 E-value=4e+02 Score=20.58 Aligned_cols=60 Identities=17% Similarity=0.105 Sum_probs=44.0
Q ss_pred EEEeCCCccHHHHHHHHHhhhCCCCCCeEEEECCeecCCCcccccccCCCCCCEEEEEEe
Q psy15482 15 LLDVSEDLELENFKAFCEVQSGYPAHEIAIEFQGVLLQDNKKTLKTHGIQDGDLVLLKHL 74 (302)
Q Consensus 15 ~leV~~~~TV~dLK~~Ie~e~gIp~~~Q~Li~~Gk~L~D~~~TL~~~gI~dgd~l~L~~~ 74 (302)
.++|+.+.....+...-++++.+|+..-.++-|.-+=-+...+-...=++.|+-|.+..+
T Consensus 30 v~svpestpftavlkfaaeefkvpaatsaiitndgiginpaq~agnvflkhgselr~ipr 89 (94)
T KOG3483|consen 30 VLSVPESTPFTAVLKFAAEEFKVPAATSAIITNDGIGINPAQTAGNVFLKHGSELRIIPR 89 (94)
T ss_pred eecCCCCCchHHHHHHHHHHccCCccceeEEecCccccCccccccceeeccCCEEEeccc
Confidence 457888888888888889999999987777765443323336666677888888877655
No 268
>cd01783 DAGK_delta_RA Ubiquitin-like domain of Diacylgylcerol kinase (DAGK). DAGK_delta_RA Diacylgylcerol kinase (DAGK) phosphorylates the second messenger diacylglycerol to phosphatidic acid as part of a protein kinase C pathway. Nine mammalian DAGK isotypes have been identified, which are classified into five subgroups according to their domain architecture and the DAGK-delta and -theta isozymes, which fall into one such group, contain an RA (Ras-associated) domain. DAGKs also contain a conserved catalytic domain (DAGKc), an assesory domain (DAGKa), and an array of conserved motifs that are likely to play a role in lipid-protein and protein-protein interactions in various DAG/PA-dependent signalling pathways.
Probab=20.61 E-value=2.5e+02 Score=22.48 Aligned_cols=28 Identities=14% Similarity=0.063 Sum_probs=24.1
Q ss_pred EEEEeCCCccHHHHHHHHHhhhCCCCCC
Q psy15482 14 FLLDVSEDLELENFKAFCEVQSGYPAHE 41 (302)
Q Consensus 14 ~~leV~~~~TV~dLK~~Ie~e~gIp~~~ 41 (302)
.+|.|+.++|+.++-...-.++|+..++
T Consensus 19 ~sv~V~~~tt~~dvv~eaL~kfGl~~~~ 46 (97)
T cd01783 19 VSIRVNKDTTVQDVILEVLPLFGLQAEC 46 (97)
T ss_pred EEEEecccchHHHHHHHHHHHhCcccCC
Confidence 3799999999999999999999987643
Done!