Query psy16982
Match_columns 116
No_of_seqs 155 out of 1006
Neff 8.4
Searched_HMMs 46136
Date Fri Aug 16 18:40:47 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy16982.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16982hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1991|consensus 99.9 3E-25 6.6E-30 178.8 12.5 113 1-116 46-158 (1010)
2 KOG1992|consensus 99.9 1.1E-23 2.3E-28 167.7 12.0 110 2-116 48-157 (960)
3 COG5657 CSE1 CAS/CSE protein i 99.8 9.4E-21 2E-25 152.2 9.9 108 2-116 47-154 (947)
4 KOG1993|consensus 99.8 8.4E-21 1.8E-25 151.0 8.5 107 2-115 43-153 (978)
5 KOG2274|consensus 99.8 9.9E-20 2.1E-24 146.3 9.7 112 1-114 46-157 (1005)
6 KOG2171|consensus 99.8 9.6E-19 2.1E-23 143.2 10.8 102 2-115 46-148 (1075)
7 COG5656 SXM1 Importin, protein 99.7 2.1E-16 4.6E-21 125.7 9.9 112 1-116 45-157 (970)
8 KOG2023|consensus 99.6 2.5E-15 5.4E-20 118.6 6.4 99 4-114 59-157 (885)
9 KOG1241|consensus 99.5 9.4E-15 2E-19 116.5 6.2 113 2-115 46-160 (859)
10 COG5215 KAP95 Karyopherin (imp 99.3 9.7E-12 2.1E-16 97.6 9.6 111 4-115 52-164 (858)
11 PF03810 IBN_N: Importin-beta 99.3 2.5E-12 5.4E-17 76.7 2.3 53 2-58 25-77 (77)
12 PF08389 Xpo1: Exportin 1-like 99.0 1.2E-09 2.5E-14 71.9 5.5 55 60-115 1-55 (148)
13 COG5101 CRM1 Importin beta-rel 98.8 4.9E-08 1.1E-12 78.0 8.5 98 4-114 58-164 (1053)
14 KOG2020|consensus 98.3 9E-07 2E-11 74.4 5.7 96 6-114 58-162 (1041)
15 KOG2022|consensus 97.1 0.0038 8.1E-08 52.0 8.6 83 3-97 51-136 (982)
16 KOG2081|consensus 96.9 0.0048 1E-07 49.1 7.5 90 5-109 29-121 (559)
17 KOG2021|consensus 96.5 0.03 6.4E-07 46.4 9.2 59 39-97 73-140 (980)
18 PF12717 Cnd1: non-SMC mitotic 96.2 0.15 3.2E-06 34.9 10.2 92 7-115 2-93 (178)
19 PF02985 HEAT: HEAT repeat; I 96.0 0.025 5.4E-07 27.6 4.1 30 86-115 1-30 (31)
20 PF12348 CLASP_N: CLASP N term 95.9 0.062 1.4E-06 37.7 7.7 102 3-115 17-124 (228)
21 PF04510 DUF577: Family of unk 95.6 0.063 1.4E-06 36.9 6.2 48 47-94 3-55 (174)
22 cd00020 ARM Armadillo/beta-cat 95.5 0.062 1.3E-06 33.1 5.8 97 6-114 20-120 (120)
23 PF12830 Nipped-B_C: Sister ch 94.7 0.69 1.5E-05 32.0 9.6 68 48-116 9-76 (187)
24 PF12755 Vac14_Fab1_bd: Vacuol 94.4 0.4 8.6E-06 29.8 7.1 67 47-114 27-96 (97)
25 PF01602 Adaptin_N: Adaptin N 94.4 0.34 7.4E-06 37.9 8.3 71 41-112 108-179 (526)
26 PF09324 DUF1981: Domain of un 94.3 0.46 1E-05 28.8 7.1 66 45-111 16-85 (86)
27 PF13646 HEAT_2: HEAT repeats; 94.2 0.15 3.3E-06 30.0 4.7 56 47-109 31-87 (88)
28 PF13646 HEAT_2: HEAT repeats; 94.0 0.7 1.5E-05 27.1 8.3 55 51-112 3-58 (88)
29 cd00020 ARM Armadillo/beta-cat 93.9 0.58 1.3E-05 28.6 7.2 65 49-114 9-78 (120)
30 PF13925 Katanin_con80: con80 93.9 0.26 5.6E-06 33.5 5.9 50 67-116 46-100 (164)
31 KOG1410|consensus 93.3 2.7 5.9E-05 35.2 11.5 94 8-114 54-157 (1082)
32 PF13513 HEAT_EZ: HEAT-like re 93.0 0.14 3E-06 27.9 2.9 51 62-112 2-55 (55)
33 PTZ00429 beta-adaptin; Provisi 93.0 0.97 2.1E-05 37.8 8.8 110 4-114 79-208 (746)
34 PF02985 HEAT: HEAT repeat; I 92.9 0.44 9.6E-06 23.0 4.4 28 50-77 3-30 (31)
35 PF12530 DUF3730: Protein of u 92.7 0.39 8.5E-06 34.3 5.6 51 63-114 100-151 (234)
36 PF12717 Cnd1: non-SMC mitotic 92.3 2 4.2E-05 29.3 8.4 77 4-96 36-112 (178)
37 PF11698 V-ATPase_H_C: V-ATPas 92.1 0.56 1.2E-05 30.4 5.1 73 39-113 36-114 (119)
38 PF01465 GRIP: GRIP domain; I 92.0 0.57 1.2E-05 25.1 4.4 36 43-78 3-38 (46)
39 PF08167 RIX1: rRNA processing 91.3 3.3 7.1E-05 28.0 9.3 96 6-116 38-145 (165)
40 PF10363 DUF2435: Protein of u 91.2 2.3 5.1E-05 26.1 7.3 66 51-116 7-74 (92)
41 PF01602 Adaptin_N: Adaptin N 90.0 7.8 0.00017 30.3 10.9 68 46-116 78-145 (526)
42 cd00197 VHS_ENTH_ANTH VHS, ENT 89.8 3.2 6.9E-05 26.1 7.1 50 61-115 18-67 (115)
43 PF00514 Arm: Armadillo/beta-c 89.8 0.88 1.9E-05 23.2 3.8 28 86-113 13-40 (41)
44 smart00755 Grip golgin-97, Ran 88.8 1.2 2.7E-05 23.9 3.9 34 44-78 3-36 (46)
45 PF10508 Proteasom_PSMB: Prote 88.5 3.9 8.4E-05 32.6 8.1 98 4-114 130-231 (503)
46 KOG1062|consensus 88.5 8.9 0.00019 32.5 10.2 112 2-116 303-416 (866)
47 PF10508 Proteasom_PSMB: Prote 88.0 5.7 0.00012 31.6 8.8 97 4-114 88-189 (503)
48 PF12460 MMS19_C: RNAPII trans 87.6 4.1 8.9E-05 31.5 7.6 54 61-115 337-395 (415)
49 PTZ00429 beta-adaptin; Provisi 87.5 5.4 0.00012 33.6 8.6 56 59-116 380-435 (746)
50 PF07571 DUF1546: Protein of u 86.9 4.2 9.1E-05 24.9 6.0 51 61-112 20-76 (92)
51 PF12348 CLASP_N: CLASP N term 85.7 9.7 0.00021 26.4 8.6 67 49-115 133-207 (228)
52 smart00185 ARM Armadillo/beta- 85.5 2 4.4E-05 21.2 3.5 28 86-113 13-40 (41)
53 PF05918 API5: Apoptosis inhib 85.0 13 0.00027 30.4 9.2 84 6-107 35-121 (556)
54 PF03810 IBN_N: Importin-beta 85.0 5.2 0.00011 22.8 5.6 41 72-115 6-46 (77)
55 PF12719 Cnd3: Nuclear condens 84.9 8.4 0.00018 28.4 7.8 74 43-116 110-183 (298)
56 KOG1824|consensus 84.6 10 0.00022 33.1 8.7 92 1-113 827-918 (1233)
57 PF03224 V-ATPase_H_N: V-ATPas 84.0 4.4 9.4E-05 30.0 6.0 65 50-114 108-179 (312)
58 PF14225 MOR2-PAG1_C: Cell mor 83.5 7.4 0.00016 28.5 6.9 50 65-114 168-217 (262)
59 PF12755 Vac14_Fab1_bd: Vacuol 82.3 3.9 8.4E-05 25.3 4.4 35 81-115 23-57 (97)
60 cd03571 ENTH_epsin ENTH domain 82.0 5.7 0.00012 25.8 5.2 45 71-115 22-69 (123)
61 PF13513 HEAT_EZ: HEAT-like re 81.9 4.8 0.0001 21.5 4.3 54 8-74 2-55 (55)
62 PF13251 DUF4042: Domain of un 81.3 2.1 4.6E-05 29.7 3.2 54 62-115 1-70 (182)
63 cd03572 ENTH_epsin_related ENT 80.6 4.4 9.5E-05 26.3 4.3 36 80-115 33-68 (122)
64 PF10521 DUF2454: Protein of u 80.2 2.9 6.3E-05 30.7 3.8 49 65-115 98-149 (282)
65 COG5096 Vesicle coat complex, 79.7 13 0.00028 31.4 7.7 56 58-114 138-195 (757)
66 COG5096 Vesicle coat complex, 79.5 13 0.00029 31.3 7.7 88 15-114 69-156 (757)
67 KOG1058|consensus 79.2 20 0.00044 30.5 8.5 72 44-116 353-427 (948)
68 PRK13800 putative oxidoreducta 78.9 11 0.00024 32.2 7.3 55 51-112 625-679 (897)
69 PRK13800 putative oxidoreducta 78.5 23 0.0005 30.3 9.1 26 50-75 655-680 (897)
70 PRK09687 putative lyase; Provi 78.1 13 0.00027 27.4 6.6 61 51-113 58-119 (280)
71 PF12719 Cnd3: Nuclear condens 77.8 25 0.00055 25.8 8.2 70 46-116 26-95 (298)
72 PRK09687 putative lyase; Provi 77.6 11 0.00024 27.8 6.2 54 52-110 164-217 (280)
73 PF01417 ENTH: ENTH domain; I 75.7 11 0.00024 24.1 5.2 45 71-115 24-72 (125)
74 KOG2171|consensus 74.3 25 0.00054 30.9 8.1 68 47-114 159-235 (1075)
75 KOG1837|consensus 73.6 18 0.0004 32.9 7.3 53 62-115 1519-1571(1621)
76 PF08620 RPAP1_C: RPAP1-like, 72.6 7.9 0.00017 22.9 3.5 30 87-116 41-70 (73)
77 KOG1943|consensus 72.1 22 0.00047 31.3 7.2 68 43-114 337-409 (1133)
78 KOG2023|consensus 71.7 21 0.00045 30.1 6.7 70 46-115 661-733 (885)
79 KOG1240|consensus 71.6 11 0.00023 33.6 5.3 52 61-115 631-686 (1431)
80 PLN03076 ARF guanine nucleotid 71.4 68 0.0015 30.0 10.4 99 7-116 1151-1255(1780)
81 KOG0267|consensus 70.8 20 0.00043 30.2 6.5 70 47-116 676-745 (825)
82 PF10978 DUF2785: Protein of u 69.2 34 0.00073 23.4 8.8 81 6-96 12-105 (175)
83 smart00273 ENTH Epsin N-termin 69.2 16 0.00034 23.3 4.8 38 77-114 31-69 (127)
84 KOG1242|consensus 66.5 52 0.0011 27.0 8.0 49 66-114 276-324 (569)
85 KOG1060|consensus 65.4 27 0.00059 29.9 6.3 53 60-113 156-208 (968)
86 PF14500 MMS19_N: Dos2-interac 65.2 14 0.00031 27.0 4.4 33 84-116 207-239 (262)
87 PF06628 Catalase-rel: Catalas 63.9 22 0.00049 20.3 4.3 36 37-72 17-53 (68)
88 PF02194 PXA: PXA domain; Int 63.5 28 0.00061 23.4 5.4 24 2-26 1-24 (185)
89 KOG2956|consensus 63.3 55 0.0012 26.4 7.4 71 44-115 284-360 (516)
90 PF13251 DUF4042: Domain of un 63.3 47 0.001 23.0 7.7 108 5-113 52-173 (182)
91 KOG0212|consensus 62.2 31 0.00067 28.4 6.0 66 48-115 209-280 (675)
92 cd08050 TAF6 TATA Binding Prot 61.5 27 0.00058 26.5 5.4 51 62-113 283-339 (343)
93 KOG1062|consensus 61.2 76 0.0017 27.2 8.2 101 14-115 105-209 (866)
94 KOG2160|consensus 60.5 31 0.00067 26.4 5.5 59 58-116 135-198 (342)
95 PF09047 MEF2_binding: MEF2 bi 59.8 11 0.00024 18.6 2.0 18 39-56 8-25 (35)
96 PF11698 V-ATPase_H_C: V-ATPas 59.7 18 0.00039 23.4 3.6 36 44-79 83-118 (119)
97 KOG2137|consensus 59.5 59 0.0013 27.4 7.2 53 61-113 403-457 (700)
98 KOG1248|consensus 59.5 1.3E+02 0.0029 26.9 10.1 77 39-116 819-900 (1176)
99 KOG0392|consensus 58.9 22 0.00048 32.0 4.9 102 6-114 90-196 (1549)
100 PF10274 ParcG: Parkin co-regu 58.4 24 0.00052 24.6 4.3 67 48-114 39-109 (183)
101 PF07571 DUF1546: Protein of u 57.6 41 0.00089 20.4 5.7 55 5-72 18-72 (92)
102 KOG4524|consensus 56.3 94 0.002 27.3 8.0 71 41-115 794-877 (1014)
103 KOG0414|consensus 55.7 1.2E+02 0.0026 27.3 8.7 68 48-115 356-429 (1251)
104 PF06012 DUF908: Domain of Unk 55.0 22 0.00048 26.7 4.0 30 87-116 28-57 (329)
105 PF04826 Arm_2: Armadillo-like 54.8 81 0.0017 23.0 9.5 76 39-114 82-163 (254)
106 KOG1059|consensus 54.1 1.3E+02 0.0028 25.8 8.3 58 4-78 310-367 (877)
107 cd08818 CARD_MDA5_1 Caspase ac 53.9 47 0.001 20.4 4.5 48 38-91 29-77 (88)
108 KOG2062|consensus 52.5 19 0.00042 30.5 3.5 62 47-113 555-617 (929)
109 PF08506 Cse1: Cse1; InterPro 51.7 1.1E+02 0.0024 23.6 8.1 63 43-106 301-368 (370)
110 PF02847 MA3: MA3 domain; Int 51.0 55 0.0012 20.0 7.3 45 69-113 20-64 (113)
111 KOG2137|consensus 50.7 1.1E+02 0.0024 25.8 7.5 67 45-113 422-495 (700)
112 PF04802 SMK-1: Component of I 50.7 84 0.0018 22.0 7.1 63 46-114 106-176 (193)
113 KOG1020|consensus 50.1 1.1E+02 0.0023 28.4 7.7 58 51-108 820-878 (1692)
114 KOG1991|consensus 50.1 1.8E+02 0.0039 25.6 9.4 55 42-96 457-513 (1010)
115 KOG1059|consensus 50.0 1.5E+02 0.0033 25.4 8.1 76 39-115 136-211 (877)
116 PF06086 Pox_A30L_A26L: Orthop 49.5 24 0.00052 25.3 3.2 47 8-59 17-63 (220)
117 PF11262 Tho2: Transcription f 49.2 33 0.00071 25.5 4.1 35 82-116 223-258 (298)
118 PF14852 Fis1_TPR_N: Fis1 N-te 48.4 31 0.00068 17.2 2.7 29 87-115 5-33 (35)
119 KOG1824|consensus 47.9 1.2E+02 0.0025 27.0 7.3 54 61-114 188-242 (1233)
120 KOG1061|consensus 47.5 1.5E+02 0.0033 25.2 7.8 69 46-114 120-189 (734)
121 PF06371 Drf_GBD: Diaphanous G 47.2 27 0.00059 23.3 3.2 52 62-113 131-186 (187)
122 PF07539 DRIM: Down-regulated 46.6 40 0.00086 22.3 3.8 30 82-111 14-43 (141)
123 KOG2057|consensus 46.5 26 0.00056 27.0 3.1 35 81-115 57-94 (499)
124 PF14631 FancD2: Fanconi anaem 46.1 78 0.0017 29.0 6.4 60 56-116 443-504 (1426)
125 COG5656 SXM1 Importin, protein 45.1 1.6E+02 0.0035 25.4 7.6 53 45-98 458-512 (970)
126 KOG1020|consensus 44.7 1.2E+02 0.0026 28.1 7.2 73 43-116 827-923 (1692)
127 KOG1242|consensus 44.6 1E+02 0.0022 25.4 6.4 65 48-112 217-281 (569)
128 TIGR02270 conserved hypothetic 44.6 1.2E+02 0.0026 23.8 6.6 51 53-110 153-203 (410)
129 KOG0414|consensus 44.4 1.2E+02 0.0025 27.4 6.9 90 6-113 936-1026(1251)
130 PF00790 VHS: VHS domain; Int 44.2 87 0.0019 20.3 5.7 36 80-115 37-72 (140)
131 PLN03200 cellulose synthase-in 44.1 58 0.0013 31.0 5.4 63 51-113 450-516 (2102)
132 PF12460 MMS19_C: RNAPII trans 43.7 1.5E+02 0.0033 22.9 8.4 65 48-113 190-258 (415)
133 PF03392 OS-D: Insect pheromon 43.1 68 0.0015 19.7 4.2 47 42-89 34-84 (95)
134 PF05397 Med15_fungi: Mediator 43.1 74 0.0016 20.3 4.5 41 5-55 4-44 (115)
135 PF05085 DUF685: Protein of un 42.9 92 0.002 23.0 5.4 82 6-95 30-111 (265)
136 PF08767 CRM1_C: CRM1 C termin 42.6 86 0.0019 23.5 5.5 55 61-115 134-195 (319)
137 TIGR02548 casB_cse2 CRISPR sys 42.4 99 0.0022 20.4 5.9 48 47-94 94-142 (159)
138 cd03561 VHS VHS domain family; 41.8 94 0.002 20.0 5.6 50 61-115 18-67 (133)
139 PF10363 DUF2435: Protein of u 41.7 81 0.0018 19.2 6.7 72 5-94 15-86 (92)
140 KOG1241|consensus 41.7 65 0.0014 27.6 4.9 100 4-114 185-288 (859)
141 KOG1967|consensus 41.1 41 0.00088 29.3 3.7 53 62-116 232-286 (1030)
142 TIGR01914 cas_Csa4 CRISPR-asso 41.0 81 0.0018 24.3 5.0 45 65-110 278-333 (354)
143 PF12765 Cohesin_HEAT: HEAT re 41.0 53 0.0011 16.9 4.1 36 71-107 3-40 (42)
144 PF14664 RICTOR_N: Rapamycin-i 39.9 78 0.0017 24.4 4.9 65 10-85 213-277 (371)
145 KOG2149|consensus 39.8 1.9E+02 0.004 22.8 8.8 99 5-116 70-172 (393)
146 KOG0213|consensus 39.4 2E+02 0.0043 25.1 7.3 31 47-77 799-829 (1172)
147 PF01603 B56: Protein phosphat 38.8 1.1E+02 0.0023 23.9 5.6 41 38-78 208-249 (409)
148 cd00256 VATPase_H VATPase_H, r 38.7 2E+02 0.0043 22.8 7.7 63 52-114 106-173 (429)
149 KOG2056|consensus 38.4 83 0.0018 24.1 4.8 47 68-114 39-88 (336)
150 PF15601 Imm42: Immunity prote 37.7 1E+02 0.0022 20.4 4.6 52 64-115 18-71 (134)
151 PF02944 BESS: BESS motif; In 37.7 56 0.0012 16.2 2.7 20 39-58 17-36 (37)
152 PF08785 Ku_PK_bind: Ku C term 37.6 1.1E+02 0.0023 19.5 4.9 55 43-97 22-80 (120)
153 KOG1061|consensus 37.4 1.2E+02 0.0026 25.7 5.9 55 60-116 363-417 (734)
154 PF04826 Arm_2: Armadillo-like 37.2 1.4E+02 0.003 21.8 5.6 62 50-111 15-80 (254)
155 KOG0168|consensus 37.2 1.1E+02 0.0024 26.7 5.6 67 45-116 550-631 (1051)
156 PF11625 DUF3253: Protein of u 36.7 20 0.00044 21.7 1.1 42 45-92 5-48 (83)
157 PF14663 RasGEF_N_2: Rapamycin 36.6 80 0.0017 19.9 3.9 29 86-114 9-37 (115)
158 PF13001 Ecm29: Proteasome sta 36.5 1.9E+02 0.0042 23.1 6.8 73 39-112 354-441 (501)
159 KOG1848|consensus 36.4 3.6E+02 0.0078 25.1 9.0 48 67-114 1073-1132(1610)
160 PF10130 PIN_2: PIN domain; I 35.6 1.1E+02 0.0023 20.0 4.5 49 39-88 51-100 (133)
161 PF04439 Adenyl_transf: Strept 35.1 96 0.0021 23.0 4.6 73 43-115 166-252 (282)
162 PF14620 YPEB: YpeB sporulatio 34.5 2E+02 0.0042 22.2 6.3 71 39-116 111-181 (361)
163 KOG0413|consensus 34.3 2.3E+02 0.0051 25.5 7.1 105 6-113 944-1072(1529)
164 PF12463 DUF3689: Protein of u 34.2 2E+02 0.0044 21.6 6.8 73 42-114 90-173 (303)
165 KOG1291|consensus 34.0 78 0.0017 25.5 4.1 45 68-113 88-134 (503)
166 KOG4500|consensus 34.0 1.3E+02 0.0029 24.3 5.4 66 48-113 316-389 (604)
167 COG5181 HSH155 U2 snRNP splice 33.7 1.5E+02 0.0031 25.3 5.7 51 61-114 295-346 (975)
168 PF08158 NUC130_3NT: NUC130/3N 33.3 49 0.0011 18.1 2.2 18 79-96 9-26 (52)
169 cd03567 VHS_GGA VHS domain fam 33.2 1.2E+02 0.0025 20.0 4.4 49 62-115 20-68 (139)
170 PLN03200 cellulose synthase-in 33.1 3.6E+02 0.0079 26.1 8.5 63 52-114 656-723 (2102)
171 PF14961 BROMI: Broad-minded p 33.1 1.6E+02 0.0036 26.6 6.2 64 50-113 164-230 (1296)
172 PF09280 XPC-binding: XPC-bind 33.1 33 0.00072 19.3 1.5 18 84-101 21-38 (59)
173 smart00544 MA3 Domain in DAP-5 33.0 1.2E+02 0.0026 18.5 5.8 20 74-93 25-44 (113)
174 cd03564 ANTH_AP180_CALM ANTH d 32.8 1.3E+02 0.0027 18.8 4.8 34 81-114 33-66 (117)
175 PF10232 Med8: Mediator of RNA 32.8 1.9E+02 0.004 20.7 5.7 31 61-92 21-52 (226)
176 KOG1851|consensus 32.7 4.3E+02 0.0093 24.9 8.9 74 40-113 1562-1637(1710)
177 PF12054 DUF3535: Domain of un 32.7 2.1E+02 0.0046 22.6 6.4 52 63-115 103-156 (441)
178 COG1413 FOG: HEAT repeat [Ener 32.2 2E+02 0.0044 21.0 6.3 47 60-111 193-239 (335)
179 KOG1967|consensus 32.2 2E+02 0.0043 25.4 6.4 69 45-114 861-938 (1030)
180 KOG0168|consensus 32.0 81 0.0017 27.5 4.1 64 51-116 171-243 (1051)
181 COG5098 Chromosome condensatio 32.0 2.2E+02 0.0048 24.6 6.5 65 50-114 302-375 (1128)
182 TIGR02270 conserved hypothetic 31.9 1.9E+02 0.0041 22.7 6.0 23 90-112 152-174 (410)
183 KOG2745|consensus 31.6 2.2E+02 0.0047 21.5 5.9 31 64-94 175-206 (321)
184 PF10083 DUF2321: Uncharacteri 31.2 1.7E+02 0.0037 19.9 4.9 22 39-60 97-118 (158)
185 PF01603 B56: Protein phosphat 31.0 2.6E+02 0.0055 21.8 7.0 57 40-96 145-205 (409)
186 PF12783 Sec7_N: Guanine nucle 30.9 1.6E+02 0.0035 19.5 7.3 103 6-115 35-147 (168)
187 PF09485 CRISPR_Cse2: CRISPR-a 30.8 82 0.0018 20.4 3.4 52 46-97 82-134 (146)
188 KOG1077|consensus 30.7 3.6E+02 0.0077 23.4 8.5 54 43-100 365-419 (938)
189 KOG3036|consensus 30.5 2.2E+02 0.0048 21.2 5.7 64 50-113 123-196 (293)
190 cd03569 VHS_Hrs_Vps27p VHS dom 30.5 1.3E+02 0.0029 19.7 4.4 49 62-115 23-71 (142)
191 KOG0166|consensus 30.0 3.1E+02 0.0067 22.4 7.9 61 54-114 201-266 (514)
192 KOG4224|consensus 29.8 1.4E+02 0.0031 23.7 4.8 57 54-110 215-276 (550)
193 PF08578 DUF1765: Protein of u 29.6 1.6E+02 0.0034 18.8 4.5 34 81-114 46-79 (126)
194 cd07911 RNRR2_Rv0233_like Ribo 29.1 1.2E+02 0.0025 22.2 4.2 41 15-55 11-54 (280)
195 PF12243 CTK3: CTD kinase subu 29.0 1.8E+02 0.0038 19.3 6.7 61 49-115 7-70 (139)
196 PF07346 DUF1477: Protein of u 28.7 1.5E+02 0.0032 19.2 4.1 31 86-116 84-117 (118)
197 PHA02855 anti-apoptotic membra 28.4 1.6E+02 0.0034 20.4 4.3 57 40-96 69-130 (180)
198 PF05004 IFRD: Interferon-rela 28.2 2.6E+02 0.0055 20.9 8.4 94 7-111 57-157 (309)
199 PF07923 N1221: N1221-like pro 27.2 2.4E+02 0.0052 20.8 5.6 59 18-79 34-92 (293)
200 PF09133 SANTA: SANTA (SANT As 27.1 33 0.00072 21.0 0.9 13 77-89 79-91 (93)
201 PF09241 Herp-Cyclin: Herpesvi 27.1 96 0.0021 18.9 2.9 35 78-112 16-55 (106)
202 COG1413 FOG: HEAT repeat [Ener 27.0 2.6E+02 0.0055 20.5 6.3 55 51-112 78-133 (335)
203 KOG2160|consensus 27.0 3E+02 0.0064 21.2 6.6 95 4-114 135-240 (342)
204 PF09492 Pec_lyase: Pectic aci 26.7 1E+02 0.0022 23.1 3.5 64 51-114 47-114 (289)
205 PF09337 zf-H2C2: His(2)-Cys(2 26.4 84 0.0018 16.0 2.2 25 69-96 10-34 (39)
206 PRK08326 ribonucleotide-diphos 26.2 1.3E+02 0.0028 22.5 4.1 42 16-58 30-73 (311)
207 PF14500 MMS19_N: Dos2-interac 26.1 2.6E+02 0.0057 20.4 10.9 101 4-116 10-113 (262)
208 KOG1248|consensus 26.1 5E+02 0.011 23.6 8.7 73 40-113 690-766 (1176)
209 COG1907 Predicted archaeal sug 26.0 19 0.00042 27.0 -0.3 12 73-84 153-164 (312)
210 PF10273 WGG: Pre-rRNA-process 25.9 1.5E+02 0.0033 17.6 3.6 52 17-79 16-67 (82)
211 PF12252 SidE: Dot/Icm substra 25.8 1.8E+02 0.0039 26.2 5.1 48 38-86 956-1005(1439)
212 KOG2956|consensus 25.7 2.2E+02 0.0048 23.1 5.3 44 71-114 434-477 (516)
213 KOG0212|consensus 25.6 4E+02 0.0087 22.3 7.5 64 47-113 377-443 (675)
214 smart00288 VHS Domain present 25.4 1.9E+02 0.0042 18.6 5.7 36 80-115 32-67 (133)
215 KOG2025|consensus 25.1 4.5E+02 0.0099 22.7 10.2 70 39-109 118-188 (892)
216 PF07778 CENP-I: Mis6 ; Inter 24.9 1.3E+02 0.0028 24.4 4.0 55 62-116 439-493 (511)
217 PF06543 Lac_bphage_repr: Lact 24.6 72 0.0016 17.3 1.8 14 39-52 30-43 (49)
218 KOG0213|consensus 24.0 2.6E+02 0.0056 24.5 5.6 42 55-96 561-605 (1172)
219 COG4168 SapB ABC-type antimicr 23.7 1.8E+02 0.004 21.6 4.3 44 61-106 243-288 (321)
220 KOG0166|consensus 23.5 4.1E+02 0.0089 21.7 7.4 74 39-112 101-179 (514)
221 PF10654 DUF2481: Protein of u 23.4 1.6E+02 0.0035 19.1 3.5 31 69-99 14-45 (126)
222 PLN03060 inositol phosphatase- 23.4 2.2E+02 0.0047 20.3 4.5 35 40-74 66-100 (206)
223 PF13979 SopA_C: SopA-like cat 23.3 2.6E+02 0.0057 19.3 4.9 71 46-116 4-87 (172)
224 PRK14626 hypothetical protein; 23.2 1.5E+02 0.0032 18.8 3.4 37 39-75 61-97 (110)
225 COG5181 HSH155 U2 snRNP splice 22.6 3.5E+02 0.0076 23.2 6.0 54 40-96 723-776 (975)
226 COG4306 Uncharacterized protei 22.5 2.4E+02 0.0052 18.6 4.9 39 39-77 97-137 (160)
227 cd01049 RNRR2 Ribonucleotide R 22.4 1.6E+02 0.0034 21.4 3.9 40 16-55 14-55 (288)
228 KOG3077|consensus 22.0 58 0.0013 24.0 1.5 22 74-95 228-249 (260)
229 PRK09614 nrdF ribonucleotide-d 22.0 1.4E+02 0.0031 22.2 3.7 39 16-54 25-65 (324)
230 PF08167 RIX1: rRNA processing 21.9 2.5E+02 0.0055 18.7 7.8 75 40-114 18-97 (165)
231 PF08623 TIP120: TATA-binding 21.7 2.8E+02 0.006 19.0 7.0 52 62-114 42-94 (169)
232 PF11784 DUF3320: Protein of u 21.6 1.5E+02 0.0032 15.9 4.0 31 64-94 10-40 (52)
233 PLN00047 photosystem II biogen 21.4 2.1E+02 0.0046 21.4 4.3 34 40-73 119-152 (283)
234 COG2124 CypX Cytochrome P450 [ 21.3 2.3E+02 0.005 21.7 4.8 44 46-91 235-278 (411)
235 KOG0159|consensus 21.2 3.2E+02 0.007 22.3 5.5 55 43-99 312-366 (519)
236 TIGR03060 PS_II_psb29 photosys 20.9 2.6E+02 0.0056 20.0 4.5 35 40-74 68-102 (214)
237 PF05918 API5: Apoptosis inhib 20.8 2.7E+02 0.0059 23.0 5.1 54 60-114 35-88 (556)
238 PF13001 Ecm29: Proteasome sta 20.7 4.5E+02 0.0097 21.1 6.9 40 5-60 76-115 (501)
239 PF08542 Rep_fac_C: Replicatio 20.7 1.3E+02 0.0028 17.4 2.7 9 85-93 6-14 (89)
240 PF08732 HIM1: HIM1; InterPro 20.6 53 0.0011 25.8 1.1 12 81-92 173-184 (410)
241 KOG0211|consensus 20.5 3.7E+02 0.0079 23.1 6.0 72 42-113 232-304 (759)
242 PLN03205 ATR interacting prote 20.4 4.4E+02 0.0095 21.3 6.0 50 65-114 302-352 (652)
243 KOG2256|consensus 20.3 4.5E+02 0.0097 22.2 6.2 79 37-116 232-313 (661)
244 KOG1293|consensus 20.3 1.6E+02 0.0035 24.7 3.8 93 6-114 22-125 (678)
245 PF12921 ATP13: Mitochondrial 20.3 1.5E+02 0.0033 19.0 3.0 30 85-116 53-82 (126)
246 PF10390 ELL: RNA polymerase I 20.2 68 0.0015 23.8 1.6 21 35-55 262-282 (284)
247 PLN02169 fatty acid (omega-1)- 20.2 4.1E+02 0.0089 20.9 6.0 48 45-94 299-346 (500)
248 KOG1943|consensus 20.1 6.6E+02 0.014 22.7 10.0 99 2-116 350-459 (1133)
No 1
>KOG1991|consensus
Probab=99.93 E-value=3e-25 Score=178.76 Aligned_cols=113 Identities=50% Similarity=0.922 Sum_probs=106.3
Q ss_pred CCCCCChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCC
Q psy16982 1 MMSEVDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFP 80 (116)
Q Consensus 1 ~~~~~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P 80 (116)
|+++.+.+|||+|||||||.|.++|+... ..| ....+++++|..||++|++.+.+.+..+|.|+..|+..|.+.|||
T Consensus 46 ~~d~~~l~vrqaaaIYlKN~I~~~W~~~~-~~g--~~~~I~e~dk~~irenIl~~iv~~p~~iRvql~~~l~~Ii~~D~p 122 (1010)
T KOG1991|consen 46 MDDGVPLPVRQAAAIYLKNKITKSWSSHE-APG--RPFGIPEEDKAVIRENILETIVQVPELIRVQLTACLNTIIKADYP 122 (1010)
T ss_pred HccCCchhHHHHHHHHHHHHHHhcCCccC-CCC--CcCCCChHHHHHHHHHHHHHHHhCchHHHHHHHHHHHHHHhcCCc
Confidence 78899999999999999999999999875 223 245799999999999999999999999999999999999999999
Q ss_pred CchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 81 GKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 81 ~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
++||++++.+.+.+++++...+++||.||+.+||+|
T Consensus 123 ~~Wp~l~d~i~~~Lqs~~~~~vy~aLl~l~qL~k~y 158 (1010)
T KOG1991|consen 123 EQWPGLLDKIKNLLQSQDANHVYGALLCLYQLFKTY 158 (1010)
T ss_pred ccchhHHHHHHHHhcCcchhhHHHHHHHHHHHHHHH
Confidence 999999999999999999999999999999999998
No 2
>KOG1992|consensus
Probab=99.91 E-value=1.1e-23 Score=167.68 Aligned_cols=110 Identities=30% Similarity=0.489 Sum_probs=103.3
Q ss_pred CCCCChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCC
Q psy16982 2 MSEVDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPG 81 (116)
Q Consensus 2 ~~~~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~ 81 (116)
+++.|.++|.+||++|||+|+++|....+. ...+.+++|+.||..|+.++.+.+..|+.|++++|+.|+++|||+
T Consensus 48 ~~~~d~~~r~aaav~fKN~iKr~W~~~~~~-----~~~i~~~~~e~ikslIv~lMl~s~~~iQ~qlseal~~Ig~~DFP~ 122 (960)
T KOG1992|consen 48 NGQQDPQIRVAAAVYFKNYIKRNWIPAEDS-----PIKIIEEDREQIKSLIVTLMLSSPFNIQKQLSEALSLIGKRDFPD 122 (960)
T ss_pred ccCcChhHHHHHHHHHHHHHHhccCcCCCC-----ccccchhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHhccccch
Confidence 456799999999999999999999875532 357999999999999999999999999999999999999999999
Q ss_pred chhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 82 KWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 82 ~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
+||+|+|++++.++++|.+...|.|.+-+.++|||
T Consensus 123 kWptLl~dL~~~ls~~D~~~~~gVL~tahsiFkr~ 157 (960)
T KOG1992|consen 123 KWPTLLPDLVARLSSGDFNVINGVLVTAHSIFKRY 157 (960)
T ss_pred hhHHHHHHHHhhccccchHHHHHHHHHHHHHHHhc
Confidence 99999999999999999999999999999999998
No 3
>COG5657 CSE1 CAS/CSE protein involved in chromosome segregation [Cell division and chromosome partitioning]
Probab=99.84 E-value=9.4e-21 Score=152.15 Aligned_cols=108 Identities=25% Similarity=0.438 Sum_probs=98.1
Q ss_pred CCCCChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCC
Q psy16982 2 MSEVDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPG 81 (116)
Q Consensus 2 ~~~~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~ 81 (116)
+.+.+.++|+.|+|+|||+|++||..... ..+.++++..||..++..+.+.++.+.-|.+++++.||+.|||+
T Consensus 47 ~~~~~m~lR~~a~i~fkn~I~~~W~~~~~-------~~i~p~e~v~IR~~l~~lii~s~n~l~iq~a~avs~IA~~DfPd 119 (947)
T COG5657 47 SAFNSMSLRWAALIQFKNYIDKHWREENG-------NSILPDENVLIRDELFSLIISSSNQLQIQNALAVSRIARLDFPD 119 (947)
T ss_pred ccccchhHHHHHHHHHHhhHHHHhhhhcc-------cCCCCccchHHHHHHHHHHHcccchHHHHHHHHHHHHHhccCcc
Confidence 34678899999999999999999985332 25777777799999999999999888889999999999999999
Q ss_pred chhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 82 KWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 82 ~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
+||+|+|.+.+.+++.|....++.|.+++.++|+|
T Consensus 120 eWpTL~~DL~~~Ls~~D~~tn~~~L~~~h~Ifk~~ 154 (947)
T COG5657 120 EWPTLVPDLLSLLSEKDMVTNENSLRVLHHIFKRL 154 (947)
T ss_pred cchhHHHHHHhhhcccchHHHHHHHHHHHHHHHHH
Confidence 99999999999999999999999999999999986
No 4
>KOG1993|consensus
Probab=99.84 E-value=8.4e-21 Score=151.04 Aligned_cols=107 Identities=26% Similarity=0.397 Sum_probs=97.9
Q ss_pred CCCCChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCC
Q psy16982 2 MSEVDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPG 81 (116)
Q Consensus 2 ~~~~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~ 81 (116)
|.+.+.+||.+|+|||||.|.++|+... ...+|+|+|+.||..++..+..+.+++..|.|.++++||+.|||-
T Consensus 43 ~~t~dv~vRWmAviyfKNgIdryWR~~~-------~~sl~~EEK~~iR~~Ll~~~~E~~nQlaiQ~AvlisrIARlDyPr 115 (978)
T KOG1993|consen 43 SKTNDVSVRWMAVIYFKNGIDRYWRRNT-------KMSLPPEEKDFIRCNLLLHSDEENNQLAIQNAVLISRIARLDYPR 115 (978)
T ss_pred ccccceeeeeehhhhHhcchhHHhhcCC-------cccCCHHHHHHHHHHHHHhcccchhHHHHHHHHHHHHHHhcCCCc
Confidence 4567789999999999999999999543 247999999999999999999777999999999999999999999
Q ss_pred chhHHHHHHHHhhCCC----ChHHHhHHHHHHHHHHhh
Q psy16982 82 KWTQIVDKVSIYLQNP----DATPWFGALLCLYQLVKN 115 (116)
Q Consensus 82 ~Wp~Ll~~l~~~l~~~----~~~~~~~~L~~L~~l~~~ 115 (116)
+||+|+|.+.+.+++. |....+..|.+|+.+.|.
T Consensus 116 eWP~Lf~~L~~~Lq~~~~~gD~~~~~RiLi~l~~ilK~ 153 (978)
T KOG1993|consen 116 EWPDLFPDLLGQLQSSLGTGDSLVQHRILITLHHILKA 153 (978)
T ss_pred cchhHHHHHHHHHHHhcccchHHHHHHHHHHHHHHHHH
Confidence 9999999999999875 999999999999999874
No 5
>KOG2274|consensus
Probab=99.81 E-value=9.9e-20 Score=146.28 Aligned_cols=112 Identities=24% Similarity=0.416 Sum_probs=101.4
Q ss_pred CCCCCChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCC
Q psy16982 1 MMSEVDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFP 80 (116)
Q Consensus 1 ~~~~~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P 80 (116)
|+.+.+.+.||+|.|+||++|.+||....+.+.. ....+++.|..||+.+++.+.++..++++..+++|+.||+.|||
T Consensus 46 ~~~~~sl~lRQ~A~v~L~~yie~hW~~~~E~fr~--~~~~~e~~K~~IRe~Ll~~l~~sn~ki~~~vay~is~Ia~~D~P 123 (1005)
T KOG2274|consen 46 ANKDASLPLRQIALVLLKRYIEKHWSPNFEAFRY--PLIVSEEVKALIREQLLNLLDDSNSKIRSAVAYAISSIAAVDYP 123 (1005)
T ss_pred hCcccCchHHHHHHHHHHHHHHHhCCChHhhccC--CCcccHHHHHHHHHHHHhhhhccccccchHHHHHHHHHHhccCc
Confidence 4566788999999999999999999987765543 22378999999999999999987789999999999999999999
Q ss_pred CchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 81 GKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 81 ~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
++||+|++.++.+++++|.+.+++++.+|..+..
T Consensus 124 d~WpElv~~i~~~l~~~n~n~i~~am~vL~el~~ 157 (1005)
T KOG2274|consen 124 DEWPELVPFILKLLSSGNENSIHGAMRVLAELSD 157 (1005)
T ss_pred hhhHHHHHHHHHHHhccchhhhhhHHHHHHHHHH
Confidence 9999999999999999999999999999998864
No 6
>KOG2171|consensus
Probab=99.78 E-value=9.6e-19 Score=143.19 Aligned_cols=102 Identities=25% Similarity=0.457 Sum_probs=96.1
Q ss_pred CCCCChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCc-HHHHHHHHHHHHHHHhccCC
Q psy16982 2 MSEVDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAP-EVIRVQLAVCVSNIVKHDFP 80 (116)
Q Consensus 2 ~~~~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~-~~i~~~l~~~i~~Ia~~d~P 80 (116)
..+.+.++||+|||.+|+.+.++|+ .++++.|+.||+.||.++.+++ +.||+.+|++|+.||+.++|
T Consensus 46 ~~~~~p~~Rq~aaVl~Rkl~~~~w~------------~l~~e~~~siks~lL~~~~~E~~~~vr~k~~dviAeia~~~l~ 113 (1075)
T KOG2171|consen 46 ATSADPQVRQLAAVLLRKLLTKHWS------------RLSAEVQQSIKSSLLEIIQSETEPSVRHKLADVIAEIARNDLP 113 (1075)
T ss_pred hcCCChHHHHHHHHHHHHHHHHHhh------------cCCHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHhccc
Confidence 4577899999999999999999997 6999999999999999999987 79999999999999999999
Q ss_pred CchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 81 GKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 81 ~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
++||+|++.+++..+|+|+...+.|+.+|..+-+.
T Consensus 114 e~WPell~~L~q~~~S~~~~~rE~al~il~s~~~~ 148 (1075)
T KOG2171|consen 114 EKWPELLQFLFQSTKSPNPSLRESALLILSSLPET 148 (1075)
T ss_pred cchHHHHHHHHHHhcCCCcchhHHHHHHHHhhhhh
Confidence 99999999999999999999999999999987653
No 7
>COG5656 SXM1 Importin, protein involved in nuclear import [Posttranslational modification, protein turnover, chaperones]
Probab=99.68 E-value=2.1e-16 Score=125.70 Aligned_cols=112 Identities=29% Similarity=0.495 Sum_probs=94.2
Q ss_pred CCCCCChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCC
Q psy16982 1 MMSEVDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFP 80 (116)
Q Consensus 1 ~~~~~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P 80 (116)
|+++.+.++|+.|||||||.|.+.|+...+... ..-..++.|+.++++++......+...|+.+-.++..|...|||
T Consensus 45 s~de~~lnvklsAaIYfKNkI~rsWss~~d~~i---~~Dek~e~K~~lienil~v~l~sp~~tr~~l~ail~~I~seD~p 121 (970)
T COG5656 45 SKDEGDLNVKLSAAIYFKNKIIRSWSSKRDDGI---KADEKSEAKKYLIENILDVFLYSPEVTRTALNAILVNIFSEDKP 121 (970)
T ss_pred hhccCCchhhHHHHHHHhhhhhhhhhhcccCCC---CCcccHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHhccccCc
Confidence 467899999999999999999999998443210 01234556666666666666777788999999999999999999
Q ss_pred -CchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 81 -GKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 81 -~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
+.|| |+|...+++.+++...++.||.|+.++||.|
T Consensus 122 s~~wg-l~p~~~nll~s~ea~~vy~gLlcl~elfkay 157 (970)
T COG5656 122 SDLWG-LFPKAANLLRSSEANHVYTGLLCLEELFKAY 157 (970)
T ss_pred hhhcc-cchHHHHhhcccchhHHHHHHHHHHHHHHHH
Confidence 9999 9999999999999999999999999999987
No 8
>KOG2023|consensus
Probab=99.58 E-value=2.5e-15 Score=118.65 Aligned_cols=99 Identities=19% Similarity=0.340 Sum_probs=94.3
Q ss_pred CCChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCch
Q psy16982 4 EVDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKW 83 (116)
Q Consensus 4 ~~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~W 83 (116)
+.+..+|.+|++.|||.++.+|. .++.+.+.+||+.++.++.++++.||...+.+|++|++..+-..|
T Consensus 59 ~~d~~~Rs~aGLlLKNnvr~~~~------------~~~~~~~~yiKs~~l~~lgd~~~lIr~tvGivITTI~s~~~~~~w 126 (885)
T KOG2023|consen 59 SEDVPTRSLAGLLLKNNVRGHYN------------SIPSEVLDYIKSECLHGLGDASPLIRATVGIVITTIASTGGLQHW 126 (885)
T ss_pred ccchhHHHHhhhhHhcccccccc------------CCChHHHHHHHHHHHhhccCchHHHHhhhhheeeeeecccccccc
Confidence 45778999999999999999997 577899999999999999999999999999999999999988999
Q ss_pred hHHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 84 TQIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 84 p~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
|+++|.+.+++.++|.+...|++.+|.+|||
T Consensus 127 pelLp~L~~~L~s~d~n~~EgA~~AL~KIcE 157 (885)
T KOG2023|consen 127 PELLPQLCELLDSPDYNTCEGAFGALQKICE 157 (885)
T ss_pred hhHHHHHHHHhcCCcccccchhHHHHHHHHh
Confidence 9999999999999999999999999999998
No 9
>KOG1241|consensus
Probab=99.54 E-value=9.4e-15 Score=116.45 Aligned_cols=113 Identities=16% Similarity=0.186 Sum_probs=95.7
Q ss_pred CCCCChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCC-
Q psy16982 2 MSEVDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFP- 80 (116)
Q Consensus 2 ~~~~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P- 80 (116)
+.+.+..+|++|++.|||.+..+-....... ...|..++.+.|++||..++.+|..+.+...+..++|++.||..|.|
T Consensus 46 n~~~~~~~R~~AGL~LKN~L~akd~~~k~~~-~qRWl~l~~e~reqVK~~il~tL~~~ep~~~s~Aaq~va~IA~~ElP~ 124 (859)
T KOG1241|consen 46 NDNSSDVARMAAGLQLKNSLTAKDPERKQQY-QQRWLQLPAEIREQVKNNILRTLGSPEPRRPSSAAQCVAAIACIELPQ 124 (859)
T ss_pred ccCCcHHHHHHHhHHHhhhhccCCHHHHHHH-HHHHHcCCHHHHHHHHHHHHHHcCCCCCCccchHHHHHHHHHHhhCch
Confidence 3466788999999999999986544322110 12367899999999999999999998888999999999999999999
Q ss_pred CchhHHHHHHHHhhCCCChH-HHhHHHHHHHHHHhh
Q psy16982 81 GKWTQIVDKVSIYLQNPDAT-PWFGALLCLYQLVKN 115 (116)
Q Consensus 81 ~~Wp~Ll~~l~~~l~~~~~~-~~~~~L~~L~~l~~~ 115 (116)
+.||+|++.++.+..++.+. ...++|.+++++|+.
T Consensus 125 n~wp~li~~lv~nv~~~~~~~~k~~slealGyice~ 160 (859)
T KOG1241|consen 125 NQWPELIVTLVSNVGEEQASMVKESSLEALGYICED 160 (859)
T ss_pred hhCHHHHHHHHHhcccccchHHHHHHHHHHHHHHcc
Confidence 89999999999999887664 778999999999973
No 10
>COG5215 KAP95 Karyopherin (importin) beta [Intracellular trafficking and secretion]
Probab=99.33 E-value=9.7e-12 Score=97.59 Aligned_cols=111 Identities=11% Similarity=0.124 Sum_probs=94.0
Q ss_pred CCChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCC-Cc
Q psy16982 4 EVDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFP-GK 82 (116)
Q Consensus 4 ~~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P-~~ 82 (116)
.....+|.+|++.|||.+..+-....... ...|..++.|.|++||...+.+|.++.+.+-+..+..++.||+.|.| +.
T Consensus 52 ns~~~~Rm~agl~LKN~l~a~d~~~~~~~-~qrW~~~~~E~k~qvK~~al~aL~s~epr~~~~Aaql~aaIA~~Elp~~~ 130 (858)
T COG5215 52 NSNDQLRMVAGLILKNSLHANDPELQKGC-SQRWLGMRHESKEQVKGMALRALKSPEPRFCTMAAQLLAAIARMELPNSL 130 (858)
T ss_pred CCcHHHHHHHHHHHhhhhhcCCHHHHHHH-HHhhccCCHHHHHHHHHHHHHHhcCCccHHHHHHHHHHHHHHHhhCcccc
Confidence 44678999999999999987654322111 12367899999999999999999999999999999999999999999 89
Q ss_pred hhHHHHHHHHhhCCCCh-HHHhHHHHHHHHHHhh
Q psy16982 83 WTQIVDKVSIYLQNPDA-TPWFGALLCLYQLVKN 115 (116)
Q Consensus 83 Wp~Ll~~l~~~l~~~~~-~~~~~~L~~L~~l~~~ 115 (116)
||+|+..++.+...+.+ ..+..+|.++++.|++
T Consensus 131 wp~lm~~mv~nvg~eqp~~~k~~sl~~~gy~ces 164 (858)
T COG5215 131 WPGLMEEMVRNVGDEQPVSGKCESLGICGYHCES 164 (858)
T ss_pred chHHHHHHHHhccccCchHhHHHHHHHHHHHhhc
Confidence 99999999999988766 4567899999999975
No 11
>PF03810 IBN_N: Importin-beta N-terminal domain; InterPro: IPR001494 Karyopherins are a group of proteins involved in transporting molecules through the pores of the nuclear envelope. Karyopherins, which may act as importins or exportins, are part of the Importin-beta super-family, which all share a similar three-dimensional structure. Members of the importin-beta (karyopherin-beta) family can bind and transport cargo by themselves, or can form heterodimers with importin-alpha. As part of a heterodimer, importin-beta mediates interactions with the pore complex, while importin-alpha acts as an adaptor protein to bind the nuclear localisation signal (NLS) on the cargo through the classical NLS import of proteins. Importin-beta is a helicoidal molecule constructed from 19 HEAT repeats. Many nuclear pore proteins contain FG sequence repeats that can bind to HEAT repeats within importins [, ], which is important for importin-beta mediated transport. Ran GTPase helps to control the unidirectional transfer of cargo. The cytoplasm contains primarily RanGDP and the nucleus RanGTP through the actions of RanGAP and RanGEF, respectively. In the nucleus, RanGTP binds to importin-beta within the importin/cargo complex, causing a conformational change in importin-beta that releases it from importin-alpha-bound cargo. As a result, the N-terminal auto-inhibitory region on importin-alpha is free to loop back and bind to the major NLS-binding site, causing the cargo to be released []. There are additional release factors as well. This entry represents the N-terminal domain of karyopherins that is important for the binding of the Ran protein []. More information about these proteins can be found at Protein of the Month: Importins [].; GO: 0008565 protein transporter activity, 0006886 intracellular protein transport; PDB: 3NC1_A 3NBY_D 3NBZ_D 3NC0_A 3GJX_D 1IBR_D 1QGR_A 3LWW_A 1F59_A 2Q5D_A ....
Probab=99.26 E-value=2.5e-12 Score=76.66 Aligned_cols=53 Identities=26% Similarity=0.392 Sum_probs=44.8
Q ss_pred CCCCChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhc
Q psy16982 2 MSEVDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVM 58 (116)
Q Consensus 2 ~~~~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~ 58 (116)
+++.+..+|++|+++|||.|+++|..... ..|..+++++|+.||+.|+++|.+
T Consensus 25 ~~~~~~~~R~~A~i~LKn~I~~~W~~~~~----~~~~~~~~~~k~~Ik~~ll~~l~~ 77 (77)
T PF03810_consen 25 SNSQDPEVRQLAAILLKNLIKKNWSPSKQ----KGWSQLPEEEKEQIKSQLLQLLLQ 77 (77)
T ss_dssp CTTSCHHHHHHHHHHHHHHHHHSGGHHHH----HHHHGSSHHHHHHHHHHHHHHHHC
T ss_pred ccCCCHHHHHHHHHHHHHHHHHcCchhhc----cCCCCCCHHHHHHHHHHHHHHHcC
Confidence 56789999999999999999999985431 124579999999999999999864
No 12
>PF08389 Xpo1: Exportin 1-like protein; InterPro: IPR013598 The exchange of macromolecules between the nucleus and cytoplasm takes place through nuclear pore complexes within the nuclear membrane. Active transport of large molecules through these pore complexes require carrier proteins, called karyopherins (importins and exportins), which shuttle between the two compartments. This domain is found close to the N terminus of yeast exportin 1 (Xpo1, Crm1, P14068 from SWISSPROT), as well as adjacent to the N-terminal domain of importin-beta (IPR001494 from INTERPRO). Exportin 1 is a nuclear export receptor that translocates proteins out of the nucleus; it interacts with leucine-rich nuclear export signal (NES) sequences in proteins to be transported, as well as with RanGTP [, ]. Importin-beta is a nuclear import receptor that translocates proteins into the nucleus; it interacts with RanGTP and importin-alpha, the latter binding with the nuclear localisation signal (NLS) sequences in proteins to be transported []. More information about these proteins can be found at Protein of the Month: Importins [].; PDB: 3IBV_A 3ICQ_U 3M1I_C 3NC1_A 3NBY_D 3NBZ_D 3NC0_A 3GJX_D 2XWU_B 2X19_B ....
Probab=98.98 E-value=1.2e-09 Score=71.95 Aligned_cols=55 Identities=27% Similarity=0.431 Sum_probs=49.2
Q ss_pred cHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 60 PEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 60 ~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
|+.|+++++.+++.|+.+|||..||++++.+++.+++ ++.....+|.+|..+.+.
T Consensus 1 p~~i~~kl~~~l~~i~~~~~P~~Wp~~l~~l~~~~~~-~~~~~~~~L~iL~~l~eE 55 (148)
T PF08389_consen 1 PPFIRNKLAQVLAEIAKRDWPQQWPDFLEDLLQLLQS-SPQHLELVLRILRILPEE 55 (148)
T ss_dssp -HHHHHHHHHHHHHHHHHHTTTTSTTHHHHHHHHHHT-THHHHHHHHHHHHHHHHH
T ss_pred ChhHHHHHHHHHHHHHHHHChhhCchHHHHHHHHhcc-chhHHHHHHHHHHHHHHH
Confidence 4579999999999999999999999999999999988 578888899999888764
No 13
>COG5101 CRM1 Importin beta-related nuclear transport receptor [Nuclear structure / Intracellular trafficking and secretion]
Probab=98.75 E-value=4.9e-08 Score=78.02 Aligned_cols=98 Identities=19% Similarity=0.302 Sum_probs=78.9
Q ss_pred CCChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCc---------HHHHHHHHHHHHHH
Q psy16982 4 EVDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAP---------EVIRVQLAVCVSNI 74 (116)
Q Consensus 4 ~~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~---------~~i~~~l~~~i~~I 74 (116)
+.-++.+..|.-.|-..|++.|+ .+|++.|..||..+++.+...+ +.+-+.+-.++..|
T Consensus 58 S~~pqskyiALs~LdklIttkWk------------llp~~~r~GiRnyvv~~vI~~s~dd~v~~~qk~~lnkldltLvqI 125 (1053)
T COG5101 58 SKLPQSKYIALSLLDKLITTKWK------------LLPEGMRQGIRNYVVQLVIEKSQDDKVRDKQKYVLNKLDLTLVQI 125 (1053)
T ss_pred ccCcchhhhHHHHHHHHHHhhhh------------hCCcHHHHHHHHHHHHHHHHhcccHHHHHHHHHHHHHhhhHHHHH
Confidence 33456778888999999999997 6999999999999999987632 23456678889999
Q ss_pred HhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 75 VKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 75 a~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
++.|||.+||+++|++++..+ .+.+.....+.+|+.+-|
T Consensus 126 lKqeWP~nWP~FIpeli~~S~-~s~~vCeNnmivLklLsE 164 (1053)
T COG5101 126 LKQEWPRNWPTFIPELINVSQ-ISMEVCENNMIVLKLLSE 164 (1053)
T ss_pred HHHhcccccchhhHHHHhhcc-chHHHHhccHHHHHHhHH
Confidence 999999999999999999766 456666677777766544
No 14
>KOG2020|consensus
Probab=98.33 E-value=9e-07 Score=74.36 Aligned_cols=96 Identities=17% Similarity=0.334 Sum_probs=77.0
Q ss_pred ChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcC---------cHHHHHHHHHHHHHHHh
Q psy16982 6 DMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMA---------PEVIRVQLAVCVSNIVK 76 (116)
Q Consensus 6 ~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~---------~~~i~~~l~~~i~~Ia~ 76 (116)
...+|.+|--+|-+.|+.+|+ .+|.+.|..+|..+++..... .+.++.+++.++..|++
T Consensus 58 ~~~~k~f~Lqlle~vik~~W~------------~~~~~~r~glk~~v~~~~~~~~~~~~~~~~~~~~~~kL~~i~Vqi~K 125 (1041)
T KOG2020|consen 58 NPILKYFALQLLENVIKFRWN------------SLPVEERVGLKNYVLTLIIEASPDEDVSETEKHLLNKLNLILVQIVK 125 (1041)
T ss_pred CchhheeeHHHHHHHHHHhcc------------cCCccccHHHHHHHHHHHhhcCCcHhHHHHHHHHHHHHhHHHHHHHH
Confidence 457888899999999999997 588899999999999998652 13588999999999999
Q ss_pred ccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 77 HDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 77 ~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
.|||+.||++++.+.+.+..+ .......+.+|+.+.|
T Consensus 126 ~eWp~~wp~~i~dl~~~s~~s-~~~~el~m~Il~lLsE 162 (1041)
T KOG2020|consen 126 REWPAIWPTFIPDLAQSSKTS-ETVCELSMIILLLLSE 162 (1041)
T ss_pred HHHHhhcchhhhhHHHHhhcC-cccchHHHHHHHHHHH
Confidence 999999999999999988743 3333444555555444
No 15
>KOG2022|consensus
Probab=97.11 E-value=0.0038 Score=52.03 Aligned_cols=83 Identities=14% Similarity=0.292 Sum_probs=67.4
Q ss_pred CCCChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcC---cHHHHHHHHHHHHHHHhccC
Q psy16982 3 SEVDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMA---PEVIRVQLAVCVSNIVKHDF 79 (116)
Q Consensus 3 ~~~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~---~~~i~~~l~~~i~~Ia~~d~ 79 (116)
+.--..+|.++|+.|--.|.++|. .+++++...+|..++..+... ++.|-+.++..++..+-+-.
T Consensus 51 ~~k~~evqyFGAltL~~ki~~~~e------------~~~~~~~~qL~~klf~~l~~~~g~~~lVl~kl~~sLasl~l~~~ 118 (982)
T KOG2022|consen 51 PDKSSEVQYFGALTLHDKINTRWE------------ECPANEAVQLKLKLFLILSRFAGGPKLVLNKLCASLASLILYMV 118 (982)
T ss_pred CCchhHHHHHhHHHHHHHHHhhhc------------cCChhHHHHHHHHHHHHHHHhcCCchhHHHHHHHHHHHHHHHHc
Confidence 344456799999999999999997 588899999999999998763 25566667777776666667
Q ss_pred CCchhHHHHHHHHhhCCC
Q psy16982 80 PGKWTQIVDKVSIYLQNP 97 (116)
Q Consensus 80 P~~Wp~Ll~~l~~~l~~~ 97 (116)
|+.||+-+..++..++.+
T Consensus 119 ~d~Wp~ai~~vi~~l~~q 136 (982)
T KOG2022|consen 119 PDLWPTAIQDVIPTLQGQ 136 (982)
T ss_pred cccCCchHHHHHHHHhcc
Confidence 999999999999998864
No 16
>KOG2081|consensus
Probab=96.94 E-value=0.0048 Score=49.05 Aligned_cols=90 Identities=27% Similarity=0.355 Sum_probs=68.1
Q ss_pred CChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhc---CcHHHHHHHHHHHHHHHhccCCC
Q psy16982 5 VDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVM---APEVIRVQLAVCVSNIVKHDFPG 81 (116)
Q Consensus 5 ~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~---~~~~i~~~l~~~i~~Ia~~d~P~ 81 (116)
.+...-.+|+-.+||.|+..+. .+++.+-...|+.++..+.. .++.++.|++.+++.+|-+ +|
T Consensus 29 ~~~~~~~FaaqTlr~Ki~~~F~------------~Lp~~~~~slrdsl~thl~~l~~~~~~i~tQL~vavA~Lal~-~~- 94 (559)
T KOG2081|consen 29 CDVEALLFAAQTLRNKIQYDFS------------ELPPLTHASLRDSLITHLKELHDHPDVIRTQLAVAVAALALH-MP- 94 (559)
T ss_pred chHHHHHHHHHHHHHHHHhhHH------------hcCcchhHHHHHHHHHHHHHHHhCCchHHHHHHHHHHHHHHH-hH-
Confidence 4667889999999999999886 46776667777766666554 2249999999999999988 56
Q ss_pred chhHHHHHHHHhhCCCChHHHhHHHHHH
Q psy16982 82 KWTQIVDKVSIYLQNPDATPWFGALLCL 109 (116)
Q Consensus 82 ~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L 109 (116)
+|.+-++++++..++..+. +..-|.+|
T Consensus 95 ~W~n~I~e~v~~~~~~~~~-~~~lLeiL 121 (559)
T KOG2081|consen 95 EWVNPIFELVRALSNKHPA-VPILLEIL 121 (559)
T ss_pred hhcchHHHHHHHhhcCCcc-HHHHHHHH
Confidence 8999888888888776554 33334333
No 17
>KOG2021|consensus
Probab=96.49 E-value=0.03 Score=46.44 Aligned_cols=59 Identities=14% Similarity=0.339 Sum_probs=51.0
Q ss_pred CCCHhHHHHHHHHHHHHHhc-----C----cHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCC
Q psy16982 39 SLHEQDKAMIRDAIVDAVVM-----A----PEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNP 97 (116)
Q Consensus 39 ~i~~~~k~~IK~~ll~~l~~-----~----~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~ 97 (116)
..+..+.+.||..+..-+.- + +.-|++.++++++.+.-.+||..||+++..+...++-+
T Consensus 73 e~nl~elqlvR~sv~swlk~qvl~ne~~~~p~fi~Nk~aqvlttLf~~eYp~~WnsfF~dlmsv~~~~ 140 (980)
T KOG2021|consen 73 EANLNELQLVRFSVTSWLKFQVLGNEQTKLPDFIMNKIAQVLTTLFMLEYPDCWNSFFDDLMSVFQVD 140 (980)
T ss_pred cCCHHHHHHHHHHHHHHHHHHHhCcccCCCChHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhcc
Confidence 47888999999999876642 3 35799999999999999999999999999999988754
No 18
>PF12717 Cnd1: non-SMC mitotic condensation complex subunit 1
Probab=96.16 E-value=0.15 Score=34.88 Aligned_cols=92 Identities=13% Similarity=0.095 Sum_probs=72.2
Q ss_pred hHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHH
Q psy16982 7 MPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQI 86 (116)
Q Consensus 7 ~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~L 86 (116)
+.+|.-|.+.+=-...++-+- + +..=..+..+|.++++.||.+...+++.+...|+-.--+.+
T Consensus 2 ~~vR~n~i~~l~DL~~r~~~~------------v-----e~~~~~l~~~L~D~~~~VR~~al~~Ls~Li~~d~ik~k~~l 64 (178)
T PF12717_consen 2 PSVRNNAIIALGDLCIRYPNL------------V-----EPYLPNLYKCLRDEDPLVRKTALLVLSHLILEDMIKVKGQL 64 (178)
T ss_pred HHHHHHHHHHHHHHHHhCcHH------------H-----HhHHHHHHHHHCCCCHHHHHHHHHHHHHHHHcCceeehhhh
Confidence 467777777777766665431 1 12234677888888899999999999999999986555777
Q ss_pred HHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 87 VDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 87 l~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
+..++..+..+|++....|-.+|.++.++
T Consensus 65 ~~~~l~~l~D~~~~Ir~~A~~~~~e~~~~ 93 (178)
T PF12717_consen 65 FSRILKLLVDENPEIRSLARSFFSELLKK 93 (178)
T ss_pred hHHHHHHHcCCCHHHHHHHHHHHHHHHHh
Confidence 79999999988999999999999888765
No 19
>PF02985 HEAT: HEAT repeat; InterPro: IPR000357 The HEAT repeat is a tandemly repeated, 37-47 amino acid long module occurring in a number of cytoplasmic proteins, including the four name-giving proteins huntingtin, elongation factor 3 (EF3), the 65 Kd alpha regulatory subunit of protein phosphatase 2A (PP2A) and the yeast PI3-kinase TOR1 []. Arrays of HEAT repeats consists of 3 to 36 units forming a rod-like helical structure and appear to function as protein-protein interaction surfaces. It has been noted that many HEAT repeat-containing proteins are involved in intracellular transport processes. In the crystal structure of PP2A PR65/A [], the HEAT repeats consist of pairs of antiparallel alpha helices [].; GO: 0005515 protein binding; PDB: 3FGA_A 2PF4_C 2IAE_A 2BKU_D 3EA5_B 3ND2_A 2BPT_A 2NYL_A 2NPP_D 2PKG_B ....
Probab=95.96 E-value=0.025 Score=27.57 Aligned_cols=30 Identities=17% Similarity=0.272 Sum_probs=23.3
Q ss_pred HHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 86 IVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 86 Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
++|.++++++++++.....+..+|..+++.
T Consensus 1 llp~l~~~l~D~~~~VR~~a~~~l~~i~~~ 30 (31)
T PF02985_consen 1 LLPILLQLLNDPSPEVRQAAAECLGAIAEH 30 (31)
T ss_dssp HHHHHHHHHT-SSHHHHHHHHHHHHHHHHT
T ss_pred CHHHHHHHcCCCCHHHHHHHHHHHHHHHhh
Confidence 577888888888888888888888888764
No 20
>PF12348 CLASP_N: CLASP N terminal; InterPro: IPR024395 This domain is found in the N-terminal region of CLIP-associated proteins (CLASPs), which are widely conserved microtubule plus-end-tracking proteins that regulate the stability of dynamic microtubules [, ]. The domain is also found in other proteins involved in microtubule binding, including STU1, MOR1 and spindle pole body component Alp14.; PDB: 2QK2_A.
Probab=95.94 E-value=0.062 Score=37.66 Aligned_cols=102 Identities=16% Similarity=0.150 Sum_probs=64.1
Q ss_pred CCCChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHH---HHHHHHHhcCcHHHHHHHHHHHHHHHhc--
Q psy16982 3 SEVDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIR---DAIVDAVVMAPEVIRVQLAVCVSNIVKH-- 77 (116)
Q Consensus 3 ~~~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK---~~ll~~l~~~~~~i~~~l~~~i~~Ia~~-- 77 (116)
++.+-..|.-|...|++.+..+ ... ...+..-..+| ..+...+.+.-+.+....+.+++.++..
T Consensus 17 ~~~~W~~r~~al~~L~~l~~~~-~~~----------~~~~~~~~~l~~~~~~i~~~l~d~Rs~v~~~A~~~l~~l~~~l~ 85 (228)
T PF12348_consen 17 SESDWEERVEALQKLRSLIKGN-APE----------DFPPDFVECLRQLLDAIIKQLSDLRSKVSKTACQLLSDLARQLG 85 (228)
T ss_dssp T-SSHHHHHHHHHHHHHHHHH--B---------------HHHHHHHH---HHHHH-S-HH---HHHHHHHHHHHHHHHHG
T ss_pred CccCHHHHHHHHHHHHHHHHcC-Ccc----------ccHHHHHHHHHHhHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHh
Confidence 4567789999999999999988 211 34566667777 3444444333357788888888888755
Q ss_pred -cCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 78 -DFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 78 -d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
+|...=+.++|.|+..+.+++.-....+..||..+++.
T Consensus 86 ~~~~~~~~~~l~~Ll~~~~~~~~~i~~~a~~~L~~i~~~ 124 (228)
T PF12348_consen 86 SHFEPYADILLPPLLKKLGDSKKFIREAANNALDAIIES 124 (228)
T ss_dssp GGGHHHHHHHHHHHHHGGG---HHHHHHHHHHHHHHHTT
T ss_pred HhHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHH
Confidence 33223466888888888888777778888888888764
No 21
>PF04510 DUF577: Family of unknown function (DUF577); InterPro: IPR007598 This is a family of Arabidopsis thaliana (Mouse-ear cress) proteins. Many of these members contain a repeated region.
Probab=95.58 E-value=0.063 Score=36.93 Aligned_cols=48 Identities=19% Similarity=0.257 Sum_probs=37.6
Q ss_pred HHHHHHHHHHhcCc--HHHHHHHHHHHHHHHhcc--CC-CchhHHHHHHHHhh
Q psy16982 47 MIRDAIVDAVVMAP--EVIRVQLAVCVSNIVKHD--FP-GKWTQIVDKVSIYL 94 (116)
Q Consensus 47 ~IK~~ll~~l~~~~--~~i~~~l~~~i~~Ia~~d--~P-~~Wp~Ll~~l~~~l 94 (116)
.||..|+.||..+. ...-+.++.+++.+|..- +| +.|++|.+.|.+..
T Consensus 3 eikplLIsCL~~q~~k~s~~KiL~~iVs~Va~~v~~~~~~~W~eL~d~Ils~~ 55 (174)
T PF04510_consen 3 EIKPLLISCLTMQETKESDFKILRRIVSHVAYEVFDLQEGGWDELSDCILSLS 55 (174)
T ss_pred chHHHHHHHHHhhcccHhHHHHHHHHHHHHHHHHHhcCCCCchhHHHHHHHhh
Confidence 47899999998765 356677777887777653 46 89999999998866
No 22
>cd00020 ARM Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
Probab=95.54 E-value=0.062 Score=33.11 Aligned_cols=97 Identities=16% Similarity=0.029 Sum_probs=66.9
Q ss_pred ChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCC---c
Q psy16982 6 DMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPG---K 82 (116)
Q Consensus 6 ~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~---~ 82 (116)
+..+|..|...|.|.....-.. ...-.+..+=..++++|.+.+..++...+.+++.|+...-+. -
T Consensus 20 ~~~~~~~a~~~l~~l~~~~~~~------------~~~~~~~~~i~~l~~~l~~~~~~v~~~a~~~L~~l~~~~~~~~~~~ 87 (120)
T cd00020 20 DENVQREAAWALSNLSAGNNDN------------IQAVVEAGGLPALVQLLKSEDEEVVKAALWALRNLAAGPEDNKLIV 87 (120)
T ss_pred CHHHHHHHHHHHHHHhcCCHHH------------HHHHHHCCChHHHHHHHhCCCHHHHHHHHHHHHHHccCcHHHHHHH
Confidence 3578888888888876652110 000011123346666676777899999999999998764221 2
Q ss_pred h-hHHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 83 W-TQIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 83 W-p~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
+ .++++.+.+.++..+......++.+|..+|+
T Consensus 88 ~~~g~l~~l~~~l~~~~~~~~~~a~~~l~~l~~ 120 (120)
T cd00020 88 LEAGGVPKLVNLLDSSNEDIQKNATGALSNLAS 120 (120)
T ss_pred HHCCChHHHHHHHhcCCHHHHHHHHHHHHHhhC
Confidence 3 2479999999999899999999999998874
No 23
>PF12830 Nipped-B_C: Sister chromatid cohesion C-terminus
Probab=94.74 E-value=0.69 Score=31.96 Aligned_cols=68 Identities=13% Similarity=0.139 Sum_probs=58.8
Q ss_pred HHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 48 IRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 48 IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
.=..++++..+++..+|....+++..|.+.-+-+=+ +-+|.++.+..+.|+.....|...+..+++||
T Consensus 9 yl~~Il~~~~~~~~~vr~~Al~~l~~il~qGLvnP~-~cvp~lIAL~ts~~~~ir~~A~~~l~~l~eK~ 76 (187)
T PF12830_consen 9 YLKNILELCLSSDDSVRLAALQVLELILRQGLVNPK-QCVPTLIALETSPNPSIRSRAYQLLKELHEKH 76 (187)
T ss_pred HHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCChH-HHHhHhhhhhCCCChHHHHHHHHHHHHHHHHh
Confidence 334778888888899999999999999999875433 48999999999999999999999999999997
No 24
>PF12755 Vac14_Fab1_bd: Vacuolar 14 Fab1-binding region
Probab=94.44 E-value=0.4 Score=29.84 Aligned_cols=67 Identities=19% Similarity=0.181 Sum_probs=47.2
Q ss_pred HHHHHHHHHHhcCcHHHHHHHHHHHHHHHhc---cCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 47 MIRDAIVDAVVMAPEVIRVQLAVCVSNIVKH---DFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 47 ~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~---d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
.|=.-++.++.+++..||-..++++..|++. +.....+++++.+...+...|++. ..|-..|-.++|
T Consensus 27 ~Il~pVL~~~~D~d~rVRy~AcEaL~ni~k~~~~~~l~~f~~IF~~L~kl~~D~d~~V-r~~a~~Ld~llk 96 (97)
T PF12755_consen 27 EILPPVLKCFDDQDSRVRYYACEALYNISKVARGEILPYFNEIFDALCKLSADPDENV-RSAAELLDRLLK 96 (97)
T ss_pred HHHHHHHHHcCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCchhH-HHHHHHHHHHhc
Confidence 3444566777788899999999999988766 222356778888888777776654 444467766655
No 25
>PF01602 Adaptin_N: Adaptin N terminal region; InterPro: IPR002553 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. Clathrin coats contain both clathrin and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors []. All AP complexes are heterotetramers composed of two large subunits (adaptins), a medium subunit (mu) and a small subunit (sigma). Each subunit has a specific function. Adaptin subunits recognise and bind to clathrin through their hinge region (clathrin box), and recruit accessory proteins that modulate AP function through their C-terminal appendage domains. By contrast, GGAs are monomers composed of four domains, which have functions similar to AP subunits: an N-terminal VHS (Vps27p/Hrs/Stam) domain, a GAT (GGA and Tom1) domain, a hinge region, and a C-terminal GAE (gamma-adaptin ear) domain. The GAE domain is similar to the AP gamma-adaptin ear domain, being responsible for the recruitment of accessory proteins that regulate clathrin-mediated endocytosis []. While clathrin mediates endocytic protein transport from ER to Golgi, coatomers (COPI, COPII) primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the N-terminal domain of various adaptins from different AP clathrin adaptor complexes (including AP1, AP2, AP3 and AP4), and from the beta and gamma subunits of various coatomer (COP) adaptors. This domain has a 2-layer alpha/alpha fold that forms a right-handed superhelix, and is a member of the ARM repeat superfamily []. The N-terminal region of the various AP adaptor proteins share strong sequence identity; by contrast, the C-terminal domains of different adaptins share similar structural folds, but have little sequence identity []. It has been proposed that the N-terminal domain interacts with another uniform component of the coated vesicles. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 1W63_C 2JKR_A 2JKT_A 2XA7_A 2VGL_B 3TJZ_E.
Probab=94.41 E-value=0.34 Score=37.86 Aligned_cols=71 Identities=14% Similarity=0.177 Sum_probs=59.6
Q ss_pred CHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhH-HHHHHHHhhCCCChHHHhHHHHHHHHH
Q psy16982 41 HEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQ-IVDKVSIYLQNPDATPWFGALLCLYQL 112 (116)
Q Consensus 41 ~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~-Ll~~l~~~l~~~~~~~~~~~L~~L~~l 112 (116)
+++.-+.+-..+.+++.++.+.||+..+.++..|.+. +|+.-+. +++.+.+.+.+.|+..+.+|+.++..+
T Consensus 108 ~~~~~~~l~~~v~~ll~~~~~~VRk~A~~~l~~i~~~-~p~~~~~~~~~~l~~lL~d~~~~V~~~a~~~l~~i 179 (526)
T PF01602_consen 108 TPEMAEPLIPDVIKLLSDPSPYVRKKAALALLKIYRK-DPDLVEDELIPKLKQLLSDKDPSVVSAALSLLSEI 179 (526)
T ss_dssp SHHHHHHHHHHHHHHHHSSSHHHHHHHHHHHHHHHHH-CHCCHHGGHHHHHHHHTTHSSHHHHHHHHHHHHHH
T ss_pred ccchhhHHHHHHHHHhcCCchHHHHHHHHHHHHHhcc-CHHHHHHHHHHHHhhhccCCcchhHHHHHHHHHHH
Confidence 5566666777788888888889999999999999988 4776666 799999999888988888999988887
No 26
>PF09324 DUF1981: Domain of unknown function (DUF1981); InterPro: IPR015403 This domain is functionally uncharacterised and found in various plant and yeast protein transport proteins. It is noramlly associated with and C-termianl to the SEC7 domain. The SEC7 domain was named after the first protein found to contain such a region []. It has been shown to be linked with guanine nucleotide exchange function [, ].
Probab=94.33 E-value=0.46 Score=28.81 Aligned_cols=66 Identities=12% Similarity=0.127 Sum_probs=49.1
Q ss_pred HHHHHHHHHHHHhcC-cHHHHHHHHHHHHHHHhccCC---CchhHHHHHHHHhhCCCChHHHhHHHHHHHH
Q psy16982 45 KAMIRDAIVDAVVMA-PEVIRVQLAVCVSNIVKHDFP---GKWTQIVDKVSIYLQNPDATPWFGALLCLYQ 111 (116)
Q Consensus 45 k~~IK~~ll~~l~~~-~~~i~~~l~~~i~~Ia~~d~P---~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~ 111 (116)
++.+|- +...+.+. +..+|..+..|+..+...-.. ..||.++..+-.....++...+..|..++..
T Consensus 16 ~~fL~P-f~~i~~~~~~~~vre~il~ci~qil~~~~~~i~SGW~~if~il~~aa~~~~e~lv~~af~~~~~ 85 (86)
T PF09324_consen 16 KDFLKP-FEYIMSNNPSIDVRELILECILQILQSRGENIKSGWKVIFSILRAAAKDNDESLVRLAFQIVQL 85 (86)
T ss_pred HHHHHH-HHHHHhccCcHHHHHHHHHHHHHHHHHhHHHHHhccHHHHHHHHHHHhCCCccHHHHHHHHHhh
Confidence 344443 33343444 368999999999998875443 5899999999998888888889999888764
No 27
>PF13646 HEAT_2: HEAT repeats; PDB: 1OYZ_A 3FGA_A 2PF4_C 2IAE_A 3B2A_A.
Probab=94.16 E-value=0.15 Score=30.02 Aligned_cols=56 Identities=16% Similarity=0.087 Sum_probs=32.9
Q ss_pred HHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHH-HhHHHHHH
Q psy16982 47 MIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATP-WFGALLCL 109 (116)
Q Consensus 47 ~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~-~~~~L~~L 109 (116)
..-..|++.+.+++..+|.+.+.+++.|. -|+.++.+...+.+++... ...+..+|
T Consensus 31 ~~~~~L~~~l~d~~~~vr~~a~~aL~~i~-------~~~~~~~L~~~l~~~~~~~vr~~a~~aL 87 (88)
T PF13646_consen 31 EAIPALIELLKDEDPMVRRAAARALGRIG-------DPEAIPALIKLLQDDDDEVVREAAAEAL 87 (88)
T ss_dssp HHHHHHHHHHTSSSHHHHHHHHHHHHCCH-------HHHTHHHHHHHHTC-SSHHHHHHHHHHH
T ss_pred hHHHHHHHHHcCCCHHHHHHHHHHHHHhC-------CHHHHHHHHHHHcCCCcHHHHHHHHhhc
Confidence 44455666665555677777777777663 3566777777666654333 44555544
No 28
>PF13646 HEAT_2: HEAT repeats; PDB: 1OYZ_A 3FGA_A 2PF4_C 2IAE_A 3B2A_A.
Probab=93.96 E-value=0.7 Score=27.07 Aligned_cols=55 Identities=13% Similarity=0.106 Sum_probs=42.3
Q ss_pred HHHHHH-hcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHH
Q psy16982 51 AIVDAV-VMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQL 112 (116)
Q Consensus 51 ~ll~~l-~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l 112 (116)
.|++.| .+++..+|...+.+++.+. =|+.++.+...++++|+.....+..+|..+
T Consensus 3 ~L~~~l~~~~~~~vr~~a~~~L~~~~-------~~~~~~~L~~~l~d~~~~vr~~a~~aL~~i 58 (88)
T PF13646_consen 3 ALLQLLQNDPDPQVRAEAARALGELG-------DPEAIPALIELLKDEDPMVRRAAARALGRI 58 (88)
T ss_dssp HHHHHHHTSSSHHHHHHHHHHHHCCT-------HHHHHHHHHHHHTSSSHHHHHHHHHHHHCC
T ss_pred HHHHHHhcCCCHHHHHHHHHHHHHcC-------CHhHHHHHHHHHcCCCHHHHHHHHHHHHHh
Confidence 567777 4455899988888877332 247999999999999998888888887653
No 29
>cd00020 ARM Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
Probab=93.91 E-value=0.58 Score=28.60 Aligned_cols=65 Identities=18% Similarity=0.146 Sum_probs=50.0
Q ss_pred HHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchh-----HHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 49 RDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWT-----QIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 49 K~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp-----~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
-..+++.|.+.+..++...+.++..++... |..+. +.++.+.+.++++|+..+..++.+|..++.
T Consensus 9 i~~l~~~l~~~~~~~~~~a~~~l~~l~~~~-~~~~~~~~~~~~i~~l~~~l~~~~~~v~~~a~~~L~~l~~ 78 (120)
T cd00020 9 LPALVSLLSSSDENVQREAAWALSNLSAGN-NDNIQAVVEAGGLPALVQLLKSEDEEVVKAALWALRNLAA 78 (120)
T ss_pred hHHHHHHHHcCCHHHHHHHHHHHHHHhcCC-HHHHHHHHHCCChHHHHHHHhCCCHHHHHHHHHHHHHHcc
Confidence 345677777777889999999999998763 33232 566778888888889999999999998875
No 30
>PF13925 Katanin_con80: con80 domain of Katanin
Probab=93.87 E-value=0.26 Score=33.49 Aligned_cols=50 Identities=22% Similarity=0.259 Sum_probs=41.2
Q ss_pred HHHHHHHHHhccCCCch-----hHHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 67 LAVCVSNIVKHDFPGKW-----TQIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 67 l~~~i~~Ia~~d~P~~W-----p~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
+++++..+....=|+.| ++++|.+..+++|....++..||.+|..++++|
T Consensus 46 lvD~L~vl~~~~~~~~~tLd~c~~lLP~i~~LL~Sk~E~~i~~aL~~L~~i~~~f 100 (164)
T PF13925_consen 46 LVDVLSVLNQSLKPEKWTLDLCVDLLPLIEELLQSKYESYISVALEMLRSILKKF 100 (164)
T ss_pred HHHHHHHHHHhcCcCcccHHHHHHHHHHHHHHHhCCcHHHHHHHHHHHHHHHHHH
Confidence 46677777744445556 589999999999999999999999999999876
No 31
>KOG1410|consensus
Probab=93.32 E-value=2.7 Score=35.19 Aligned_cols=94 Identities=20% Similarity=0.239 Sum_probs=76.0
Q ss_pred HHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhc-Cc---HHHHHHHHHHHHHHHhccCCC--
Q psy16982 8 PVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVM-AP---EVIRVQLAVCVSNIVKHDFPG-- 81 (116)
Q Consensus 8 ~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~-~~---~~i~~~l~~~i~~Ia~~d~P~-- 81 (116)
--..+|+..|-+.+.++- ++|-+.|--||..+++.+.+ .| +-+...+++.++.|-+..|-+
T Consensus 54 Ys~mlAst~L~Klvs~~t-------------~lpl~qrldir~Yilnylat~~Pk~~~fvi~sLiQl~arlTK~gW~d~~ 120 (1082)
T KOG1410|consen 54 YSQMLASTCLMKLVSRKT-------------PLPLEQRLDIRNYILNYLATGAPKLAPFVIQSLIQLFARLTKLGWFDQQ 120 (1082)
T ss_pred hHHHHHHHHHHHHHcCCC-------------CCcHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHhccccccc
Confidence 345667777777766543 58999999999999999998 33 457888999999999988762
Q ss_pred ----chhHHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 82 ----KWTQIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 82 ----~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
.+-+.+..+...++.++.+...-|+.+|..+..
T Consensus 121 k~~y~FRd~v~~~~kfl~~~~ve~~~igv~iLsqLvq 157 (1082)
T KOG1410|consen 121 KDEYVFRDPVDDVTKFLQMDNVEHCIIGVQILSQLVQ 157 (1082)
T ss_pred cccchhhhhHHHHHHHhccCchHHHHHHHHHHHHHHH
Confidence 356889999999999999999999999988764
No 32
>PF13513 HEAT_EZ: HEAT-like repeat; PDB: 2Z5J_A 2OT8_B 2Z5O_A 2H4M_A 2QMR_A 1QBK_B 2Z5M_A 2Z5K_A 2Z5N_A 1GCJ_B ....
Probab=93.01 E-value=0.14 Score=27.93 Aligned_cols=51 Identities=16% Similarity=0.089 Sum_probs=36.8
Q ss_pred HHHHHHHHHHHHHHhccCC---CchhHHHHHHHHhhCCCChHHHhHHHHHHHHH
Q psy16982 62 VIRVQLAVCVSNIVKHDFP---GKWTQIVDKVSIYLQNPDATPWFGALLCLYQL 112 (116)
Q Consensus 62 ~i~~~l~~~i~~Ia~~d~P---~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l 112 (116)
.+|...+.+|+.++....+ ..=|++++.+...+.+++......+..+|..|
T Consensus 2 ~vR~~A~~aLg~l~~~~~~~~~~~~~~~~~~L~~~L~d~~~~VR~~A~~aLg~l 55 (55)
T PF13513_consen 2 RVRRAAAWALGRLAEGCPELLQPYLPELLPALIPLLQDDDDSVRAAAAWALGNL 55 (55)
T ss_dssp HHHHHHHHHHHCTTTTTHHHHHHHHHHHHHHHHHHTTSSSHHHHHHHHHHHHCH
T ss_pred HHHHHHHHHHhhHhcccHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHhcC
Confidence 4677778888776644322 25578999999999887777777888887653
No 33
>PTZ00429 beta-adaptin; Provisional
Probab=92.95 E-value=0.97 Score=37.84 Aligned_cols=110 Identities=10% Similarity=0.096 Sum_probs=71.1
Q ss_pred CCChHHHHHHHHHhhhhhcccccc---------ccCCCCCC---------ccCCCCHhHHHHHHHHHHHHHhcCcHHHHH
Q psy16982 4 EVDMPVRQAGVVYLKNLITNQWVE---------KEVDPGTP---------LAFSLHEQDKAMIRDAIVDAVVMAPEVIRV 65 (116)
Q Consensus 4 ~~~~~vR~~Aai~LKn~i~~~w~~---------~~~~~~~~---------~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~ 65 (116)
+.+..+|.+.-+|+.++-+.+-.. .+-.+..+ ....-.++.-+.+=..+.+++.+.++-||+
T Consensus 79 S~d~elKKLvYLYL~~ya~~~pelalLaINtl~KDl~d~Np~IRaLALRtLs~Ir~~~i~e~l~~~lkk~L~D~~pYVRK 158 (746)
T PTZ00429 79 STDLELKKLVYLYVLSTARLQPEKALLAVNTFLQDTTNSSPVVRALAVRTMMCIRVSSVLEYTLEPLRRAVADPDPYVRK 158 (746)
T ss_pred CCCHHHHHHHHHHHHHHcccChHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHcCCcHHHHHHHHHHHHHHhcCCCHHHHH
Confidence 346788888888888876543320 00000000 000112333344445556666667789999
Q ss_pred HHHHHHHHHHhccCCCchh--HHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 66 QLAVCVSNIVKHDFPGKWT--QIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 66 ~l~~~i~~Ia~~d~P~~Wp--~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
..+.|+..|-+.+ |+.-+ ++++.+..++..+|+..+.+++.+|.+|++
T Consensus 159 tAalai~Kly~~~-pelv~~~~~~~~L~~LL~D~dp~Vv~nAl~aL~eI~~ 208 (746)
T PTZ00429 159 TAAMGLGKLFHDD-MQLFYQQDFKKDLVELLNDNNPVVASNAAAIVCEVND 208 (746)
T ss_pred HHHHHHHHHHhhC-cccccccchHHHHHHHhcCCCccHHHHHHHHHHHHHH
Confidence 9999999997764 54332 456777788889999999999999999875
No 34
>PF02985 HEAT: HEAT repeat; InterPro: IPR000357 The HEAT repeat is a tandemly repeated, 37-47 amino acid long module occurring in a number of cytoplasmic proteins, including the four name-giving proteins huntingtin, elongation factor 3 (EF3), the 65 Kd alpha regulatory subunit of protein phosphatase 2A (PP2A) and the yeast PI3-kinase TOR1 []. Arrays of HEAT repeats consists of 3 to 36 units forming a rod-like helical structure and appear to function as protein-protein interaction surfaces. It has been noted that many HEAT repeat-containing proteins are involved in intracellular transport processes. In the crystal structure of PP2A PR65/A [], the HEAT repeats consist of pairs of antiparallel alpha helices [].; GO: 0005515 protein binding; PDB: 3FGA_A 2PF4_C 2IAE_A 2BKU_D 3EA5_B 3ND2_A 2BPT_A 2NYL_A 2NPP_D 2PKG_B ....
Probab=92.92 E-value=0.44 Score=22.98 Aligned_cols=28 Identities=18% Similarity=0.332 Sum_probs=23.3
Q ss_pred HHHHHHHhcCcHHHHHHHHHHHHHHHhc
Q psy16982 50 DAIVDAVVMAPEVIRVQLAVCVSNIVKH 77 (116)
Q Consensus 50 ~~ll~~l~~~~~~i~~~l~~~i~~Ia~~ 77 (116)
..+++++.+++..||...+.++..|+++
T Consensus 3 p~l~~~l~D~~~~VR~~a~~~l~~i~~~ 30 (31)
T PF02985_consen 3 PILLQLLNDPSPEVRQAAAECLGAIAEH 30 (31)
T ss_dssp HHHHHHHT-SSHHHHHHHHHHHHHHHHT
T ss_pred HHHHHHcCCCCHHHHHHHHHHHHHHHhh
Confidence 4577888888899999999999999864
No 35
>PF12530 DUF3730: Protein of unknown function (DUF3730) ; InterPro: IPR022542 This domain is found in eukaryotes, and is typically between 220 and 262 amino acids in length.
Probab=92.72 E-value=0.39 Score=34.35 Aligned_cols=51 Identities=22% Similarity=0.186 Sum_probs=42.5
Q ss_pred HHHHHHHHHHHHHhccCCCchhHHHHHHHHhh-CCCChHHHhHHHHHHHHHHh
Q psy16982 63 IRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYL-QNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 63 i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l-~~~~~~~~~~~L~~L~~l~~ 114 (116)
..-..+.++..|+..+ |+.|+++++.|.+.+ +++++...-.++..+..+|+
T Consensus 100 ~~i~~a~s~~~ic~~~-p~~g~~ll~~ls~~L~~~~~~~~~alale~l~~Lc~ 151 (234)
T PF12530_consen 100 CLISIAASIRDICCSR-PDHGVDLLPLLSGCLNQSCDEVAQALALEALAPLCE 151 (234)
T ss_pred HHHHHHHHHHHHHHhC-hhhHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHH
Confidence 3444567888898885 889999999999999 67777777889999999985
No 36
>PF12717 Cnd1: non-SMC mitotic condensation complex subunit 1
Probab=92.26 E-value=2 Score=29.30 Aligned_cols=77 Identities=19% Similarity=0.217 Sum_probs=51.8
Q ss_pred CCChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCch
Q psy16982 4 EVDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKW 83 (116)
Q Consensus 4 ~~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~W 83 (116)
+.+..||..|.+.|.+.+....-... ..+=..++.++.++++.||.....++..++..-.|+--
T Consensus 36 D~~~~VR~~al~~Ls~Li~~d~ik~k----------------~~l~~~~l~~l~D~~~~Ir~~A~~~~~e~~~~~~~~~i 99 (178)
T PF12717_consen 36 DEDPLVRKTALLVLSHLILEDMIKVK----------------GQLFSRILKLLVDENPEIRSLARSFFSELLKKRNPNII 99 (178)
T ss_pred CCCHHHHHHHHHHHHHHHHcCceeeh----------------hhhhHHHHHHHcCCCHHHHHHHHHHHHHHHHhccchHH
Confidence 44679999999999999987654322 12223455666666677777777777777777666655
Q ss_pred hHHHHHHHHhhCC
Q psy16982 84 TQIVDKVSIYLQN 96 (116)
Q Consensus 84 p~Ll~~l~~~l~~ 96 (116)
.+.++.++..+++
T Consensus 100 ~~~~~e~i~~l~~ 112 (178)
T PF12717_consen 100 YNNFPELISSLNN 112 (178)
T ss_pred HHHHHHHHHHHhC
Confidence 6666666665554
No 37
>PF11698 V-ATPase_H_C: V-ATPase subunit H; InterPro: IPR011987 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. V-ATPases (also known as V1V0-ATPase or vacuolar ATPase) (3.6.3.14 from EC) are found in the eukaryotic endomembrane system, and in the plasma membrane of prokaryotes and certain specialised eukaryotic cells. V-ATPases hydrolyse ATP to drive a proton pump, and are involved in a variety of vital intra- and inter-cellular processes such as receptor mediated endocytosis, protein trafficking, active transport of metabolites, homeostasis and neurotransmitter release []. V-ATPases are composed of two linked complexes: the V1 complex (subunits A-H) contains the catalytic core that hydrolyses ATP, while the V0 complex (subunits a, c, c', c'', d) forms the membrane-spanning pore. V-ATPases may have an additional role in membrane fusion through binding to t-SNARE proteins []. This entry represents the C-terminal domain of subunit H (also known as Vma13p) found in the V1 complex of V-ATPases. This subunit has a regulatory function, being responsible for activating ATPase activity and coupling ATPase activity to proton flow []. The yeast enzyme contains five motifs similar to the HEAT or Armadillo repeats seen in the importins, and can be divided into two distinct domains: a large N-terminal domain consisting of stacked alpha helices, and a smaller C-terminal alpha-helical domain with a similar superhelical topology to an armadillo repeat []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0016820 hydrolase activity, acting on acid anhydrides, catalyzing transmembrane movement of substances, 0015991 ATP hydrolysis coupled proton transport, 0000221 vacuolar proton-transporting V-type ATPase, V1 domain; PDB: 1HO8_A.
Probab=92.07 E-value=0.56 Score=30.41 Aligned_cols=73 Identities=14% Similarity=0.284 Sum_probs=51.4
Q ss_pred CCCHhHHHHHHHHHHHHHhcCc-HHHHHHHHHHHHHHHhccCCCc---h--hHHHHHHHHhhCCCChHHHhHHHHHHHHH
Q psy16982 39 SLHEQDKAMIRDAIVDAVVMAP-EVIRVQLAVCVSNIVKHDFPGK---W--TQIVDKVSIYLQNPDATPWFGALLCLYQL 112 (116)
Q Consensus 39 ~i~~~~k~~IK~~ll~~l~~~~-~~i~~~l~~~i~~Ia~~d~P~~---W--p~Ll~~l~~~l~~~~~~~~~~~L~~L~~l 112 (116)
.+.+++-+.+| .|+++|..+. +..-.-.|.=|+..+++ +|.. . -..=+.++++++++|++..+-||.|+..+
T Consensus 36 kf~~~~~~llk-~L~~lL~~s~d~~~laVac~Dig~~vr~-~p~gr~ii~~lg~K~~vM~Lm~h~d~eVr~eAL~avQkl 113 (119)
T PF11698_consen 36 KFEENNFELLK-KLIKLLDKSDDPTTLAVACHDIGEFVRH-YPNGRNIIEKLGAKERVMELMNHEDPEVRYEALLAVQKL 113 (119)
T ss_dssp GGSSGGGHHHH-HHHHHH-SHHHHHHHHHHHHHHHHHHHH--GGGHHHHHHHSHHHHHHHHTS-SSHHHHHHHHHHHHHH
T ss_pred HHHHcccHHHH-HHHHHHccCCCcceeehhhcchHHHHHH-ChhHHHHHHhcChHHHHHHHhcCCCHHHHHHHHHHHHHH
Confidence 56666667777 6788885543 55666677888888888 6742 1 12345678889999999999999999988
Q ss_pred H
Q psy16982 113 V 113 (116)
Q Consensus 113 ~ 113 (116)
+
T Consensus 114 m 114 (119)
T PF11698_consen 114 M 114 (119)
T ss_dssp H
T ss_pred H
Confidence 6
No 38
>PF01465 GRIP: GRIP domain; InterPro: IPR000237 The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain [, , ] is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi []. The GRIP domain contains a completely conserved tyrosine residue.; GO: 0005515 protein binding, 0000042 protein targeting to Golgi; PDB: 1R4A_H 1UPT_B.
Probab=92.02 E-value=0.57 Score=25.11 Aligned_cols=36 Identities=11% Similarity=0.202 Sum_probs=27.6
Q ss_pred hHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhcc
Q psy16982 43 QDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHD 78 (116)
Q Consensus 43 ~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d 78 (116)
.+.+++|+-+++.|...+...|.++..+|+.|.+.+
T Consensus 3 ~~~eYLKNvl~~fl~~~~~~~~~~llpvi~tlL~fs 38 (46)
T PF01465_consen 3 INLEYLKNVLLQFLESREPSEREQLLPVIATLLKFS 38 (46)
T ss_dssp HHHHHHHHHHHHHHTTSS---HHHHHHHHHHHTT--
T ss_pred hhHHHHHHHHHHHhcCCchhhHHHHHHHHHHHHCCC
Confidence 467899999999999988888999999999998765
No 39
>PF08167 RIX1: rRNA processing/ribosome biogenesis
Probab=91.27 E-value=3.3 Score=28.00 Aligned_cols=96 Identities=17% Similarity=0.200 Sum_probs=61.9
Q ss_pred ChHHHHHHHHHhhhhhccc-cccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCc-HHHHHHHHHHHHHHHhc--cCCC
Q psy16982 6 DMPVRQAGVVYLKNLITNQ-WVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAP-EVIRVQLAVCVSNIVKH--DFPG 81 (116)
Q Consensus 6 ~~~vR~~Aai~LKn~i~~~-w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~-~~i~~~l~~~i~~Ia~~--d~P~ 81 (116)
+..-|..|...+|-.+... |. .+.+.-..-++ .++..|.+++ ..+....+.+++.|... +||+
T Consensus 38 ~~~~rw~G~~Ll~~~~~~~~~e------------~l~~~~~~W~~-~Ll~~L~~~~~~~~~~~ai~~L~~l~~~~~~~p~ 104 (165)
T PF08167_consen 38 SAYSRWAGLCLLKVTVEQCSWE------------ILLSHGSQWLR-ALLSILEKPDPPSVLEAAIITLTRLFDLIRGKPT 104 (165)
T ss_pred ChhhHHHHHHHHHHHHHHhhHH------------HHHHHHHHHHH-HHHHHHcCCCCHHHHHHHHHHHHHHHHHhcCCCc
Confidence 4567888888888888876 75 34444445555 4788887755 57788888888888754 3332
Q ss_pred --------chhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 82 --------KWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 82 --------~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
.=|.+++.+++.++. ......+|.+|..+++.|
T Consensus 105 l~Rei~tp~l~~~i~~ll~l~~~--~~~~~~~l~~L~~ll~~~ 145 (165)
T PF08167_consen 105 LTREIATPNLPKFIQSLLQLLQD--SSCPETALDALATLLPHH 145 (165)
T ss_pred hHHHHhhccHHHHHHHHHHHHhc--cccHHHHHHHHHHHHHHC
Confidence 223344444444443 345667888888887765
No 40
>PF10363 DUF2435: Protein of unknown function (DUF2435)
Probab=91.15 E-value=2.3 Score=26.11 Aligned_cols=66 Identities=15% Similarity=0.169 Sum_probs=46.1
Q ss_pred HHHHHHhcCcHHHHHHHHHHHHHHHh-ccCC-CchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 51 AIVDAVVMAPEVIRVQLAVCVSNIVK-HDFP-GKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 51 ~ll~~l~~~~~~i~~~l~~~i~~Ia~-~d~P-~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
..+.-+.++...+|.+--..++.... .+.+ ...|.++.-+.+.+..+|+=.-.+++.+|..++..|
T Consensus 7 ~al~~L~dp~~PvRa~gL~~L~~Li~~~~~~~~~~~~il~l~l~~L~d~DsyVYL~aI~~L~~La~~~ 74 (92)
T PF10363_consen 7 EALSDLNDPLPPVRAHGLVLLRKLIESKSEPVIDIPKILDLFLSQLKDEDSYVYLNAIKGLAALADRH 74 (92)
T ss_pred HHHHHccCCCcchHHHHHHHHHHHHHcCCcchhhHHHHHHHHHHHcCCCCchHHHHHHHHHHHHHHHC
Confidence 34455556667899887666666654 4535 788999999999999887755556777666666543
No 41
>PF01602 Adaptin_N: Adaptin N terminal region; InterPro: IPR002553 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. Clathrin coats contain both clathrin and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors []. All AP complexes are heterotetramers composed of two large subunits (adaptins), a medium subunit (mu) and a small subunit (sigma). Each subunit has a specific function. Adaptin subunits recognise and bind to clathrin through their hinge region (clathrin box), and recruit accessory proteins that modulate AP function through their C-terminal appendage domains. By contrast, GGAs are monomers composed of four domains, which have functions similar to AP subunits: an N-terminal VHS (Vps27p/Hrs/Stam) domain, a GAT (GGA and Tom1) domain, a hinge region, and a C-terminal GAE (gamma-adaptin ear) domain. The GAE domain is similar to the AP gamma-adaptin ear domain, being responsible for the recruitment of accessory proteins that regulate clathrin-mediated endocytosis []. While clathrin mediates endocytic protein transport from ER to Golgi, coatomers (COPI, COPII) primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the N-terminal domain of various adaptins from different AP clathrin adaptor complexes (including AP1, AP2, AP3 and AP4), and from the beta and gamma subunits of various coatomer (COP) adaptors. This domain has a 2-layer alpha/alpha fold that forms a right-handed superhelix, and is a member of the ARM repeat superfamily []. The N-terminal region of the various AP adaptor proteins share strong sequence identity; by contrast, the C-terminal domains of different adaptins share similar structural folds, but have little sequence identity []. It has been proposed that the N-terminal domain interacts with another uniform component of the coated vesicles. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 1W63_C 2JKR_A 2JKT_A 2XA7_A 2VGL_B 3TJZ_E.
Probab=90.03 E-value=7.8 Score=30.29 Aligned_cols=68 Identities=19% Similarity=0.191 Sum_probs=52.1
Q ss_pred HHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 46 AMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 46 ~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
-.+=+.+.+-|.++++.+|.....+++.|+. |+--|.+++.+.+.+.++++.....|+.++.++++.+
T Consensus 78 ~l~~n~l~kdl~~~n~~~~~lAL~~l~~i~~---~~~~~~l~~~v~~ll~~~~~~VRk~A~~~l~~i~~~~ 145 (526)
T PF01602_consen 78 ILIINSLQKDLNSPNPYIRGLALRTLSNIRT---PEMAEPLIPDVIKLLSDPSPYVRKKAALALLKIYRKD 145 (526)
T ss_dssp HHHHHHHHHHHCSSSHHHHHHHHHHHHHH-S---HHHHHHHHHHHHHHHHSSSHHHHHHHHHHHHHHHHHC
T ss_pred HHHHHHHHHhhcCCCHHHHHHHHhhhhhhcc---cchhhHHHHHHHHHhcCCchHHHHHHHHHHHHHhccC
Confidence 3344555555666667888877777777774 4568999999999999999988889999999998753
No 42
>cd00197 VHS_ENTH_ANTH VHS, ENTH and ANTH domain superfamily; composed of proteins containing a VHS, ENTH or ANTH domain. The VHS domain is present in Vps27 (Vacuolar Protein Sorting), Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and STAM (Signal Transducing Adaptor Molecule). It is located at the N-termini of proteins involved in intracellular membrane trafficking. The epsin N-terminal homology (ENTH) domain is an evolutionarily conserved protein module found primarily in proteins that participate in clathrin-mediated endocytosis. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. VHS, ENTH and ANTH domains are structurally similar and are composed of a superhelix of eight alpha helices. ENTH adnd ANTH (E/ANTH) domains bind both inositol phospholipids and proteins and contribute to the nucleation and formation of clathrin coats on membra
Probab=89.84 E-value=3.2 Score=26.06 Aligned_cols=50 Identities=18% Similarity=0.110 Sum_probs=39.8
Q ss_pred HHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 61 EVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 61 ~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
...-..+++++. +.|...++.+..|...+.+.++..++-+|.+|.++++.
T Consensus 18 ~~~i~~i~d~~~-----~~~~~~~~~~~~l~kRl~~~~~~~~lkaL~lLe~lvkN 67 (115)
T cd00197 18 WPLIMEICDLIN-----ETNVGPKEAVDAIKKRINNKNPHVVLKALTLLEYCVKN 67 (115)
T ss_pred HHHHHHHHHHHH-----CCCccHHHHHHHHHHHhcCCcHHHHHHHHHHHHHHHHH
Confidence 344445555543 34788899999999999999999999999999999874
No 43
>PF00514 Arm: Armadillo/beta-catenin-like repeat; InterPro: IPR000225 The armadillo (Arm) repeat is an approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila melanogaster segment polarity gene armadillo involved in signal transduction through wingless. Animal Arm-repeat proteins function in various processes, including intracellular signalling and cytoskeletal regulation, and include such proteins as beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumour suppressor protein, and the nuclear transport factor importin-alpha, amongst others []. A subset of these proteins is conserved across eukaryotic kingdoms. In higher plants, some Arm-repeat proteins function in intracellular signalling like their mammalian counterparts, while others have novel functions []. The 3-dimensional fold of an armadillo repeat is known from the crystal structure of beta-catenin, where the 12 repeats form a superhelix of alpha helices with three helices per unit []. The cylindrical structure features a positively charged grove, which presumably interacts with the acidic surfaces of the known interaction partners of beta-catenin.; GO: 0005515 protein binding; PDB: 2Z6G_A 1IQ1_C 3RZX_A 2C1M_A 3BTR_C 3OQS_A 3TPO_A 1IAL_A 1Q1S_C 1PJM_B ....
Probab=89.83 E-value=0.88 Score=23.18 Aligned_cols=28 Identities=21% Similarity=0.141 Sum_probs=20.5
Q ss_pred HHHHHHHhhCCCChHHHhHHHHHHHHHH
Q psy16982 86 IVDKVSIYLQNPDATPWFGALLCLYQLV 113 (116)
Q Consensus 86 Ll~~l~~~l~~~~~~~~~~~L~~L~~l~ 113 (116)
.+|.|++++.++|...+..++.+|+.++
T Consensus 13 ~i~~Lv~ll~~~~~~v~~~a~~al~nl~ 40 (41)
T PF00514_consen 13 GIPPLVQLLKSPDPEVQEEAAWALGNLA 40 (41)
T ss_dssp HHHHHHHHTTSSSHHHHHHHHHHHHHHH
T ss_pred cHHHHHHHHcCCCHHHHHHHHHHHHHHh
Confidence 5677777777777777777777777665
No 44
>smart00755 Grip golgin-97, RanBP2alpha,Imh1p and p230/golgin-245.
Probab=88.81 E-value=1.2 Score=23.89 Aligned_cols=34 Identities=9% Similarity=0.258 Sum_probs=28.3
Q ss_pred HHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhcc
Q psy16982 44 DKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHD 78 (116)
Q Consensus 44 ~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d 78 (116)
+.+++|+-+++.|...+.. |.++..+|+.+.+.+
T Consensus 3 n~eYLKNVll~fl~~~e~~-r~~ll~vi~tlL~fs 36 (46)
T smart00755 3 NFEYLKNVLLQFLTLRESE-RETLLKVISTVLQLS 36 (46)
T ss_pred cHHHHHHHHHHHhccCcch-HHHHHHHHHHHhCCC
Confidence 4578999999999887755 899999999988775
No 45
>PF10508 Proteasom_PSMB: Proteasome non-ATPase 26S subunit; InterPro: IPR019538 The 26S proteasome is an enzymatic complex that degrades ubiquitinated proteins in eukaryotic cells. 26S proteasome non-ATPase regulatory subunit 5 is one of a number of chaperones that are involved in the assembly of the proteasome. The chaperones dissociate before 26S proteasome formation is complete [].; GO: 0044183 protein binding involved in protein folding
Probab=88.48 E-value=3.9 Score=32.58 Aligned_cols=98 Identities=17% Similarity=0.173 Sum_probs=69.2
Q ss_pred CCChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccC--CC
Q psy16982 4 EVDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDF--PG 81 (116)
Q Consensus 4 ~~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~--P~ 81 (116)
+.+..|...|+-.|++..+..-.- . ..++++. ...|-..+.+++..+|-.+.+++..|++..- -.
T Consensus 130 ~~d~~Va~~A~~~L~~l~~~~~~~-~--------~l~~~~~----~~~L~~l~~~~~~~vR~Rv~el~v~i~~~S~~~~~ 196 (503)
T PF10508_consen 130 DPDLSVAKAAIKALKKLASHPEGL-E--------QLFDSNL----LSKLKSLMSQSSDIVRCRVYELLVEIASHSPEAAE 196 (503)
T ss_pred CCcHHHHHHHHHHHHHHhCCchhH-H--------HHhCcch----HHHHHHHHhccCHHHHHHHHHHHHHHHhcCHHHHH
Confidence 456788888999999888642210 0 0122222 4556666666677888889999999987741 12
Q ss_pred chh--HHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 82 KWT--QIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 82 ~Wp--~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
..- ++++.++..+.++|.-...+++.+|..+++
T Consensus 197 ~~~~sgll~~ll~eL~~dDiLvqlnalell~~La~ 231 (503)
T PF10508_consen 197 AVVNSGLLDLLLKELDSDDILVQLNALELLSELAE 231 (503)
T ss_pred HHHhccHHHHHHHHhcCccHHHHHHHHHHHHHHHc
Confidence 222 499999999999888888999999999876
No 46
>KOG1062|consensus
Probab=88.46 E-value=8.9 Score=32.52 Aligned_cols=112 Identities=10% Similarity=0.035 Sum_probs=74.8
Q ss_pred CCCCChHHHHHHHHHhhhhhccccccccCCC--CCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccC
Q psy16982 2 MSEVDMPVRQAGVVYLKNLITNQWVEKEVDP--GTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDF 79 (116)
Q Consensus 2 ~~~~~~~vR~~Aai~LKn~i~~~w~~~~~~~--~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~ 79 (116)
+=..+...|-+|+=.|-+++...-..-.+.. +-..-.+.+++.-++=|..+++||.+++..||+-.-+....|...
T Consensus 303 ~I~~~~~LrvlainiLgkFL~n~d~NirYvaLn~L~r~V~~d~~avqrHr~tIleCL~DpD~SIkrralELs~~lvn~-- 380 (866)
T KOG1062|consen 303 DIRSNSGLRVLAINILGKFLLNRDNNIRYVALNMLLRVVQQDPTAVQRHRSTILECLKDPDVSIKRRALELSYALVNE-- 380 (866)
T ss_pred hccCCchHHHHHHHHHHHHhcCCccceeeeehhhHHhhhcCCcHHHHHHHHHHHHHhcCCcHHHHHHHHHHHHHHhcc--
Confidence 3345668899999888888876655433210 001123567778888899999999999999988877777777665
Q ss_pred CCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 80 PGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 80 P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
.+=-.++.+++..+.+.|++..-..-.-+-.+.|+|
T Consensus 381 -~Nv~~mv~eLl~fL~~~d~~~k~~~as~I~~laEkf 416 (866)
T KOG1062|consen 381 -SNVRVMVKELLEFLESSDEDFKADIASKIAELAEKF 416 (866)
T ss_pred -ccHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHhc
Confidence 355667777777777766655444444444555554
No 47
>PF10508 Proteasom_PSMB: Proteasome non-ATPase 26S subunit; InterPro: IPR019538 The 26S proteasome is an enzymatic complex that degrades ubiquitinated proteins in eukaryotic cells. 26S proteasome non-ATPase regulatory subunit 5 is one of a number of chaperones that are involved in the assembly of the proteasome. The chaperones dissociate before 26S proteasome formation is complete [].; GO: 0044183 protein binding involved in protein folding
Probab=87.98 E-value=5.7 Score=31.64 Aligned_cols=97 Identities=9% Similarity=0.079 Sum_probs=66.7
Q ss_pred CCChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCch
Q psy16982 4 EVDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKW 83 (116)
Q Consensus 4 ~~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~W 83 (116)
+.+..||.+|.-.+.+.+...-.... .+.+ ..+=..++.++.+++..+....+.++..|+++. ..-
T Consensus 88 h~~~~Vr~l~l~~l~~~~~~~~~~~~---------~~~~---~~l~~~i~~~L~~~d~~Va~~A~~~L~~l~~~~--~~~ 153 (503)
T PF10508_consen 88 HPSPKVRRLALKQLGRIARHSEGAAQ---------LLVD---NELLPLIIQCLRDPDLSVAKAAIKALKKLASHP--EGL 153 (503)
T ss_pred CCCHHHHHHHHHHHHHHhcCCHHHHH---------HhcC---ccHHHHHHHHHcCCcHHHHHHHHHHHHHHhCCc--hhH
Confidence 34568999999888888754321000 0111 124456888888888999999999999999864 333
Q ss_pred hHH-----HHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 84 TQI-----VDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 84 p~L-----l~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
..+ +..+.+.+..+|.......+.++-.+++
T Consensus 154 ~~l~~~~~~~~L~~l~~~~~~~vR~Rv~el~v~i~~ 189 (503)
T PF10508_consen 154 EQLFDSNLLSKLKSLMSQSSDIVRCRVYELLVEIAS 189 (503)
T ss_pred HHHhCcchHHHHHHHHhccCHHHHHHHHHHHHHHHh
Confidence 444 7888888877666666677777777664
No 48
>PF12460 MMS19_C: RNAPII transcription regulator C-terminal; InterPro: IPR024687 This domain, approximately 60 amino acids in length, is found in the N-terminal region of MMS19 proteins. MMS19 is required for both nucleotide excision repair (NER) and RNA polymerase II (RNAP II) transcription [].
Probab=87.63 E-value=4.1 Score=31.51 Aligned_cols=54 Identities=19% Similarity=0.312 Sum_probs=45.3
Q ss_pred HHHHHHHHHHHHHHHhccCC-----CchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 61 EVIRVQLAVCVSNIVKHDFP-----GKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 61 ~~i~~~l~~~i~~Ia~~d~P-----~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
..++...-.+++.|.++ -| .+=|+|+|-+++.++.+|.+...++|.+|..+++.
T Consensus 337 ~~~k~~yL~ALs~ll~~-vP~~vl~~~l~~LlPLLlqsL~~~~~~v~~s~L~tL~~~l~~ 395 (415)
T PF12460_consen 337 DEIKSNYLTALSHLLKN-VPKSVLLPELPTLLPLLLQSLSLPDADVLLSSLETLKMILEE 395 (415)
T ss_pred hhhHHHHHHHHHHHHhh-CCHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHc
Confidence 34777888888888885 46 26789999999999999999999999999988763
No 49
>PTZ00429 beta-adaptin; Provisional
Probab=87.50 E-value=5.4 Score=33.57 Aligned_cols=56 Identities=7% Similarity=0.081 Sum_probs=44.0
Q ss_pred CcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 59 APEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 59 ~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
.+..++.....+|+.+|.. ||......++.|++++..+. +.+..++.+++.|+++|
T Consensus 380 ~D~ef~r~aIrAIg~lA~k-~~~~a~~cV~~Ll~ll~~~~-~~v~e~i~vik~Ilrky 435 (746)
T PTZ00429 380 VDMVFVVEVVRAIASLAIK-VDSVAPDCANLLLQIVDRRP-ELLPQVVTAAKDIVRKY 435 (746)
T ss_pred CCHHHHHHHHHHHHHHHHh-ChHHHHHHHHHHHHHhcCCc-hhHHHHHHHHHHHHHHC
Confidence 3456778888889999855 78888999999999987643 34556788999998887
No 50
>PF07571 DUF1546: Protein of unknown function (DUF1546); InterPro: IPR011442 These proteins are associated with IPR004823 from INTERPRO in transcription initiation factor TFIID subunit 6 (TAF6).; GO: 0051090 regulation of sequence-specific DNA binding transcription factor activity, 0005634 nucleus
Probab=86.87 E-value=4.2 Score=24.90 Aligned_cols=51 Identities=18% Similarity=0.302 Sum_probs=38.2
Q ss_pred HHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhh----CC--CChHHHhHHHHHHHHH
Q psy16982 61 EVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYL----QN--PDATPWFGALLCLYQL 112 (116)
Q Consensus 61 ~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l----~~--~~~~~~~~~L~~L~~l 112 (116)
-.+|...+.+++.|++. |-...|+|-+.+...+ .+ .+....+||+..|..+
T Consensus 20 ~~LRd~AA~lL~~I~~~-~~~~~~~L~~Ri~~tl~k~l~d~~~~~~t~YGAi~gL~~l 76 (92)
T PF07571_consen 20 WALRDFAASLLAQICRK-FSSSYPTLQPRITRTLLKALLDPKKPLGTHYGAIVGLSAL 76 (92)
T ss_pred HHHHHHHHHHHHHHHHH-hccccchHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHH
Confidence 37999999999999977 6666777666665544 32 3557899999998876
No 51
>PF12348 CLASP_N: CLASP N terminal; InterPro: IPR024395 This domain is found in the N-terminal region of CLIP-associated proteins (CLASPs), which are widely conserved microtubule plus-end-tracking proteins that regulate the stability of dynamic microtubules [, ]. The domain is also found in other proteins involved in microtubule binding, including STU1, MOR1 and spindle pole body component Alp14.; PDB: 2QK2_A.
Probab=85.69 E-value=9.7 Score=26.44 Aligned_cols=67 Identities=16% Similarity=0.149 Sum_probs=45.4
Q ss_pred HHHHHHHHhcCcHHHHHHHHHHHHHHHhccC------CC--chhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 49 RDAIVDAVVMAPEVIRVQLAVCVSNIVKHDF------PG--KWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 49 K~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~------P~--~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
...+..++.+.++.+|...+..+..+...-- .. ..+.+.+.+...++.+|++....|-.++..+.+.
T Consensus 133 ~~~l~~~~~~Kn~~vR~~~~~~l~~~l~~~~~~~~~l~~~~~~~~l~~~l~~~l~D~~~~VR~~Ar~~~~~l~~~ 207 (228)
T PF12348_consen 133 LEILSQGLKSKNPQVREECAEWLAIILEKWGSDSSVLQKSAFLKQLVKALVKLLSDADPEVREAARECLWALYSH 207 (228)
T ss_dssp HHHHHHHTT-S-HHHHHHHHHHHHHHHTT-----GGG--HHHHHHHHHHHHHHHTSS-HHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHhCCCHHHHHHHHHHHHHHHHHccchHhhhcccchHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH
Confidence 4444445555567899999988888876532 11 2477999999999999998888888887776653
No 52
>smart00185 ARM Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Probab=85.53 E-value=2 Score=21.20 Aligned_cols=28 Identities=18% Similarity=0.039 Sum_probs=17.3
Q ss_pred HHHHHHHhhCCCChHHHhHHHHHHHHHH
Q psy16982 86 IVDKVSIYLQNPDATPWFGALLCLYQLV 113 (116)
Q Consensus 86 Ll~~l~~~l~~~~~~~~~~~L~~L~~l~ 113 (116)
.++.++++++++++..+..++.+|+.++
T Consensus 13 ~i~~L~~ll~~~~~~i~~~a~~aL~nl~ 40 (41)
T smart00185 13 GLPALVELLKSEDEEVVKEAAWALSNLS 40 (41)
T ss_pred CHHHHHHHHcCCCHHHHHHHHHHHHHHc
Confidence 4555666666666666666666666554
No 53
>PF05918 API5: Apoptosis inhibitory protein 5 (API5); InterPro: IPR008383 This family consists of apoptosis inhibitory protein 5 (API5) sequences from several organisms. Apoptosis or programmed cell death is a physiological form of cell death that occurs in embryonic development and organ formation. It is characterised by biochemical and morphological changes such as DNA fragmentation and cell volume shrinkage. API5 is an anti apoptosis gene located in Homo sapiens chromosome 11, whose expression prevents the programmed cell death that occurs upon the deprivation of growth factors [,].; PDB: 3U0R_A 3V6A_A.
Probab=85.00 E-value=13 Score=30.39 Aligned_cols=84 Identities=17% Similarity=0.177 Sum_probs=50.4
Q ss_pred ChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhH
Q psy16982 6 DMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQ 85 (116)
Q Consensus 6 ~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~ 85 (116)
+...+++||-.+-++++.+-+ +.+ .-=+.++.+.-+++..||.+....|-.+++.. |+.=+-
T Consensus 35 ~~k~K~Laaq~I~kffk~FP~-------------l~~----~Ai~a~~DLcEDed~~iR~~aik~lp~~ck~~-~~~v~k 96 (556)
T PF05918_consen 35 SPKEKRLAAQFIPKFFKHFPD-------------LQE----EAINAQLDLCEDEDVQIRKQAIKGLPQLCKDN-PEHVSK 96 (556)
T ss_dssp -HHHHHHHHHHHHHHHCC-GG-------------GHH----HHHHHHHHHHT-SSHHHHHHHHHHGGGG--T---T-HHH
T ss_pred CHHHHHHHHHHHHHHHhhChh-------------hHH----HHHHHHHHHHhcccHHHHHHHHHhHHHHHHhH-HHHHhH
Confidence 456788888766665554332 221 11246677777778899999888888888875 544588
Q ss_pred HHHHHHHhhCCCChHH---HhHHHH
Q psy16982 86 IVDKVSIYLQNPDATP---WFGALL 107 (116)
Q Consensus 86 Ll~~l~~~l~~~~~~~---~~~~L~ 107 (116)
+.+.|+|+|+++++.. +..+|.
T Consensus 97 vaDvL~QlL~tdd~~E~~~v~~sL~ 121 (556)
T PF05918_consen 97 VADVLVQLLQTDDPVELDAVKNSLM 121 (556)
T ss_dssp HHHHHHHHTT---HHHHHHHHHHHH
T ss_pred HHHHHHHHHhcccHHHHHHHHHHHH
Confidence 9999999999987643 444553
No 54
>PF03810 IBN_N: Importin-beta N-terminal domain; InterPro: IPR001494 Karyopherins are a group of proteins involved in transporting molecules through the pores of the nuclear envelope. Karyopherins, which may act as importins or exportins, are part of the Importin-beta super-family, which all share a similar three-dimensional structure. Members of the importin-beta (karyopherin-beta) family can bind and transport cargo by themselves, or can form heterodimers with importin-alpha. As part of a heterodimer, importin-beta mediates interactions with the pore complex, while importin-alpha acts as an adaptor protein to bind the nuclear localisation signal (NLS) on the cargo through the classical NLS import of proteins. Importin-beta is a helicoidal molecule constructed from 19 HEAT repeats. Many nuclear pore proteins contain FG sequence repeats that can bind to HEAT repeats within importins [, ], which is important for importin-beta mediated transport. Ran GTPase helps to control the unidirectional transfer of cargo. The cytoplasm contains primarily RanGDP and the nucleus RanGTP through the actions of RanGAP and RanGEF, respectively. In the nucleus, RanGTP binds to importin-beta within the importin/cargo complex, causing a conformational change in importin-beta that releases it from importin-alpha-bound cargo. As a result, the N-terminal auto-inhibitory region on importin-alpha is free to loop back and bind to the major NLS-binding site, causing the cargo to be released []. There are additional release factors as well. This entry represents the N-terminal domain of karyopherins that is important for the binding of the Ran protein []. More information about these proteins can be found at Protein of the Month: Importins [].; GO: 0008565 protein transporter activity, 0006886 intracellular protein transport; PDB: 3NC1_A 3NBY_D 3NBZ_D 3NC0_A 3GJX_D 1IBR_D 1QGR_A 3LWW_A 1F59_A 2Q5D_A ....
Probab=84.99 E-value=5.2 Score=22.81 Aligned_cols=41 Identities=24% Similarity=0.392 Sum_probs=20.5
Q ss_pred HHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 72 SNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 72 ~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
..+...+ |+-|..|+..+.+ .+.+......|...|+..+++
T Consensus 6 ~~~~~~~-p~~~~~l~~il~~--~~~~~~~R~~A~i~LKn~I~~ 46 (77)
T PF03810_consen 6 KQFQKQN-PGFWQYLLQILSS--NSQDPEVRQLAAILLKNLIKK 46 (77)
T ss_dssp HHHHHSC-TCHHHHHHHHHHC--TTSCHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHH-hhHHHHHHHHHHc--cCCCHHHHHHHHHHHHHHHHH
Confidence 3444444 5555544444433 233555555666666655544
No 55
>PF12719 Cnd3: Nuclear condensing complex subunits, C-term domain
Probab=84.87 E-value=8.4 Score=28.36 Aligned_cols=74 Identities=8% Similarity=0.083 Sum_probs=49.6
Q ss_pred hHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 43 QDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 43 ~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
.....+-+.+.+.+.+.+..++...++.++++.-.+--..||+++..|+-..=+..-.....--+||..++..|
T Consensus 110 ~~~~~l~~~l~~~l~~~~~~~~~~a~EGl~KLlL~~~i~~~~~vL~~Lll~yF~p~t~~~~~LrQ~L~~Ffp~y 183 (298)
T PF12719_consen 110 VDSKSLLKILTKFLDSENPELQAIAVEGLCKLLLSGRISDPPKVLSRLLLLYFNPSTEDNQRLRQCLSVFFPVY 183 (298)
T ss_pred chHhHHHHHHHHHHhcCCHHHHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHcCcccCCcHHHHHHHHHHHHHH
Confidence 34456777788888888889999999999999877765557999999976543321111112334455555443
No 56
>KOG1824|consensus
Probab=84.57 E-value=10 Score=33.06 Aligned_cols=92 Identities=9% Similarity=0.122 Sum_probs=61.0
Q ss_pred CCCCCChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCC
Q psy16982 1 MMSEVDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFP 80 (116)
Q Consensus 1 ~~~~~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P 80 (116)
|++.-...+|.+|-+.|-..=.+ .+..-.+.+++.+++++.++..-+++..+.++..++.-
T Consensus 827 ~s~~s~~~ikvfa~LslGElgr~----------------~~~s~~~e~~~~iieaf~sp~edvksAAs~ALGsl~vg--- 887 (1233)
T KOG1824|consen 827 QSPKSSDSIKVFALLSLGELGRR----------------KDLSPQNELKDTIIEAFNSPSEDVKSAASYALGSLAVG--- 887 (1233)
T ss_pred hCCCCchhHHHHHHhhhhhhccC----------------CCCCcchhhHHHHHHHcCCChHHHHHHHHHHhhhhhcC---
Confidence 34445566777777666433222 12222245777999999999999999999999999875
Q ss_pred CchhHHHHHHHHhhCCCChHHHhHHHHHHHHHH
Q psy16982 81 GKWTQIVDKVSIYLQNPDATPWFGALLCLYQLV 113 (116)
Q Consensus 81 ~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~ 113 (116)
+-|+++|++...+.++ +...|--|..|+++.
T Consensus 888 -nl~~yLpfil~qi~sq-pk~QyLLLhSlkevi 918 (1233)
T KOG1824|consen 888 -NLPKYLPFILEQIESQ-PKRQYLLLHSLKEVI 918 (1233)
T ss_pred -chHhHHHHHHHHHhcc-hHhHHHHHHHHHHHH
Confidence 4557777777777653 555555555555554
No 57
>PF03224 V-ATPase_H_N: V-ATPase subunit H; InterPro: IPR004908 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. V-ATPases (also known as V1V0-ATPase or vacuolar ATPase) (3.6.3.14 from EC) are found in the eukaryotic endomembrane system, and in the plasma membrane of prokaryotes and certain specialised eukaryotic cells. V-ATPases hydrolyse ATP to drive a proton pump, and are involved in a variety of vital intra- and inter-cellular processes such as receptor mediated endocytosis, protein trafficking, active transport of metabolites, homeostasis and neurotransmitter release []. V-ATPases are composed of two linked complexes: the V1 complex (subunits A-H) contains the catalytic core that hydrolyses ATP, while the V0 complex (subunits a, c, c', c'', d) forms the membrane-spanning pore. V-ATPases may have an additional role in membrane fusion through binding to t-SNARE proteins []. This entry represents subunit H (also known as Vma13p) found in the V1 complex of V-ATPases. This subunit has a regulatory function, being responsible for activating ATPase activity and coupling ATPase activity to proton flow []. The yeast enzyme contains five motifs similar to the HEAT or Armadillo repeats seen in the importins, and can be divided into two distinct domains: a large N-terminal domain consisting of stacked alpha helices, and a smaller C-terminal alpha-helical domain with a similar superhelical topology to an armadillo repeat []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0046961 proton-transporting ATPase activity, rotational mechanism, 0015991 ATP hydrolysis coupled proton transport, 0000221 vacuolar proton-transporting V-type ATPase, V1 domain; PDB: 1HO8_A.
Probab=84.04 E-value=4.4 Score=30.05 Aligned_cols=65 Identities=14% Similarity=0.251 Sum_probs=44.2
Q ss_pred HHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchh---HHHHHHHHhhCC----CChHHHhHHHHHHHHHHh
Q psy16982 50 DAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWT---QIVDKVSIYLQN----PDATPWFGALLCLYQLVK 114 (116)
Q Consensus 50 ~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp---~Ll~~l~~~l~~----~~~~~~~~~L~~L~~l~~ 114 (116)
..++..+.+++..++.+.+.+++.++...-+..=. +.++.+++.+.+ ++.+....|+.+|..+++
T Consensus 108 ~~fl~ll~~~D~~i~~~a~~iLt~Ll~~~~~~~~~~~~~~l~~ll~~L~~~l~~~~~~~~~~av~~L~~LL~ 179 (312)
T PF03224_consen 108 SPFLKLLDRNDSFIQLKAAFILTSLLSQGPKRSEKLVKEALPKLLQWLSSQLSSSDSELQYIAVQCLQNLLR 179 (312)
T ss_dssp HHHHHH-S-SSHHHHHHHHHHHHHHHTSTTT--HHHHHHHHHHHHHHHH-TT-HHHH---HHHHHHHHHHHT
T ss_pred HHHHHHhcCCCHHHHHHHHHHHHHHHHcCCccccchHHHHHHHHHHHHHHhhcCCCcchHHHHHHHHHHHhC
Confidence 45566666677899999999999999887554444 778888877765 344556889999988875
No 58
>PF14225 MOR2-PAG1_C: Cell morphogenesis C-terminal
Probab=83.54 E-value=7.4 Score=28.51 Aligned_cols=50 Identities=24% Similarity=0.079 Sum_probs=42.0
Q ss_pred HHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 65 VQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 65 ~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
.=+..+++.|..+-||+.+.+.+..+.+.+.++-.......|.+|+.++.
T Consensus 168 dfl~~v~~~l~~~f~P~~~~~~l~~Ll~lL~n~~~w~~~~~L~iL~~ll~ 217 (262)
T PF14225_consen 168 DFLSQVVSYLREAFFPDHEFQILTFLLGLLENGPPWLRRKTLQILKVLLP 217 (262)
T ss_pred HHHHHHHHHHHHHhCchhHHHHHHHHHHHHhCCcHHHHHHHHHHHHHHhc
Confidence 33566778888887899999999999999999888888889999988764
No 59
>PF12755 Vac14_Fab1_bd: Vacuolar 14 Fab1-binding region
Probab=82.30 E-value=3.9 Score=25.35 Aligned_cols=35 Identities=20% Similarity=0.222 Sum_probs=30.1
Q ss_pred CchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 81 GKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 81 ~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
..-+++++.++.+++.+|....+.+..+|+.++|.
T Consensus 23 ~~l~~Il~pVL~~~~D~d~rVRy~AcEaL~ni~k~ 57 (97)
T PF12755_consen 23 KYLDEILPPVLKCFDDQDSRVRYYACEALYNISKV 57 (97)
T ss_pred HHHHHHHHHHHHHcCCCcHHHHHHHHHHHHHHHHH
Confidence 45578888889999999999999999999998874
No 60
>cd03571 ENTH_epsin ENTH domain, Epsin family; The epsin (Eps15 interactor) N-terminal homology (ENTH) domain is an evolutionarily conserved protein module found primarily in proteins that participate in clathrin-mediated endocytosis. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. ENTH and ANTH (E/ANTH) domains are structurally similar to the VHS domain and are composed of a superhelix of eight alpha helices. E/ANTH domains bind both inositol phospholipids and proteins and contribute to the nucleation and formation of clathrin coats on membranes. ENTH domains also function in the development of membrane curvature through lipid remodeling during the formation of clathrin-coated vesicles. E/ANTH-bearing proteins have recently been shown to function with adaptor protein-1 and GGA adaptors at the trans-Golgi network, which suggests that E/ANTH domains are univ
Probab=81.97 E-value=5.7 Score=25.77 Aligned_cols=45 Identities=20% Similarity=0.278 Sum_probs=32.7
Q ss_pred HHHHHhccCC-CchhHHHHHHHHhhCCC--ChHHHhHHHHHHHHHHhh
Q psy16982 71 VSNIVKHDFP-GKWTQIVDKVSIYLQNP--DATPWFGALLCLYQLVKN 115 (116)
Q Consensus 71 i~~Ia~~d~P-~~Wp~Ll~~l~~~l~~~--~~~~~~~~L~~L~~l~~~ 115 (116)
+..||..-|- ..+++.+..|...|.+. +-..++-+|.+|.++++.
T Consensus 22 m~eIa~~t~~~~~~~~Im~~l~kRL~~~~k~WR~vyKaL~lleyLl~n 69 (123)
T cd03571 22 MAEIARATYNYVEFQEIMSMLWKRLNDKGKNWRHVYKALTLLEYLLKN 69 (123)
T ss_pred HHHHHHHhCCHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHh
Confidence 3444443333 68888999998888765 667888999999988763
No 61
>PF13513 HEAT_EZ: HEAT-like repeat; PDB: 2Z5J_A 2OT8_B 2Z5O_A 2H4M_A 2QMR_A 1QBK_B 2Z5M_A 2Z5K_A 2Z5N_A 1GCJ_B ....
Probab=81.94 E-value=4.8 Score=21.53 Aligned_cols=54 Identities=19% Similarity=0.133 Sum_probs=31.6
Q ss_pred HHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHH
Q psy16982 8 PVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNI 74 (116)
Q Consensus 8 ~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~I 74 (116)
.+|..|+-.|-+.....-. ........+=..|+.+|.++++.||...+.+++.|
T Consensus 2 ~vR~~A~~aLg~l~~~~~~-------------~~~~~~~~~~~~L~~~L~d~~~~VR~~A~~aLg~l 55 (55)
T PF13513_consen 2 RVRRAAAWALGRLAEGCPE-------------LLQPYLPELLPALIPLLQDDDDSVRAAAAWALGNL 55 (55)
T ss_dssp HHHHHHHHHHHCTTTTTHH-------------HHHHHHHHHHHHHHHHTTSSSHHHHHHHHHHHHCH
T ss_pred HHHHHHHHHHhhHhcccHH-------------HHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHhcC
Confidence 5788888888763222111 11112233444555566566679999999888754
No 62
>PF13251 DUF4042: Domain of unknown function (DUF4042)
Probab=81.32 E-value=2.1 Score=29.70 Aligned_cols=54 Identities=19% Similarity=0.350 Sum_probs=37.0
Q ss_pred HHHHHHHHHHHHHHhc----cCCCchhHHHHHH-----------HH-hhCCCChHHHhHHHHHHHHHHhh
Q psy16982 62 VIRVQLAVCVSNIVKH----DFPGKWTQIVDKV-----------SI-YLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 62 ~i~~~l~~~i~~Ia~~----d~P~~Wp~Ll~~l-----------~~-~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
+||.....++..|++. .+.+.||.|+|+- +. .+.+.++....+++.+|..+++.
T Consensus 1 kvR~~Al~~L~al~k~~~~r~l~~yW~~llP~~~~~~~~~~~sLlt~il~Dp~~kvR~aA~~~l~~lL~g 70 (182)
T PF13251_consen 1 KVRQAALQCLQALAKSTDKRSLFGYWPALLPDSVLQGRPATPSLLTCILKDPSPKVRAAAASALAALLEG 70 (182)
T ss_pred ChhHHHHHHHHHHHHhcCCceeHhhHHHHCCCCCCcCCCCCcchhHHHHcCCchhHHHHHHHHHHHHHHc
Confidence 3677788888888887 3447999999988 11 11122445677888888887753
No 63
>cd03572 ENTH_epsin_related ENTH domain, Epsin Related family; composed of hypothetical proteins containing an ENTH-like domain. The epsin N-terminal homology (ENTH) domain is an evolutionarily conserved protein module found primarily in proteins that participate in clathrin-mediated endocytosis. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. ENTH and ANTH (E/ANTH) domains are structurally similar to the VHS domain and are composed of a superhelix of eight alpha helices. E/ANTH domains bind both inositol phospholipids and proteins and contribute to the nucleation and formation of clathrin coats on membranes. ENTH domains also function in the development of membrane curvature through lipid remodeling during the formation of clathrin-coated vesicles. E/ANTH-bearing proteins have recently been shown to function with adaptor protein-1 and GGA adaptors at the t
Probab=80.59 E-value=4.4 Score=26.34 Aligned_cols=36 Identities=11% Similarity=0.166 Sum_probs=30.2
Q ss_pred CCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 80 PGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 80 P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
|..-.++++.|...|+..++....-+|.+|.++|.+
T Consensus 33 ~~~~~ei~d~L~kRL~~~~~hVK~K~Lrilk~l~~~ 68 (122)
T cd03572 33 VGSCQELLEYLLKRLKRSSPHVKLKVLKIIKHLCEK 68 (122)
T ss_pred HHHHHHHHHHHHHHhcCCCCcchHHHHHHHHHHHhh
Confidence 356778999999999988877778899999999874
No 64
>PF10521 DUF2454: Protein of unknown function (DUF2454); InterPro: IPR018870 Putative protein of unknown function; subunit of the ASTRA complex which is part of the chromatin remodeling machinery; similar to Schizosaccharomyces pombe (Fission yeast) Tti2p; may interact with Rsm23p [].
Probab=80.17 E-value=2.9 Score=30.71 Aligned_cols=49 Identities=18% Similarity=0.196 Sum_probs=39.1
Q ss_pred HHHHHHHHHHHhccCC---CchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 65 VQLAVCVSNIVKHDFP---GKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 65 ~~l~~~i~~Ia~~d~P---~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
..++-++. ...+.| ..||-++|.++..++..++..+..|+.+|+.++++
T Consensus 98 ~~l~w~v~--~~~~~~~i~~~~~liiP~iL~llDD~~~~~K~~G~~lL~~ll~~ 149 (282)
T PF10521_consen 98 HVLSWIVL--SQLDRPWISQHWPLIIPPILNLLDDYSPEIKIQGCQLLHHLLEK 149 (282)
T ss_pred HHHHHHHH--hcCCcchHHHhhhHHHhhHHHHhcCCCHHHHHHHHHHHHHHHHh
Confidence 45555555 233444 69999999999999999999999999999999874
No 65
>COG5096 Vesicle coat complex, various subunits [Intracellular trafficking and secretion]
Probab=79.70 E-value=13 Score=31.41 Aligned_cols=56 Identities=14% Similarity=0.186 Sum_probs=42.6
Q ss_pred cCcHHHHHHHHHHHHHHHhccCCCchhHH-HHHHH-HhhCCCChHHHhHHHHHHHHHHh
Q psy16982 58 MAPEVIRVQLAVCVSNIVKHDFPGKWTQI-VDKVS-IYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 58 ~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~L-l~~l~-~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
+....||+..+.||..+-+.| |+..+++ +..+. .++..+||..+-+|+.+|+.+..
T Consensus 138 d~~ayVRk~Aalav~kly~ld-~~l~~~~g~~~~l~~l~~D~dP~Vi~nAl~sl~~i~~ 195 (757)
T COG5096 138 DPHAYVRKTAALAVAKLYRLD-KDLYHELGLIDILKELVADSDPIVIANALASLAEIDP 195 (757)
T ss_pred CCcHHHHHHHHHHHHHHHhcC-HhhhhcccHHHHHHHHhhCCCchHHHHHHHHHHHhch
Confidence 344689999999999998887 6677766 44444 45557889999999999888765
No 66
>COG5096 Vesicle coat complex, various subunits [Intracellular trafficking and secretion]
Probab=79.51 E-value=13 Score=31.32 Aligned_cols=88 Identities=19% Similarity=0.132 Sum_probs=61.3
Q ss_pred HHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhh
Q psy16982 15 VYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYL 94 (116)
Q Consensus 15 i~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l 94 (116)
+-||+.+--+|..- ....++.--..=+.+..-+.++.+.||.-. |..+....-|+-|+.+++.+.+++
T Consensus 69 ~ElKrL~ylYl~~y---------ak~~P~~~lLavNti~kDl~d~N~~iR~~A---lR~ls~l~~~el~~~~~~~ik~~l 136 (757)
T COG5096 69 VELKRLLYLYLERY---------AKLKPELALLAVNTIQKDLQDPNEEIRGFA---LRTLSLLRVKELLGNIIDPIKKLL 136 (757)
T ss_pred HHHHHHHHHHHHHH---------hccCHHHHHHHHHHHHhhccCCCHHHHHHH---HHHHHhcChHHHHHHHHHHHHHHc
Confidence 56777777777642 234444333333455556666778999874 444555555567999999999999
Q ss_pred CCCChHHHhHHHHHHHHHHh
Q psy16982 95 QNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 95 ~~~~~~~~~~~L~~L~~l~~ 114 (116)
..+++-....|..|+.++++
T Consensus 137 ~d~~ayVRk~Aalav~kly~ 156 (757)
T COG5096 137 TDPHAYVRKTAALAVAKLYR 156 (757)
T ss_pred cCCcHHHHHHHHHHHHHHHh
Confidence 99999888888888888774
No 67
>KOG1058|consensus
Probab=79.18 E-value=20 Score=30.50 Aligned_cols=72 Identities=8% Similarity=0.169 Sum_probs=56.1
Q ss_pred HHHHHHHHHHHHHhcCc---HHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 44 DKAMIRDAIVDAVVMAP---EVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 44 ~k~~IK~~ll~~l~~~~---~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
.-+.+|..+...-..+. .+-|..+..+|...|.. ||+-=.+.++.++..+++.|+..-...|..+++++++|
T Consensus 353 iv~~Lkke~~kT~~~e~d~~~~yRqlLiktih~cav~-Fp~~aatvV~~ll~fisD~N~~aas~vl~FvrE~iek~ 427 (948)
T KOG1058|consen 353 IVQFLKKEVMKTHNEESDDNGKYRQLLIKTIHACAVK-FPEVAATVVSLLLDFISDSNEAAASDVLMFVREAIEKF 427 (948)
T ss_pred HHHHHHHHHHhccccccccchHHHHHHHHHHHHHhhc-ChHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHhC
Confidence 34555555555544322 35688888888887766 89888899999999999999999999999999999986
No 68
>PRK13800 putative oxidoreductase/HEAT repeat-containing protein; Provisional
Probab=78.89 E-value=11 Score=32.17 Aligned_cols=55 Identities=11% Similarity=0.006 Sum_probs=28.0
Q ss_pred HHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHH
Q psy16982 51 AIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQL 112 (116)
Q Consensus 51 ~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l 112 (116)
.|++.|.++++.||...+..+..+.. |+.++.|...+.++|......++..|..+
T Consensus 625 ~L~~~L~D~d~~VR~~Av~~L~~~~~-------~~~~~~L~~aL~D~d~~VR~~Aa~aL~~l 679 (897)
T PRK13800 625 ELAPYLADPDPGVRRTAVAVLTETTP-------PGFGPALVAALGDGAAAVRRAAAEGLREL 679 (897)
T ss_pred HHHHHhcCCCHHHHHHHHHHHhhhcc-------hhHHHHHHHHHcCCCHHHHHHHHHHHHHH
Confidence 44444444555566655555554431 33555555555555555555555555444
No 69
>PRK13800 putative oxidoreductase/HEAT repeat-containing protein; Provisional
Probab=78.48 E-value=23 Score=30.34 Aligned_cols=26 Identities=23% Similarity=0.233 Sum_probs=13.8
Q ss_pred HHHHHHHhcCcHHHHHHHHHHHHHHH
Q psy16982 50 DAIVDAVVMAPEVIRVQLAVCVSNIV 75 (116)
Q Consensus 50 ~~ll~~l~~~~~~i~~~l~~~i~~Ia 75 (116)
..|.+.|.+++..+|...+..+..++
T Consensus 655 ~~L~~aL~D~d~~VR~~Aa~aL~~l~ 680 (897)
T PRK13800 655 PALVAALGDGAAAVRRAAAEGLRELV 680 (897)
T ss_pred HHHHHHHcCCCHHHHHHHHHHHHHHH
Confidence 34445554455556665555555553
No 70
>PRK09687 putative lyase; Provisional
Probab=78.10 E-value=13 Score=27.42 Aligned_cols=61 Identities=10% Similarity=-0.045 Sum_probs=33.7
Q ss_pred HHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHh-hCCCChHHHhHHHHHHHHHH
Q psy16982 51 AIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIY-LQNPDATPWFGALLCLYQLV 113 (116)
Q Consensus 51 ~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~-l~~~~~~~~~~~L~~L~~l~ 113 (116)
.+..++.+++..+|...+.+++.|...+= .=++.++.|... +...++.....+..+|++++
T Consensus 58 ~l~~ll~~~d~~vR~~A~~aLg~lg~~~~--~~~~a~~~L~~l~~~D~d~~VR~~A~~aLG~~~ 119 (280)
T PRK09687 58 LAIELCSSKNPIERDIGADILSQLGMAKR--CQDNVFNILNNLALEDKSACVRASAINATGHRC 119 (280)
T ss_pred HHHHHHhCCCHHHHHHHHHHHHhcCCCcc--chHHHHHHHHHHHhcCCCHHHHHHHHHHHhccc
Confidence 33343334445677777777777654321 123455556555 34456666667777777665
No 71
>PF12719 Cnd3: Nuclear condensing complex subunits, C-term domain
Probab=77.79 E-value=25 Score=25.83 Aligned_cols=70 Identities=19% Similarity=0.144 Sum_probs=55.8
Q ss_pred HHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 46 AMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 46 ~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
..+.+-+++++.+.+..||...-.|++..+-.|- +-=.+.++.+.+.++.++......+|.++..+.-.|
T Consensus 26 ~ll~~lI~P~v~~~~~~vR~~al~cLGl~~Lld~-~~a~~~l~l~~~~~~~~~~~v~~~al~~l~Dll~~~ 95 (298)
T PF12719_consen 26 SLLDSLILPAVQSSDPAVRELALKCLGLCCLLDK-ELAKEHLPLFLQALQKDDEEVKITALKALFDLLLTH 95 (298)
T ss_pred HHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHhCh-HHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHc
Confidence 6677777788888889999999999999999984 233567777888887677777788898888877654
No 72
>PRK09687 putative lyase; Provisional
Probab=77.64 E-value=11 Score=27.75 Aligned_cols=54 Identities=15% Similarity=0.130 Sum_probs=30.0
Q ss_pred HHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHH
Q psy16982 52 IVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLY 110 (116)
Q Consensus 52 ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~ 110 (116)
|+.+|.+++..||...+.++..+ .. .=|+..+.|+..+...|......+...|.
T Consensus 164 L~~~L~d~~~~VR~~A~~aLg~~-~~----~~~~~~~~L~~~L~D~~~~VR~~A~~aLg 217 (280)
T PRK09687 164 LINLLKDPNGDVRNWAAFALNSN-KY----DNPDIREAFVAMLQDKNEEIRIEAIIGLA 217 (280)
T ss_pred HHHHhcCCCHHHHHHHHHHHhcC-CC----CCHHHHHHHHHHhcCCChHHHHHHHHHHH
Confidence 33344444455666666666665 11 23356666666666666666556555554
No 73
>PF01417 ENTH: ENTH domain; InterPro: IPR001026 The ENTH (Epsin N-terminal homology) domain is approximately 150 amino acids in length and is always found located at the N-termini of proteins. The domain forms a compact globular structure, composed of 9 alpha-helices connected by loops of varying length. The general topology is determined by three helical hairpins that are stacked consecutively with a right hand twist []. An N-terminal helix folds back, forming a deep basic groove that forms the binding pocket for the Ins(1,4,5)P3 ligand []. The ligand is coordinated by residues from surrounding alpha-helices and all three phosphates are multiply coordinated. The coordination of Ins(1,4,5)P3 suggests that ENTH is specific for particular head groups. Proteins containing this domain have been found to bind PtdIns(4,5)P2 and PtdIns(1,4,5)P3 suggesting that the domain may be a membrane interacting module. The main function of proteins containing this domain appears to be to act as accessory clathrin adaptors in endocytosis, Epsin is able to recruit and promote clathrin polymerisation on a lipid monolayer, but may have additional roles in signalling and actin regulation []. Epsin causes a strong degree of membrane curvature and tubulation, even fragmentation of membranes with a high PtdIns(4,5)P2 content. Epsin binding to membranes facilitates their deformation by insertion of the N-terminal helix into the outer leaflet of the bilayer, pushing the head groups apart. This would reduce the energy needed to curve the membrane into a vesicle, making it easier for the clathrin cage to fix and stabilise the curved membrane. This points to a pioneering role for epsin in vesicle budding as it provides both a driving force and a link between membrane invagination and clathrin polymerisation. ; PDB: 1H0A_A 1EYH_A 1EDU_A 2QY7_B 1XGW_A 2V8S_E 1VDY_A 2DCP_A 1INZ_A 3ONL_B ....
Probab=75.70 E-value=11 Score=24.07 Aligned_cols=45 Identities=22% Similarity=0.256 Sum_probs=32.2
Q ss_pred HHHHHhccCC-CchhHHHHHHHHhh---CCCChHHHhHHHHHHHHHHhh
Q psy16982 71 VSNIVKHDFP-GKWTQIVDKVSIYL---QNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 71 i~~Ia~~d~P-~~Wp~Ll~~l~~~l---~~~~~~~~~~~L~~L~~l~~~ 115 (116)
+..|+..-|- ...+.++..|...+ ...+...++-+|.+|.+++..
T Consensus 24 l~eIa~~t~~~~~~~~I~~~l~kRL~~~~~k~wr~~~KaL~ll~yLl~n 72 (125)
T PF01417_consen 24 LAEIAQLTYNSKDCQEIMDVLWKRLSKSDGKNWRHVYKALTLLEYLLKN 72 (125)
T ss_dssp HHHHHHHTTSCHHHHHHHHHHHHHHHSSTSSGHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHhccccHHHHHHHHHHHHHhcCCcchhHHHHHHHHHHHHHHH
Confidence 3455544443 67888889998888 345667888899999888763
No 74
>KOG2171|consensus
Probab=74.25 E-value=25 Score=30.91 Aligned_cols=68 Identities=13% Similarity=0.093 Sum_probs=46.9
Q ss_pred HHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccC--CCchh---HHHHHHHHhh----CCCChHHHhHHHHHHHHHHh
Q psy16982 47 MIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDF--PGKWT---QIVDKVSIYL----QNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 47 ~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~--P~~Wp---~Ll~~l~~~l----~~~~~~~~~~~L~~L~~l~~ 114 (116)
.++..+.++|.+++..||.....++..++.+-= +..|+ .++|.++..+ +.+|.+....+|.+|.++.+
T Consensus 159 ~l~~lf~q~~~d~s~~vr~~a~rA~~a~~~~~~~~~~~~~~~~~llP~~l~vl~~~i~~~d~~~a~~~l~~l~El~e 235 (1075)
T KOG2171|consen 159 DLLRLFSQTMTDPSSPVRVAAVRALGAFAEYLENNKSEVDKFRDLLPSLLNVLQEVIQDGDDDAAKSALEALIELLE 235 (1075)
T ss_pred HHHHHHHHhccCCcchHHHHHHHHHHHHHHHhccchHHHHHHHHHhHHHHHHhHhhhhccchHHHHHHHHHHHHHHh
Confidence 567788888888887799999998888876631 34443 5666655544 45666666677777776654
No 75
>KOG1837|consensus
Probab=73.63 E-value=18 Score=32.88 Aligned_cols=53 Identities=9% Similarity=0.094 Sum_probs=44.2
Q ss_pred HHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 62 VIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 62 ~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
.+-..++.+++.++..| |++|+.|..+++....++++...+.++.++..+.++
T Consensus 1519 ~v~~~li~~i~~~~~a~-~~d~~pl~~k~l~~trss~~~~r~~ai~~~~~l~~~ 1571 (1621)
T KOG1837|consen 1519 IVSKLLIAEIASDSVAD-KDDLKPLNQKILKKTRSSSRKARYLAIIQVKLLYTK 1571 (1621)
T ss_pred HHHHHHHHHHHhhccCC-hhhhHHHHHHHHHHhccccHHHHHHHHHHHHHHHHH
Confidence 46677888888888887 455999999999999999999999999888877654
No 76
>PF08620 RPAP1_C: RPAP1-like, C-terminal; InterPro: IPR013929 Inhibition of RNA polymerase II-associated protein 1 (RPAP1) synthesis in Saccharomyces cerevisiae (Baker's yeast) results in changes in global gene expression that are similar to those caused by the loss of the RNAPII subunit Rpb11 []. This entry represents the C-terminal region that contains the motif GLHHH. This region is conserved from yeast to humans.
Probab=72.60 E-value=7.9 Score=22.88 Aligned_cols=30 Identities=10% Similarity=0.013 Sum_probs=24.8
Q ss_pred HHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 87 VDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 87 l~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
+++++.+..|..+...-.||.+|..|+.++
T Consensus 41 i~El~~L~RSsv~~QR~~al~~L~~Il~~~ 70 (73)
T PF08620_consen 41 IQELFHLSRSSVPSQRCIALQTLGRILYRA 70 (73)
T ss_pred HHHHHHHHhcCcHHHHHHHHHHHHHHHHHH
Confidence 677778888888888889999999988763
No 77
>KOG1943|consensus
Probab=72.07 E-value=22 Score=31.25 Aligned_cols=68 Identities=21% Similarity=0.329 Sum_probs=53.6
Q ss_pred hHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHh----hCC-CChHHHhHHHHHHHHHHh
Q psy16982 43 QDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIY----LQN-PDATPWFGALLCLYQLVK 114 (116)
Q Consensus 43 ~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~----l~~-~~~~~~~~~L~~L~~l~~ 114 (116)
+--+.+=+.|++.+.+.+..||-.+|--+..|+.+. | ++|.++++.. ++. ++....+||+.+|.++..
T Consensus 337 eivE~vie~Lls~l~d~dt~VrWSaAKg~grvt~rl-p---~~Lad~vi~svid~~~p~e~~~aWHgacLaLAELA~ 409 (1133)
T KOG1943|consen 337 EIVEFVIEHLLSALSDTDTVVRWSAAKGLGRVTSRL-P---PELADQVIGSVIDLFNPAEDDSAWHGACLALAELAL 409 (1133)
T ss_pred HHHHHHHHHHHHhccCCcchhhHHHHHHHHHHHccC-c---HHHHHHHHHHHHHhcCcCCchhHHHHHHHHHHHHHh
Confidence 455677788888888888999999999999999996 6 5666666543 333 357789999999998765
No 78
>KOG2023|consensus
Probab=71.72 E-value=21 Score=30.11 Aligned_cols=70 Identities=9% Similarity=0.119 Sum_probs=54.3
Q ss_pred HHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCC---chhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 46 AMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPG---KWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 46 ~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~---~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
..|++.+++|+.++.+.||...-..+..+.+.=|+. .-.++++.+-.++..++......|.-+++.++.+
T Consensus 661 snl~~lll~C~~D~~peVRQS~FALLGDltk~c~~~v~p~~~~fl~~lg~Nl~~~~isv~nNA~WAiGeia~k 733 (885)
T KOG2023|consen 661 SNLLDLLLQCLQDEVPEVRQSAFALLGDLTKACFEHVIPNLADFLPILGANLNPENISVCNNAIWAIGEIALK 733 (885)
T ss_pred ccHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHhhcCChhhchHHHHHHHHHHHHHHH
Confidence 448999999999998999988888888888775654 4557888888777777666677777777777654
No 79
>KOG1240|consensus
Probab=71.61 E-value=11 Score=33.65 Aligned_cols=52 Identities=15% Similarity=0.115 Sum_probs=36.1
Q ss_pred HHHHHHHHHHHHHHHhccCCCchh----HHHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 61 EVIRVQLAVCVSNIVKHDFPGKWT----QIVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 61 ~~i~~~l~~~i~~Ia~~d~P~~Wp----~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
..+|..+-+.|..++-+- .|- -|+|-|.|.++.+.+..+..+|.+|+.+||.
T Consensus 631 w~LR~aFfdsI~gvsi~V---G~rs~seyllPLl~Q~ltD~EE~Viv~aL~~ls~Lik~ 686 (1431)
T KOG1240|consen 631 WRLRGAFFDSIVGVSIFV---GWRSVSEYLLPLLQQGLTDGEEAVIVSALGSLSILIKL 686 (1431)
T ss_pred HHHHHHHHhhccceEEEE---eeeeHHHHHHHHHHHhccCcchhhHHHHHHHHHHHHHh
Confidence 445555555555554442 343 4788888888888888889999999999873
No 80
>PLN03076 ARF guanine nucleotide exchange factor (ARF-GEF); Provisional
Probab=71.36 E-value=68 Score=30.05 Aligned_cols=99 Identities=14% Similarity=0.208 Sum_probs=67.7
Q ss_pred hHHHHHHHHHhhhhhccccccccCCCCCCccCCCC--HhHHHHHHHHHHHHHhcCc-HHHHHHHHHHHHHHHhccCC---
Q psy16982 7 MPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLH--EQDKAMIRDAIVDAVVMAP-EVIRVQLAVCVSNIVKHDFP--- 80 (116)
Q Consensus 7 ~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~--~~~k~~IK~~ll~~l~~~~-~~i~~~l~~~i~~Ia~~d~P--- 80 (116)
..++.+|.-.||..-.+.-...+ ++ .-.|..+| -+...+.+.. ..+|..+.+|+..|...-..
T Consensus 1151 ~~va~fAidsLrQLs~kfle~eE----------L~~f~FQkefLk-Pfe~im~~s~~~eVrE~ILeCv~qmI~s~~~nIk 1219 (1780)
T PLN03076 1151 LSIAIFAMDSLRQLSMKFLEREE----------LANYNFQNEFMK-PFVIVMRKSNAVEIRELIIRCVSQMVLSRVNNVK 1219 (1780)
T ss_pred hhHHHHHHHHHHHHHHHhcchhh----------hhchhHHHHHHH-HHHHHHHhcCchHHHHHHHHHHHHHHHHHHhhhh
Confidence 45778888777777766554221 11 12345554 3344444433 57888888888877654332
Q ss_pred CchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 81 GKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 81 ~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
..|+.++..+.......+...+..|..++..|++.|
T Consensus 1220 SGWktIF~VLs~aa~d~~e~iV~lAFetl~~I~~d~ 1255 (1780)
T PLN03076 1220 SGWKSMFMVFTTAAYDDHKNIVLLAFEIIEKIIREY 1255 (1780)
T ss_pred cCcHHHHHHHHHHHhCccHHHHHHHHHHHHHHHHhh
Confidence 599999999988887778888999999999888754
No 81
>KOG0267|consensus
Probab=70.76 E-value=20 Score=30.24 Aligned_cols=70 Identities=16% Similarity=0.137 Sum_probs=49.4
Q ss_pred HHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 47 MIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 47 ~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
-||..|..+..-.+..++.-+..++..++..---+-.-.|+|.+-.++.+.-...+..+|..|.+++..|
T Consensus 676 DiK~sI~s~~kl~D~sV~ADvL~Iltek~eiLtLDl~t~l~P~lt~LLgS~~e~~v~vsld~Llklv~~f 745 (825)
T KOG0267|consen 676 DIKGSIGSLRKLADNSVQADVLNILTEKIEILTLDLCTQLLPVLTALLGSKTERPVNVSLDMLLKLVAVF 745 (825)
T ss_pred hhhHHHHHHHHhhhhhHHHHHHHHHhhhhhHhhHHHHHHHHHHHHHHhcccchhhhhhHHHHHHHHHHHh
Confidence 3565655555555666666666666665555322567789999999999988888888888888887765
No 82
>PF10978 DUF2785: Protein of unknown function (DUF2785); InterPro: IPR021247 Some members in this family are annotated as hypothetical membrane spanning proteins however this cannot be confirmed. The family has no known function.
Probab=69.20 E-value=34 Score=23.39 Aligned_cols=81 Identities=9% Similarity=0.108 Sum_probs=64.6
Q ss_pred ChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCc-----------HHHHHHHHHHHHHH
Q psy16982 6 DMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAP-----------EVIRVQLAVCVSNI 74 (116)
Q Consensus 6 ~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~-----------~~i~~~l~~~i~~I 74 (116)
....|.++++.+=..+..+=.. +.++++++..+.+.++..+..+. ..--.+.++++..+
T Consensus 12 ~vf~RSFsaL~la~il~~d~~~----------~fL~~~~~~~~~~~~~~Yl~~E~D~RGfv~~~GWaHa~AH~aD~l~el 81 (175)
T PF10978_consen 12 AVFTRSFSALLLALILYADRNY----------PFLSQEERQELIDQLIRYLLLEKDTRGFVEEKGWAHAFAHGADLLDEL 81 (175)
T ss_pred ceeeehHHHHHHHHHHhcCCcc----------cCCCHHHHHHHHHHHHHHHHHccccccCCccCcHHHHHHHHHHHHHHH
Confidence 4568999999998888875541 36999999999999999998752 23557788999999
Q ss_pred Hhcc-CC-CchhHHHHHHHHhhCC
Q psy16982 75 VKHD-FP-GKWTQIVDKVSIYLQN 96 (116)
Q Consensus 75 a~~d-~P-~~Wp~Ll~~l~~~l~~ 96 (116)
+.+. ++ .....++..+.+.+..
T Consensus 82 ~~~p~~~~~~~~~lL~~i~~~~~~ 105 (175)
T PF10978_consen 82 VQHPELDRADKIELLAAILEKYKR 105 (175)
T ss_pred HcCCCCCHHHHHHHHHHHHHHHcC
Confidence 8774 45 6789999999888875
No 83
>smart00273 ENTH Epsin N-terminal homology (ENTH) domain.
Probab=69.15 E-value=16 Score=23.35 Aligned_cols=38 Identities=18% Similarity=0.259 Sum_probs=26.6
Q ss_pred ccCCCchhHHHHHHHHhhCCC-ChHHHhHHHHHHHHHHh
Q psy16982 77 HDFPGKWTQIVDKVSIYLQNP-DATPWFGALLCLYQLVK 114 (116)
Q Consensus 77 ~d~P~~Wp~Ll~~l~~~l~~~-~~~~~~~~L~~L~~l~~ 114 (116)
+..|...++++..|...+... +...++-+|.+|+++++
T Consensus 31 ~~~~~~~~~i~~~l~~Rl~~~~~w~~v~KsL~llh~ll~ 69 (127)
T smart00273 31 HNEKSSFAEIMAVLWRRLNDTKNWRVVYKALILLHYLLR 69 (127)
T ss_pred ccCHhhHHHHHHHHHHHHhCcccHHHHHHHHHHHHHHHh
Confidence 445566777777777777665 66777777777777765
No 84
>KOG1242|consensus
Probab=66.55 E-value=52 Score=27.00 Aligned_cols=49 Identities=8% Similarity=0.058 Sum_probs=34.1
Q ss_pred HHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 66 QLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 66 ~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
++..+++..+..-.-..-|+++|.+.+.+.+..++...+|..|+..++.
T Consensus 276 ellg~m~~~ap~qLs~~lp~iiP~lsevl~DT~~evr~a~~~~l~~~~s 324 (569)
T KOG1242|consen 276 ELLGAMADCAPKQLSLCLPDLIPVLSEVLWDTKPEVRKAGIETLLKFGS 324 (569)
T ss_pred HHHHHHHHhchHHHHHHHhHhhHHHHHHHccCCHHHHHHHHHHHHHHHH
Confidence 3333444444332224678899999998888888888888888888775
No 85
>KOG1060|consensus
Probab=65.42 E-value=27 Score=29.92 Aligned_cols=53 Identities=13% Similarity=0.189 Sum_probs=46.4
Q ss_pred cHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHH
Q psy16982 60 PEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLV 113 (116)
Q Consensus 60 ~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~ 113 (116)
++-||...|.+|-.+=+.| |++-++|++.|-.++...++..+-++..++..+|
T Consensus 156 s~yVRk~AA~AIpKLYsLd-~e~k~qL~e~I~~LLaD~splVvgsAv~AF~evC 208 (968)
T KOG1060|consen 156 SPYVRKTAAHAIPKLYSLD-PEQKDQLEEVIKKLLADRSPLVVGSAVMAFEEVC 208 (968)
T ss_pred cHHHHHHHHHhhHHHhcCC-hhhHHHHHHHHHHHhcCCCCcchhHHHHHHHHhc
Confidence 3679999999999998888 7788899999999999888888888999988887
No 86
>PF14500 MMS19_N: Dos2-interacting transcription regulator of RNA-Pol-II
Probab=65.15 E-value=14 Score=26.96 Aligned_cols=33 Identities=15% Similarity=0.133 Sum_probs=28.7
Q ss_pred hHHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 84 TQIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 84 p~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
|..+|.+++.+.++.+....-+|.+|...+++|
T Consensus 207 ~~~~p~LleKL~s~~~~~K~D~L~tL~~c~~~y 239 (262)
T PF14500_consen 207 PFAFPLLLEKLDSTSPSVKLDSLQTLKACIENY 239 (262)
T ss_pred HHHHHHHHHHHcCCCcHHHHHHHHHHHHHHHHC
Confidence 568888888888888888888999999999887
No 87
>PF06628 Catalase-rel: Catalase-related immune-responsive; InterPro: IPR010582 Catalases (1.11.1.6 from EC) are antioxidant enzymes that catalyse the conversion of hydrogen peroxide to water and molecular oxygen, serving to protect cells from its toxic effects []. Hydrogen peroxide is produced as a consequence of oxidative cellular metabolism and can be converted to the highly reactive hydroxyl radical via transition metals, this radical being able to damage a wide variety of molecules within a cell, leading to oxidative stress and cell death. Catalases act to neutralise hydrogen peroxide toxicity, and are produced by all aerobic organisms ranging from bacteria to man. Most catalases are mono-functional, haem-containing enzymes, although there are also bifunctional haem-containing peroxidase/catalases (IPR000763 from INTERPRO) that are closely related to plant peroxidases, and non-haem, manganese-containing catalases (IPR007760 from INTERPRO) that are found in bacteria []. This entry represents a small conserved region within catalase enzymes that carries the immune-responsive amphipathic octa-peptide that is recognised by T cells [].; PDB: 2CAH_A 1NM0_A 1H7K_A 1E93_A 1H6N_A 3HB6_A 2CAG_A 1M85_A 1MQF_A 1A4E_C ....
Probab=63.87 E-value=22 Score=20.30 Aligned_cols=36 Identities=8% Similarity=0.134 Sum_probs=28.0
Q ss_pred cCCCCHhHHHHHHHHHHHHHhcCcHH-HHHHHHHHHH
Q psy16982 37 AFSLHEQDKAMIRDAIVDAVVMAPEV-IRVQLAVCVS 72 (116)
Q Consensus 37 ~~~i~~~~k~~IK~~ll~~l~~~~~~-i~~~l~~~i~ 72 (116)
|..+++++|+.+=+++...|...+.. |+......+.
T Consensus 17 y~~l~~~er~~lv~nia~~l~~v~~~~i~~r~l~~f~ 53 (68)
T PF06628_consen 17 YRVLSDEERERLVENIAGHLSGVSDEEIQERVLAYFY 53 (68)
T ss_dssp HHHSSHHHHHHHHHHHHHHHTTSSHHHHHHHHHHHHH
T ss_pred HHHCCHHHHHHHHHHHHHHHccCChhhHHHHHHHHHH
Confidence 44689999999999999999888865 7766444433
No 88
>PF02194 PXA: PXA domain; InterPro: IPR003114 This domain is found associated with PX domains. The PX (phox) domain [] occurs in a variety of eukaryotic proteins associated with intracellular signalling pathways.
Probab=63.48 E-value=28 Score=23.43 Aligned_cols=24 Identities=21% Similarity=0.475 Sum_probs=20.6
Q ss_pred CCCCChHHHHHHHHHhhhhhccccc
Q psy16982 2 MSEVDMPVRQAGVVYLKNLITNQWV 26 (116)
Q Consensus 2 ~~~~~~~vR~~Aai~LKn~i~~~w~ 26 (116)
|+.++..+..+-...+|++|.. |-
T Consensus 1 s~~vd~~l~~li~~I~rdfV~s-WY 24 (185)
T PF02194_consen 1 SPEVDEALHELIDLILRDFVNS-WY 24 (185)
T ss_pred ChHHHHHHHHHHHHHHHHHHHh-hh
Confidence 4567888999999999999998 75
No 89
>KOG2956|consensus
Probab=63.35 E-value=55 Score=26.37 Aligned_cols=71 Identities=17% Similarity=0.171 Sum_probs=49.5
Q ss_pred HHHHHHHHHHHHHhcCc-HHHHHHHHHHHHHHHhccCCCchh----HHHHHHHHhhCC-CChHHHhHHHHHHHHHHhh
Q psy16982 44 DKAMIRDAIVDAVVMAP-EVIRVQLAVCVSNIVKHDFPGKWT----QIVDKVSIYLQN-PDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 44 ~k~~IK~~ll~~l~~~~-~~i~~~l~~~i~~Ia~~d~P~~Wp----~Ll~~l~~~l~~-~~~~~~~~~L~~L~~l~~~ 115 (116)
.-+.|++.+.+.=.++. ..-...++++...+..-.| .-|. +++..+++.++. .|.....-+|.+|.++|+.
T Consensus 284 ~~~~v~~~l~~~~g~e~a~~~k~alsel~~m~~e~sf-svWeq~f~~iL~~l~EvL~d~~~~~~k~laLrvL~~ml~~ 360 (516)
T KOG2956|consen 284 QSALVADLLKEISGSERASERKEALSELPKMLCEGSF-SVWEQHFAEILLLLLEVLSDSEDEIIKKLALRVLREMLTN 360 (516)
T ss_pred hhHHHHHHHHhccCccchhHHHHHHHHHHHHHHccch-hHHHHHHHHHHHHHHHHHccchhhHHHHHHHHHHHHHHHh
Confidence 33566666555544444 4556677888888888877 5776 555556677776 6777788999999999873
No 90
>PF13251 DUF4042: Domain of unknown function (DUF4042)
Probab=63.28 E-value=47 Score=22.99 Aligned_cols=108 Identities=21% Similarity=0.260 Sum_probs=66.9
Q ss_pred CChHHHHHHHHHhhhhhccc---cccccCCCCC-CccCCCCHhHH---HHHHHHHHHHHhcCc-HHHHHHHHHHHHHHHh
Q psy16982 5 VDMPVRQAGVVYLKNLITNQ---WVEKEVDPGT-PLAFSLHEQDK---AMIRDAIVDAVVMAP-EVIRVQLAVCVSNIVK 76 (116)
Q Consensus 5 ~~~~vR~~Aai~LKn~i~~~---w~~~~~~~~~-~~~~~i~~~~k---~~IK~~ll~~l~~~~-~~i~~~l~~~i~~Ia~ 76 (116)
-+..+|+.|+-.+--.+... -..-++..+. ..+..++..-= ..+-..|+.+|..+. ..+-.|+..|++..+.
T Consensus 52 p~~kvR~aA~~~l~~lL~gsk~~L~~Ae~~~~~~~sFtslS~tLa~~i~~lH~~Ll~~L~~E~~~~~l~q~lK~la~Lv~ 131 (182)
T PF13251_consen 52 PSPKVRAAAASALAALLEGSKPFLAQAEESKGPSGSFTSLSSTLASMIMELHRGLLLALQAEKSPPVLTQLLKCLAVLVQ 131 (182)
T ss_pred CchhHHHHHHHHHHHHHHccHHHHHHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHhcccccHHHHHHHHHHHHHHc
Confidence 45678999988887777642 1100101111 01233443322 344557788888776 5788899999988876
Q ss_pred ccCC-Cch-hHHHHHHHH----hhCCCChHHHhHHHHHHHHHH
Q psy16982 77 HDFP-GKW-TQIVDKVSI----YLQNPDATPWFGALLCLYQLV 113 (116)
Q Consensus 77 ~d~P-~~W-p~Ll~~l~~----~l~~~~~~~~~~~L~~L~~l~ 113 (116)
. -| ++- ++|++.++. .+.+.|.+....+|.|+..+.
T Consensus 132 ~-tPY~rL~~~ll~~~v~~v~~~l~~~d~~v~v~~l~~~~~l~ 173 (182)
T PF13251_consen 132 A-TPYHRLPPGLLTEVVTQVRPLLRHRDPNVRVAALSCLGALL 173 (182)
T ss_pred c-CChhhcCHhHHHHHHHHHHHHHhcCCCcHHHHHHHHHHHHH
Confidence 5 34 333 456666554 455678888899999998775
No 91
>KOG0212|consensus
Probab=62.18 E-value=31 Score=28.43 Aligned_cols=66 Identities=17% Similarity=0.301 Sum_probs=50.9
Q ss_pred HHHHHHHHHhcCcHHHH----HHHHHHHHHHHhccCC-C-chhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 48 IRDAIVDAVVMAPEVIR----VQLAVCVSNIVKHDFP-G-KWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 48 IK~~ll~~l~~~~~~i~----~~l~~~i~~Ia~~d~P-~-~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
+-+.|++.|.+++..|| +.+.+.++.|.+. | . +.++.++.++..++++++.....+|.-+.++++-
T Consensus 209 ~ldGLf~~LsD~s~eVr~~~~t~l~~fL~eI~s~--P~s~d~~~~i~vlv~~l~ss~~~iq~~al~Wi~efV~i 280 (675)
T KOG0212|consen 209 LLDGLFNMLSDSSDEVRTLTDTLLSEFLAEIRSS--PSSMDYDDMINVLVPHLQSSEPEIQLKALTWIQEFVKI 280 (675)
T ss_pred HHHHHHHHhcCCcHHHHHHHHHHHHHHHHHHhcC--ccccCcccchhhccccccCCcHHHHHHHHHHHHHHhcC
Confidence 44566677777777777 4456777888755 5 3 7899999999999999999888888888877654
No 92
>cd08050 TAF6 TATA Binding Protein (TBP) Associated Factor 6 (TAF6) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex. The TATA Binding Protein (TBP) Associated Factor 6 (TAF6) is one of several TAFs that bind TBP and are involved in forming Transcription Factor IID (TFIID) complex. TFIID is one of seven General Transcription Factors (GTFs) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) that are involved in accurate initiation of transcription by RNA polymerase II in eukaryotes. TFIID plays an important role in the recognition of promoter DNA and assembly of the pre-initiation complex. TFIID complex is composed of the TBP and at least 13 TAFs. TAFs are named after their electrophoretic mobility in polyacrylamide gels in different species. A new, unified nomenclature has been suggested for the pol II TAFs to show the relationship between TAF orthologs and paralogs. Several hypotheses are proposed for TAFs functions such as serving as
Probab=61.55 E-value=27 Score=26.52 Aligned_cols=51 Identities=20% Similarity=0.338 Sum_probs=37.7
Q ss_pred HHHHHHHHHHHHHHhccCCCchhHHHHHHHHhh-----CCCCh-HHHhHHHHHHHHHH
Q psy16982 62 VIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYL-----QNPDA-TPWFGALLCLYQLV 113 (116)
Q Consensus 62 ~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l-----~~~~~-~~~~~~L~~L~~l~ 113 (116)
.+|...|.+++.|++. |....|+|-+.+...+ +...+ ...|||+..|..+-
T Consensus 283 ~LRd~AA~ll~~i~~~-f~~~y~~l~~ri~~tl~k~l~d~~~~~~~~YGAi~GL~~lG 339 (343)
T cd08050 283 ALRDYAARLLAQICRK-FSTSYNTLQPRITRTLLKALLDPKKPLTTHYGAIVGLSALG 339 (343)
T ss_pred HHHHHHHHHHHHHHHH-cCCCCCcHHHHHHHHHHHHHcCCCCCcchhhHHHHHHHHhC
Confidence 6999999999999977 6677777777776332 22333 44899999988764
No 93
>KOG1062|consensus
Probab=61.24 E-value=76 Score=27.25 Aligned_cols=101 Identities=15% Similarity=0.136 Sum_probs=67.0
Q ss_pred HHHhhhhhccccccccCCC-CC---CccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHH
Q psy16982 14 VVYLKNLITNQWVEKEVDP-GT---PLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDK 89 (116)
Q Consensus 14 ai~LKn~i~~~w~~~~~~~-~~---~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~ 89 (116)
-+++-|.+++.-+.++... |- ......++|--.-+-..+-..|....+.||+..+.|..++.+.+ |+--..+++.
T Consensus 105 llLltNslknDL~s~nq~vVglAL~alg~i~s~EmardlapeVe~Ll~~~~~~irKKA~Lca~r~irK~-P~l~e~f~~~ 183 (866)
T KOG1062|consen 105 LLLLTNSLKNDLNSSNQYVVGLALCALGNICSPEMARDLAPEVERLLQHRDPYIRKKAALCAVRFIRKV-PDLVEHFVIA 183 (866)
T ss_pred HHHHHHHHHhhccCCCeeehHHHHHHhhccCCHHHhHHhhHHHHHHHhCCCHHHHHHHHHHHHHHHHcC-chHHHHhhHH
Confidence 3456677777655443321 10 01123455544445555566667777899999999999999886 6666666666
Q ss_pred HHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 90 VSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 90 l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
.-+.|.+.+.....+++..+..+|+.
T Consensus 184 ~~~lL~ek~hGVL~~~l~l~~e~c~~ 209 (866)
T KOG1062|consen 184 FRKLLCEKHHGVLIAGLHLITELCKI 209 (866)
T ss_pred HHHHHhhcCCceeeeHHHHHHHHHhc
Confidence 67777777777777888888888874
No 94
>KOG2160|consensus
Probab=60.52 E-value=31 Score=26.43 Aligned_cols=59 Identities=17% Similarity=0.170 Sum_probs=43.4
Q ss_pred cCcHHHHHHHHHHHHHHHhccCC-Cc-h--hHHHHHHHHhhCCCChHHH-hHHHHHHHHHHhhC
Q psy16982 58 MAPEVIRVQLAVCVSNIVKHDFP-GK-W--TQIVDKVSIYLQNPDATPW-FGALLCLYQLVKNY 116 (116)
Q Consensus 58 ~~~~~i~~~l~~~i~~Ia~~d~P-~~-W--p~Ll~~l~~~l~~~~~~~~-~~~L~~L~~l~~~y 116 (116)
+.+..+|...+.+|+.++..--+ .+ = -+.++.|+..++++++..+ -.+|.++..+++++
T Consensus 135 ~~~~~lR~~Aa~Vigt~~qNNP~~Qe~v~E~~~L~~Ll~~ls~~~~~~~r~kaL~AissLIRn~ 198 (342)
T KOG2160|consen 135 NSDAELRELAARVIGTAVQNNPKSQEQVIELGALSKLLKILSSDDPNTVRTKALFAISSLIRNN 198 (342)
T ss_pred CCcHHHHHHHHHHHHHHHhcCHHHHHHHHHcccHHHHHHHHccCCCchHHHHHHHHHHHHHhcC
Confidence 34578999999999999987522 10 0 1378888888888777655 78999998888753
No 95
>PF09047 MEF2_binding: MEF2 binding; InterPro: IPR015134 The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the calcineurin-binding protein CABIN 1, adopts an amphipathic alpha-helical structure, which allows it to bind a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription []. ; PDB: 1N6J_G.
Probab=59.85 E-value=11 Score=18.58 Aligned_cols=18 Identities=33% Similarity=0.506 Sum_probs=12.3
Q ss_pred CCCHhHHHHHHHHHHHHH
Q psy16982 39 SLHEQDKAMIRDAIVDAV 56 (116)
Q Consensus 39 ~i~~~~k~~IK~~ll~~l 56 (116)
.++++.|+.+|..++..-
T Consensus 8 siseetkqklk~~ils~q 25 (35)
T PF09047_consen 8 SISEETKQKLKSAILSAQ 25 (35)
T ss_dssp ---HHHHHHHHHHHHTT-
T ss_pred cccHHHHHHHHHHHhhhH
Confidence 599999999999887653
No 96
>PF11698 V-ATPase_H_C: V-ATPase subunit H; InterPro: IPR011987 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. V-ATPases (also known as V1V0-ATPase or vacuolar ATPase) (3.6.3.14 from EC) are found in the eukaryotic endomembrane system, and in the plasma membrane of prokaryotes and certain specialised eukaryotic cells. V-ATPases hydrolyse ATP to drive a proton pump, and are involved in a variety of vital intra- and inter-cellular processes such as receptor mediated endocytosis, protein trafficking, active transport of metabolites, homeostasis and neurotransmitter release []. V-ATPases are composed of two linked complexes: the V1 complex (subunits A-H) contains the catalytic core that hydrolyses ATP, while the V0 complex (subunits a, c, c', c'', d) forms the membrane-spanning pore. V-ATPases may have an additional role in membrane fusion through binding to t-SNARE proteins []. This entry represents the C-terminal domain of subunit H (also known as Vma13p) found in the V1 complex of V-ATPases. This subunit has a regulatory function, being responsible for activating ATPase activity and coupling ATPase activity to proton flow []. The yeast enzyme contains five motifs similar to the HEAT or Armadillo repeats seen in the importins, and can be divided into two distinct domains: a large N-terminal domain consisting of stacked alpha helices, and a smaller C-terminal alpha-helical domain with a similar superhelical topology to an armadillo repeat []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0016820 hydrolase activity, acting on acid anhydrides, catalyzing transmembrane movement of substances, 0015991 ATP hydrolysis coupled proton transport, 0000221 vacuolar proton-transporting V-type ATPase, V1 domain; PDB: 1HO8_A.
Probab=59.66 E-value=18 Score=23.43 Aligned_cols=36 Identities=8% Similarity=0.268 Sum_probs=29.6
Q ss_pred HHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccC
Q psy16982 44 DKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDF 79 (116)
Q Consensus 44 ~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~ 79 (116)
++-..|..++++|..+++.||.+.-.|+..+..+-|
T Consensus 83 ~~lg~K~~vM~Lm~h~d~eVr~eAL~avQklm~~~w 118 (119)
T PF11698_consen 83 EKLGAKERVMELMNHEDPEVRYEALLAVQKLMVNNW 118 (119)
T ss_dssp HHHSHHHHHHHHTS-SSHHHHHHHHHHHHHHHHHS-
T ss_pred HhcChHHHHHHHhcCCCHHHHHHHHHHHHHHHHhcC
Confidence 456679999999999999999999999999887643
No 97
>KOG2137|consensus
Probab=59.51 E-value=59 Score=27.37 Aligned_cols=53 Identities=15% Similarity=0.183 Sum_probs=22.9
Q ss_pred HHHHHHHHHHHHHHHhc-cCCCchhHHHHHHHHh-hCCCChHHHhHHHHHHHHHH
Q psy16982 61 EVIRVQLAVCVSNIVKH-DFPGKWTQIVDKVSIY-LQNPDATPWFGALLCLYQLV 113 (116)
Q Consensus 61 ~~i~~~l~~~i~~Ia~~-d~P~~Wp~Ll~~l~~~-l~~~~~~~~~~~L~~L~~l~ 113 (116)
..++..+...+..++.. |+|--=.-++|.|.+. +..++...+..+|.|+..+.
T Consensus 403 ~~iQ~~~L~~lptv~e~iD~~~vk~~ilP~l~~l~~~tt~~~vkvn~L~c~~~l~ 457 (700)
T KOG2137|consen 403 VQIQELALQILPTVAESIDVPFVKQAILPRLKNLAFKTTNLYVKVNVLPCLAGLI 457 (700)
T ss_pred hhhHHHHHHhhhHHHHhccHHHHHHHHHHHhhcchhcccchHHHHHHHHHHHHHH
Confidence 34444444444444433 4432222344444443 22334444455555555444
No 98
>KOG1248|consensus
Probab=59.50 E-value=1.3e+02 Score=26.92 Aligned_cols=77 Identities=10% Similarity=0.132 Sum_probs=59.1
Q ss_pred CCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCC-----chhHHHHHHHHhhCCCChHHHhHHHHHHHHHH
Q psy16982 39 SLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPG-----KWTQIVDKVSIYLQNPDATPWFGALLCLYQLV 113 (116)
Q Consensus 39 ~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~-----~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~ 113 (116)
.++++.-..+=+.+-..|.+..+.|+......|...+. .||. .=|+||+.++.++..............|..++
T Consensus 819 ~ld~~~l~~li~~V~~~L~s~sreI~kaAI~fikvlv~-~~pe~~l~~~~~~LL~sll~ls~d~k~~~r~Kvr~LlekLi 897 (1176)
T KOG1248|consen 819 ILDDETLEKLISMVCLYLASNSREIAKAAIGFIKVLVY-KFPEECLSPHLEELLPSLLALSHDHKIKVRKKVRLLLEKLI 897 (1176)
T ss_pred cccHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHH-cCCHHHHhhhHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHH
Confidence 57778888888888888888888888776655555444 4784 34679999999888777777777888888888
Q ss_pred hhC
Q psy16982 114 KNY 116 (116)
Q Consensus 114 ~~y 116 (116)
++|
T Consensus 898 rkf 900 (1176)
T KOG1248|consen 898 RKF 900 (1176)
T ss_pred HHh
Confidence 886
No 99
>KOG0392|consensus
Probab=58.95 E-value=22 Score=31.96 Aligned_cols=102 Identities=15% Similarity=0.105 Sum_probs=68.3
Q ss_pred ChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcC--c---HHHHHHHHHHHHHHHhccCC
Q psy16982 6 DMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMA--P---EVIRVQLAVCVSNIVKHDFP 80 (116)
Q Consensus 6 ~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~--~---~~i~~~l~~~i~~Ia~~d~P 80 (116)
...+|.-|+|-++-..+.|...-+. ..+.+-.-..++.-.+.-+.+- + ..+|.+.+.++....++.-+
T Consensus 90 ~we~rhg~~i~lrei~~~h~~~~~~-------~~led~~~rll~v~~Ldrf~dfisd~vvapVre~caq~L~~~l~~~~~ 162 (1549)
T KOG0392|consen 90 QWEIRHGAAIALREILKTHGDSLSY-------ELLEDLLIRLLCVLALDRFGDFISDNVVAPVREACAQALGAYLKHMDE 162 (1549)
T ss_pred hhhhhcCcchhhhhHHHHhcchhhH-------HHHHHHHHHHHHHHHHHHhcccccccchhhhHHHHHHHHHHHHHhhhh
Confidence 3467888888888888777654221 0111111122222222222221 1 47999999999999999888
Q ss_pred CchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 81 GKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 81 ~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
..-.+-++.+.+.+...+.+.++|+|..+++.+-
T Consensus 163 s~~~~~~~il~q~~~q~~w~ir~Ggll~iky~~a 196 (1549)
T KOG0392|consen 163 SLIKETLDILLQMLRQPNWEIRHGGLLGIKYNVA 196 (1549)
T ss_pred HhhHHHHHHHHHHHcCcchhheechHHHHHHHHH
Confidence 8888888899998888888999999998887653
No 100
>PF10274 ParcG: Parkin co-regulated protein; InterPro: IPR019399 This family of proteins is transcribed anti-sense along the DNA to the Parkin gene product and the two appear to be transcribed under the same promoter. The protein has predicted alpha-helical and beta-sheet domains which suggest its function is in the ubiquitin/proteasome system []. Mutations in parkin are the genetic cause of early-onset and autosomal recessive juvenile parkinsonism.
Probab=58.43 E-value=24 Score=24.59 Aligned_cols=67 Identities=15% Similarity=0.128 Sum_probs=40.4
Q ss_pred HHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCC----chhHHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 48 IRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPG----KWTQIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 48 IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~----~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
.--.+++.|......-+-.....+..+.....++ -=|.|+.-|-+.+++.|++....+|.+|..++.
T Consensus 39 ~Lpif~dGL~Et~~Py~flA~~g~~dll~~~~~~kilPvlPqLI~plk~AL~tr~~~V~~~~L~~Lq~Lv~ 109 (183)
T PF10274_consen 39 YLPIFFDGLRETEHPYRFLARQGIKDLLERGGGEKILPVLPQLIIPLKRALNTRDPEVFCATLKALQQLVT 109 (183)
T ss_pred HHHHHHhhhhccCccHHHHHHHHHHHHHHhcchhHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHH
Confidence 3334455555544443333334444444443332 335666666677888899999999999998843
No 101
>PF07571 DUF1546: Protein of unknown function (DUF1546); InterPro: IPR011442 These proteins are associated with IPR004823 from INTERPRO in transcription initiation factor TFIID subunit 6 (TAF6).; GO: 0051090 regulation of sequence-specific DNA binding transcription factor activity, 0005634 nucleus
Probab=57.64 E-value=41 Score=20.43 Aligned_cols=55 Identities=9% Similarity=0.068 Sum_probs=38.2
Q ss_pred CChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHH
Q psy16982 5 VDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVS 72 (116)
Q Consensus 5 ~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~ 72 (116)
.+..+|-+||-.|...++++=. -.+.-|..|=..+++.+.++.+.+.++.+.+..
T Consensus 18 ~h~~LRd~AA~lL~~I~~~~~~-------------~~~~L~~Ri~~tl~k~l~d~~~~~~t~YGAi~g 72 (92)
T PF07571_consen 18 NHWALRDFAASLLAQICRKFSS-------------SYPTLQPRITRTLLKALLDPKKPLGTHYGAIVG 72 (92)
T ss_pred chHHHHHHHHHHHHHHHHHhcc-------------ccchHHHHHHHHHHHHHcCCCCCHHHHHHHHHH
Confidence 3568999999999998766432 234455677777888888777666666655443
No 102
>KOG4524|consensus
Probab=56.34 E-value=94 Score=27.28 Aligned_cols=71 Identities=15% Similarity=0.175 Sum_probs=45.2
Q ss_pred CHhHHHHHHHHHHHH---HhcCcHHHHHHHHHHHHHHHh--ccCC--------CchhHHHHHHHHhhCCCChHHHhHHHH
Q psy16982 41 HEQDKAMIRDAIVDA---VVMAPEVIRVQLAVCVSNIVK--HDFP--------GKWTQIVDKVSIYLQNPDATPWFGALL 107 (116)
Q Consensus 41 ~~~~k~~IK~~ll~~---l~~~~~~i~~~l~~~i~~Ia~--~d~P--------~~Wp~Ll~~l~~~l~~~~~~~~~~~L~ 107 (116)
.+..-..+++-+..+ |.+++..||-+.-.+++...- .+|| ..||.++.. +...|+-.+..++.
T Consensus 794 ~~~qv~iv~kIl~r~~~~LS~e~l~irvkaLdvl~~gl~~La~~~n~LlPlvhq~W~~vie~----~~~k~~L~v~~a~~ 869 (1014)
T KOG4524|consen 794 LPDQVKIVLKILGRGIHLLSHESLRIRVKALDVLSLGLPLLATYHNLLLPLVHQTWPSVIEC----LLCKDPLIVQRAFS 869 (1014)
T ss_pred CChHHHHHHHHHHHHHHHhcchhHHHHHHHHHHHHhccHHHhccchhHhHHHHhhhhHHHHH----HhcCchHHHHHHHH
Confidence 344445566555554 444556788887777765432 1233 267765544 44578999999999
Q ss_pred HHHHHHhh
Q psy16982 108 CLYQLVKN 115 (116)
Q Consensus 108 ~L~~l~~~ 115 (116)
|++.+++.
T Consensus 870 ~i~~m~~~ 877 (1014)
T KOG4524|consen 870 CIEQMGKY 877 (1014)
T ss_pred HHHHHHHH
Confidence 99998763
No 103
>KOG0414|consensus
Probab=55.73 E-value=1.2e+02 Score=27.27 Aligned_cols=68 Identities=13% Similarity=0.183 Sum_probs=53.4
Q ss_pred HHHHHHHHHh----cCcHHHHHHHHHHHHHHHhc--cCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 48 IRDAIVDAVV----MAPEVIRVQLAVCVSNIVKH--DFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 48 IK~~ll~~l~----~~~~~i~~~l~~~i~~Ia~~--d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
+|..+++.|. +.+.-+|+..-++.+.|... .+++.|.+.+...+..+.+.+....+.++..+..+..+
T Consensus 356 ~r~~~le~l~erl~Dvsa~vRskVLqv~~~l~~~~s~p~~~~~eV~~la~grl~DkSslVRk~Ai~Ll~~~L~~ 429 (1251)
T KOG0414|consen 356 LRDELLELLRERLLDVSAYVRSKVLQVFRRLFQQHSIPLGSRTEVLELAIGRLEDKSSLVRKNAIQLLSSLLDR 429 (1251)
T ss_pred HHHHHHHHHHHHhhcccHHHHHHHHHHHHHHHHccCCCccHHHHHHHHHhcccccccHHHHHHHHHHHHHHHhc
Confidence 4445555554 45678999999999998754 45599999999999999998888888888888777643
No 104
>PF06012 DUF908: Domain of Unknown Function (DUF908); InterPro: IPR010309 This is a domain of unknown function found at the N terminus of a family of E3 ubiquitin protein ligases, including yeast TOM1, many of which appear to play a role in mRNA transcription and processing. This domain is found in association with and immediately N-terminal to another domain of unknown function: IPR010314 from INTERPRO.
Probab=55.04 E-value=22 Score=26.73 Aligned_cols=30 Identities=23% Similarity=0.382 Sum_probs=26.0
Q ss_pred HHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 87 VDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 87 l~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
.+.|..+|.+.|.+.+.++|.++..+.++|
T Consensus 28 ~e~L~~LL~s~~~dVl~~aL~ll~~l~qr~ 57 (329)
T PF06012_consen 28 SEHLNSLLNSTDLDVLLAALRLLLRLAQRY 57 (329)
T ss_pred HHHHHHHHcCCCHHHHHHHHHHHHHHHhhh
Confidence 456778888999999999999999998886
No 105
>PF04826 Arm_2: Armadillo-like; InterPro: IPR006911 This entry consists of mammalian proteins of unknown function.
Probab=54.82 E-value=81 Score=22.97 Aligned_cols=76 Identities=11% Similarity=0.135 Sum_probs=46.7
Q ss_pred CCCHhHHHHHHHHHHHHHhc---C--cHHHHHHHHHHHHHHHhcc-CCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHH
Q psy16982 39 SLHEQDKAMIRDAIVDAVVM---A--PEVIRVQLAVCVSNIVKHD-FPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQL 112 (116)
Q Consensus 39 ~i~~~~k~~IK~~ll~~l~~---~--~~~i~~~l~~~i~~Ia~~d-~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l 112 (116)
..+.+.+..||..+-+.+.. . ++.++.....++..+.-.+ +-.--..-++.++.++.+++......+|.+|..+
T Consensus 82 s~~~en~~~Ik~~i~~Vc~~~~s~~lns~~Q~agLrlL~nLtv~~~~~~~l~~~i~~ll~LL~~G~~~~k~~vLk~L~nL 161 (254)
T PF04826_consen 82 SVNDENQEQIKMYIPQVCEETVSSPLNSEVQLAGLRLLTNLTVTNDYHHMLANYIPDLLSLLSSGSEKTKVQVLKVLVNL 161 (254)
T ss_pred CCChhhHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHccCCCcchhhhHHhhHHHHHHHHHcCChHHHHHHHHHHHHh
Confidence 34556666777665555442 2 2355544445555554332 2233345678888899999888888888887766
Q ss_pred Hh
Q psy16982 113 VK 114 (116)
Q Consensus 113 ~~ 114 (116)
.+
T Consensus 162 S~ 163 (254)
T PF04826_consen 162 SE 163 (254)
T ss_pred cc
Confidence 43
No 106
>KOG1059|consensus
Probab=54.13 E-value=1.3e+02 Score=25.81 Aligned_cols=58 Identities=14% Similarity=0.206 Sum_probs=40.3
Q ss_pred CCChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhcc
Q psy16982 4 EVDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHD 78 (116)
Q Consensus 4 ~~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d 78 (116)
+.|.+.|.++.+.+....+.|-+ .-+.-|+-++.||.+.+..||.-..+.+..|+..+
T Consensus 310 dsDqNLKYlgLlam~KI~ktHp~-----------------~Vqa~kdlIlrcL~DkD~SIRlrALdLl~gmVskk 367 (877)
T KOG1059|consen 310 DSDQNLKYLGLLAMSKILKTHPK-----------------AVQAHKDLILRCLDDKDESIRLRALDLLYGMVSKK 367 (877)
T ss_pred cCCccHHHHHHHHHHHHhhhCHH-----------------HHHHhHHHHHHHhccCCchhHHHHHHHHHHHhhhh
Confidence 34667777777777777766643 22456788899999888888877666666666554
No 107
>cd08818 CARD_MDA5_1 Caspase activation and recruitment domain found in MDA5, first repeat. Caspase activation and recruitment domain (CARD) found in MDA5 (melanoma-differentiation-associated gene 5), first repeat. MDA5, also known as IFIH1, contains two N-terminal CARD domains and a C-terminal RNA helicase domain. MDA5 is a cytoplasmic DEAD box RNA helicase that plays an important role in host antiviral response by sensing incoming viral RNA. Upon activation, the signal is transferred to downstream pathways via the adaptor molecule IPS-1 (MAVS, VISA, CARDIF), leading to the induction of type I interferons. Although very similar in sequence, MDA5 recognizes different sets of viruses compared to RIG-I, a related RNA helicase. MDA5 associates with IPS-1 through a CARD-CARD interaction. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protein-
Probab=53.88 E-value=47 Score=20.37 Aligned_cols=48 Identities=21% Similarity=0.225 Sum_probs=30.9
Q ss_pred CCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCch-hHHHHHHH
Q psy16982 38 FSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKW-TQIVDKVS 91 (116)
Q Consensus 38 ~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~W-p~Ll~~l~ 91 (116)
..+++++|+.|+... .+..........+..+.+--.+..| .++++.|-
T Consensus 29 ~~L~~e~ke~I~a~~------~~~Gn~~AA~~LL~~l~~~~~~~GWf~~FldAL~ 77 (88)
T cd08818 29 TFLEAEVKERIRAAA------ATRGNIAAAELLLSTLEKGTWDPGWFREFVTALE 77 (88)
T ss_pred ccCCHHHHHHHHHHH------HccCcHHHHHHHHHHHHHhccCCchHHHHHHHHH
Confidence 368899999998652 2234555666677888772234467 66666654
No 108
>KOG2062|consensus
Probab=52.51 E-value=19 Score=30.54 Aligned_cols=62 Identities=16% Similarity=0.190 Sum_probs=44.4
Q ss_pred HHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCC-ChHHHhHHHHHHHHHH
Q psy16982 47 MIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNP-DATPWFGALLCLYQLV 113 (116)
Q Consensus 47 ~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~-~~~~~~~~L~~L~~l~ 113 (116)
.||..|=-+..+.+.-||+....+|.-|...| |+.++.++++|+.+ |+...+|+-.+|.-.|
T Consensus 555 air~lLh~aVsD~nDDVrRaAVialGFVl~~d-----p~~~~s~V~lLses~N~HVRyGaA~ALGIaC 617 (929)
T KOG2062|consen 555 AIRRLLHVAVSDVNDDVRRAAVIALGFVLFRD-----PEQLPSTVSLLSESYNPHVRYGAAMALGIAC 617 (929)
T ss_pred hHHHhhcccccccchHHHHHHHHHheeeEecC-----hhhchHHHHHHhhhcChhhhhhHHHHHhhhh
Confidence 34433322244445688998888888888777 57888888888765 8888899888887665
No 109
>PF08506 Cse1: Cse1; InterPro: IPR013713 The exchange of macromolecules between the nucleus and cytoplasm takes place through nuclear pore complexes within the nuclear membrane. Active transport of large molecules through these pore complexes require carrier proteins, called karyopherins (importins and exportins), which shuttle between the two compartments. This domain is found in exportin Cse1 (also known as importin-alpha re-exporter). Exportin Cse1 mediates nuclear transport of importin-alpha back into the cytosol, where importin-alpha functions as a transporter of proteins carrying nuclear localisation signals (NLS) from the cytoplasm into the nucleus [, , ]. This domain contains HEAT repeats. More information about these proteins can be found at Protein of the Month: Importins [].; GO: 0006886 intracellular protein transport; PDB: 1Z3H_B 1WA5_C.
Probab=51.66 E-value=1.1e+02 Score=23.58 Aligned_cols=63 Identities=10% Similarity=0.146 Sum_probs=42.1
Q ss_pred hHHHHHHHHHHHHHhc---CcHHHHHHHHHHHHHHHhccCC-CchhHHHHHHHHhhCCCCh-HHHhHHH
Q psy16982 43 QDKAMIRDAIVDAVVM---APEVIRVQLAVCVSNIVKHDFP-GKWTQIVDKVSIYLQNPDA-TPWFGAL 106 (116)
Q Consensus 43 ~~k~~IK~~ll~~l~~---~~~~i~~~l~~~i~~Ia~~d~P-~~Wp~Ll~~l~~~l~~~~~-~~~~~~L 106 (116)
+.-++..+.+++-|.. ..+-++.-.+..+..-... +| +.+.+++|.+++.|++++. -+.|+|.
T Consensus 301 ~v~~Ff~~~v~peL~~~~~~~piLka~aik~~~~Fr~~-l~~~~l~~~~~~l~~~L~~~~~vv~tyAA~ 368 (370)
T PF08506_consen 301 DVVDFFSQHVLPELQPDVNSHPILKADAIKFLYTFRNQ-LPKEQLLQIFPLLVNHLQSSSYVVHTYAAI 368 (370)
T ss_dssp -HHHHHHHHTCHHHH-SS-S-HHHHHHHHHHHHHHGGG-S-HHHHHHHHHHHHHHTTSS-HHHHHHHHH
T ss_pred cHHHHHHHHhHHHhcccCCCCcchHHHHHHHHHHHHhh-CCHHHHHHHHHHHHHHhCCCCcchhhhhhh
Confidence 4567777777777772 2245666666666666665 67 7999999999999998765 4455554
No 110
>PF02847 MA3: MA3 domain; InterPro: IPR003891 This entry represents the MI domain (after MA-3 and eIF4G), it is a protein-protein interaction module of ~130 amino acids [, , ]. It appears in several translation factors and is found in: One copy in plant and animal eIF4G 1 and 2 (DAP-5/NAT1/p97) Two copies in the animal programmed cell death protein 4 (PDCD4) or MA-3 that is induced during programmed cell death and inhibits neoplastic transformation Four tandem-repeated copies in a group of uncharacterised plant proteins The MI domain consists of seven alpha-helices, which pack into a globular form. The packing arrangement consists of repeating pairs of antiparallel helices packed one upon the other such that a superhelical axis is generated perpendicular to the alpha-helical axes []. The MI domain has also been named MA3 domain.; PDB: 2ION_A 2IOL_B 2NSZ_A 3EIQ_C 2HM8_A 2KZT_B 2IOS_A 2RG8_B 2ZU6_E 3EIJ_A ....
Probab=51.05 E-value=55 Score=19.98 Aligned_cols=45 Identities=11% Similarity=0.021 Sum_probs=21.8
Q ss_pred HHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHH
Q psy16982 69 VCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLV 113 (116)
Q Consensus 69 ~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~ 113 (116)
+++..|-....|...++++..++...-..+.........++..++
T Consensus 20 ea~~~l~el~~~~~~~~vv~~~l~~~le~~~~~r~~~~~Ll~~L~ 64 (113)
T PF02847_consen 20 EAVECLKELKLPSQHHEVVKVILECALEEKKSYREYYSKLLSHLC 64 (113)
T ss_dssp HHHHHHHHTT-GGGHHHHHHHHHHHHHTSSHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHhCCCccHHHHHHHHHHHHhhccHHHHHHHHHHHHHHH
Confidence 344444444555566666666666544443333333444444444
No 111
>KOG2137|consensus
Probab=50.75 E-value=1.1e+02 Score=25.84 Aligned_cols=67 Identities=13% Similarity=0.184 Sum_probs=50.5
Q ss_pred HHHHHHHHHHHHhcC-----cHHHHHHHHHHHHHHHhccCCCchh--HHHHHHHHhhCCCChHHHhHHHHHHHHHH
Q psy16982 45 KAMIRDAIVDAVVMA-----PEVIRVQLAVCVSNIVKHDFPGKWT--QIVDKVSIYLQNPDATPWFGALLCLYQLV 113 (116)
Q Consensus 45 k~~IK~~ll~~l~~~-----~~~i~~~l~~~i~~Ia~~d~P~~Wp--~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~ 113 (116)
-..||+.|+..+... .-.++..+..|++.++ +.-+.|- +.+..+..++...|+..+++.+.+...+.
T Consensus 422 ~~~vk~~ilP~l~~l~~~tt~~~vkvn~L~c~~~l~--q~lD~~~v~d~~lpi~~~~~~~dp~iv~~~~~i~~~l~ 495 (700)
T KOG2137|consen 422 VPFVKQAILPRLKNLAFKTTNLYVKVNVLPCLAGLI--QRLDKAAVLDELLPILKCIKTRDPAIVMGFLRIYEALA 495 (700)
T ss_pred HHHHHHHHHHHhhcchhcccchHHHHHHHHHHHHHH--HHHHHHHhHHHHHHHHHHhcCCCcHHHHHHHHHHHHHH
Confidence 578889988887663 2468888888998888 4446664 56666777888889999999888776553
No 112
>PF04802 SMK-1: Component of IIS longevity pathway SMK-1; InterPro: IPR006887 This is a conserved region which characterises a number of eukaryotic proteins of unknown function.
Probab=50.71 E-value=84 Score=21.97 Aligned_cols=63 Identities=8% Similarity=0.182 Sum_probs=39.7
Q ss_pred HHHHHHHHHHHhcCc--HHH----HHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCC--ChHHHhHHHHHHHHHHh
Q psy16982 46 AMIRDAIVDAVVMAP--EVI----RVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNP--DATPWFGALLCLYQLVK 114 (116)
Q Consensus 46 ~~IK~~ll~~l~~~~--~~i----~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~--~~~~~~~~L~~L~~l~~ 114 (116)
+++|+.++.-..+++ +.+ -..-.+++..|...+ ++++.++..+.++ +.+....++..|+++|.
T Consensus 106 qYLkDvvL~r~lDd~~~s~L~s~I~~n~~~Iv~~l~~d~------~fL~~Lf~~l~~~~~~~~~r~d~v~fL~e~c~ 176 (193)
T PF04802_consen 106 QYLKDVVLPRFLDDNTFSTLNSLIFFNQVEIVNMLQDDE------NFLEELFAILKDPSTSDERRRDGVKFLHEFCS 176 (193)
T ss_pred HHHHHHHcccccccHHHHHHHHHHHHhHHHHHHHHHhCH------HHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHH
Confidence 567777777655554 222 233345555555443 6777777777553 45778889999999884
No 113
>KOG1020|consensus
Probab=50.09 E-value=1.1e+02 Score=28.40 Aligned_cols=58 Identities=21% Similarity=0.164 Sum_probs=27.7
Q ss_pred HHHHHHhcCcHHHHHHHHHHHHHHHhccCC-CchhHHHHHHHHhhCCCChHHHhHHHHH
Q psy16982 51 AIVDAVVMAPEVIRVQLAVCVSNIVKHDFP-GKWTQIVDKVSIYLQNPDATPWFGALLC 108 (116)
Q Consensus 51 ~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P-~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~ 108 (116)
.|+..+..+...+|+..-.|++.|+..|=- -.=|+.-..+.+.+..+......+++..
T Consensus 820 ~Il~~l~e~~ialRtkAlKclS~ive~Dp~vL~~~dvq~~Vh~R~~DssasVREAaldL 878 (1692)
T KOG1020|consen 820 LILSVLGENAIALRTKALKCLSMIVEADPSVLSRPDVQEAVHGRLNDSSASVREAALDL 878 (1692)
T ss_pred HHHHHhcCchHHHHHHHHHHHHHHHhcChHhhcCHHHHHHHHHhhccchhHHHHHHHHH
Confidence 455555544456666666666666665511 2334444444444443333333333333
No 114
>KOG1991|consensus
Probab=50.07 E-value=1.8e+02 Score=25.65 Aligned_cols=55 Identities=13% Similarity=0.153 Sum_probs=42.7
Q ss_pred HhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCC--chhHHHHHHHHhhCC
Q psy16982 42 EQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPG--KWTQIVDKVSIYLQN 96 (116)
Q Consensus 42 ~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~--~Wp~Ll~~l~~~l~~ 96 (116)
.+--.++-+++++.+.++.--+|...|.+++.++..||++ .--+.++-..+++.+
T Consensus 457 ~~mE~flv~hVfP~f~s~~g~Lrarac~vl~~~~~~df~d~~~l~~ale~t~~~l~~ 513 (1010)
T KOG1991|consen 457 SQMEYFLVNHVFPEFQSPYGYLRARACWVLSQFSSIDFKDPNNLSEALELTHNCLLN 513 (1010)
T ss_pred HHHHHHHHHHhhHhhcCchhHHHHHHHHHHHHHHhccCCChHHHHHHHHHHHHHhcc
Confidence 3444566666677776666789999999999999999984 556788888888774
No 115
>KOG1059|consensus
Probab=49.98 E-value=1.5e+02 Score=25.44 Aligned_cols=76 Identities=11% Similarity=0.188 Sum_probs=55.8
Q ss_pred CCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 39 SLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 39 ~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
.++++--.-+=.-++.+|.++-+-+|.....++..+.-. ||+.--.-+|.+.+.|...|+..+.++..++=++.+|
T Consensus 136 fvTpdLARDLa~Dv~tLL~sskpYvRKkAIl~lykvFLk-YPeAlr~~FprL~EkLeDpDp~V~SAAV~VICELArK 211 (877)
T KOG1059|consen 136 IVTPDLARDLADDVFTLLNSSKPYVRKKAILLLYKVFLK-YPEALRPCFPRLVEKLEDPDPSVVSAAVSVICELARK 211 (877)
T ss_pred ccCchhhHHHHHHHHHHHhcCchHHHHHHHHHHHHHHHh-hhHhHhhhHHHHHHhccCCCchHHHHHHHHHHHHHhh
Confidence 345554445556677777777788888888888777665 6877766788888889888888888887776666554
No 116
>PF06086 Pox_A30L_A26L: Orthopoxvirus A26L/A30L protein; InterPro: IPR009285 This family consists of several Orthopoxvirus A26L and A30L proteins. The Vaccinia A30L gene is regulated by a late promoter and encodes a protein of approximately 9 kDa. It is thought that the A30L protein is needed for vaccinia virus morphogenesis, specifically the association of the dense viroplasm with viral membranes [].
Probab=49.49 E-value=24 Score=25.32 Aligned_cols=47 Identities=21% Similarity=0.375 Sum_probs=36.5
Q ss_pred HHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcC
Q psy16982 8 PVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMA 59 (116)
Q Consensus 8 ~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~ 59 (116)
.+|.+..-.||+.|.++|...=++. ..|+-++|..||+-|=..+.+.
T Consensus 17 d~~~~~I~~F~slV~~~W~~~L~~~-----sCI~R~~RKiIRnViR~ym~s~ 63 (220)
T PF06086_consen 17 DVNVASIREFKSLVSETWNKPLNEN-----SCIPREDRKIIRNVIREYMKSA 63 (220)
T ss_pred cCccchHHHHHHHHHHhcCCccccc-----cccCHHHHHHHHHHHHHHHHhC
Confidence 4566777889999999998653322 3699999999998888877764
No 117
>PF11262 Tho2: Transcription factor/nuclear export subunit protein 2; InterPro: IPR021418 THO and TREX form a eukaryotic complex which functions in messenger ribonucleoprotein metabolism and plays a role in preventing the transcription-associated genetic instability [],[]. Tho2, along with four other subunits forms THO []. This entry represents a conserved domain found towards the C terminus of these proteins.
Probab=49.16 E-value=33 Score=25.54 Aligned_cols=35 Identities=17% Similarity=0.345 Sum_probs=28.9
Q ss_pred chh-HHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 82 KWT-QIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 82 ~Wp-~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
.|- .+...+..++.+++.-.+.+|+.+|..+...|
T Consensus 223 kWh~~i~~~l~~~L~s~eym~iRNai~vL~~i~~~F 258 (298)
T PF11262_consen 223 KWHKRITKALISCLESKEYMHIRNAIIVLKKISPVF 258 (298)
T ss_pred HHHHHHHHHHHHHhcchhHHHHHHHHHHHHHHHhhC
Confidence 453 45578888999999999999999999998765
No 118
>PF14852 Fis1_TPR_N: Fis1 N-terminal tetratricopeptide repeat; PDB: 1IYG_A 3UUX_C 1Y8M_A 2PQR_A 2PQN_A 3O48_A 1PC2_A 1NZN_A.
Probab=48.45 E-value=31 Score=17.20 Aligned_cols=29 Identities=7% Similarity=0.024 Sum_probs=22.4
Q ss_pred HHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 87 VDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 87 l~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
+..-..++.|.+...+.-|+.+|..+++.
T Consensus 5 FnyAw~Lv~S~~~~d~~~Gi~lLe~l~~~ 33 (35)
T PF14852_consen 5 FNYAWGLVKSNNREDQQEGIALLEELYRD 33 (35)
T ss_dssp HHHHHHHHHSSSHHHHHHHHHHHHHHCCC
T ss_pred hHHHHHHhcCCCHHHHHHHHHHHHHHHhc
Confidence 44455677888888889999999988763
No 119
>KOG1824|consensus
Probab=47.94 E-value=1.2e+02 Score=26.97 Aligned_cols=54 Identities=9% Similarity=0.174 Sum_probs=45.3
Q ss_pred HHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCC-ChHHHhHHHHHHHHHHh
Q psy16982 61 EVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNP-DATPWFGALLCLYQLVK 114 (116)
Q Consensus 61 ~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~-~~~~~~~~L~~L~~l~~ 114 (116)
..||+..+.+++.++..---+-+.++++.+...+.+. +...+...+.||..||+
T Consensus 188 ~aVrKkai~~l~~la~~~~~~ly~~li~~Ll~~L~~~~q~~~~rt~Iq~l~~i~r 242 (1233)
T KOG1824|consen 188 LAVRKKAITALGHLASSCNRDLYVELIEHLLKGLSNRTQMSATRTYIQCLAAICR 242 (1233)
T ss_pred HHHHHHHHHHHHHHHHhcCHHHHHHHHHHHHhccCCCCchHHHHHHHHHHHHHHH
Confidence 5799999999999998842268899999999888764 56778889999999986
No 120
>KOG1061|consensus
Probab=47.47 E-value=1.5e+02 Score=25.21 Aligned_cols=69 Identities=19% Similarity=0.244 Sum_probs=52.3
Q ss_pred HHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCC-CchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 46 AMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFP-GKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 46 ~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P-~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
..+=+-|..++.+..+-+|...+.+++..-..+-- ..=.+|++.+..++..+|+..+-+++.+|..|.+
T Consensus 120 ey~~~Pl~~~l~d~~~yvRktaa~~vakl~~~~~~~~~~~gl~~~L~~ll~D~~p~VVAnAlaaL~eI~e 189 (734)
T KOG1061|consen 120 EYLCDPLLKCLKDDDPYVRKTAAVCVAKLFDIDPDLVEDSGLVDALKDLLSDSNPMVVANALAALSEIHE 189 (734)
T ss_pred HHHHHHHHHhccCCChhHHHHHHHHHHHhhcCChhhccccchhHHHHHHhcCCCchHHHHHHHHHHHHHH
Confidence 44445566666666788999999999888766522 2334688899999988899999999999998875
No 121
>PF06371 Drf_GBD: Diaphanous GTPase-binding Domain; InterPro: IPR010473 Diaphanous-related formins (Drfs) are a family of formin homology (FH) proteins that act as effectors of Rho small GTPases during growth factor-induced cytoskeletal remodelling, stress fibre formation, and cell division []. Drf proteins are characterised by a variety of shared domains: an N-terminal GTPase-binding domain (GBD), formin-homology domains FH1, FH2 (IPR003104 from INTERPRO) and FH3 (IPR010472 from INTERPRO), and a C-terminal conserved Dia-autoregulatory domain (DAD) that binds the GBD. This entry represents the GBD, which is a bifunctional autoinhibitory domain that interacts with and is regulated by activated Rho family members. Mammalian Drf3 contains a CRIB-like motif within its GBD for binding to Cdc42, which is required for Cdc42 to activate and guide Drf3 towards the cell cortex where it remodels the actin skeleton [].; GO: 0003779 actin binding, 0017048 Rho GTPase binding, 0030036 actin cytoskeleton organization; PDB: 3OBV_A 2BNX_A 3EG5_D 2BAP_B 3O4X_B 1Z2C_B 2F31_A.
Probab=47.20 E-value=27 Score=23.33 Aligned_cols=52 Identities=17% Similarity=0.226 Sum_probs=40.5
Q ss_pred HHHHHHHHHHHHHHhccCC----CchhHHHHHHHHhhCCCChHHHhHHHHHHHHHH
Q psy16982 62 VIRVQLAVCVSNIVKHDFP----GKWTQIVDKVSIYLQNPDATPWFGALLCLYQLV 113 (116)
Q Consensus 62 ~i~~~l~~~i~~Ia~~d~P----~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~ 113 (116)
.+...+..|+..|+...+- -.=|+.+..|...+.+.++.....++.+|..+|
T Consensus 131 ~~~~~~l~Clkal~n~~~G~~~v~~~~~~v~~i~~~L~s~~~~~r~~~leiL~~lc 186 (187)
T PF06371_consen 131 DIEHECLRCLKALMNTKYGLEAVLSHPDSVNLIALSLDSPNIKTRKLALEILAALC 186 (187)
T ss_dssp HHHHHHHHHHHHHTSSHHHHHHHHCSSSHHHHHHHT--TTSHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHccHHHHHHHHcCcHHHHHHHHHHCCCCHHHHHHHHHHHHHHH
Confidence 5677788888888876531 145788999999999999999999999999887
No 122
>PF07539 DRIM: Down-regulated in metastasis; InterPro: IPR011430 These eukaryotic proteins include DRIM (Down-Regulated In Metastasis) (O75691 from SWISSPROT), which is differentially expressed in metastatic and non-metastatic human breast carcinoma cells []. It is believed to be involved in processing of non-coding RNA [].
Probab=46.63 E-value=40 Score=22.31 Aligned_cols=30 Identities=23% Similarity=0.215 Sum_probs=25.4
Q ss_pred chhHHHHHHHHhhCCCChHHHhHHHHHHHH
Q psy16982 82 KWTQIVDKVSIYLQNPDATPWFGALLCLYQ 111 (116)
Q Consensus 82 ~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~ 111 (116)
.++++-+.+..++.++|++....||.|+..
T Consensus 14 ~~~~l~~~~~~LL~~~d~~vQklAL~cll~ 43 (141)
T PF07539_consen 14 RSDELYDALLRLLSSRDPEVQKLALDCLLT 43 (141)
T ss_pred hHHHHHHHHHHHHcCCCHHHHHHHHHHHHH
Confidence 678888888899999999888888888754
No 123
>KOG2057|consensus
Probab=46.49 E-value=26 Score=26.96 Aligned_cols=35 Identities=14% Similarity=0.272 Sum_probs=27.1
Q ss_pred CchhHHHHHHHHhh-CCC--ChHHHhHHHHHHHHHHhh
Q psy16982 81 GKWTQIVDKVSIYL-QNP--DATPWFGALLCLYQLVKN 115 (116)
Q Consensus 81 ~~Wp~Ll~~l~~~l-~~~--~~~~~~~~L~~L~~l~~~ 115 (116)
+++|+|+.-|.+.+ ..+ +-.++|-.|..|.++++.
T Consensus 57 EdFpelmnmL~qRMLedNK~~WRRVYKSLiLLaYLikN 94 (499)
T KOG2057|consen 57 EDFPELMNMLFQRMLEDNKDAWRRVYKSLILLAYLIKN 94 (499)
T ss_pred HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhc
Confidence 67889999998864 333 336799999999999874
No 124
>PF14631 FancD2: Fanconi anaemia protein FancD2 nuclease; PDB: 3S4W_B.
Probab=46.13 E-value=78 Score=29.00 Aligned_cols=60 Identities=12% Similarity=0.158 Sum_probs=40.6
Q ss_pred HhcCcHHHHHHHHHHHHHHHhccCCCch--hHHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 56 VVMAPEVIRVQLAVCVSNIVKHDFPGKW--TQIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 56 l~~~~~~i~~~l~~~i~~Ia~~d~P~~W--p~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
|..++...-..++..+...+-..|. .. .+.+..|+..+.+++...+.++|.+|..+++++
T Consensus 443 Ll~S~e~~v~~FG~~~Y~~lF~~fd-s~~qqeVv~~Lvthi~sg~~~ev~~aL~vL~~L~~~~ 504 (1426)
T PF14631_consen 443 LLRSKEPSVREFGSHLYKYLFKEFD-SYCQQEVVGALVTHIGSGNSQEVDAALDVLCELAEKN 504 (1426)
T ss_dssp HHTSSSHHHHHHHHHHHHHHHHSS--HHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHH-
T ss_pred HHhCCCHHHHHHHHHHHHHHHhhcc-chhHHHHHHHHHHHHcCCcHHHHHHHHHHHHHHHhcc
Confidence 3344444445566666666666563 33 689999999998888888999999999998764
No 125
>COG5656 SXM1 Importin, protein involved in nuclear import [Posttranslational modification, protein turnover, chaperones]
Probab=45.09 E-value=1.6e+02 Score=25.42 Aligned_cols=53 Identities=15% Similarity=0.217 Sum_probs=39.9
Q ss_pred HHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCC--chhHHHHHHHHhhCCCC
Q psy16982 45 KAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPG--KWTQIVDKVSIYLQNPD 98 (116)
Q Consensus 45 k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~--~Wp~Ll~~l~~~l~~~~ 98 (116)
..+|-..+++.+.+..--++++.|+.++.+ ..|||+ .=-++.+...+++.+++
T Consensus 458 e~fiv~hv~P~f~s~ygfL~Srace~is~~-eeDfkd~~ill~aye~t~ncl~nn~ 512 (970)
T COG5656 458 EYFIVNHVIPAFRSNYGFLKSRACEFISTI-EEDFKDNGILLEAYENTHNCLKNNH 512 (970)
T ss_pred HHHHHHHhhHhhcCcccchHHHHHHHHHHH-HHhcccchHHHHHHHHHHHHHhcCC
Confidence 345556666666666667999999999999 999984 34577888888888754
No 126
>KOG1020|consensus
Probab=44.74 E-value=1.2e+02 Score=28.11 Aligned_cols=73 Identities=14% Similarity=0.148 Sum_probs=49.4
Q ss_pred hHHHHHHHHHHHHHhc---C---------------------cHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCC
Q psy16982 43 QDKAMIRDAIVDAVVM---A---------------------PEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPD 98 (116)
Q Consensus 43 ~~k~~IK~~ll~~l~~---~---------------------~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~ 98 (116)
+.+..+|..-+.||.. . ...||-...+.+++-.-. .|+--++..+.|.+.+....
T Consensus 827 e~~ialRtkAlKclS~ive~Dp~vL~~~dvq~~Vh~R~~DssasVREAaldLvGrfvl~-~~e~~~qyY~~i~erIlDtg 905 (1692)
T KOG1020|consen 827 ENAIALRTKALKCLSMIVEADPSVLSRPDVQEAVHGRLNDSSASVREAALDLVGRFVLS-IPELIFQYYDQIIERILDTG 905 (1692)
T ss_pred CchHHHHHHHHHHHHHHHhcChHhhcCHHHHHHHHHhhccchhHHHHHHHHHHhhhhhc-cHHHHHHHHHHHHhhcCCCc
Confidence 5667889998888864 1 124555555555554444 35555666666666666556
Q ss_pred hHHHhHHHHHHHHHHhhC
Q psy16982 99 ATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 99 ~~~~~~~L~~L~~l~~~y 116 (116)
......++.+++.+|+.|
T Consensus 906 vsVRKRvIKIlrdic~e~ 923 (1692)
T KOG1020|consen 906 VSVRKRVIKILRDICEET 923 (1692)
T ss_pred hhHHHHHHHHHHHHHHhC
Confidence 667789999999999875
No 127
>KOG1242|consensus
Probab=44.65 E-value=1e+02 Score=25.38 Aligned_cols=65 Identities=11% Similarity=-0.036 Sum_probs=48.1
Q ss_pred HHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHH
Q psy16982 48 IRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQL 112 (116)
Q Consensus 48 IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l 112 (116)
+.-.++....+..+.+|.....++..|.++-.|..=+.++|.++..+....-....+++..|+.+
T Consensus 217 ~lp~il~~~~d~~~~Vr~Aa~~a~kai~~~~~~~aVK~llpsll~~l~~~kWrtK~aslellg~m 281 (569)
T KOG1242|consen 217 ILPSILTNFGDKINKVREAAVEAAKAIMRCLSAYAVKLLLPSLLGSLLEAKWRTKMASLELLGAM 281 (569)
T ss_pred hHHHHHHHhhccchhhhHHHHHHHHHHHHhcCcchhhHhhhhhHHHHHHHhhhhHHHHHHHHHHH
Confidence 44455555555668999999999999999977788888998888877665445566666666643
No 128
>TIGR02270 conserved hypothetical protein. Members are found in Myxococcus xanthus (six members), Geobacter sulfurreducens, and Pseudomonas aeruginosa; a short protein homologous to the N-terminal region is found in Mesorhizobium loti. All sequence are from Proteobacteria. The function is unknown.
Probab=44.63 E-value=1.2e+02 Score=23.78 Aligned_cols=51 Identities=16% Similarity=0.240 Sum_probs=28.8
Q ss_pred HHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHH
Q psy16982 53 VDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLY 110 (116)
Q Consensus 53 l~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~ 110 (116)
+.+|.+++..+|.....++..+.. ++.++.|...+.+.|+....+++..+.
T Consensus 153 ~~~L~d~d~~Vra~A~raLG~l~~-------~~a~~~L~~al~d~~~~VR~aA~~al~ 203 (410)
T TIGR02270 153 EAALTHEDALVRAAALRALGELPR-------RLSESTLRLYLRDSDPEVRFAALEAGL 203 (410)
T ss_pred HHHhcCCCHHHHHHHHHHHHhhcc-------ccchHHHHHHHcCCCHHHHHHHHHHHH
Confidence 333444445566665555555443 345566666677777766666665553
No 129
>KOG0414|consensus
Probab=44.39 E-value=1.2e+02 Score=27.35 Aligned_cols=90 Identities=19% Similarity=0.247 Sum_probs=59.5
Q ss_pred ChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCc-HHHHHHHHHHHHHHHhccCCCchh
Q psy16982 6 DMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAP-EVIRVQLAVCVSNIVKHDFPGKWT 84 (116)
Q Consensus 6 ~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~-~~i~~~l~~~i~~Ia~~d~P~~Wp 84 (116)
++.++++|.+.|-++.. +|.+-.+.=-..++..|-.++ +.||..+...++-.|-. ||.--.
T Consensus 936 dp~Lq~AAtLaL~klM~-----------------iSa~fces~l~llftimeksp~p~IRsN~VvalgDlav~-fpnlie 997 (1251)
T KOG0414|consen 936 DPELQAAATLALGKLMC-----------------ISAEFCESHLPLLFTIMEKSPSPRIRSNLVVALGDLAVR-FPNLIE 997 (1251)
T ss_pred CHHHHHHHHHHHHHHhh-----------------hhHHHHHHHHHHHHHHHhcCCCceeeecchheccchhhh-cccccc
Confidence 46688888888776654 333322332345666666555 67888887777777766 674333
Q ss_pred HHHHHHHHhhCCCChHHHhHHHHHHHHHH
Q psy16982 85 QIVDKVSIYLQNPDATPWFGALLCLYQLV 113 (116)
Q Consensus 85 ~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~ 113 (116)
..-+.+...+...++.....++.+|..++
T Consensus 998 ~~T~~Ly~rL~D~~~~vRkta~lvlshLI 1026 (1251)
T KOG0414|consen 998 PWTEHLYRRLRDESPSVRKTALLVLSHLI 1026 (1251)
T ss_pred hhhHHHHHHhcCccHHHHHHHHHHHHHHH
Confidence 34456677788888888888998888764
No 130
>PF00790 VHS: VHS domain; InterPro: IPR002014 The VHS domain is a ~140 residues long domain, whose name is derived from its occurrence in VPS-27, Hrs and STAM. Based on regions surrounding the domain, VHS-proteins can be divided into 4 groups []: STAM/EAST/Hbp which all share the domain composition VHS-SH3-ITAM and carry one or two ubiquitin-interacting motifs Proteins with a FYVE domain (IPR000306 from INTERPRO) C-terminal to VHS which also carry one or two ubiquitin-interacting motifs GGA proteins with a domain composition VHS-GAT (GGA and Tom1) homology domain VHS domain alone or in combination with domains other than those listed above The VHS domain is always found at the N- terminus of proteins suggesting that such topology is important for function. The domain is considered to have a general membrane targeting/cargo recognition role in vesicular trafficking []. Resolution of the crystal structure of the VHS domain of Drosophila Hrs and human Tom1 revealed that it consists of eight helices arranged in a double-layer superhelix []. The existence of conserved patches of residues on the domain surface suggests that VHS domains may be involved in protein-protein recognition and docking. Overall, sequence similarity is low (approx 25%) amongst domain family members.; GO: 0006886 intracellular protein transport; PDB: 1X5B_A 2L0T_B 1DVP_A 3LDZ_C 3ZYQ_A 4AVX_A 3G2U_A 3G2W_A 1UJJ_A 3G2V_A ....
Probab=44.24 E-value=87 Score=20.29 Aligned_cols=36 Identities=25% Similarity=0.199 Sum_probs=29.3
Q ss_pred CCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 80 PGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 80 P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
+..=.+.+..|-..|.++|++....+|.+|..+++.
T Consensus 37 ~~~~kea~~~l~krl~~~~~~vq~~aL~lld~lvkN 72 (140)
T PF00790_consen 37 PDGAKEAARALRKRLKHGNPNVQLLALTLLDALVKN 72 (140)
T ss_dssp TTHHHHHHHHHHHHHTTSSHHHHHHHHHHHHHHHHH
T ss_pred CccHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHHc
Confidence 444467888888888899999999999999888764
No 131
>PLN03200 cellulose synthase-interactive protein; Provisional
Probab=44.15 E-value=58 Score=30.97 Aligned_cols=63 Identities=10% Similarity=0.115 Sum_probs=47.7
Q ss_pred HHHHHHhcCcHHHHHHHHHHHHHHHhccCCCch----hHHHHHHHHhhCCCChHHHhHHHHHHHHHH
Q psy16982 51 AIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKW----TQIVDKVSIYLQNPDATPWFGALLCLYQLV 113 (116)
Q Consensus 51 ~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~W----p~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~ 113 (116)
.|++.|.+++..++...+.+++.|+...=...+ .+.+|.|++++.+++......+..+|.-++
T Consensus 450 ~LV~LL~s~s~~iQ~~A~~~L~nLa~~ndenr~aIieaGaIP~LV~LL~s~~~~iqeeAawAL~NLa 516 (2102)
T PLN03200 450 LLISLLGLSSEQQQEYAVALLAILTDEVDESKWAITAAGGIPPLVQLLETGSQKAKEDSATVLWNLC 516 (2102)
T ss_pred HHHHHHcCCCHHHHHHHHHHHHHHHcCCHHHHHHHHHCCCHHHHHHHHcCCCHHHHHHHHHHHHHHh
Confidence 456666666778888888899988864212455 578999999999988888888877777665
No 132
>PF12460 MMS19_C: RNAPII transcription regulator C-terminal; InterPro: IPR024687 This domain, approximately 60 amino acids in length, is found in the N-terminal region of MMS19 proteins. MMS19 is required for both nucleotide excision repair (NER) and RNA polymerase II (RNAP II) transcription [].
Probab=43.70 E-value=1.5e+02 Score=22.89 Aligned_cols=65 Identities=14% Similarity=0.238 Sum_probs=45.0
Q ss_pred HHHHHHHHHhcCc-HHHHHHHHHHHHHHHhccCC--CchhHHHHHHHHhh-CCCChHHHhHHHHHHHHHH
Q psy16982 48 IRDAIVDAVVMAP-EVIRVQLAVCVSNIVKHDFP--GKWTQIVDKVSIYL-QNPDATPWFGALLCLYQLV 113 (116)
Q Consensus 48 IK~~ll~~l~~~~-~~i~~~l~~~i~~Ia~~d~P--~~Wp~Ll~~l~~~l-~~~~~~~~~~~L~~L~~l~ 113 (116)
+-+.++....+.. ...|...+.+++.++.. || +...++++.+...+ .+++......++.++.-+.
T Consensus 190 ll~~l~~~~~~~~~~~~~~~~~~~la~LvNK-~~~~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~Wi~ 258 (415)
T PF12460_consen 190 LLQSLLNLALSSEDEFSRLAALQLLASLVNK-WPDDDDLDEFLDSLLQSISSSEDSELRPQALEILIWIT 258 (415)
T ss_pred HHHHHHHHHHcCCChHHHHHHHHHHHHHHcC-CCChhhHHHHHHHHHhhhcccCCcchhHHHHHHHHHHH
Confidence 5556666665554 78999999999999999 77 35778888888777 3444445555555554433
No 133
>PF03392 OS-D: Insect pheromone-binding family, A10/OS-D; InterPro: IPR005055 A class of small (14-20 Kd) water-soluble proteins, called odorant binding proteins (OBPs), first discovered in the insect sensillar lymph but also in the mucus of vertebrates, is postulated to mediate the solubilisation of hydrophobic odorant molecules, and thereby to facilitate their transport to the receptor neurons. The product of a gene expressed in the olfactory system of Drosophila melanogaster (Fruit fly), OS-D, shares features common to vertebrate odorant-binding proteins, but has a primary structure unlike odorant-binding proteins []. OS-D derivatives have subsequently been found in chemosensory organs of phylogenetically distinct insects, including cockroaches, phasmids and moths, suggesting that OS-D-like proteins seem to be conserved in the insect phylum.; PDB: 1KX9_A 1N8U_A 1KX8_A 1K19_A 1N8V_A 2GVS_A 2JNT_A.
Probab=43.11 E-value=68 Score=19.75 Aligned_cols=47 Identities=15% Similarity=0.367 Sum_probs=29.0
Q ss_pred HhHHHHHHHHHHHHHhc----CcHHHHHHHHHHHHHHHhccCCCchhHHHHH
Q psy16982 42 EQDKAMIRDAIVDAVVM----APEVIRVQLAVCVSNIVKHDFPGKWTQIVDK 89 (116)
Q Consensus 42 ~~~k~~IK~~ll~~l~~----~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~ 89 (116)
..+-..+|..|-++|.+ +++.-+..+-.++..+... +|+.|..|+..
T Consensus 34 t~~~~~lK~~ipeal~t~C~KCt~kQK~~~~kv~~~l~~~-~P~~w~~l~~K 84 (95)
T PF03392_consen 34 TPEGKELKKVIPEALKTNCAKCTPKQKENARKVIKFLKKN-YPDEWEELVKK 84 (95)
T ss_dssp HHHHHHHHHHHHHHHHCTTTTS-HHHHHHHHHHHHHHHHH-HHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHHHHccccCCCHHHHHHHHHHHHHHHHc-CHHHHHHHHHH
Confidence 44556788888888876 2334444444455555444 68899877653
No 134
>PF05397 Med15_fungi: Mediator complex subunit 15; InterPro: IPR008626 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. This family represents subunit 15 of the Mediator complex in fungi. It contains Saccharomyces cerevisiae GAL11 (Med15) protein. Gal11 (Med15) and Sin4 (Med16) proteins are S. cerevisiae global transcription factors that regulate transcription of a variety of genes, both positively and negatively. Gal11, in a major part, functions in the activation of transcription, whereas Sin4 has an opposite role [].; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex
Probab=43.07 E-value=74 Score=20.31 Aligned_cols=41 Identities=22% Similarity=0.182 Sum_probs=24.9
Q ss_pred CChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHH
Q psy16982 5 VDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDA 55 (116)
Q Consensus 5 ~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~ 55 (116)
.+..+|++-.=.-||.+.-.-- ...+|+++|+.||+.|-.+
T Consensus 4 ~~~~ik~l~~e~~~~~~~l~~v----------~~~ls~eeK~~i~~~l~~~ 44 (115)
T PF05397_consen 4 LPERIKQLYEEVSRNPVRLSPV----------TNSLSPEEKAAIRQQLQEI 44 (115)
T ss_pred hHHHHHHHHHHHHhcCCCCCcc----------cccCCHHHHHHHHHHHHHH
Confidence 3445565555555555443322 1259999999999887443
No 135
>PF05085 DUF685: Protein of unknown function (DUF685); InterPro: IPR007777 This family consists of uncharacterised proteins from Borrelia species. There is some evidence to suggest that the proteins may be outer surface proteins.
Probab=42.93 E-value=92 Score=22.98 Aligned_cols=82 Identities=15% Similarity=0.202 Sum_probs=57.4
Q ss_pred ChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhH
Q psy16982 6 DMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQ 85 (116)
Q Consensus 6 ~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~ 85 (116)
|.++--..||.+||+.+.-=..... | .=-.--|+.||+.+-+-|.....-|......+|..+...|- ....+
T Consensus 30 ddg~~s~~ait~~~fl~~~~~~tfk--~-----e~~~~fk~iik~~ia~el~~~~~fi~~iy~kii~kLi~n~s-~~~~~ 101 (265)
T PF05085_consen 30 DDGVASSNAITYKNFLKTTKDKTFK--G-----EGLGYFKEIIKSTIAEELAADKDFIEKIYIKIIDKLINNES-SKLSN 101 (265)
T ss_pred ccccccccceeHHHHHHHHhhhhhc--c-----CCcHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHhcccc-hhHHH
Confidence 3455566789999998753222111 1 11245789999999999988888888888889999988773 56777
Q ss_pred HHHHHHHhhC
Q psy16982 86 IVDKVSIYLQ 95 (116)
Q Consensus 86 Ll~~l~~~l~ 95 (116)
++..+-+.+.
T Consensus 102 ~f~Kiks~l~ 111 (265)
T PF05085_consen 102 LFSKIKSRLT 111 (265)
T ss_pred HHHHHHHHHh
Confidence 7777766553
No 136
>PF08767 CRM1_C: CRM1 C terminal; InterPro: IPR014877 CRM1 (also known as Exportin1) mediates the nuclear export of proteins bearing a leucine-rich nuclear export signal (NES). CRM1 forms a complex with the NES containing protein and the small GTPase Ran. This region forms an alpha helical structure formed by six helical hairpin motifs that are structurally similar to the HEAT repeat, but share little sequence similarity to the HEAT repeat []. ; PDB: 3M1I_C 3GB8_A 1W9C_A 3NC1_A 3NBY_D 3NBZ_D 3NC0_A 3GJX_D.
Probab=42.63 E-value=86 Score=23.50 Aligned_cols=55 Identities=20% Similarity=0.271 Sum_probs=43.3
Q ss_pred HHHHHHHHHHHHHHHhccC------C-CchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 61 EVIRVQLAVCVSNIVKHDF------P-GKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 61 ~~i~~~l~~~i~~Ia~~d~------P-~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
+..|..+-..+..|..+-| | +....+++.++-.+.+.+.+....||.+|..++++
T Consensus 134 Pe~r~~ff~LL~~i~~~~f~~l~~lp~~~f~~~idsi~wg~kh~~~~I~~~~L~~l~~ll~~ 195 (319)
T PF08767_consen 134 PEHRVNFFKLLRAINEHCFPALLQLPPEQFKLVIDSIVWGFKHTNREISETGLNILLELLNN 195 (319)
T ss_dssp HHHHHHHHHHHHHHHHHHTHHHHHS-HHHHHHHHHHHHHHHTSSSHHHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHHHhHHHHHcCCHHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHHHHH
Confidence 6778777777777766543 3 56778888888888888999999999999998875
No 137
>TIGR02548 casB_cse2 CRISPR system CASCADE complex protein CasB/Cse2. CRISPR is a term for Clustered, Regularly Interspaced Short Palidromic Repeats. A number of protein families appear only in association with these repeats and are designated Cas (CRISPR-Associated) proteins. This family is found in Ecoli subtype CRISPR/Cas regions of many bacteria, most of which are mesophiles, and not in Archaea. It was designated Cse2 originally, and renamed CasB based on its characterization in the CASCADE complex.
Probab=42.38 E-value=99 Score=20.41 Aligned_cols=48 Identities=8% Similarity=0.033 Sum_probs=38.4
Q ss_pred HHHHHHHHHHhcCc-HHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhh
Q psy16982 47 MIRDAIVDAVVMAP-EVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYL 94 (116)
Q Consensus 47 ~IK~~ll~~l~~~~-~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l 94 (116)
.+...+-.++...+ ..+-.++-.+|..+...+.|-+|+.|...|...-
T Consensus 94 ~~e~Rf~~Ll~a~~~~~l~~~Lrrlv~ll~~~~~~vd~~~La~dL~~W~ 142 (159)
T TIGR02548 94 ASEKRFRALLEADARDELLRRLRRLVSLLKSLAISVDYAQLARDLLRWN 142 (159)
T ss_pred hHHHHHHHHHcCCCHHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHhc
Confidence 34556666665553 6888889999999999999999999999998874
No 138
>cd03561 VHS VHS domain family; The VHS domain is present in Vps27 (Vacuolar Protein Sorting), Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and STAM (Signal Transducing Adaptor Molecule). It has a superhelical structure similar to that of the ARM (Armadillo) repeats and is present at the N-termini of proteins involved in intracellular membrane trafficking. There are four general groups of VHS domain containing proteins based on their association with other domains. The first group consists of proteins of the STAM/EAST/Hbp family which has the domain composition VHS-SH3-ITAM. The second consists of proteins with a FYVE domain C-terminal to VHS. The third consists of GGA proteins with a domain composition VHS-GAT (GGA and TOM)-GAE (gamma-adaptin ear) domain. The fourth consists of proteins with a VHS domain alone or with domains other than those mentioned above. In GGA proteins, VHS domains are involved in cargo recognition in trans-Golgi, thereby having a general me
Probab=41.78 E-value=94 Score=19.97 Aligned_cols=50 Identities=18% Similarity=0.113 Sum_probs=35.0
Q ss_pred HHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 61 EVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 61 ~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
-.+--.+|+.|... +..=-+.+..|-..+.++|+.....+|.+|..+++.
T Consensus 18 ~~~il~icd~I~~~-----~~~~k~a~raL~krl~~~n~~vql~AL~lLd~~vkN 67 (133)
T cd03561 18 WALNLELCDLINLK-----PNGPKEAARAIRKKIKYGNPHVQLLALTLLELLVKN 67 (133)
T ss_pred HHHHHHHHHHHhCC-----CCCHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHh
Confidence 34555556655543 334456788888888888888888999888887763
No 139
>PF10363 DUF2435: Protein of unknown function (DUF2435)
Probab=41.68 E-value=81 Score=19.19 Aligned_cols=72 Identities=17% Similarity=0.188 Sum_probs=51.9
Q ss_pred CChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchh
Q psy16982 5 VDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWT 84 (116)
Q Consensus 5 ~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp 84 (116)
-..++|--|-+.|++.|.++- .+......|-..++..|.++++-|--....+++.+|... |+
T Consensus 15 p~~PvRa~gL~~L~~Li~~~~--------------~~~~~~~~il~l~l~~L~d~DsyVYL~aI~~L~~La~~~-p~--- 76 (92)
T PF10363_consen 15 PLPPVRAHGLVLLRKLIESKS--------------EPVIDIPKILDLFLSQLKDEDSYVYLNAIKGLAALADRH-PD--- 76 (92)
T ss_pred CCcchHHHHHHHHHHHHHcCC--------------cchhhHHHHHHHHHHHcCCCCchHHHHHHHHHHHHHHHC-hH---
Confidence 346799999999999999755 122334566667778888888888888888899998874 53
Q ss_pred HHHHHHHHhh
Q psy16982 85 QIVDKVSIYL 94 (116)
Q Consensus 85 ~Ll~~l~~~l 94 (116)
+.++.++...
T Consensus 77 ~vl~~L~~~y 86 (92)
T PF10363_consen 77 EVLPILLDEY 86 (92)
T ss_pred HHHHHHHHHH
Confidence 2555555443
No 140
>KOG1241|consensus
Probab=41.66 E-value=65 Score=27.55 Aligned_cols=100 Identities=15% Similarity=0.153 Sum_probs=66.1
Q ss_pred CCChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCC--C
Q psy16982 4 EVDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFP--G 81 (116)
Q Consensus 4 ~~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P--~ 81 (116)
+.+..+|++|-=.|-|...-.=. +.-.+.+|++|=+...++-..++..|+...-.|+-+|+...|- .
T Consensus 185 e~s~~vRLaa~~aL~nsLef~~~-----------nF~~E~ern~iMqvvcEatq~~d~~i~~aa~~ClvkIm~LyY~~m~ 253 (859)
T KOG1241|consen 185 ETSAAVRLAALNALYNSLEFTKA-----------NFNNEMERNYIMQVVCEATQSPDEEIQVAAFQCLVKIMSLYYEFME 253 (859)
T ss_pred CCchhHHHHHHHHHHHHHHHHHH-----------hhccHhhhceeeeeeeecccCCcHHHHHHHHHHHHHHHHHHHHHHH
Confidence 55678999998777776542111 0123456666666666666666789999999999999887663 1
Q ss_pred ch--hHHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 82 KW--TQIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 82 ~W--p~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
.+ +.|+....+...|.+++....|+..-..||+
T Consensus 254 ~yM~~alfaitl~amks~~deValQaiEFWstice 288 (859)
T KOG1241|consen 254 PYMEQALFAITLAAMKSDNDEVALQAIEFWSTICE 288 (859)
T ss_pred HHHHHHHHHHHHHHHcCCcHHHHHHHHHHHHHHHH
Confidence 11 3466666777777777766666666666665
No 141
>KOG1967|consensus
Probab=41.09 E-value=41 Score=29.31 Aligned_cols=53 Identities=21% Similarity=0.127 Sum_probs=39.9
Q ss_pred HHHHH-H-HHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 62 VIRVQ-L-AVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 62 ~i~~~-l-~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
.||++ + ..+-+.|++.+ .-=|.++|.+.+.++++++....-+|.||...|.+|
T Consensus 232 ~I~reDL~~sLr~al~stP--~Fa~~~lp~LlEKL~as~~~~K~DsL~~L~ec~~~y 286 (1030)
T KOG1967|consen 232 TIRREDLKASLRSALVSTP--SFAPFALPLLLEKLNASDPSAKVDSLDTLNECCLKY 286 (1030)
T ss_pred cccHHHHHHHHHHHHhcCc--cchhhHHHHHHHHhccccchhhhhHHHHHHHHHHHh
Confidence 46644 3 33344555654 345789999999999999988888999999999988
No 142
>TIGR01914 cas_Csa4 CRISPR-associated protein, Csa4 family. CRISPR loci appear to be mobile elements with a wide host range. This model represents a protein that tends to be found near CRISPR repeats. The species range for this species, so far, is exclusively archaeal. It is found so far in only four different species, and includes two tandem genes in Pyrococcus furiosus DSM 3638. This subfamily is found in a CRISPR/Cas locus we designate APERN, so the family is designated Csa4, for CRISPR/Cas Subtype Protein 4.
Probab=41.05 E-value=81 Score=24.26 Aligned_cols=45 Identities=24% Similarity=0.346 Sum_probs=29.9
Q ss_pred HHHHHHHHHHHhccCCCchhHHHHHHHH-----------hhCCCChHHHhHHHHHHH
Q psy16982 65 VQLAVCVSNIVKHDFPGKWTQIVDKVSI-----------YLQNPDATPWFGALLCLY 110 (116)
Q Consensus 65 ~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~-----------~l~~~~~~~~~~~L~~L~ 110 (116)
..+-+.|..+=..+|+ .||-|++.|.. .+-.+|.+..++++..+.
T Consensus 278 ~~LmdfI~~lK~r~~y-~~~kfvd~L~r~d~e~~~~L~~ai~~~~~~~~Ysa~R~~k 333 (354)
T TIGR01914 278 GVLMDFIAYLKARDFY-SWPKFVDFLARRDPEISLQLTDAILNGDEEAFYTALRELK 333 (354)
T ss_pred hHHHHHHHHHhhhhhc-chHHHHHHHhccChHHHHHHHHHHHcCChhHHHHHHHHHh
Confidence 4566677777676887 49999988872 222356666777776543
No 143
>PF12765 Cohesin_HEAT: HEAT repeat associated with sister chromatid cohesion
Probab=40.95 E-value=53 Score=16.86 Aligned_cols=36 Identities=19% Similarity=0.214 Sum_probs=19.0
Q ss_pred HHHHHhccCC--CchhHHHHHHHHhhCCCChHHHhHHHH
Q psy16982 71 VSNIVKHDFP--GKWTQIVDKVSIYLQNPDATPWFGALL 107 (116)
Q Consensus 71 i~~Ia~~d~P--~~Wp~Ll~~l~~~l~~~~~~~~~~~L~ 107 (116)
++.|+..| | -.-|++...+...+..+++.-..+++.
T Consensus 3 l~~iv~~d-p~ll~~~~v~~~i~~rl~D~s~~VR~aav~ 40 (42)
T PF12765_consen 3 LSSIVEKD-PTLLDSSDVQSAIIRRLSDSSPSVREAAVD 40 (42)
T ss_pred HHHHHhcC-ccccchHHHHHHHHHHhcCCChHHHHHHHH
Confidence 44445444 3 234556666666666555555555554
No 144
>PF14664 RICTOR_N: Rapamycin-insensitive companion of mTOR, N-term
Probab=39.90 E-value=78 Score=24.39 Aligned_cols=65 Identities=17% Similarity=0.303 Sum_probs=48.1
Q ss_pred HHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhH
Q psy16982 10 RQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQ 85 (116)
Q Consensus 10 R~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~ 85 (116)
|..++...=-.+-+.|..- ..++-++...+|+ |+.+|..+...+|..+-+++-.+.+...| .|.+
T Consensus 213 ~l~~s~~ai~~~LrsW~GL---------l~l~~~~~~~lks-Lv~~L~~p~~~ir~~Ildll~dllrik~p-~w~~ 277 (371)
T PF14664_consen 213 RLQASAKAISTLLRSWPGL---------LYLSMNDFRGLKS-LVDSLRLPNPEIRKAILDLLFDLLRIKPP-SWTE 277 (371)
T ss_pred HHHHHHHHHHHHHhcCCce---------eeeecCCchHHHH-HHHHHcCCCHHHHHHHHHHHHHHHCCCCC-Cccc
Confidence 6666666666667888742 2345455466775 88899888888999999999999999877 5654
No 145
>KOG2149|consensus
Probab=39.78 E-value=1.9e+02 Score=22.78 Aligned_cols=99 Identities=12% Similarity=0.119 Sum_probs=59.8
Q ss_pred CChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCc--
Q psy16982 5 VDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGK-- 82 (116)
Q Consensus 5 ~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~-- 82 (116)
.+..+|+-|.--||..++.|-. .-...+..+=+.+.+.+.+.+..+|..+...+..++...-|+.
T Consensus 70 hNakvRkdal~glkd~l~s~p~-------------~l~~~~~~ll~~~~~~i~D~~~~vR~~~~qll~~~i~~~~~e~~s 136 (393)
T KOG2149|consen 70 HNAKVRKDALNGLKDLLKSHPA-------------ELQSHLYALLQKLRELILDDDSLVRDALYQLLDSLILPACKEDQS 136 (393)
T ss_pred chHhhhHHHHHHHHHHHHhChH-------------HHHHHHHHHHHHhhhhhcCccccHHHHHHHHHHHHHhhcchhhhc
Confidence 4456777777777777666332 1122333344444555555666777777666666555554433
Q ss_pred --hhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 83 --WTQIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 83 --Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
-+=+++.+...+..-.+....-++..|.-+.+.|
T Consensus 137 p~~~l~~~yi~~AMThit~~i~~dslkfL~~Ll~~~ 172 (393)
T KOG2149|consen 137 PMVSLLMPYISSAMTHITPEIQEDSLKFLSLLLERY 172 (393)
T ss_pred chHHHHHHHHHHHHhhccHHHHHhhHHHHHHHHHHc
Confidence 3446666666666666777777777777777765
No 146
>KOG0213|consensus
Probab=39.41 E-value=2e+02 Score=25.10 Aligned_cols=31 Identities=23% Similarity=0.149 Sum_probs=24.2
Q ss_pred HHHHHHHHHHhcCcHHHHHHHHHHHHHHHhc
Q psy16982 47 MIRDAIVDAVVMAPEVIRVQLAVCVSNIVKH 77 (116)
Q Consensus 47 ~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~ 77 (116)
+|-..++..|.+.+.++|.|.+..++.||..
T Consensus 799 qi~stiL~rLnnksa~vRqqaadlis~la~V 829 (1172)
T KOG0213|consen 799 QICSTILWRLNNKSAKVRQQAADLISSLAKV 829 (1172)
T ss_pred HHHHHHHHHhcCCChhHHHHHHHHHHHHHHH
Confidence 4556677777778889999999998888754
No 147
>PF01603 B56: Protein phosphatase 2A regulatory B subunit (B56 family); InterPro: IPR002554 Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzyme to act on a diverse array of substrates is largely controlled by the nature of its regulatory B subunit. There are multiple families of B subunits, this family is called the B56 family [].; GO: 0008601 protein phosphatase type 2A regulator activity, 0007165 signal transduction, 0000159 protein phosphatase type 2A complex; PDB: 2NYM_B 2NYL_B 2IAE_E 2NPP_B 3FGA_B 2JAK_A.
Probab=38.85 E-value=1.1e+02 Score=23.90 Aligned_cols=41 Identities=22% Similarity=0.396 Sum_probs=29.8
Q ss_pred CCCCHhHHHHHHHHHHHHHhcCc-HHHHHHHHHHHHHHHhcc
Q psy16982 38 FSLHEQDKAMIRDAIVDAVVMAP-EVIRVQLAVCVSNIVKHD 78 (116)
Q Consensus 38 ~~i~~~~k~~IK~~ll~~l~~~~-~~i~~~l~~~i~~Ia~~d 78 (116)
.++.++.+..++..|+.+..... .....+++.|+......|
T Consensus 208 ~plk~eh~~fl~~vllPLh~~~~~~~y~~~L~~~~~~f~~kd 249 (409)
T PF01603_consen 208 VPLKEEHKQFLRKVLLPLHKSPHLSSYHQQLSYCVVQFLEKD 249 (409)
T ss_dssp SS--HHHHHHHHHTTGGGGGSTGGGGTHHHHHHHHHHHHHH-
T ss_pred CCCcHHHHHHHHHHHHHHhcCCcHHHHHHHHHHHHHHHHHhC
Confidence 36788999999999998887765 456777888887777766
No 148
>cd00256 VATPase_H VATPase_H, regulatory vacuolar ATP synthase subunit H (Vma13p); activation component of the peripheral V1 complex of V-ATPase, a heteromultimeric enzyme which uses ATP to actively transport protons into organelles and extracellular compartments. The topology is that of a superhelical spiral, in part the geometry is similar to superhelices composed of armadillo repeat motifs, as found in importins for example.
Probab=38.66 E-value=2e+02 Score=22.82 Aligned_cols=63 Identities=11% Similarity=0.030 Sum_probs=40.5
Q ss_pred HHHHHhcCcHHHHHHHHHHHHHHHhccCCCch----hHHHHHHHHhhCCC-ChHHHhHHHHHHHHHHh
Q psy16982 52 IVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKW----TQIVDKVSIYLQNP-DATPWFGALLCLYQLVK 114 (116)
Q Consensus 52 ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~W----p~Ll~~l~~~l~~~-~~~~~~~~L~~L~~l~~ 114 (116)
++..|.+.+..|....+.+++.++.......= +-+++.+.+.+.++ +...+..|+.||..+++
T Consensus 106 fl~lL~~~d~~i~~~a~~iLt~l~~~~~~~~~~~~l~~~~~~l~~~l~~~~~~~~~~~~v~~L~~LL~ 173 (429)
T cd00256 106 FFNLLNRQDQFIVHMSFSILAKLACFGLAKMEGSDLDYYFNWLKEQLNNITNNDYVQTAARCLQMLLR 173 (429)
T ss_pred HHHHHcCCchhHHHHHHHHHHHHHhcCccccchhHHHHHHHHHHHHhhccCCcchHHHHHHHHHHHhC
Confidence 34455556678899999999999876422111 22444555555554 45678888888887764
No 149
>KOG2056|consensus
Probab=38.43 E-value=83 Score=24.11 Aligned_cols=47 Identities=19% Similarity=0.255 Sum_probs=34.0
Q ss_pred HHHHHHHHhccCC-CchhHHHHHHHHhhCC--CChHHHhHHHHHHHHHHh
Q psy16982 68 AVCVSNIVKHDFP-GKWTQIVDKVSIYLQN--PDATPWFGALLCLYQLVK 114 (116)
Q Consensus 68 ~~~i~~Ia~~d~P-~~Wp~Ll~~l~~~l~~--~~~~~~~~~L~~L~~l~~ 114 (116)
+.+++.||..-|- ...-++|+.|...+.. .+-..++.+|.+|.++++
T Consensus 39 ~~lm~eIA~~ty~~~e~~eIm~vi~kRl~d~gknWR~VyKaLtlleyLl~ 88 (336)
T KOG2056|consen 39 GTLMAEIAQATYNFVEYQEIMDVLWKRLNDSGKNWRHVYKALTLLEYLLK 88 (336)
T ss_pred hHHHHHHHHHhcCHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHh
Confidence 3456666655443 5677788888887764 366789999999999876
No 150
>PF15601 Imm42: Immunity protein 42
Probab=37.67 E-value=1e+02 Score=20.38 Aligned_cols=52 Identities=12% Similarity=0.109 Sum_probs=29.5
Q ss_pred HHHHHHHHHHHHhccCC-CchhHHHHHHHH-hhCCCChHHHhHHHHHHHHHHhh
Q psy16982 64 RVQLAVCVSNIVKHDFP-GKWTQIVDKVSI-YLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 64 ~~~l~~~i~~Ia~~d~P-~~Wp~Ll~~l~~-~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
-+.+-.+|+.-...+-| .+||-|+..+.+ .+..++......=|..++..+++
T Consensus 18 l~sFFsti~~~lE~~~wGskfP~Lm~~LY~g~L~~~~~~~A~~eL~~I~~~l~~ 71 (134)
T PF15601_consen 18 LHSFFSTISYRLENEGWGSKFPLLMNELYRGYLRYEELEKALKELEEIRKELKK 71 (134)
T ss_pred HHHHHHHHHHHhhccCCCCcchHHHHHHHcCCCCHHHHHHHHHHHHHHHHHHhc
Confidence 34455566666666666 588888888854 34444444444444444444443
No 151
>PF02944 BESS: BESS motif; InterPro: IPR004210 The BESS domain has been named after the three proteins that originally defined the domain: BEAF (Boundary element associated factor 32) [], Suvar(3)7 [] and Stonewall []). The BESS domain is 40 amino acid residues long and is predicted to be composed of three alpha helices, as such it might be related to the myb/SANT HTH domain. The BESS domain directs a variety of protein-protein interactions, including interactions with itself, with Dorsal, and with a TBP-associated factor. It is found in a single copy in Drosophila proteins and is often associated with the MADF domain [, , ]. Proteins known to contain a BESS domain include: Drosophila Boundary element associated factor 32 (BEAF-32). Drosophila Suppressor of variegation protein 3-7 (Su(var)3-7), which could play a role in chromosome condensation. Drosophila Ravus, which is homologous to the C-terminal part of Su(var)3-7 []. Drosophila Stonewall (Stwl), a putative transcription factor required for maintenance of female germline stem cells as well as oocyte differentiation. Drosophila Adf-1, a transcription factor first identified on the basis of its interaction with the alcohol dehydrogenase promoter but that binds the promoters of a diverse group of genes []. Drosophila Dorsal-interacting protein 3 (Dip3). It functions both as an activator to bind DNA in a sequence specific manner and a coactivator to stimulate synergistic activation by Dorsal and Twist []. ; GO: 0003677 DNA binding
Probab=37.65 E-value=56 Score=16.21 Aligned_cols=20 Identities=15% Similarity=0.184 Sum_probs=16.2
Q ss_pred CCCHhHHHHHHHHHHHHHhc
Q psy16982 39 SLHEQDKAMIRDAIVDAVVM 58 (116)
Q Consensus 39 ~i~~~~k~~IK~~ll~~l~~ 58 (116)
.+++..|..+|..+.+++.+
T Consensus 17 ~L~~~~k~~~k~~i~~ll~e 36 (37)
T PF02944_consen 17 RLPPKQKLKFKMKILQLLFE 36 (37)
T ss_pred hCCHHHHHHHHHHHHHHHHh
Confidence 58888889999888887753
No 152
>PF08785 Ku_PK_bind: Ku C terminal domain like; InterPro: IPR014893 The non-homologous end joining (NHEJ) pathway is one method by which double stranded breaks in chromosomal DNA are repaired. Ku is a component of a multi-protein complex that is involved in the NHEJ. Ku has affinity for DNA ends and recruits the DNA-dependent protein kinase catalytic subunit (DNA-PKcs). This domain is found at the C-terminal of Ku which binds to DNA-PKcs []. ; GO: 0016817 hydrolase activity, acting on acid anhydrides; PDB: 1RW2_A 1Q2Z_A 3ISM_C.
Probab=37.58 E-value=1.1e+02 Score=19.46 Aligned_cols=55 Identities=11% Similarity=0.224 Sum_probs=37.9
Q ss_pred hHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHH----HhccCCCchhHHHHHHHHhhCCC
Q psy16982 43 QDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNI----VKHDFPGKWTQIVDKVSIYLQNP 97 (116)
Q Consensus 43 ~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~I----a~~d~P~~Wp~Ll~~l~~~l~~~ 97 (116)
+.-+.+++.|.+++.++...-..++.+||..+ ...+-|+.|.+++..|-..+.+.
T Consensus 22 ~A~~qM~~vI~~Lv~~s~~~~y~kalecl~~lR~~~i~~~ep~~yN~Fl~~LK~~~~~~ 80 (120)
T PF08785_consen 22 KAIQQMKNVIEQLVSDSGDQNYDKALECLRALREECIEEEEPDEYNDFLRKLKKKLLSK 80 (120)
T ss_dssp HHHHHHHHHHHHHHHCSHCHHHHHHHHHHHHHHHHHHHHT-CHHHHHHHHHHHHHHHCT
T ss_pred HHHHHHHHHHHHHHhccCcchHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHhc
Confidence 44566777888888777334455566666655 35666899999999998876554
No 153
>KOG1061|consensus
Probab=37.40 E-value=1.2e+02 Score=25.74 Aligned_cols=55 Identities=7% Similarity=0.059 Sum_probs=37.8
Q ss_pred cHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 60 PEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 60 ~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
+....+....+|..+|..- .+|.+.++.++..+...-.-.+.-+..+++.++++|
T Consensus 363 D~~fvrkaIraig~~aik~--e~~~~cv~~lLell~~~~~yvvqE~~vvi~dilRky 417 (734)
T KOG1061|consen 363 DVDFVRKAVRAIGRLAIKA--EQSNDCVSILLELLETKVDYVVQEAIVVIRDILRKY 417 (734)
T ss_pred CHHHHHHHHHHhhhhhhhh--hhhhhhHHHHHHHHhhcccceeeehhHHHHhhhhcC
Confidence 3455566667777777663 467888999998888554344555667777777776
No 154
>PF04826 Arm_2: Armadillo-like; InterPro: IPR006911 This entry consists of mammalian proteins of unknown function.
Probab=37.20 E-value=1.4e+02 Score=21.78 Aligned_cols=62 Identities=13% Similarity=0.138 Sum_probs=43.9
Q ss_pred HHHHHHHhcC-cHHHHHHHHHHHHHHHhccCC-Cchh--HHHHHHHHhhCCCChHHHhHHHHHHHH
Q psy16982 50 DAIVDAVVMA-PEVIRVQLAVCVSNIVKHDFP-GKWT--QIVDKVSIYLQNPDATPWFGALLCLYQ 111 (116)
Q Consensus 50 ~~ll~~l~~~-~~~i~~~l~~~i~~Ia~~d~P-~~Wp--~Ll~~l~~~l~~~~~~~~~~~L~~L~~ 111 (116)
+.|+..|... ++.++.....++...+.+.+- +-+- +-++-+.+.+.++++.....+|.+|.-
T Consensus 15 ~~Ll~lL~~t~dp~i~e~al~al~n~aaf~~nq~~Ir~~Ggi~lI~~lL~~p~~~vr~~AL~aL~N 80 (254)
T PF04826_consen 15 QKLLCLLESTEDPFIQEKALIALGNSAAFPFNQDIIRDLGGISLIGSLLNDPNPSVREKALNALNN 80 (254)
T ss_pred HHHHHHHhcCCChHHHHHHHHHHHhhccChhHHHHHHHcCCHHHHHHHcCCCChHHHHHHHHHHHh
Confidence 4677777754 478888888888887765543 2332 457778888888888777777777764
No 155
>KOG0168|consensus
Probab=37.19 E-value=1.1e+02 Score=26.71 Aligned_cols=67 Identities=16% Similarity=0.216 Sum_probs=43.6
Q ss_pred HHHHHHHHHHHHhc-----CcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHH----------hhCCCChHHHhHHHHHH
Q psy16982 45 KAMIRDAIVDAVVM-----APEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSI----------YLQNPDATPWFGALLCL 109 (116)
Q Consensus 45 k~~IK~~ll~~l~~-----~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~----------~l~~~~~~~~~~~L~~L 109 (116)
++++| .|+..|.+ ....||.....+|-++..+-= ++|+..++. .+++.|...+.+||.+.
T Consensus 550 q~F~~-~llpVLveVYsSsA~~~VR~kcL~Ailrlvy~s~----seli~slLk~~~vSS~lAG~lsskD~~vlVgALQvA 624 (1051)
T KOG0168|consen 550 QSFGK-DLLPVLVEVYSSSANPDVRYKCLSAILRLVYFSN----SELIGSLLKNTNVSSHLAGMLSSKDLTVLVGALQVA 624 (1051)
T ss_pred HHHHH-HHHHHHHHHHhccCCchhhHHHHHHHHHHHhhCC----HHHHHHHHhcchHHHHHHhhhhcCCCeeEeehHHHH
Confidence 34444 34555543 236899886666666654432 666666653 34456788889999999
Q ss_pred HHHHhhC
Q psy16982 110 YQLVKNY 116 (116)
Q Consensus 110 ~~l~~~y 116 (116)
.-|||||
T Consensus 625 EiLmeKl 631 (1051)
T KOG0168|consen 625 EILMEKL 631 (1051)
T ss_pred HHHHHHh
Confidence 9999987
No 156
>PF11625 DUF3253: Protein of unknown function (DUF3253); InterPro: IPR021660 This bacterial family of proteins has no known function. ; PDB: 2NS0_A.
Probab=36.68 E-value=20 Score=21.69 Aligned_cols=42 Identities=21% Similarity=0.416 Sum_probs=24.9
Q ss_pred HHHHHHHHHHHHhcCc--HHHHHHHHHHHHHHHhccCCCchhHHHHHHHH
Q psy16982 45 KAMIRDAIVDAVVMAP--EVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSI 92 (116)
Q Consensus 45 k~~IK~~ll~~l~~~~--~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~ 92 (116)
.+.|...|+.+|..-. +.| |=+.+|+.-+|++|-+||+.+-.
T Consensus 5 ~~~l~~~Il~ll~~R~~~kti------CPSevARal~~~~WR~lm~~vR~ 48 (83)
T PF11625_consen 5 DARLEAAILALLAARGPGKTI------CPSEVARALGPDDWRDLMPPVRA 48 (83)
T ss_dssp HHHHHHHHHHHHHHS-TT--B-------HHHHHHHH-TTS-GGGHHHHHH
T ss_pred HHHHHHHHHHHHHhcCCCCcc------CHHHHHHHHCchhhHHHHHHHHH
Confidence 3567888888887632 232 22344444488889999998854
No 157
>PF14663 RasGEF_N_2: Rapamycin-insensitive companion of mTOR RasGEF_N domain
Probab=36.55 E-value=80 Score=19.94 Aligned_cols=29 Identities=17% Similarity=0.115 Sum_probs=21.6
Q ss_pred HHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 86 IVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 86 Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
-++-+++.+...+.+.+..|+.+|...|.
T Consensus 9 ~i~lLv~QL~D~~~~V~~~A~~iL~e~c~ 37 (115)
T PF14663_consen 9 GIELLVTQLYDPSPEVVAAALEILEEACE 37 (115)
T ss_pred HHHHHHHHhcCCCHHHHHHHHHHHHHHHh
Confidence 35667777777777778888888888775
No 158
>PF13001 Ecm29: Proteasome stabiliser; InterPro: IPR024372 The proteasome (or macropain) (3.4.25.1 from EC) [, , , , ] is a eukaryotic and archaeal multicatalytic proteinase complex that seems to be involved in an ATP/ubiquitin-dependent nonlysosomal proteolytic pathway. In eukaryotes the proteasome is composed of about 28 distinct subunits which form a highly ordered ring-shaped structure (20S ring) of about 700 kDa. Most proteasome subunits can be classified, on the basis on sequence similarities into two groups, alpha (A) and beta (B). Ecm29 tethers the proteasome core particle to the regulatory particle, stabilising the interaction between these two components [, , ].
Probab=36.50 E-value=1.9e+02 Score=23.10 Aligned_cols=73 Identities=11% Similarity=0.069 Sum_probs=46.3
Q ss_pred CCCHhHHHHHHHHHHHH----Hh--------cCcHHHHHHHHHHHHHHHhccCC---CchhHHHHHHHHhhCCCChHHHh
Q psy16982 39 SLHEQDKAMIRDAIVDA----VV--------MAPEVIRVQLAVCVSNIVKHDFP---GKWTQIVDKVSIYLQNPDATPWF 103 (116)
Q Consensus 39 ~i~~~~k~~IK~~ll~~----l~--------~~~~~i~~~l~~~i~~Ia~~d~P---~~Wp~Ll~~l~~~l~~~~~~~~~ 103 (116)
.+++..-..++..|+.. +. +.....|...-++|+.|++.+.. ++| +++..++..+..++++..-
T Consensus 354 ~~~~~~l~~l~~~i~~~g~p~~~~~~~~~~~~~~~~lR~~aYe~lG~L~~~~p~l~~~d~-~li~~LF~sL~~~~~evr~ 432 (501)
T PF13001_consen 354 HISPQILKLLRPVILSQGWPLIQDSSSQSNSSEDIELRSLAYETLGLLAKRAPSLFSKDL-SLIEFLFDSLEDESPEVRV 432 (501)
T ss_pred hcCHHHHHHHHHHHHhcCccccccccccCCCcccHHHHHHHHHHHHHHHccCcccccccH-HHHHHHHHHhhCcchHHHH
Confidence 35555545555555544 41 01257899999999999998643 354 4999999999766665544
Q ss_pred HHHHHHHHH
Q psy16982 104 GALLCLYQL 112 (116)
Q Consensus 104 ~~L~~L~~l 112 (116)
+.-.+|..+
T Consensus 433 sIqeALssl 441 (501)
T PF13001_consen 433 SIQEALSSL 441 (501)
T ss_pred HHHHHHHHH
Confidence 444444443
No 159
>KOG1848|consensus
Probab=36.38 E-value=3.6e+02 Score=25.13 Aligned_cols=48 Identities=17% Similarity=0.256 Sum_probs=37.1
Q ss_pred HHHHHHHHHhc------------cCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 67 LAVCVSNIVKH------------DFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 67 l~~~i~~Ia~~------------d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
.+.+|+.||+. +|-+.|.+|++++-..+.+++++...+++.++..+..
T Consensus 1073 ~~ltisgIaklf~e~fk~llnln~f~~vwe~ll~flkrl~s~~s~e~slsai~~~qell~ 1132 (1610)
T KOG1848|consen 1073 SCLTISGIAKLFSENFKLLLNLNGFLDVWEELLQFLKRLHSDISPEISLSAIKALQELLF 1132 (1610)
T ss_pred hhhhHHHHHHHHHHHHHHHHhcccHHHHHHHHHHHHHHHHhcCChHhHHHHHHHHHHHHH
Confidence 56667777754 2334899999999999999999988888888877653
No 160
>PF10130 PIN_2: PIN domain; InterPro: IPR019298 This entry represents a set of bacterial and archaeal proteins that are predicted to be RNases (from similarities to 5'-exonucleases).
Probab=35.56 E-value=1.1e+02 Score=20.03 Aligned_cols=49 Identities=16% Similarity=0.361 Sum_probs=27.4
Q ss_pred CCCHhHHHHHHHHHHHHHhcCc-HHHHHHHHHHHHHHHhccCCCchhHHHH
Q psy16982 39 SLHEQDKAMIRDAIVDAVVMAP-EVIRVQLAVCVSNIVKHDFPGKWTQIVD 88 (116)
Q Consensus 39 ~i~~~~k~~IK~~ll~~l~~~~-~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~ 88 (116)
.+++++-..+-..+.+.+.-.+ ..+......+...+...| |++||-+-=
T Consensus 51 ~l~~~~~~~~l~~l~~~I~iv~~~~~~~~~~~A~~~~~~~D-~~D~p~vAL 100 (133)
T PF10130_consen 51 KLSEEELEEVLNILFSRIKIVPEEIYSENIEEAREIIRDRD-PDDWPFVAL 100 (133)
T ss_pred CCCHHHHHHHHHHHHhheEEecHHHhHHHHHHHHHHhcCCC-cchHHHHHH
Confidence 3555554444444444433333 344556666777777777 788885433
No 161
>PF04439 Adenyl_transf: Streptomycin adenylyltransferase; InterPro: IPR007530 Also known as aminoglycoside 6-adenylyltransferase (2.7.7 from EC), this protein confers resistance to aminoglycoside antibiotics.; PDB: 2PBE_A.
Probab=35.05 E-value=96 Score=22.96 Aligned_cols=73 Identities=21% Similarity=0.308 Sum_probs=44.6
Q ss_pred hHHHHHHHHHHHHHhcCcHHHHHHHHHHHH-HHH-hccCC----------Cch--hHHHHHHHHhhCCCChHHHhHHHHH
Q psy16982 43 QDKAMIRDAIVDAVVMAPEVIRVQLAVCVS-NIV-KHDFP----------GKW--TQIVDKVSIYLQNPDATPWFGALLC 108 (116)
Q Consensus 43 ~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~-~Ia-~~d~P----------~~W--p~Ll~~l~~~l~~~~~~~~~~~L~~ 108 (116)
-.|..-|..|+-+...-...+|..|-..|+ .|+ .++|. .+| |+....+......++...+..+|.+
T Consensus 166 VaKgL~R~El~yA~~~~~~~~r~~Ll~mL~W~ig~~~~f~~~~Gk~gK~l~~~L~~~~~~~l~~ty~~~~~~~iw~aL~~ 245 (282)
T PF04439_consen 166 VAKGLWRNELWYAKDMLDQIMRPELLKMLEWYIGIKTGFSVSVGKNGKYLKKYLPPEIWERLLSTYSDSDYEDIWQALFA 245 (282)
T ss_dssp HHHHHHTT-HHHHHHHHHHTHHHHHHHHHHHHHHHHH--EEE--GGGTTGGGTS-HHHHHHHHTT---SSHHHHHHHHHH
T ss_pred HHHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcCCCccchhHHHHCCHHHHHHHHHHhCCCCHHHHHHHHHH
Confidence 345566666666655544456777777777 443 33432 256 5677777777778899988999888
Q ss_pred HHHHHhh
Q psy16982 109 LYQLVKN 115 (116)
Q Consensus 109 L~~l~~~ 115 (116)
...++++
T Consensus 246 ~~~LF~~ 252 (282)
T PF04439_consen 246 MCDLFRE 252 (282)
T ss_dssp HHHHHHH
T ss_pred HHHHHHH
Confidence 8887764
No 162
>PF14620 YPEB: YpeB sporulation
Probab=34.54 E-value=2e+02 Score=22.18 Aligned_cols=71 Identities=14% Similarity=0.288 Sum_probs=54.2
Q ss_pred CCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 39 SLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 39 ~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
.+++++...++ -|-.....|..+|-.+-..|..-.+ +|-++-..+-......+...+.+++..+.+=++.|
T Consensus 111 ~Lt~~e~~tL~-----~L~~~s~~l~~~L~~~~~~v~~~~l--~w~d~~~~~~~~~~~~~~~~i~~~f~~v~~~~~~y 181 (361)
T PF14620_consen 111 PLTDEEYKTLK-----ELYEQSGELNKELQDVQNKVLSGNL--RWMDVEKALASELQPQDNTIIDGGFKTVEKQVQGY 181 (361)
T ss_pred CCCHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHhCCC--chHHHHHhhhhccccCCCcccccHHHHHHHhhccC
Confidence 68888887755 3444556788888888888877765 89998888888777777777778888887766655
No 163
>KOG0413|consensus
Probab=34.27 E-value=2.3e+02 Score=25.49 Aligned_cols=105 Identities=18% Similarity=0.182 Sum_probs=59.2
Q ss_pred ChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHH----------------------hcCcHHH
Q psy16982 6 DMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAV----------------------VMAPEVI 63 (116)
Q Consensus 6 ~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l----------------------~~~~~~i 63 (116)
+.-+|..+.|.|-|.+-.|-.--.-. -......+--.+-..||++++-++ .++..-|
T Consensus 944 ~~~vra~~vvTlakmcLah~~LaKr~-~P~lvkeLe~~~~~aiRnNiV~am~D~C~~YTam~d~YiP~I~~~L~Dp~~iV 1022 (1529)
T KOG0413|consen 944 SDKVRAVGVVTLAKMCLAHDRLAKRL-MPMLVKELEYNTAHAIRNNIVLAMGDICSSYTAMTDRYIPMIAASLCDPSVIV 1022 (1529)
T ss_pred chHHHHHHHHHHHHHHhhhhHHHHHH-HHHHHHHHHhhhHHHHhcceeeeehhhHHHHHHHHHHhhHHHHHHhcCchHHH
Confidence 45678888888888776654311000 000000122233455666554333 3344679
Q ss_pred HHHHHHHHHHHHhccCCCchhH--HHHHHHHhhCCCChHHHhHHHHHHHHHH
Q psy16982 64 RVQLAVCVSNIVKHDFPGKWTQ--IVDKVSIYLQNPDATPWFGALLCLYQLV 113 (116)
Q Consensus 64 ~~~l~~~i~~Ia~~d~P~~Wp~--Ll~~l~~~l~~~~~~~~~~~L~~L~~l~ 113 (116)
|+|-...++.+...||- +|.+ ++.++..++.++ ++...-|=.++..++
T Consensus 1023 Rrqt~ilL~rLLq~~~v-Kw~G~Lf~Rf~l~l~D~~-edIr~~a~f~~~~vL 1072 (1529)
T KOG0413|consen 1023 RRQTIILLARLLQFGIV-KWNGELFIRFMLALLDAN-EDIRNDAKFYISEVL 1072 (1529)
T ss_pred HHHHHHHHHHHHhhhhh-hcchhhHHHHHHHHcccC-HHHHHHHHHHHHHHH
Confidence 99999999999999994 8854 556666666653 333333444444444
No 164
>PF12463 DUF3689: Protein of unknown function (DUF3689) ; InterPro: IPR022162 This family of proteins is found in eukaryotes. Proteins in this family are typically between 399 and 797 amino acids in length.
Probab=34.19 E-value=2e+02 Score=21.64 Aligned_cols=73 Identities=15% Similarity=0.228 Sum_probs=52.0
Q ss_pred HhHHHHHHHHHHHHHhcCc--HHHHHHHHHHHHHHHhccCC--C-ch---hHHHHHHHHhhCCC---ChHHHhHHHHHHH
Q psy16982 42 EQDKAMIRDAIVDAVVMAP--EVIRVQLAVCVSNIVKHDFP--G-KW---TQIVDKVSIYLQNP---DATPWFGALLCLY 110 (116)
Q Consensus 42 ~~~k~~IK~~ll~~l~~~~--~~i~~~l~~~i~~Ia~~d~P--~-~W---p~Ll~~l~~~l~~~---~~~~~~~~L~~L~ 110 (116)
...+..+=..|++.+.+++ ...|-.++.||....+--=| + .| -+|++.++..+-++ +....++...+|+
T Consensus 90 ~~~~~gLl~kIi~~l~~e~~~s~~RfwLa~cVESfLRg~t~~~~Q~fl~~~GLLe~lv~eil~~~~~~~~v~Q~~FDLLG 169 (303)
T PF12463_consen 90 CKEKKGLLSKIIHVLKKEPIDSSYRFWLARCVESFLRGATSYADQAFLAERGLLEHLVSEILSDGCMSQEVLQSNFDLLG 169 (303)
T ss_pred ccccccHHHHHHHHHHhCCCchhHHHHHHHHHHHHHcCCCcHHHHHHHHhcchHHHHHHHHhcCccchHHHHHHHHHHHH
Confidence 3445556678888888876 68899999999999876322 1 12 24888898876654 2356778888888
Q ss_pred HHHh
Q psy16982 111 QLVK 114 (116)
Q Consensus 111 ~l~~ 114 (116)
+++|
T Consensus 170 ELiK 173 (303)
T PF12463_consen 170 ELIK 173 (303)
T ss_pred HHHC
Confidence 8876
No 165
>KOG1291|consensus
Probab=34.01 E-value=78 Score=25.53 Aligned_cols=45 Identities=20% Similarity=0.423 Sum_probs=30.4
Q ss_pred HHHHHHHHhccCCCchhHHHHHHHHhhC--CCChHHHhHHHHHHHHHH
Q psy16982 68 AVCVSNIVKHDFPGKWTQIVDKVSIYLQ--NPDATPWFGALLCLYQLV 113 (116)
Q Consensus 68 ~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~--~~~~~~~~~~L~~L~~l~ 113 (116)
+.=++++++.+|| +||..+=-++.-+. ..|...+.|.-.+|+-++
T Consensus 88 G~hLAe~Cr~~Yp-k~~~~~Lwi~aEiAiI~sDiqEVIGTAiAlniL~ 134 (503)
T KOG1291|consen 88 GKHLAEICREEYP-KWPRMVLWIMAEIAIIASDIQEVIGTAIALNILS 134 (503)
T ss_pred cHHHHHHHHHHcc-ccHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHh
Confidence 3447899999998 89987776665443 356666666555555443
No 166
>KOG4500|consensus
Probab=33.97 E-value=1.3e+02 Score=24.30 Aligned_cols=66 Identities=12% Similarity=0.084 Sum_probs=48.7
Q ss_pred HHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCC-Cch--hHHHHHHHHhhCC-----CChHHHhHHHHHHHHHH
Q psy16982 48 IRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFP-GKW--TQIVDKVSIYLQN-----PDATPWFGALLCLYQLV 113 (116)
Q Consensus 48 IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P-~~W--p~Ll~~l~~~l~~-----~~~~~~~~~L~~L~~l~ 113 (116)
+-+.++..+-+.+..+...-+.+|+..|+.|-- -.. .++++.+++++.. +|....|++|.+|+.++
T Consensus 316 ~l~~~~sw~~S~d~~l~t~g~LaigNfaR~D~~ci~~v~~~~~nkL~~~l~~~~~vdgnV~~qhA~lsALRnl~ 389 (604)
T KOG4500|consen 316 FLDFLESWFRSDDSNLITMGSLAIGNFARRDDICIQLVQKDFLNKLISCLMQEKDVDGNVERQHACLSALRNLM 389 (604)
T ss_pred HHHHHHHHhcCCchhHHHHHHHHHHhhhccchHHHHHHHHHHHHHHHHHHHHhcCCCccchhHHHHHHHHHhcc
Confidence 445666666667778888888999999999843 111 4677777777764 57889999999998764
No 167
>COG5181 HSH155 U2 snRNP spliceosome subunit [RNA processing and modification]
Probab=33.71 E-value=1.5e+02 Score=25.29 Aligned_cols=51 Identities=8% Similarity=0.003 Sum_probs=43.6
Q ss_pred HHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCC-ChHHHhHHHHHHHHHHh
Q psy16982 61 EVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNP-DATPWFGALLCLYQLVK 114 (116)
Q Consensus 61 ~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~-~~~~~~~~L~~L~~l~~ 114 (116)
.-+|+..+.+++.+|..- .-|+|+|+|-....|. +-+..+.|+.|..+||+
T Consensus 295 eYVRnvt~ra~~vva~al---gv~~llpfl~a~c~SrkSw~aRhTgiri~qqI~~ 346 (975)
T COG5181 295 EYVRNVTGRAVGVVADAL---GVEELLPFLEALCGSRKSWEARHTGIRIAQQICE 346 (975)
T ss_pred HHHHHHHHHHHHHHHHhh---CcHHHHHHHHHHhcCccchhhhchhhHHHHHHHH
Confidence 579999999999999883 6789999999988875 66788899999888885
No 168
>PF08158 NUC130_3NT: NUC130/3NT domain; InterPro: IPR012977 This N-terminal domain is found in a novel nucleolar protein family defined by NUC130/133 [].
Probab=33.28 E-value=49 Score=18.09 Aligned_cols=18 Identities=17% Similarity=0.318 Sum_probs=9.4
Q ss_pred CCCchhHHHHHHHHhhCC
Q psy16982 79 FPGKWTQIVDKVSIYLQN 96 (116)
Q Consensus 79 ~P~~Wp~Ll~~l~~~l~~ 96 (116)
||+.=.++-+++.+.+.+
T Consensus 9 Yp~~~~~Fp~~L~~lL~~ 26 (52)
T PF08158_consen 9 YPKETKDFPQELIDLLRN 26 (52)
T ss_pred cHHHHHHHHHHHHHHHHh
Confidence 565444555555555543
No 169
>cd03567 VHS_GGA VHS domain family, GGA subfamily; GGA (Golgi-localized, Gamma-ear-containing, Arf-binding) comprise a subfamily of ubiquitously expressed, monomeric, motif-binding cargo/clathrin adaptor proteins. The VHS domain has a superhelical structure similar to the structure of the ARM (Armadillo) repeats and is present at the N-termini of proteins. GGA proteins have a multidomain structure consisting of an N-terminal VHS domain linked by a short proline-rich linker to a GAT (GGA and TOM) domain, which is followed by a long flexible linker to the C-terminal appendage, GAE (gamma-adaptin ear) domain. The VHS domain of GGA proteins binds to the acidic-cluster dileucine (DxxLL) motif found on the cytoplasmic tails of cargo proteins trafficked between the trans-Golgi network and the endosomal system.
Probab=33.18 E-value=1.2e+02 Score=20.02 Aligned_cols=49 Identities=18% Similarity=0.150 Sum_probs=32.8
Q ss_pred HHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 62 VIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 62 ~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
..--.+|+.|. .+ |..=-+-+..|...+++.|++....+|.+|..++++
T Consensus 20 ~~ileicD~In----~~-~~~~k~a~rai~krl~~~n~~v~l~AL~LLe~~vkN 68 (139)
T cd03567 20 EAIQAFCEQIN----KE-PEGPQLAVRLLAHKIQSPQEKEALQALTVLEACMKN 68 (139)
T ss_pred HHHHHHHHHHH----cC-CccHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHH
Confidence 33444555543 23 333346777888888888888888888888887763
No 170
>PLN03200 cellulose synthase-interactive protein; Provisional
Probab=33.14 E-value=3.6e+02 Score=26.13 Aligned_cols=63 Identities=10% Similarity=0.060 Sum_probs=48.5
Q ss_pred HHHHHhcCcHHHHHHHHHHHHHHHhccCCCchh-----HHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 52 IVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWT-----QIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 52 ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp-----~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
++..|...+..++...+.+++.+++.-.+++-. +.++-+++++.++|...+..++.+|..+++
T Consensus 656 LV~LLss~~~~v~keAA~AL~nL~~~~~~~q~~~~v~~GaV~pL~~LL~~~d~~v~e~Al~ALanLl~ 723 (2102)
T PLN03200 656 CIKLLTNNTEAVATQSARALAALSRSIKENRKVSYAAEDAIKPLIKLAKSSSIEVAEQAVCALANLLS 723 (2102)
T ss_pred HHHHHhcCChHHHHHHHHHHHHHHhCCCHHHHHHHHHcCCHHHHHHHHhCCChHHHHHHHHHHHHHHc
Confidence 455555566789999999999999743333322 468889999999999999999999988875
No 171
>PF14961 BROMI: Broad-minded protein
Probab=33.14 E-value=1.6e+02 Score=26.62 Aligned_cols=64 Identities=16% Similarity=0.245 Sum_probs=46.5
Q ss_pred HHHHHHHhcC-cHHHHHHHHHHHHHHHhccCC--CchhHHHHHHHHhhCCCChHHHhHHHHHHHHHH
Q psy16982 50 DAIVDAVVMA-PEVIRVQLAVCVSNIVKHDFP--GKWTQIVDKVSIYLQNPDATPWFGALLCLYQLV 113 (116)
Q Consensus 50 ~~ll~~l~~~-~~~i~~~l~~~i~~Ia~~d~P--~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~ 113 (116)
+.+++-|... |..+|....+++-.+--.|-. +.||.|-..|...|+.+|+.....+|....+.+
T Consensus 164 q~i~d~ld~~~P~evR~eAlq~Lc~~p~SDVls~E~W~~L~~~L~~~LsDpD~~is~~~L~f~Ak~f 230 (1296)
T PF14961_consen 164 QLIADKLDPGQPKEVRLEALQILCSAPPSDVLSCESWSVLRENLTDALSDPDPEISDASLRFHAKMF 230 (1296)
T ss_pred HHHHHhcCCCCchHHHHHHHHHHhcCChhhccccccHHHHHHHHHHHcCCCCHHHHHHHHHHHHHhc
Confidence 3445545443 367998877777666555555 589999999999999999888777777666554
No 172
>PF09280 XPC-binding: XPC-binding domain; InterPro: IPR015360 Members of this entry adopt a structure consisting of four alpha helices, arranged in an array. They bind specifically and directly to the xeroderma pigmentosum group C protein (XPC) to initiate nucleotide excision repair []. ; GO: 0003684 damaged DNA binding, 0006289 nucleotide-excision repair, 0043161 proteasomal ubiquitin-dependent protein catabolic process; PDB: 1PVE_A 1QZE_A 1OQY_A 1TP4_A 1X3W_B 3ESW_B 2QSG_X 2QSF_X 1X3Z_B 2QSH_X ....
Probab=33.08 E-value=33 Score=19.27 Aligned_cols=18 Identities=11% Similarity=0.122 Sum_probs=12.5
Q ss_pred hHHHHHHHHhhCCCChHH
Q psy16982 84 TQIVDKVSIYLQNPDATP 101 (116)
Q Consensus 84 p~Ll~~l~~~l~~~~~~~ 101 (116)
|++++.+++.+..+||..
T Consensus 21 P~lL~~lLqql~~~nP~l 38 (59)
T PF09280_consen 21 PQLLPPLLQQLGQSNPQL 38 (59)
T ss_dssp GGGHHHHHHHHHCCSHHH
T ss_pred HHHHHHHHHHHhccCHHH
Confidence 567777777777776653
No 173
>smart00544 MA3 Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press
Probab=33.00 E-value=1.2e+02 Score=18.53 Aligned_cols=20 Identities=10% Similarity=0.321 Sum_probs=9.0
Q ss_pred HHhccCCCchhHHHHHHHHh
Q psy16982 74 IVKHDFPGKWTQIVDKVSIY 93 (116)
Q Consensus 74 Ia~~d~P~~Wp~Ll~~l~~~ 93 (116)
+-....|+..++++..++..
T Consensus 25 l~~L~~~~~~~~vv~~~i~~ 44 (113)
T smart00544 25 LLELKLPEQHHEVVKVLLTC 44 (113)
T ss_pred HHHhCCCcchHHHHHHHHHH
Confidence 33333344445555555443
No 174
>cd03564 ANTH_AP180_CALM ANTH domain family; composed of adaptor protein 180 (AP180), clathrin assembly lymphoid myeloid leukemia protein (CALM) and similar proteins. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. AP180 and CALM play important roles in clathrin-mediated endocytosis. AP180 is a brain-specific clathrin-binding protein which stimulates clathrin assembly during the recycling of synaptic vesicles. The ANTH domain is structurally similar to the VHS domain and is composed of a superhelix of eight alpha helices. ANTH domains bind both inositol phospholipids and proteins, and contribute to the nucleation and formation of clathrin coats on membranes. ANTH-bearing proteins have recently been shown to function with adaptor protein-1 and GGA adaptors at the trans-Golgi network, which suggests that the ANTH domain is a universal component of the machine
Probab=32.85 E-value=1.3e+02 Score=18.84 Aligned_cols=34 Identities=18% Similarity=0.182 Sum_probs=21.6
Q ss_pred CchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 81 GKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 81 ~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
..-++++..|...+...|.-.+.-+|.+|+.+++
T Consensus 33 ~~~~~~~~~l~~Rl~~~~w~v~~K~LillH~llr 66 (117)
T cd03564 33 ASIPSFASALSRRLLDRNWVVVLKALILLHRLLR 66 (117)
T ss_pred CCHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHh
Confidence 4555666666666655666666667777776654
No 175
>PF10232 Med8: Mediator of RNA polymerase II transcription complex subunit 8; InterPro: IPR019364 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. Arc32, or Med8, is one of the subunits of the Mediator complex of RNA polymerase II. The region conserved contains two alpha helices putatively necessary for binding to other subunits within the core of the Mediator complex. The N terminus of Med8 binds to the essential core Head part of Mediator and the C terminus hinges to Med18 on the non-essential part of the Head that also includes Med20 []. ; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex; PDB: 3C0T_B 3RJ1_J 2HZS_I.
Probab=32.78 E-value=1.9e+02 Score=20.73 Aligned_cols=31 Identities=16% Similarity=0.491 Sum_probs=18.4
Q ss_pred HHHHHHHHHHHHHHH-hccCCCchhHHHHHHHH
Q psy16982 61 EVIRVQLAVCVSNIV-KHDFPGKWTQIVDKVSI 92 (116)
Q Consensus 61 ~~i~~~l~~~i~~Ia-~~d~P~~Wp~Ll~~l~~ 92 (116)
..+.+.+...+..|. ..+.| .||.|+....-
T Consensus 21 ~qL~~SL~~l~~~L~~~~~lp-~W~slq~qf~i 52 (226)
T PF10232_consen 21 AQLKHSLQSLIDKLEQSQPLP-PWPSLQDQFAI 52 (226)
T ss_dssp HHHHHHHHHHHHHHT-T-SS----HHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHccCCCC-CcHHHHHHHHH
Confidence 356677777777777 44444 99999876643
No 176
>KOG1851|consensus
Probab=32.68 E-value=4.3e+02 Score=24.93 Aligned_cols=74 Identities=23% Similarity=0.189 Sum_probs=51.0
Q ss_pred CCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCC--CChHHHhHHHHHHHHHH
Q psy16982 40 LHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQN--PDATPWFGALLCLYQLV 113 (116)
Q Consensus 40 i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~--~~~~~~~~~L~~L~~l~ 113 (116)
..+..|+-|+..+...+.+.+-.+|...+.+++.+.+.-|-..=++--.......++ .+....|||...|..++
T Consensus 1562 ~~~~~r~dI~~l~~s~l~D~~i~vre~Aa~~Lsgl~~~s~~~~~~~k~d~~~~~~~s~s~~~i~~HgavlgLgA~V 1637 (1710)
T KOG1851|consen 1562 SQELRRDDIRKLLESLLNDDQIEVREEAAKCLSGLLQGSKFQFVSDKRDTTSNILQSKSKDEIKAHGAVLGLGAIV 1637 (1710)
T ss_pred ccchhHHHHHHHHHHHHcchHHHHHHHHHHHHHHHHhccccccchHhhhhhhhhhhhcchHHHHhhhhHHHHHHHH
Confidence 466788889988888888887789999999999998775542222333333333332 36677888888887764
No 177
>PF12054 DUF3535: Domain of unknown function (DUF3535); InterPro: IPR022707 This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is typically between 439 to 459 amino acids in length. This domain is found associated with PF00271 from PFAM, PF02985 from PFAM, and PF00176 from PFAM. This domain has two completely conserved residues (P and K) that may be functionally important.
Probab=32.65 E-value=2.1e+02 Score=22.59 Aligned_cols=52 Identities=25% Similarity=0.356 Sum_probs=40.2
Q ss_pred HHHHHHHHHHHHHhccCC-CchhHHHHH-HHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 63 IRVQLAVCVSNIVKHDFP-GKWTQIVDK-VSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 63 i~~~l~~~i~~Ia~~d~P-~~Wp~Ll~~-l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
-|-..+.+++.++++ +| ..+.+++.. +...|++........|-.++.+.++.
T Consensus 103 ~Ri~aA~ALG~l~~~-~~~~~~~~~~~~~L~~~L~S~sa~qR~~aalvl~ewa~~ 156 (441)
T PF12054_consen 103 ARIAAAKALGLLLSY-WPESSLQEIFQPLLLPYLNSPSATQRLLAALVLEEWAKA 156 (441)
T ss_pred HHHHHHHHHHHHHHh-cccchHHHHHHHHHHHHhcchhHHHHHHHHHHHHHHHHh
Confidence 455567888888887 46 788888885 88999998888878887788777654
No 178
>COG1413 FOG: HEAT repeat [Energy production and conversion]
Probab=32.22 E-value=2e+02 Score=21.03 Aligned_cols=47 Identities=17% Similarity=0.140 Sum_probs=33.5
Q ss_pred cHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHH
Q psy16982 60 PEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQ 111 (116)
Q Consensus 60 ~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~ 111 (116)
...+|...+..+..+.... +++.+.+...++..+......++..|..
T Consensus 193 ~~~vr~~Aa~aL~~~~~~~-----~~~~~~l~~~~~~~~~~vr~~~~~~l~~ 239 (335)
T COG1413 193 DADVRRAAASALGQLGSEN-----VEAADLLVKALSDESLEVRKAALLALGE 239 (335)
T ss_pred hHHHHHHHHHHHHHhhcch-----hhHHHHHHHHhcCCCHHHHHHHHHHhcc
Confidence 3467888777887777665 5677888888888777776666666543
No 179
>KOG1967|consensus
Probab=32.18 E-value=2e+02 Score=25.37 Aligned_cols=69 Identities=19% Similarity=0.228 Sum_probs=48.3
Q ss_pred HHHHHHHHHHHHhc----CcHHHHHHHHHHHHHHHhccCC-----CchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 45 KAMIRDAIVDAVVM----APEVIRVQLAVCVSNIVKHDFP-----GKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 45 k~~IK~~ll~~l~~----~~~~i~~~l~~~i~~Ia~~d~P-----~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
|+.+=..+++.|.+ .+..+++.+-.+++.+... -| .+-|.|+|-|++.++=.|+......+.|++-+..
T Consensus 861 kQRfF~~ivP~l~~~~~t~~~~~K~~yl~~LshVl~~-vP~~vllp~~~~LlPLLLq~Ls~~D~~v~vstl~~i~~~l~ 938 (1030)
T KOG1967|consen 861 KQRFFCDIVPILVSKFETAPGSQKHNYLEALSHVLTN-VPKQVLLPQFPMLLPLLLQALSMPDVIVRVSTLRTIPMLLT 938 (1030)
T ss_pred HHHHHHhhHHHHHHHhccCCccchhHHHHHHHHHHhc-CCHHhhccchhhHHHHHHHhcCCCccchhhhHhhhhhHHHH
Confidence 34443444444443 4456677777777777664 45 3889999999999999998888888888876654
No 180
>KOG0168|consensus
Probab=32.02 E-value=81 Score=27.49 Aligned_cols=64 Identities=14% Similarity=0.135 Sum_probs=44.7
Q ss_pred HHHHHHhcC-c--HHHH--HHHHHHHHHHHhc---cCCCchhHHHHHHHHhhCCC-ChHHHhHHHHHHHHHHhhC
Q psy16982 51 AIVDAVVMA-P--EVIR--VQLAVCVSNIVKH---DFPGKWTQIVDKVSIYLQNP-DATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 51 ~ll~~l~~~-~--~~i~--~~l~~~i~~Ia~~---d~P~~Wp~Ll~~l~~~l~~~-~~~~~~~~L~~L~~l~~~y 116 (116)
.||+.|..+ + .++. .++|+.+..--.- -|| -..|+|.|+.+++.+ |.+....|..+|.++|+.|
T Consensus 171 kLL~gL~~~~Des~Qleal~Elce~L~mgnEesLs~fp--v~slvp~Lv~LL~~E~n~DIMl~AcRaltyl~evl 243 (1051)
T KOG0168|consen 171 KLLQGLQAESDESQQLEALTELCEMLSMGNEESLSGFP--VKSLVPVLVALLSHEHNFDIMLLACRALTYLCEVL 243 (1051)
T ss_pred HHHHhccccCChHHHHHHHHHHHHHHhhcchhhhcccc--HHHHHHHHHHHHhccccHHHHHHHHHHHHHHHhhc
Confidence 566777665 3 2332 5577776543211 233 347999999999875 7888999999999999875
No 181
>COG5098 Chromosome condensation complex Condensin, subunit D2 [Chromatin structure and dynamics / Cell division and chromosome partitioning]
Probab=32.02 E-value=2.2e+02 Score=24.64 Aligned_cols=65 Identities=17% Similarity=0.195 Sum_probs=45.5
Q ss_pred HHHHHHHhcCcHHHHHHHHHHHHHHHhc---c------CCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 50 DAIVDAVVMAPEVIRVQLAVCVSNIVKH---D------FPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 50 ~~ll~~l~~~~~~i~~~l~~~i~~Ia~~---d------~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
+.+.++|.+++-.+|-.+.++.+.++.. | ||..-.+|+.-+++.++..+|-...-+|.++.+||.
T Consensus 302 ~~~~~LLdses~tlRc~~~EicaN~V~~~~~d~qm~e~~~~~~~~Lv~ll~ERl~D~~py~RtKalqv~~kifd 375 (1128)
T COG5098 302 EHFDELLDSESFTLRCCFLEICANLVEHFKKDGQMVEHYKQKLNDLVGLLVERLSDTYPYTRTKALQVLEKIFD 375 (1128)
T ss_pred HHHHHHhcccchhHHHHHHHHHHHHHHHHhcchhhHhhHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHh
Confidence 4556666666667777776666666533 2 333455677777778888888888899999999885
No 182
>TIGR02270 conserved hypothetical protein. Members are found in Myxococcus xanthus (six members), Geobacter sulfurreducens, and Pseudomonas aeruginosa; a short protein homologous to the N-terminal region is found in Mesorhizobium loti. All sequence are from Proteobacteria. The function is unknown.
Probab=31.88 E-value=1.9e+02 Score=22.66 Aligned_cols=23 Identities=30% Similarity=0.215 Sum_probs=10.2
Q ss_pred HHHhhCCCChHHHhHHHHHHHHH
Q psy16982 90 VSIYLQNPDATPWFGALLCLYQL 112 (116)
Q Consensus 90 l~~~l~~~~~~~~~~~L~~L~~l 112 (116)
+...+++.|+.....++.+|.++
T Consensus 152 L~~~L~d~d~~Vra~A~raLG~l 174 (410)
T TIGR02270 152 LEAALTHEDALVRAAALRALGEL 174 (410)
T ss_pred HHHHhcCCCHHHHHHHHHHHHhh
Confidence 33334444444444444444443
No 183
>KOG2745|consensus
Probab=31.63 E-value=2.2e+02 Score=21.51 Aligned_cols=31 Identities=19% Similarity=0.138 Sum_probs=23.6
Q ss_pred HHHHHHHHHHHHhccCC-CchhHHHHHHHHhh
Q psy16982 64 RVQLAVCVSNIVKHDFP-GKWTQIVDKVSIYL 94 (116)
Q Consensus 64 ~~~l~~~i~~Ia~~d~P-~~Wp~Ll~~l~~~l 94 (116)
-..++..|..|.+.|-+ +-.-.|+|.|+..+
T Consensus 175 Ytg~~~Sl~~I~kqEG~~GfFaGLvP~LLGdl 206 (321)
T KOG2745|consen 175 YTGLVGSLATIGKQEGIAGFFAGLVPRLLGDL 206 (321)
T ss_pred hhHHHHHHHHHHHhcchhhHHhhhhHHHHhhH
Confidence 35677889999999988 56677888777643
No 184
>PF10083 DUF2321: Uncharacterized protein conserved in bacteria (DUF2321); InterPro: IPR016891 This entry is represented by Bacteriophage 'Lactobacillus prophage Lj928', Orf-Ljo1454. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=31.23 E-value=1.7e+02 Score=19.93 Aligned_cols=22 Identities=18% Similarity=0.377 Sum_probs=20.1
Q ss_pred CCCHhHHHHHHHHHHHHHhcCc
Q psy16982 39 SLHEQDKAMIRDAIVDAVVMAP 60 (116)
Q Consensus 39 ~i~~~~k~~IK~~ll~~l~~~~ 60 (116)
.|++++|+.++..+=.++.+.|
T Consensus 97 eLs~deke~~~~sl~dL~~d~P 118 (158)
T PF10083_consen 97 ELSPDEKEQFKESLPDLTKDTP 118 (158)
T ss_pred cCCHHHHHHHHhhhHHHhhcCC
Confidence 6999999999999999998876
No 185
>PF01603 B56: Protein phosphatase 2A regulatory B subunit (B56 family); InterPro: IPR002554 Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzyme to act on a diverse array of substrates is largely controlled by the nature of its regulatory B subunit. There are multiple families of B subunits, this family is called the B56 family [].; GO: 0008601 protein phosphatase type 2A regulator activity, 0007165 signal transduction, 0000159 protein phosphatase type 2A complex; PDB: 2NYM_B 2NYL_B 2IAE_E 2NPP_B 3FGA_B 2JAK_A.
Probab=30.99 E-value=2.6e+02 Score=21.78 Aligned_cols=57 Identities=5% Similarity=0.130 Sum_probs=32.3
Q ss_pred CCHhHHHHHHHHHHHHHhcCc---HHHHHHHHHHHHHHHh-ccCCCchhHHHHHHHHhhCC
Q psy16982 40 LHEQDKAMIRDAIVDAVVMAP---EVIRVQLAVCVSNIVK-HDFPGKWTQIVDKVSIYLQN 96 (116)
Q Consensus 40 i~~~~k~~IK~~ll~~l~~~~---~~i~~~l~~~i~~Ia~-~d~P~~Wp~Ll~~l~~~l~~ 96 (116)
-++.+|+.+|+-+......-. +.||+.+...+..... .+-|..=.++|..+.+.++.
T Consensus 145 ~D~rER~~lk~~l~~iy~k~~~~r~~Ir~~i~~~~~~fi~e~~~~~gI~elLeil~sii~g 205 (409)
T PF01603_consen 145 PDPRERDYLKTILHRIYGKFPNLRSFIRKSINNIFYRFIYETERHNGIAELLEILGSIING 205 (409)
T ss_dssp STHHHHHHHHHHHHHHHHH-TTTHHHHHHHHHHHHHHHHHTTS--STHHHHHHHHHHHHTT
T ss_pred CCHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHhcCcccccCHHHHHHHHHHHHhc
Confidence 456677777776666665532 4566666666665444 33334555666666665554
No 186
>PF12783 Sec7_N: Guanine nucleotide exchange factor in Golgi transport N-terminal
Probab=30.94 E-value=1.6e+02 Score=19.46 Aligned_cols=103 Identities=13% Similarity=0.080 Sum_probs=59.4
Q ss_pred ChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhc----CcHHHHHHHHHHHHHHHhc---c
Q psy16982 6 DMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVM----APEVIRVQLAVCVSNIVKH---D 78 (116)
Q Consensus 6 ~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~----~~~~i~~~l~~~i~~Ia~~---d 78 (116)
+...|.+|-=.+.+.+..+-..-.. .. =.+.-...+|+.+..++.. ++..+-.....++..+... .
T Consensus 35 ~~~~k~l~LeLl~~iL~~~~~~f~~------~~-~~~~l~~~lk~~l~~~Ll~~~~~~~~~i~~~slri~~~l~~~~~~~ 107 (168)
T PF12783_consen 35 DERSKLLSLELLESILENHGSVFRS------SE-EHPSLINLLKDDLCPALLKNLSSSDFPIFSRSLRIFLTLLSRFRSH 107 (168)
T ss_pred hHHHHHHHHHHHHHHHHhCHHHHhC------Cc-chHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHH
Confidence 4567777777777777765431100 00 0125556677776666654 3344444455555555433 2
Q ss_pred CCCchhHHHHHHHH-hhCCCC--hHHHhHHHHHHHHHHhh
Q psy16982 79 FPGKWTQIVDKVSI-YLQNPD--ATPWFGALLCLYQLVKN 115 (116)
Q Consensus 79 ~P~~Wp~Ll~~l~~-~l~~~~--~~~~~~~L~~L~~l~~~ 115 (116)
+..+=.-+++.+.. .+++++ ......+|.+++.+|+.
T Consensus 108 Lk~ele~~l~~i~~~il~~~~~~~~~k~~~Le~l~~l~~~ 147 (168)
T PF12783_consen 108 LKLELEVFLSHIILRILESDNSSLWQKELALEILRELCKD 147 (168)
T ss_pred HHHHHHHHHHHHHHHHHccCCCcHHHHHHHHHHHHHHHhC
Confidence 22344556777766 455443 46788899999999874
No 187
>PF09485 CRISPR_Cse2: CRISPR-associated protein Cse2 (CRISPR_cse2); InterPro: IPR013382 Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes []. CRISPRs appear to provide acquired resistance against bacteriophages, possibly acting with an RNA interference-like mechanism to inhibit gene functions of invasive DNA elements [, ]. Differences in the number and type of spacers between CRISPR repeats correlate with phage sensitivity. It is thought that following phage infection, bacteria integrate new spacers derived from phage genomic sequences, and that the removal or addition of particular spacers modifies the phage-resistance phenotype of the cell. Therefore, the specificity of CRISPRs may be determined by spacer-phage sequence similarity. In addition, there are many protein families known as CRISPR-associated sequences (Cas), which are encoded in the vicinity of CRISPR loci []. CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a CRISPR cluster or filling the region between two repeat clusters. Cas genes and CRISPRs are found on mobile genetic elements such as plasmids, and have undergone extensive horizontal transfer. Cas proteins are thought to be involved in the propagation and functioning of CRISPRs. Some Cas proteins show similarity to helicases and repair proteins, although the functions of most are unknown. Cas families can be divided into subtypes according to operon organisation and phylogeny. This entry represents the Cse2 family of Cas proteins, which includes CT1973 from Chlorobium tepidum. These proteins are found in the CRISPR/Cas subtype Ecoli regions of many bacteria (most of which are mesophiles), and not in Archaea [].; PDB: 2ZCA_B.
Probab=30.78 E-value=82 Score=20.41 Aligned_cols=52 Identities=15% Similarity=0.243 Sum_probs=32.7
Q ss_pred HHHHHHHHHHHhcC-cHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCC
Q psy16982 46 AMIRDAIVDAVVMA-PEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNP 97 (116)
Q Consensus 46 ~~IK~~ll~~l~~~-~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~ 97 (116)
..+...+-.++..+ ...+-.++-.+|..+...+.|-+|++|...|...-...
T Consensus 82 ~~~e~Rf~~Ll~a~~~~~l~~~Lrrlv~ll~~~~i~~d~~~La~dL~~W~~~~ 134 (146)
T PF09485_consen 82 DSSERRFRRLLRADTPEELYRHLRRLVRLLRSKGIPVDWAQLADDLYYWQDED 134 (146)
T ss_dssp TCHHHHHHHHHC---TTTHHHHHHHHHHHS-S----B-HHHHHHHHHTTT-TT
T ss_pred ccHHHHHHHHHhCCCcHHHHHHHHHHHHHhcccCCCCCHHHHHHHHHHHcCcc
Confidence 44556666666666 46788888899999998888899999999998765443
No 188
>KOG1077|consensus
Probab=30.67 E-value=3.6e+02 Score=23.37 Aligned_cols=54 Identities=20% Similarity=0.247 Sum_probs=32.7
Q ss_pred hHHHHHHHHHHHHHhcC-cHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChH
Q psy16982 43 QDKAMIRDAIVDAVVMA-PEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDAT 100 (116)
Q Consensus 43 ~~k~~IK~~ll~~l~~~-~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~ 100 (116)
..|.+ ++.++..|..+ +..||+...+.+..++..+ +=.+++..+++.+.+.|+.
T Consensus 365 avK~h-~d~Ii~sLkterDvSirrravDLLY~mcD~~---Nak~IV~elLqYL~tAd~s 419 (938)
T KOG1077|consen 365 AVKKH-QDTIINSLKTERDVSIRRRAVDLLYAMCDVS---NAKQIVAELLQYLETADYS 419 (938)
T ss_pred HHHHH-HHHHHHHhccccchHHHHHHHHHHHHHhchh---hHHHHHHHHHHHHhhcchH
Confidence 33444 55666666643 3577777777777666553 4556777777777665543
No 189
>KOG3036|consensus
Probab=30.55 E-value=2.2e+02 Score=21.23 Aligned_cols=64 Identities=16% Similarity=0.073 Sum_probs=46.6
Q ss_pred HHHHHHHhcCc-----HHHHHHHHHHHHHHHhccCCC-----chhHHHHHHHHhhCCCChHHHhHHHHHHHHHH
Q psy16982 50 DAIVDAVVMAP-----EVIRVQLAVCVSNIVKHDFPG-----KWTQIVDKVSIYLQNPDATPWFGALLCLYQLV 113 (116)
Q Consensus 50 ~~ll~~l~~~~-----~~i~~~l~~~i~~Ia~~d~P~-----~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~ 113 (116)
-.+...|...+ ..+|-..--+|+.+++.|=.+ -|-|.+|..+.-+.+|+....-.+..++.+|+
T Consensus 123 lylYpfL~Tt~~~r~fEyLRLtsLGVIgaLvk~dd~eVi~fLl~TeIVPlCLrime~GSelSKtvA~fIlqKIl 196 (293)
T KOG3036|consen 123 LYLYPFLNTTSKSRPFEYLRLTSLGVIGALVKNDDQEVIRFLLTTEIVPLCLRIMESGSELSKTVATFILQKIL 196 (293)
T ss_pred hhhHHhhhccccCCchHHHhHHHHHHHHHHHhcCcHHHHHHHHHhhhHHHHHHHHhcccHHHHHHHHHHHHHHh
Confidence 34445554432 468888788888888887543 67889999999899888888777887776654
No 190
>cd03569 VHS_Hrs_Vps27p VHS domain family, Hrs and Vps27p subfamily; composed of Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and its yeast homolog Vps27p (vacuolar protein sorting). The VHS domain, an essential part of Hrs/Vps27p, has a superhelical structure similar to the structure of ARM (Armadillo) repeats and is present at the N-termini of proteins. Hrs also contains a FYVE (Fab1p, YOTB, Vac1p, and EEA1) zinc finger domain C-terminal to VHS, as well as two coiled-coil domains. Hrs has been proposed to play a role in at least three vesicle trafficking events: exocytosis, endocytosis, and endosome to lysosome trafficking. Hrs is involved in promoting rapid recycling of endocytosed signaling receptors to the plasma membrane.
Probab=30.50 E-value=1.3e+02 Score=19.69 Aligned_cols=49 Identities=16% Similarity=0.075 Sum_probs=33.8
Q ss_pred HHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 62 VIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 62 ~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
..--.+|+.|.. + +..=-+.+..|-..+.++|++....+|.+|..+++.
T Consensus 23 ~~ileicD~In~----~-~~~~k~a~ral~krl~~~n~~vql~AL~LLe~~vkN 71 (142)
T cd03569 23 ASILEICDMIRS----K-DVQPKYAMRALKKRLLSKNPNVQLYALLLLESCVKN 71 (142)
T ss_pred HHHHHHHHHHhC----C-CCCHHHHHHHHHHHHcCCChHHHHHHHHHHHHHHHH
Confidence 444555555542 2 223347888888888888999989999988887764
No 191
>KOG0166|consensus
Probab=30.00 E-value=3.1e+02 Score=22.41 Aligned_cols=61 Identities=15% Similarity=0.139 Sum_probs=45.5
Q ss_pred HHHhcCcH-HHHHHHHHHHHHHHhccCC-Cch---hHHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 54 DAVVMAPE-VIRVQLAVCVSNIVKHDFP-GKW---TQIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 54 ~~l~~~~~-~i~~~l~~~i~~Ia~~d~P-~~W---p~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
..+....+ .+.+++.=+++.+++.-=| -.| ..++|.|..++.+.|++..--+.-++.++.+
T Consensus 201 ~~l~~~~~~~~lRn~tW~LsNlcrgk~P~P~~~~v~~iLp~L~~ll~~~D~~Vl~Da~WAlsyLsd 266 (514)
T KOG0166|consen 201 RLLNKSDKLSMLRNATWTLSNLCRGKNPSPPFDVVAPILPALLRLLHSTDEEVLTDACWALSYLTD 266 (514)
T ss_pred HHhccccchHHHHHHHHHHHHHHcCCCCCCcHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHhc
Confidence 33333333 5778889999999877534 444 4799999999999999888778888887753
No 192
>KOG4224|consensus
Probab=29.84 E-value=1.4e+02 Score=23.65 Aligned_cols=57 Identities=12% Similarity=0.026 Sum_probs=36.2
Q ss_pred HHHhcCcHHHHHHHHHHHHHHHhc-----cCCCchhHHHHHHHHhhCCCChHHHhHHHHHHH
Q psy16982 54 DAVVMAPEVIRVQLAVCVSNIVKH-----DFPGKWTQIVDKVSIYLQNPDATPWFGALLCLY 110 (116)
Q Consensus 54 ~~l~~~~~~i~~~l~~~i~~Ia~~-----d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~ 110 (116)
+++...+..++--.+.+|+.|+-. -.-+.=|.|++.++++..++++...-.+-.+|+
T Consensus 215 sll~s~d~dvqyycttaisnIaVd~~~Rk~Laqaep~lv~~Lv~Lmd~~s~kvkcqA~lALr 276 (550)
T KOG4224|consen 215 SLLKSGDLDVQYYCTTAISNIAVDRRARKILAQAEPKLVPALVDLMDDGSDKVKCQAGLALR 276 (550)
T ss_pred hhhccCChhHHHHHHHHhhhhhhhHHHHHHHHhcccchHHHHHHHHhCCChHHHHHHHHHHh
Confidence 333334456777777788877633 333456889999999999887754333333443
No 193
>PF08578 DUF1765: Protein of unknown function (DUF1765); InterPro: IPR013887 This entry represents a conserved region found in hypothetical proteins from fungi, mycetozoa and entamoebidae.
Probab=29.59 E-value=1.6e+02 Score=18.85 Aligned_cols=34 Identities=21% Similarity=0.361 Sum_probs=28.8
Q ss_pred CchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 81 GKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 81 ~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
=+|+-.+..+-..+++.+......+|..|+.+..
T Consensus 46 ~dw~fwl~~i~~ml~s~n~~~~~r~l~fLyn~w~ 79 (126)
T PF08578_consen 46 FDWDFWLDGIRMMLESDNVQSEIRALSFLYNIWD 79 (126)
T ss_pred cCHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHH
Confidence 3999999999999998888888888888876543
No 194
>cd07911 RNRR2_Rv0233_like Ribonucleotide Reductase R2-like protein, Mn/Fe-binding domain. Rv0233 is a Mycobacterium tuberculosis ribonucleotide reductase R2 protein with a heterodinuclear manganese/iron-carboxylate cofactor located in its metal center. The Rv0233-like family may represent a structural/functional counterpart of the evolutionary ancestor of the RNRR2's (Ribonucleotide Reductase, R2/beta subunit) and the bacterial multicomponent monooxygenases. RNRR2s belong to a broad superfamily of ferritin-like diiron-carboxylate proteins. The RNR protein catalyzes the conversion of ribonucleotides to deoxyribonucleotides and is found in prokaryotes and archaea. The catalytically active form of RNR is a proposed alpha2-beta2 tetramer. The homodimeric alpha subunit (R1) contains the active site and redox active cysteines as well as the allosteric binding sites.
Probab=29.11 E-value=1.2e+02 Score=22.16 Aligned_cols=41 Identities=10% Similarity=0.078 Sum_probs=27.0
Q ss_pred HHhhhhhcc-ccccccCCCC--CCccCCCCHhHHHHHHHHHHHH
Q psy16982 15 VYLKNLITN-QWVEKEVDPG--TPLAFSLHEQDKAMIRDAIVDA 55 (116)
Q Consensus 15 i~LKn~i~~-~w~~~~~~~~--~~~~~~i~~~~k~~IK~~ll~~ 55 (116)
=+.|+.... +|...+-... ..+|..+++++|..++..+-..
T Consensus 11 ~ly~~~~~~~~W~~~eid~s~D~~~w~~L~~~Er~~~~~~l~~f 54 (280)
T cd07911 11 KLFEKGKRKGFWNPADIDFSQDREDWEQLSEEERDLALRLCAGF 54 (280)
T ss_pred HHHHHHHccCCCCHHHcCccchHHHHHhCCHHHHHHHHHHHHHH
Confidence 356777788 8986542211 1246689999999999754443
No 195
>PF12243 CTK3: CTD kinase subunit gamma CTK3
Probab=28.99 E-value=1.8e+02 Score=19.30 Aligned_cols=61 Identities=8% Similarity=0.080 Sum_probs=40.8
Q ss_pred HHHHHHHHh---cCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 49 RDAIVDAVV---MAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 49 K~~ll~~l~---~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
|..+.+.|. .+...+++....++.. .+.=.+|.+.|++.+..++.+..-.-++.+-.+|+.
T Consensus 7 r~~F~~~L~~L~aS~qSi~kaa~fAlk~------~~~~edL~~cIle~le~~~lN~R~nI~~fID~l~e~ 70 (139)
T PF12243_consen 7 RMQFTQLLRRLNASQQSIQKAAQFALKN------RDMEEDLWSCILEQLEKENLNTRINIFYFIDSLCES 70 (139)
T ss_pred HHHHHHHHHHcchhHHHHHHHHHHHHHc------cccHHHHHHHHHHHHhccchhhHHHHHHHHHHHHHH
Confidence 344444443 3335677666666654 233457888888888888888888888888888764
No 196
>PF07346 DUF1477: Protein of unknown function (DUF1477); InterPro: IPR009946 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf19; it is a family of uncharacterised viral proteins.
Probab=28.66 E-value=1.5e+02 Score=19.19 Aligned_cols=31 Identities=16% Similarity=0.180 Sum_probs=20.0
Q ss_pred HHHHHHHhhC-CCCh--HHHhHHHHHHHHHHhhC
Q psy16982 86 IVDKVSIYLQ-NPDA--TPWFGALLCLYQLVKNY 116 (116)
Q Consensus 86 Ll~~l~~~l~-~~~~--~~~~~~L~~L~~l~~~y 116 (116)
++..+++.+- .+|- -...--+.+|.++++||
T Consensus 84 vLn~lV~FLi~~SDG~~lQc~in~~lLdylL~KY 117 (118)
T PF07346_consen 84 VLNFLVNFLIAHSDGDNLQCQINTQLLDYLLEKY 117 (118)
T ss_pred HHHHHHHHHHHcCCcchHHHHHHHHHHHHHHHhc
Confidence 5666666543 3333 33445678999999998
No 197
>PHA02855 anti-apoptotic membrane protein; Provisional
Probab=28.36 E-value=1.6e+02 Score=20.40 Aligned_cols=57 Identities=9% Similarity=0.209 Sum_probs=38.8
Q ss_pred CCHhHHHHHHHHHHHHHhc-CcHHHHHHHHHHHHHHHhccCCC----chhHHHHHHHHhhCC
Q psy16982 40 LHEQDKAMIRDAIVDAVVM-APEVIRVQLAVCVSNIVKHDFPG----KWTQIVDKVSIYLQN 96 (116)
Q Consensus 40 i~~~~k~~IK~~ll~~l~~-~~~~i~~~l~~~i~~Ia~~d~P~----~Wp~Ll~~l~~~l~~ 96 (116)
+.++.-..||+.++++|.. ..+.|.-+....++.|+..---+ -..+|++.+++.+++
T Consensus 69 i~~~nI~~IK~~iie~L~~D~rPSVKLA~iSLlSiIiek~~~kn~~~v~s~lid~I~~kiSe 130 (180)
T PHA02855 69 VTEENINDIKSQIIESLNNDNRPSVKLAIISLISMIAEKKGYKNNNIVMSDLINEIANKISE 130 (180)
T ss_pred hHHHHHHHHHHHHHHHHHhcCCchhHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHhhh
Confidence 3344447899999999955 44778877777888887652211 245678888776665
No 198
>PF05004 IFRD: Interferon-related developmental regulator (IFRD); InterPro: IPR007701 Interferon-related developmental regulator (IFRD1) is the human homologue of the Rattus norvegicus early response protein PC4 and its murine homologue TIS7 []. The exact function of IFRD1 is unknown but it has been shown that PC4 is necessary for muscle differentiation and that it might have a role in signal transduction. This entry also contains IFRD2 and its murine equivalent SKMc15, which are highly expressed soon after gastrulation and in the hepatic primordium, suggesting an involvement in early hematopoiesis [].
Probab=28.18 E-value=2.6e+02 Score=20.90 Aligned_cols=94 Identities=12% Similarity=0.150 Sum_probs=55.9
Q ss_pred hHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhc-----cCCC
Q psy16982 7 MPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKH-----DFPG 81 (116)
Q Consensus 7 ~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~-----d~P~ 81 (116)
...|..|--.+.+.+..++... .-.+.+..+=+.++.++......-+...+.+++.++-. +-..
T Consensus 57 ~~~Re~aL~~l~~~l~~~~~~d-----------~v~~~~~tL~~~~~k~lkkg~~~E~~lA~~~l~Ll~ltlg~g~~~~e 125 (309)
T PF05004_consen 57 SSTREAALEALIRALSSRYLPD-----------FVEDRRETLLDALLKSLKKGKSEEQALAARALALLALTLGAGEDSEE 125 (309)
T ss_pred HHHHHHHHHHHHHHHHhcccHH-----------HHHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHhhhcCCCccHHH
Confidence 5678888888888888776521 22344566666777777665544444455556665544 1124
Q ss_pred chhHHHHHHHHhhCCCC--hHHHhHHHHHHHH
Q psy16982 82 KWTQIVDKVSIYLQNPD--ATPWFGALLCLYQ 111 (116)
Q Consensus 82 ~Wp~Ll~~l~~~l~~~~--~~~~~~~L~~L~~ 111 (116)
-+.++.|.|...+..+. +.....++.||.-
T Consensus 126 i~~~~~~~L~~~l~d~s~~~~~R~~~~~aLai 157 (309)
T PF05004_consen 126 IFEELKPVLKRILTDSSASPKARAACLEALAI 157 (309)
T ss_pred HHHHHHHHHHHHHhCCccchHHHHHHHHHHHH
Confidence 66788888888777543 2333444545443
No 199
>PF07923 N1221: N1221-like protein; InterPro: IPR012486 The sequences featured in this family are similar to a hypothetical protein product of ORF N1221 in the CPT1-SPC98 intergenic region of the yeast genome (P53917 from SWISSPROT). This encodes an acidic polypeptide with several possible transmembrane regions [].
Probab=27.21 E-value=2.4e+02 Score=20.77 Aligned_cols=59 Identities=14% Similarity=0.155 Sum_probs=40.5
Q ss_pred hhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccC
Q psy16982 18 KNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDF 79 (116)
Q Consensus 18 Kn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~ 79 (116)
|+.+...|...... ..|...+++.+...=..+++.|...+...|.....+|-.|+.=-|
T Consensus 34 k~~Fe~~~~~~~~~---~~W~~~~~~~~~~~i~~ll~~L~~~~~~~R~~al~~LlYi~~G~~ 92 (293)
T PF07923_consen 34 KKAFEQDWRSFCQD---PSWDELSFDQRKDFIEKLLDQLESSDSEDRLEALRALLYIAQGTW 92 (293)
T ss_pred HHHHHHHhhhhhcc---ccccccchhhHHHHHHHHHHhccccchhhHHHHHHHHHHHHcCCc
Confidence 67777777643221 236778887777777788888888777777777777777765444
No 200
>PF09133 SANTA: SANTA (SANT Associated); InterPro: IPR015216 The SANTA domain (SANT associated) is approximately 90 amino acids in length and is conserved in eukaryotes. It is sometimes found in association with the SANT domain (IPR001005 from INTERPRO, also known as the Myb-like DNA-binding domain) implying a putative function in regulating chromatin remodelling. Sequence analysis has showed that the SANTA domain is likely to form four central beta-sheets with three flanking alpha-helices. Many conserved hydrophobic residues are present which implies a possible role in protein-protein interactions.
Probab=27.11 E-value=33 Score=21.03 Aligned_cols=13 Identities=23% Similarity=0.961 Sum_probs=10.6
Q ss_pred ccCCCchhHHHHH
Q psy16982 77 HDFPGKWTQIVDK 89 (116)
Q Consensus 77 ~d~P~~Wp~Ll~~ 89 (116)
.-||.+|.+++..
T Consensus 79 ~GFP~~W~~~~~~ 91 (93)
T PF09133_consen 79 NGFPENWEEYINE 91 (93)
T ss_pred cCCCHHHHHHHHh
Confidence 3799999998765
No 201
>PF09241 Herp-Cyclin: Herpesviridae viral cyclin; InterPro: IPR015322 Cyclins are eukaryotic proteins that play an active role in controlling nuclear cell division cycles [], and regulate cyclin dependent kinases (CDKs). Cyclins, together with the p34 (cdc2) or cdk2 kinases, form the Maturation Promoting Factor (MPF). There are two main groups of cyclins, G1/S cyclins, which are essential for the control of the cell cycle at the G1/S (start) transition, and G2/M cyclins, which are essential for the control of the cell cycle at the G2/M (mitosis) transition. G2/M cyclins accumulate steadily during G2 and are abruptly destroyed as cells exit from mitosis (at the end of the M-phase). In most species, there are multiple forms of G1 and G2 cyclins. For example, in vertebrates, there are two G2 cyclins, A and B, and at least three G1 cyclins, C, D, and E. Cyclin homologues have been found in various viruses, including Saimiriine herpesvirus 2 (Herpesvirus saimiri) and Human herpesvirus 8 (HHV-8) (Kaposi's sarcoma-associated herpesvirus). These viral homologues differ from their cellular counterparts in that the viral proteins have gained new functions and eliminated others to harness the cell and benefit the virus []. This entry represents a domain found in a family of viral cyclins that specifically activate CDK6 of host cells to a very high degree []. This domain adopts a helical structure consisting of five alpha-helices, with one helix surrounded by the others.; PDB: 1XO2_A 1JOW_A 2F2C_A 2EUF_A 1BU2_A.
Probab=27.09 E-value=96 Score=18.88 Aligned_cols=35 Identities=26% Similarity=0.445 Sum_probs=18.7
Q ss_pred cCC-CchhHHHHHHH----HhhCCCChHHHhHHHHHHHHH
Q psy16982 78 DFP-GKWTQIVDKVS----IYLQNPDATPWFGALLCLYQL 112 (116)
Q Consensus 78 d~P-~~Wp~Ll~~l~----~~l~~~~~~~~~~~L~~L~~l 112 (116)
..| +-||.|++.-. ..+-..|....--+|.|-..+
T Consensus 16 kipe~~wpql~e~~s~tickaliqpniall~p~licaggl 55 (106)
T PF09241_consen 16 KIPEDFWPQLFEATSITICKALIQPNIALLPPCLICAGGL 55 (106)
T ss_dssp T--GGGHHHHHHHHHHHHHHHTTSGGGGGS-HHHHHHHHH
T ss_pred cCcHHHhHHHHHHHHHHHHHHHcCCCccccCcceeecccc
Confidence 346 68999887543 334445555555555555544
No 202
>COG1413 FOG: HEAT repeat [Energy production and conversion]
Probab=27.03 E-value=2.6e+02 Score=20.51 Aligned_cols=55 Identities=15% Similarity=0.061 Sum_probs=27.6
Q ss_pred HHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhC-CCChHHHhHHHHHHHHH
Q psy16982 51 AIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQ-NPDATPWFGALLCLYQL 112 (116)
Q Consensus 51 ~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~-~~~~~~~~~~L~~L~~l 112 (116)
.+...+.+....+|.+.+.++ +....| +-++.++..++ +.|......+...|..+
T Consensus 78 ~l~~~l~d~~~~vr~~a~~aL---g~~~~~----~a~~~li~~l~~d~~~~vR~~aa~aL~~~ 133 (335)
T COG1413 78 LLRELLSDEDPRVRDAAADAL---GELGDP----EAVPPLVELLENDENEGVRAAAARALGKL 133 (335)
T ss_pred HHHHHhcCCCHHHHHHHHHHH---HccCCh----hHHHHHHHHHHcCCcHhHHHHHHHHHHhc
Confidence 334444444456666655533 333333 34455555555 35555555666665544
No 203
>KOG2160|consensus
Probab=27.03 E-value=3e+02 Score=21.25 Aligned_cols=95 Identities=16% Similarity=0.214 Sum_probs=60.6
Q ss_pred CCChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCc-HHHHHHHHHHHHHHHhccCCC-
Q psy16982 4 EVDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAP-EVIRVQLAVCVSNIVKHDFPG- 81 (116)
Q Consensus 4 ~~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~-~~i~~~l~~~i~~Ia~~d~P~- 81 (116)
+.+..+|-.||=.+-..+.++-..... +-+ ..-.+.|+..+..++ ..+|++.-.+|+...+..-|+
T Consensus 135 ~~~~~lR~~Aa~Vigt~~qNNP~~Qe~---------v~E---~~~L~~Ll~~ls~~~~~~~r~kaL~AissLIRn~~~g~ 202 (342)
T KOG2160|consen 135 NSDAELRELAARVIGTAVQNNPKSQEQ---------VIE---LGALSKLLKILSSDDPNTVRTKALFAISSLIRNNKPGQ 202 (342)
T ss_pred CCcHHHHHHHHHHHHHHHhcCHHHHHH---------HHH---cccHHHHHHHHccCCCchHHHHHHHHHHHHHhcCcHHH
Confidence 457789999998888888776653211 111 114567788887655 588899999999988875442
Q ss_pred -------chhHHHHHHHHhhCC--CChHHHhHHHHHHHHHHh
Q psy16982 82 -------KWTQIVDKVSIYLQN--PDATPWFGALLCLYQLVK 114 (116)
Q Consensus 82 -------~Wp~Ll~~l~~~l~~--~~~~~~~~~L~~L~~l~~ 114 (116)
.|.- |...+++ .+.....-++..+..+.+
T Consensus 203 ~~fl~~~G~~~----L~~vl~~~~~~~~lkrK~~~Ll~~Ll~ 240 (342)
T KOG2160|consen 203 DEFLKLNGYQV----LRDVLQSNNTSVKLKRKALFLLSLLLQ 240 (342)
T ss_pred HHHHhcCCHHH----HHHHHHcCCcchHHHHHHHHHHHHHHH
Confidence 3433 3344444 344556667777766654
No 204
>PF09492 Pec_lyase: Pectic acid lyase; InterPro: IPR012669 Members of this family are isozymes of pectate lyase (4.2.2.2 from EC), also called polygalacturonic transeliminase and alpha-1,4-D-endopolygalacturonic acid lyase.; PDB: 1R76_A 1GXM_B 1GXN_A 1GXO_A.
Probab=26.74 E-value=1e+02 Score=23.12 Aligned_cols=64 Identities=14% Similarity=0.213 Sum_probs=35.7
Q ss_pred HHHHHHhcCc-HHHHHHHHHHHHHHHhccCC-CchhHHHHHHHHhhCC--CChHHHhHHHHHHHHHHh
Q psy16982 51 AIVDAVVMAP-EVIRVQLAVCVSNIVKHDFP-GKWTQIVDKVSIYLQN--PDATPWFGALLCLYQLVK 114 (116)
Q Consensus 51 ~ll~~l~~~~-~~i~~~l~~~i~~Ia~~d~P-~~Wp~Ll~~l~~~l~~--~~~~~~~~~L~~L~~l~~ 114 (116)
.|..+..... +..+..+-..|..|...-|| +.||...|.--..-.. =|.+..-..|.+|..+.+
T Consensus 47 fLa~~y~~t~d~~y~~A~~kgl~ylL~aQypnGGWPQ~yP~~~~Y~~~ITfNDdam~~vl~lL~~v~~ 114 (289)
T PF09492_consen 47 FLARVYQATKDPRYREAFLKGLDYLLKAQYPNGGWPQFYPLRGGYHDHITFNDDAMVNVLELLRDVAE 114 (289)
T ss_dssp HHHHHHHHCG-HHHHHHHHHHHHHHHHHS-TTS--BSECS--SGGGGSEE-GGGHHHHHHHHHHHHHC
T ss_pred HHHHHHHHhCChHHHHHHHHHHHHHHHhhCCCCCCCccCCCCCCCCCceEEccHHHHHHHHHHHHHHh
Confidence 4444444433 68899999999999999999 8999865532221111 034444555555555543
No 205
>PF09337 zf-H2C2: His(2)-Cys(2) zinc finger; InterPro: IPR015416 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents an H2C2-type zinc finger that binds to histone upstream activating sequence (UAS) elements found in histone gene promoters []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].
Probab=26.38 E-value=84 Score=16.01 Aligned_cols=25 Identities=12% Similarity=0.303 Sum_probs=17.5
Q ss_pred HHHHHHHhccCCCchhHHHHHHHHhhCC
Q psy16982 69 VCVSNIVKHDFPGKWTQIVDKVSIYLQN 96 (116)
Q Consensus 69 ~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~ 96 (116)
.+++.|+..- -||.+-..+.+.+.+
T Consensus 10 kT~~~i~~~y---~W~gm~~~V~~~ir~ 34 (39)
T PF09337_consen 10 KTTAKISQRY---HWPGMKKDVRRVIRS 34 (39)
T ss_pred HHHHHHHHhh---eecCHHHHHHHHHhc
Confidence 4667777764 488888888776653
No 206
>PRK08326 ribonucleotide-diphosphate reductase subunit beta; Validated
Probab=26.23 E-value=1.3e+02 Score=22.47 Aligned_cols=42 Identities=7% Similarity=0.094 Sum_probs=27.3
Q ss_pred HhhhhhccccccccCCCCC--CccCCCCHhHHHHHHHHHHHHHhc
Q psy16982 16 YLKNLITNQWVEKEVDPGT--PLAFSLHEQDKAMIRDAIVDAVVM 58 (116)
Q Consensus 16 ~LKn~i~~~w~~~~~~~~~--~~~~~i~~~~k~~IK~~ll~~l~~ 58 (116)
+-|+....+|...+-.... .+|..+++++|..++..+ ..+..
T Consensus 30 ly~~~~~~fW~peEidls~D~~dw~~Lt~~Er~~~~~il-a~f~~ 73 (311)
T PRK08326 30 LFAKGNAKFWNPADIDFSRDAEDWEKLSDEERDYATRLC-AQFIA 73 (311)
T ss_pred HHHHHHHcCCCHHhcCccchHHHHHhCCHHHHHHHHHHH-HHHHh
Confidence 5677788899865322111 246689999999998554 44433
No 207
>PF14500 MMS19_N: Dos2-interacting transcription regulator of RNA-Pol-II
Probab=26.11 E-value=2.6e+02 Score=20.36 Aligned_cols=101 Identities=12% Similarity=0.109 Sum_probs=62.0
Q ss_pred CCChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCC-Cc
Q psy16982 4 EVDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFP-GK 82 (116)
Q Consensus 4 ~~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P-~~ 82 (116)
+.+..+|.-|--+|=..+.+--. ..+++++-+.+=+-...-|. ....+...+.-+.+.+....|+ +.
T Consensus 10 sed~~~R~ka~~~Ls~vL~~lp~-----------~~L~~~ev~~L~~F~~~rl~-D~~~~~~~l~gl~~L~~~~~~~~~~ 77 (262)
T PF14500_consen 10 SEDPIIRAKALELLSEVLERLPP-----------DFLSRQEVQVLLDFFCSRLD-DHACVQPALKGLLALVKMKNFSPES 77 (262)
T ss_pred CCCHHHHHHHHHHHHHHHHhCCH-----------hhccHHHHHHHHHHHHHHhc-cHhhHHHHHHHHHHHHhCcCCChhh
Confidence 35778888888777777766443 14666666655555555552 2245555555555555445555 45
Q ss_pred hhHHHHHHHHhhCCC--ChHHHhHHHHHHHHHHhhC
Q psy16982 83 WTQIVDKVSIYLQNP--DATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 83 Wp~Ll~~l~~~l~~~--~~~~~~~~L~~L~~l~~~y 116 (116)
=..++..+++...-+ -...++.++.+|..++++|
T Consensus 78 ~~~i~~~l~~~~~~q~~~q~~R~~~~~ll~~l~~~~ 113 (262)
T PF14500_consen 78 AVKILRSLFQNVDVQSLPQSTRYAVYQLLDSLLENH 113 (262)
T ss_pred HHHHHHHHHHhCChhhhhHHHHHHHHHHHHHHHHHh
Confidence 566777777654432 3355777888888887764
No 208
>KOG1248|consensus
Probab=26.06 E-value=5e+02 Score=23.57 Aligned_cols=73 Identities=15% Similarity=0.172 Sum_probs=54.8
Q ss_pred CCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCC----CchhHHHHHHHHhhCCCChHHHhHHHHHHHHHH
Q psy16982 40 LHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFP----GKWTQIVDKVSIYLQNPDATPWFGALLCLYQLV 113 (116)
Q Consensus 40 i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P----~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~ 113 (116)
+..+.-..|.+.+.+.+.+.+...+..-..|+..|...- | +-=|-++++++=.+...|....+.++.||..++
T Consensus 690 ~~~q~i~~I~n~L~ds~qs~~~~~~~~rl~~L~~L~~~~-~~e~~~~i~k~I~EvIL~~Ke~n~~aR~~Af~lL~~i~ 766 (1176)
T KOG1248|consen 690 LVEQRIDDIFNSLLDSFQSSSSPAQASRLKCLKRLLKLL-SAEHCDLIPKLIPEVILSLKEVNVKARRNAFALLVFIG 766 (1176)
T ss_pred HHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHhc-cHHHHHHHHHHHHHHHHhcccccHHHHhhHHHHHHHHH
Confidence 445555778888888888888888888888888887662 3 233556666665566668888999999999988
No 209
>COG1907 Predicted archaeal sugar kinases [General function prediction only]
Probab=26.04 E-value=19 Score=27.03 Aligned_cols=12 Identities=42% Similarity=0.969 Sum_probs=10.5
Q ss_pred HHHhccCCCchh
Q psy16982 73 NIVKHDFPGKWT 84 (116)
Q Consensus 73 ~Ia~~d~P~~Wp 84 (116)
.|++.|||++|-
T Consensus 153 ~I~R~dfPedW~ 164 (312)
T COG1907 153 LIFRLDFPEDWR 164 (312)
T ss_pred eeeeecCCCceE
Confidence 589999999994
No 210
>PF10273 WGG: Pre-rRNA-processing protein TSR2; InterPro: IPR019398 The pre-rRNA-processing protein TSR2 is required for 20S pre-rRNA processing []. This family contains a distinctive WGG motif.
Probab=25.94 E-value=1.5e+02 Score=17.59 Aligned_cols=52 Identities=17% Similarity=0.261 Sum_probs=29.4
Q ss_pred hhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccC
Q psy16982 17 LKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDF 79 (116)
Q Consensus 17 LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~ 79 (116)
|+-.|.+.|..+ -+.+-+.-+...++..+.+....-...+...|..+...+|
T Consensus 16 L~lAVen~wGG~-----------~s~~K~~~l~~~i~~~f~~~~~~~~~~le~~L~~~m~~eF 67 (82)
T PF10273_consen 16 LQLAVENGWGGP-----------DSQEKADWLAEVIVDWFTENKDPDADDLEDFLEDIMDDEF 67 (82)
T ss_pred HHHHHHhccCCc-----------cHHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHhHc
Confidence 455566666532 2445556777777777766443335555555555555554
No 211
>PF12252 SidE: Dot/Icm substrate protein; InterPro: IPR021014 This entry represents bacterial proteins that are typically between 397 and 1543 amino acids in length including SidE protein in the Dot/Icm pathway of Legionella pneumophila bacteria. There is little literature describing the family.
Probab=25.82 E-value=1.8e+02 Score=26.22 Aligned_cols=48 Identities=10% Similarity=0.321 Sum_probs=37.1
Q ss_pred CCCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCC--CchhHH
Q psy16982 38 FSLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFP--GKWTQI 86 (116)
Q Consensus 38 ~~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P--~~Wp~L 86 (116)
..++.+-|.++++.+..+|.++-..++..-...+...... || .+|..+
T Consensus 956 s~is~~yKnFLne~ViPvLEeCl~aL~~nn~~~L~kaLA~-FP~d~qWSaF 1005 (1439)
T PF12252_consen 956 SKISQEYKNFLNEKVIPVLEECLNALRENNMDMLQKALAA-FPSDKQWSAF 1005 (1439)
T ss_pred ccccHHHHHHHHhccHHHHHHHHHHHHhcCHHHHHHHHHh-CCCcccchhc
Confidence 4689999999999999999998777776655555555444 78 699864
No 212
>KOG2956|consensus
Probab=25.66 E-value=2.2e+02 Score=23.07 Aligned_cols=44 Identities=16% Similarity=0.207 Sum_probs=31.5
Q ss_pred HHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 71 VSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 71 i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
+..|-.-+.-.-=|++.|.+++..++........+.+||-++..
T Consensus 434 ~e~l~~EeL~~ll~diaP~~iqay~S~SS~VRKtaVfCLVamv~ 477 (516)
T KOG2956|consen 434 FERLSAEELLNLLPDIAPCVIQAYDSTSSTVRKTAVFCLVAMVN 477 (516)
T ss_pred HhhcCHHHHHHhhhhhhhHHHHHhcCchHHhhhhHHHhHHHHHH
Confidence 33333334435678999999999998877777888888877664
No 213
>KOG0212|consensus
Probab=25.65 E-value=4e+02 Score=22.32 Aligned_cols=64 Identities=8% Similarity=0.010 Sum_probs=39.0
Q ss_pred HHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHh---hCCCChHHHhHHHHHHHHHH
Q psy16982 47 MIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIY---LQNPDATPWFGALLCLYQLV 113 (116)
Q Consensus 47 ~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~---l~~~~~~~~~~~L~~L~~l~ 113 (116)
.|=..|+..|.+.+..+-.+.-..++.|+... +=|++.+++.++ ...+.--.-..|-.+++++|
T Consensus 377 ~if~tLL~tLsd~sd~vvl~~L~lla~i~~s~---~~~~~~~fl~sLL~~f~e~~~~l~~Rg~lIIRqlC 443 (675)
T KOG0212|consen 377 SIFLTLLKTLSDRSDEVVLLALSLLASICSSS---NSPNLRKFLLSLLEMFKEDTKLLEVRGNLIIRQLC 443 (675)
T ss_pred HHHHHHHHhhcCchhHHHHHHHHHHHHHhcCc---ccccHHHHHHHHHHHHhhhhHHHHhhhhHHHHHHH
Confidence 34567777888877777777778888888763 223445544444 44433333345555666666
No 214
>smart00288 VHS Domain present in VPS-27, Hrs and STAM. Unpublished observations. Domain of unknown function.
Probab=25.36 E-value=1.9e+02 Score=18.56 Aligned_cols=36 Identities=28% Similarity=0.156 Sum_probs=26.1
Q ss_pred CCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhh
Q psy16982 80 PGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKN 115 (116)
Q Consensus 80 P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~ 115 (116)
+..=-+.+..|-..+.++|+.....+|.+|..+++.
T Consensus 32 ~~~~k~a~r~l~krl~~~n~~v~l~AL~lLe~~vkN 67 (133)
T smart00288 32 PDGPKDAVRLLKKRLNNKNPHVALLALTLLDACVKN 67 (133)
T ss_pred CccHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHH
Confidence 333446677777778888888888888888877763
No 215
>KOG2025|consensus
Probab=25.09 E-value=4.5e+02 Score=22.75 Aligned_cols=70 Identities=13% Similarity=0.161 Sum_probs=52.0
Q ss_pred CCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCC-CChHHHhHHHHHH
Q psy16982 39 SLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQN-PDATPWFGALLCL 109 (116)
Q Consensus 39 ~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~-~~~~~~~~~L~~L 109 (116)
.++++.-+.+...++.-+.+-.+.||.|...+++..-.-+--++-| ....+...+++ ++++...++|.++
T Consensus 118 eidd~vfn~l~e~l~~Rl~Drep~VRiqAv~aLsrlQ~d~~dee~~-v~n~l~~liqnDpS~EVRRaaLsnI 188 (892)
T KOG2025|consen 118 EIDDDVFNKLNEKLLIRLKDREPNVRIQAVLALSRLQGDPKDEECP-VVNLLKDLIQNDPSDEVRRAALSNI 188 (892)
T ss_pred ccCHHHHHHHHHHHHHHHhccCchHHHHHHHHHHHHhcCCCCCccc-HHHHHHHHHhcCCcHHHHHHHHHhh
Confidence 6899999999999999999988999999988888876543334554 55555566655 3556666777654
No 216
>PF07778 CENP-I: Mis6 ; InterPro: IPR012485 Centromere protein Cenp-I (also known as Mis6) is an essential centromere connector protein acting during G1-S phase of the cell cycle. Mis6 is thought to be required for recruiting Cenp-A, the centromere- specific histone H3 variant; an important event for centromere function and chromosome segregation during mitosis [, ].
Probab=24.94 E-value=1.3e+02 Score=24.43 Aligned_cols=55 Identities=15% Similarity=0.128 Sum_probs=45.2
Q ss_pred HHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 62 VIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 62 ~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
.-|.++-..++.|--.+|.+--+.++.-|-+.+-++++...-+-|.++..++.+|
T Consensus 439 ~~R~~IL~LLs~iPl~~f~eL~~~ll~pLe~lf~~ss~~~kcslL~~y~~LL~nW 493 (511)
T PF07778_consen 439 DCRSQILQLLSWIPLSSFSELYPLLLQPLEQLFLTSSPYFKCSLLQFYTNLLQNW 493 (511)
T ss_pred ccHHHHHHHHhhccCCCHHHHHHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHH
Confidence 4677888888888888887778888888888888888888888898988888764
No 217
>PF06543 Lac_bphage_repr: Lactococcus bacteriophage repressor; InterPro: IPR009498 This entry represents the C terminus of various Lactococcus bacteriophage repressor proteins.
Probab=24.61 E-value=72 Score=17.26 Aligned_cols=14 Identities=14% Similarity=0.228 Sum_probs=11.9
Q ss_pred CCCHhHHHHHHHHH
Q psy16982 39 SLHEQDKAMIRDAI 52 (116)
Q Consensus 39 ~i~~~~k~~IK~~l 52 (116)
++++++|+.||-.+
T Consensus 30 PltdevK~a~k~i~ 43 (49)
T PF06543_consen 30 PLTDEVKEAMKLIF 43 (49)
T ss_pred eCCHHHHHHHHHHH
Confidence 79999999998654
No 218
>KOG0213|consensus
Probab=24.04 E-value=2.6e+02 Score=24.45 Aligned_cols=42 Identities=10% Similarity=0.204 Sum_probs=31.9
Q ss_pred HHhcCcHHHHHHHHHHHHHHHhccCC---CchhHHHHHHHHhhCC
Q psy16982 55 AVVMAPEVIRVQLAVCVSNIVKHDFP---GKWTQIVDKVSIYLQN 96 (116)
Q Consensus 55 ~l~~~~~~i~~~l~~~i~~Ia~~d~P---~~Wp~Ll~~l~~~l~~ 96 (116)
+|.+++.+||.-.+.+++.+|..-.| +...+.+.-|-..+++
T Consensus 561 gl~De~qkVR~itAlalsalaeaa~Pygie~fDsVlkpLwkgir~ 605 (1172)
T KOG0213|consen 561 GLKDEQQKVRTITALALSALAEAATPYGIEQFDSVLKPLWKGIRQ 605 (1172)
T ss_pred hhcccchhhhhHHHHHHHHHHHhcCCcchHHHHHHHHHHHHHHHH
Confidence 34445679999999999999999999 4666666666665554
No 219
>COG4168 SapB ABC-type antimicrobial peptide transport system, permease component [Defense mechanisms]
Probab=23.67 E-value=1.8e+02 Score=21.64 Aligned_cols=44 Identities=14% Similarity=0.206 Sum_probs=27.5
Q ss_pred HHHHHHHHHH--HHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHH
Q psy16982 61 EVIRVQLAVC--VSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGAL 106 (116)
Q Consensus 61 ~~i~~~l~~~--i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L 106 (116)
+++-.|++.. .+.|...-| .||.+=.=+.+.+.+.|.+...+|.
T Consensus 243 Pq~g~qfs~mLTlamitEs~f--~WPGiGRWLi~Ai~qqDy~sIsaGV 288 (321)
T COG4168 243 PQLGLQFSTMLTLAMITESVF--GWPGIGRWLINAIRQQDYASISAGV 288 (321)
T ss_pred cchhHHHHHHHHHHHHHHHHc--CCCchhHHHHHHHHhhhhHHhhcce
Confidence 3444444443 344444444 7888888888888888887766543
No 220
>KOG0166|consensus
Probab=23.54 E-value=4.1e+02 Score=21.71 Aligned_cols=74 Identities=15% Similarity=0.101 Sum_probs=54.0
Q ss_pred CCCHhHHHHHHHHHHHHHhcCc-HHHHHHHHHHHHHHHhccCC--Cch--hHHHHHHHHhhCCCChHHHhHHHHHHHHH
Q psy16982 39 SLHEQDKAMIRDAIVDAVVMAP-EVIRVQLAVCVSNIVKHDFP--GKW--TQIVDKVSIYLQNPDATPWFGALLCLYQL 112 (116)
Q Consensus 39 ~i~~~~k~~IK~~ll~~l~~~~-~~i~~~l~~~i~~Ia~~d~P--~~W--p~Ll~~l~~~l~~~~~~~~~~~L~~L~~l 112 (116)
.+.+-.+..+=..++++|.... +.++-..+-++..||+..-- +.- .+-+|.++.++.+++.+....+.-+|..|
T Consensus 101 pi~~vi~~G~v~~lV~~l~~~~~~~lq~eAAWaLTnIAsgtse~T~~vv~agavp~fi~Ll~s~~~~v~eQavWALgNI 179 (514)
T KOG0166|consen 101 PIDEVIQSGVVPRLVEFLSRDDNPTLQFEAAWALTNIASGTSEQTKVVVDAGAVPIFIQLLSSPSADVREQAVWALGNI 179 (514)
T ss_pred CHHHHHHcCcHHHHHHHHccCCChhHHHHHHHHHHHHhcCchhhccccccCCchHHHHHHhcCCcHHHHHHHHHHHhcc
Confidence 5666666677778888887544 78999999999999985432 111 24677889999998887777777776654
No 221
>PF10654 DUF2481: Protein of unknown function (DUF2481) ; InterPro: IPR018916 This entry is represented by Bacteriophage A500, Gp59. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=23.43 E-value=1.6e+02 Score=19.05 Aligned_cols=31 Identities=13% Similarity=0.295 Sum_probs=21.8
Q ss_pred HHHHHHHhccCC-CchhHHHHHHHHhhCCCCh
Q psy16982 69 VCVSNIVKHDFP-GKWTQIVDKVSIYLQNPDA 99 (116)
Q Consensus 69 ~~i~~Ia~~d~P-~~Wp~Ll~~l~~~l~~~~~ 99 (116)
++|+.|+..|.| ++=.+|...|-++...+..
T Consensus 14 eIIsyl~n~dl~~~~~k~LqkeLn~Lm~~nTE 45 (126)
T PF10654_consen 14 EIISYLVNNDLSFSKRKELQKELNQLMNENTE 45 (126)
T ss_pred HHHHHHHhCCCChHHHHHHHHHHHHHHhcchH
Confidence 467777888887 6667777777777665543
No 222
>PLN03060 inositol phosphatase-like protein; Provisional
Probab=23.37 E-value=2.2e+02 Score=20.29 Aligned_cols=35 Identities=17% Similarity=0.134 Sum_probs=26.5
Q ss_pred CCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHH
Q psy16982 40 LHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNI 74 (116)
Q Consensus 40 i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~I 74 (116)
-|+++|..|-+.+++++...+.++|.-...+....
T Consensus 66 ~Pee~~~~IF~Alc~a~~~dp~~~r~dA~~l~~~a 100 (206)
T PLN03060 66 PNATDRDAIFKAYIEALGEDPDQYRKDAKKLEEWA 100 (206)
T ss_pred CChHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHH
Confidence 48899999999999999888777776555444433
No 223
>PF13979 SopA_C: SopA-like catalytic domain; PDB: 3NB2_B 3NAW_B 3SQV_B 2QZA_B 3SY2_B 2QYU_A.
Probab=23.32 E-value=2.6e+02 Score=19.34 Aligned_cols=71 Identities=14% Similarity=0.280 Sum_probs=38.5
Q ss_pred HHHHHHHHHHHhcCcH-------HHHHHHHHHHHHHHhc-cCCCch---hHHHHHHHHhhC--CCChHHHhHHHHHHHHH
Q psy16982 46 AMIRDAIVDAVVMAPE-------VIRVQLAVCVSNIVKH-DFPGKW---TQIVDKVSIYLQ--NPDATPWFGALLCLYQL 112 (116)
Q Consensus 46 ~~IK~~ll~~l~~~~~-------~i~~~l~~~i~~Ia~~-d~P~~W---p~Ll~~l~~~l~--~~~~~~~~~~L~~L~~l 112 (116)
..+|+.++.++.+... .=+..|..+++.++.. +++..| |+..+.|++... +.++...--.|.||..+
T Consensus 4 ~~l~~~f~~a~~~~~~~ikl~~~~~q~~L~~if~~~~~~~~~~~~~~Lk~~Hy~~il~~~~L~~~~~~~kA~~Lf~La~v 83 (172)
T PF13979_consen 4 GELKDAFIAAIKSNKSDIKLVDDQQQQKLNKIFASYMDDEDFENRWNLKDKHYEQILNAYDLSSASEQKKAEILFSLAAV 83 (172)
T ss_dssp HHHHHHHHHHCCSSS-S--TTSCHHHHHHHHHH------GGGCCCTCC-HHHHHHHHCCTTTTTS-HHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHhcCcchhcccCcHHHHHHHHHHHhhhccccccccCCCCHHHHHHHHHHhCcccCCHHHHHHHHHHHHHH
Confidence 3566666666655321 1235566666666432 344566 467777776543 34555566678888888
Q ss_pred HhhC
Q psy16982 113 VKNY 116 (116)
Q Consensus 113 ~~~y 116 (116)
+-+|
T Consensus 84 f~ky 87 (172)
T PF13979_consen 84 FTKY 87 (172)
T ss_dssp HHHH
T ss_pred HHHh
Confidence 8765
No 224
>PRK14626 hypothetical protein; Provisional
Probab=23.20 E-value=1.5e+02 Score=18.79 Aligned_cols=37 Identities=14% Similarity=0.210 Sum_probs=30.8
Q ss_pred CCCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHH
Q psy16982 39 SLHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIV 75 (116)
Q Consensus 39 ~i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia 75 (116)
.+.+++++.+.+.|+.++.+.-.+++.....-++.+.
T Consensus 61 ll~~ed~e~LeDLI~aA~N~A~~k~~~~~~e~m~~~t 97 (110)
T PRK14626 61 LLNEDEYEVLKDLLIAAFNEASRRSKEVMGEKMTQAA 97 (110)
T ss_pred HcCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 4677889999999999999888788877777777776
No 225
>COG5181 HSH155 U2 snRNP spliceosome subunit [RNA processing and modification]
Probab=22.64 E-value=3.5e+02 Score=23.16 Aligned_cols=54 Identities=17% Similarity=0.091 Sum_probs=29.5
Q ss_pred CCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCC
Q psy16982 40 LHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQN 96 (116)
Q Consensus 40 i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~ 96 (116)
++.-+=-.|-=.|++.|.+-.+.||+...+.+..|++.--| .+.+..|++.|..
T Consensus 723 i~~rEWMRIcfeLvd~Lks~nKeiRR~A~~tfG~Is~aiGP---qdvL~~LlnnLkv 776 (975)
T COG5181 723 IGVREWMRICFELVDSLKSWNKEIRRNATETFGCISRAIGP---QDVLDILLNNLKV 776 (975)
T ss_pred CCHHHHHHHHHHHHHHHHHhhHHHHHhhhhhhhhHHhhcCH---HHHHHHHHhcchH
Confidence 44444455555566666655556666666666666665544 2345555554443
No 226
>COG4306 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.48 E-value=2.4e+02 Score=18.64 Aligned_cols=39 Identities=13% Similarity=0.286 Sum_probs=27.2
Q ss_pred CCCHhHHHHHHHHHHHHHhcCcH-HHHH-HHHHHHHHHHhc
Q psy16982 39 SLHEQDKAMIRDAIVDAVVMAPE-VIRV-QLAVCVSNIVKH 77 (116)
Q Consensus 39 ~i~~~~k~~IK~~ll~~l~~~~~-~i~~-~l~~~i~~Ia~~ 77 (116)
.+++++++++|..|-++..++++ ++.. -+-.+++.++.+
T Consensus 97 ~l~pdevqqf~tdlt~ltkdspktqva~lrfkkvmskv~t~ 137 (160)
T COG4306 97 NLNPDEVQQFRTDLTDLTKDSPKTQVATLRFKKVMSKVATS 137 (160)
T ss_pred cCCHHHHHHHHhhHHHHhhcCchhHHHHHHHHHHHHHHHHH
Confidence 68999999999999999988874 4332 244455555443
No 227
>cd01049 RNRR2 Ribonucleotide Reductase, R2/beta subunit, ferritin-like diiron-binding domain. Ribonucleotide Reductase, R2/beta subunit (RNRR2) is a member of a broad superfamily of ferritin-like diiron-carboxylate proteins. The RNR protein catalyzes the conversion of ribonucleotides to deoxyribonucleotides and is found in all eukaryotes, many prokaryotes, several viruses, and few archaea. The catalytically active form of RNR is a proposed alpha2-beta2 tetramer. The homodimeric alpha subunit (R1) contains the active site and redox active cysteines as well as the allosteric binding sites. The beta subunit (R2) contains a diiron cluster that, in its reduced state, reacts with dioxygen to form a stable tyrosyl radical and a diiron(III) cluster. This essential tyrosyl radical is proposed to generate a thiyl radical, located on a cysteine residue in the R1 active site that initiates ribonucleotide reduction. The beta subunit is composed of 10-13 helices, the 8 longest helices form an alpha-
Probab=22.44 E-value=1.6e+02 Score=21.36 Aligned_cols=40 Identities=20% Similarity=0.252 Sum_probs=25.2
Q ss_pred HhhhhhccccccccCCCC--CCccCCCCHhHHHHHHHHHHHH
Q psy16982 16 YLKNLITNQWVEKEVDPG--TPLAFSLHEQDKAMIRDAIVDA 55 (116)
Q Consensus 16 ~LKn~i~~~w~~~~~~~~--~~~~~~i~~~~k~~IK~~ll~~ 55 (116)
+.|+....+|...+-... ..+|..+++++|..++..+-..
T Consensus 14 ly~~~~~~~W~p~ei~~~~D~~~~~~l~~~er~~~~~~la~~ 55 (288)
T cd01049 14 LYKKAEANFWTPEEIDLSKDLKDWEKLTEAERHFIKRVLAFL 55 (288)
T ss_pred HHHHHHHcCCChhhcchhhhHHHHhHCCHHHHHHHHHHHHHH
Confidence 345566678986542211 1346678999999998665443
No 228
>KOG3077|consensus
Probab=22.01 E-value=58 Score=24.00 Aligned_cols=22 Identities=27% Similarity=0.625 Sum_probs=18.2
Q ss_pred HHhccCCCchhHHHHHHHHhhC
Q psy16982 74 IVKHDFPGKWTQIVDKVSIYLQ 95 (116)
Q Consensus 74 Ia~~d~P~~Wp~Ll~~l~~~l~ 95 (116)
+..+|.-+.||.|+++.++...
T Consensus 228 lSnYDeegAWP~liDeFVe~~r 249 (260)
T KOG3077|consen 228 LSNYDEEGAWPVLIDEFVEYLR 249 (260)
T ss_pred ccCccccccchHHHHHHHHHHH
Confidence 3567777999999999998765
No 229
>PRK09614 nrdF ribonucleotide-diphosphate reductase subunit beta; Reviewed
Probab=22.00 E-value=1.4e+02 Score=22.19 Aligned_cols=39 Identities=21% Similarity=0.237 Sum_probs=25.2
Q ss_pred HhhhhhccccccccCCCC--CCccCCCCHhHHHHHHHHHHH
Q psy16982 16 YLKNLITNQWVEKEVDPG--TPLAFSLHEQDKAMIRDAIVD 54 (116)
Q Consensus 16 ~LKn~i~~~w~~~~~~~~--~~~~~~i~~~~k~~IK~~ll~ 54 (116)
+-|+....+|...+-... ..+|..+++++|..++..|--
T Consensus 25 ~y~~~~~~fW~peEi~~s~D~~dw~~Lt~~Er~~~~~~l~~ 65 (324)
T PRK09614 25 AWKRLTANFWLPEEVPLSNDLKDWKKLSDEEKNLYTRVFGG 65 (324)
T ss_pred HHHHHHhCCCCCccccccchHHHHHhCCHHHHHHHHHHHHH
Confidence 445667778976432211 124678999999998865443
No 230
>PF08167 RIX1: rRNA processing/ribosome biogenesis
Probab=21.92 E-value=2.5e+02 Score=18.69 Aligned_cols=75 Identities=13% Similarity=0.069 Sum_probs=45.7
Q ss_pred CCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhcc---CC-CchhHHHHHHHHhhCCCCh-HHHhHHHHHHHHHHh
Q psy16982 40 LHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHD---FP-GKWTQIVDKVSIYLQNPDA-TPWFGALLCLYQLVK 114 (116)
Q Consensus 40 i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d---~P-~~Wp~Ll~~l~~~l~~~~~-~~~~~~L~~L~~l~~ 114 (116)
.+..+-...+..+...|.+.+..-|-.-...+..++... -- ..-...+..+++.+++.++ .....+..+|..++.
T Consensus 18 ~~~~~l~~l~~ri~~LL~s~~~~~rw~G~~Ll~~~~~~~~~e~l~~~~~~W~~~Ll~~L~~~~~~~~~~~ai~~L~~l~~ 97 (165)
T PF08167_consen 18 PSKSALHKLVTRINSLLQSKSAYSRWAGLCLLKVTVEQCSWEILLSHGSQWLRALLSILEKPDPPSVLEAAIITLTRLFD 97 (165)
T ss_pred cCHHHHHHHHHHHHHHhCCCChhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHH
Confidence 356777888999999888765444433223333333321 11 2334456677777776544 567788888888875
No 231
>PF08623 TIP120: TATA-binding protein interacting (TIP20); InterPro: IPR013932 TIP120 (also known as cullin-associated and neddylation-dissociated protein 1) is a TATA binding protein interacting protein that enhances transcription []. ; PDB: 4A0C_A 1U6G_C.
Probab=21.70 E-value=2.8e+02 Score=19.03 Aligned_cols=52 Identities=15% Similarity=0.209 Sum_probs=36.7
Q ss_pred HHHHHHHHHHHHHHhccCC-CchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 62 VIRVQLAVCVSNIVKHDFP-GKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 62 ~i~~~l~~~i~~Ia~~d~P-~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
.+|+..-+|+..+...-+- -...++++.+...+.. +.+...-+...|.+++.
T Consensus 42 elRK~ayE~lytlLd~~~~~~~~~~~~~~v~~GL~D-~~DIk~L~~~~l~kl~~ 94 (169)
T PF08623_consen 42 ELRKAAYECLYTLLDTCLSRIDISEFLDRVEAGLKD-EHDIKMLCHLMLSKLAQ 94 (169)
T ss_dssp HHHHHHHHHHHHHHHSTCSSS-HHHHHHHHHHTTSS--HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHhhcCC-cHHHHHHHHHHHHHHHH
Confidence 5889989999888765333 4688999999998877 56666666666666553
No 232
>PF11784 DUF3320: Protein of unknown function (DUF3320); InterPro: IPR021754 This family is conserved in Proteobacteria and Chlorobi families. Many members are annotated as being putative DNA helicase-related proteins.
Probab=21.63 E-value=1.5e+02 Score=15.93 Aligned_cols=31 Identities=23% Similarity=0.239 Sum_probs=22.6
Q ss_pred HHHHHHHHHHHHhccCCCchhHHHHHHHHhh
Q psy16982 64 RVQLAVCVSNIVKHDFPGKWTQIVDKVSIYL 94 (116)
Q Consensus 64 ~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l 94 (116)
..++...+..|+..|.|-....|..-+....
T Consensus 10 ~~~L~~~i~~Iv~~EgPI~~~~L~~Ri~~a~ 40 (52)
T PF11784_consen 10 RPQLARMIRQIVEVEGPIHEDELARRIARAW 40 (52)
T ss_pred HHHHHHHHHHHHHHcCCccHHHHHHHHHHHc
Confidence 4567777888888888877777777776543
No 233
>PLN00047 photosystem II biogenesis protein Psb29; Provisional
Probab=21.45 E-value=2.1e+02 Score=21.42 Aligned_cols=34 Identities=24% Similarity=0.189 Sum_probs=25.8
Q ss_pred CCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHH
Q psy16982 40 LHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSN 73 (116)
Q Consensus 40 i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~ 73 (116)
-|+++|..|-+.+++++...+.++|.-...+...
T Consensus 119 ~Pee~~~~IF~Alc~a~g~Dp~qyr~dA~~l~~~ 152 (283)
T PLN00047 119 PSDEDRDAIFKAYIKALGEDPEQYRKDAAKLEEW 152 (283)
T ss_pred CChHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHH
Confidence 4789999999999999988877776554444333
No 234
>COG2124 CypX Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=21.34 E-value=2.3e+02 Score=21.74 Aligned_cols=44 Identities=20% Similarity=0.364 Sum_probs=35.3
Q ss_pred HHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHH
Q psy16982 46 AMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVS 91 (116)
Q Consensus 46 ~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~ 91 (116)
+.|++.++-.|........+.++-++..++++ |+.|..|..+.-
T Consensus 235 ~Ei~~~~~~ll~AGheTTa~~l~~a~~~L~~~--P~~~~~l~~e~~ 278 (411)
T COG2124 235 DEIRDELITLLVAGHETTANALAWALYALLRH--PDQLAKLRAEPD 278 (411)
T ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHC--chHHHHHHhCcc
Confidence 45788889999988889999999999999887 777776655543
No 235
>KOG0159|consensus
Probab=21.22 E-value=3.2e+02 Score=22.35 Aligned_cols=55 Identities=7% Similarity=0.182 Sum_probs=43.4
Q ss_pred hHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCCh
Q psy16982 43 QDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDA 99 (116)
Q Consensus 43 ~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~ 99 (116)
-++..+|..+++.+...-......+.-++..+|+. |+.=+.|..++...+.+++.
T Consensus 312 l~~k~~~~~~~dll~aGvDTTs~tl~~~Ly~Larn--P~~Q~~L~~Ei~~~~p~~~~ 366 (519)
T KOG0159|consen 312 LSRKDAKANVMDLLAAGVDTTSNTLLWALYELARN--PEVQQRLREEILAVLPSGNS 366 (519)
T ss_pred CCHHHHHHHHHHHHHHhccchHHHHHHHHHHHhcC--hHHHHHHHHHHHhhCCCccc
Confidence 35677788888888777677778888889999988 77778888899888887543
No 236
>TIGR03060 PS_II_psb29 photosystem II biogenesis protein Psp29. Psp29, originally designated sll1414 in Synechocystis 6803, is found universally in Cyanobacteria and in Arabidopsis. It was isolated and partially sequenced from purified photosystem II (PS II) in Synechocystis. While its function is unknown, mutant studies show an impairment in photosystem II biogenesis and/or stability, rather than in PS II core function.
Probab=20.92 E-value=2.6e+02 Score=20.04 Aligned_cols=35 Identities=17% Similarity=0.206 Sum_probs=26.4
Q ss_pred CCHhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHH
Q psy16982 40 LHEQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNI 74 (116)
Q Consensus 40 i~~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~I 74 (116)
-|+++|..|-+.+++++.-.+.++|.-...+....
T Consensus 68 ~Pee~~~~IF~Alc~a~~~dp~~~r~dA~~l~~~a 102 (214)
T TIGR03060 68 RPEEHLDALFDALCNSNGFDPEQLREDAKQLLEQA 102 (214)
T ss_pred CChHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHH
Confidence 48899999999999999888777776555444433
No 237
>PF05918 API5: Apoptosis inhibitory protein 5 (API5); InterPro: IPR008383 This family consists of apoptosis inhibitory protein 5 (API5) sequences from several organisms. Apoptosis or programmed cell death is a physiological form of cell death that occurs in embryonic development and organ formation. It is characterised by biochemical and morphological changes such as DNA fragmentation and cell volume shrinkage. API5 is an anti apoptosis gene located in Homo sapiens chromosome 11, whose expression prevents the programmed cell death that occurs upon the deprivation of growth factors [,].; PDB: 3U0R_A 3V6A_A.
Probab=20.84 E-value=2.7e+02 Score=22.95 Aligned_cols=54 Identities=20% Similarity=0.165 Sum_probs=42.1
Q ss_pred cHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 60 PEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 60 ~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
+.+.+.-.++.|..-.++ ||+-=.+-++.++.+...+|......|+.-|..+|+
T Consensus 35 ~~k~K~Laaq~I~kffk~-FP~l~~~Ai~a~~DLcEDed~~iR~~aik~lp~~ck 88 (556)
T PF05918_consen 35 SPKEKRLAAQFIPKFFKH-FPDLQEEAINAQLDLCEDEDVQIRKQAIKGLPQLCK 88 (556)
T ss_dssp -HHHHHHHHHHHHHHHCC--GGGHHHHHHHHHHHHT-SSHHHHHHHHHHGGGG--
T ss_pred CHHHHHHHHHHHHHHHhh-ChhhHHHHHHHHHHHHhcccHHHHHHHHHhHHHHHH
Confidence 467788888888888776 898888899999999999999888899998888886
No 238
>PF13001 Ecm29: Proteasome stabiliser; InterPro: IPR024372 The proteasome (or macropain) (3.4.25.1 from EC) [, , , , ] is a eukaryotic and archaeal multicatalytic proteinase complex that seems to be involved in an ATP/ubiquitin-dependent nonlysosomal proteolytic pathway. In eukaryotes the proteasome is composed of about 28 distinct subunits which form a highly ordered ring-shaped structure (20S ring) of about 700 kDa. Most proteasome subunits can be classified, on the basis on sequence similarities into two groups, alpha (A) and beta (B). Ecm29 tethers the proteasome core particle to the regulatory particle, stabilising the interaction between these two components [, , ].
Probab=20.71 E-value=4.5e+02 Score=21.06 Aligned_cols=40 Identities=13% Similarity=0.308 Sum_probs=32.0
Q ss_pred CChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcCc
Q psy16982 5 VDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMAP 60 (116)
Q Consensus 5 ~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~~ 60 (116)
....+|.++.+|++..+.| +++++|..+=-.+++.+..-+
T Consensus 76 ~s~~vrnfsliyi~~g~~R----------------l~~~e~~~llP~ll~~is~~~ 115 (501)
T PF13001_consen 76 DSSFVRNFSLIYIEMGFDR----------------LDDEERRELLPSLLKGISKKP 115 (501)
T ss_pred CchHHHHHHHHHHHHhhhc----------------CCHHHHHHHHHHHHHhhccCc
Confidence 4578999999999877655 788888888888888887543
No 239
>PF08542 Rep_fac_C: Replication factor C C-terminal domain; InterPro: IPR013748 Replication factor C (RFC) is a multimeric AAA+ protein complex that loads the DNA polymerase processivity clamp PCNA (Proliferating Cell Nuclear Antigen) onto DNA using ATP to drive the reaction []. PCNA functions at multiple levels in directing DNA metabolic pathways []. When bound to DNA, PCNA organises various proteins involved in DNA replication, DNA repair, DNA modification, and chromatin modelling. Replication factor C consists of five subunits in a spiral arrangement: Rfc1, Rfc2, Rfc3, Rfc4, and Rfc5 subunits. Rfc1 and Rfc2 load the PCNA sliding clamp onto DNA, while Rfc3 binds ATP and also acts as a checkpoint sensor. The RFC complex contains four ATP sites (sites A, B, C, and D) located at subunit interfaces. In each ATP site, an arginine residue from one subunit is located near the gamma-phosphate of ATP bound in the adjacent subunit. These arginine residues act as "arginine fingers" that can potentially perform two functions: sensing that ATP is bound and catalyzing ATP hydrolysis []. This entry represents the core domain found in Rfc1-5.; GO: 0003689 DNA clamp loader activity, 0005524 ATP binding, 0006260 DNA replication, 0005663 DNA replication factor C complex; PDB: 1SXJ_B 2CHG_B 2CHV_F 2CHQ_C 1IQP_A.
Probab=20.69 E-value=1.3e+02 Score=17.44 Aligned_cols=9 Identities=0% Similarity=0.305 Sum_probs=3.4
Q ss_pred HHHHHHHHh
Q psy16982 85 QIVDKVSIY 93 (116)
Q Consensus 85 ~Ll~~l~~~ 93 (116)
+.+..+++.
T Consensus 6 ~~i~~i~~~ 14 (89)
T PF08542_consen 6 EVIEEILES 14 (89)
T ss_dssp HHHHHHHHH
T ss_pred HHHHHHHHH
Confidence 333333333
No 240
>PF08732 HIM1: HIM1; InterPro: IPR014843 HIM1 (high induction of mutagenesis protein 1) plays a role in the control of spontaneous and induced mutagenesis []. It is thought to participate in the control of processing of mutational intermediates appearing during error-prone bypass of DNA damage.
Probab=20.62 E-value=53 Score=25.80 Aligned_cols=12 Identities=17% Similarity=0.562 Sum_probs=11.0
Q ss_pred CchhHHHHHHHH
Q psy16982 81 GKWTQIVDKVSI 92 (116)
Q Consensus 81 ~~Wp~Ll~~l~~ 92 (116)
..||+|+|.|+.
T Consensus 173 ~~Wp~Llp~iF~ 184 (410)
T PF08732_consen 173 TKWPDLLPDIFS 184 (410)
T ss_pred cchHHHHHHHhc
Confidence 599999999987
No 241
>KOG0211|consensus
Probab=20.55 E-value=3.7e+02 Score=23.14 Aligned_cols=72 Identities=15% Similarity=0.108 Sum_probs=46.4
Q ss_pred HhHHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCC-CchhHHHHHHHHhhCCCChHHHhHHHHHHHHHH
Q psy16982 42 EQDKAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFP-GKWTQIVDKVSIYLQNPDATPWFGALLCLYQLV 113 (116)
Q Consensus 42 ~~~k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P-~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~ 113 (116)
+..|..+|...-++-....+.+|...+.-...+|+.--- ..|.++++...+++..+.....+.+.-.+..+.
T Consensus 232 ~~vk~elr~~~~~lc~d~~~~Vr~~~a~~l~~~a~~~~~~~~~s~v~~~~~~L~~DdqdsVr~~a~~~~~~l~ 304 (759)
T KOG0211|consen 232 DAVKRELRPIVQSLCQDDTPMVRRAVASNLGNIAKVLESEIVKSEVLPTLIQLLRDDQDSVREAAVESLVSLL 304 (759)
T ss_pred HHHHHHHHHHHHhhccccchhhHHHHHhhhHHHHHHHHHHHHHhhccHHHhhhhhcchhhHHHHHHHHHHHHH
Confidence 455666665544444445578898888888888877322 578888888888877654444555555544443
No 242
>PLN03205 ATR interacting protein; Provisional
Probab=20.43 E-value=4.4e+02 Score=21.31 Aligned_cols=50 Identities=14% Similarity=0.080 Sum_probs=37.5
Q ss_pred HHHHHHHHHHHhccCC-CchhHHHHHHHHhhCCCChHHHhHHHHHHHHHHh
Q psy16982 65 VQLAVCVSNIVKHDFP-GKWTQIVDKVSIYLQNPDATPWFGALLCLYQLVK 114 (116)
Q Consensus 65 ~~l~~~i~~Ia~~d~P-~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~ 114 (116)
..+..+.+.+.+.-+- -.-..|++.++.+..-++...++.+|.+|+.+.+
T Consensus 302 ekVshlYs~~tKiS~G~V~lqtLlEaLLdLC~v~n~a~V~RsLRvLh~vLq 352 (652)
T PLN03205 302 EKVYHLYSAVTKISYGFVNLKSLVEPLLDLCKAETAVLVHRSLRVLHVLLE 352 (652)
T ss_pred HHHHHHHHHHHHhhCCeeeHHHHHHHHHHHHhcCchhhhHHHHHHHHHHHH
Confidence 3344455555544443 4677899999999999999999999999998875
No 243
>KOG2256|consensus
Probab=20.27 E-value=4.5e+02 Score=22.23 Aligned_cols=79 Identities=14% Similarity=0.216 Sum_probs=43.3
Q ss_pred cCCCCHhHHHHHHHHHHHHHhcC-cHHHHHHHHHHHHHHHhc--cCCCchhHHHHHHHHhhCCCChHHHhHHHHHHHHHH
Q psy16982 37 AFSLHEQDKAMIRDAIVDAVVMA-PEVIRVQLAVCVSNIVKH--DFPGKWTQIVDKVSIYLQNPDATPWFGALLCLYQLV 113 (116)
Q Consensus 37 ~~~i~~~~k~~IK~~ll~~l~~~-~~~i~~~l~~~i~~Ia~~--d~P~~Wp~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~ 113 (116)
|..+....|.+..+. +.++..- +..+-......+...+-+ -||.-=..|+..++...++++......|+.||..+|
T Consensus 232 w~k~~~~vKsYl~s~-l~Ll~~~t~te~~~~~L~~l~~l~~~~~~f~k~lk~liK~~V~vWstge~~~rv~Afl~l~~l~ 310 (661)
T KOG2256|consen 232 WYKLRVLVKSYLGSS-LHLLNQLTDTEVLAFTLRHLTVLVPFLATFPKLLKKLIKAVVHVWSTGEESLRVLAFLCLIDLC 310 (661)
T ss_pred chhhhHHHHHHhHHH-HHHHHHcchHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHheeeccCCcchhhHHHHHHHHHH
Confidence 556666666666543 3333332 222222222222222211 234444455555666667888888899999999998
Q ss_pred hhC
Q psy16982 114 KNY 116 (116)
Q Consensus 114 ~~y 116 (116)
.++
T Consensus 311 ~~~ 313 (661)
T KOG2256|consen 311 RKF 313 (661)
T ss_pred HHh
Confidence 763
No 244
>KOG1293|consensus
Probab=20.27 E-value=1.6e+02 Score=24.72 Aligned_cols=93 Identities=16% Similarity=0.122 Sum_probs=56.7
Q ss_pred ChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHH----HHHHhcCc--HHHHHHHHHHHHHHHhcc-
Q psy16982 6 DMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAI----VDAVVMAP--EVIRVQLAVCVSNIVKHD- 78 (116)
Q Consensus 6 ~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~l----l~~l~~~~--~~i~~~l~~~i~~Ia~~d- 78 (116)
|.+.-.-|.+.+||.+..+-+... ..||..- +..+..++ .....+.+.++...+..+
T Consensus 22 dpe~lvrai~~~kN~vig~~~~K~----------------~~ik~GAv~~Ll~L~s~e~~s~~~k~~~~~llns~f~~eq 85 (678)
T KOG1293|consen 22 DPEQLVRAIYMSKNLVIGFTDNKE----------------TNIKLGAVELLLALLSLEDGSTELKNGFAVLLNSLFLGEQ 85 (678)
T ss_pred CHHHHHHHHHHhcchhhcCCCccc----------------hhhhhcchHHHHhhccccCCchhhhhhHHHHHHhHHhhcc
Confidence 455667788899998887665322 1222222 22222222 345566666666655432
Q ss_pred -CCCch--hHHHHHHHHhhCCCC-hHHHhHHHHHHHHHHh
Q psy16982 79 -FPGKW--TQIVDKVSIYLQNPD-ATPWFGALLCLYQLVK 114 (116)
Q Consensus 79 -~P~~W--p~Ll~~l~~~l~~~~-~~~~~~~L~~L~~l~~ 114 (116)
+-+.- -.-++.+++++++.| ...+.+++.|++.+++
T Consensus 86 d~v~svL~~~~ll~Ll~LLs~sD~~~~le~~l~~lR~Ife 125 (678)
T KOG1293|consen 86 DKVDSVLRIIELLKLLQLLSESDSLNVLEKTLRCLRTIFE 125 (678)
T ss_pred chHHHHHHHhhHHHHHHHhcCcchHhHHHHHHHHHHHHHh
Confidence 11111 135678889999988 6788899999999886
No 245
>PF12921 ATP13: Mitochondrial ATPase expression; InterPro: IPR024319 ATPase expression protein 2 (also known as ATP13 in some species) is necessary for the expression of subunit 9 of mitochondrial ATPase. The protein has a basic amino terminal signal sequence that is cleaved upon import into mitochondria [].
Probab=20.25 E-value=1.5e+02 Score=19.04 Aligned_cols=30 Identities=13% Similarity=0.156 Sum_probs=16.3
Q ss_pred HHHHHHHHhhCCCChHHHhHHHHHHHHHHhhC
Q psy16982 85 QIVDKVSIYLQNPDATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 85 ~Ll~~l~~~l~~~~~~~~~~~L~~L~~l~~~y 116 (116)
.||..|+.....+ .....|+.++..+.++|
T Consensus 53 ~lL~AIv~sf~~n--~~i~~al~~vd~fs~~Y 82 (126)
T PF12921_consen 53 RLLIAIVHSFGYN--GDIFSALKLVDFFSRKY 82 (126)
T ss_pred HHHHHHHHHHHhc--ccHHHHHHHHHHHHHHc
Confidence 4555555544222 33456667766666665
No 246
>PF10390 ELL: RNA polymerase II elongation factor ELL ; InterPro: IPR019464 ELL is a family of RNA polymerase II elongation factors. It is bound stably to elongation-associated factors 1 and 2, EAFs, and together these act as a strong regulator of transcription activity. by direct interaction with Pol II. ELL binds to pol II on its own but the affinity is greatly increased by the cooperation of EAF []. Some members carry an occludin domain (IPR010844 from INTERPRO) just downstream. There is no Saccharomyces cerevisiae (Baker's yeast) member. ; GO: 0006368 transcription elongation from RNA polymerase II promoter, 0008023 transcription elongation factor complex; PDB: 2E5N_A 2DOA_A.
Probab=20.24 E-value=68 Score=23.77 Aligned_cols=21 Identities=5% Similarity=0.116 Sum_probs=14.7
Q ss_pred CccCCCCHhHHHHHHHHHHHH
Q psy16982 35 PLAFSLHEQDKAMIRDAIVDA 55 (116)
Q Consensus 35 ~~~~~i~~~~k~~IK~~ll~~ 55 (116)
.+|+.+++++|+.||..+..-
T Consensus 262 ~dWP~yse~erq~l~r~l~~~ 282 (284)
T PF10390_consen 262 KDWPGYSEEERQLLKRRLSRK 282 (284)
T ss_dssp TT-TT--TCHHHHHHHHHHHH
T ss_pred cCCCCCCHHHHHHHHHHHHhh
Confidence 458899999999999887654
No 247
>PLN02169 fatty acid (omega-1)-hydroxylase/midchain alkane hydroxylase
Probab=20.23 E-value=4.1e+02 Score=20.93 Aligned_cols=48 Identities=21% Similarity=0.295 Sum_probs=35.4
Q ss_pred HHHHHHHHHHHHhcCcHHHHHHHHHHHHHHHhccCCCchhHHHHHHHHhh
Q psy16982 45 KAMIRDAIVDAVVMAPEVIRVQLAVCVSNIVKHDFPGKWTQIVDKVSIYL 94 (116)
Q Consensus 45 k~~IK~~ll~~l~~~~~~i~~~l~~~i~~Ia~~d~P~~Wp~Ll~~l~~~l 94 (116)
.+.|++.++..+........+.++-++..++.+ |+-..-+..++-+.+
T Consensus 299 ~~~i~~~~~~~l~AG~dTTa~tl~w~l~~La~~--Pevq~kl~~Ei~~v~ 346 (500)
T PLN02169 299 DKFIRDVIFSLVLAGRDTTSSALTWFFWLLSKH--PQVMAKIRHEINTKF 346 (500)
T ss_pred hHHHHHHHHHHHHhchhHHHHHHHHHHHHHHCC--HHHHHHHHHHHHhhC
Confidence 346888888888877777888888888888886 566666666665544
No 248
>KOG1943|consensus
Probab=20.07 E-value=6.6e+02 Score=22.74 Aligned_cols=99 Identities=12% Similarity=0.129 Sum_probs=62.4
Q ss_pred CCCCChHHHHHHHHHhhhhhccccccccCCCCCCccCCCCHhHHHHHHHHHHHHHhcC-cHHHHHHHHHHHHHHHhc--c
Q psy16982 2 MSEVDMPVRQAGVVYLKNLITNQWVEKEVDPGTPLAFSLHEQDKAMIRDAIVDAVVMA-PEVIRVQLAVCVSNIVKH--D 78 (116)
Q Consensus 2 ~~~~~~~vR~~Aai~LKn~i~~~w~~~~~~~~~~~~~~i~~~~k~~IK~~ll~~l~~~-~~~i~~~l~~~i~~Ia~~--d 78 (116)
-++.|..||..||=-+++...+- |.+--..+=.++++++.-. +...=+..+.+++++|.. .
T Consensus 350 l~d~dt~VrWSaAKg~grvt~rl----------------p~~Lad~vi~svid~~~p~e~~~aWHgacLaLAELA~rGlL 413 (1133)
T KOG1943|consen 350 LSDTDTVVRWSAAKGLGRVTSRL----------------PPELADQVIGSVIDLFNPAEDDSAWHGACLALAELALRGLL 413 (1133)
T ss_pred ccCCcchhhHHHHHHHHHHHccC----------------cHHHHHHHHHHHHHhcCcCCchhHHHHHHHHHHHHHhcCCc
Confidence 35677889999997777766552 3333334444556644322 356667889999999876 5
Q ss_pred CCCchhHHHHHHHHhhCCC--------ChHHHhHHHHHHHHHHhhC
Q psy16982 79 FPGKWTQIVDKVSIYLQNP--------DATPWFGALLCLYQLVKNY 116 (116)
Q Consensus 79 ~P~~Wp~Ll~~l~~~l~~~--------~~~~~~~~L~~L~~l~~~y 116 (116)
.|..-++++|-|...+.=+ -.....++.++...+.+.|
T Consensus 414 lps~l~dVvplI~kaL~Yd~~~G~~s~G~~VRDaAcY~~WAf~Ray 459 (1133)
T KOG1943|consen 414 LPSLLEDVVPLILKALHYDVRRGQHSVGQHVRDAACYVCWAFARAY 459 (1133)
T ss_pred chHHHHHHHHHHHHHhhhhhhhcccccccchHHHHHHHHHHHHhcC
Confidence 5677778888887766421 1233456666666666554
Done!