Query 032259
Match_columns 144
No_of_seqs 112 out of 701
Neff 6.0
Searched_HMMs 46136
Date Fri Mar 29 11:43:52 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/032259.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/032259hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PLN03200 cellulose synthase-in 99.2 1.4E-10 3.1E-15 115.2 12.5 100 5-108 1372-1473(2102)
2 PLN03200 cellulose synthase-in 99.2 3.4E-10 7.4E-15 112.6 12.7 101 5-108 460-561 (2102)
3 cd00020 ARM Armadillo/beta-cat 99.1 3.7E-09 8E-14 73.3 12.7 97 6-105 22-119 (120)
4 KOG4224 Armadillo repeat prote 98.8 3.3E-08 7.1E-13 85.6 9.2 101 4-108 180-282 (550)
5 cd00020 ARM Armadillo/beta-cat 98.6 9.5E-07 2.1E-11 61.0 9.9 79 27-108 2-80 (120)
6 KOG4224 Armadillo repeat prote 98.5 6E-07 1.3E-11 77.9 8.8 99 7-107 308-406 (550)
7 PF05804 KAP: Kinesin-associat 98.3 5E-06 1.1E-10 76.8 11.4 96 6-107 305-400 (708)
8 PF00514 Arm: Armadillo/beta-c 98.3 1.6E-06 3.5E-11 51.6 5.2 41 21-64 1-41 (41)
9 KOG0166 Karyopherin (importin) 98.2 1.7E-05 3.6E-10 71.0 10.5 102 5-109 124-227 (514)
10 PF05804 KAP: Kinesin-associat 98.1 3.6E-05 7.8E-10 71.3 10.9 99 5-107 263-361 (708)
11 PF04826 Arm_2: Armadillo-like 98.0 0.00011 2.3E-09 60.4 11.3 100 3-107 25-164 (254)
12 COG5064 SRP1 Karyopherin (impo 97.7 0.00013 2.8E-09 63.2 7.8 96 10-108 134-232 (526)
13 PF05536 Neurochondrin: Neuroc 97.7 0.00016 3.4E-09 65.1 8.3 99 6-108 114-212 (543)
14 smart00185 ARM Armadillo/beta- 97.6 0.0002 4.4E-09 41.3 5.2 40 22-64 2-41 (41)
15 KOG0166 Karyopherin (importin) 97.4 0.0015 3.1E-08 58.7 10.0 101 6-108 294-395 (514)
16 KOG2122 Beta-catenin-binding p 97.3 0.00037 8E-09 68.7 6.1 101 5-108 499-603 (2195)
17 PF00514 Arm: Armadillo/beta-c 97.3 0.00075 1.6E-08 39.9 5.0 40 67-106 2-41 (41)
18 PF09759 Atx10homo_assoc: Spin 97.3 0.003 6.5E-08 45.5 8.7 69 7-76 2-71 (102)
19 KOG1048 Neural adherens juncti 97.2 0.0031 6.8E-08 58.5 10.0 99 6-107 248-350 (717)
20 KOG4199 Uncharacterized conser 97.2 0.0071 1.5E-07 52.5 11.4 101 7-108 303-405 (461)
21 PF04826 Arm_2: Armadillo-like 97.1 0.0065 1.4E-07 49.9 10.4 100 6-111 110-210 (254)
22 KOG2160 Armadillo/beta-catenin 97.0 0.013 2.9E-07 50.3 11.7 103 4-108 137-242 (342)
23 KOG2122 Beta-catenin-binding p 97.0 0.00056 1.2E-08 67.5 3.7 116 3-118 450-571 (2195)
24 KOG4646 Uncharacterized conser 97.0 0.0022 4.7E-08 49.3 5.9 98 4-104 71-168 (173)
25 PF03224 V-ATPase_H_N: V-ATPas 97.0 0.0086 1.9E-07 49.7 10.0 101 6-108 165-271 (312)
26 PF05536 Neurochondrin: Neuroc 96.9 0.0066 1.4E-07 54.8 9.6 99 5-108 71-170 (543)
27 PF10508 Proteasom_PSMB: Prote 96.9 0.014 3E-07 51.9 11.4 103 3-108 131-233 (503)
28 KOG2160 Armadillo/beta-catenin 96.7 0.028 6.2E-07 48.2 11.1 97 8-107 100-197 (342)
29 KOG1048 Neural adherens juncti 96.5 0.026 5.6E-07 52.6 10.5 104 3-107 578-685 (717)
30 PF12031 DUF3518: Domain of un 96.5 0.011 2.5E-07 48.7 7.3 85 8-94 141-233 (257)
31 smart00185 ARM Armadillo/beta- 96.4 0.013 2.9E-07 33.4 5.1 37 69-105 4-40 (41)
32 COG5064 SRP1 Karyopherin (impo 96.2 0.027 5.8E-07 49.2 8.1 116 4-127 170-289 (526)
33 KOG3036 Protein involved in ce 96.2 0.093 2E-06 43.8 10.7 99 8-106 96-197 (293)
34 KOG4199 Uncharacterized conser 96.1 0.097 2.1E-06 45.6 11.2 104 5-111 256-364 (461)
35 cd00256 VATPase_H VATPase_H, r 95.9 0.15 3.2E-06 45.1 11.4 102 5-108 158-260 (429)
36 KOG4646 Uncharacterized conser 95.8 0.032 7E-07 43.0 6.2 99 5-107 31-129 (173)
37 PF13513 HEAT_EZ: HEAT-like re 94.9 0.07 1.5E-06 32.9 4.6 54 49-103 1-54 (55)
38 KOG1789 Endocytosis protein RM 94.5 0.5 1.1E-05 46.7 11.0 100 3-107 1784-1884(2235)
39 PF13513 HEAT_EZ: HEAT-like re 94.3 0.11 2.3E-06 32.0 4.5 53 7-62 3-55 (55)
40 PF04078 Rcd1: Cell differenti 94.2 0.44 9.5E-06 39.7 9.0 98 7-106 66-168 (262)
41 PF04063 DUF383: Domain of unk 94.2 0.82 1.8E-05 36.1 10.2 103 4-106 8-132 (192)
42 PF13646 HEAT_2: HEAT repeats; 93.8 0.14 3.1E-06 33.6 4.5 59 34-105 1-59 (88)
43 PF10508 Proteasom_PSMB: Prote 93.7 0.72 1.6E-05 41.1 10.1 99 5-108 52-150 (503)
44 PF13646 HEAT_2: HEAT repeats; 92.9 0.31 6.7E-06 32.0 5.0 56 32-101 31-87 (88)
45 KOG0168 Putative ubiquitin fus 92.6 0.91 2E-05 43.6 9.3 101 3-108 224-325 (1051)
46 PF03224 V-ATPase_H_N: V-ATPas 92.6 0.89 1.9E-05 37.7 8.4 77 8-86 215-292 (312)
47 COG5209 RCD1 Uncharacterized p 91.8 0.59 1.3E-05 38.9 6.3 99 8-106 117-218 (315)
48 KOG4500 Rho/Rac GTPase guanine 91.8 0.95 2.1E-05 40.8 7.9 86 4-89 100-190 (604)
49 PF08045 CDC14: Cell division 91.6 4 8.7E-05 33.9 11.0 97 8-106 108-207 (257)
50 PF12755 Vac14_Fab1_bd: Vacuol 91.5 1.4 3E-05 31.1 7.2 69 33-105 28-96 (97)
51 KOG1293 Proteins containing ar 90.6 3.6 7.7E-05 38.4 10.6 113 5-122 433-552 (678)
52 PF09759 Atx10homo_assoc: Spin 90.4 1.6 3.4E-05 31.4 6.7 57 52-108 3-61 (102)
53 PF11841 DUF3361: Domain of un 89.6 3.3 7.1E-05 32.1 8.3 58 4-64 73-131 (160)
54 PF11841 DUF3361: Domain of un 88.9 4.7 0.0001 31.2 8.7 106 6-112 32-137 (160)
55 PF04063 DUF383: Domain of unk 88.1 3 6.4E-05 33.0 7.4 80 5-87 72-157 (192)
56 PF10165 Ric8: Guanine nucleot 87.7 4.3 9.2E-05 35.8 8.9 96 11-108 1-110 (446)
57 KOG1222 Kinesin associated pro 87.4 2.1 4.6E-05 39.2 6.8 99 7-113 320-418 (791)
58 PF10165 Ric8: Guanine nucleot 87.4 6.8 0.00015 34.5 10.0 84 4-87 45-132 (446)
59 PRK09687 putative lyase; Provi 86.1 2.8 6.1E-05 34.7 6.5 59 33-103 160-218 (280)
60 PF12348 CLASP_N: CLASP N term 86.1 13 0.00027 28.8 9.9 63 47-109 143-209 (228)
61 PF02985 HEAT: HEAT repeat; I 85.4 2.1 4.5E-05 23.5 3.8 28 34-64 2-29 (31)
62 KOG1222 Kinesin associated pro 85.1 2.8 6E-05 38.5 6.3 88 3-96 357-444 (791)
63 KOG2171 Karyopherin (importin) 84.6 6.6 0.00014 38.6 8.9 97 9-107 407-505 (1075)
64 KOG1293 Proteins containing ar 84.4 4.9 0.00011 37.5 7.7 108 2-114 388-496 (678)
65 KOG4500 Rho/Rac GTPase guanine 83.7 6.5 0.00014 35.6 7.9 102 6-108 330-433 (604)
66 KOG3036 Protein involved in ce 83.6 20 0.00042 30.3 10.2 98 6-106 140-247 (293)
67 PF01602 Adaptin_N: Adaptin N 83.5 9.8 0.00021 32.9 9.0 85 7-104 95-179 (526)
68 PF02985 HEAT: HEAT repeat; I 83.3 2.6 5.7E-05 23.1 3.6 29 78-106 1-29 (31)
69 PF06371 Drf_GBD: Diaphanous G 82.9 4.8 0.0001 30.2 6.1 56 6-64 131-187 (187)
70 PF11698 V-ATPase_H_C: V-ATPas 82.4 5.2 0.00011 29.5 5.8 71 33-105 44-114 (119)
71 KOG0946 ER-Golgi vesicle-tethe 81.2 14 0.00031 35.5 9.5 97 7-106 138-241 (970)
72 cd03567 VHS_GGA VHS domain fam 80.8 17 0.00036 27.2 8.3 79 33-114 39-124 (139)
73 cd03568 VHS_STAM VHS domain fa 79.0 18 0.00039 27.1 8.0 77 33-112 38-116 (144)
74 KOG1789 Endocytosis protein RM 78.9 8.4 0.00018 38.7 7.4 75 9-87 2065-2140(2235)
75 PF05004 IFRD: Interferon-rela 78.7 20 0.00043 30.1 9.0 98 3-105 55-159 (309)
76 PRK09687 putative lyase; Provi 78.6 9.9 0.00021 31.4 7.0 70 33-104 91-186 (280)
77 PF06371 Drf_GBD: Diaphanous G 78.5 2.3 5.1E-05 31.9 3.0 80 25-104 100-185 (187)
78 PF00790 VHS: VHS domain; Int 78.5 14 0.00031 27.1 7.2 73 33-108 43-120 (140)
79 cd03569 VHS_Hrs_Vps27p VHS dom 77.5 22 0.00048 26.5 8.1 75 33-110 42-118 (142)
80 KOG0168 Putative ubiquitin fus 77.3 14 0.0003 35.9 8.2 91 6-104 270-362 (1051)
81 KOG2171 Karyopherin (importin) 76.4 4.9 0.00011 39.4 5.1 113 4-125 816-929 (1075)
82 cd00256 VATPase_H VATPase_H, r 75.3 17 0.00038 32.2 7.9 95 10-106 206-308 (429)
83 TIGR03504 FimV_Cterm FimV C-te 73.3 5.4 0.00012 24.2 3.1 34 71-104 11-44 (44)
84 KOG2759 Vacuolar H+-ATPase V1 72.1 40 0.00087 30.2 9.3 79 9-89 175-253 (442)
85 PF12348 CLASP_N: CLASP N term 69.3 33 0.00071 26.4 7.6 95 7-109 69-163 (228)
86 PF06012 DUF908: Domain of Unk 68.5 36 0.00079 28.6 8.2 70 9-78 240-311 (329)
87 cd03561 VHS VHS domain family; 67.7 45 0.00097 24.2 8.2 73 33-108 38-114 (133)
88 PF14664 RICTOR_N: Rapamycin-i 66.5 42 0.00092 29.0 8.3 93 8-107 85-178 (371)
89 PRK13800 putative oxidoreducta 64.6 44 0.00096 32.0 8.7 70 32-104 652-741 (897)
90 PRK13800 putative oxidoreducta 62.5 45 0.00098 31.9 8.4 73 30-105 619-712 (897)
91 PF14663 RasGEF_N_2: Rapamycin 61.4 31 0.00066 24.8 5.6 52 33-90 9-60 (115)
92 KOG1517 Guanine nucleotide bin 61.0 71 0.0015 32.1 9.3 96 11-108 577-673 (1387)
93 PF06543 Lac_bphage_repr: Lact 58.7 4.9 0.00011 25.2 0.8 19 110-128 14-32 (49)
94 smart00288 VHS Domain present 55.6 78 0.0017 23.0 7.7 73 33-108 38-113 (133)
95 PF08216 CTNNBL: Catenin-beta- 53.9 30 0.00065 25.2 4.4 36 8-43 63-98 (108)
96 KOG2973 Uncharacterized conser 53.4 78 0.0017 27.5 7.5 92 7-107 19-112 (353)
97 PF04078 Rcd1: Cell differenti 53.3 52 0.0011 27.5 6.3 54 8-64 113-168 (262)
98 KOG2611 Neurochondrin/leucine- 51.7 1.1E+02 0.0023 28.5 8.3 74 9-87 81-162 (698)
99 PTZ00429 beta-adaptin; Provisi 51.2 90 0.002 29.7 8.2 71 30-108 177-247 (746)
100 PF06685 DUF1186: Protein of u 50.2 99 0.0021 25.5 7.4 86 30-116 71-171 (249)
101 cd03572 ENTH_epsin_related ENT 48.5 1E+02 0.0022 22.7 6.6 69 36-107 42-120 (122)
102 PF11698 V-ATPase_H_C: V-ATPas 46.7 66 0.0014 23.7 5.3 45 15-62 68-113 (119)
103 PF14726 RTTN_N: Rotatin, an a 46.2 84 0.0018 22.2 5.7 36 7-42 46-81 (98)
104 PF14668 RICTOR_V: Rapamycin-i 45.2 92 0.002 20.8 6.7 56 10-67 6-61 (73)
105 PF01602 Adaptin_N: Adaptin N 43.5 59 0.0013 28.0 5.5 91 7-107 130-222 (526)
106 COG1413 FOG: HEAT repeat [Ener 43.0 75 0.0016 25.9 5.8 62 33-108 75-137 (335)
107 KOG2973 Uncharacterized conser 42.6 90 0.0019 27.1 6.2 62 8-73 59-120 (353)
108 PF05918 API5: Apoptosis inhib 42.3 65 0.0014 29.7 5.7 16 49-64 54-69 (556)
109 COG5096 Vesicle coat complex, 42.1 1.2E+02 0.0026 29.1 7.5 69 33-107 128-196 (757)
110 COG1413 FOG: HEAT repeat [Ener 41.3 1.1E+02 0.0023 25.0 6.4 59 32-102 180-238 (335)
111 PF12717 Cnd1: non-SMC mitotic 40.7 1.3E+02 0.0028 22.7 6.4 52 49-105 2-53 (178)
112 PF13764 E3_UbLigase_R4: E3 ub 39.9 1.2E+02 0.0026 29.2 7.3 78 28-106 113-197 (802)
113 KOG2999 Regulator of Rac1, req 38.1 3.5E+02 0.0076 25.5 10.8 101 11-111 103-247 (713)
114 PF12717 Cnd1: non-SMC mitotic 37.7 1.3E+02 0.0028 22.7 6.0 90 6-108 3-94 (178)
115 PRK00927 tryptophanyl-tRNA syn 37.0 1.1E+02 0.0024 25.9 6.0 70 34-106 256-326 (333)
116 PF12936 Kri1_C: KRI1-like fam 36.8 16 0.00034 25.8 0.7 29 110-138 11-40 (93)
117 PTZ00429 beta-adaptin; Provisi 36.6 1.2E+02 0.0027 28.8 6.7 67 36-108 144-210 (746)
118 PHA02102 hypothetical protein 35.9 45 0.00097 22.3 2.7 16 109-124 35-50 (72)
119 KOG1242 Protein containing ada 35.4 1.2E+02 0.0027 28.1 6.3 60 48-108 267-326 (569)
120 KOG3431 Apoptosis-related prot 35.3 22 0.00047 26.6 1.2 28 58-86 75-102 (129)
121 PF08167 RIX1: rRNA processing 35.2 1.9E+02 0.0042 21.7 8.0 82 21-106 53-143 (165)
122 KOG1242 Protein containing ada 35.1 1.5E+02 0.0032 27.6 6.7 102 7-111 270-406 (569)
123 PF10954 DUF2755: Protein of u 34.2 64 0.0014 23.0 3.4 47 48-94 20-66 (100)
124 COG5209 RCD1 Uncharacterized p 33.3 3E+02 0.0064 23.2 8.0 87 7-96 162-258 (315)
125 PF06025 DUF913: Domain of Unk 33.0 3.3E+02 0.0071 23.6 8.5 81 6-87 124-206 (379)
126 KOG3678 SARM protein (with ste 32.0 2.5E+02 0.0055 26.2 7.6 69 8-80 199-267 (832)
127 PF12031 DUF3518: Domain of un 31.9 78 0.0017 26.4 4.1 43 2-44 184-228 (257)
128 PF06345 Drf_DAD: DRF Autoregu 31.1 50 0.0011 15.7 1.7 14 77-90 1-14 (15)
129 TIGR02270 conserved hypothetic 28.6 1.8E+02 0.0038 25.6 6.0 26 79-104 149-174 (410)
130 KOG1981 SOK1 kinase belonging 28.2 3.3E+02 0.0071 25.0 7.7 89 19-107 148-238 (513)
131 PF11701 UNC45-central: Myosin 27.8 2.6E+02 0.0056 20.8 8.0 88 11-101 63-154 (157)
132 KOG1248 Uncharacterized conser 27.7 3.1E+02 0.0068 27.7 7.9 61 48-108 840-900 (1176)
133 COG5231 VMA13 Vacuolar H+-ATPa 27.0 4.5E+02 0.0097 23.3 8.5 77 11-88 169-246 (432)
134 PRK12556 tryptophanyl-tRNA syn 26.7 1.3E+02 0.0028 25.6 4.8 56 48-106 271-327 (332)
135 PF08045 CDC14: Cell division 26.7 2E+02 0.0044 23.8 5.7 50 14-64 158-207 (257)
136 cd00197 VHS_ENTH_ANTH VHS, ENT 26.4 2.2E+02 0.0048 19.6 8.2 69 33-104 38-113 (115)
137 KOG2611 Neurochondrin/leucine- 26.4 1.6E+02 0.0034 27.4 5.3 56 6-63 126-181 (698)
138 KOG2259 Uncharacterized conser 26.0 2.3E+02 0.0049 27.3 6.4 88 6-106 388-475 (823)
139 PRK12282 tryptophanyl-tRNA syn 25.5 2.5E+02 0.0054 23.9 6.3 70 34-106 255-325 (333)
140 KOG1517 Guanine nucleotide bin 24.2 1.2E+02 0.0026 30.6 4.4 97 17-116 539-636 (1387)
141 PF03130 HEAT_PBS: PBS lyase H 23.8 79 0.0017 16.6 1.9 10 78-87 17-26 (27)
142 PF12460 MMS19_C: RNAPII trans 23.0 4.7E+02 0.01 22.5 7.6 72 33-107 324-395 (415)
143 KOG2759 Vacuolar H+-ATPase V1 22.4 4.9E+02 0.011 23.5 7.5 50 15-66 224-273 (442)
144 KOG2734 Uncharacterized conser 22.1 4.5E+02 0.0097 24.1 7.3 77 6-83 197-274 (536)
145 KOG1077 Vesicle coat complex A 22.0 7.5E+02 0.016 24.2 9.1 83 3-96 341-423 (938)
146 cd03565 VHS_Tom1 VHS domain fa 21.9 3.3E+02 0.0072 20.1 8.4 77 33-111 39-120 (141)
147 TIGR02270 conserved hypothetic 21.6 5.6E+02 0.012 22.5 9.4 26 34-62 149-174 (410)
148 PF04064 DUF384: Domain of unk 21.3 2.4E+02 0.0051 18.1 6.0 52 58-110 1-53 (58)
No 1
>PLN03200 cellulose synthase-interactive protein; Provisional
Probab=99.20 E-value=1.4e-10 Score=115.15 Aligned_cols=100 Identities=13% Similarity=0.124 Sum_probs=89.5
Q ss_pred cch-HHHHHHHHHHhcCChhhHHHHHhcCc-HHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHH
Q 032259 5 GES-TSRVLLILCNLAASNEGRSAILDANG-VSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVL 82 (144)
Q Consensus 5 ~~~-~E~AlavL~~La~~~egr~aI~~~g~-I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L 82 (144)
... .|.|+.+|..|+.|++||+++.++++ ||.++ ++..+ |+..+|.|+.+||.||...+.++..+.+.|++..|
T Consensus 1372 ~~~~~E~Al~vLd~Lc~~eegre~~~~h~a~vplV~-~ilrv---S~~a~E~AV~aL~kl~~~~~~v~~Emv~~G~~~kl 1447 (2102)
T PLN03200 1372 SSTAQEAGVCALDRLLDDEQLAELVAAHGAVVPLVG-LVVGT---NYVLHEAAISALIKLGKDRPPCKLDMVKAGIIERV 1447 (2102)
T ss_pred CchHHHHHHHHHHHHhcCHhhHHHHHHcCChhhHHH-HHHcC---CHHHHHHHHHHHHHHhCCChHHHHHHHHhCHHHHH
Confidence 344 89999999999999999999998775 55555 55554 89999999999999998888889999999999999
Q ss_pred HHHHHhCCHHHHHHHHHHHHHhhcCC
Q 032259 83 REVEERGSQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 83 ~~Ll~~Gt~raK~kA~~lL~~L~~~~ 108 (144)
+.+++..+.+.|+||+.+|++|+...
T Consensus 1448 lllLQ~c~~~lkekAaeLLrlL~~~~ 1473 (2102)
T PLN03200 1448 LDILPEAPDSLCSAIAELLRILTNNS 1473 (2102)
T ss_pred HHHHHcCCHHHHHHHHHHHHHhccch
Confidence 99999999999999999999999865
No 2
>PLN03200 cellulose synthase-interactive protein; Provisional
Probab=99.16 E-value=3.4e-10 Score=112.57 Aligned_cols=101 Identities=22% Similarity=0.288 Sum_probs=93.0
Q ss_pred cchHHHHHHHHHHhcC-ChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHH
Q 032259 5 GESTSRVLLILCNLAA-SNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLR 83 (144)
Q Consensus 5 ~~~~E~AlavL~~La~-~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~ 83 (144)
....+.|+.+|.+|+. +++.+..|.++|+||+|+++|+++ +++.||+|+.+|++||.+++..+.++.+.|++|+|+
T Consensus 460 ~~iQ~~A~~~L~nLa~~ndenr~aIieaGaIP~LV~LL~s~---~~~iqeeAawAL~NLa~~~~qir~iV~~aGAIppLV 536 (2102)
T PLN03200 460 EQQQEYAVALLAILTDEVDESKWAITAAGGIPPLVQLLETG---SQKAKEDSATVLWNLCCHSEDIRACVESAGAVPALL 536 (2102)
T ss_pred HHHHHHHHHHHHHHHcCCHHHHHHHHHCCCHHHHHHHHcCC---CHHHHHHHHHHHHHHhCCcHHHHHHHHHCCCHHHHH
Confidence 4567899999999996 788999999999999999999987 799999999999999998778888888999999999
Q ss_pred HHHHhCCHHHHHHHHHHHHHhhcCC
Q 032259 84 EVEERGSQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 84 ~Ll~~Gt~raK~kA~~lL~~L~~~~ 108 (144)
+++.+|+++.|+.|..+|..|....
T Consensus 537 ~LL~sgd~~~q~~Aa~AL~nLi~~~ 561 (2102)
T PLN03200 537 WLLKNGGPKGQEIAAKTLTKLVRTA 561 (2102)
T ss_pred HHHhCCCHHHHHHHHHHHHHHHhcc
Confidence 9999999999999999999997653
No 3
>cd00020 ARM Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
Probab=99.10 E-value=3.7e-09 Score=73.27 Aligned_cols=97 Identities=25% Similarity=0.286 Sum_probs=89.2
Q ss_pred chHHHHHHHHHHhcCC-hhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHH
Q 032259 6 ESTSRVLLILCNLAAS-NEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLRE 84 (144)
Q Consensus 6 ~~~E~AlavL~~La~~-~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~ 84 (144)
.+.+.++..|.+++.. ++.+..+.+.+++|.|+++|.+. +++.+++|+.+|.+||...+.........|+++.|+.
T Consensus 22 ~~~~~a~~~l~~l~~~~~~~~~~~~~~~~i~~l~~~l~~~---~~~v~~~a~~~L~~l~~~~~~~~~~~~~~g~l~~l~~ 98 (120)
T cd00020 22 NVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSE---DEEVVKAALWALRNLAAGPEDNKLIVLEAGGVPKLVN 98 (120)
T ss_pred HHHHHHHHHHHHHhcCCHHHHHHHHHCCChHHHHHHHhCC---CHHHHHHHHHHHHHHccCcHHHHHHHHHCCChHHHHH
Confidence 5789999999999986 99999999999999999999986 7999999999999999977677777889999999999
Q ss_pred HHHhCCHHHHHHHHHHHHHhh
Q 032259 85 VEERGSQRAKEKAKRILGMLK 105 (144)
Q Consensus 85 Ll~~Gt~raK~kA~~lL~~L~ 105 (144)
++..+....++.|..+|..|.
T Consensus 99 ~l~~~~~~~~~~a~~~l~~l~ 119 (120)
T cd00020 99 LLDSSNEDIQKNATGALSNLA 119 (120)
T ss_pred HHhcCCHHHHHHHHHHHHHhh
Confidence 999999999999999998874
No 4
>KOG4224 consensus Armadillo repeat protein VAC8 required for vacuole fusion, inheritance and cytosol-to-vacuole protein targeting [Intracellular trafficking, secretion, and vesicular transport]
Probab=98.78 E-value=3.3e-08 Score=85.62 Aligned_cols=101 Identities=21% Similarity=0.281 Sum_probs=89.4
Q ss_pred CcchHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcC--cHHH
Q 032259 4 SGESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEAR--AAEV 81 (144)
Q Consensus 4 ~~~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~G--ai~~ 81 (144)
+-+..-.|..+|.++....|+|..++.+|++|.||.++..| .+..|++|..++-++..+... |.+-.+.+ +||.
T Consensus 180 dirvqrnatgaLlnmThs~EnRr~LV~aG~lpvLVsll~s~---d~dvqyycttaisnIaVd~~~-Rk~Laqaep~lv~~ 255 (550)
T KOG4224|consen 180 DIRVQRNATGALLNMTHSRENRRVLVHAGGLPVLVSLLKSG---DLDVQYYCTTAISNIAVDRRA-RKILAQAEPKLVPA 255 (550)
T ss_pred hhhHHHHHHHHHHHhhhhhhhhhhhhccCCchhhhhhhccC---ChhHHHHHHHHhhhhhhhHHH-HHHHHhcccchHHH
Confidence 34556789999999999999999999999999999999998 789999999999999986544 44445666 9999
Q ss_pred HHHHHHhCCHHHHHHHHHHHHHhhcCC
Q 032259 82 LREVEERGSQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 82 L~~Ll~~Gt~raK~kA~~lL~~L~~~~ 108 (144)
|++|..+|++++|-.|.-+|++|....
T Consensus 256 Lv~Lmd~~s~kvkcqA~lALrnlasdt 282 (550)
T KOG4224|consen 256 LVDLMDDGSDKVKCQAGLALRNLASDT 282 (550)
T ss_pred HHHHHhCCChHHHHHHHHHHhhhcccc
Confidence 999999999999999999999998764
No 5
>cd00020 ARM Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
Probab=98.57 E-value=9.5e-07 Score=61.02 Aligned_cols=79 Identities=24% Similarity=0.236 Sum_probs=71.2
Q ss_pred HHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHhCCHHHHHHHHHHHHHhhc
Q 032259 27 AILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEERGSQRAKEKAKRILGMLKG 106 (144)
Q Consensus 27 aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~Gt~raK~kA~~lL~~L~~ 106 (144)
.+.+.|.|+.|+++|.++ ++..++.|+.+|.++|..++.......+.|++|.|+.++.+..++.++.|..+|..|..
T Consensus 2 ~~~~~~~i~~l~~~l~~~---~~~~~~~a~~~l~~l~~~~~~~~~~~~~~~~i~~l~~~l~~~~~~v~~~a~~~L~~l~~ 78 (120)
T cd00020 2 AVIQAGGLPALVSLLSSS---DENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSEDEEVVKAALWALRNLAA 78 (120)
T ss_pred hHHHcCChHHHHHHHHcC---CHHHHHHHHHHHHHHhcCCHHHHHHHHHCCChHHHHHHHhCCCHHHHHHHHHHHHHHcc
Confidence 467789999999999987 68999999999999999877667777788999999999999999999999999999977
Q ss_pred CC
Q 032259 107 RE 108 (144)
Q Consensus 107 ~~ 108 (144)
..
T Consensus 79 ~~ 80 (120)
T cd00020 79 GP 80 (120)
T ss_pred Cc
Confidence 53
No 6
>KOG4224 consensus Armadillo repeat protein VAC8 required for vacuole fusion, inheritance and cytosol-to-vacuole protein targeting [Intracellular trafficking, secretion, and vesicular transport]
Probab=98.50 E-value=6e-07 Score=77.93 Aligned_cols=99 Identities=18% Similarity=0.151 Sum_probs=86.5
Q ss_pred hHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHH
Q 032259 7 STSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVE 86 (144)
Q Consensus 7 ~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll 86 (144)
++-..++-+.|++.||-+..-|.++|.+.+||++|+.| .+.+.|=||+.+||+|+-.++..+..+...|+||.|.+|+
T Consensus 308 lilasVaCIrnisihplNe~lI~dagfl~pLVrlL~~~--dnEeiqchAvstLrnLAasse~n~~~i~esgAi~kl~eL~ 385 (550)
T KOG4224|consen 308 LILASVACIRNISIHPLNEVLIADAGFLRPLVRLLRAG--DNEEIQCHAVSTLRNLAASSEHNVSVIRESGAIPKLIELL 385 (550)
T ss_pred HHHHHHHHHhhcccccCcccceecccchhHHHHHHhcC--CchhhhhhHHHHHHHHhhhhhhhhHHHhhcCchHHHHHHH
Confidence 44556777889999999999999999999999999998 4888999999999999987777788888999999999999
Q ss_pred HhCCHHHHHHHHHHHHHhhcC
Q 032259 87 ERGSQRAKEKAKRILGMLKGR 107 (144)
Q Consensus 87 ~~Gt~raK~kA~~lL~~L~~~ 107 (144)
.+|.-..+..-...+..|.=.
T Consensus 386 lD~pvsvqseisac~a~Lal~ 406 (550)
T KOG4224|consen 386 LDGPVSVQSEISACIAQLALN 406 (550)
T ss_pred hcCChhHHHHHHHHHHHHHhc
Confidence 999888887777777777654
No 7
>PF05804 KAP: Kinesin-associated protein (KAP)
Probab=98.34 E-value=5e-06 Score=76.84 Aligned_cols=96 Identities=19% Similarity=0.158 Sum_probs=83.3
Q ss_pred chHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHH
Q 032259 6 ESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREV 85 (144)
Q Consensus 6 ~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~L 85 (144)
++.-.++..|.+|+...+++..|.+.|.||.|++++.+. +....+.|..+|++|+.+. ..|..+.+.|++|.|+.+
T Consensus 305 ellil~v~fLkkLSi~~ENK~~m~~~giV~kL~kLl~s~---~~~l~~~aLrlL~NLSfd~-~~R~~mV~~GlIPkLv~L 380 (708)
T PF05804_consen 305 ELLILAVTFLKKLSIFKENKDEMAESGIVEKLLKLLPSE---NEDLVNVALRLLFNLSFDP-ELRSQMVSLGLIPKLVEL 380 (708)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHcCCHHHHHHHhcCC---CHHHHHHHHHHHHHhCcCH-HHHHHHHHCCCcHHHHHH
Confidence 455668899999999999999999999999999999986 6889999999999999765 458888899999999999
Q ss_pred HHhCCHHHHHHHHHHHHHhhcC
Q 032259 86 EERGSQRAKEKAKRILGMLKGR 107 (144)
Q Consensus 86 l~~Gt~raK~kA~~lL~~L~~~ 107 (144)
+.+.+ -+..+..+|.+|+..
T Consensus 381 L~d~~--~~~val~iLy~LS~d 400 (708)
T PF05804_consen 381 LKDPN--FREVALKILYNLSMD 400 (708)
T ss_pred hCCCc--hHHHHHHHHHHhccC
Confidence 98765 445688888888773
No 8
>PF00514 Arm: Armadillo/beta-catenin-like repeat; InterPro: IPR000225 The armadillo (Arm) repeat is an approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila melanogaster segment polarity gene armadillo involved in signal transduction through wingless. Animal Arm-repeat proteins function in various processes, including intracellular signalling and cytoskeletal regulation, and include such proteins as beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumour suppressor protein, and the nuclear transport factor importin-alpha, amongst others []. A subset of these proteins is conserved across eukaryotic kingdoms. In higher plants, some Arm-repeat proteins function in intracellular signalling like their mammalian counterparts, while others have novel functions []. The 3-dimensional fold of an armadillo repeat is known from the crystal structure of beta-catenin, where the 12 repeats form a superhelix of alpha helices with three helices per unit []. The cylindrical structure features a positively charged grove, which presumably interacts with the acidic surfaces of the known interaction partners of beta-catenin.; GO: 0005515 protein binding; PDB: 2Z6G_A 1IQ1_C 3RZX_A 2C1M_A 3BTR_C 3OQS_A 3TPO_A 1IAL_A 1Q1S_C 1PJM_B ....
Probab=98.32 E-value=1.6e-06 Score=51.59 Aligned_cols=41 Identities=34% Similarity=0.440 Sum_probs=38.2
Q ss_pred ChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhc
Q 032259 21 SNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGH 64 (144)
Q Consensus 21 ~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~ 64 (144)
+++++..|.+.|+||+|+++|++. ++..+++|+.+|.+||.
T Consensus 1 ~~~~~~~i~~~g~i~~Lv~ll~~~---~~~v~~~a~~al~nl~~ 41 (41)
T PF00514_consen 1 SPENKQAIVEAGGIPPLVQLLKSP---DPEVQEEAAWALGNLAA 41 (41)
T ss_dssp SHHHHHHHHHTTHHHHHHHHTTSS---SHHHHHHHHHHHHHHHT
T ss_pred CHHHHHHHHHcccHHHHHHHHcCC---CHHHHHHHHHHHHHHhC
Confidence 578999999999999999999986 89999999999999984
No 9
>KOG0166 consensus Karyopherin (importin) alpha [Intracellular trafficking, secretion, and vesicular transport]
Probab=98.17 E-value=1.7e-05 Score=70.95 Aligned_cols=102 Identities=20% Similarity=0.220 Sum_probs=91.3
Q ss_pred cchHHHHHHHHHHhcC-ChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHH
Q 032259 5 GESTSRVLLILCNLAA-SNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLR 83 (144)
Q Consensus 5 ~~~~E~AlavL~~La~-~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~ 83 (144)
..+..+|+=+|.|+|. ..+-...++++|+||.++.+|.+. +..-+|.|+-+|-++.-+++.+|..+...|++++|+
T Consensus 124 ~~lq~eAAWaLTnIAsgtse~T~~vv~agavp~fi~Ll~s~---~~~v~eQavWALgNIagds~~~Rd~vl~~g~l~pLl 200 (514)
T KOG0166|consen 124 PTLQFEAAWALTNIASGTSEQTKVVVDAGAVPIFIQLLSSP---SADVREQAVWALGNIAGDSPDCRDYVLSCGALDPLL 200 (514)
T ss_pred hhHHHHHHHHHHHHhcCchhhccccccCCchHHHHHHhcCC---cHHHHHHHHHHHhccccCChHHHHHHHhhcchHHHH
Confidence 4567789999999997 677788889999999999999986 788999999999999999999999999999999999
Q ss_pred HHHHhCCH-HHHHHHHHHHHHhhcCCC
Q 032259 84 EVEERGSQ-RAKEKAKRILGMLKGRED 109 (144)
Q Consensus 84 ~Ll~~Gt~-raK~kA~~lL~~L~~~~~ 109 (144)
.++....+ .-.+.+..+|.+|.+..|
T Consensus 201 ~~l~~~~~~~~lRn~tW~LsNlcrgk~ 227 (514)
T KOG0166|consen 201 RLLNKSDKLSMLRNATWTLSNLCRGKN 227 (514)
T ss_pred HHhccccchHHHHHHHHHHHHHHcCCC
Confidence 99988865 778999999999988763
No 10
>PF05804 KAP: Kinesin-associated protein (KAP)
Probab=98.07 E-value=3.6e-05 Score=71.25 Aligned_cols=99 Identities=21% Similarity=0.096 Sum_probs=87.8
Q ss_pred cchHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHH
Q 032259 5 GESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLRE 84 (144)
Q Consensus 5 ~~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~ 84 (144)
..+.--++.+|.|||.++..+..+.+.|.|+.||++|.++ +.+-.-.++..|..||...++ +..+.+.|++++|..
T Consensus 263 eqLlrv~~~lLlNLAed~~ve~kM~~~~iV~~Lv~~Ldr~---n~ellil~v~fLkkLSi~~EN-K~~m~~~giV~kL~k 338 (708)
T PF05804_consen 263 EQLLRVAFYLLLNLAEDPRVELKMVNKGIVSLLVKCLDRE---NEELLILAVTFLKKLSIFKEN-KDEMAESGIVEKLLK 338 (708)
T ss_pred HHHHHHHHHHHHHHhcChHHHHHHHhcCCHHHHHHHHcCC---CHHHHHHHHHHHHHHcCCHHH-HHHHHHcCCHHHHHH
Confidence 3455668889999999999999999999999999999886 788899999999999987654 667779999999999
Q ss_pred HHHhCCHHHHHHHHHHHHHhhcC
Q 032259 85 VEERGSQRAKEKAKRILGMLKGR 107 (144)
Q Consensus 85 Ll~~Gt~raK~kA~~lL~~L~~~ 107 (144)
++..+.....+.|..+|.+|+-.
T Consensus 339 Ll~s~~~~l~~~aLrlL~NLSfd 361 (708)
T PF05804_consen 339 LLPSENEDLVNVALRLLFNLSFD 361 (708)
T ss_pred HhcCCCHHHHHHHHHHHHHhCcC
Confidence 99999999999999999998764
No 11
>PF04826 Arm_2: Armadillo-like; InterPro: IPR006911 This entry consists of mammalian proteins of unknown function.
Probab=98.00 E-value=0.00011 Score=60.43 Aligned_cols=100 Identities=22% Similarity=0.271 Sum_probs=81.2
Q ss_pred CCcchHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChH--------------
Q 032259 3 KSGESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLR-------------- 68 (144)
Q Consensus 3 ~~~~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~-------------- 68 (144)
++..+.|.|+.+|.+.+..+.++..|.+.|+|+.+..+|.+. ++..|+.|+-+|-+|+...+.
T Consensus 25 ~dp~i~e~al~al~n~aaf~~nq~~Ir~~Ggi~lI~~lL~~p---~~~vr~~AL~aL~Nls~~~en~~~Ik~~i~~Vc~~ 101 (254)
T PF04826_consen 25 EDPFIQEKALIALGNSAAFPFNQDIIRDLGGISLIGSLLNDP---NPSVREKALNALNNLSVNDENQEQIKMYIPQVCEE 101 (254)
T ss_pred CChHHHHHHHHHHHhhccChhHHHHHHHcCCHHHHHHHcCCC---ChHHHHHHHHHHHhcCCChhhHHHHHHHHHHHHHH
Confidence 455688999999999999999999999999999999999986 788889998888877632211
Q ss_pred --------------------------HHHHHHHcCcHHHHHHHHHhCCHHHHHHHHHHHHHhhcC
Q 032259 69 --------------------------FKGLAKEARAAEVLREVEERGSQRAKEKAKRILGMLKGR 107 (144)
Q Consensus 69 --------------------------~~~~a~~~Gai~~L~~Ll~~Gt~raK~kA~~lL~~L~~~ 107 (144)
...+. ...+|.|+.|+..|+..+|..+-.+|-.|+..
T Consensus 102 ~~s~~lns~~Q~agLrlL~nLtv~~~~~~~l--~~~i~~ll~LL~~G~~~~k~~vLk~L~nLS~n 164 (254)
T PF04826_consen 102 TVSSPLNSEVQLAGLRLLTNLTVTNDYHHML--ANYIPDLLSLLSSGSEKTKVQVLKVLVNLSEN 164 (254)
T ss_pred HhcCCCCCHHHHHHHHHHHccCCCcchhhhH--HhhHHHHHHHHHcCChHHHHHHHHHHHHhccC
Confidence 01111 23578899999999999999999988888875
No 12
>COG5064 SRP1 Karyopherin (importin) alpha [Intracellular trafficking and secretion]
Probab=97.73 E-value=0.00013 Score=63.24 Aligned_cols=96 Identities=19% Similarity=0.211 Sum_probs=81.4
Q ss_pred HHHHHHHHhcCChhhHH-HHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHh
Q 032259 10 RVLLILCNLAASNEGRS-AILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEER 88 (144)
Q Consensus 10 ~AlavL~~La~~~egr~-aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~ 88 (144)
+|.=+|.|+++-..... .++++|+||.++.+|.++ +...+|.||=+|.+++-+++.+|..+...|++.+|+.|+.+
T Consensus 134 EAaWalTNiaSGtt~QTkvVvd~~AVPlfiqlL~s~---~~~V~eQavWALGNiAGDS~~~RD~vL~~galeplL~ll~s 210 (526)
T COG5064 134 EAAWALTNIASGTTQQTKVVVDAGAVPLFIQLLSST---EDDVREQAVWALGNIAGDSEGCRDYVLQCGALEPLLGLLLS 210 (526)
T ss_pred HHHHHHhhhccCcccceEEEEeCCchHHHHHHHcCc---hHHHHHHHHHHhccccCCchhHHHHHHhcCchHHHHHHHHh
Confidence 56778899997544444 447899999999999987 67889999999999999889999999999999999999987
Q ss_pred CCH--HHHHHHHHHHHHhhcCC
Q 032259 89 GSQ--RAKEKAKRILGMLKGRE 108 (144)
Q Consensus 89 Gt~--raK~kA~~lL~~L~~~~ 108 (144)
-.+ .--++|..+|..|.+-.
T Consensus 211 s~~~ismlRn~TWtLSNlcRGk 232 (526)
T COG5064 211 SAIHISMLRNATWTLSNLCRGK 232 (526)
T ss_pred ccchHHHHHHhHHHHHHhhCCC
Confidence 655 55688999998888755
No 13
>PF05536 Neurochondrin: Neurochondrin
Probab=97.70 E-value=0.00016 Score=65.09 Aligned_cols=99 Identities=19% Similarity=0.149 Sum_probs=74.5
Q ss_pred chHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHH
Q 032259 6 ESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREV 85 (144)
Q Consensus 6 ~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~L 85 (144)
+++++|+.+|..++.+|+|+.++.+.|+|+.|++++.++ +..+|.|..+|.+++.............+....+..|
T Consensus 114 ~~v~dalqcL~~Ias~~~G~~aLl~~g~v~~L~ei~~~~----~~~~E~Al~lL~~Lls~~~~~~~~~~~~~l~~il~~L 189 (543)
T PF05536_consen 114 ETVDDALQCLLAIASSPEGAKALLESGAVPALCEIIPNQ----SFQMEIALNLLLNLLSRLGQKSWAEDSQLLHSILPSL 189 (543)
T ss_pred hHHHHHHHHHHHHHcCcHhHHHHHhcCCHHHHHHHHHhC----cchHHHHHHHHHHHHHhcchhhhhhhHHHHHHHHHHH
Confidence 689999999999999999999999999999999999984 8999999999999988433111112233444455555
Q ss_pred HHhCCHHHHHHHHHHHHHhhcCC
Q 032259 86 EERGSQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 86 l~~Gt~raK~kA~~lL~~L~~~~ 108 (144)
...-+......+-.++..|..+-
T Consensus 190 a~~fs~~~~~~kfell~~L~~~L 212 (543)
T PF05536_consen 190 ARDFSSFHGEDKFELLEFLSAFL 212 (543)
T ss_pred HHHHHhhccchHHHHHHHHHHhc
Confidence 55555555666666676666644
No 14
>smart00185 ARM Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Probab=97.62 E-value=0.0002 Score=41.27 Aligned_cols=40 Identities=33% Similarity=0.469 Sum_probs=35.8
Q ss_pred hhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhc
Q 032259 22 NEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGH 64 (144)
Q Consensus 22 ~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~ 64 (144)
++++..|.+.|+||+|+++|..+ ++..+++|+.+|.+|+.
T Consensus 2 ~~~~~~i~~~g~i~~L~~ll~~~---~~~i~~~a~~aL~nl~~ 41 (41)
T smart00185 2 DEQKQAVVDAGGLPALVELLKSE---DEEVVKEAAWALSNLSS 41 (41)
T ss_pred cHHHHHHHHCCCHHHHHHHHcCC---CHHHHHHHHHHHHHHcC
Confidence 46888999999999999999975 79999999999999973
No 15
>KOG0166 consensus Karyopherin (importin) alpha [Intracellular trafficking, secretion, and vesicular transport]
Probab=97.39 E-value=0.0015 Score=58.72 Aligned_cols=101 Identities=15% Similarity=0.224 Sum_probs=85.0
Q ss_pred chHHHHHHHHHHhcC-ChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHH
Q 032259 6 ESTSRVLLILCNLAA-SNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLRE 84 (144)
Q Consensus 6 ~~~E~AlavL~~La~-~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~ 84 (144)
.+.--|+..+.|+++ .++-...+.+.|++|.|..+|... ....-|+.|+=++.+++.++......+.++|++|.|+.
T Consensus 294 ~v~~PaLRaiGNIvtG~d~QTq~vi~~~~L~~l~~ll~~s--~~~~ikkEAcW~iSNItAG~~~qiqaVida~l~p~Li~ 371 (514)
T KOG0166|consen 294 KVVTPALRAIGNIVTGSDEQTQVVINSGALPVLSNLLSSS--PKESIKKEACWTISNITAGNQEQIQAVIDANLIPVLIN 371 (514)
T ss_pred ccccHHHhhccceeeccHHHHHHHHhcChHHHHHHHhccC--cchhHHHHHHHHHHHhhcCCHHHHHHHHHcccHHHHHH
Confidence 345568999999987 556677778999999999999863 13335788999999999998876777789999999999
Q ss_pred HHHhCCHHHHHHHHHHHHHhhcCC
Q 032259 85 VEERGSQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 85 Ll~~Gt~raK~kA~~lL~~L~~~~ 108 (144)
+++++.-+.|+.|+.++.++-...
T Consensus 372 ~l~~~ef~~rKEAawaIsN~ts~g 395 (514)
T KOG0166|consen 372 LLQTAEFDIRKEAAWAISNLTSSG 395 (514)
T ss_pred HHhccchHHHHHHHHHHHhhcccC
Confidence 999999999999999999887765
No 16
>KOG2122 consensus Beta-catenin-binding protein APC, contains ARM repeats [Signal transduction mechanisms; Cytoskeleton]
Probab=97.35 E-value=0.00037 Score=68.67 Aligned_cols=101 Identities=17% Similarity=0.096 Sum_probs=84.6
Q ss_pred cchHHHHHHHHHHhc----CChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHH
Q 032259 5 GESTSRVLLILCNLA----ASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAE 80 (144)
Q Consensus 5 ~~~~E~AlavL~~La----~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~ 80 (144)
..++|.|=.||.|++ .|++.|..+.+++||..|+..|++. |-...-||+.+||+|+..++.-++..-+.|+|+
T Consensus 499 LaIIEsaGGILRNVSS~IAt~E~yRQILR~~NCLq~LLQ~LKS~---SLTiVSNaCGTLWNLSAR~p~DQq~LwD~gAv~ 575 (2195)
T KOG2122|consen 499 LAIIESAGGILRNVSSLIATCEDYRQILRRHNCLQTLLQHLKSH---SLTIVSNACGTLWNLSARSPEDQQMLWDDGAVP 575 (2195)
T ss_pred hhhhhcCccHHHHHHhHhhccchHHHHHHHhhHHHHHHHHhhhc---ceEEeecchhhhhhhhcCCHHHHHHHHhcccHH
Confidence 357888889998865 5889999999999999999999986 888899999999999998888889999999999
Q ss_pred HHHHHHHhCCHHHHHHHHHHHHHhhcCC
Q 032259 81 VLREVEERGSQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 81 ~L~~Ll~~Gt~raK~kA~~lL~~L~~~~ 108 (144)
-|..|+.+-..---+-++.+|++|-.++
T Consensus 576 mLrnLIhSKhkMIa~GSaaALrNLln~R 603 (2195)
T KOG2122|consen 576 MLRNLIHSKHKMIAMGSAAALRNLLNFR 603 (2195)
T ss_pred HHHHHHhhhhhhhhhhHHHHHHHHhcCC
Confidence 9999999875444444555666665555
No 17
>PF00514 Arm: Armadillo/beta-catenin-like repeat; InterPro: IPR000225 The armadillo (Arm) repeat is an approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila melanogaster segment polarity gene armadillo involved in signal transduction through wingless. Animal Arm-repeat proteins function in various processes, including intracellular signalling and cytoskeletal regulation, and include such proteins as beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumour suppressor protein, and the nuclear transport factor importin-alpha, amongst others []. A subset of these proteins is conserved across eukaryotic kingdoms. In higher plants, some Arm-repeat proteins function in intracellular signalling like their mammalian counterparts, while others have novel functions []. The 3-dimensional fold of an armadillo repeat is known from the crystal structure of beta-catenin, where the 12 repeats form a superhelix of alpha helices with three helices per unit []. The cylindrical structure features a positively charged grove, which presumably interacts with the acidic surfaces of the known interaction partners of beta-catenin.; GO: 0005515 protein binding; PDB: 2Z6G_A 1IQ1_C 3RZX_A 2C1M_A 3BTR_C 3OQS_A 3TPO_A 1IAL_A 1Q1S_C 1PJM_B ....
Probab=97.30 E-value=0.00075 Score=39.86 Aligned_cols=40 Identities=23% Similarity=0.146 Sum_probs=35.8
Q ss_pred hHHHHHHHHcCcHHHHHHHHHhCCHHHHHHHHHHHHHhhc
Q 032259 67 LRFKGLAKEARAAEVLREVEERGSQRAKEKAKRILGMLKG 106 (144)
Q Consensus 67 ~~~~~~a~~~Gai~~L~~Ll~~Gt~raK~kA~~lL~~L~~ 106 (144)
+..+..+.+.|++|+|+.|+.++.+..++.|..+|..|.+
T Consensus 2 ~~~~~~i~~~g~i~~Lv~ll~~~~~~v~~~a~~al~nl~~ 41 (41)
T PF00514_consen 2 PENKQAIVEAGGIPPLVQLLKSPDPEVQEEAAWALGNLAA 41 (41)
T ss_dssp HHHHHHHHHTTHHHHHHHHTTSSSHHHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHcccHHHHHHHHcCCCHHHHHHHHHHHHHHhC
Confidence 4567778899999999999999999999999999998863
No 18
>PF09759 Atx10homo_assoc: Spinocerebellar ataxia type 10 protein domain; InterPro: IPR019156 This is the conserved C-terminal 100 residues of Ataxin-10. Ataxin-10 belongs to the family of armadillo repeat proteins and in solution it tends to form homotrimeric complexes, which associate via a tip-to-tip association in a horseshoe-shaped contact with the concave sides of the molecules facing each other. This domain may represent the homo-association site since that is located near the C terminus of Ataxin-10. The protein does not contain a signal sequence for secretion or any subcellular compartment confirming its cytoplasmic localisation, specifically to the olivocerebellar region [].
Probab=97.27 E-value=0.003 Score=45.47 Aligned_cols=69 Identities=14% Similarity=0.233 Sum_probs=57.1
Q ss_pred hHHHHHHHHHHhc-CChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHc
Q 032259 7 STSRVLLILCNLA-ASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEA 76 (144)
Q Consensus 7 ~~E~AlavL~~La-~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~ 76 (144)
+.-..+.+|+||+ .+++....+.+.|+||.++..-.-. ..+|--||+|+-++-+||.++++.+..+...
T Consensus 2 ~K~~lvrlianl~~~~~~~Qd~vr~~~Gi~liL~~c~iD-~~nP~irEwai~aiRnL~e~n~eNQ~~I~~L 71 (102)
T PF09759_consen 2 FKRDLVRLIANLCYKNKEVQDLVRELGGIPLILSCCNID-DHNPFIREWAIFAIRNLCEGNPENQEFIAQL 71 (102)
T ss_pred cHHHHHHHHHHHHhCCHHHHHHHHHcCChHHHHHhcCCC-cccHHHHHHHHHHHHHHHhCCHHHHHHHHhc
Confidence 4457789999999 5999999999999999998865322 1379999999999999999888777776543
No 19
>KOG1048 consensus Neural adherens junction protein Plakophilin and related Armadillo repeat proteins [Signal transduction mechanisms; Extracellular structures]
Probab=97.18 E-value=0.0031 Score=58.49 Aligned_cols=99 Identities=21% Similarity=0.208 Sum_probs=86.3
Q ss_pred chHHHHHHHHHHhcC-ChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccCh--HHHHHHHHcCcHHHH
Q 032259 6 ESTSRVLLILCNLAA-SNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNL--RFKGLAKEARAAEVL 82 (144)
Q Consensus 6 ~~~E~AlavL~~La~-~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~--~~~~~a~~~Gai~~L 82 (144)
...-.|.+-|..|+- +.+.|..+-+-|+||.||.+|+.- +++-+.+|+++|-+|..++. ...-.+.+.+.||.|
T Consensus 248 ~~qsnaaaylQHlcfgd~~ik~~vrqlggI~kLv~Ll~~~---~~evq~~acgaLRNLvf~~~~~~NKlai~~~~Gv~~l 324 (717)
T KOG1048|consen 248 SVQSNAAAYLQHLCFGDNKIKSRVRQLGGIPKLVALLDHR---NDEVQRQACGALRNLVFGKSTDSNKLAIKELNGVPTL 324 (717)
T ss_pred hhhHHHHHHHHHHHhhhHHHHHHHHHhccHHHHHHHhcCC---cHHHHHHHHHHHHhhhcccCCcccchhhhhcCChHHH
Confidence 456788899999994 888899888899999999999985 78999999999999998443 366777899999999
Q ss_pred HHHHHh-CCHHHHHHHHHHHHHhhcC
Q 032259 83 REVEER-GSQRAKEKAKRILGMLKGR 107 (144)
Q Consensus 83 ~~Ll~~-Gt~raK~kA~~lL~~L~~~ 107 (144)
+.+++. +....|+....+|++|+..
T Consensus 325 ~~~Lr~t~D~ev~e~iTg~LWNLSS~ 350 (717)
T KOG1048|consen 325 VRLLRHTQDDEVRELITGILWNLSSN 350 (717)
T ss_pred HHHHHhhcchHHHHHHHHHHhcccch
Confidence 999975 8999999999999999885
No 20
>KOG4199 consensus Uncharacterized conserved protein [Function unknown]
Probab=97.17 E-value=0.0071 Score=52.48 Aligned_cols=101 Identities=19% Similarity=0.206 Sum_probs=84.8
Q ss_pred hHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHH
Q 032259 7 STSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVE 86 (144)
Q Consensus 7 ~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll 86 (144)
+.-.++.+|..|+.+++.+..|++.|+.+.++.++..- ..+|.-.+.+.+++.-||...+..-..+.+.|+-..-++-+
T Consensus 303 l~k~~lslLralAG~DsvKs~IV~~gg~~~ii~l~~~h-~~~p~Vi~~~~a~i~~l~LR~pdhsa~~ie~G~a~~avqAm 381 (461)
T KOG4199|consen 303 LAKTCLSLLRALAGSDSVKSTIVEKGGLDKIITLALRH-SDDPLVIQEVMAIISILCLRSPDHSAKAIEAGAADLAVQAM 381 (461)
T ss_pred HHHHHHHHHHHHhCCCchHHHHHHhcChHHHHHHHHHc-CCChHHHHHHHHHHHHHHhcCcchHHHHHhcchHHHHHHHH
Confidence 45678999999999999999999999999999998664 25788899999999999997776677788888887777777
Q ss_pred HhCC--HHHHHHHHHHHHHhhcCC
Q 032259 87 ERGS--QRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 87 ~~Gt--~raK~kA~~lL~~L~~~~ 108 (144)
+.-. ...+++|..+++++-..+
T Consensus 382 kahP~~a~vQrnac~~IRNiv~rs 405 (461)
T KOG4199|consen 382 KAHPVAAQVQRNACNMIRNIVVRS 405 (461)
T ss_pred HhCcHHHHHHHHHHHHHHHHHHhh
Confidence 6653 455799999999987654
No 21
>PF04826 Arm_2: Armadillo-like; InterPro: IPR006911 This entry consists of mammalian proteins of unknown function.
Probab=97.12 E-value=0.0065 Score=49.91 Aligned_cols=100 Identities=16% Similarity=0.144 Sum_probs=80.1
Q ss_pred chHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHH
Q 032259 6 ESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREV 85 (144)
Q Consensus 6 ~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~L 85 (144)
.+...++.+|.+|+..++.+..+. ..||.++++|..| +...|.++.-+|++|+..... ......+.+.+.|+.|
T Consensus 110 ~~Q~agLrlL~nLtv~~~~~~~l~--~~i~~ll~LL~~G---~~~~k~~vLk~L~nLS~np~~-~~~Ll~~q~~~~~~~L 183 (254)
T PF04826_consen 110 EVQLAGLRLLTNLTVTNDYHHMLA--NYIPDLLSLLSSG---SEKTKVQVLKVLVNLSENPDM-TRELLSAQVLSSFLSL 183 (254)
T ss_pred HHHHHHHHHHHccCCCcchhhhHH--hhHHHHHHHHHcC---ChHHHHHHHHHHHHhccCHHH-HHHHHhccchhHHHHH
Confidence 455678999999999888877775 4799999999999 899999999999999976544 4444566789999999
Q ss_pred HHhC-CHHHHHHHHHHHHHhhcCCCCC
Q 032259 86 EERG-SQRAKEKAKRILGMLKGREDDD 111 (144)
Q Consensus 86 l~~G-t~raK~kA~~lL~~L~~~~~~~ 111 (144)
.... +..--..+-.+...+......+
T Consensus 184 f~~~~~~~~l~~~l~~~~ni~~~~~~~ 210 (254)
T PF04826_consen 184 FNSSESKENLLRVLTFFENINENIKKE 210 (254)
T ss_pred HccCCccHHHHHHHHHHHHHHHhhCcc
Confidence 8776 4555788888888887766333
No 22
>KOG2160 consensus Armadillo/beta-catenin-like repeat-containing protein [Posttranslational modification, protein turnover, chaperones]
Probab=97.03 E-value=0.013 Score=50.25 Aligned_cols=103 Identities=18% Similarity=0.225 Sum_probs=87.2
Q ss_pred CcchHHHHHHHHHHhcC-ChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHH
Q 032259 4 SGESTSRVLLILCNLAA-SNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVL 82 (144)
Q Consensus 4 ~~~~~E~AlavL~~La~-~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L 82 (144)
+..+.+.|+-||...+. +|...+.+.+.|+.+.|..++... .+...+-+|..++.+|-++++.-...-...+....|
T Consensus 137 ~~~lR~~Aa~Vigt~~qNNP~~Qe~v~E~~~L~~Ll~~ls~~--~~~~~r~kaL~AissLIRn~~~g~~~fl~~~G~~~L 214 (342)
T KOG2160|consen 137 DAELRELAARVIGTAVQNNPKSQEQVIELGALSKLLKILSSD--DPNTVRTKALFAISSLIRNNKPGQDEFLKLNGYQVL 214 (342)
T ss_pred cHHHHHHHHHHHHHHHhcCHHHHHHHHHcccHHHHHHHHccC--CCchHHHHHHHHHHHHHhcCcHHHHHHHhcCCHHHH
Confidence 45788999999999996 899999999999999999999764 366778999999999988776555555666779999
Q ss_pred HHHHHhC--CHHHHHHHHHHHHHhhcCC
Q 032259 83 REVEERG--SQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 83 ~~Ll~~G--t~raK~kA~~lL~~L~~~~ 108 (144)
..+++++ +.+.|+||..+|..|-...
T Consensus 215 ~~vl~~~~~~~~lkrK~~~Ll~~Ll~~~ 242 (342)
T KOG2160|consen 215 RDVLQSNNTSVKLKRKALFLLSLLLQED 242 (342)
T ss_pred HHHHHcCCcchHHHHHHHHHHHHHHHhh
Confidence 9999985 7999999999999887754
No 23
>KOG2122 consensus Beta-catenin-binding protein APC, contains ARM repeats [Signal transduction mechanisms; Cytoskeleton]
Probab=97.02 E-value=0.00056 Score=67.46 Aligned_cols=116 Identities=22% Similarity=0.276 Sum_probs=95.7
Q ss_pred CCcchHHHHHHHHHHhcC-ChhhHHHHHh-cCcHHHHHHHhh-cCCCCChHHHHHHHHHHHHHhc---cChHHHHHHHHc
Q 032259 3 KSGESTSRVLLILCNLAA-SNEGRSAILD-ANGVSILVGMLK-ESGSDSEATRENCVAALFALGH---GNLRFKGLAKEA 76 (144)
Q Consensus 3 ~~~~~~E~AlavL~~La~-~~egr~aI~~-~g~I~~LV~lL~-~G~~~S~~~kE~Ava~L~~Lc~---~~~~~~~~a~~~ 76 (144)
|+...+-..|..|.||+. |.+|+++|.+ .|++..||.+|. .|...+..-.|+|-.||-++++ ..+-+|++-.+.
T Consensus 450 ~kEsTLKavLSALWNLSAHcteNKA~iCaVDGALaFLVg~LSY~~qs~tLaIIEsaGGILRNVSS~IAt~E~yRQILR~~ 529 (2195)
T KOG2122|consen 450 KKESTLKAVLSALWNLSAHCTENKAEICAVDGALAFLVGTLSYEGQSNTLAIIESAGGILRNVSSLIATCEDYRQILRRH 529 (2195)
T ss_pred cccchHHHHHHHHhhhhhcccccchhhhcccchHHHHHhhccccCCcchhhhhhcCccHHHHHHhHhhccchHHHHHHHh
Confidence 345666778899999998 8999999977 689999999995 3432456679999999999987 445679999999
Q ss_pred CcHHHHHHHHHhCCHHHHHHHHHHHHHhhcCCCCCCcchhhh
Q 032259 77 RAAEVLREVEERGSQRAKEKAKRILGMLKGREDDDEDVDWEG 118 (144)
Q Consensus 77 Gai~~L~~Ll~~Gt~raK~kA~~lL~~L~~~~~~~~~~~~~~ 118 (144)
..+..|++.|.+.+--.--+|...|++|+.-..+|-..=|++
T Consensus 530 NCLq~LLQ~LKS~SLTiVSNaCGTLWNLSAR~p~DQq~LwD~ 571 (2195)
T KOG2122|consen 530 NCLQTLLQHLKSHSLTIVSNACGTLWNLSARSPEDQQMLWDD 571 (2195)
T ss_pred hHHHHHHHHhhhcceEEeecchhhhhhhhcCCHHHHHHHHhc
Confidence 999999999999988888899999999999876664444543
No 24
>KOG4646 consensus Uncharacterized conserved protein, contains ARM repeats [Function unknown]
Probab=96.98 E-value=0.0022 Score=49.33 Aligned_cols=98 Identities=17% Similarity=0.147 Sum_probs=73.9
Q ss_pred CcchHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHH
Q 032259 4 SGESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLR 83 (144)
Q Consensus 4 ~~~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~ 83 (144)
..-++|-+.+-|+|||..+.+.+-|.++++||.++..+.+- ....--.|+..|..||-++...+.....+-++....
T Consensus 71 ne~LvefgIgglCNlC~d~~n~~~I~ea~g~plii~~lssp---~e~tv~sa~~~l~~l~~~~Rt~r~ell~p~Vv~~v~ 147 (173)
T KOG4646|consen 71 NELLVEFGIGGLCNLCLDKTNAKFIREALGLPLIIFVLSSP---PEITVHSAALFLQLLEFGERTERDELLSPAVVRTVQ 147 (173)
T ss_pred cHHHHHHhHHHHHhhccChHHHHHHHHhcCCceEEeecCCC---hHHHHHHHHHHHHHhcCcccchhHHhccHHHHHHHH
Confidence 34578999999999999999999999999999999998773 334567778889999987666677777776666666
Q ss_pred HHHHhCCHHHHHHHHHHHHHh
Q 032259 84 EVEERGSQRAKEKAKRILGML 104 (144)
Q Consensus 84 ~Ll~~Gt~raK~kA~~lL~~L 104 (144)
.-..+-+-.-+--|...|..+
T Consensus 148 r~~~s~s~~~rnLa~~fl~~~ 168 (173)
T KOG4646|consen 148 RWRESKSHDERNLASAFLDKH 168 (173)
T ss_pred HHHHHhhHHHHHHHHHHHHhh
Confidence 655454555555555555443
No 25
>PF03224 V-ATPase_H_N: V-ATPase subunit H; InterPro: IPR004908 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. V-ATPases (also known as V1V0-ATPase or vacuolar ATPase) (3.6.3.14 from EC) are found in the eukaryotic endomembrane system, and in the plasma membrane of prokaryotes and certain specialised eukaryotic cells. V-ATPases hydrolyse ATP to drive a proton pump, and are involved in a variety of vital intra- and inter-cellular processes such as receptor mediated endocytosis, protein trafficking, active transport of metabolites, homeostasis and neurotransmitter release []. V-ATPases are composed of two linked complexes: the V1 complex (subunits A-H) contains the catalytic core that hydrolyses ATP, while the V0 complex (subunits a, c, c', c'', d) forms the membrane-spanning pore. V-ATPases may have an additional role in membrane fusion through binding to t-SNARE proteins []. This entry represents subunit H (also known as Vma13p) found in the V1 complex of V-ATPases. This subunit has a regulatory function, being responsible for activating ATPase activity and coupling ATPase activity to proton flow []. The yeast enzyme contains five motifs similar to the HEAT or Armadillo repeats seen in the importins, and can be divided into two distinct domains: a large N-terminal domain consisting of stacked alpha helices, and a smaller C-terminal alpha-helical domain with a similar superhelical topology to an armadillo repeat []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0046961 proton-transporting ATPase activity, rotational mechanism, 0015991 ATP hydrolysis coupled proton transport, 0000221 vacuolar proton-transporting V-type ATPase, V1 domain; PDB: 1HO8_A.
Probab=96.97 E-value=0.0086 Score=49.74 Aligned_cols=101 Identities=19% Similarity=0.183 Sum_probs=78.3
Q ss_pred chHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhh-----cCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHH
Q 032259 6 ESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLK-----ESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAE 80 (144)
Q Consensus 6 ~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~-----~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~ 80 (144)
+..+-|+..|.+|..+++.|..+.+.++++.|+++|+ .+. ....-+=+++-++|.|+...+. .......++||
T Consensus 165 ~~~~~av~~L~~LL~~~~~R~~f~~~~~v~~l~~iL~~~~~~~~~-~~~Ql~Y~~ll~lWlLSF~~~~-~~~~~~~~~i~ 242 (312)
T PF03224_consen 165 ELQYIAVQCLQNLLRSKEYRQVFWKSNGVSPLFDILRKQATNSNS-SGIQLQYQALLCLWLLSFEPEI-AEELNKKYLIP 242 (312)
T ss_dssp ---HHHHHHHHHHHTSHHHHHHHHTHHHHHHHHHHHH----------HHHHHHHHHHHHHHHTTSHHH-HHHHHTTSHHH
T ss_pred chHHHHHHHHHHHhCcchhHHHHHhcCcHHHHHHHHHhhcccCCC-CchhHHHHHHHHHHHHhcCHHH-HHHHhccchHH
Confidence 4457889999999999999999999999999999992 221 2355688999999999976544 44456777999
Q ss_pred HHHHHHHhC-CHHHHHHHHHHHHHhhcCC
Q 032259 81 VLREVEERG-SQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 81 ~L~~Ll~~G-t~raK~kA~~lL~~L~~~~ 108 (144)
.|+++++.- -++.-|-+-.+|++|-...
T Consensus 243 ~L~~i~~~~~KEKvvRv~la~l~Nl~~~~ 271 (312)
T PF03224_consen 243 LLADILKDSIKEKVVRVSLAILRNLLSKA 271 (312)
T ss_dssp HHHHHHHH--SHHHHHHHHHHHHHTTSSS
T ss_pred HHHHHHHhcccchHHHHHHHHHHHHHhcc
Confidence 999999876 5777788888888887764
No 26
>PF05536 Neurochondrin: Neurochondrin
Probab=96.94 E-value=0.0066 Score=54.76 Aligned_cols=99 Identities=23% Similarity=0.204 Sum_probs=78.7
Q ss_pred cchHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCCh-HHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHH
Q 032259 5 GESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSE-ATRENCVAALFALGHGNLRFKGLAKEARAAEVLR 83 (144)
Q Consensus 5 ~~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~-~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~ 83 (144)
..+..-|+.||..++..|+-...=--.+-||.|+++|..+ +. ...+.|..+|..++...+..+.+ .+.|+++.|.
T Consensus 71 ~~~~~LavsvL~~f~~~~~~a~~~~~~~~IP~Lle~l~~~---s~~~~v~dalqcL~~Ias~~~G~~aL-l~~g~v~~L~ 146 (543)
T PF05536_consen 71 EEYLSLAVSVLAAFCRDPELASSPQMVSRIPLLLEILSSS---SDLETVDDALQCLLAIASSPEGAKAL-LESGAVPALC 146 (543)
T ss_pred HHHHHHHHHHHHHHcCChhhhcCHHHHHHHHHHHHHHHcC---CchhHHHHHHHHHHHHHcCcHhHHHH-HhcCCHHHHH
Confidence 3467889999999999776653322236799999999886 55 88999999999999765554444 5789999999
Q ss_pred HHHHhCCHHHHHHHHHHHHHhhcCC
Q 032259 84 EVEERGSQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 84 ~Ll~~Gt~raK~kA~~lL~~L~~~~ 108 (144)
+...+ .+..++.|..+|..+-...
T Consensus 147 ei~~~-~~~~~E~Al~lL~~Lls~~ 170 (543)
T PF05536_consen 147 EIIPN-QSFQMEIALNLLLNLLSRL 170 (543)
T ss_pred HHHHh-CcchHHHHHHHHHHHHHhc
Confidence 99988 7778999999998876643
No 27
>PF10508 Proteasom_PSMB: Proteasome non-ATPase 26S subunit; InterPro: IPR019538 The 26S proteasome is an enzymatic complex that degrades ubiquitinated proteins in eukaryotic cells. 26S proteasome non-ATPase regulatory subunit 5 is one of a number of chaperones that are involved in the assembly of the proteasome. The chaperones dissociate before 26S proteasome formation is complete [].; GO: 0044183 protein binding involved in protein folding
Probab=96.91 E-value=0.014 Score=51.91 Aligned_cols=103 Identities=17% Similarity=0.150 Sum_probs=91.4
Q ss_pred CCcchHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHH
Q 032259 3 KSGESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVL 82 (144)
Q Consensus 3 ~~~~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L 82 (144)
.+..+.+.|..+|.+|+.++.+.+.+...+.++.|..++... ++..|=.+..++..++..++.........|+++.|
T Consensus 131 ~d~~Va~~A~~~L~~l~~~~~~~~~l~~~~~~~~L~~l~~~~---~~~vR~Rv~el~v~i~~~S~~~~~~~~~sgll~~l 207 (503)
T PF10508_consen 131 PDLSVAKAAIKALKKLASHPEGLEQLFDSNLLSKLKSLMSQS---SDIVRCRVYELLVEIASHSPEAAEAVVNSGLLDLL 207 (503)
T ss_pred CcHHHHHHHHHHHHHHhCCchhHHHHhCcchHHHHHHHHhcc---CHHHHHHHHHHHHHHHhcCHHHHHHHHhccHHHHH
Confidence 355778999999999999999999998888899999999874 56777788899999999888888888899999999
Q ss_pred HHHHHhCCHHHHHHHHHHHHHhhcCC
Q 032259 83 REVEERGSQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 83 ~~Ll~~Gt~raK~kA~~lL~~L~~~~ 108 (144)
+..+.+.....|-.|..+|..|....
T Consensus 208 l~eL~~dDiLvqlnalell~~La~~~ 233 (503)
T PF10508_consen 208 LKELDSDDILVQLNALELLSELAETP 233 (503)
T ss_pred HHHhcCccHHHHHHHHHHHHHHHcCh
Confidence 99999988889999999999998843
No 28
>KOG2160 consensus Armadillo/beta-catenin-like repeat-containing protein [Posttranslational modification, protein turnover, chaperones]
Probab=96.69 E-value=0.028 Score=48.24 Aligned_cols=97 Identities=18% Similarity=0.152 Sum_probs=84.7
Q ss_pred HHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHH
Q 032259 8 TSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEE 87 (144)
Q Consensus 8 ~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~ 87 (144)
.+.|+.-|..++-+=++..-|...|+.++|+.+|.++ .+.-|+.|+-+|-...++++..+..+.+.|+.+.|+.++.
T Consensus 100 ke~ald~Le~lve~iDnAndl~~~ggl~~ll~~l~~~---~~~lR~~Aa~Vigt~~qNNP~~Qe~v~E~~~L~~Ll~~ls 176 (342)
T KOG2160|consen 100 KEDALDNLEELVEDIDNANDLISLGGLVPLLGYLENS---DAELRELAARVIGTAVQNNPKSQEQVIELGALSKLLKILS 176 (342)
T ss_pred HHHHHHHHHHHHHhhhhHHhHhhccCHHHHHHHhcCC---cHHHHHHHHHHHHHHHhcCHHHHHHHHHcccHHHHHHHHc
Confidence 5778888888888888888899999999999999886 6888999999999999999999999999999999999986
Q ss_pred h-CCHHHHHHHHHHHHHhhcC
Q 032259 88 R-GSQRAKEKAKRILGMLKGR 107 (144)
Q Consensus 88 ~-Gt~raK~kA~~lL~~L~~~ 107 (144)
. ++..+|.||--++-.|=+.
T Consensus 177 ~~~~~~~r~kaL~AissLIRn 197 (342)
T KOG2160|consen 177 SDDPNTVRTKALFAISSLIRN 197 (342)
T ss_pred cCCCchHHHHHHHHHHHHHhc
Confidence 4 5788889998888777665
No 29
>KOG1048 consensus Neural adherens junction protein Plakophilin and related Armadillo repeat proteins [Signal transduction mechanisms; Extracellular structures]
Probab=96.53 E-value=0.026 Score=52.56 Aligned_cols=104 Identities=20% Similarity=0.125 Sum_probs=84.6
Q ss_pred CCcchHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCC---CChHHHHHHHHHHHHHhccChHHHHHHHHcCcH
Q 032259 3 KSGESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGS---DSEATRENCVAALFALGHGNLRFKGLAKEARAA 79 (144)
Q Consensus 3 ~~~~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~---~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai 79 (144)
.+..++..+..+|.||+.++.++..|. -.+||-||+.|..+.. -+...--.++.+|+++...+.....-..+.+.+
T Consensus 578 ~~~~vv~s~a~~LrNls~d~rnk~lig-k~a~~~lv~~Lp~~~~~~~~sedtv~~vc~tl~niv~~~~~nAkdl~~~~g~ 656 (717)
T KOG1048|consen 578 DDSDVVRSAAGALRNLSRDIRNKELIG-KYAIPDLVRCLPGSGPSTSLSEDTVRAVCHTLNNIVRKNVLNAKDLLEIKGI 656 (717)
T ss_pred CCchHHHHHHHHHhhhccCchhhhhhh-cchHHHHHHhCcCCCCCcCchHHHHHHHHHhHHHHHHHhHHHHHHHHhccCh
Confidence 355788999999999999999999999 5799999999987631 124556677789999998777667777899999
Q ss_pred HHHHHHHHhC-CHHHHHHHHHHHHHhhcC
Q 032259 80 EVLREVEERG-SQRAKEKAKRILGMLKGR 107 (144)
Q Consensus 80 ~~L~~Ll~~G-t~raK~kA~~lL~~L~~~ 107 (144)
+.|+.|...- +++.-+-|..+|..|-..
T Consensus 657 ~kL~~I~~s~~S~k~~kaAs~vL~~lW~y 685 (717)
T KOG1048|consen 657 PKLRLISKSQHSPKEFKAASSVLDVLWQY 685 (717)
T ss_pred HHHHHHhcccCCHHHHHHHHHHHHHHHHH
Confidence 9999998764 788888888888776554
No 30
>PF12031 DUF3518: Domain of unknown function (DUF3518); InterPro: IPR021906 This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is about 260 amino acids in length. This domain is found associated with PF01388 from PFAM.
Probab=96.52 E-value=0.011 Score=48.73 Aligned_cols=85 Identities=24% Similarity=0.271 Sum_probs=67.5
Q ss_pred HHHHHHHHHHhcCChhhHHHHHhcCc-------HHHHHHHhhcCCCCChHHHHHHHHHHHHHhccCh-HHHHHHHHcCcH
Q 032259 8 TSRVLLILCNLAASNEGRSAILDANG-------VSILVGMLKESGSDSEATRENCVAALFALGHGNL-RFKGLAKEARAA 79 (144)
Q Consensus 8 ~E~AlavL~~La~~~egr~aI~~~g~-------I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~-~~~~~a~~~Gai 79 (144)
..-|+.+|++|+..+.|-.-|...+. +..|+++|..- ..+-.||-|+.+|.+||..++ ..+.++.+.+.|
T Consensus 141 qrlaLEaLcKLsV~e~NVDliLaTpp~sRlE~l~~~L~r~l~~~--e~~v~REfAvvlL~~La~~~~~~~r~iA~q~~~i 218 (257)
T PF12031_consen 141 QRLALEALCKLSVIENNVDLILATPPFSRLERLFHTLVRLLGMR--EDQVCREFAVVLLSNLAQGDEAAARAIAMQKPCI 218 (257)
T ss_pred HHHHHHHHHHhheeccCcceeeeCCCHHHHHHHHHHHHHHhccc--cchhHHHHHHHHHHHHhcccHHHHHHHHHhhchH
Confidence 45789999999999999888877553 34566666432 268889999999999999765 566889999999
Q ss_pred HHHHHHHHhCCHHHH
Q 032259 80 EVLREVEERGSQRAK 94 (144)
Q Consensus 80 ~~L~~Ll~~Gt~raK 94 (144)
..|+..++++..-+.
T Consensus 219 ~~Li~FiE~a~~~~~ 233 (257)
T PF12031_consen 219 SHLIAFIEDAEQNAH 233 (257)
T ss_pred HHHHHHHHHHHHHHH
Confidence 999999998744433
No 31
>smart00185 ARM Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Probab=96.37 E-value=0.013 Score=33.41 Aligned_cols=37 Identities=19% Similarity=0.128 Sum_probs=33.4
Q ss_pred HHHHHHHcCcHHHHHHHHHhCCHHHHHHHHHHHHHhh
Q 032259 69 FKGLAKEARAAEVLREVEERGSQRAKEKAKRILGMLK 105 (144)
Q Consensus 69 ~~~~a~~~Gai~~L~~Ll~~Gt~raK~kA~~lL~~L~ 105 (144)
.+..+.+.|++++|+.|+..+.+..++.|..+|+.|.
T Consensus 4 ~~~~i~~~g~i~~L~~ll~~~~~~i~~~a~~aL~nl~ 40 (41)
T smart00185 4 QKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLS 40 (41)
T ss_pred HHHHHHHCCCHHHHHHHHcCCCHHHHHHHHHHHHHHc
Confidence 5667779999999999999989999999999999875
No 32
>COG5064 SRP1 Karyopherin (importin) alpha [Intracellular trafficking and secretion]
Probab=96.19 E-value=0.027 Score=49.22 Aligned_cols=116 Identities=22% Similarity=0.282 Sum_probs=88.8
Q ss_pred CcchHHHHHHHHHHhcC-ChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccC-h--HHHHHHHHcCcH
Q 032259 4 SGESTSRVLLILCNLAA-SNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGN-L--RFKGLAKEARAA 79 (144)
Q Consensus 4 ~~~~~E~AlavL~~La~-~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~-~--~~~~~a~~~Gai 79 (144)
+..+.|.|+=.|.|+|. .+..|.-+.+.|++.+|+.+|.+.. ..-.--.||.=+|-+||++. + .+..+ .-++
T Consensus 170 ~~~V~eQavWALGNiAGDS~~~RD~vL~~galeplL~ll~ss~-~~ismlRn~TWtLSNlcRGknP~P~w~~i---sqal 245 (526)
T COG5064 170 EDDVREQAVWALGNIAGDSEGCRDYVLQCGALEPLLGLLLSSA-IHISMLRNATWTLSNLCRGKNPPPDWSNI---SQAL 245 (526)
T ss_pred hHHHHHHHHHHhccccCCchhHHHHHHhcCchHHHHHHHHhcc-chHHHHHHhHHHHHHhhCCCCCCCchHHH---HHHH
Confidence 34678999999999998 6667888899999999999998741 11356889999999999943 2 33333 2368
Q ss_pred HHHHHHHHhCCHHHHHHHHHHHHHhhcCCCCCCcchhhhhcccCCCcc
Q 032259 80 EVLREVEERGSQRAKEKAKRILGMLKGREDDDEDVDWEGVLDSGGPTR 127 (144)
Q Consensus 80 ~~L~~Ll~~Gt~raK~kA~~lL~~L~~~~~~~~~~~~~~~~~~~~~~~ 127 (144)
|.|..|.-.-.+..--.|..++..|+.-.+ ++++ -|+|.|-|+|
T Consensus 246 piL~KLiys~D~evlvDA~WAiSYlsDg~~--E~i~--avld~g~~~R 289 (526)
T COG5064 246 PILAKLIYSRDPEVLVDACWAISYLSDGPN--EKIQ--AVLDVGIPGR 289 (526)
T ss_pred HHHHHHHhhcCHHHHHHHHHHHHHhccCcH--HHHH--HHHhcCCcHH
Confidence 999999988888898999999999887542 3433 4677777766
No 33
>KOG3036 consensus Protein involved in cell differentiation/sexual development [General function prediction only]
Probab=96.16 E-value=0.093 Score=43.79 Aligned_cols=99 Identities=19% Similarity=0.150 Sum_probs=76.3
Q ss_pred HHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCCh-H-HHHHHHHHHHHHhccC-hHHHHHHHHcCcHHHHHH
Q 032259 8 TSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSE-A-TRENCVAALFALGHGN-LRFKGLAKEARAAEVLRE 84 (144)
Q Consensus 8 ~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~-~-~kE~Ava~L~~Lc~~~-~~~~~~a~~~Gai~~L~~ 84 (144)
+=.|+++|-.+|+||+-|..+.++-.=--|-..|..+.+..+ + -|=.+.+++-.|...+ .++-.--...++||.++.
T Consensus 96 VcnaL~LlQcvASHpdTr~~FL~A~iPlylYpfL~Tt~~~r~fEyLRLtsLGVIgaLvk~dd~eVi~fLl~TeIVPlCLr 175 (293)
T KOG3036|consen 96 VCNALALLQCVASHPDTRRAFLRAHIPLYLYPFLNTTSKSRPFEYLRLTSLGVIGALVKNDDQEVIRFLLTTEIVPLCLR 175 (293)
T ss_pred HHHHHHHHHHHhcCcchHHHHHHccChhhhHHhhhccccCCchHHHhHHHHHHHHHHHhcCcHHHHHHHHHhhhHHHHHH
Confidence 457999999999999999999987533345556655421222 2 3778999999999844 454444568899999999
Q ss_pred HHHhCCHHHHHHHHHHHHHhhc
Q 032259 85 VEERGSQRAKEKAKRILGMLKG 106 (144)
Q Consensus 85 Ll~~Gt~raK~kA~~lL~~L~~ 106 (144)
..+.|+...|.-|.-+|+.+--
T Consensus 176 ime~GSelSKtvA~fIlqKIll 197 (293)
T KOG3036|consen 176 IMESGSELSKTVATFILQKILL 197 (293)
T ss_pred HHhcccHHHHHHHHHHHHHHhh
Confidence 9999999999999999987654
No 34
>KOG4199 consensus Uncharacterized conserved protein [Function unknown]
Probab=96.15 E-value=0.097 Score=45.64 Aligned_cols=104 Identities=21% Similarity=0.265 Sum_probs=77.7
Q ss_pred cchHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHH---HHHHHHHHhccChHHHHHHHHcCcHHH
Q 032259 5 GESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATREN---CVAALFALGHGNLRFKGLAKEARAAEV 81 (144)
Q Consensus 5 ~~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~---Ava~L~~Lc~~~~~~~~~a~~~Gai~~ 81 (144)
.+..-++...|..|+...|-...|.+.|++..|+++|.+.+ -...|+- +.+.|-.|+ +++.++..+.+.|..+.
T Consensus 256 p~~L~~l~~tl~~lAVr~E~C~~I~e~GGl~tl~~~i~d~n--~~~~r~l~k~~lslLralA-G~DsvKs~IV~~gg~~~ 332 (461)
T KOG4199|consen 256 PDSLVSLSTTLKALAVRDEICKSIAESGGLDTLLRCIDDSN--EQGNRTLAKTCLSLLRALA-GSDSVKSTIVEKGGLDK 332 (461)
T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHccCHHHHHHHHhhhc--hhhHHHHHHHHHHHHHHHh-CCCchHHHHHHhcChHH
Confidence 45667788899999999999999999999999999998731 3444544 445555555 55567888889999999
Q ss_pred HHHHHHh--CCHHHHHHHHHHHHHhhcCCCCC
Q 032259 82 LREVEER--GSQRAKEKAKRILGMLKGREDDD 111 (144)
Q Consensus 82 L~~Ll~~--Gt~raK~kA~~lL~~L~~~~~~~ 111 (144)
++.++.. .+|..-+.+..++..|.=..+|-
T Consensus 333 ii~l~~~h~~~p~Vi~~~~a~i~~l~LR~pdh 364 (461)
T KOG4199|consen 333 IITLALRHSDDPLVIQEVMAIISILCLRSPDH 364 (461)
T ss_pred HHHHHHHcCCChHHHHHHHHHHHHHHhcCcch
Confidence 9999753 46777777777777665544333
No 35
>cd00256 VATPase_H VATPase_H, regulatory vacuolar ATP synthase subunit H (Vma13p); activation component of the peripheral V1 complex of V-ATPase, a heteromultimeric enzyme which uses ATP to actively transport protons into organelles and extracellular compartments. The topology is that of a superhelical spiral, in part the geometry is similar to superhelices composed of armadillo repeat motifs, as found in importins for example.
Probab=95.88 E-value=0.15 Score=45.15 Aligned_cols=102 Identities=16% Similarity=0.137 Sum_probs=81.0
Q ss_pred cchHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHH
Q 032259 5 GESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLRE 84 (144)
Q Consensus 5 ~~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~ 84 (144)
.+.+.-|+..|..|...++.|..+.+.+++++|+++|+.-. ....-+=+++-++|.|+...+. .......+.|+.|+.
T Consensus 158 ~~~~~~~v~~L~~LL~~~~~R~~f~~~~~v~~L~~~L~~~~-~~~Ql~Y~~ll~lWlLSF~~~~-~~~~~~~~~i~~l~~ 235 (429)
T cd00256 158 NDYVQTAARCLQMLLRVDEYRFAFVLADGVPTLVKLLSNAT-LGFQLQYQSIFCIWLLTFNPHA-AEVLKRLSLIQDLSD 235 (429)
T ss_pred cchHHHHHHHHHHHhCCchHHHHHHHccCHHHHHHHHhhcc-ccHHHHHHHHHHHHHHhccHHH-HHhhccccHHHHHHH
Confidence 45567788899999999999999998889999999998741 1346688899999999986543 344467899999999
Q ss_pred HHHhC-CHHHHHHHHHHHHHhhcCC
Q 032259 85 VEERG-SQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 85 Ll~~G-t~raK~kA~~lL~~L~~~~ 108 (144)
+++.- -++.-|-+-.+|++|-...
T Consensus 236 i~k~s~KEKvvRv~l~~l~Nll~~~ 260 (429)
T cd00256 236 ILKESTKEKVIRIVLAIFRNLISKR 260 (429)
T ss_pred HHHhhhhHHHHHHHHHHHHHHhhcc
Confidence 99876 4666677888888887754
No 36
>KOG4646 consensus Uncharacterized conserved protein, contains ARM repeats [Function unknown]
Probab=95.84 E-value=0.032 Score=43.00 Aligned_cols=99 Identities=19% Similarity=0.120 Sum_probs=84.8
Q ss_pred cchHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHH
Q 032259 5 GESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLRE 84 (144)
Q Consensus 5 ~~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~ 84 (144)
.+..|..++=|+|+|=.|-+=.-+.+...+...|+.|..- ...-+|++++.|.++|.+. ...+.+..++.+|..+.
T Consensus 31 ~eakeqv~ANLANFAYDP~Nys~Lrql~vLdlFvdsl~e~---ne~LvefgIgglCNlC~d~-~n~~~I~ea~g~plii~ 106 (173)
T KOG4646|consen 31 IEAKEQVTANLANFAYDPINYSHLRQLDVLDLFVDSLEEQ---NELLVEFGIGGLCNLCLDK-TNAKFIREALGLPLIIF 106 (173)
T ss_pred HHHHHHHHHHHHhhccCcchHHHHHHhhHHHHHHHHhhcc---cHHHHHHhHHHHHhhccCh-HHHHHHHHhcCCceEEe
Confidence 3456788888999999999999999999999999999885 6788999999999999865 45677788999999888
Q ss_pred HHHhCCHHHHHHHHHHHHHhhcC
Q 032259 85 VEERGSQRAKEKAKRILGMLKGR 107 (144)
Q Consensus 85 Ll~~Gt~raK~kA~~lL~~L~~~ 107 (144)
.+.+-...+-..|...|+.|.-.
T Consensus 107 ~lssp~e~tv~sa~~~l~~l~~~ 129 (173)
T KOG4646|consen 107 VLSSPPEITVHSAALFLQLLEFG 129 (173)
T ss_pred ecCCChHHHHHHHHHHHHHhcCc
Confidence 88888888888899888888764
No 37
>PF13513 HEAT_EZ: HEAT-like repeat; PDB: 2Z5J_A 2OT8_B 2Z5O_A 2H4M_A 2QMR_A 1QBK_B 2Z5M_A 2Z5K_A 2Z5N_A 1GCJ_B ....
Probab=94.92 E-value=0.07 Score=32.85 Aligned_cols=54 Identities=17% Similarity=0.076 Sum_probs=42.6
Q ss_pred hHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHhCCHHHHHHHHHHHHH
Q 032259 49 EATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEERGSQRAKEKAKRILGM 103 (144)
Q Consensus 49 ~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~Gt~raK~kA~~lL~~ 103 (144)
|+.|+.|+-+|-+++......... ....++|.|..++++.++..|..|...|..
T Consensus 1 p~vR~~A~~aLg~l~~~~~~~~~~-~~~~~~~~L~~~L~d~~~~VR~~A~~aLg~ 54 (55)
T PF13513_consen 1 PRVRRAAAWALGRLAEGCPELLQP-YLPELLPALIPLLQDDDDSVRAAAAWALGN 54 (55)
T ss_dssp HHHHHHHHHHHHCTTTTTHHHHHH-HHHHHHHHHHHHTTSSSHHHHHHHHHHHHC
T ss_pred CHHHHHHHHHHhhHhcccHHHHHH-HHHHHHHHHHHHHcCCCHHHHHHHHHHHhc
Confidence 467899999999888755433333 467799999999999999999999988864
No 38
>KOG1789 consensus Endocytosis protein RME-8, contains DnaJ domain [Intracellular trafficking, secretion, and vesicular transport; Posttranslational modification, protein turnover, chaperones]
Probab=94.49 E-value=0.5 Score=46.74 Aligned_cols=100 Identities=17% Similarity=0.210 Sum_probs=81.6
Q ss_pred CCcchHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHH
Q 032259 3 KSGESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVL 82 (144)
Q Consensus 3 ~~~~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L 82 (144)
+++.+.-.|+.++..+..+.++-..|+..+.+..|..+|-+ -|.-++-+.-+|+.|++.. .....+++.|+..-+
T Consensus 1784 ~~~~iq~LaL~Vi~~~Tan~~Cv~~~a~~~vL~~LL~lLHS----~PS~R~~vL~vLYAL~S~~-~i~keA~~hg~l~yi 1858 (2235)
T KOG1789|consen 1784 KHPKLQILALQVILLATANKECVTDLATCNVLTTLLTLLHS----QPSMRARVLDVLYALSSNG-QIGKEALEHGGLMYI 1858 (2235)
T ss_pred CCchHHHHHHHHHHHHhcccHHHHHHHhhhHHHHHHHHHhc----ChHHHHHHHHHHHHHhcCc-HHHHHHHhcCchhhh
Confidence 34557788999999999999999999999988999999987 4899999999999999764 455666677665555
Q ss_pred HHH-HHhCCHHHHHHHHHHHHHhhcC
Q 032259 83 REV-EERGSQRAKEKAKRILGMLKGR 107 (144)
Q Consensus 83 ~~L-l~~Gt~raK~kA~~lL~~L~~~ 107 (144)
..+ +.+-++..|..|+.+|.-|...
T Consensus 1859 l~~~c~~~~~QqRAqaAeLlaKl~Ad 1884 (2235)
T KOG1789|consen 1859 LSILCLTNSDQQRAQAAELLAKLQAD 1884 (2235)
T ss_pred hHHHhccCcHHHHHHHHHHHHHhhhc
Confidence 554 5566899999999999888764
No 39
>PF13513 HEAT_EZ: HEAT-like repeat; PDB: 2Z5J_A 2OT8_B 2Z5O_A 2H4M_A 2QMR_A 1QBK_B 2Z5M_A 2Z5K_A 2Z5N_A 1GCJ_B ....
Probab=94.34 E-value=0.11 Score=31.99 Aligned_cols=53 Identities=17% Similarity=0.103 Sum_probs=40.2
Q ss_pred hHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHH
Q 032259 7 STSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFAL 62 (144)
Q Consensus 7 ~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~L 62 (144)
+.+.|+..|.+++.+......-.....+|.|+.+|++. ++.-|++|+.+|-+|
T Consensus 3 vR~~A~~aLg~l~~~~~~~~~~~~~~~~~~L~~~L~d~---~~~VR~~A~~aLg~l 55 (55)
T PF13513_consen 3 VRRAAAWALGRLAEGCPELLQPYLPELLPALIPLLQDD---DDSVRAAAAWALGNL 55 (55)
T ss_dssp HHHHHHHHHHCTTTTTHHHHHHHHHHHHHHHHHHTTSS---SHHHHHHHHHHHHCH
T ss_pred HHHHHHHHHhhHhcccHHHHHHHHHHHHHHHHHHHcCC---CHHHHHHHHHHHhcC
Confidence 46788899999886544444444456899999999875 678899999988654
No 40
>PF04078 Rcd1: Cell differentiation family, Rcd1-like ; InterPro: IPR007216 Rcd1 (Required cell differentiation 1) -like proteins are found among a wide range of organisms []. Rcd1 was initially identified as an essential factor in nitrogen starvation-invoked differentiation in fission yeast. This results largely from a defect in nitrogen starvation-invoked induction of ste11+, a key transcriptional factor gene required for the onset of sexual development. It is one of the most conserved proteins in eukaryotes, and its mammalian homologue is expressed in a variety of differentiating tissues [, ]. The mammalian Rcd1 is a novel transcriptional cofactor and is critical for retinoic acid-induced differentiation of F9 mouse teratocarcinoma cells, at least in part, via forming complexes with retinoic acid receptor and activation transcription factor-2 (ATF-2) []. Two of the members in this family have been characterised as being involved in regulation of Ste11 regulated sex genes [, ].; PDB: 2FV2_B.
Probab=94.22 E-value=0.44 Score=39.67 Aligned_cols=98 Identities=19% Similarity=0.143 Sum_probs=72.1
Q ss_pred hHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHH----HHHHHHHHHHHhc-cChHHHHHHHHcCcHHH
Q 032259 7 STSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEAT----RENCVAALFALGH-GNLRFKGLAKEARAAEV 81 (144)
Q Consensus 7 ~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~----kE~Ava~L~~Lc~-~~~~~~~~a~~~Gai~~ 81 (144)
=+-.|+++|-.+|.|||-|..+.++.-.--|...|.... ..+. |=.+.+++..|.. ++.++-.--....++|.
T Consensus 66 RVcnaLaLlQ~vAshpetr~~Fl~a~iplyLyPfL~tt~--k~r~~E~LRLtsLGVIgaLvK~d~~evi~fLl~tEiipl 143 (262)
T PF04078_consen 66 RVCNALALLQCVASHPETRMPFLKAHIPLYLYPFLNTTS--KTRPFEYLRLTSLGVIGALVKTDDPEVISFLLQTEIIPL 143 (262)
T ss_dssp HHHHHHHHHHHHHH-TTTHHHHHHTTGGGGGHHHHH------SHHHHHHHHHHHHHHHHHHTT--HHHHHHHHCTTHHHH
T ss_pred HHHHHHHHHHHHHcChHHHHHHHHcCchhhehhhhhccc--cccccchhhHhHHHHHHHHHcCCcHHHHHHHHhhchHHH
Confidence 356899999999999999999999876556667776531 2222 4556777778877 44455566678899999
Q ss_pred HHHHHHhCCHHHHHHHHHHHHHhhc
Q 032259 82 LREVEERGSQRAKEKAKRILGMLKG 106 (144)
Q Consensus 82 L~~Ll~~Gt~raK~kA~~lL~~L~~ 106 (144)
++..++.|+...|-.|.-+++.+=-
T Consensus 144 cLr~me~GselSKtvAtfIlqKIL~ 168 (262)
T PF04078_consen 144 CLRIMEFGSELSKTVATFILQKILL 168 (262)
T ss_dssp HHHHHHHS-HHHHHHHHHHHHHHHH
T ss_pred HHHHHHhccHHHHHHHHHHHHHHHc
Confidence 9999999999999999999986644
No 41
>PF04063 DUF383: Domain of unknown function (DUF383); InterPro: IPR007205 This is a protein of unknown function. It is found N-terminal to another domain of unknown function (IPR007206 from INTERPRO).
Probab=94.19 E-value=0.82 Score=36.14 Aligned_cols=103 Identities=12% Similarity=0.071 Sum_probs=72.9
Q ss_pred CcchHHHHHHHHHHhcCChhhHHHHHhc--C--------------cHHHHHHHhhcCCC---CChHHHHHHHHHHHHHhc
Q 032259 4 SGESTSRVLLILCNLAASNEGRSAILDA--N--------------GVSILVGMLKESGS---DSEATRENCVAALFALGH 64 (144)
Q Consensus 4 ~~~~~E~AlavL~~La~~~egr~aI~~~--g--------------~I~~LV~lL~~G~~---~S~~~kE~Ava~L~~Lc~ 64 (144)
+....+.+.++|+||+.++++...+.+. . .|..|++..-.|.. .....-++.+.+|.++++
T Consensus 8 ~~~~adl~~MLLsNlT~~~~~~~~ll~~~~~~~~~~~~~~~~~~~~l~~Ll~~F~~g~~~~~n~~~~~~yla~vl~NlS~ 87 (192)
T PF04063_consen 8 KSPLADLACMLLSNLTRSDSGCEKLLQLKRESSSQAPKEVSLSGFYLDKLLDLFVKGADPSYNKKDNYDYLASVLANLSQ 87 (192)
T ss_pred CcchHHHHHHHHHHhccchHHHHHHHhcccccccccccccchhHHHHHHHHHHHHcCCcccCCCCcchhHHHHHHHHhcC
Confidence 3457899999999999999999977542 2 57788888877421 134468999999999998
Q ss_pred cChHHHHHHH-HcCc--HHHHHHHHHhCCHHHHHHHHHHHHHhhc
Q 032259 65 GNLRFKGLAK-EARA--AEVLREVEERGSQRAKEKAKRILGMLKG 106 (144)
Q Consensus 65 ~~~~~~~~a~-~~Ga--i~~L~~Ll~~Gt~raK~kA~~lL~~L~~ 106 (144)
..+..+-... +.+. +..|+-..+.-+..-|+=++++||++.=
T Consensus 88 ~~~gR~~~l~~~~~~~~l~kLl~ft~~~s~iRR~Gva~~IrNccF 132 (192)
T PF04063_consen 88 LPEGRQFFLDPQRYDGPLQKLLPFTEHKSVIRRGGVAGTIRNCCF 132 (192)
T ss_pred CHHHHHHHhCchhhhhHHHHHHHHhccCcHHHHHHHHHHHHHhhc
Confidence 5543233322 3344 6677776666677777888888887653
No 42
>PF13646 HEAT_2: HEAT repeats; PDB: 1OYZ_A 3FGA_A 2PF4_C 2IAE_A 3B2A_A.
Probab=93.84 E-value=0.14 Score=33.65 Aligned_cols=59 Identities=24% Similarity=0.343 Sum_probs=45.1
Q ss_pred HHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHhCCHHHHHHHHHHHHHhh
Q 032259 34 VSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEERGSQRAKEKAKRILGMLK 105 (144)
Q Consensus 34 I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~Gt~raK~kA~~lL~~L~ 105 (144)
||.|++.|.+. .++..|.+|+.+|..+. .+.++|.|..++.+..+..|..|...|..+.
T Consensus 1 i~~L~~~l~~~--~~~~vr~~a~~~L~~~~-----------~~~~~~~L~~~l~d~~~~vr~~a~~aL~~i~ 59 (88)
T PF13646_consen 1 IPALLQLLQND--PDPQVRAEAARALGELG-----------DPEAIPALIELLKDEDPMVRRAAARALGRIG 59 (88)
T ss_dssp HHHHHHHHHTS--SSHHHHHHHHHHHHCCT-----------HHHHHHHHHHHHTSSSHHHHHHHHHHHHCCH
T ss_pred CHHHHHHHhcC--CCHHHHHHHHHHHHHcC-----------CHhHHHHHHHHHcCCCHHHHHHHHHHHHHhC
Confidence 68899999433 27888988887777332 1135899999998889999999988888764
No 43
>PF10508 Proteasom_PSMB: Proteasome non-ATPase 26S subunit; InterPro: IPR019538 The 26S proteasome is an enzymatic complex that degrades ubiquitinated proteins in eukaryotic cells. 26S proteasome non-ATPase regulatory subunit 5 is one of a number of chaperones that are involved in the assembly of the proteasome. The chaperones dissociate before 26S proteasome formation is complete [].; GO: 0044183 protein binding involved in protein folding
Probab=93.71 E-value=0.72 Score=41.10 Aligned_cols=99 Identities=14% Similarity=0.029 Sum_probs=79.7
Q ss_pred cchHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHH
Q 032259 5 GESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLRE 84 (144)
Q Consensus 5 ~~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~ 84 (144)
.+.++.+..+|..+-........ .....+.|.+.|... ++.-|+.|+..|.++...++....+..+.++++.++.
T Consensus 52 ~e~v~~~~~iL~~~l~~~~~~~l--~~~~~~~L~~gL~h~---~~~Vr~l~l~~l~~~~~~~~~~~~~~~~~~l~~~i~~ 126 (503)
T PF10508_consen 52 REQVELICDILKRLLSALSPDSL--LPQYQPFLQRGLTHP---SPKVRRLALKQLGRIARHSEGAAQLLVDNELLPLIIQ 126 (503)
T ss_pred hHHHHHHHHHHHHHHhccCHHHH--HHHHHHHHHHHhcCC---CHHHHHHHHHHHHHHhcCCHHHHHHhcCccHHHHHHH
Confidence 34567777778777765544444 334567777777764 7999999999999998877777888889999999999
Q ss_pred HHHhCCHHHHHHHHHHHHHhhcCC
Q 032259 85 VEERGSQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 85 Ll~~Gt~raK~kA~~lL~~L~~~~ 108 (144)
.+.+......+.|..+|+.|....
T Consensus 127 ~L~~~d~~Va~~A~~~L~~l~~~~ 150 (503)
T PF10508_consen 127 CLRDPDLSVAKAAIKALKKLASHP 150 (503)
T ss_pred HHcCCcHHHHHHHHHHHHHHhCCc
Confidence 999999999999999999998865
No 44
>PF13646 HEAT_2: HEAT repeats; PDB: 1OYZ_A 3FGA_A 2PF4_C 2IAE_A 3B2A_A.
Probab=92.88 E-value=0.31 Score=31.98 Aligned_cols=56 Identities=21% Similarity=0.341 Sum_probs=42.9
Q ss_pred CcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHhC-CHHHHHHHHHHH
Q 032259 32 NGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEERG-SQRAKEKAKRIL 101 (144)
Q Consensus 32 g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~G-t~raK~kA~~lL 101 (144)
..+|.|++++.+. ++.-|..|+.+|-.+- ...+++.|..++.+. ....|+.|...|
T Consensus 31 ~~~~~L~~~l~d~---~~~vr~~a~~aL~~i~-----------~~~~~~~L~~~l~~~~~~~vr~~a~~aL 87 (88)
T PF13646_consen 31 EAIPALIELLKDE---DPMVRRAAARALGRIG-----------DPEAIPALIKLLQDDDDEVVREAAAEAL 87 (88)
T ss_dssp HHHHHHHHHHTSS---SHHHHHHHHHHHHCCH-----------HHHTHHHHHHHHTC-SSHHHHHHHHHHH
T ss_pred hHHHHHHHHHcCC---CHHHHHHHHHHHHHhC-----------CHHHHHHHHHHHcCCCcHHHHHHHHhhc
Confidence 3689999999764 7899999999888762 344899999999886 455577777665
No 45
>KOG0168 consensus Putative ubiquitin fusion degradation protein [Posttranslational modification, protein turnover, chaperones]
Probab=92.65 E-value=0.91 Score=43.59 Aligned_cols=101 Identities=14% Similarity=0.024 Sum_probs=82.8
Q ss_pred CCcchHHHHHHHHHHhcC-ChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHH
Q 032259 3 KSGESTSRVLLILCNLAA-SNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEV 81 (144)
Q Consensus 3 ~~~~~~E~AlavL~~La~-~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~ 81 (144)
-.-++.-.|...|.+|+. +|..-+.+++.++||.|++.|..= .-----|.+..+|-.+++..+ ..+.++|++.+
T Consensus 224 ~n~DIMl~AcRaltyl~evlP~S~a~vV~~~aIPvl~~kL~~I--eyiDvAEQ~LqALE~iSR~H~---~AiL~AG~l~a 298 (1051)
T KOG0168|consen 224 HNFDIMLLACRALTYLCEVLPRSSAIVVDEHAIPVLLEKLLTI--EYIDVAEQSLQALEKISRRHP---KAILQAGALSA 298 (1051)
T ss_pred ccHHHHHHHHHHHHHHHhhccchhheeecccchHHHHHhhhhh--hhhHHHHHHHHHHHHHHhhcc---HHHHhcccHHH
Confidence 344567789999999995 999999999999999999988762 134558899999999998554 35679999999
Q ss_pred HHHHHHhCCHHHHHHHHHHHHHhhcCC
Q 032259 82 LREVEERGSQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 82 L~~Ll~~Gt~raK~kA~~lL~~L~~~~ 108 (144)
.+..+.==+-.++|+|-++..++.+.-
T Consensus 299 ~LsylDFFSi~aQR~AlaiaaN~Cksi 325 (1051)
T KOG0168|consen 299 VLSYLDFFSIHAQRVALAIAANCCKSI 325 (1051)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 999887778899999999888877755
No 46
>PF03224 V-ATPase_H_N: V-ATPase subunit H; InterPro: IPR004908 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. V-ATPases (also known as V1V0-ATPase or vacuolar ATPase) (3.6.3.14 from EC) are found in the eukaryotic endomembrane system, and in the plasma membrane of prokaryotes and certain specialised eukaryotic cells. V-ATPases hydrolyse ATP to drive a proton pump, and are involved in a variety of vital intra- and inter-cellular processes such as receptor mediated endocytosis, protein trafficking, active transport of metabolites, homeostasis and neurotransmitter release []. V-ATPases are composed of two linked complexes: the V1 complex (subunits A-H) contains the catalytic core that hydrolyses ATP, while the V0 complex (subunits a, c, c', c'', d) forms the membrane-spanning pore. V-ATPases may have an additional role in membrane fusion through binding to t-SNARE proteins []. This entry represents subunit H (also known as Vma13p) found in the V1 complex of V-ATPases. This subunit has a regulatory function, being responsible for activating ATPase activity and coupling ATPase activity to proton flow []. The yeast enzyme contains five motifs similar to the HEAT or Armadillo repeats seen in the importins, and can be divided into two distinct domains: a large N-terminal domain consisting of stacked alpha helices, and a smaller C-terminal alpha-helical domain with a similar superhelical topology to an armadillo repeat []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0046961 proton-transporting ATPase activity, rotational mechanism, 0015991 ATP hydrolysis coupled proton transport, 0000221 vacuolar proton-transporting V-type ATPase, V1 domain; PDB: 1HO8_A.
Probab=92.56 E-value=0.89 Score=37.69 Aligned_cols=77 Identities=19% Similarity=0.094 Sum_probs=59.1
Q ss_pred HHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccCh-HHHHHHHHcCcHHHHHHHH
Q 032259 8 TSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNL-RFKGLAKEARAAEVLREVE 86 (144)
Q Consensus 8 ~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~-~~~~~a~~~Gai~~L~~Ll 86 (144)
.=.++-.|..|+-+++....+.+.+.||.|+++++.- ...+..--++++|.+|+.... .....+...|+.+.|..|.
T Consensus 215 ~Y~~ll~lWlLSF~~~~~~~~~~~~~i~~L~~i~~~~--~KEKvvRv~la~l~Nl~~~~~~~~~~~mv~~~~l~~l~~L~ 292 (312)
T PF03224_consen 215 QYQALLCLWLLSFEPEIAEELNKKYLIPLLADILKDS--IKEKVVRVSLAILRNLLSKAPKSNIELMVLCGLLKTLQNLS 292 (312)
T ss_dssp HHHHHHHHHHHTTSHHHHHHHHTTSHHHHHHHHHHH----SHHHHHHHHHHHHHTTSSSSTTHHHHHHHH-HHHHHHHHH
T ss_pred HHHHHHHHHHHhcCHHHHHHHhccchHHHHHHHHHhc--ccchHHHHHHHHHHHHHhccHHHHHHHHHHccHHHHHHHHh
Confidence 3467888999999999999999988999999999984 356778889999999998443 3556666666666555554
No 47
>COG5209 RCD1 Uncharacterized protein involved in cell differentiation/sexual development [General function prediction only]
Probab=91.83 E-value=0.59 Score=38.92 Aligned_cols=99 Identities=23% Similarity=0.188 Sum_probs=73.6
Q ss_pred HHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCCh--HHHHHHHHHHHHHhc-cChHHHHHHHHcCcHHHHHH
Q 032259 8 TSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSE--ATRENCVAALFALGH-GNLRFKGLAKEARAAEVLRE 84 (144)
Q Consensus 8 ~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~--~~kE~Ava~L~~Lc~-~~~~~~~~a~~~Gai~~L~~ 84 (144)
+=.|+.+|-.|++||+-|..+.++..---|-..|....+.++ --|-.+.+++.+|-. ++..+-.--....+||.++.
T Consensus 117 vcnaL~lLQclaShPetk~~Fl~AhiplflypfLntss~~~~fEyLRltsLGVIgaLvkNdsq~vi~fLltTeivPLcLr 196 (315)
T COG5209 117 VCNALNLLQCLASHPETKKVFLDAHIPLFLYPFLNTSSSNSKFEYLRLTSLGVIGALVKNDSQYVIKFLLTTEIVPLCLR 196 (315)
T ss_pred HHHHHHHHHHHhcCcchheeeeecccceeeHhhhhccccCCccceeeehHHHHHHHHHhCCCHHHHHHHHhhhHHHHHHH
Confidence 457899999999999999999886432233444433210122 237789999999998 44555555677889999999
Q ss_pred HHHhCCHHHHHHHHHHHHHhhc
Q 032259 85 VEERGSQRAKEKAKRILGMLKG 106 (144)
Q Consensus 85 Ll~~Gt~raK~kA~~lL~~L~~ 106 (144)
+...|++..|--|.-+++.+-.
T Consensus 197 Ime~gSElSktvaifI~qkil~ 218 (315)
T COG5209 197 IMELGSELSKTVAIFIFQKILG 218 (315)
T ss_pred HHHhhhHHHHHHHHHHHHHHhc
Confidence 9999999999999888887655
No 48
>KOG4500 consensus Rho/Rac GTPase guanine nucleotide exchange factor smgGDS/Vimar [Signal transduction mechanisms]
Probab=91.79 E-value=0.95 Score=40.78 Aligned_cols=86 Identities=21% Similarity=0.234 Sum_probs=65.1
Q ss_pred CcchHHHHHHHHHHhcC-ChhhHHHHHhcCcHHHHHHHhhcCCC-CChHHHHH---HHHHHHHHhccChHHHHHHHHcCc
Q 032259 4 SGESTSRVLLILCNLAA-SNEGRSAILDANGVSILVGMLKESGS-DSEATREN---CVAALFALGHGNLRFKGLAKEARA 78 (144)
Q Consensus 4 ~~~~~E~AlavL~~La~-~~egr~aI~~~g~I~~LV~lL~~G~~-~S~~~kE~---Ava~L~~Lc~~~~~~~~~a~~~Ga 78 (144)
+..+-+.....|+|+|- +.|+|++|.+.||-..++++|+.-.+ ..|.+-|. +.++|.+-..+++.++.-+.+.|+
T Consensus 100 d~ev~~Q~~RaLgNiCydn~E~R~a~~~lgGaqivid~L~~~cs~d~~ane~~~~v~~g~l~Ny~l~~~~l~aq~~~~gV 179 (604)
T KOG4500|consen 100 DTEVHEQCFRALGNICYDNNENRAAFFNLGGAQIVIDVLKPYCSKDNPANEEYSAVAFGVLHNYILDSRELRAQVADAGV 179 (604)
T ss_pred cccHHHHHHHHHhhhhccCchhHHHHHhcCCceehHhhhccccccCCccHHHHHHHHHHHHHHhhCCcHHHHHHHHhccc
Confidence 34677889999999995 89999999999998888888876310 12333333 345666666688888999999999
Q ss_pred HHHHHHHHHhC
Q 032259 79 AEVLREVEERG 89 (144)
Q Consensus 79 i~~L~~Ll~~G 89 (144)
++.|.-+..=|
T Consensus 180 l~tL~~~~~I~ 190 (604)
T KOG4500|consen 180 LNTLAITYWID 190 (604)
T ss_pred HHHHHHHhhcc
Confidence 99988877544
No 49
>PF08045 CDC14: Cell division control protein 14, SIN component; InterPro: IPR012535 Cdc14 is a component of the septation initiation network (SIN) and is required for the localisation and activity of Sid1. Sid1 is a protein kinase that localises asymmetrically to one spindle pole body (SPB) in anaphase disappears prior to cell separation [], [].
Probab=91.56 E-value=4 Score=33.87 Aligned_cols=97 Identities=13% Similarity=0.148 Sum_probs=76.1
Q ss_pred HHHHHHHHHHhcC-ChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHH
Q 032259 8 TSRVLLILCNLAA-SNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVE 86 (144)
Q Consensus 8 ~E~AlavL~~La~-~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll 86 (144)
+-.|+.+|.-++- ||..|..+.+...+..|+++|... .++..+-.++-+|..+..+++.....=.+.+.+..+..++
T Consensus 108 i~~aL~vLQGl~LLHp~Sr~lF~r~~~m~lll~LL~~~--~~~~i~~a~L~tLv~iLld~p~N~r~FE~~~Gl~~v~~ll 185 (257)
T PF08045_consen 108 IALALRVLQGLCLLHPPSRKLFHREQNMELLLDLLSPS--NPPAIQSACLDTLVCILLDSPENQRDFEELNGLSTVCSLL 185 (257)
T ss_pred HHHHHHHHHHHHHcCchHHHHHhhhhhHHHHHHHhccC--CCchHHHHHHHHHHHHHHcChHHHHHHHHhCCHHHHHHHH
Confidence 4557888988884 999999999999999999999653 2577788888889888888876667778899999999999
Q ss_pred HhC--CHHHHHHHHHHHHHhhc
Q 032259 87 ERG--SQRAKEKAKRILGMLKG 106 (144)
Q Consensus 87 ~~G--t~raK~kA~~lL~~L~~ 106 (144)
.+- +...|=|...-|...--
T Consensus 186 k~~~~~~~~r~K~~EFL~fyl~ 207 (257)
T PF08045_consen 186 KSKSTDRELRLKCIEFLYFYLM 207 (257)
T ss_pred ccccccHHHhHHHHHHHHHHHc
Confidence 765 45556666655554443
No 50
>PF12755 Vac14_Fab1_bd: Vacuolar 14 Fab1-binding region
Probab=91.54 E-value=1.4 Score=31.07 Aligned_cols=69 Identities=12% Similarity=0.117 Sum_probs=52.7
Q ss_pred cHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHhCCHHHHHHHHHHHHHhh
Q 032259 33 GVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEERGSQRAKEKAKRILGMLK 105 (144)
Q Consensus 33 ~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~Gt~raK~kA~~lL~~L~ 105 (144)
-||+++..+.+. .++.|..|+-.|++++..... ..+---.-+++.|..++.+-.+..|.-|.-+-++|+
T Consensus 28 Il~pVL~~~~D~---d~rVRy~AcEaL~ni~k~~~~-~~l~~f~~IF~~L~kl~~D~d~~Vr~~a~~Ld~llk 96 (97)
T PF12755_consen 28 ILPPVLKCFDDQ---DSRVRYYACEALYNISKVARG-EILPYFNEIFDALCKLSADPDENVRSAAELLDRLLK 96 (97)
T ss_pred HHHHHHHHcCCC---cHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHcCCchhHHHHHHHHHHHhc
Confidence 578888888875 799999999999999973221 111134667899999998888888888877777765
No 51
>KOG1293 consensus Proteins containing armadillo/beta-catenin-like repeat [General function prediction only]
Probab=90.59 E-value=3.6 Score=38.38 Aligned_cols=113 Identities=23% Similarity=0.260 Sum_probs=85.8
Q ss_pred cchHHHHHHHHHHhcC-ChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcC-cHHHH
Q 032259 5 GESTSRVLLILCNLAA-SNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEAR-AAEVL 82 (144)
Q Consensus 5 ~~~~E~AlavL~~La~-~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~G-ai~~L 82 (144)
+.+.--++.+|+||.- ...-|..+.+.|+|..|..++.+= .+..+.+++-+|..+..+.++......-.- -...|
T Consensus 433 ~~i~~~~lgai~NlVmefs~~kskfl~~ngId~l~s~~~~~---~~n~r~~~~~~Lr~l~f~~de~~k~~~~~ki~a~~i 509 (678)
T KOG1293|consen 433 IMIMGITLGAICNLVMEFSNLKSKFLRNNGIDILESMLTDP---DFNSRANSLWVLRHLMFNCDEEEKFQLLAKIPANLI 509 (678)
T ss_pred hhHHHHHHHHHHHHHhhcccHHHHHHHcCcHHHHHHHhcCC---CchHHHHHHHHHHHHHhcchHHHHHHHHHHhhHHHH
Confidence 3445678899999994 788899999999999999999873 567889999999999886554333332222 24667
Q ss_pred HHHHHhCCHHHHHHHHHHHHHhhcCCCCCCcchh-----hhhccc
Q 032259 83 REVEERGSQRAKEKAKRILGMLKGREDDDEDVDW-----EGVLDS 122 (144)
Q Consensus 83 ~~Ll~~Gt~raK~kA~~lL~~L~~~~~~~~~~~~-----~~~~~~ 122 (144)
+.+..+--+..++.+-.+||+|-- |..+++|| .|+++.
T Consensus 510 ~~l~nd~d~~Vqeq~fqllRNl~c--~~~~svdfll~~~~~~ld~ 552 (678)
T KOG1293|consen 510 LDLINDPDWAVQEQCFQLLRNLTC--NSRKSVDFLLEKFKDVLDK 552 (678)
T ss_pred HHHHhCCCHHHHHHHHHHHHHhhc--CcHHHHHHHHHhhhHHHHH
Confidence 778888899999998888888744 56688887 556554
No 52
>PF09759 Atx10homo_assoc: Spinocerebellar ataxia type 10 protein domain; InterPro: IPR019156 This is the conserved C-terminal 100 residues of Ataxin-10. Ataxin-10 belongs to the family of armadillo repeat proteins and in solution it tends to form homotrimeric complexes, which associate via a tip-to-tip association in a horseshoe-shaped contact with the concave sides of the molecules facing each other. This domain may represent the homo-association site since that is located near the C terminus of Ataxin-10. The protein does not contain a signal sequence for secretion or any subcellular compartment confirming its cytoplasmic localisation, specifically to the olivocerebellar region [].
Probab=90.42 E-value=1.6 Score=31.38 Aligned_cols=57 Identities=14% Similarity=0.061 Sum_probs=50.0
Q ss_pred HHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHH--hCCHHHHHHHHHHHHHhhcCC
Q 032259 52 RENCVAALFALGHGNLRFKGLAKEARAAEVLREVEE--RGSQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 52 kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~--~Gt~raK~kA~~lL~~L~~~~ 108 (144)
|-..+.+|-+||..+..++..+.+.|.+|.++.-.. ...|..|+.|--.+|.|-...
T Consensus 3 K~~lvrlianl~~~~~~~Qd~vr~~~Gi~liL~~c~iD~~nP~irEwai~aiRnL~e~n 61 (102)
T PF09759_consen 3 KRDLVRLIANLCYKNKEVQDLVRELGGIPLILSCCNIDDHNPFIREWAIFAIRNLCEGN 61 (102)
T ss_pred HHHHHHHHHHHHhCCHHHHHHHHHcCChHHHHHhcCCCcccHHHHHHHHHHHHHHHhCC
Confidence 556788999999988888999999999999999874 448999999999999998864
No 53
>PF11841 DUF3361: Domain of unknown function (DUF3361)
Probab=89.62 E-value=3.3 Score=32.09 Aligned_cols=58 Identities=19% Similarity=0.171 Sum_probs=48.8
Q ss_pred CcchHHHHHHHHHHhcCChhh-HHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhc
Q 032259 4 SGESTSRVLLILCNLAASNEG-RSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGH 64 (144)
Q Consensus 4 ~~~~~E~AlavL~~La~~~eg-r~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~ 64 (144)
+..+...|+++|.++..++.. ...+.+.=.++.|+..|... +++-+.||++.+-.|..
T Consensus 73 d~~i~q~sLaILEs~Vl~S~~ly~~V~~evt~~~Li~hLq~~---~~~iq~naiaLinAL~~ 131 (160)
T PF11841_consen 73 DASILQRSLAILESIVLNSPKLYQLVEQEVTLESLIRHLQVS---NQEIQTNAIALINALFL 131 (160)
T ss_pred cchHHHHHHHHHHHHHhCCHHHHHHHhccCCHHHHHHHHHcC---CHHHHHHHHHHHHHHHh
Confidence 457889999999999987666 45555556899999999986 78899999999999977
No 54
>PF11841 DUF3361: Domain of unknown function (DUF3361)
Probab=88.86 E-value=4.7 Score=31.25 Aligned_cols=106 Identities=14% Similarity=0.106 Sum_probs=79.5
Q ss_pred chHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHH
Q 032259 6 ESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREV 85 (144)
Q Consensus 6 ~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~L 85 (144)
++...++..+..|-.|.---+.......|..++..+.... ..+...+-|.++|-++...++..-..+.++=-++.|+..
T Consensus 32 ~~La~~L~af~eLMeHg~vsWd~l~~~FI~Kia~~Vn~~~-~d~~i~q~sLaILEs~Vl~S~~ly~~V~~evt~~~Li~h 110 (160)
T PF11841_consen 32 EILAYALTAFVELMEHGIVSWDTLSDSFIKKIASYVNSSA-MDASILQRSLAILESIVLNSPKLYQLVEQEVTLESLIRH 110 (160)
T ss_pred HHHHHHHHHHHHHHhcCcCchhhccHHHHHHHHHHHcccc-ccchHHHHHHHHHHHHHhCCHHHHHHHhccCCHHHHHHH
Confidence 3444466666666666555566666667788888887531 235678899999999999888767777777779999999
Q ss_pred HHhCCHHHHHHHHHHHHHhhcCCCCCC
Q 032259 86 EERGSQRAKEKAKRILGMLKGREDDDE 112 (144)
Q Consensus 86 l~~Gt~raK~kA~~lL~~L~~~~~~~~ 112 (144)
++...+.-+.+|-.++--|=...|++.
T Consensus 111 Lq~~~~~iq~naiaLinAL~~kA~~~~ 137 (160)
T PF11841_consen 111 LQVSNQEIQTNAIALINALFLKADDSK 137 (160)
T ss_pred HHcCCHHHHHHHHHHHHHHHhcCChHH
Confidence 999999999999999987766655543
No 55
>PF04063 DUF383: Domain of unknown function (DUF383); InterPro: IPR007205 This is a protein of unknown function. It is found N-terminal to another domain of unknown function (IPR007206 from INTERPRO).
Probab=88.14 E-value=3 Score=32.97 Aligned_cols=80 Identities=16% Similarity=0.096 Sum_probs=56.5
Q ss_pred cchHHHHHHHHHHhcCChhhHHHHHhcC-cH---HHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHH--cCc
Q 032259 5 GESTSRVLLILCNLAASNEGRSAILDAN-GV---SILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKE--ARA 78 (144)
Q Consensus 5 ~~~~E~AlavL~~La~~~egr~aI~~~g-~I---~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~--~Ga 78 (144)
..-.+-...+|.||+.+++||..+.+.. .+ ..|+..+.. .|..-|.-++++|-|.|-+.+....+-.. ..+
T Consensus 72 ~~~~~yla~vl~NlS~~~~gR~~~l~~~~~~~~l~kLl~ft~~---~s~iRR~Gva~~IrNccFd~~~H~~LL~~~~~~i 148 (192)
T PF04063_consen 72 KDNYDYLASVLANLSQLPEGRQFFLDPQRYDGPLQKLLPFTEH---KSVIRRGGVAGTIRNCCFDTDSHEWLLSDDEVDI 148 (192)
T ss_pred CcchhHHHHHHHHhcCCHHHHHHHhCchhhhhHHHHHHHHhcc---CcHHHHHHHHHHHHHhhccHhHHHHhcCchhhhh
Confidence 3445678899999999999999998753 44 444444443 36777999999999999976655555432 456
Q ss_pred HHHHHHHHH
Q 032259 79 AEVLREVEE 87 (144)
Q Consensus 79 i~~L~~Ll~ 87 (144)
.|.|+.-+-
T Consensus 149 Lp~LLlPLa 157 (192)
T PF04063_consen 149 LPYLLLPLA 157 (192)
T ss_pred HHHHHhhcc
Confidence 666665554
No 56
>PF10165 Ric8: Guanine nucleotide exchange factor synembryn; InterPro: IPR019318 Ric8 is involved in the EGL-30 neurotransmitter signalling pathway []. It is a guanine nucleotide exchange factor [] that regulates neurotransmitter secretion.
Probab=87.72 E-value=4.3 Score=35.81 Aligned_cols=96 Identities=18% Similarity=0.107 Sum_probs=80.7
Q ss_pred HHHHHHHhcCChhhHHHHHhcCcHHHHHHHh---------hcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHH
Q 032259 11 VLLILCNLAASNEGRSAILDANGVSILVGML---------KESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEV 81 (144)
Q Consensus 11 AlavL~~La~~~egr~aI~~~g~I~~LV~lL---------~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~ 81 (144)
|+..|..|+..+.+-..|....+|..|.++- ... .++.....|.-.|.|+...++..|....+.|..+.
T Consensus 1 ~L~~LRiLsRd~~~~~~l~~~~~l~~L~~~a~l~~~~~~~~~~--~~~~v~~EALKCL~N~lf~s~~aR~~~~~~~~~~~ 78 (446)
T PF10165_consen 1 CLETLRILSRDPTGLDPLFTEEGLSTLLKHAGLSESDEDEFES--PDPDVSREALKCLCNALFLSPSARQIFVDLGLAEK 78 (446)
T ss_pred CHHHHHHHccCcccchhhccHHHHHHHHHhcCCcccccccccC--CChHHHHHHHHHHHHHHhCCHHHHHHHHHcCcHHH
Confidence 4678899999999999998877888888877 222 46778889999999999888888899999999999
Q ss_pred HHHHHHhC-----CHHHHHHHHHHHHHhhcCC
Q 032259 82 LREVEERG-----SQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 82 L~~Ll~~G-----t~raK~kA~~lL~~L~~~~ 108 (144)
|+..+... .+..+=-..++|.++....
T Consensus 79 l~~~Lk~~~~~~~~~d~~Fl~~RLLFLlTa~~ 110 (446)
T PF10165_consen 79 LCERLKNYSDSSQPSDVEFLDSRLLFLLTALR 110 (446)
T ss_pred HHHHHHcccccCCChhHHHHHHHHHHHHhcCC
Confidence 99999988 5667777888888888765
No 57
>KOG1222 consensus Kinesin associated protein KAP [Intracellular trafficking, secretion, and vesicular transport]
Probab=87.45 E-value=2.1 Score=39.21 Aligned_cols=99 Identities=17% Similarity=0.166 Sum_probs=77.3
Q ss_pred hHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHH
Q 032259 7 STSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVE 86 (144)
Q Consensus 7 ~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll 86 (144)
+.-.....|..|+-..|++-.+.+.|.|..|+++.-.. .+.-+.--.-.||+|+.+.. .+..+.+.|.+|.|..++
T Consensus 320 Ll~lv~~FLkKLSIf~eNK~~M~~~~iveKL~klfp~~---h~dL~~~tl~LlfNlSFD~g-lr~KMv~~GllP~l~~ll 395 (791)
T KOG1222|consen 320 LLTLVIKFLKKLSIFDENKIVMEQNGIVEKLLKLFPIQ---HPDLRKATLMLLFNLSFDSG-LRPKMVNGGLLPHLASLL 395 (791)
T ss_pred HHHHHHHHHHHhhhhccchHHHHhccHHHHHHHhcCCC---CHHHHHHHHHHhhhcccccc-ccHHHhhccchHHHHHHh
Confidence 34455677889999999999999999999999999875 57778888899999998654 566677999999999999
Q ss_pred HhCCHHHHHHHHHHHHHhhcCCCCCCc
Q 032259 87 ERGSQRAKEKAKRILGMLKGREDDDED 113 (144)
Q Consensus 87 ~~Gt~raK~kA~~lL~~L~~~~~~~~~ 113 (144)
.+.+.. .-|..+|..++= ||+.+
T Consensus 396 ~~d~~~--~iA~~~lYh~S~--dD~~K 418 (791)
T KOG1222|consen 396 DSDTKH--GIALNMLYHLSC--DDDAK 418 (791)
T ss_pred CCcccc--hhhhhhhhhhcc--CcHHH
Confidence 877643 456666666643 45533
No 58
>PF10165 Ric8: Guanine nucleotide exchange factor synembryn; InterPro: IPR019318 Ric8 is involved in the EGL-30 neurotransmitter signalling pathway []. It is a guanine nucleotide exchange factor [] that regulates neurotransmitter secretion.
Probab=87.44 E-value=6.8 Score=34.53 Aligned_cols=84 Identities=24% Similarity=0.185 Sum_probs=64.4
Q ss_pred CcchHHHHHHHHHHhcC-ChhhHHHHHhcCcHHHHHHHhhcCCC--CChHHHHHHHHHHHHHhccC-hHHHHHHHHcCcH
Q 032259 4 SGESTSRVLLILCNLAA-SNEGRSAILDANGVSILVGMLKESGS--DSEATRENCVAALFALGHGN-LRFKGLAKEARAA 79 (144)
Q Consensus 4 ~~~~~E~AlavL~~La~-~~egr~aI~~~g~I~~LV~lL~~G~~--~S~~~kE~Ava~L~~Lc~~~-~~~~~~a~~~Gai 79 (144)
+..+..+|+..|+|+-- ++..|..+.+.|..+.+++.|+.... .++...=...-+||.++... .....++.+.+++
T Consensus 45 ~~~v~~EALKCL~N~lf~s~~aR~~~~~~~~~~~l~~~Lk~~~~~~~~~d~~Fl~~RLLFLlTa~~~~~~~~L~~e~~~~ 124 (446)
T PF10165_consen 45 DPDVSREALKCLCNALFLSPSARQIFVDLGLAEKLCERLKNYSDSSQPSDVEFLDSRLLFLLTALRPDDRKKLIEEHHGV 124 (446)
T ss_pred ChHHHHHHHHHHHHHHhCCHHHHHHHHHcCcHHHHHHHHHcccccCCChhHHHHHHHHHHHHhcCChhHHHHHHHHhhhH
Confidence 44677899999999995 89999999999999999999988621 14556667778888888744 4566777777887
Q ss_pred HHHHHHHH
Q 032259 80 EVLREVEE 87 (144)
Q Consensus 80 ~~L~~Ll~ 87 (144)
..|..-+.
T Consensus 125 ~~l~~~L~ 132 (446)
T PF10165_consen 125 ELLTEALE 132 (446)
T ss_pred HHHHHHHH
Confidence 77777553
No 59
>PRK09687 putative lyase; Provisional
Probab=86.12 E-value=2.8 Score=34.65 Aligned_cols=59 Identities=17% Similarity=0.185 Sum_probs=37.0
Q ss_pred cHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHhCCHHHHHHHHHHHHH
Q 032259 33 GVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEERGSQRAKEKAKRILGM 103 (144)
Q Consensus 33 ~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~Gt~raK~kA~~lL~~ 103 (144)
+|+.|+++|.+. ++.-+..|+.+|..++..++ .+++.|+.++.+..+..|..|...|..
T Consensus 160 ai~~L~~~L~d~---~~~VR~~A~~aLg~~~~~~~---------~~~~~L~~~L~D~~~~VR~~A~~aLg~ 218 (280)
T PRK09687 160 AIPLLINLLKDP---NGDVRNWAAFALNSNKYDNP---------DIREAFVAMLQDKNEEIRIEAIIGLAL 218 (280)
T ss_pred HHHHHHHHhcCC---CHHHHHHHHHHHhcCCCCCH---------HHHHHHHHHhcCCChHHHHHHHHHHHc
Confidence 577888888764 56778888888877743222 244555666655566666665555543
No 60
>PF12348 CLASP_N: CLASP N terminal; InterPro: IPR024395 This domain is found in the N-terminal region of CLIP-associated proteins (CLASPs), which are widely conserved microtubule plus-end-tracking proteins that regulate the stability of dynamic microtubules [, ]. The domain is also found in other proteins involved in microtubule binding, including STU1, MOR1 and spindle pole body component Alp14.; PDB: 2QK2_A.
Probab=86.08 E-value=13 Score=28.78 Aligned_cols=63 Identities=16% Similarity=0.097 Sum_probs=45.3
Q ss_pred CChHHHHHHHHHHHHHhccC----hHHHHHHHHcCcHHHHHHHHHhCCHHHHHHHHHHHHHhhcCCC
Q 032259 47 DSEATRENCVAALFALGHGN----LRFKGLAKEARAAEVLREVEERGSQRAKEKAKRILGMLKGRED 109 (144)
Q Consensus 47 ~S~~~kE~Ava~L~~Lc~~~----~~~~~~a~~~Gai~~L~~Ll~~Gt~raK~kA~~lL~~L~~~~~ 109 (144)
.++..|+.++..|..+.... +.......-..+++.+...+.++.+..|+.|..++..|....+
T Consensus 143 Kn~~vR~~~~~~l~~~l~~~~~~~~~l~~~~~~~~l~~~l~~~l~D~~~~VR~~Ar~~~~~l~~~~~ 209 (228)
T PF12348_consen 143 KNPQVREECAEWLAIILEKWGSDSSVLQKSAFLKQLVKALVKLLSDADPEVREAARECLWALYSHFP 209 (228)
T ss_dssp S-HHHHHHHHHHHHHHHTT-----GGG--HHHHHHHHHHHHHHHTSS-HHHHHHHHHHHHHHHHHH-
T ss_pred CCHHHHHHHHHHHHHHHHHccchHhhhcccchHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCC
Confidence 37899999999888887622 2222222225578999999999999999999999999988763
No 61
>PF02985 HEAT: HEAT repeat; InterPro: IPR000357 The HEAT repeat is a tandemly repeated, 37-47 amino acid long module occurring in a number of cytoplasmic proteins, including the four name-giving proteins huntingtin, elongation factor 3 (EF3), the 65 Kd alpha regulatory subunit of protein phosphatase 2A (PP2A) and the yeast PI3-kinase TOR1 []. Arrays of HEAT repeats consists of 3 to 36 units forming a rod-like helical structure and appear to function as protein-protein interaction surfaces. It has been noted that many HEAT repeat-containing proteins are involved in intracellular transport processes. In the crystal structure of PP2A PR65/A [], the HEAT repeats consist of pairs of antiparallel alpha helices [].; GO: 0005515 protein binding; PDB: 3FGA_A 2PF4_C 2IAE_A 2BKU_D 3EA5_B 3ND2_A 2BPT_A 2NYL_A 2NPP_D 2PKG_B ....
Probab=85.35 E-value=2.1 Score=23.52 Aligned_cols=28 Identities=25% Similarity=0.424 Sum_probs=23.9
Q ss_pred HHHHHHHhhcCCCCChHHHHHHHHHHHHHhc
Q 032259 34 VSILVGMLKESGSDSEATRENCVAALFALGH 64 (144)
Q Consensus 34 I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~ 64 (144)
+|.+++++.+- +++-|+.|+..|..++.
T Consensus 2 lp~l~~~l~D~---~~~VR~~a~~~l~~i~~ 29 (31)
T PF02985_consen 2 LPILLQLLNDP---SPEVRQAAAECLGAIAE 29 (31)
T ss_dssp HHHHHHHHT-S---SHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHcCCC---CHHHHHHHHHHHHHHHh
Confidence 68899999885 89999999999998874
No 62
>KOG1222 consensus Kinesin associated protein KAP [Intracellular trafficking, secretion, and vesicular transport]
Probab=85.13 E-value=2.8 Score=38.51 Aligned_cols=88 Identities=14% Similarity=0.190 Sum_probs=70.3
Q ss_pred CCcchHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHH
Q 032259 3 KSGESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVL 82 (144)
Q Consensus 3 ~~~~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L 82 (144)
++.+++.-.+.+|-||+-....|-.+++.|-+|.|+.+|.+ ..-+.-|+-+|+.++.++. .+.+-.-...|+.|
T Consensus 357 ~h~dL~~~tl~LlfNlSFD~glr~KMv~~GllP~l~~ll~~-----d~~~~iA~~~lYh~S~dD~-~K~MfayTdci~~l 430 (791)
T KOG1222|consen 357 QHPDLRKATLMLLFNLSFDSGLRPKMVNGGLLPHLASLLDS-----DTKHGIALNMLYHLSCDDD-AKAMFAYTDCIKLL 430 (791)
T ss_pred CCHHHHHHHHHHhhhccccccccHHHhhccchHHHHHHhCC-----cccchhhhhhhhhhccCcH-HHHHHHHHHHHHHH
Confidence 45667778889999999999999999999999999999965 4566789999999998664 46666678889988
Q ss_pred HHHHHhCCHHHHHH
Q 032259 83 REVEERGSQRAKEK 96 (144)
Q Consensus 83 ~~Ll~~Gt~raK~k 96 (144)
...+-.|+..--..
T Consensus 431 mk~v~~~~~~~vdl 444 (791)
T KOG1222|consen 431 MKDVLSGTGSEVDL 444 (791)
T ss_pred HHHHHhcCCceecH
Confidence 88887775443333
No 63
>KOG2171 consensus Karyopherin (importin) beta 3 [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=84.56 E-value=6.6 Score=38.59 Aligned_cols=97 Identities=10% Similarity=0.027 Sum_probs=61.2
Q ss_pred HHHHHHHHHhcCC-hhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHH-HHHHHH
Q 032259 9 SRVLLILCNLAAS-NEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAE-VLREVE 86 (144)
Q Consensus 9 E~AlavL~~La~~-~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~-~L~~Ll 86 (144)
=.|+-+|..+++. +..-+.-.+.--+|.|+..+.+- .+++-+-||+++|.++....+.-.-.=--.+++. .|..|.
T Consensus 407 ~AA~naigQ~stdl~p~iqk~~~e~l~~aL~~~ld~~--~~~rV~ahAa~al~nf~E~~~~~~l~pYLd~lm~~~l~~L~ 484 (1075)
T KOG2171|consen 407 YAALNAIGQMSTDLQPEIQKKHHERLPPALIALLDST--QNVRVQAHAAAALVNFSEECDKSILEPYLDGLMEKKLLLLL 484 (1075)
T ss_pred HHHHHHHHhhhhhhcHHHHHHHHHhccHHHHHHhccc--CchHHHHHHHHHHHHHHHhCcHHHHHHHHHHHHHHHHHHHh
Confidence 3455555555542 22222222233456888888664 3799999999999999763322111111355666 677778
Q ss_pred HhCCHHHHHHHHHHHHHhhcC
Q 032259 87 ERGSQRAKEKAKRILGMLKGR 107 (144)
Q Consensus 87 ~~Gt~raK~kA~~lL~~L~~~ 107 (144)
+++++..|+.|.++|.-....
T Consensus 485 ~~~~~~v~e~vvtaIasvA~A 505 (1075)
T KOG2171|consen 485 QSSKPYVQEQAVTAIASVADA 505 (1075)
T ss_pred cCCchhHHHHHHHHHHHHHHH
Confidence 899999999999988765543
No 64
>KOG1293 consensus Proteins containing armadillo/beta-catenin-like repeat [General function prediction only]
Probab=84.43 E-value=4.9 Score=37.49 Aligned_cols=108 Identities=15% Similarity=0.121 Sum_probs=76.3
Q ss_pred cCCcchHHHHHHHHHHhcC-ChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHH
Q 032259 2 VKSGESTSRVLLILCNLAA-SNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAE 80 (144)
Q Consensus 2 ~~~~~~~E~AlavL~~La~-~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~ 80 (144)
+|+...+..|+.-+.+++. ...-|..+....++.+||+++..+ ...-+-.+.++|.+|...-...+.-....|.|.
T Consensus 388 ~kd~~~~aaa~l~~~s~srsV~aL~tg~~~~dv~~plvqll~dp---~~~i~~~~lgai~NlVmefs~~kskfl~~ngId 464 (678)
T KOG1293|consen 388 IKDHDFVAAALLCLKSFSRSVSALRTGLKRNDVAQPLVQLLMDP---EIMIMGITLGAICNLVMEFSNLKSKFLRNNGID 464 (678)
T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHcCCccchhHHHHHHHhhCc---chhHHHHHHHHHHHHHhhcccHHHHHHHcCcHH
Confidence 4444444444444555543 223333344455788999999876 677788888999999884445677888999999
Q ss_pred HHHHHHHhCCHHHHHHHHHHHHHhhcCCCCCCcc
Q 032259 81 VLREVEERGSQRAKEKAKRILGMLKGREDDDEDV 114 (144)
Q Consensus 81 ~L~~Ll~~Gt~raK~kA~~lL~~L~~~~~~~~~~ 114 (144)
.|..++.+-.+-.|+++...|+.+.= |+|+..
T Consensus 465 ~l~s~~~~~~~n~r~~~~~~Lr~l~f--~~de~~ 496 (678)
T KOG1293|consen 465 ILESMLTDPDFNSRANSLWVLRHLMF--NCDEEE 496 (678)
T ss_pred HHHHHhcCCCchHHHHHHHHHHHHHh--cchHHH
Confidence 99999999988889998888888764 445433
No 65
>KOG4500 consensus Rho/Rac GTPase guanine nucleotide exchange factor smgGDS/Vimar [Signal transduction mechanisms]
Probab=83.66 E-value=6.5 Score=35.62 Aligned_cols=102 Identities=16% Similarity=0.088 Sum_probs=79.6
Q ss_pred chHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhc--CCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHH
Q 032259 6 ESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLKE--SGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLR 83 (144)
Q Consensus 6 ~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~--G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~ 83 (144)
++.-...-.|+|++..++....+++.+.+..|+++|.. +.+..-+-+--+.++|-+|...-+ .+.....+|+....+
T Consensus 330 ~l~t~g~LaigNfaR~D~~ci~~v~~~~~nkL~~~l~~~~~vdgnV~~qhA~lsALRnl~IPv~-nka~~~~aGvteaIL 408 (604)
T KOG4500|consen 330 NLITMGSLAIGNFARRDDICIQLVQKDFLNKLISCLMQEKDVDGNVERQHACLSALRNLMIPVS-NKAHFAPAGVTEAIL 408 (604)
T ss_pred hHHHHHHHHHHhhhccchHHHHHHHHHHHHHHHHHHHHhcCCCccchhHHHHHHHHHhccccCC-chhhccccchHHHHH
Confidence 44556677899999999999999999999999998854 211133445555699999987433 356667899999999
Q ss_pred HHHHhCCHHHHHHHHHHHHHhhcCC
Q 032259 84 EVEERGSQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 84 ~Ll~~Gt~raK~kA~~lL~~L~~~~ 108 (144)
..+....|+..-|=-..|++++...
T Consensus 409 ~~lk~~~ppv~fkllgTlrM~~d~q 433 (604)
T KOG4500|consen 409 LQLKLASPPVTFKLLGTLRMIRDSQ 433 (604)
T ss_pred HHHHhcCCcchHHHHHHHHHHHhch
Confidence 9999998888888888888888764
No 66
>KOG3036 consensus Protein involved in cell differentiation/sexual development [General function prediction only]
Probab=83.57 E-value=20 Score=30.25 Aligned_cols=98 Identities=14% Similarity=0.069 Sum_probs=67.2
Q ss_pred chHHHHHHHHHHhcCCh--hhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccC---hHHHHHHHHcCcHH
Q 032259 6 ESTSRVLLILCNLAASN--EGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGN---LRFKGLAKEARAAE 80 (144)
Q Consensus 6 ~~~E~AlavL~~La~~~--egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~---~~~~~~a~~~Gai~ 80 (144)
-+.=.+++|++.|.... +-..-+...+.||..+++|..| |...|--|..+|.-+-.++ .+.++...+.-+|.
T Consensus 140 yLRLtsLGVIgaLvk~dd~eVi~fLl~TeIVPlCLrime~G---SelSKtvA~fIlqKIlldD~GL~YiCqt~eRF~av~ 216 (293)
T KOG3036|consen 140 YLRLTSLGVIGALVKNDDQEVIRFLLTTEIVPLCLRIMESG---SELSKTVATFILQKILLDDVGLYYICQTAERFSAVA 216 (293)
T ss_pred HHhHHHHHHHHHHHhcCcHHHHHHHHHhhhHHHHHHHHhcc---cHHHHHHHHHHHHHHhhccccHHHHHHhHHHHHHHH
Confidence 34557899999999743 3333446788999999999999 8999999999988765533 25556665555543
Q ss_pred H-----HHHHHHhCCHHHHHHHHHHHHHhhc
Q 032259 81 V-----LREVEERGSQRAKEKAKRILGMLKG 106 (144)
Q Consensus 81 ~-----L~~Ll~~Gt~raK~kA~~lL~~L~~ 106 (144)
. +..|+..+++|.-+.+-+....|..
T Consensus 217 ~~L~kmv~~l~~~ps~RllKhviRcYlrLsd 247 (293)
T KOG3036|consen 217 LVLGKMVFQLVSMPSPRLLKHVIRCYLRLSD 247 (293)
T ss_pred HHHHHHHHHHhcCCCHHHHHHHHHHHHHhcC
Confidence 2 3345567788876665555444433
No 67
>PF01602 Adaptin_N: Adaptin N terminal region; InterPro: IPR002553 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. Clathrin coats contain both clathrin and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors []. All AP complexes are heterotetramers composed of two large subunits (adaptins), a medium subunit (mu) and a small subunit (sigma). Each subunit has a specific function. Adaptin subunits recognise and bind to clathrin through their hinge region (clathrin box), and recruit accessory proteins that modulate AP function through their C-terminal appendage domains. By contrast, GGAs are monomers composed of four domains, which have functions similar to AP subunits: an N-terminal VHS (Vps27p/Hrs/Stam) domain, a GAT (GGA and Tom1) domain, a hinge region, and a C-terminal GAE (gamma-adaptin ear) domain. The GAE domain is similar to the AP gamma-adaptin ear domain, being responsible for the recruitment of accessory proteins that regulate clathrin-mediated endocytosis []. While clathrin mediates endocytic protein transport from ER to Golgi, coatomers (COPI, COPII) primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the N-terminal domain of various adaptins from different AP clathrin adaptor complexes (including AP1, AP2, AP3 and AP4), and from the beta and gamma subunits of various coatomer (COP) adaptors. This domain has a 2-layer alpha/alpha fold that forms a right-handed superhelix, and is a member of the ARM repeat superfamily []. The N-terminal region of the various AP adaptor proteins share strong sequence identity; by contrast, the C-terminal domains of different adaptins share similar structural folds, but have little sequence identity []. It has been proposed that the N-terminal domain interacts with another uniform component of the coated vesicles. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 1W63_C 2JKR_A 2JKT_A 2XA7_A 2VGL_B 3TJZ_E.
Probab=83.50 E-value=9.8 Score=32.89 Aligned_cols=85 Identities=15% Similarity=0.204 Sum_probs=64.1
Q ss_pred hHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHH
Q 032259 7 STSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVE 86 (144)
Q Consensus 7 ~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll 86 (144)
+...|+..|+++.. ++....+ ++.+.++|.+. +|--|..|+.++..+...++. ..... .++.|..++
T Consensus 95 ~~~lAL~~l~~i~~-~~~~~~l-----~~~v~~ll~~~---~~~VRk~A~~~l~~i~~~~p~---~~~~~-~~~~l~~lL 161 (526)
T PF01602_consen 95 IRGLALRTLSNIRT-PEMAEPL-----IPDVIKLLSDP---SPYVRKKAALALLKIYRKDPD---LVEDE-LIPKLKQLL 161 (526)
T ss_dssp HHHHHHHHHHHH-S-HHHHHHH-----HHHHHHHHHSS---SHHHHHHHHHHHHHHHHHCHC---CHHGG-HHHHHHHHT
T ss_pred HHHHHHhhhhhhcc-cchhhHH-----HHHHHHHhcCC---chHHHHHHHHHHHHHhccCHH---HHHHH-HHHHHhhhc
Confidence 56778888888883 4444433 56788888875 789999999999999885543 22223 688999999
Q ss_pred HhCCHHHHHHHHHHHHHh
Q 032259 87 ERGSQRAKEKAKRILGML 104 (144)
Q Consensus 87 ~~Gt~raK~kA~~lL~~L 104 (144)
.+..+..+..|..+|..+
T Consensus 162 ~d~~~~V~~~a~~~l~~i 179 (526)
T PF01602_consen 162 SDKDPSVVSAALSLLSEI 179 (526)
T ss_dssp THSSHHHHHHHHHHHHHH
T ss_pred cCCcchhHHHHHHHHHHH
Confidence 888899999998888888
No 68
>PF02985 HEAT: HEAT repeat; InterPro: IPR000357 The HEAT repeat is a tandemly repeated, 37-47 amino acid long module occurring in a number of cytoplasmic proteins, including the four name-giving proteins huntingtin, elongation factor 3 (EF3), the 65 Kd alpha regulatory subunit of protein phosphatase 2A (PP2A) and the yeast PI3-kinase TOR1 []. Arrays of HEAT repeats consists of 3 to 36 units forming a rod-like helical structure and appear to function as protein-protein interaction surfaces. It has been noted that many HEAT repeat-containing proteins are involved in intracellular transport processes. In the crystal structure of PP2A PR65/A [], the HEAT repeats consist of pairs of antiparallel alpha helices [].; GO: 0005515 protein binding; PDB: 3FGA_A 2PF4_C 2IAE_A 2BKU_D 3EA5_B 3ND2_A 2BPT_A 2NYL_A 2NPP_D 2PKG_B ....
Probab=83.32 E-value=2.6 Score=23.10 Aligned_cols=29 Identities=17% Similarity=0.243 Sum_probs=24.6
Q ss_pred cHHHHHHHHHhCCHHHHHHHHHHHHHhhc
Q 032259 78 AAEVLREVEERGSQRAKEKAKRILGMLKG 106 (144)
Q Consensus 78 ai~~L~~Ll~~Gt~raK~kA~~lL~~L~~ 106 (144)
++|.|++++++.+++.|..|...|..+.+
T Consensus 1 llp~l~~~l~D~~~~VR~~a~~~l~~i~~ 29 (31)
T PF02985_consen 1 LLPILLQLLNDPSPEVRQAAAECLGAIAE 29 (31)
T ss_dssp HHHHHHHHHT-SSHHHHHHHHHHHHHHHH
T ss_pred CHHHHHHHcCCCCHHHHHHHHHHHHHHHh
Confidence 47899999999999999999999887754
No 69
>PF06371 Drf_GBD: Diaphanous GTPase-binding Domain; InterPro: IPR010473 Diaphanous-related formins (Drfs) are a family of formin homology (FH) proteins that act as effectors of Rho small GTPases during growth factor-induced cytoskeletal remodelling, stress fibre formation, and cell division []. Drf proteins are characterised by a variety of shared domains: an N-terminal GTPase-binding domain (GBD), formin-homology domains FH1, FH2 (IPR003104 from INTERPRO) and FH3 (IPR010472 from INTERPRO), and a C-terminal conserved Dia-autoregulatory domain (DAD) that binds the GBD. This entry represents the GBD, which is a bifunctional autoinhibitory domain that interacts with and is regulated by activated Rho family members. Mammalian Drf3 contains a CRIB-like motif within its GBD for binding to Cdc42, which is required for Cdc42 to activate and guide Drf3 towards the cell cortex where it remodels the actin skeleton [].; GO: 0003779 actin binding, 0017048 Rho GTPase binding, 0030036 actin cytoskeleton organization; PDB: 3OBV_A 2BNX_A 3EG5_D 2BAP_B 3O4X_B 1Z2C_B 2F31_A.
Probab=82.93 E-value=4.8 Score=30.18 Aligned_cols=56 Identities=23% Similarity=0.245 Sum_probs=46.4
Q ss_pred chHHHHHHHHHHhcCChhhHHHHHh-cCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhc
Q 032259 6 ESTSRVLLILCNLAASNEGRSAILD-ANGVSILVGMLKESGSDSEATRENCVAALFALGH 64 (144)
Q Consensus 6 ~~~E~AlavL~~La~~~egr~aI~~-~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~ 64 (144)
.....++.-|..|..++.|+..+.. ++.|..|+..|.+. +++.+..|.-+|-.||.
T Consensus 131 ~~~~~~l~Clkal~n~~~G~~~v~~~~~~v~~i~~~L~s~---~~~~r~~~leiL~~lc~ 187 (187)
T PF06371_consen 131 DIEHECLRCLKALMNTKYGLEAVLSHPDSVNLIALSLDSP---NIKTRKLALEILAALCL 187 (187)
T ss_dssp HHHHHHHHHHHHHTSSHHHHHHHHCSSSHHHHHHHT--TT---SHHHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHHHccHHHHHHHHcCcHHHHHHHHHHCCC---CHHHHHHHHHHHHHHHC
Confidence 3456678888999999999999987 67899999999874 79999999999999984
No 70
>PF11698 V-ATPase_H_C: V-ATPase subunit H; InterPro: IPR011987 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. V-ATPases (also known as V1V0-ATPase or vacuolar ATPase) (3.6.3.14 from EC) are found in the eukaryotic endomembrane system, and in the plasma membrane of prokaryotes and certain specialised eukaryotic cells. V-ATPases hydrolyse ATP to drive a proton pump, and are involved in a variety of vital intra- and inter-cellular processes such as receptor mediated endocytosis, protein trafficking, active transport of metabolites, homeostasis and neurotransmitter release []. V-ATPases are composed of two linked complexes: the V1 complex (subunits A-H) contains the catalytic core that hydrolyses ATP, while the V0 complex (subunits a, c, c', c'', d) forms the membrane-spanning pore. V-ATPases may have an additional role in membrane fusion through binding to t-SNARE proteins []. This entry represents the C-terminal domain of subunit H (also known as Vma13p) found in the V1 complex of V-ATPases. This subunit has a regulatory function, being responsible for activating ATPase activity and coupling ATPase activity to proton flow []. The yeast enzyme contains five motifs similar to the HEAT or Armadillo repeats seen in the importins, and can be divided into two distinct domains: a large N-terminal domain consisting of stacked alpha helices, and a smaller C-terminal alpha-helical domain with a similar superhelical topology to an armadillo repeat []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0016820 hydrolase activity, acting on acid anhydrides, catalyzing transmembrane movement of substances, 0015991 ATP hydrolysis coupled proton transport, 0000221 vacuolar proton-transporting V-type ATPase, V1 domain; PDB: 1HO8_A.
Probab=82.39 E-value=5.2 Score=29.53 Aligned_cols=71 Identities=14% Similarity=0.078 Sum_probs=54.2
Q ss_pred cHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHhCCHHHHHHHHHHHHHhh
Q 032259 33 GVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEERGSQRAKEKAKRILGMLK 105 (144)
Q Consensus 33 ~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~Gt~raK~kA~~lL~~L~ 105 (144)
.|..|+++|... ..+...--|+.=|..+++..+.-+.++.+.|+-..+..|+.+..+..|..|-..++.|-
T Consensus 44 llk~L~~lL~~s--~d~~~laVac~Dig~~vr~~p~gr~ii~~lg~K~~vM~Lm~h~d~eVr~eAL~avQklm 114 (119)
T PF11698_consen 44 LLKKLIKLLDKS--DDPTTLAVACHDIGEFVRHYPNGRNIIEKLGAKERVMELMNHEDPEVRYEALLAVQKLM 114 (119)
T ss_dssp HHHHHHHHH-SH--HHHHHHHHHHHHHHHHHHH-GGGHHHHHHHSHHHHHHHHTS-SSHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHccC--CCcceeehhhcchHHHHHHChhHHHHHHhcChHHHHHHHhcCCCHHHHHHHHHHHHHHH
Confidence 478899999652 13445555666677888866777888899999999999999999999999988887653
No 71
>KOG0946 consensus ER-Golgi vesicle-tethering protein p115 [Intracellular trafficking, secretion, and vesicular transport]
Probab=81.21 E-value=14 Score=35.55 Aligned_cols=97 Identities=25% Similarity=0.257 Sum_probs=73.0
Q ss_pred hHHHHHHHHHHhcCC--hhhHHHHHh-cCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHH
Q 032259 7 STSRVLLILCNLAAS--NEGRSAILD-ANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLR 83 (144)
Q Consensus 7 ~~E~AlavL~~La~~--~egr~aI~~-~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~ 83 (144)
+.=-++.+|.+|-.| ++-+.+|.. +-+|..||.+|++. -.--|-.|+-.|..|..+++.++.+++-+.++..|.
T Consensus 138 VR~~aIqLlsalls~r~~e~q~~ll~~P~gIS~lmdlL~Ds---rE~IRNe~iLlL~eL~k~n~~IQKlVAFENaFerLf 214 (970)
T KOG0946|consen 138 VRLYAIQLLSALLSCRPTELQDALLVSPMGISKLMDLLRDS---REPIRNEAILLLSELVKDNSSIQKLVAFENAFERLF 214 (970)
T ss_pred hhhHHHHHHHHHHhcCCHHHHHHHHHCchhHHHHHHHHhhh---hhhhchhHHHHHHHHHccCchHHHHHHHHHHHHHHH
Confidence 344567777777764 688888876 45999999999983 233577777888889998889999999999999999
Q ss_pred HHHHh-CC---HHHHHHHHHHHHHhhc
Q 032259 84 EVEER-GS---QRAKEKAKRILGMLKG 106 (144)
Q Consensus 84 ~Ll~~-Gt---~raK~kA~~lL~~L~~ 106 (144)
.+++. |. .-.-..+-.+|..|=+
T Consensus 215 sIIeeEGg~dGgIVveDCL~ll~NLLK 241 (970)
T KOG0946|consen 215 SIIEEEGGLDGGIVVEDCLILLNNLLK 241 (970)
T ss_pred HHHHhcCCCCCcchHHHHHHHHHHHHh
Confidence 99975 32 2345666666665544
No 72
>cd03567 VHS_GGA VHS domain family, GGA subfamily; GGA (Golgi-localized, Gamma-ear-containing, Arf-binding) comprise a subfamily of ubiquitously expressed, monomeric, motif-binding cargo/clathrin adaptor proteins. The VHS domain has a superhelical structure similar to the structure of the ARM (Armadillo) repeats and is present at the N-termini of proteins. GGA proteins have a multidomain structure consisting of an N-terminal VHS domain linked by a short proline-rich linker to a GAT (GGA and TOM) domain, which is followed by a long flexible linker to the C-terminal appendage, GAE (gamma-adaptin ear) domain. The VHS domain of GGA proteins binds to the acidic-cluster dileucine (DxxLL) motif found on the cytoplasmic tails of cargo proteins trafficked between the trans-Golgi network and the endosomal system.
Probab=80.82 E-value=17 Score=27.24 Aligned_cols=79 Identities=16% Similarity=0.124 Sum_probs=58.6
Q ss_pred cHHHHHHHhhcCCCCChHHHHHHHHHHHHHhc-cChHHHHHHHHcCcHHHHHHHHHh------CCHHHHHHHHHHHHHhh
Q 032259 33 GVSILVGMLKESGSDSEATRENCVAALFALGH-GNLRFKGLAKEARAAEVLREVEER------GSQRAKEKAKRILGMLK 105 (144)
Q Consensus 33 ~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~-~~~~~~~~a~~~Gai~~L~~Ll~~------Gt~raK~kA~~lL~~L~ 105 (144)
++..|.+.|..+ ++..+-.|..+|-.+.. ....+...+.....+.-|+.++.. ..+..|+|...+|+.-.
T Consensus 39 a~rai~krl~~~---n~~v~l~AL~LLe~~vkNCG~~fh~evas~~Fl~el~kl~~~k~~~~~~~~~Vk~kil~li~~W~ 115 (139)
T cd03567 39 AVRLLAHKIQSP---QEKEALQALTVLEACMKNCGERFHSEVGKFRFLNELIKLVSPKYLGSRTSEKVKTKIIELLYSWT 115 (139)
T ss_pred HHHHHHHHHcCC---CHHHHHHHHHHHHHHHHHcCHHHHHHHHhHHHHHHHHHHhccccCCCCCCHHHHHHHHHHHHHHH
Confidence 567888888876 78888889888887766 445566666677788889999853 25788999888888777
Q ss_pred cCCCCCCcc
Q 032259 106 GREDDDEDV 114 (144)
Q Consensus 106 ~~~~~~~~~ 114 (144)
....+++.+
T Consensus 116 ~~f~~~p~~ 124 (139)
T cd03567 116 LELPHEPKI 124 (139)
T ss_pred HHhcccchH
Confidence 655555443
No 73
>cd03568 VHS_STAM VHS domain family, STAM subfamily; members include STAM (Signal Transducing Adaptor Molecule), EAST (EGFR-associated protein with SH3 and TAM domains) and Hbp (Hrs-binding protein). Collectively, they are referred to as STAM. All STAMs have at their N-termini a VHS domain, which is involved in cytokine-mediated intracellular signal transduction and has a superhelical structure similar to the structure of ARM (Armadillo) repeats, followed by a SH3 (Src homology 3) domain, a well-established protein-protein interaction domain. At the C-termini of most vertebrate STAMS, an ITAM (Immunoreceptor Tyrosine-based Activation) motif is present, which mediates the binding of HRS (hepatocyte growth factor-regulated tyrosine kinase substrate) in endocytic and exocytic machineries.
Probab=79.02 E-value=18 Score=27.14 Aligned_cols=77 Identities=13% Similarity=0.134 Sum_probs=58.7
Q ss_pred cHHHHHHHhhcCCCCChHHHHHHHHHHHHHhc-cChHHHHHHHHcCcHHHHHHHHHh-CCHHHHHHHHHHHHHhhcCCCC
Q 032259 33 GVSILVGMLKESGSDSEATRENCVAALFALGH-GNLRFKGLAKEARAAEVLREVEER-GSQRAKEKAKRILGMLKGREDD 110 (144)
Q Consensus 33 ~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~-~~~~~~~~a~~~Gai~~L~~Ll~~-Gt~raK~kA~~lL~~L~~~~~~ 110 (144)
++..|.+.|..+ ++..+-.|..+|-.+.. ....+...+.....+..|..++.. ..+..|+|...+|+.......+
T Consensus 38 a~ral~KRl~~~---n~~v~l~AL~LLe~~vkNCG~~fh~evask~Fl~eL~kl~~~~~~~~Vk~kil~li~~W~~~f~~ 114 (144)
T cd03568 38 CLKAIMKRLNHK---DPNVQLRALTLLDACAENCGKRFHQEVASRDFTQELKKLINDRVHPTVKEKLREVVKQWADEFKN 114 (144)
T ss_pred HHHHHHHHHcCC---CHHHHHHHHHHHHHHHHHCCHHHHHHHhhHHHHHHHHHHhcccCCHHHHHHHHHHHHHHHHHhCC
Confidence 677888888876 78888888888877766 445566666677788999999987 6888999988888887765544
Q ss_pred CC
Q 032259 111 DE 112 (144)
Q Consensus 111 ~~ 112 (144)
+.
T Consensus 115 ~~ 116 (144)
T cd03568 115 DP 116 (144)
T ss_pred Cc
Confidence 43
No 74
>KOG1789 consensus Endocytosis protein RME-8, contains DnaJ domain [Intracellular trafficking, secretion, and vesicular transport; Posttranslational modification, protein turnover, chaperones]
Probab=78.89 E-value=8.4 Score=38.69 Aligned_cols=75 Identities=11% Similarity=0.045 Sum_probs=61.9
Q ss_pred HHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhc-cChHHHHHHHHcCcHHHHHHHHH
Q 032259 9 SRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGH-GNLRFKGLAKEARAAEVLREVEE 87 (144)
Q Consensus 9 E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~-~~~~~~~~a~~~Gai~~L~~Ll~ 87 (144)
-.|+.+|..|+.+.-+..++....+|.+++..|+.. +..---|+.+|-.+-. ...+...-+.+.|.||-|+.|+.
T Consensus 2065 ~SaiRVlH~Lsen~~C~~AMA~l~~i~~~m~~mkK~----~~~~GLA~EalkR~~~r~~~eLVAQ~LK~gLvpyLL~LLd 2140 (2235)
T KOG1789|consen 2065 RSAIRVLHELSENQFCCDAMAQLPCIDGIMKSMKKQ----PSLMGLAAEALKRLMKRNTGELVAQMLKCGLVPYLLQLLD 2140 (2235)
T ss_pred HHHHHHHHHHhhccHHHHHHhccccchhhHHHHHhc----chHHHHHHHHHHHHHHHhHHHHHHHHhccCcHHHHHHHhc
Confidence 468899999999999999999988998899999874 3444478888888876 33466777889999999999985
No 75
>PF05004 IFRD: Interferon-related developmental regulator (IFRD); InterPro: IPR007701 Interferon-related developmental regulator (IFRD1) is the human homologue of the Rattus norvegicus early response protein PC4 and its murine homologue TIS7 []. The exact function of IFRD1 is unknown but it has been shown that PC4 is necessary for muscle differentiation and that it might have a role in signal transduction. This entry also contains IFRD2 and its murine equivalent SKMc15, which are highly expressed soon after gastrulation and in the hepatic primordium, suggesting an involvement in early hematopoiesis [].
Probab=78.71 E-value=20 Score=30.09 Aligned_cols=98 Identities=18% Similarity=0.147 Sum_probs=60.6
Q ss_pred CCcchHHHHHHHHHHhcCChhhHHHHHhc-C-cHHHHHHHhhcCCCCChHHHHHHHHHHHHHhc--cC-hHHHHHHHHcC
Q 032259 3 KSGESTSRVLLILCNLAASNEGRSAILDA-N-GVSILVGMLKESGSDSEATRENCVAALFALGH--GN-LRFKGLAKEAR 77 (144)
Q Consensus 3 ~~~~~~E~AlavL~~La~~~egr~aI~~~-g-~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~--~~-~~~~~~a~~~G 77 (144)
|+....+.|+..|.++-.+.---..+.+. . -+..+.+.++.| +++.+..|+.++-.||. +. +....+ -..
T Consensus 55 K~~~~Re~aL~~l~~~l~~~~~~d~v~~~~~tL~~~~~k~lkkg---~~~E~~lA~~~l~Ll~ltlg~g~~~~ei--~~~ 129 (309)
T PF05004_consen 55 KSSSTREAALEALIRALSSRYLPDFVEDRRETLLDALLKSLKKG---KSEEQALAARALALLALTLGAGEDSEEI--FEE 129 (309)
T ss_pred cCHHHHHHHHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHhccC---CHHHHHHHHHHHHHHhhhcCCCccHHHH--HHH
Confidence 55566788888887777543333444332 2 357788888888 55777777776666665 32 121222 345
Q ss_pred cHHHHHHHHHhCC--HHHHHHHHHHHHHhh
Q 032259 78 AAEVLREVEERGS--QRAKEKAKRILGMLK 105 (144)
Q Consensus 78 ai~~L~~Ll~~Gt--~raK~kA~~lL~~L~ 105 (144)
+.|.|..++.+++ +.+|..+...|-++.
T Consensus 130 ~~~~L~~~l~d~s~~~~~R~~~~~aLai~~ 159 (309)
T PF05004_consen 130 LKPVLKRILTDSSASPKARAACLEALAICT 159 (309)
T ss_pred HHHHHHHHHhCCccchHHHHHHHHHHHHHH
Confidence 8899999999885 445555554555543
No 76
>PRK09687 putative lyase; Provisional
Probab=78.57 E-value=9.9 Score=31.40 Aligned_cols=70 Identities=16% Similarity=0.057 Sum_probs=45.7
Q ss_pred cHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccCh-------------------HHHHHHH-------HcCcHHHHHHHH
Q 032259 33 GVSILVGMLKESGSDSEATRENCVAALFALGHGNL-------------------RFKGLAK-------EARAAEVLREVE 86 (144)
Q Consensus 33 ~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~-------------------~~~~~a~-------~~Gai~~L~~Ll 86 (144)
+++.|..++... ..+.-+..|+.+|..+|.... .+|..+. ..-+++.|+.++
T Consensus 91 a~~~L~~l~~~D--~d~~VR~~A~~aLG~~~~~~~~~~~~a~~~l~~~~~D~~~~VR~~a~~aLg~~~~~~ai~~L~~~L 168 (280)
T PRK09687 91 VFNILNNLALED--KSACVRASAINATGHRCKKNPLYSPKIVEQSQITAFDKSTNVRFAVAFALSVINDEAAIPLLINLL 168 (280)
T ss_pred HHHHHHHHHhcC--CCHHHHHHHHHHHhcccccccccchHHHHHHHHHhhCCCHHHHHHHHHHHhccCCHHHHHHHHHHh
Confidence 467777774443 267889999999988875211 1222211 123477788888
Q ss_pred HhCCHHHHHHHHHHHHHh
Q 032259 87 ERGSQRAKEKAKRILGML 104 (144)
Q Consensus 87 ~~Gt~raK~kA~~lL~~L 104 (144)
.+..+..|..|...|-.+
T Consensus 169 ~d~~~~VR~~A~~aLg~~ 186 (280)
T PRK09687 169 KDPNGDVRNWAAFALNSN 186 (280)
T ss_pred cCCCHHHHHHHHHHHhcC
Confidence 777777888887777776
No 77
>PF06371 Drf_GBD: Diaphanous GTPase-binding Domain; InterPro: IPR010473 Diaphanous-related formins (Drfs) are a family of formin homology (FH) proteins that act as effectors of Rho small GTPases during growth factor-induced cytoskeletal remodelling, stress fibre formation, and cell division []. Drf proteins are characterised by a variety of shared domains: an N-terminal GTPase-binding domain (GBD), formin-homology domains FH1, FH2 (IPR003104 from INTERPRO) and FH3 (IPR010472 from INTERPRO), and a C-terminal conserved Dia-autoregulatory domain (DAD) that binds the GBD. This entry represents the GBD, which is a bifunctional autoinhibitory domain that interacts with and is regulated by activated Rho family members. Mammalian Drf3 contains a CRIB-like motif within its GBD for binding to Cdc42, which is required for Cdc42 to activate and guide Drf3 towards the cell cortex where it remodels the actin skeleton [].; GO: 0003779 actin binding, 0017048 Rho GTPase binding, 0030036 actin cytoskeleton organization; PDB: 3OBV_A 2BNX_A 3EG5_D 2BAP_B 3O4X_B 1Z2C_B 2F31_A.
Probab=78.50 E-value=2.3 Score=31.89 Aligned_cols=80 Identities=15% Similarity=0.269 Sum_probs=56.2
Q ss_pred HHHHHhcCcHHHHHHHhhcCCC------CChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHhCCHHHHHHHH
Q 032259 25 RSAILDANGVSILVGMLKESGS------DSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEERGSQRAKEKAK 98 (144)
Q Consensus 25 r~aI~~~g~I~~LV~lL~~G~~------~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~Gt~raK~kA~ 98 (144)
-..+.+.||+..|+++|..=.. ........++..|-.|+........+...++.+..|...+.+...++|..|.
T Consensus 100 v~~Fl~~~G~~~L~~~L~~~~~~~~~~~~~~~~~~~~l~Clkal~n~~~G~~~v~~~~~~v~~i~~~L~s~~~~~r~~~l 179 (187)
T PF06371_consen 100 VQEFLELGGLEALLNVLSKLNKKKEKSEEDIDIEHECLRCLKALMNTKYGLEAVLSHPDSVNLIALSLDSPNIKTRKLAL 179 (187)
T ss_dssp HHHH-HHHHHHHHHHHHHHHHTHHCTCTTCHHHHHHHHHHHHHHTSSHHHHHHHHCSSSHHHHHHHT--TTSHHHHHHHH
T ss_pred HHHhccCCCHHHHHHHHHHhhhhhhhcchhHHHHHHHHHHHHHHHccHHHHHHHHcCcHHHHHHHHHHCCCCHHHHHHHH
Confidence 4456678999999998865100 0112344566777777765556677777899999999999999999999999
Q ss_pred HHHHHh
Q 032259 99 RILGML 104 (144)
Q Consensus 99 ~lL~~L 104 (144)
.+|..+
T Consensus 180 eiL~~l 185 (187)
T PF06371_consen 180 EILAAL 185 (187)
T ss_dssp HHHHHH
T ss_pred HHHHHH
Confidence 988765
No 78
>PF00790 VHS: VHS domain; InterPro: IPR002014 The VHS domain is a ~140 residues long domain, whose name is derived from its occurrence in VPS-27, Hrs and STAM. Based on regions surrounding the domain, VHS-proteins can be divided into 4 groups []: STAM/EAST/Hbp which all share the domain composition VHS-SH3-ITAM and carry one or two ubiquitin-interacting motifs Proteins with a FYVE domain (IPR000306 from INTERPRO) C-terminal to VHS which also carry one or two ubiquitin-interacting motifs GGA proteins with a domain composition VHS-GAT (GGA and Tom1) homology domain VHS domain alone or in combination with domains other than those listed above The VHS domain is always found at the N- terminus of proteins suggesting that such topology is important for function. The domain is considered to have a general membrane targeting/cargo recognition role in vesicular trafficking []. Resolution of the crystal structure of the VHS domain of Drosophila Hrs and human Tom1 revealed that it consists of eight helices arranged in a double-layer superhelix []. The existence of conserved patches of residues on the domain surface suggests that VHS domains may be involved in protein-protein recognition and docking. Overall, sequence similarity is low (approx 25%) amongst domain family members.; GO: 0006886 intracellular protein transport; PDB: 1X5B_A 2L0T_B 1DVP_A 3LDZ_C 3ZYQ_A 4AVX_A 3G2U_A 3G2W_A 1UJJ_A 3G2V_A ....
Probab=78.49 E-value=14 Score=27.09 Aligned_cols=73 Identities=18% Similarity=0.085 Sum_probs=56.2
Q ss_pred cHHHHHHHhhcCCCCChHHHHHHHHHHHHHhc-cChHHHHHHHHcCcHHHHHHHHHhC-CHH---HHHHHHHHHHHhhcC
Q 032259 33 GVSILVGMLKESGSDSEATRENCVAALFALGH-GNLRFKGLAKEARAAEVLREVEERG-SQR---AKEKAKRILGMLKGR 107 (144)
Q Consensus 33 ~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~-~~~~~~~~a~~~Gai~~L~~Ll~~G-t~r---aK~kA~~lL~~L~~~ 107 (144)
++..|.+.|..+ +|..+-.|+.+|-.+.. +.+.+...+.....+..|..|+.+. +.. .|+|+..+|+.....
T Consensus 43 a~~~l~krl~~~---~~~vq~~aL~lld~lvkNcg~~f~~ev~~~~fl~~l~~l~~~~~~~~~~~Vk~k~l~ll~~W~~~ 119 (140)
T PF00790_consen 43 AARALRKRLKHG---NPNVQLLALTLLDALVKNCGPRFHREVASKEFLDELVKLIKSKKTDPETPVKEKILELLQEWAEA 119 (140)
T ss_dssp HHHHHHHHHTTS---SHHHHHHHHHHHHHHHHHSHHHHHHHHTSHHHHHHHHHHHHHTTTHHHSHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHhCC---CHHHHHHHHHHHHHHHHcCCHHHHHHHhHHHHHHHHHHHHccCCCCchhHHHHHHHHHHHHHHHH
Confidence 456788888886 89999999888887776 3356666666667889999988765 544 799988888887775
Q ss_pred C
Q 032259 108 E 108 (144)
Q Consensus 108 ~ 108 (144)
.
T Consensus 120 f 120 (140)
T PF00790_consen 120 F 120 (140)
T ss_dssp T
T ss_pred H
Confidence 5
No 79
>cd03569 VHS_Hrs_Vps27p VHS domain family, Hrs and Vps27p subfamily; composed of Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and its yeast homolog Vps27p (vacuolar protein sorting). The VHS domain, an essential part of Hrs/Vps27p, has a superhelical structure similar to the structure of ARM (Armadillo) repeats and is present at the N-termini of proteins. Hrs also contains a FYVE (Fab1p, YOTB, Vac1p, and EEA1) zinc finger domain C-terminal to VHS, as well as two coiled-coil domains. Hrs has been proposed to play a role in at least three vesicle trafficking events: exocytosis, endocytosis, and endosome to lysosome trafficking. Hrs is involved in promoting rapid recycling of endocytosed signaling receptors to the plasma membrane.
Probab=77.53 E-value=22 Score=26.50 Aligned_cols=75 Identities=8% Similarity=0.045 Sum_probs=57.4
Q ss_pred cHHHHHHHhhcCCCCChHHHHHHHHHHHHHhc-cChHHHHHHHHcCcHHHHHHHHHhC-CHHHHHHHHHHHHHhhcCCCC
Q 032259 33 GVSILVGMLKESGSDSEATRENCVAALFALGH-GNLRFKGLAKEARAAEVLREVEERG-SQRAKEKAKRILGMLKGREDD 110 (144)
Q Consensus 33 ~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~-~~~~~~~~a~~~Gai~~L~~Ll~~G-t~raK~kA~~lL~~L~~~~~~ 110 (144)
++..|.+.|..+ ++..+-.|..+|-.+.. ....+...+.....+..|..++... .+..|+|+..+++.......+
T Consensus 42 a~ral~krl~~~---n~~vql~AL~LLe~~vkNCG~~fh~evas~~fl~~l~~l~~~~~~~~Vk~kil~li~~W~~~f~~ 118 (142)
T cd03569 42 AMRALKKRLLSK---NPNVQLYALLLLESCVKNCGTHFHDEVASREFMDELKDLIKTTKNEEVRQKILELIQAWALAFRN 118 (142)
T ss_pred HHHHHHHHHcCC---ChHHHHHHHHHHHHHHHHCCHHHHHHHhhHHHHHHHHHHHcccCCHHHHHHHHHHHHHHHHHhCC
Confidence 677888888887 78888888777766655 3455666667888999999999754 678899988888887765533
No 80
>KOG0168 consensus Putative ubiquitin fusion degradation protein [Posttranslational modification, protein turnover, chaperones]
Probab=77.30 E-value=14 Score=35.92 Aligned_cols=91 Identities=16% Similarity=0.113 Sum_probs=70.2
Q ss_pred chHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhc--cChHHHHHHHHcCcHHHHH
Q 032259 6 ESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGH--GNLRFKGLAKEARAAEVLR 83 (144)
Q Consensus 6 ~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~--~~~~~~~~a~~~Gai~~L~ 83 (144)
++.|.++.+|..|+. +.-.+|.++|+|...+..|.-= |..+|..|.++--+.|. ..+...-++ .++|.|.
T Consensus 270 DvAEQ~LqALE~iSR--~H~~AiL~AG~l~a~LsylDFF---Si~aQR~AlaiaaN~Cksi~sd~f~~v~---ealPlL~ 341 (1051)
T KOG0168|consen 270 DVAEQSLQALEKISR--RHPKAILQAGALSAVLSYLDFF---SIHAQRVALAIAANCCKSIRSDEFHFVM---EALPLLT 341 (1051)
T ss_pred HHHHHHHHHHHHHHh--hccHHHHhcccHHHHHHHHHHH---HHHHHHHHHHHHHHHHhcCCCccchHHH---HHHHHHH
Confidence 578999999999996 2235788999999999999773 78899999999999998 444444443 3789999
Q ss_pred HHHHhCCHHHHHHHHHHHHHh
Q 032259 84 EVEERGSQRAKEKAKRILGML 104 (144)
Q Consensus 84 ~Ll~~Gt~raK~kA~~lL~~L 104 (144)
.|++.-+.+.-+.+.--+..+
T Consensus 342 ~lLs~~D~k~ies~~ic~~ri 362 (1051)
T KOG0168|consen 342 PLLSYQDKKPIESVCICLTRI 362 (1051)
T ss_pred HHHhhccchhHHHHHHHHHHH
Confidence 999988777766555444333
No 81
>KOG2171 consensus Karyopherin (importin) beta 3 [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=76.39 E-value=4.9 Score=39.45 Aligned_cols=113 Identities=18% Similarity=0.140 Sum_probs=80.5
Q ss_pred CcchHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhc-cChHHHHHHHHcCcHHHH
Q 032259 4 SGESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGH-GNLRFKGLAKEARAAEVL 82 (144)
Q Consensus 4 ~~~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~-~~~~~~~~a~~~Gai~~L 82 (144)
+..+...+..++..|+....+.-...=.+..|.++.++..+ .+-.|++|+.++-.+.. ..+.... -..+.+|.+
T Consensus 816 d~~ll~~i~~i~~~l~k~~k~~f~p~f~~~~p~iv~~l~~~---~~~~r~~av~~~~d~ie~~~~a~~~--~~~~~~p~~ 890 (1075)
T KOG2171|consen 816 DAYLLDAISDILAALAKALKGSFLPFFENFLPLIVKLLKSK---KTVARQWAVCIFDDLIEGCGEASAK--YKERFLPLV 890 (1075)
T ss_pred hHHHHHHHHHHHHHHHHHccccccHHHHHHHHHHHHHHhcC---CcHHHHHHHHHHHHHHHhcccccch--HHHHHHHHH
Confidence 44566777778877776444433333346789999999886 45559999999888765 2221111 246789999
Q ss_pred HHHHHhCCHHHHHHHHHHHHHhhcCCCCCCcchhhhhcccCCC
Q 032259 83 REVEERGSQRAKEKAKRILGMLKGREDDDEDVDWEGVLDSGGP 125 (144)
Q Consensus 83 ~~Ll~~Gt~raK~kA~~lL~~L~~~~~~~~~~~~~~~~~~~~~ 125 (144)
++.+++-.|+.|+.|+--+-.+.++..++ |+.|+.+..+
T Consensus 891 ~~~~~d~~pEVRqaAsYGiGvlaq~~g~~----y~~v~~~~l~ 929 (1075)
T KOG2171|consen 891 LEALQDSDPEVRQAAAYGMGVLAQFGGED----YAPVCSEALP 929 (1075)
T ss_pred HHHcCCCCHHHHHHHHHHHHHHHHHcCcc----hHHHHHHHHH
Confidence 99999999999999999999999987443 6666655443
No 82
>cd00256 VATPase_H VATPase_H, regulatory vacuolar ATP synthase subunit H (Vma13p); activation component of the peripheral V1 complex of V-ATPase, a heteromultimeric enzyme which uses ATP to actively transport protons into organelles and extracellular compartments. The topology is that of a superhelical spiral, in part the geometry is similar to superhelices composed of armadillo repeat motifs, as found in importins for example.
Probab=75.30 E-value=17 Score=32.23 Aligned_cols=95 Identities=16% Similarity=0.127 Sum_probs=61.4
Q ss_pred HHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccC------hHHHHHHHHcCcHHHHH
Q 032259 10 RVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGN------LRFKGLAKEARAAEVLR 83 (144)
Q Consensus 10 ~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~------~~~~~~a~~~Gai~~L~ 83 (144)
.++-+|..|+-++++.+.+.+.+.||.|+++++.. ...+..--++++|.+|...+ ..+...+...|+.+.|.
T Consensus 206 ~~ll~lWlLSF~~~~~~~~~~~~~i~~l~~i~k~s--~KEKvvRv~l~~l~Nll~~~~~~~~~~~~~~~mv~~~l~~~l~ 283 (429)
T cd00256 206 QSIFCIWLLTFNPHAAEVLKRLSLIQDLSDILKES--TKEKVIRIVLAIFRNLISKRVDREVKKTAALQMVQCKVLKTLQ 283 (429)
T ss_pred HHHHHHHHHhccHHHHHhhccccHHHHHHHHHHhh--hhHHHHHHHHHHHHHHhhcccccchhhhHHHHHHHcChHHHHH
Confidence 56677888998888777777778999999999984 23455778889999998732 12344455666666555
Q ss_pred HHHHhC--CHHHHHHHHHHHHHhhc
Q 032259 84 EVEERG--SQRAKEKAKRILGMLKG 106 (144)
Q Consensus 84 ~Ll~~G--t~raK~kA~~lL~~L~~ 106 (144)
.|.... .+.-.+.-..+-..|.+
T Consensus 284 ~L~~rk~~DedL~edl~~L~e~L~~ 308 (429)
T cd00256 284 SLEQRKYDDEDLTDDLKFLTEELKN 308 (429)
T ss_pred HHhcCCCCcHHHHHHHHHHHHHHHH
Confidence 554433 33334444444444433
No 83
>TIGR03504 FimV_Cterm FimV C-terminal domain. This protein is found at the extreme C-terminus of FimV from Pseudomonas aeruginosa, and of TspA of Neisseria meningitidis. Disruption of the former blocks twitching motility from type IV pili; Semmler, et al. suggest a role in peptidoglycan layer remodelling required by type IV fimbrial systems.
Probab=73.32 E-value=5.4 Score=24.25 Aligned_cols=34 Identities=29% Similarity=0.467 Sum_probs=27.9
Q ss_pred HHHHHcCcHHHHHHHHHhCCHHHHHHHHHHHHHh
Q 032259 71 GLAKEARAAEVLREVEERGSQRAKEKAKRILGML 104 (144)
Q Consensus 71 ~~a~~~Gai~~L~~Ll~~Gt~raK~kA~~lL~~L 104 (144)
.+....|+-..|-+++..|++..|..|..+|.-|
T Consensus 11 e~Gd~e~Ar~lL~evl~~~~~~q~~eA~~LL~~l 44 (44)
T TIGR03504 11 EMGDLEGARELLEEVIEEGDEAQRQEARALLAQL 44 (44)
T ss_pred HcCChHHHHHHHHHHHHcCCHHHHHHHHHHHhcC
Confidence 3344577888899999999999999999998753
No 84
>KOG2759 consensus Vacuolar H+-ATPase V1 sector, subunit H [Energy production and conversion]
Probab=72.13 E-value=40 Score=30.16 Aligned_cols=79 Identities=18% Similarity=0.148 Sum_probs=61.3
Q ss_pred HHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHh
Q 032259 9 SRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEER 88 (144)
Q Consensus 9 E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~ 88 (144)
--|+..|-.+...++-|-+++.+.++..++.+|.++ .+.-.-+=+..-.+|.|.-+.. ..+.+...+.|+.|..+++.
T Consensus 175 ~~~~rcLQ~ll~~~eyR~~~v~adg~~~l~~~l~s~-~~~~QlQYqsifciWlLtFn~~-~ae~~~~~~li~~L~~Ivk~ 252 (442)
T KOG2759|consen 175 QFAARCLQTLLRVDEYRYAFVIADGVSLLIRILAST-KCGFQLQYQSIFCIWLLTFNPH-AAEKLKRFDLIQDLSDIVKE 252 (442)
T ss_pred HHHHHHHHHHhcCcchhheeeecCcchhhHHHHhcc-CcchhHHHHHHHHHHHhhcCHH-HHHHHhhccHHHHHHHHHHH
Confidence 446668888999999999999999999999999543 2455667788888999886543 34666788888888888765
Q ss_pred C
Q 032259 89 G 89 (144)
Q Consensus 89 G 89 (144)
.
T Consensus 253 ~ 253 (442)
T KOG2759|consen 253 S 253 (442)
T ss_pred H
Confidence 4
No 85
>PF12348 CLASP_N: CLASP N terminal; InterPro: IPR024395 This domain is found in the N-terminal region of CLIP-associated proteins (CLASPs), which are widely conserved microtubule plus-end-tracking proteins that regulate the stability of dynamic microtubules [, ]. The domain is also found in other proteins involved in microtubule binding, including STU1, MOR1 and spindle pole body component Alp14.; PDB: 2QK2_A.
Probab=69.34 E-value=33 Score=26.42 Aligned_cols=95 Identities=16% Similarity=0.090 Sum_probs=57.7
Q ss_pred hHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHH
Q 032259 7 STSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVE 86 (144)
Q Consensus 7 ~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll 86 (144)
++-.|+.+|..|+..-...-.-.-.-.+|.|++.+.++ +.--++.|..+|..++..-.....+ .++.|....
T Consensus 69 v~~~A~~~l~~l~~~l~~~~~~~~~~~l~~Ll~~~~~~---~~~i~~~a~~~L~~i~~~~~~~~~~-----~~~~l~~~~ 140 (228)
T PF12348_consen 69 VSKTACQLLSDLARQLGSHFEPYADILLPPLLKKLGDS---KKFIREAANNALDAIIESCSYSPKI-----LLEILSQGL 140 (228)
T ss_dssp HHHHHHHHHHHHHHHHGGGGHHHHHHHHHHHHHGGG------HHHHHHHHHHHHHHHTTS-H--HH-----HHHHHHHHT
T ss_pred HHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHccc---cHHHHHHHHHHHHHHHHHCCcHHHH-----HHHHHHHHH
Confidence 45677777777776333321111123678888888775 5677999999999998743311111 255666666
Q ss_pred HhCCHHHHHHHHHHHHHhhcCCC
Q 032259 87 ERGSQRAKEKAKRILGMLKGRED 109 (144)
Q Consensus 87 ~~Gt~raK~kA~~lL~~L~~~~~ 109 (144)
.+-++..|..+...|..+-..++
T Consensus 141 ~~Kn~~vR~~~~~~l~~~l~~~~ 163 (228)
T PF12348_consen 141 KSKNPQVREECAEWLAIILEKWG 163 (228)
T ss_dssp T-S-HHHHHHHHHHHHHHHTT--
T ss_pred hCCCHHHHHHHHHHHHHHHHHcc
Confidence 77799999998888888766664
No 86
>PF06012 DUF908: Domain of Unknown Function (DUF908); InterPro: IPR010309 This is a domain of unknown function found at the N terminus of a family of E3 ubiquitin protein ligases, including yeast TOM1, many of which appear to play a role in mRNA transcription and processing. This domain is found in association with and immediately N-terminal to another domain of unknown function: IPR010314 from INTERPRO.
Probab=68.54 E-value=36 Score=28.64 Aligned_cols=70 Identities=17% Similarity=0.167 Sum_probs=52.3
Q ss_pred HHHHHHHHHhcCChhhHHHHHhcC--cHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCc
Q 032259 9 SRVLLILCNLAASNEGRSAILDAN--GVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARA 78 (144)
Q Consensus 9 E~AlavL~~La~~~egr~aI~~~g--~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Ga 78 (144)
=.|+++|..+...++.-..+...+ .+.-|+++|+.+..++..-|--|..+|-.++........++...|+
T Consensus 240 llAi~~l~~~~~~~~~~~~~l~~dp~l~~eL~eLi~~~~~v~~~i~~~Al~~L~ai~~~~~~~~~V~~aLg~ 311 (329)
T PF06012_consen 240 LLAIANLVYIHPESQFSSKLLEQDPELVNELVELISPEEKVPMDIQTAALRALEAISHKRPRCSDVLRALGA 311 (329)
T ss_pred HHHHHHHHhhCCCHHHHHHHHhcChHHHHHHHHHhCCCCCCCHHHHHHHHHHHHHHHhccccHHHHHHHhcC
Confidence 346667777777888888887765 8999999998865466778999999999999865544555444443
No 87
>cd03561 VHS VHS domain family; The VHS domain is present in Vps27 (Vacuolar Protein Sorting), Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and STAM (Signal Transducing Adaptor Molecule). It has a superhelical structure similar to that of the ARM (Armadillo) repeats and is present at the N-termini of proteins involved in intracellular membrane trafficking. There are four general groups of VHS domain containing proteins based on their association with other domains. The first group consists of proteins of the STAM/EAST/Hbp family which has the domain composition VHS-SH3-ITAM. The second consists of proteins with a FYVE domain C-terminal to VHS. The third consists of GGA proteins with a domain composition VHS-GAT (GGA and TOM)-GAE (gamma-adaptin ear) domain. The fourth consists of proteins with a VHS domain alone or with domains other than those mentioned above. In GGA proteins, VHS domains are involved in cargo recognition in trans-Golgi, thereby having a general me
Probab=67.66 E-value=45 Score=24.24 Aligned_cols=73 Identities=11% Similarity=0.025 Sum_probs=52.8
Q ss_pred cHHHHHHHhhcCCCCChHHHHHHHHHHHHHhc-cChHHHHHHHHcCcHHHHHHHHHh--C-CHHHHHHHHHHHHHhhcCC
Q 032259 33 GVSILVGMLKESGSDSEATRENCVAALFALGH-GNLRFKGLAKEARAAEVLREVEER--G-SQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 33 ~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~-~~~~~~~~a~~~Gai~~L~~Ll~~--G-t~raK~kA~~lL~~L~~~~ 108 (144)
++..|-+.|..+ +|..+-.|+.+|-.+.. ....+...+....-+..|+.++.. . .+..|+|+..+|+......
T Consensus 38 a~raL~krl~~~---n~~vql~AL~lLd~~vkNcg~~f~~~i~s~~fl~~l~~l~~~~~~~~~~Vk~kil~ll~~W~~~f 114 (133)
T cd03561 38 AARAIRKKIKYG---NPHVQLLALTLLELLVKNCGKPFHLQVADKEFLLELVKIAKNSPKYDPKVREKALELILAWSESF 114 (133)
T ss_pred HHHHHHHHHcCC---CHHHHHHHHHHHHHHHHhCChHHHHHHhhHHHHHHHHHHhCCCCCCCHHHHHHHHHHHHHHHHHh
Confidence 567888888887 79999999888877766 334455555454444458888875 3 6788999888888877654
No 88
>PF14664 RICTOR_N: Rapamycin-insensitive companion of mTOR, N-term
Probab=66.48 E-value=42 Score=29.00 Aligned_cols=93 Identities=20% Similarity=0.247 Sum_probs=68.4
Q ss_pred HHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHH
Q 032259 8 TSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEE 87 (144)
Q Consensus 8 ~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~ 87 (144)
.|.|+..+..+...+.|...|- .+-+.+||.+...- .++=+.-|+.+|..++..++. ++...|++..|...+.
T Consensus 85 R~QALkliR~~l~~~~~~~~~~-~~vvralvaiae~~---~D~lr~~cletL~El~l~~P~---lv~~~gG~~~L~~~l~ 157 (371)
T PF14664_consen 85 REQALKLIRAFLEIKKGPKEIP-RGVVRALVAIAEHE---DDRLRRICLETLCELALLNPE---LVAECGGIRVLLRALI 157 (371)
T ss_pred HHHHHHHHHHHHHhcCCcccCC-HHHHHHHHHHHhCC---chHHHHHHHHHHHHHHhhCHH---HHHHcCCHHHHHHHHH
Confidence 4578888888877766655552 24567788877763 678899999999999997764 5567999999999998
Q ss_pred hCC-HHHHHHHHHHHHHhhcC
Q 032259 88 RGS-QRAKEKAKRILGMLKGR 107 (144)
Q Consensus 88 ~Gt-~raK~kA~~lL~~L~~~ 107 (144)
+|+ +.+.--...+|.+|...
T Consensus 158 d~~~~~~~~l~~~lL~lLd~p 178 (371)
T PF14664_consen 158 DGSFSISESLLDTLLYLLDSP 178 (371)
T ss_pred hccHhHHHHHHHHHHHHhCCc
Confidence 885 44445566666666554
No 89
>PRK13800 putative oxidoreductase/HEAT repeat-containing protein; Provisional
Probab=64.58 E-value=44 Score=31.98 Aligned_cols=70 Identities=20% Similarity=0.155 Sum_probs=44.6
Q ss_pred CcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhc--------------cChHHHHHHHH------cCcHHHHHHHHHhCCH
Q 032259 32 NGVSILVGMLKESGSDSEATRENCVAALFALGH--------------GNLRFKGLAKE------ARAAEVLREVEERGSQ 91 (144)
Q Consensus 32 g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~--------------~~~~~~~~a~~------~Gai~~L~~Ll~~Gt~ 91 (144)
..++.|++.|.+. .+.-|..|+.+|..+.. .++.+|..+.+ .+-...|+..+.+..+
T Consensus 652 ~~~~~L~~aL~D~---d~~VR~~Aa~aL~~l~~~~~~~~~L~~~L~~~d~~VR~~A~~aL~~~~~~~~~~l~~~L~D~d~ 728 (897)
T PRK13800 652 GFGPALVAALGDG---AAAVRRAAAEGLRELVEVLPPAPALRDHLGSPDPVVRAAALDVLRALRAGDAALFAAALGDPDH 728 (897)
T ss_pred hHHHHHHHHHcCC---CHHHHHHHHHHHHHHHhccCchHHHHHHhcCCCHHHHHHHHHHHHhhccCCHHHHHHHhcCCCH
Confidence 4678888888774 57778888888865521 22334443322 2344666777777777
Q ss_pred HHHHHHHHHHHHh
Q 032259 92 RAKEKAKRILGML 104 (144)
Q Consensus 92 raK~kA~~lL~~L 104 (144)
..|..|...|..+
T Consensus 729 ~VR~~Av~aL~~~ 741 (897)
T PRK13800 729 RVRIEAVRALVSV 741 (897)
T ss_pred HHHHHHHHHHhcc
Confidence 7777777777654
No 90
>PRK13800 putative oxidoreductase/HEAT repeat-containing protein; Provisional
Probab=62.51 E-value=45 Score=31.93 Aligned_cols=73 Identities=25% Similarity=0.257 Sum_probs=43.1
Q ss_pred hcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhc-------------cChHHHHHHHH-c-------CcHHHHHHHHHh
Q 032259 30 DANGVSILVGMLKESGSDSEATRENCVAALFALGH-------------GNLRFKGLAKE-A-------RAAEVLREVEER 88 (144)
Q Consensus 30 ~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~-------------~~~~~~~~a~~-~-------Gai~~L~~Ll~~ 88 (144)
....++.|++.|.+. .+.-|..|+.+|..+.. .++.+|..+.. . ...+.|..++.+
T Consensus 619 ~~~~~~~L~~~L~D~---d~~VR~~Av~~L~~~~~~~~~~~L~~aL~D~d~~VR~~Aa~aL~~l~~~~~~~~~L~~~L~~ 695 (897)
T PRK13800 619 DAPSVAELAPYLADP---DPGVRRTAVAVLTETTPPGFGPALVAALGDGAAAVRRAAAEGLRELVEVLPPAPALRDHLGS 695 (897)
T ss_pred cchhHHHHHHHhcCC---CHHHHHHHHHHHhhhcchhHHHHHHHHHcCCCHHHHHHHHHHHHHHHhccCchHHHHHHhcC
Confidence 345788999999875 78889999888866542 11222222211 1 122455556666
Q ss_pred CCHHHHHHHHHHHHHhh
Q 032259 89 GSQRAKEKAKRILGMLK 105 (144)
Q Consensus 89 Gt~raK~kA~~lL~~L~ 105 (144)
..++.|..|..+|..+.
T Consensus 696 ~d~~VR~~A~~aL~~~~ 712 (897)
T PRK13800 696 PDPVVRAAALDVLRALR 712 (897)
T ss_pred CCHHHHHHHHHHHHhhc
Confidence 66666666666665544
No 91
>PF14663 RasGEF_N_2: Rapamycin-insensitive companion of mTOR RasGEF_N domain
Probab=61.43 E-value=31 Score=24.76 Aligned_cols=52 Identities=19% Similarity=0.195 Sum_probs=36.5
Q ss_pred cHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHhCC
Q 032259 33 GVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEERGS 90 (144)
Q Consensus 33 ~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~Gt 90 (144)
+|+.||+-|-+- +++-...|+.+|...|........++.. -|.|..|-..|.
T Consensus 9 ~i~lLv~QL~D~---~~~V~~~A~~iL~e~c~~~~~le~~v~~---~p~l~~L~~~g~ 60 (115)
T PF14663_consen 9 GIELLVTQLYDP---SPEVVAAALEILEEACEDKEYLEYLVSL---RPSLDHLGDIGS 60 (115)
T ss_pred HHHHHHHHhcCC---CHHHHHHHHHHHHHHHhchhhHHHHHHc---CcHHHHHHHcCH
Confidence 688999999774 7889999999999999876443333322 255555544443
No 92
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=61.01 E-value=71 Score=32.10 Aligned_cols=96 Identities=17% Similarity=0.106 Sum_probs=81.2
Q ss_pred HHHHHHHhcC-ChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHhC
Q 032259 11 VLLILCNLAA-SNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEERG 89 (144)
Q Consensus 11 AlavL~~La~-~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~G 89 (144)
|+=||+.+.. .+-|.++..+.+-|..=.+.|..+ .-+--+++++-.|..|=.+.+..|-...+.+++..|..++.+-
T Consensus 577 aAFVLAviv~nf~lGQ~acl~~~li~iCle~lnd~--~~pLLrQW~~icLG~LW~d~~~Arw~G~r~~AhekL~~~LsD~ 654 (1387)
T KOG1517|consen 577 AAFVLAVIVRNFKLGQKACLNGNLIGICLEHLNDD--PEPLLRQWLCICLGRLWEDYDEARWSGRRDNAHEKLILLLSDP 654 (1387)
T ss_pred HHHHHHHHHcccchhHHHhccccHHHHHHHHhcCC--ccHHHHHHHHHHHHHHhhhcchhhhccccccHHHHHHHHhcCc
Confidence 5557777774 788999999888888778888875 1366799999999999887777788889999999999999999
Q ss_pred CHHHHHHHHHHHHHhhcCC
Q 032259 90 SQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 90 t~raK~kA~~lL~~L~~~~ 108 (144)
.++.|.-|.-+|..|-...
T Consensus 655 vpEVRaAAVFALgtfl~~~ 673 (1387)
T KOG1517|consen 655 VPEVRAAAVFALGTFLSNG 673 (1387)
T ss_pred cHHHHHHHHHHHHHHhccc
Confidence 9999999999999987753
No 93
>PF06543 Lac_bphage_repr: Lactococcus bacteriophage repressor; InterPro: IPR009498 This entry represents the C terminus of various Lactococcus bacteriophage repressor proteins.
Probab=58.71 E-value=4.9 Score=25.23 Aligned_cols=19 Identities=32% Similarity=0.622 Sum_probs=15.2
Q ss_pred CCCcchhhhhcccCCCccc
Q 032259 110 DDEDVDWEGVLDSGGPTRS 128 (144)
Q Consensus 110 ~~~~~~~~~~~~~~~~~~~ 128 (144)
||+++||+..||.++-..+
T Consensus 14 Dd~kvdWd~wvSf~GrPlt 32 (49)
T PF06543_consen 14 DDPKVDWDKWVSFDGRPLT 32 (49)
T ss_pred CCcccchHHheeeCCeeCC
Confidence 7788999999998875443
No 94
>smart00288 VHS Domain present in VPS-27, Hrs and STAM. Unpublished observations. Domain of unknown function.
Probab=55.56 E-value=78 Score=23.05 Aligned_cols=73 Identities=11% Similarity=0.005 Sum_probs=53.8
Q ss_pred cHHHHHHHhhcCCCCChHHHHHHHHHHHHHhc-cChHHHHHHHHcCcHHHHHHHHHhC-C-HHHHHHHHHHHHHhhcCC
Q 032259 33 GVSILVGMLKESGSDSEATRENCVAALFALGH-GNLRFKGLAKEARAAEVLREVEERG-S-QRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 33 ~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~-~~~~~~~~a~~~Gai~~L~~Ll~~G-t-~raK~kA~~lL~~L~~~~ 108 (144)
++..|-+.|..+ +|..+-.|+.+|-.+.. ....+...+.....+..|..++..- + +..|+|+..+++......
T Consensus 38 a~r~l~krl~~~---n~~v~l~AL~lLe~~vkNcg~~f~~ev~s~~fl~~L~~l~~~~~~~~~Vk~kil~li~~W~~~f 113 (133)
T smart00288 38 AVRLLKKRLNNK---NPHVALLALTLLDACVKNCGSKFHLEVASKEFLNELVKLIKPKYPLPLVKKRILELIQEWADAF 113 (133)
T ss_pred HHHHHHHHHcCC---CHHHHHHHHHHHHHHHHHCCHHHHHHHHhHHHHHHHHHHHcCCCCcHHHHHHHHHHHHHHHHHH
Confidence 566788888876 79999999888877766 3456666667777888999888653 3 447888887777766644
No 95
>PF08216 CTNNBL: Catenin-beta-like, Arm-motif containing nuclear; InterPro: IPR013180 This domain is found in eukaryotic proteins. A human nuclear protein with this domain (Q8WYA6 from SWISSPROT) is thought to have a role in apoptosis [].
Probab=53.88 E-value=30 Score=25.15 Aligned_cols=36 Identities=22% Similarity=0.297 Sum_probs=31.8
Q ss_pred HHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhc
Q 032259 8 TSRVLLILCNLAASNEGRSAILDANGVSILVGMLKE 43 (144)
Q Consensus 8 ~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~ 43 (144)
.+..+.-|..||++|+-=..+++.|+++.|+.+|.-
T Consensus 63 Ld~~Ik~l~~La~~P~LYp~lv~l~~v~sL~~LL~H 98 (108)
T PF08216_consen 63 LDEEIKKLSVLATAPELYPELVELGAVPSLLGLLSH 98 (108)
T ss_pred HHHHHHHHHHccCChhHHHHHHHcCCHHHHHHHHCC
Confidence 356788899999999999999999999999999954
No 96
>KOG2973 consensus Uncharacterized conserved protein [Function unknown]
Probab=53.37 E-value=78 Score=27.48 Aligned_cols=92 Identities=17% Similarity=0.095 Sum_probs=58.6
Q ss_pred hHHHHHHHHHHhcCChhhHHHHHh-cCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHH
Q 032259 7 STSRVLLILCNLAASNEGRSAILD-ANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREV 85 (144)
Q Consensus 7 ~~E~AlavL~~La~~~egr~aI~~-~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~L 85 (144)
+.-.|+..|..|..+ .++..... .-.|+.|.+++..- +. -+.|+.+|.+++... .++...... .+..|+..
T Consensus 19 v~~~AV~~l~~lt~~-~~~~~~~~~~~~lk~l~qL~~~~----~~-~~~a~~alVnlsq~~-~l~~~ll~~-~~k~l~~~ 90 (353)
T KOG2973|consen 19 VRKAAVEHLLGLTGR-GLQSLSKYSEALLKDLTQLLKDL----DP-AEPAATALVNLSQKE-ELRKKLLQD-LLKVLMDM 90 (353)
T ss_pred HHHHHHHHHhhcccc-chhhhccchhhhHHHHHHHccCc----cc-ccHHHHHHHHHHhhH-HHHHHHHHH-HHHHHHHH
Confidence 345566677777777 44444433 34688889988773 22 788999999999754 344554444 67777777
Q ss_pred HHhC-CHHHHHHHHHHHHHhhcC
Q 032259 86 EERG-SQRAKEKAKRILGMLKGR 107 (144)
Q Consensus 86 l~~G-t~raK~kA~~lL~~L~~~ 107 (144)
+-+- ++.++ -...+|.+|.+.
T Consensus 91 ~~~p~~~lad-~~cmlL~NLs~~ 112 (353)
T KOG2973|consen 91 LTDPQSPLAD-LICMLLSNLSRD 112 (353)
T ss_pred hcCcccchHH-HHHHHHHHhccC
Confidence 7666 44444 344566666553
No 97
>PF04078 Rcd1: Cell differentiation family, Rcd1-like ; InterPro: IPR007216 Rcd1 (Required cell differentiation 1) -like proteins are found among a wide range of organisms []. Rcd1 was initially identified as an essential factor in nitrogen starvation-invoked differentiation in fission yeast. This results largely from a defect in nitrogen starvation-invoked induction of ste11+, a key transcriptional factor gene required for the onset of sexual development. It is one of the most conserved proteins in eukaryotes, and its mammalian homologue is expressed in a variety of differentiating tissues [, ]. The mammalian Rcd1 is a novel transcriptional cofactor and is critical for retinoic acid-induced differentiation of F9 mouse teratocarcinoma cells, at least in part, via forming complexes with retinoic acid receptor and activation transcription factor-2 (ATF-2) []. Two of the members in this family have been characterised as being involved in regulation of Ste11 regulated sex genes [, ].; PDB: 2FV2_B.
Probab=53.32 E-value=52 Score=27.48 Aligned_cols=54 Identities=15% Similarity=0.098 Sum_probs=41.7
Q ss_pred HHHHHHHHHHhcC--ChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhc
Q 032259 8 TSRVLLILCNLAA--SNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGH 64 (144)
Q Consensus 8 ~E~AlavL~~La~--~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~ 64 (144)
.=..+.|++.|.+ .+|.-.-+.+...||.-++.|..| |.-.|--|.-++..+=.
T Consensus 113 RLtsLGVIgaLvK~d~~evi~fLl~tEiiplcLr~me~G---selSKtvAtfIlqKIL~ 168 (262)
T PF04078_consen 113 RLTSLGVIGALVKTDDPEVISFLLQTEIIPLCLRIMEFG---SELSKTVATFILQKILL 168 (262)
T ss_dssp HHHHHHHHHHHHTT--HHHHHHHHCTTHHHHHHHHHHHS----HHHHHHHHHHHHHHHH
T ss_pred hHhHHHHHHHHHcCCcHHHHHHHHhhchHHHHHHHHHhc---cHHHHHHHHHHHHHHHc
Confidence 3457889999997 445555557788999999999998 88999999998876544
No 98
>KOG2611 consensus Neurochondrin/leucine-rich protein (Neurochondrin) [Function unknown]
Probab=51.72 E-value=1.1e+02 Score=28.52 Aligned_cols=74 Identities=12% Similarity=0.257 Sum_probs=53.6
Q ss_pred HHHHHHHHHhcCChhh--HHHHHhcCcHHHHHHHhhcCCCCChH------HHHHHHHHHHHHhccChHHHHHHHHcCcHH
Q 032259 9 SRVLLILCNLAASNEG--RSAILDANGVSILVGMLKESGSDSEA------TRENCVAALFALGHGNLRFKGLAKEARAAE 80 (144)
Q Consensus 9 E~AlavL~~La~~~eg--r~aI~~~g~I~~LV~lL~~G~~~S~~------~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~ 80 (144)
.-++++|+-+|..||- ...|+ ..||.|.++|..|. .+- --|.+-..|..++......+.++ ..|.++
T Consensus 81 ~i~itvLacFC~~pElAsh~~~v--~~IP~llev~~~~~--d~d~e~~~~m~~d~Y~cL~~Va~~e~G~~~Li-a~G~~~ 155 (698)
T KOG2611|consen 81 QISITVLACFCRVPELASHEEMV--SRIPLLLEVMSKGI--DTDYEDNLIMLEDCYECLYLVATAEAGLMTLI-ASGGLR 155 (698)
T ss_pred HHHHHHHHHHhCChhhccCHHHH--HhhhHHHHHHHhcC--CCchhhhHHHHHHHHHHHHHHhcCCchhHHHH-hcCchH
Confidence 3467888888887764 23333 47999999998763 222 47889999999988655545544 789999
Q ss_pred HHHHHHH
Q 032259 81 VLREVEE 87 (144)
Q Consensus 81 ~L~~Ll~ 87 (144)
.+.++-.
T Consensus 156 ~~~Q~y~ 162 (698)
T KOG2611|consen 156 VIAQMYE 162 (698)
T ss_pred HHHHHHh
Confidence 9988753
No 99
>PTZ00429 beta-adaptin; Provisional
Probab=51.23 E-value=90 Score=29.70 Aligned_cols=71 Identities=13% Similarity=0.079 Sum_probs=50.2
Q ss_pred hcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHhCCHHHHHHHHHHHHHhhcCC
Q 032259 30 DANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEERGSQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 30 ~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~Gt~raK~kA~~lL~~L~~~~ 108 (144)
+.+-++.|.++|.+. ++...-||+.+|..++..++.. +-.....+..|+..+...++..+-. +|++|....
T Consensus 177 ~~~~~~~L~~LL~D~---dp~Vv~nAl~aL~eI~~~~~~~--l~l~~~~~~~Ll~~L~e~~EW~Qi~---IL~lL~~y~ 247 (746)
T PTZ00429 177 QQDFKKDLVELLNDN---NPVVASNAAAIVCEVNDYGSEK--IESSNEWVNRLVYHLPECNEWGQLY---ILELLAAQR 247 (746)
T ss_pred ccchHHHHHHHhcCC---CccHHHHHHHHHHHHHHhCchh--hHHHHHHHHHHHHHhhcCChHHHHH---HHHHHHhcC
Confidence 346678889988875 7999999999999998754321 2234555677777777778888875 555555544
No 100
>PF06685 DUF1186: Protein of unknown function (DUF1186); InterPro: IPR010602 This family consists of several hypothetical bacterial proteins of around 250 residues in length and is found in several Chlamydia and Anabaena species. The function of this family is unknown.
Probab=50.19 E-value=99 Score=25.49 Aligned_cols=86 Identities=16% Similarity=0.278 Sum_probs=51.2
Q ss_pred hcCcHHHHHHHhhcCCCCC------hHHHHHHHHHHHHHhccCh-HHHHHHHHc--------CcHHHHHHHHHhCCHHHH
Q 032259 30 DANGVSILVGMLKESGSDS------EATRENCVAALFALGHGNL-RFKGLAKEA--------RAAEVLREVEERGSQRAK 94 (144)
Q Consensus 30 ~~g~I~~LV~lL~~G~~~S------~~~kE~Ava~L~~Lc~~~~-~~~~~a~~~--------Gai~~L~~Ll~~Gt~raK 94 (144)
+..+.|+|+++++... .. .-..|.-..+|.++|.++. .....+.++ +++.+|..+...|...-.
T Consensus 71 e~~A~~~li~l~~~~~-~~~~~l~GD~~tE~l~~ilasv~~G~~~~L~~li~~~~~~~yvR~aa~~aL~~l~~~~~~~Re 149 (249)
T PF06685_consen 71 EERALPPLIRLFSQDD-DFLEDLFGDFITEDLPRILASVGDGDIEPLKELIEDPDADEYVRMAAISALAFLVHEGPISRE 149 (249)
T ss_pred hhhhHHHHHHHHcCCc-chHHHHHcchhHhHHHHHHHHHhCCCHHHHHHHHhCCcHHHHHHHHHHHHHHHHHHcCCCCHH
Confidence 3568999999997421 01 2457888888888888663 455555444 347788888877754434
Q ss_pred HHHHHHHHHhhcCCCCCCcchh
Q 032259 95 EKAKRILGMLKGREDDDEDVDW 116 (144)
Q Consensus 95 ~kA~~lL~~L~~~~~~~~~~~~ 116 (144)
+-..-.-.+|...-+.+++.-|
T Consensus 150 ~vi~~f~~ll~~~l~~~~~~~~ 171 (249)
T PF06685_consen 150 EVIQYFRELLNYFLERNPSFLW 171 (249)
T ss_pred HHHHHHHHHHHHHhccCchHHH
Confidence 4444444444443333334333
No 101
>cd03572 ENTH_epsin_related ENTH domain, Epsin Related family; composed of hypothetical proteins containing an ENTH-like domain. The epsin N-terminal homology (ENTH) domain is an evolutionarily conserved protein module found primarily in proteins that participate in clathrin-mediated endocytosis. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. ENTH and ANTH (E/ANTH) domains are structurally similar to the VHS domain and are composed of a superhelix of eight alpha helices. E/ANTH domains bind both inositol phospholipids and proteins and contribute to the nucleation and formation of clathrin coats on membranes. ENTH domains also function in the development of membrane curvature through lipid remodeling during the formation of clathrin-coated vesicles. E/ANTH-bearing proteins have recently been shown to function with adaptor protein-1 and GGA adaptors at the t
Probab=48.54 E-value=1e+02 Score=22.70 Aligned_cols=69 Identities=19% Similarity=0.280 Sum_probs=48.7
Q ss_pred HHHHHhhcCCCCChHHHHHHHHHHHHHhc-cChHHHHH-HHHcCcHHHHHHHHH-----hC---CHHHHHHHHHHHHHhh
Q 032259 36 ILVGMLKESGSDSEATRENCVAALFALGH-GNLRFKGL-AKEARAAEVLREVEE-----RG---SQRAKEKAKRILGMLK 105 (144)
Q Consensus 36 ~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~-~~~~~~~~-a~~~Gai~~L~~Ll~-----~G---t~raK~kA~~lL~~L~ 105 (144)
.|.+-|... ++-.|-.|.-+|-.||. +++.++.. ....-+|..+.+... .| ....|..|..++.++-
T Consensus 42 ~L~kRL~~~---~~hVK~K~Lrilk~l~~~G~~~f~~~~~~~~~~Ik~~~~f~g~~Dp~~Gd~~~~~VR~~A~El~~~if 118 (122)
T cd03572 42 YLLKRLKRS---SPHVKLKVLKIIKHLCEKGNSDFKRELQRNSAQIRECANYKGPPDPLKGDSLNEKVREEAQELIKAIF 118 (122)
T ss_pred HHHHHhcCC---CCcchHHHHHHHHHHHhhCCHHHHHHHHHhHHHHHHHHHcCCCCCcccCcchhHHHHHHHHHHHHHHh
Confidence 556666553 68889999999999999 66554444 455666777776654 44 3566888999988876
Q ss_pred cC
Q 032259 106 GR 107 (144)
Q Consensus 106 ~~ 107 (144)
..
T Consensus 119 ~~ 120 (122)
T cd03572 119 SY 120 (122)
T ss_pred cc
Confidence 53
No 102
>PF11698 V-ATPase_H_C: V-ATPase subunit H; InterPro: IPR011987 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. V-ATPases (also known as V1V0-ATPase or vacuolar ATPase) (3.6.3.14 from EC) are found in the eukaryotic endomembrane system, and in the plasma membrane of prokaryotes and certain specialised eukaryotic cells. V-ATPases hydrolyse ATP to drive a proton pump, and are involved in a variety of vital intra- and inter-cellular processes such as receptor mediated endocytosis, protein trafficking, active transport of metabolites, homeostasis and neurotransmitter release []. V-ATPases are composed of two linked complexes: the V1 complex (subunits A-H) contains the catalytic core that hydrolyses ATP, while the V0 complex (subunits a, c, c', c'', d) forms the membrane-spanning pore. V-ATPases may have an additional role in membrane fusion through binding to t-SNARE proteins []. This entry represents the C-terminal domain of subunit H (also known as Vma13p) found in the V1 complex of V-ATPases. This subunit has a regulatory function, being responsible for activating ATPase activity and coupling ATPase activity to proton flow []. The yeast enzyme contains five motifs similar to the HEAT or Armadillo repeats seen in the importins, and can be divided into two distinct domains: a large N-terminal domain consisting of stacked alpha helices, and a smaller C-terminal alpha-helical domain with a similar superhelical topology to an armadillo repeat []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0016820 hydrolase activity, acting on acid anhydrides, catalyzing transmembrane movement of substances, 0015991 ATP hydrolysis coupled proton transport, 0000221 vacuolar proton-transporting V-type ATPase, V1 domain; PDB: 1HO8_A.
Probab=46.71 E-value=66 Score=23.67 Aligned_cols=45 Identities=13% Similarity=0.217 Sum_probs=33.2
Q ss_pred HHHhcC-ChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHH
Q 032259 15 LCNLAA-SNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFAL 62 (144)
Q Consensus 15 L~~La~-~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~L 62 (144)
|..++. +|+||..+-+.|+=..++++|... .++-|.+|.-++..|
T Consensus 68 ig~~vr~~p~gr~ii~~lg~K~~vM~Lm~h~---d~eVr~eAL~avQkl 113 (119)
T PF11698_consen 68 IGEFVRHYPNGRNIIEKLGAKERVMELMNHE---DPEVRYEALLAVQKL 113 (119)
T ss_dssp HHHHHHH-GGGHHHHHHHSHHHHHHHHTS-S---SHHHHHHHHHHHHHH
T ss_pred hHHHHHHChhHHHHHHhcChHHHHHHHhcCC---CHHHHHHHHHHHHHH
Confidence 444554 789998888888888889999875 688888888777655
No 103
>PF14726 RTTN_N: Rotatin, an armadillo repeat protein, centriole functioning
Probab=46.24 E-value=84 Score=22.23 Aligned_cols=36 Identities=22% Similarity=0.214 Sum_probs=31.2
Q ss_pred hHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhh
Q 032259 7 STSRVLLILCNLAASNEGRSAILDANGVSILVGMLK 42 (144)
Q Consensus 7 ~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~ 42 (144)
..++++++|..|..+|.+...+.+-|++..|-++-.
T Consensus 46 ~~~~VL~Ll~~L~~~~~a~~~l~~iG~~~fL~klr~ 81 (98)
T PF14726_consen 46 MKEEVLALLLRLLKSPYAAQILRDIGAVRFLSKLRP 81 (98)
T ss_pred cHHHHHHHHHHHHhCcHHHHHHHHccHHHHHHHHHh
Confidence 568999999999999999999999999988655544
No 104
>PF14668 RICTOR_V: Rapamycin-insensitive companion of mTOR, domain 5
Probab=45.16 E-value=92 Score=20.83 Aligned_cols=56 Identities=16% Similarity=0.171 Sum_probs=46.1
Q ss_pred HHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccCh
Q 032259 10 RVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNL 67 (144)
Q Consensus 10 ~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~ 67 (144)
.|+=.+.++++.+.|-.-+.+...|+.++++.... ....-|--|..+|-.+++..+
T Consensus 6 aaLWaighIgss~~G~~lL~~~~iv~~iv~~a~~s--~v~siRGT~fy~Lglis~T~~ 61 (73)
T PF14668_consen 6 AALWAIGHIGSSPLGIQLLDESDIVEDIVKIAENS--PVLSIRGTCFYVLGLISSTEE 61 (73)
T ss_pred HHHHHHHhHhcChHHHHHHhhcCHHHHHHHHHHhC--CccchHHHHHHHHHHHhCCHH
Confidence 46778999999999999999889999999999874 245567788888888887543
No 105
>PF01602 Adaptin_N: Adaptin N terminal region; InterPro: IPR002553 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. Clathrin coats contain both clathrin and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors []. All AP complexes are heterotetramers composed of two large subunits (adaptins), a medium subunit (mu) and a small subunit (sigma). Each subunit has a specific function. Adaptin subunits recognise and bind to clathrin through their hinge region (clathrin box), and recruit accessory proteins that modulate AP function through their C-terminal appendage domains. By contrast, GGAs are monomers composed of four domains, which have functions similar to AP subunits: an N-terminal VHS (Vps27p/Hrs/Stam) domain, a GAT (GGA and Tom1) domain, a hinge region, and a C-terminal GAE (gamma-adaptin ear) domain. The GAE domain is similar to the AP gamma-adaptin ear domain, being responsible for the recruitment of accessory proteins that regulate clathrin-mediated endocytosis []. While clathrin mediates endocytic protein transport from ER to Golgi, coatomers (COPI, COPII) primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the N-terminal domain of various adaptins from different AP clathrin adaptor complexes (including AP1, AP2, AP3 and AP4), and from the beta and gamma subunits of various coatomer (COP) adaptors. This domain has a 2-layer alpha/alpha fold that forms a right-handed superhelix, and is a member of the ARM repeat superfamily []. The N-terminal region of the various AP adaptor proteins share strong sequence identity; by contrast, the C-terminal domains of different adaptins share similar structural folds, but have little sequence identity []. It has been proposed that the N-terminal domain interacts with another uniform component of the coated vesicles. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 1W63_C 2JKR_A 2JKT_A 2XA7_A 2VGL_B 3TJZ_E.
Probab=43.49 E-value=59 Score=28.05 Aligned_cols=91 Identities=11% Similarity=0.078 Sum_probs=58.5
Q ss_pred hHHHHHHHHHHhcC-ChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHH-hccChHHHHHHHHcCcHHHHHH
Q 032259 7 STSRVLLILCNLAA-SNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFAL-GHGNLRFKGLAKEARAAEVLRE 84 (144)
Q Consensus 7 ~~E~AlavL~~La~-~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~L-c~~~~~~~~~a~~~Gai~~L~~ 84 (144)
+.-+|+..+..+.. +|+- +-.. .++.|.++|.+. ++..+.+|+.+|..+ +...... -.-...++.|..
T Consensus 130 VRk~A~~~l~~i~~~~p~~---~~~~-~~~~l~~lL~d~---~~~V~~~a~~~l~~i~~~~~~~~---~~~~~~~~~L~~ 199 (526)
T PF01602_consen 130 VRKKAALALLKIYRKDPDL---VEDE-LIPKLKQLLSDK---DPSVVSAALSLLSEIKCNDDSYK---SLIPKLIRILCQ 199 (526)
T ss_dssp HHHHHHHHHHHHHHHCHCC---HHGG-HHHHHHHHTTHS---SHHHHHHHHHHHHHHHCTHHHHT---THHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhccCHHH---HHHH-HHHHHhhhccCC---cchhHHHHHHHHHHHccCcchhh---hhHHHHHHHhhh
Confidence 44556666666653 3432 1112 478899999775 688999999999999 3222111 223556777777
Q ss_pred HHHhCCHHHHHHHHHHHHHhhcC
Q 032259 85 VEERGSQRAKEKAKRILGMLKGR 107 (144)
Q Consensus 85 Ll~~Gt~raK~kA~~lL~~L~~~ 107 (144)
++...+|..+.+...+|..+...
T Consensus 200 ~l~~~~~~~q~~il~~l~~~~~~ 222 (526)
T PF01602_consen 200 LLSDPDPWLQIKILRLLRRYAPM 222 (526)
T ss_dssp HHTCCSHHHHHHHHHHHTTSTSS
T ss_pred cccccchHHHHHHHHHHHhcccC
Confidence 77788999998866666555443
No 106
>COG1413 FOG: HEAT repeat [Energy production and conversion]
Probab=42.99 E-value=75 Score=25.95 Aligned_cols=62 Identities=27% Similarity=0.379 Sum_probs=43.1
Q ss_pred cHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHH-hCCHHHHHHHHHHHHHhhcCC
Q 032259 33 GVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEE-RGSQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 33 ~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~-~Gt~raK~kA~~lL~~L~~~~ 108 (144)
++|.|.++|.+. ++.-+..|+.+|..+- .+-+++.|+.++. +.....|..|...|..+....
T Consensus 75 av~~l~~~l~d~---~~~vr~~a~~aLg~~~-----------~~~a~~~li~~l~~d~~~~vR~~aa~aL~~~~~~~ 137 (335)
T COG1413 75 AVPLLRELLSDE---DPRVRDAAADALGELG-----------DPEAVPPLVELLENDENEGVRAAAARALGKLGDER 137 (335)
T ss_pred HHHHHHHHhcCC---CHHHHHHHHHHHHccC-----------ChhHHHHHHHHHHcCCcHhHHHHHHHHHHhcCchh
Confidence 567777777664 5666666666333221 2346888999888 589999999999988877653
No 107
>KOG2973 consensus Uncharacterized conserved protein [Function unknown]
Probab=42.59 E-value=90 Score=27.13 Aligned_cols=62 Identities=21% Similarity=0.246 Sum_probs=46.4
Q ss_pred HHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHH
Q 032259 8 TSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLA 73 (144)
Q Consensus 8 ~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a 73 (144)
.+.|+.+|.||+..++-|..|.+. -+..+++.+-+- .++-+ +-.+..|.+|++.+..+..+.
T Consensus 59 ~~~a~~alVnlsq~~~l~~~ll~~-~~k~l~~~~~~p--~~~la-d~~cmlL~NLs~~~~~~~~ll 120 (353)
T KOG2973|consen 59 AEPAATALVNLSQKEELRKKLLQD-LLKVLMDMLTDP--QSPLA-DLICMLLSNLSRDDDEVAALL 120 (353)
T ss_pred ccHHHHHHHHHHhhHHHHHHHHHH-HHHHHHHHhcCc--ccchH-HHHHHHHHHhccCchHHHHHH
Confidence 678999999999999999999877 777888888653 14444 445678889998665544443
No 108
>PF05918 API5: Apoptosis inhibitory protein 5 (API5); InterPro: IPR008383 This family consists of apoptosis inhibitory protein 5 (API5) sequences from several organisms. Apoptosis or programmed cell death is a physiological form of cell death that occurs in embryonic development and organ formation. It is characterised by biochemical and morphological changes such as DNA fragmentation and cell volume shrinkage. API5 is an anti apoptosis gene located in Homo sapiens chromosome 11, whose expression prevents the programmed cell death that occurs upon the deprivation of growth factors [,].; PDB: 3U0R_A 3V6A_A.
Probab=42.29 E-value=65 Score=29.70 Aligned_cols=16 Identities=19% Similarity=0.254 Sum_probs=8.6
Q ss_pred hHHHHHHHHHHHHHhc
Q 032259 49 EATRENCVAALFALGH 64 (144)
Q Consensus 49 ~~~kE~Ava~L~~Lc~ 64 (144)
|.-+|.|.-+++-||.
T Consensus 54 P~l~~~Ai~a~~DLcE 69 (556)
T PF05918_consen 54 PDLQEEAINAQLDLCE 69 (556)
T ss_dssp GGGHHHHHHHHHHHHT
T ss_pred hhhHHHHHHHHHHHHh
Confidence 4555555555555555
No 109
>COG5096 Vesicle coat complex, various subunits [Intracellular trafficking and secretion]
Probab=42.09 E-value=1.2e+02 Score=29.13 Aligned_cols=69 Identities=14% Similarity=0.122 Sum_probs=57.0
Q ss_pred cHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHhCCHHHHHHHHHHHHHhhcC
Q 032259 33 GVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEERGSQRAKEKAKRILGMLKGR 107 (144)
Q Consensus 33 ~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~Gt~raK~kA~~lL~~L~~~ 107 (144)
.++++.+.+..+ ++..|.+|+-++..+-+-+. ....+.|.+..|..|+.+-.|..++.|-..|..+...
T Consensus 128 ~~~~ik~~l~d~---~ayVRk~Aalav~kly~ld~---~l~~~~g~~~~l~~l~~D~dP~Vi~nAl~sl~~i~~e 196 (757)
T COG5096 128 IIDPIKKLLTDP---HAYVRKTAALAVAKLYRLDK---DLYHELGLIDILKELVADSDPIVIANALASLAEIDPE 196 (757)
T ss_pred HHHHHHHHccCC---cHHHHHHHHHHHHHHHhcCH---hhhhcccHHHHHHHHhhCCCchHHHHHHHHHHHhchh
Confidence 567888888876 79999999999999987443 3445789999999999999999999998888876543
No 110
>COG1413 FOG: HEAT repeat [Energy production and conversion]
Probab=41.29 E-value=1.1e+02 Score=25.02 Aligned_cols=59 Identities=25% Similarity=0.385 Sum_probs=39.4
Q ss_pred CcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHhCCHHHHHHHHHHHH
Q 032259 32 NGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEERGSQRAKEKAKRILG 102 (144)
Q Consensus 32 g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~Gt~raK~kA~~lL~ 102 (144)
..++.+.+.++.. ....+..|+.+|..+...+ ..+.+.+...+.+.....+.++...|-
T Consensus 180 ~~~~~l~~~l~~~---~~~vr~~Aa~aL~~~~~~~---------~~~~~~l~~~~~~~~~~vr~~~~~~l~ 238 (335)
T COG1413 180 EAIPLLIELLEDE---DADVRRAAASALGQLGSEN---------VEAADLLVKALSDESLEVRKAALLALG 238 (335)
T ss_pred hhhHHHHHHHhCc---hHHHHHHHHHHHHHhhcch---------hhHHHHHHHHhcCCCHHHHHHHHHHhc
Confidence 4788889998875 5678888888888887653 334455555555666666555544443
No 111
>PF12717 Cnd1: non-SMC mitotic condensation complex subunit 1
Probab=40.74 E-value=1.3e+02 Score=22.71 Aligned_cols=52 Identities=17% Similarity=0.071 Sum_probs=27.4
Q ss_pred hHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHhCCHHHHHHHHHHHHHhh
Q 032259 49 EATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEERGSQRAKEKAKRILGMLK 105 (144)
Q Consensus 49 ~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~Gt~raK~kA~~lL~~L~ 105 (144)
|.-|-|++.+|.-||..-+... .-.+|.|...+++..+..|+.|...|..|-
T Consensus 2 ~~vR~n~i~~l~DL~~r~~~~v-----e~~~~~l~~~L~D~~~~VR~~al~~Ls~Li 53 (178)
T PF12717_consen 2 PSVRNNAIIALGDLCIRYPNLV-----EPYLPNLYKCLRDEDPLVRKTALLVLSHLI 53 (178)
T ss_pred HHHHHHHHHHHHHHHHhCcHHH-----HhHHHHHHHHHCCCCHHHHHHHHHHHHHHH
Confidence 3445566666666665322211 223555666666666666666655555543
No 112
>PF13764 E3_UbLigase_R4: E3 ubiquitin-protein ligase UBR4
Probab=39.93 E-value=1.2e+02 Score=29.20 Aligned_cols=78 Identities=18% Similarity=0.166 Sum_probs=47.9
Q ss_pred HHhcCcHHHHHHHhhcCCCCChHHHHHHH--HHHHHHhccChHHHHHHHHcCcHHHHHHHHH----hCC-HHHHHHHHHH
Q 032259 28 ILDANGVSILVGMLKESGSDSEATRENCV--AALFALGHGNLRFKGLAKEARAAEVLREVEE----RGS-QRAKEKAKRI 100 (144)
Q Consensus 28 I~~~g~I~~LV~lL~~G~~~S~~~kE~Av--a~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~----~Gt-~raK~kA~~l 100 (144)
|.+.|++..|+++|..=. ...+.++... --|+..|..-...|+...+.|+++.|+..+. .+. ...-.-+..+
T Consensus 113 ~~~~gGL~~ll~~l~~~~-~~~~~~~ll~~llkLL~~c~Kv~~NR~~Ll~~~al~~LL~~L~~~l~~~~~~~~~~i~E~L 191 (802)
T PF13764_consen 113 LAECGGLEVLLSRLDSIR-DFSRGRELLQVLLKLLRYCCKVKVNRRALLELNALNRLLSVLNRALQANQNSSQAEIAEQL 191 (802)
T ss_pred hhcCCCHHHHHHHHHhhc-cccCcHHHHHHHHHHHHHHHhhHHHHHHHHHcCCHHHHHHHHHHHHhCccccccchHHHHH
Confidence 345789999999996521 1223444444 3455556644555666667999999999885 333 2223555556
Q ss_pred HHHhhc
Q 032259 101 LGMLKG 106 (144)
Q Consensus 101 L~~L~~ 106 (144)
|.++..
T Consensus 192 L~IiE~ 197 (802)
T PF13764_consen 192 LEIIES 197 (802)
T ss_pred HHHHHH
Confidence 665544
No 113
>KOG2999 consensus Regulator of Rac1, required for phagocytosis and cell migration [Signal transduction mechanisms]
Probab=38.15 E-value=3.5e+02 Score=25.54 Aligned_cols=101 Identities=20% Similarity=0.191 Sum_probs=74.3
Q ss_pred HHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCC------------------------------------------
Q 032259 11 VLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDS------------------------------------------ 48 (144)
Q Consensus 11 AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S------------------------------------------ 48 (144)
|+.-|..|+..+.-..++....++..|..++++|..+.
T Consensus 103 a~k~l~sls~d~~fa~efi~~~gl~~L~~liedg~~~~~~~~L~~~L~af~elmehgvvsW~~~~~~fV~~~a~~V~~~~ 182 (713)
T KOG2999|consen 103 ALKELDSLSLDPTFAEEFIRCSGLELLFSLIEDGRVCMSSELLSTSLRAFSELMEHGVVSWESVSNDFVVSMASYVNAKR 182 (713)
T ss_pred HHHHHhhccccHHHHHHHHhcchHHHHHHHHHcCccchHHHHHHHHHHHHHHHHhhceeeeeecccHHHHHHHHHHhhhh
Confidence 66677788888888888888788888888888774300
Q ss_pred --hHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHhCCHHHHHHHHHHHHHhhcCCCCC
Q 032259 49 --EATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEERGSQRAKEKAKRILGMLKGREDDD 111 (144)
Q Consensus 49 --~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~Gt~raK~kA~~lL~~L~~~~~~~ 111 (144)
...-+.|..+|-++..++...+..+.++--+..|+..++.+..+-..+|-.+|..|=.-.++|
T Consensus 183 ~~a~~~~~AL~~LE~~vl~s~~~~~~v~eev~i~~li~hlq~~n~~i~~~aial~nal~~~a~~~ 247 (713)
T KOG2999|consen 183 EDANTLLAALQMLESLVLGSDTLRQLVAEEVPIETLIRHLQVSNQRIQTCAIALLNALFRKAPDD 247 (713)
T ss_pred hcccchHHHHHHHHHHHhCChHHHHHHHhcCcHHHHHHHHHhcchHHHHHHHHHHHHHHhhCChH
Confidence 012345666666776677666777778888999999999998888888999888766655454
No 114
>PF12717 Cnd1: non-SMC mitotic condensation complex subunit 1
Probab=37.70 E-value=1.3e+02 Score=22.71 Aligned_cols=90 Identities=11% Similarity=0.108 Sum_probs=65.9
Q ss_pred chHHHHHHHHHHhcC-ChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCc-HHHHH
Q 032259 6 ESTSRVLLILCNLAA-SNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARA-AEVLR 83 (144)
Q Consensus 6 ~~~E~AlavL~~La~-~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Ga-i~~L~ 83 (144)
.+.-.++.+|+-|+. +|.- ++ .-+|.+...|++. ++.-|++|+.+|-.|-..+- ++--|- +..++
T Consensus 3 ~vR~n~i~~l~DL~~r~~~~----ve-~~~~~l~~~L~D~---~~~VR~~al~~Ls~Li~~d~-----ik~k~~l~~~~l 69 (178)
T PF12717_consen 3 SVRNNAIIALGDLCIRYPNL----VE-PYLPNLYKCLRDE---DPLVRKTALLVLSHLILEDM-----IKVKGQLFSRIL 69 (178)
T ss_pred HHHHHHHHHHHHHHHhCcHH----HH-hHHHHHHHHHCCC---CHHHHHHHHHHHHHHHHcCc-----eeehhhhhHHHH
Confidence 345677788888875 4322 22 3578899999986 89999999999999976432 111233 36677
Q ss_pred HHHHhCCHHHHHHHHHHHHHhhcCC
Q 032259 84 EVEERGSQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 84 ~Ll~~Gt~raK~kA~~lL~~L~~~~ 108 (144)
.++.+-.+.-|..|...+.-+....
T Consensus 70 ~~l~D~~~~Ir~~A~~~~~e~~~~~ 94 (178)
T PF12717_consen 70 KLLVDENPEIRSLARSFFSELLKKR 94 (178)
T ss_pred HHHcCCCHHHHHHHHHHHHHHHHhc
Confidence 7778889999999999999888874
No 115
>PRK00927 tryptophanyl-tRNA synthetase; Reviewed
Probab=37.03 E-value=1.1e+02 Score=25.92 Aligned_cols=70 Identities=16% Similarity=0.242 Sum_probs=45.2
Q ss_pred HHHHHHHhhcCCCCChHHHHHHHHHHHHHhc-cChHHHHHHHHcCcHHHHHHHHHhCCHHHHHHHHHHHHHhhc
Q 032259 34 VSILVGMLKESGSDSEATRENCVAALFALGH-GNLRFKGLAKEARAAEVLREVEERGSQRAKEKAKRILGMLKG 106 (144)
Q Consensus 34 I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~-~~~~~~~~a~~~Gai~~L~~Ll~~Gt~raK~kA~~lL~~L~~ 106 (144)
+..+.+-...|...+...|+..+..+..+-. -.+.+..+..+ ..-|-++++.|+.+||+.|...|+..++
T Consensus 256 ~eel~~~~~~g~~~~~~lK~~la~~i~~~l~pire~~~~~~~~---~~~~~~il~~G~~~a~~~a~~~l~~v~~ 326 (333)
T PRK00927 256 IEELEAEYEAGGKGYGDFKKDLAEAVVEFLAPIRERYEELLAD---PAYLDEILAEGAEKARAVASKTLKEVRE 326 (333)
T ss_pred HHHHHHHHhcCCCChHHHHHHHHHHHHHHHHHHHHHHHHHHcC---HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444444444421245678888787777765 22233333322 3357888999999999999999988765
No 116
>PF12936 Kri1_C: KRI1-like family C-terminal; InterPro: IPR024626 The yeast member of the Kri1-like family (Kri1p) is found to be required for 40S ribosome biogenesis in the nucleolus []. This entry represents the C-terminal domain of this protein family.
Probab=36.80 E-value=16 Score=25.84 Aligned_cols=29 Identities=28% Similarity=0.564 Sum_probs=22.2
Q ss_pred CCCcchhhhhcccCCCccccccc-cCCCCC
Q 032259 110 DDEDVDWEGVLDSGGPTRSRYRV-GMTGYG 138 (144)
Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 138 (144)
+.-..||+|+|.+++|+|=+|+- -...+|
T Consensus 11 Eyy~ld~Ed~igd~~p~RFkYr~V~p~~fG 40 (93)
T PF12936_consen 11 EYYKLDYEDIIGDDMPTRFKYREVPPNSFG 40 (93)
T ss_pred HHhcCcHHHHcCCCCCCceeeeecCcccCC
Confidence 34567999999888999999986 444444
No 117
>PTZ00429 beta-adaptin; Provisional
Probab=36.58 E-value=1.2e+02 Score=28.82 Aligned_cols=67 Identities=15% Similarity=0.103 Sum_probs=50.3
Q ss_pred HHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHhCCHHHHHHHHHHHHHhhcCC
Q 032259 36 ILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEERGSQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 36 ~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~Gt~raK~kA~~lL~~L~~~~ 108 (144)
++.+.|.+ .+|--|..|+-++..+...++. .+...|.++.|.+|+.+..+.....|..+|.-+....
T Consensus 144 ~lkk~L~D---~~pYVRKtAalai~Kly~~~pe---lv~~~~~~~~L~~LL~D~dp~Vv~nAl~aL~eI~~~~ 210 (746)
T PTZ00429 144 PLRRAVAD---PDPYVRKTAAMGLGKLFHDDMQ---LFYQQDFKKDLVELLNDNNPVVASNAAAIVCEVNDYG 210 (746)
T ss_pred HHHHHhcC---CCHHHHHHHHHHHHHHHhhCcc---cccccchHHHHHHHhcCCCccHHHHHHHHHHHHHHhC
Confidence 34444544 3788999999999999875543 3345678888999888888999999988888886543
No 118
>PHA02102 hypothetical protein
Probab=35.90 E-value=45 Score=22.29 Aligned_cols=16 Identities=38% Similarity=0.586 Sum_probs=12.6
Q ss_pred CCCCcchhhhhcccCC
Q 032259 109 DDDEDVDWEGVLDSGG 124 (144)
Q Consensus 109 ~~~~~~~~~~~~~~~~ 124 (144)
|+|+.++|+|-+++.-
T Consensus 35 n~~nev~f~DWLsSSC 50 (72)
T PHA02102 35 NDDNEVRFEDWLSSSC 50 (72)
T ss_pred CCCCcEeHHHhhcccc
Confidence 7889999999777643
No 119
>KOG1242 consensus Protein containing adaptin N-terminal region [Translation, ribosomal structure and biogenesis]
Probab=35.45 E-value=1.2e+02 Score=28.08 Aligned_cols=60 Identities=13% Similarity=-0.052 Sum_probs=49.7
Q ss_pred ChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHhCCHHHHHHHHHHHHHhhcCC
Q 032259 48 SEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEERGSQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 48 S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~Gt~raK~kA~~lL~~L~~~~ 108 (144)
..++|.+++..|-.|....+. +-....+-+||.|.+.+.+=-+.-|+-+-..|..+...-
T Consensus 267 kWrtK~aslellg~m~~~ap~-qLs~~lp~iiP~lsevl~DT~~evr~a~~~~l~~~~svi 326 (569)
T KOG1242|consen 267 KWRTKMASLELLGAMADCAPK-QLSLCLPDLIPVLSEVLWDTKPEVRKAGIETLLKFGSVI 326 (569)
T ss_pred hhhhHHHHHHHHHHHHHhchH-HHHHHHhHhhHHHHHHHccCCHHHHHHHHHHHHHHHHhh
Confidence 568999999999988876554 233346789999999999999999999999988888765
No 120
>KOG3431 consensus Apoptosis-related protein/predicted DNA-binding protein [Cell cycle control, cell division, chromosome partitioning]
Probab=35.32 E-value=22 Score=26.64 Aligned_cols=28 Identities=14% Similarity=0.241 Sum_probs=14.7
Q ss_pred HHHHHhccChHHHHHHHHcCcHHHHHHHH
Q 032259 58 ALFALGHGNLRFKGLAKEARAAEVLREVE 86 (144)
Q Consensus 58 ~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll 86 (144)
.|..+.... .+...+.++++|..|-.+.
T Consensus 75 ~lirma~~g-Qvs~Kise~~lisiLe~is 102 (129)
T KOG3431|consen 75 YLIRMAQTG-QVSHKISEAELISILEKIS 102 (129)
T ss_pred HHHHHHHhC-CccccccHHHHHHHHHHHH
Confidence 344555422 2344455666666666554
No 121
>PF08167 RIX1: rRNA processing/ribosome biogenesis
Probab=35.22 E-value=1.9e+02 Score=21.68 Aligned_cols=82 Identities=17% Similarity=0.188 Sum_probs=51.6
Q ss_pred Chhh-HHHHHhcC--cHHHHHHHhhcCCCCChHHHHHHHHHHHHHhc---cChH-HHHHHH--HcCcHHHHHHHHHhCCH
Q 032259 21 SNEG-RSAILDAN--GVSILVGMLKESGSDSEATRENCVAALFALGH---GNLR-FKGLAK--EARAAEVLREVEERGSQ 91 (144)
Q Consensus 21 ~~eg-r~aI~~~g--~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~---~~~~-~~~~a~--~~Gai~~L~~Ll~~Gt~ 91 (144)
++++ .+.+.+.+ =+..|+.+|..- .++..++.|+.+|-.|.. +-+. .|+++. -++.++.|+.+++. +
T Consensus 53 ~~~~~~e~l~~~~~~W~~~Ll~~L~~~--~~~~~~~~ai~~L~~l~~~~~~~p~l~Rei~tp~l~~~i~~ll~l~~~--~ 128 (165)
T PF08167_consen 53 VEQCSWEILLSHGSQWLRALLSILEKP--DPPSVLEAAIITLTRLFDLIRGKPTLTREIATPNLPKFIQSLLQLLQD--S 128 (165)
T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHcCC--CCHHHHHHHHHHHHHHHHHhcCCCchHHHHhhccHHHHHHHHHHHHhc--c
Confidence 4455 45554554 378899999874 367789999999988876 3333 334332 24456777777765 4
Q ss_pred HHHHHHHHHHHHhhc
Q 032259 92 RAKEKAKRILGMLKG 106 (144)
Q Consensus 92 raK~kA~~lL~~L~~ 106 (144)
.....+-.+|..+-.
T Consensus 129 ~~~~~~l~~L~~ll~ 143 (165)
T PF08167_consen 129 SCPETALDALATLLP 143 (165)
T ss_pred ccHHHHHHHHHHHHH
Confidence 555566666655544
No 122
>KOG1242 consensus Protein containing adaptin N-terminal region [Translation, ribosomal structure and biogenesis]
Probab=35.10 E-value=1.5e+02 Score=27.60 Aligned_cols=102 Identities=16% Similarity=0.228 Sum_probs=73.6
Q ss_pred hHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhc--cChHHHHHH-----------
Q 032259 7 STSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGH--GNLRFKGLA----------- 73 (144)
Q Consensus 7 ~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~--~~~~~~~~a----------- 73 (144)
....++.+|..++.|..-.-....+..||.|.+.|-+- .++-|+.+..+|..+|+ .++.....+
T Consensus 270 tK~aslellg~m~~~ap~qLs~~lp~iiP~lsevl~DT---~~evr~a~~~~l~~~~svidN~dI~~~ip~Lld~l~dp~ 346 (569)
T KOG1242|consen 270 TKMASLELLGAMADCAPKQLSLCLPDLIPVLSEVLWDT---KPEVRKAGIETLLKFGSVIDNPDIQKIIPTLLDALADPS 346 (569)
T ss_pred hHHHHHHHHHHHHHhchHHHHHHHhHhhHHHHHHHccC---CHHHHHHHHHHHHHHHHhhccHHHHHHHHHHHHHhcCcc
Confidence 34668899999999888888877788999999999875 68999999999999997 333222111
Q ss_pred -------HHcC-----------cHHHHHHHHHhC----CHHHHHHHHHHHHHhhcCCCCC
Q 032259 74 -------KEAR-----------AAEVLREVEERG----SQRAKEKAKRILGMLKGREDDD 111 (144)
Q Consensus 74 -------~~~G-----------ai~~L~~Ll~~G----t~raK~kA~~lL~~L~~~~~~~ 111 (144)
...| .+..++.+++.| +...||+++.+..++..-.+|.
T Consensus 347 ~~~~e~~~~L~~ttFV~~V~~psLalmvpiL~R~l~eRst~~kr~t~~IidNm~~LveDp 406 (569)
T KOG1242|consen 347 CYTPECLDSLGATTFVAEVDAPSLALMVPILKRGLAERSTSIKRKTAIIIDNMCKLVEDP 406 (569)
T ss_pred cchHHHHHhhcceeeeeeecchhHHHHHHHHHHHHhhccchhhhhHHHHHHHHHHhhcCH
Confidence 1111 134556666655 4566899999999988877443
No 123
>PF10954 DUF2755: Protein of unknown function (DUF2755); InterPro: IPR020513 This entry contains membrane proteins with no known function.; GO: 0016021 integral to membrane
Probab=34.21 E-value=64 Score=22.98 Aligned_cols=47 Identities=19% Similarity=0.258 Sum_probs=33.3
Q ss_pred ChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHhCCHHHH
Q 032259 48 SEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEERGSQRAK 94 (144)
Q Consensus 48 S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~Gt~raK 94 (144)
++.+.-+|+-+|+..=.+..-.--++-.+|+.+-|.++..+|.||.+
T Consensus 20 t~gNiAYAlFVLfcfWaGaQlLn~LvHAPGV~EhLmQ~Qds~RPrve 66 (100)
T PF10954_consen 20 TPGNIAYALFVLFCFWAGAQLLNMLVHAPGVYEHLMQVQDSGRPRVE 66 (100)
T ss_pred CcchhHHHHHHHHHHHhhHHHHHHHHhCcHHHHHHHHHHhcCCCceE
Confidence 34556666666655544655555667789999999999999988753
No 124
>COG5209 RCD1 Uncharacterized protein involved in cell differentiation/sexual development [General function prediction only]
Probab=33.30 E-value=3e+02 Score=23.24 Aligned_cols=87 Identities=11% Similarity=0.082 Sum_probs=55.6
Q ss_pred hHHHHHHHHHHhcCC--hhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccCh---HHHHHHHHcCcHHH
Q 032259 7 STSRVLLILCNLAAS--NEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNL---RFKGLAKEARAAEV 81 (144)
Q Consensus 7 ~~E~AlavL~~La~~--~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~---~~~~~a~~~Gai~~ 81 (144)
+.=.++.+++.|..+ ++-..-+.....||...++|..| |.-.|--|+-++..+-.++. +.++.-.+.-+|..
T Consensus 162 LRltsLGVIgaLvkNdsq~vi~fLltTeivPLcLrIme~g---SElSktvaifI~qkil~dDvGLqYiCqT~eRFyAv~~ 238 (315)
T COG5209 162 LRLTSLGVIGALVKNDSQYVIKFLLTTEIVPLCLRIMELG---SELSKTVAIFIFQKILGDDVGLQYICQTFERFYAVNL 238 (315)
T ss_pred eeehHHHHHHHHHhCCCHHHHHHHHhhhHHHHHHHHHHhh---hHHHHHHHHHHHHHHhccchhHHHHHHHHHHHHHHHH
Confidence 344688999999974 45555566778999999999998 78888888888776544332 33333333333322
Q ss_pred -----HHHHHHhCCHHHHHH
Q 032259 82 -----LREVEERGSQRAKEK 96 (144)
Q Consensus 82 -----L~~Ll~~Gt~raK~k 96 (144)
..+++..|+.|--+.
T Consensus 239 vln~mv~qlVs~~~~RLlKh 258 (315)
T COG5209 239 VLNSMVSQLVSLGSTRLLKH 258 (315)
T ss_pred HHHHHHHHHhhccchhHHHH
Confidence 223445666554443
No 125
>PF06025 DUF913: Domain of Unknown Function (DUF913); InterPro: IPR010314 This is a domain of unknown function found towards the N terminus of a family of E3 ubiquitin protein ligases, including yeast TOM1, many of which appear to play a role in mRNA transcription and processing. This domain is found in association with and immediately C-terminal to another domain of unknown function: IPR010309 from INTERPRO.
Probab=32.95 E-value=3.3e+02 Score=23.62 Aligned_cols=81 Identities=14% Similarity=0.093 Sum_probs=55.5
Q ss_pred chHHHHHHHHHHhcC-ChhhHHHHHhcCcHHHHHHHhh-cCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHH
Q 032259 6 ESTSRVLLILCNLAA-SNEGRSAILDANGVSILVGMLK-ESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLR 83 (144)
Q Consensus 6 ~~~E~AlavL~~La~-~~egr~aI~~~g~I~~LV~lL~-~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~ 83 (144)
.+.-.|+.++..+-- .|-.-..|.++|-++.+++.+. .|...|.+.--.--.+|-.||.++...... .+.++++.+.
T Consensus 124 ~v~s~a~~ivs~~I~nePT~~~~l~e~Gl~~~~L~~i~~~~i~~s~e~l~~lP~~l~AicLN~~Gl~~~-~~~~~l~~~f 202 (379)
T PF06025_consen 124 SVFSLAINIVSDFIHNEPTSFSILQEAGLIDAFLDAITAKGILPSSEVLTSLPNVLSAICLNNRGLEKV-KSSNPLDKLF 202 (379)
T ss_pred HHHHHHHHHHHHHHhcCCchhHHHHHcCChHHHHHHHhccCCCCcHHHHHHHHHHHhHHhcCHHHHHHH-HhcChHHHHH
Confidence 345667777877775 5666777788899999999888 763234444444557788889876654433 4567888777
Q ss_pred HHHH
Q 032259 84 EVEE 87 (144)
Q Consensus 84 ~Ll~ 87 (144)
++..
T Consensus 203 ~if~ 206 (379)
T PF06025_consen 203 EIFT 206 (379)
T ss_pred HHhC
Confidence 7654
No 126
>KOG3678 consensus SARM protein (with sterile alpha and armadillo motifs) [Extracellular structures]
Probab=32.02 E-value=2.5e+02 Score=26.17 Aligned_cols=69 Identities=10% Similarity=0.086 Sum_probs=44.4
Q ss_pred HHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHH
Q 032259 8 TSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAE 80 (144)
Q Consensus 8 ~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~ 80 (144)
.-+|..+|..+-. .+++..|++-| +..++.+-+.. ..++.+...+++|-+|-.++++.......+|++.
T Consensus 199 Rve~~rlLEq~~~-aeN~d~va~~~-~~~Il~lAK~~--e~~e~aR~~~~il~~mFKHSeet~~~Lvaa~~lD 267 (832)
T KOG3678|consen 199 RVEAARLLEQILV-AENRDRVARIG-LGVILNLAKER--EPVELARSVAGILEHMFKHSEETCQRLVAAGGLD 267 (832)
T ss_pred HHHHHHHHHHHHh-hhhhhHHhhcc-chhhhhhhhhc--CcHHHHHHHHHHHHHHhhhhHHHHHHHHhhcccc
Confidence 4566677776544 78899998865 33333333332 1677888888999999887765444444566553
No 127
>PF12031 DUF3518: Domain of unknown function (DUF3518); InterPro: IPR021906 This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is about 260 amino acids in length. This domain is found associated with PF01388 from PFAM.
Probab=31.90 E-value=78 Score=26.42 Aligned_cols=43 Identities=21% Similarity=0.376 Sum_probs=33.6
Q ss_pred cCCcchHHHHHHHHHHhcCChhhHH-HH-HhcCcHHHHHHHhhcC
Q 032259 2 VKSGESTSRVLLILCNLAASNEGRS-AI-LDANGVSILVGMLKES 44 (144)
Q Consensus 2 ~~~~~~~E~AlavL~~La~~~egr~-aI-~~~g~I~~LV~lL~~G 44 (144)
-++.-+.|-|+.+|.+|+.-++.-. .| .+.++|..|+..+..+
T Consensus 184 ~e~~v~REfAvvlL~~La~~~~~~~r~iA~q~~~i~~Li~FiE~a 228 (257)
T PF12031_consen 184 REDQVCREFAVVLLSNLAQGDEAAARAIAMQKPCISHLIAFIEDA 228 (257)
T ss_pred ccchhHHHHHHHHHHHHhcccHHHHHHHHHhhchHHHHHHHHHHH
Confidence 3556678999999999998665555 34 4578999999999874
No 128
>PF06345 Drf_DAD: DRF Autoregulatory Domain; InterPro: IPR010465 This domain is found in Diaphanous-related formins (Drfs). It binds the N-terminal GTPase-binding domain; this link is broken when GTP-bound Rho binds to the GBD and activates the protein. The addition of diaphanous activating domains (DAD) to mammalian cells induces actin filament formation, stabilises microtubules, and activates serum-response mediated transcription [].; PDB: 3O4X_H 3OBV_E 2BAP_C 2F31_B.
Probab=31.09 E-value=50 Score=15.66 Aligned_cols=14 Identities=29% Similarity=0.314 Sum_probs=10.0
Q ss_pred CcHHHHHHHHHhCC
Q 032259 77 RAAEVLREVEERGS 90 (144)
Q Consensus 77 Gai~~L~~Ll~~Gt 90 (144)
|++..|++-+++|.
T Consensus 1 gvmdsllealqtg~ 14 (15)
T PF06345_consen 1 GVMDSLLEALQTGS 14 (15)
T ss_dssp -HHHHHHHHHHHST
T ss_pred CcHHHHHHHHHccC
Confidence 56778888888774
No 129
>TIGR02270 conserved hypothetical protein. Members are found in Myxococcus xanthus (six members), Geobacter sulfurreducens, and Pseudomonas aeruginosa; a short protein homologous to the N-terminal region is found in Mesorhizobium loti. All sequence are from Proteobacteria. The function is unknown.
Probab=28.56 E-value=1.8e+02 Score=25.62 Aligned_cols=26 Identities=23% Similarity=0.243 Sum_probs=13.6
Q ss_pred HHHHHHHHHhCCHHHHHHHHHHHHHh
Q 032259 79 AEVLREVEERGSQRAKEKAKRILGML 104 (144)
Q Consensus 79 i~~L~~Ll~~Gt~raK~kA~~lL~~L 104 (144)
.+.|..++++.++..+..|...|-.+
T Consensus 149 ~~~L~~~L~d~d~~Vra~A~raLG~l 174 (410)
T TIGR02270 149 GPALEAALTHEDALVRAAALRALGEL 174 (410)
T ss_pred HHHHHHHhcCCCHHHHHHHHHHHHhh
Confidence 34555555555555555555555443
No 130
>KOG1981 consensus SOK1 kinase belonging to the STE20/SPS1/GC kinase family [Signal transduction mechanisms]
Probab=28.22 E-value=3.3e+02 Score=25.02 Aligned_cols=89 Identities=13% Similarity=0.051 Sum_probs=54.5
Q ss_pred cCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhc--cChHHHHHHHHcCcHHHHHHHHHhCCHHHHHH
Q 032259 19 AASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGH--GNLRFKGLAKEARAAEVLREVEERGSQRAKEK 96 (144)
Q Consensus 19 a~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~--~~~~~~~~a~~~Gai~~L~~Ll~~Gt~raK~k 96 (144)
+.+.--|..|.+.--...|-+.+..|+-.=..--|+.+.+|..||. .++.++..-...-+|+.|-.+++-=.---++.
T Consensus 148 p~~~r~r~eInevLD~dll~Qele~G~Ldi~~L~~fvl~ll~~lCAPaRDe~V~~l~~itdvV~~~R~Ilq~l~lMK~Di 227 (513)
T KOG1981|consen 148 PVHTRLRAEINEVLDTDLLRQELESGTLDISYLSEFVLDLLSRLCAPARDEEVAKLRSITDVVDGFRGILQLLELMKLDI 227 (513)
T ss_pred CccHHHHHHHHHHHhHHHHHHHHHcCCchHHHHHHHHHHHHHHhcCCcccHHHhhhhhHHHHHHHHHHHHHHHHHHHHHH
Confidence 5566678888776666788889999832223448999999999998 45555544444444554444443333333455
Q ss_pred HHHHHHHhhcC
Q 032259 97 AKRILGMLKGR 107 (144)
Q Consensus 97 A~~lL~~L~~~ 107 (144)
|..-|++++-.
T Consensus 228 aN~~I~~lrp~ 238 (513)
T KOG1981|consen 228 ANYQIRILRPA 238 (513)
T ss_pred HHHHHHHhhHH
Confidence 55555555543
No 131
>PF11701 UNC45-central: Myosin-binding striated muscle assembly central; InterPro: IPR024660 The UNC-45 or small muscle protein 1 of Caenorhabditis elegans is expressed in two forms from different genomic positions in mammals: as a general tissue protein (UNC-45a) and as a specific form (UNC-45b) expressed only in striated and skeletal muscle. Myofibril formation requires both UNC-45 forms, consistent with the fact that the cytoskeleton is necessary for the development and maintenance of organised myofibrils []. Rng3 (Ring assembly protein 3), the homologue in Schizosaccharomyces pombe, is crucial for cell shape, normal actin cytoskeleton, and contractile ring assembly, and is essential for assembly of the myosin II-containing progenitors of the contractile ring. Widespread defects in the cytoskeleton are found in null mutants of all three fungal proteins []. Mammalian Unc45 is found to act as a specific chaperone during the folding of myosin and the assembly of striated muscle by forming a stable complex with the general chaperone Hsp90 []. All members carry up to three amino-terminal tetratricopeptide repeat (TPR) and a UCS domain at the C terminus that contains a number of Arm repeats. ; PDB: 3OPB_A 3NOW_A.
Probab=27.81 E-value=2.6e+02 Score=20.83 Aligned_cols=88 Identities=10% Similarity=0.089 Sum_probs=55.5
Q ss_pred HHHHHH-HhcCChhhHHHH-HhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHh
Q 032259 11 VLLILC-NLAASNEGRSAI-LDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEER 88 (144)
Q Consensus 11 AlavL~-~La~~~egr~aI-~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~ 88 (144)
++.+|. .+...|+--..| ...|.++.++.++.+. ..++..+.++.=|++.++.+..+|..+.+. .++.|-.+..+
T Consensus 63 ~~~~l~~lfp~~~dv~~~l~~~eg~~~~l~~~~~~~--~~~~~~~~~~lell~aAc~d~~~r~~I~~~-~~~~L~~~~~~ 139 (157)
T PF11701_consen 63 AFSALTALFPGPPDVGSELFLSEGFLESLLPLASRK--SKDRKVQKAALELLSAACIDKSCRTFISKN-YVSWLKELYKN 139 (157)
T ss_dssp HHHHHHHHCTTTHHHHHHHCCTTTHHHHHHHHHH-C--TS-HHHHHHHHHHHHHHTTSHHHHHCCHHH-CHHHHHHHTTT
T ss_pred HHHHHHHHhCCCHHHHHHHHhhhhHHHHHHHHHhcc--cCCHHHHHHHHHHHHHHHccHHHHHHHHHH-HHHHHHHHHcc
Confidence 344444 345678888888 4578899999999822 145666666666666666666666666555 57888888866
Q ss_pred CC-HH-HHHHHHHHH
Q 032259 89 GS-QR-AKEKAKRIL 101 (144)
Q Consensus 89 Gt-~r-aK~kA~~lL 101 (144)
.. +. -|-.|.-.|
T Consensus 140 ~~~~~~ir~~A~v~L 154 (157)
T PF11701_consen 140 SKDDSEIRVLAAVGL 154 (157)
T ss_dssp CC-HH-CHHHHHHHH
T ss_pred ccchHHHHHHHHHHH
Confidence 54 44 355554433
No 132
>KOG1248 consensus Uncharacterized conserved protein [Function unknown]
Probab=27.70 E-value=3.1e+02 Score=27.71 Aligned_cols=61 Identities=10% Similarity=-0.036 Sum_probs=48.7
Q ss_pred ChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHhCCHHHHHHHHHHHHHhhcCC
Q 032259 48 SEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEERGSQRAKEKAKRILGMLKGRE 108 (144)
Q Consensus 48 S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~Gt~raK~kA~~lL~~L~~~~ 108 (144)
+++-..-|+..+-.+|..-+.....--.+-++|.+..++++++-.-|.|..-+|..|-+..
T Consensus 840 sreI~kaAI~fikvlv~~~pe~~l~~~~~~LL~sll~ls~d~k~~~r~Kvr~LlekLirkf 900 (1176)
T KOG1248|consen 840 SREIAKAAIGFIKVLVYKFPEECLSPHLEELLPSLLALSHDHKIKVRKKVRLLLEKLIRKF 900 (1176)
T ss_pred CHHHHHHHHHHHHHHHHcCCHHHHhhhHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHh
Confidence 7888999999998888866653444445558999999999999999999999998775533
No 133
>COG5231 VMA13 Vacuolar H+-ATPase V1 sector, subunit H [Energy production and conversion]
Probab=26.99 E-value=4.5e+02 Score=23.30 Aligned_cols=77 Identities=14% Similarity=0.103 Sum_probs=52.5
Q ss_pred HHHHHHHhcCChhhHHHHHhcC-cHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHh
Q 032259 11 VLLILCNLAASNEGRSAILDAN-GVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEER 88 (144)
Q Consensus 11 AlavL~~La~~~egr~aI~~~g-~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~ 88 (144)
|+..|.+|...++-|..+-... +-..+++++++.- .-.+-|=|..-++|.|.....-.+.+-.....|..|+.+++.
T Consensus 169 av~cl~~l~~~~e~R~i~waentcs~r~~e~l~n~v-g~~qlQY~SL~~iw~lTf~~~~aqdi~K~~dli~dli~iVk~ 246 (432)
T COG5231 169 AVSCLSNLEFDVEKRKIEWAENTCSRRFMEILQNYV-GVKQLQYNSLIIIWILTFSKECAQDIDKMDDLINDLIAIVKE 246 (432)
T ss_pred HHHHHhhhhhhHHHHHHHHHHhhHHHHHHHHHHhhh-hhhhhHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5677889999999999995554 6678899998831 023458889999999986543222333445566666666653
No 134
>PRK12556 tryptophanyl-tRNA synthetase; Provisional
Probab=26.73 E-value=1.3e+02 Score=25.62 Aligned_cols=56 Identities=21% Similarity=0.204 Sum_probs=39.0
Q ss_pred ChHHHHHHHHHHHHHhc-cChHHHHHHHHcCcHHHHHHHHHhCCHHHHHHHHHHHHHhhc
Q 032259 48 SEATRENCVAALFALGH-GNLRFKGLAKEARAAEVLREVEERGSQRAKEKAKRILGMLKG 106 (144)
Q Consensus 48 S~~~kE~Ava~L~~Lc~-~~~~~~~~a~~~Gai~~L~~Ll~~Gt~raK~kA~~lL~~L~~ 106 (144)
+...|+..+.+|..+.. -.+.+.....+. .-|-.++..|+.+||+.|...|+-.++
T Consensus 271 ~~~~K~~lae~i~~~l~pire~~~~~~~~~---~~~~~il~~G~~kA~~~A~~tl~~v~~ 327 (332)
T PRK12556 271 WGDVKKELFRVVDRELAGPREKYAMYMNEP---SLLDEALEKGAERAREIAKPNLAEIKK 327 (332)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHcCH---HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34567777777777665 222333333333 358888999999999999999988765
No 135
>PF08045 CDC14: Cell division control protein 14, SIN component; InterPro: IPR012535 Cdc14 is a component of the septation initiation network (SIN) and is required for the localisation and activity of Sid1. Sid1 is a protein kinase that localises asymmetrically to one spindle pole body (SPB) in anaphase disappears prior to cell separation [], [].
Probab=26.70 E-value=2e+02 Score=23.83 Aligned_cols=50 Identities=24% Similarity=0.429 Sum_probs=38.7
Q ss_pred HHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhc
Q 032259 14 ILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGH 64 (144)
Q Consensus 14 vL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~ 64 (144)
+++.|..+|.+...+.+.+|+..++++++... .+..-|-.++..|+-...
T Consensus 158 Lv~iLld~p~N~r~FE~~~Gl~~v~~llk~~~-~~~~~r~K~~EFL~fyl~ 207 (257)
T PF08045_consen 158 LVCILLDSPENQRDFEELNGLSTVCSLLKSKS-TDRELRLKCIEFLYFYLM 207 (257)
T ss_pred HHHHHHcChHHHHHHHHhCCHHHHHHHHcccc-ccHHHhHHHHHHHHHHHc
Confidence 34556679999999999999999999998752 234457888888876665
No 136
>cd00197 VHS_ENTH_ANTH VHS, ENTH and ANTH domain superfamily; composed of proteins containing a VHS, ENTH or ANTH domain. The VHS domain is present in Vps27 (Vacuolar Protein Sorting), Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and STAM (Signal Transducing Adaptor Molecule). It is located at the N-termini of proteins involved in intracellular membrane trafficking. The epsin N-terminal homology (ENTH) domain is an evolutionarily conserved protein module found primarily in proteins that participate in clathrin-mediated endocytosis. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. VHS, ENTH and ANTH domains are structurally similar and are composed of a superhelix of eight alpha helices. ENTH adnd ANTH (E/ANTH) domains bind both inositol phospholipids and proteins and contribute to the nucleation and formation of clathrin coats on membra
Probab=26.36 E-value=2.2e+02 Score=19.62 Aligned_cols=69 Identities=16% Similarity=0.107 Sum_probs=44.8
Q ss_pred cHHHHHHHhhcCCCCChHHHHHHHHHHHHHhc-cChHHHHHHHHcCcHHHHHHH---HHhC---CHHHHHHHHHHHHHh
Q 032259 33 GVSILVGMLKESGSDSEATRENCVAALFALGH-GNLRFKGLAKEARAAEVLREV---EERG---SQRAKEKAKRILGML 104 (144)
Q Consensus 33 ~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~-~~~~~~~~a~~~Gai~~L~~L---l~~G---t~raK~kA~~lL~~L 104 (144)
++..|.+.|... +++.+-.|..+|-.++. +++.+...+.......-|+.+ -..| ....|+++..+++..
T Consensus 38 ~~~~l~kRl~~~---~~~~~lkaL~lLe~lvkN~g~~f~~~i~~~~~~~~l~~~~~~~~~~~~~~~~Vr~k~~~l~~~w 113 (115)
T cd00197 38 AVDAIKKRINNK---NPHVVLKALTLLEYCVKNCGERFHQEVASNDFAVELLKFDKSKLLGDDVSTNVREKAIELVQLW 113 (115)
T ss_pred HHHHHHHHhcCC---cHHHHHHHHHHHHHHHHHccHHHHHHHHHhHHHHHHHHhhccccccCCCChHHHHHHHHHHHHH
Confidence 344566666654 78999999999999988 555556655555444444332 1233 567788888887764
No 137
>KOG2611 consensus Neurochondrin/leucine-rich protein (Neurochondrin) [Function unknown]
Probab=26.35 E-value=1.6e+02 Score=27.43 Aligned_cols=56 Identities=14% Similarity=0.192 Sum_probs=42.1
Q ss_pred chHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHh
Q 032259 6 ESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALG 63 (144)
Q Consensus 6 ~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc 63 (144)
.|+|++-.-|..+|+++.|-..+.+.|+|+.+.++-.-- .-.-..+-|.-+|+.+.
T Consensus 126 ~m~~d~Y~cL~~Va~~e~G~~~Lia~G~~~~~~Q~y~~~--~~~~d~alal~Vlll~~ 181 (698)
T KOG2611|consen 126 IMLEDCYECLYLVATAEAGLMTLIASGGLRVIAQMYELP--DGSHDMALALKVLLLLV 181 (698)
T ss_pred HHHHHHHHHHHHHhcCCchhHHHHhcCchHHHHHHHhCC--CCchhHHHHHHHHHHHH
Confidence 488999999999999999999999999999998654321 12344555555555544
No 138
>KOG2259 consensus Uncharacterized conserved protein [Function unknown]
Probab=26.01 E-value=2.3e+02 Score=27.26 Aligned_cols=88 Identities=24% Similarity=0.294 Sum_probs=61.0
Q ss_pred chHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHH
Q 032259 6 ESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREV 85 (144)
Q Consensus 6 ~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~L 85 (144)
++...|++-|+.|+.+..+-+ ..++..||+++.+. =..-|..|.-+|--++.. +..++..++..+.-
T Consensus 388 EVR~AAV~Sl~~La~ssP~FA----~~aldfLvDMfNDE---~~~VRL~ai~aL~~Is~~------l~i~eeql~~il~~ 454 (823)
T KOG2259|consen 388 EVRRAAVASLCSLATSSPGFA----VRALDFLVDMFNDE---IEVVRLKAIFALTMISVH------LAIREEQLRQILES 454 (823)
T ss_pred HHHHHHHHHHHHHHcCCCCcH----HHHHHHHHHHhccH---HHHHHHHHHHHHHHHHHH------heecHHHHHHHHHH
Confidence 356789999999997433211 13567889999774 356799999888888753 33445566777777
Q ss_pred HHhCCHHHHHHHHHHHHHhhc
Q 032259 86 EERGSQRAKEKAKRILGMLKG 106 (144)
Q Consensus 86 l~~Gt~raK~kA~~lL~~L~~ 106 (144)
+.+-++..|+....+|+..+-
T Consensus 455 L~D~s~dvRe~l~elL~~~~~ 475 (823)
T KOG2259|consen 455 LEDRSVDVREALRELLKNARV 475 (823)
T ss_pred HHhcCHHHHHHHHHHHHhcCC
Confidence 777778777777777766543
No 139
>PRK12282 tryptophanyl-tRNA synthetase II; Reviewed
Probab=25.46 E-value=2.5e+02 Score=23.88 Aligned_cols=70 Identities=23% Similarity=0.207 Sum_probs=44.9
Q ss_pred HHHHHHHhhcCCCCChHHHHHHHHHHHHHhccC-hHHHHHHHHcCcHHHHHHHHHhCCHHHHHHHHHHHHHhhc
Q 032259 34 VSILVGMLKESGSDSEATRENCVAALFALGHGN-LRFKGLAKEARAAEVLREVEERGSQRAKEKAKRILGMLKG 106 (144)
Q Consensus 34 I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~-~~~~~~a~~~Gai~~L~~Ll~~Gt~raK~kA~~lL~~L~~ 106 (144)
+.-|.+-...|...+...|+.++..+..+-..- +.......+ ..-|..+++.|+.+|++.|...|+-.++
T Consensus 255 ~e~l~~~y~~g~~~~~dlK~~lae~i~~~l~pirer~~~~~~~---~~~~~~vl~~G~~ka~~~A~~~~~~v~~ 325 (333)
T PRK12282 255 VAELKAHYQRGGLGDVKCKRYLEEVLQELLAPIRERRAEFAKD---PGYVLEILKAGSEKAREVAAQTLSEVKD 325 (333)
T ss_pred HHHHHHHHhcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHcC---HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444442123456888888888776522 222233333 3458889999999999999999988765
No 140
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=24.24 E-value=1.2e+02 Score=30.63 Aligned_cols=97 Identities=13% Similarity=0.096 Sum_probs=69.4
Q ss_pred HhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHhC-CHHHHH
Q 032259 17 NLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEERG-SQRAKE 95 (144)
Q Consensus 17 ~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~G-t~raK~ 95 (144)
.||..+.+...+++.++=.-.+.+|..+...+++.|--|+-+|-.++.+-..-+....+.+.|..+++.+.++ -+.-|+
T Consensus 539 ILAvD~SCQ~dLvKe~g~~YF~~vL~~~~~~~~EqrtmaAFVLAviv~nf~lGQ~acl~~~li~iCle~lnd~~~pLLrQ 618 (1387)
T KOG1517|consen 539 ILAVDPSCQADLVKENGYKYFLQVLDPSQAIPPEQRTMAAFVLAVIVRNFKLGQKACLNGNLIGICLEHLNDDPEPLLRQ 618 (1387)
T ss_pred HHhcCchhHHHHHhccCceeEEEEecCcCCCCHHHHHHHHHHHHHHHcccchhHHHhccccHHHHHHHHhcCCccHHHHH
Confidence 4667788888888777666666777664335679999999999999997665566678999999999999988 366666
Q ss_pred HHHHHHHHhhcCCCCCCcchh
Q 032259 96 KAKRILGMLKGREDDDEDVDW 116 (144)
Q Consensus 96 kA~~lL~~L~~~~~~~~~~~~ 116 (144)
=..-.|- +-+.|.+...|
T Consensus 619 W~~icLG---~LW~d~~~Arw 636 (1387)
T KOG1517|consen 619 WLCICLG---RLWEDYDEARW 636 (1387)
T ss_pred HHHHHHH---HHhhhcchhhh
Confidence 5444443 33434444555
No 141
>PF03130 HEAT_PBS: PBS lyase HEAT-like repeat; InterPro: IPR004155 These proteins contain a short bi-helical repeat that is related to HEAT. Cyanobacteria and red algae harvest light energy using macromolecular complexes known as phycobilisomes (PBS), peripherally attached to the photosynthetic membrane. The major components of PBS are the phycobiliproteins. These heterodimeric proteins are covalently attached to phycobilins: open-chain tetrapyrrole chromophores, which function as the photosynthetic light-harvesting pigments. Phycobiliproteins differ in sequence and in the nature and number of attached phycobilins to each of their subunits. These proteins include the lyase enzymes that specifically attach particular phycobilins to apophycobiliprotein subunits. The most comprehensively studied of these is the CpcE/Flyase P31967 from SWISSPROT, P31968 from SWISSPROT, which attaches phycocyanobilin (PCB) to the alpha subunit of apophycocyanin []. Similarly, MpeU/V attaches phycoerythrobilin to phycoerythrin II, while CpeY/Z is thought to be involved in phycoerythrobilin (PEB) attachment to phycoerythrin (PE) I (PEs I and II differ in sequence and in the number of attached molecules of PEB: PE I has five, PE II has six) []. All the reactions of the above lyases involve an apoprotein cysteine SH addition to a terminal delta 3,3'-double bond. Such a reaction is not possible in the case of phycoviolobilin (PVB), the phycobilin of alpha-phycoerythrocyanin (alpha-PEC). It is thought that in this case, PCB, not PVB, is first added to apo-alpha-PEC, and is then isomerized to PVB. The addition reaction has been shown to occur in the presence of either of the components of alpha-PEC-PVB lyase PecE or PecF (or both). The isomerisation reaction occurs only when both PecE and PecF components are present, i.e. the PecE/F phycobiliprotein lyase is also a phycobilin isomerase []. Another member of this family is the NblB protein, whose similarity to the phycobiliprotein lyases was previously noted []. This constitutively expressed protein is not known to have any lyase activity. It is thought to be involved in the coordination of PBS degradation with environmental nutrient limitation. It has been suggested that the similarity of NblB to the phycobiliprotein lyases is due to the ability to bind tetrapyrrole phycobilins via the common repeated motif [].; PDB: 1TE4_A.
Probab=23.77 E-value=79 Score=16.63 Aligned_cols=10 Identities=40% Similarity=0.245 Sum_probs=6.6
Q ss_pred cHHHHHHHHH
Q 032259 78 AAEVLREVEE 87 (144)
Q Consensus 78 ai~~L~~Ll~ 87 (144)
++|+|++.++
T Consensus 17 ai~~L~~~L~ 26 (27)
T PF03130_consen 17 AIPALIEALE 26 (27)
T ss_dssp HHHHHHHHHH
T ss_pred HHHHHHHHhc
Confidence 5677776664
No 142
>PF12460 MMS19_C: RNAPII transcription regulator C-terminal; InterPro: IPR024687 This domain, approximately 60 amino acids in length, is found in the N-terminal region of MMS19 proteins. MMS19 is required for both nucleotide excision repair (NER) and RNA polymerase II (RNAP II) transcription [].
Probab=22.96 E-value=4.7e+02 Score=22.47 Aligned_cols=72 Identities=15% Similarity=0.056 Sum_probs=50.7
Q ss_pred cHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHHHHHHhCCHHHHHHHHHHHHHhhcC
Q 032259 33 GVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLREVEERGSQRAKEKAKRILGMLKGR 107 (144)
Q Consensus 33 ~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~Gt~raK~kA~~lL~~L~~~ 107 (144)
.+|.|++..+.. +...|++...+|-.+..+-+.-...-.-+-++|.|++=+...++..+.-+-..|..+-..
T Consensus 324 ~~p~L~~~~~~~---~~~~k~~yL~ALs~ll~~vP~~vl~~~l~~LlPLLlqsL~~~~~~v~~s~L~tL~~~l~~ 395 (415)
T PF12460_consen 324 VLPKLLEGFKEA---DDEIKSNYLTALSHLLKNVPKSVLLPELPTLLPLLLQSLSLPDADVLLSSLETLKMILEE 395 (415)
T ss_pred HHHHHHHHHhhc---ChhhHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHc
Confidence 577888888774 455899999999988875443223333355788888888777777777777777666554
No 143
>KOG2759 consensus Vacuolar H+-ATPase V1 sector, subunit H [Energy production and conversion]
Probab=22.43 E-value=4.9e+02 Score=23.49 Aligned_cols=50 Identities=16% Similarity=0.094 Sum_probs=36.3
Q ss_pred HHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccC
Q 032259 15 LCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGN 66 (144)
Q Consensus 15 L~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~ 66 (144)
+..|.-+|.-.+-+...+-|+.|+++++..+ ..+-..-+++++.+++...
T Consensus 224 iWlLtFn~~~ae~~~~~~li~~L~~Ivk~~~--KEKV~Rivlai~~Nll~k~ 273 (442)
T KOG2759|consen 224 IWLLTFNPHAAEKLKRFDLIQDLSDIVKEST--KEKVTRIVLAIFRNLLDKG 273 (442)
T ss_pred HHHhhcCHHHHHHHhhccHHHHHHHHHHHHH--HHHHHHHHHHHHHHHhccC
Confidence 4556667777777755678999999998741 3445667788899998844
No 144
>KOG2734 consensus Uncharacterized conserved protein [Function unknown]
Probab=22.10 E-value=4.5e+02 Score=24.14 Aligned_cols=77 Identities=18% Similarity=0.158 Sum_probs=50.1
Q ss_pred chHHHHHHHHHHhcC-ChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHHH
Q 032259 6 ESTSRVLLILCNLAA-SNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVLR 83 (144)
Q Consensus 6 ~~~E~AlavL~~La~-~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L~ 83 (144)
.-+..+++++.||.. .++....+++.|-+.-|...+..- ..-..++.+|.-+|.-+-.++...+........|..|+
T Consensus 197 ~gv~~~L~vveNlv~~r~~~~~~~~e~~ll~WLL~rl~~k-~~f~aNk~YasEiLaillq~s~e~~~~~~~l~GiD~lL 274 (536)
T KOG2734|consen 197 DGVHNTLAVVENLVEVRPAICTEIVEQGLLSWLLKRLKGK-AAFDANKQYASEILAILLQNSDENRKLLGPLDGIDVLL 274 (536)
T ss_pred hhhHHHHHHHHHHHhccHHHHHHHHHhhHHHHHHHHHhcc-cCcchhHHHHHHHHHHHhccCchhhhhhcCcccHHHHH
Confidence 345678899999986 677777778876555555544321 12456899999999988875554444444444444443
No 145
>KOG1077 consensus Vesicle coat complex AP-2, alpha subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=22.03 E-value=7.5e+02 Score=24.19 Aligned_cols=83 Identities=19% Similarity=0.228 Sum_probs=55.5
Q ss_pred CCcchHHHHHHHHHHhcCChhhHHHHHhcCcHHHHHHHhhcCCCCChHHHHHHHHHHHHHhccChHHHHHHHHcCcHHHH
Q 032259 3 KSGESTSRVLLILCNLAASNEGRSAILDANGVSILVGMLKESGSDSEATRENCVAALFALGHGNLRFKGLAKEARAAEVL 82 (144)
Q Consensus 3 ~~~~~~E~AlavL~~La~~~egr~aI~~~g~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~~~~~~~~~a~~~Gai~~L 82 (144)
++.++.=.|+.-++.|+++.....++-.+ ...+...|+.-. ...-+..|+-.|+.+|-.+ +.+.+ +.-|
T Consensus 341 rE~NiRYLaLEsm~~L~ss~~s~davK~h--~d~Ii~sLkter--DvSirrravDLLY~mcD~~-Nak~I------V~el 409 (938)
T KOG1077|consen 341 RETNIRYLALESMCKLASSEFSIDAVKKH--QDTIINSLKTER--DVSIRRRAVDLLYAMCDVS-NAKQI------VAEL 409 (938)
T ss_pred ccccchhhhHHHHHHHHhccchHHHHHHH--HHHHHHHhcccc--chHHHHHHHHHHHHHhchh-hHHHH------HHHH
Confidence 45667777888888888887777777665 677888887431 3456888999999999743 22333 3345
Q ss_pred HHHHHhCCHHHHHH
Q 032259 83 REVEERGSQRAKEK 96 (144)
Q Consensus 83 ~~Ll~~Gt~raK~k 96 (144)
++-+.+-.+..|+.
T Consensus 410 LqYL~tAd~siree 423 (938)
T KOG1077|consen 410 LQYLETADYSIREE 423 (938)
T ss_pred HHHHhhcchHHHHH
Confidence 55555555555544
No 146
>cd03565 VHS_Tom1 VHS domain family, Tom1 subfamily; The VHS domain is an essential part of Tom1 (Target of myb1 - retroviral oncogene) protein. The VHS domain has a superhelical structure similar to the structure of the ARM repeats and is present at the very N-termini of proteins. It is a right-handed superhelix of eight alpha helices. The VHS domain has been found in a number of proteins, some of which have been implicated in intracellular trafficking and sorting. The VHS domain of the Tom1 protein is essential for the negative regulation of Interleukin-1 and Tumor Necrosis Factor-induced signaling pathways.
Probab=21.95 E-value=3.3e+02 Score=20.06 Aligned_cols=77 Identities=13% Similarity=0.047 Sum_probs=49.8
Q ss_pred cHHHHHHHhhcCCCCChHHHHHHHHHHHHHhc-cChHHHHHHHHcCcHHH-HHHHHHh-C--CHHHHHHHHHHHHHhhcC
Q 032259 33 GVSILVGMLKESGSDSEATRENCVAALFALGH-GNLRFKGLAKEARAAEV-LREVEER-G--SQRAKEKAKRILGMLKGR 107 (144)
Q Consensus 33 ~I~~LV~lL~~G~~~S~~~kE~Ava~L~~Lc~-~~~~~~~~a~~~Gai~~-L~~Ll~~-G--t~raK~kA~~lL~~L~~~ 107 (144)
++..|-+.|..+ .++..+-.|..+|-.+.. ....+...+..-..+.- |+.++.. . ....|+|...+++.....
T Consensus 39 a~ralkkRl~~~--~n~~v~l~aL~LLe~~vkNCG~~fh~eiask~Fl~e~L~~~i~~~~~~~~~Vk~kil~li~~W~~~ 116 (141)
T cd03565 39 AVRALKKRLNGN--KNHKEVMLTLTVLETCVKNCGHRFHVLVAKKDFIKDVLVKLINPKNNPPTIVQEKVLALIQAWADA 116 (141)
T ss_pred HHHHHHHHHccC--CCHHHHHHHHHHHHHHHHHccHHHHHHHHHHHhhhHHHHHHHcccCCCcHHHHHHHHHHHHHHHHH
Confidence 566777777543 257777888776666655 33445555555567765 8888852 2 347789988888877765
Q ss_pred CCCC
Q 032259 108 EDDD 111 (144)
Q Consensus 108 ~~~~ 111 (144)
..++
T Consensus 117 f~~~ 120 (141)
T cd03565 117 FRGS 120 (141)
T ss_pred hCCC
Confidence 5333
No 147
>TIGR02270 conserved hypothetical protein. Members are found in Myxococcus xanthus (six members), Geobacter sulfurreducens, and Pseudomonas aeruginosa; a short protein homologous to the N-terminal region is found in Mesorhizobium loti. All sequence are from Proteobacteria. The function is unknown.
Probab=21.57 E-value=5.6e+02 Score=22.50 Aligned_cols=26 Identities=23% Similarity=0.175 Sum_probs=15.2
Q ss_pred HHHHHHHhhcCCCCChHHHHHHHHHHHHH
Q 032259 34 VSILVGMLKESGSDSEATRENCVAALFAL 62 (144)
Q Consensus 34 I~~LV~lL~~G~~~S~~~kE~Ava~L~~L 62 (144)
.+.|..+|.+. ++..+..|+.+|-.+
T Consensus 149 ~~~L~~~L~d~---d~~Vra~A~raLG~l 174 (410)
T TIGR02270 149 GPALEAALTHE---DALVRAAALRALGEL 174 (410)
T ss_pred HHHHHHHhcCC---CHHHHHHHHHHHHhh
Confidence 55666666543 566666666666543
No 148
>PF04064 DUF384: Domain of unknown function (DUF384); InterPro: IPR007206 This is a protein of unknown function. It is found C-terminal to another domain of unknown function (IPR007205 from INTERPRO).
Probab=21.34 E-value=2.4e+02 Score=18.08 Aligned_cols=52 Identities=21% Similarity=0.311 Sum_probs=39.1
Q ss_pred HHHHHhccChHHHHHHHHcCcHHHHHHHHHhC-CHHHHHHHHHHHHHhhcCCCC
Q 032259 58 ALFALGHGNLRFKGLAKEARAAEVLREVEERG-SQRAKEKAKRILGMLKGREDD 110 (144)
Q Consensus 58 ~L~~Lc~~~~~~~~~a~~~Gai~~L~~Ll~~G-t~raK~kA~~lL~~L~~~~~~ 110 (144)
+|+.||... .-|......|+-+.|-++=... .+..+..--.+.++|-+.+++
T Consensus 1 ~LllL~~T~-~GR~~lR~~~vY~IlRe~h~~E~d~~V~e~~erlV~iLirdE~e 53 (58)
T PF04064_consen 1 ALLLLCATR-EGREYLREKGVYPILRELHKWEEDEEVQEACERLVQILIRDEPE 53 (58)
T ss_pred CHhHHhccH-HHHHHHHHcCchHHHHHHHhccCCHHHHHHHHHHHHHHhcCCCc
Confidence 367788754 3466777888988888886655 688888888899999887633
Done!