Query psy4702
Match_columns 138
No_of_seqs 104 out of 120
Neff 4.6
Searched_HMMs 46136
Date Fri Aug 16 23:41:52 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4702.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4702hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4151|consensus 99.8 8.6E-21 1.9E-25 174.7 6.4 125 8-132 623-748 (748)
2 cd00020 ARM Armadillo/beta-cat 98.6 6.2E-07 1.3E-11 61.3 9.0 111 13-123 8-118 (120)
3 PF10508 Proteasom_PSMB: Prote 97.2 0.0036 7.7E-08 55.9 9.9 121 9-130 116-246 (503)
4 cd00020 ARM Armadillo/beta-cat 96.9 0.004 8.6E-08 42.3 6.4 74 53-126 6-79 (120)
5 PLN03200 cellulose synthase-in 96.6 0.011 2.4E-07 61.2 9.5 112 11-122 403-514 (2102)
6 PLN03200 cellulose synthase-in 96.4 0.023 5.1E-07 59.0 10.3 112 14-125 448-559 (2102)
7 PF13646 HEAT_2: HEAT repeats; 94.9 0.097 2.1E-06 34.4 5.5 80 19-120 7-87 (88)
8 KOG0166|consensus 94.7 0.2 4.2E-06 46.0 8.7 104 14-117 239-343 (514)
9 KOG4224|consensus 94.6 0.14 3E-06 46.3 7.3 117 5-123 152-278 (550)
10 KOG1048|consensus 94.3 0.12 2.6E-06 49.1 6.6 109 17-125 524-684 (717)
11 PF12717 Cnd1: non-SMC mitotic 94.3 0.16 3.4E-06 39.1 6.2 90 26-126 2-93 (178)
12 PF04826 Arm_2: Armadillo-like 93.0 0.65 1.4E-05 38.6 8.0 69 15-85 16-85 (254)
13 PF09759 Atx10homo_assoc: Spin 92.2 0.27 5.8E-06 36.1 4.2 56 34-89 8-65 (102)
14 PRK09687 putative lyase; Provi 91.8 0.49 1.1E-05 39.5 5.9 99 14-121 92-217 (280)
15 PF00514 Arm: Armadillo/beta-c 91.2 0.3 6.6E-06 28.9 3.1 40 43-82 1-40 (41)
16 PF10508 Proteasom_PSMB: Prote 90.8 0.79 1.7E-05 41.0 6.7 75 53-127 76-150 (503)
17 PF03224 V-ATPase_H_N: V-ATPas 90.7 0.53 1.2E-05 39.1 5.2 101 24-126 162-270 (312)
18 PF13513 HEAT_EZ: HEAT-like re 90.6 0.43 9.4E-06 29.5 3.5 54 27-81 2-55 (55)
19 KOG0166|consensus 90.4 1.5 3.3E-05 40.3 8.2 112 13-124 110-223 (514)
20 PF08324 PUL: PUL domain; Int 90.0 0.36 7.7E-06 38.9 3.4 91 25-118 170-267 (268)
21 PF12755 Vac14_Fab1_bd: Vacuol 88.4 1.7 3.6E-05 31.1 5.6 70 53-125 26-96 (97)
22 PF11701 UNC45-central: Myosin 87.9 1.6 3.5E-05 33.3 5.6 92 27-121 59-155 (157)
23 PF13513 HEAT_EZ: HEAT-like re 87.7 1.4 3.1E-05 27.1 4.4 53 69-122 2-54 (55)
24 smart00185 ARM Armadillo/beta- 86.8 0.82 1.8E-05 25.9 2.7 38 45-82 3-40 (41)
25 KOG4500|consensus 86.0 2.7 5.8E-05 38.9 6.7 91 29-122 291-387 (604)
26 KOG4224|consensus 85.8 0.9 1.9E-05 41.3 3.6 103 19-123 92-194 (550)
27 KOG1048|consensus 85.7 1.5 3.4E-05 41.8 5.3 93 15-108 236-331 (717)
28 PF05804 KAP: Kinesin-associat 85.6 4.8 0.0001 38.3 8.5 93 14-108 292-384 (708)
29 KOG4199|consensus 84.3 3.8 8.3E-05 36.9 6.7 96 24-120 254-354 (461)
30 KOG2160|consensus 83.0 16 0.00035 32.1 10.1 95 14-108 126-221 (342)
31 PF11698 V-ATPase_H_C: V-ATPas 82.7 0.87 1.9E-05 34.3 1.9 69 14-83 45-115 (119)
32 PRK09687 putative lyase; Provi 82.4 5.2 0.00011 33.3 6.6 25 97-121 129-153 (280)
33 PF00514 Arm: Armadillo/beta-c 82.4 6.1 0.00013 23.1 5.2 38 86-123 2-39 (41)
34 PRK13800 putative oxidoreducta 81.9 4.6 0.0001 38.7 6.8 97 14-121 777-895 (897)
35 PF13646 HEAT_2: HEAT repeats; 81.9 3 6.5E-05 27.2 4.1 57 57-124 2-59 (88)
36 smart00185 ARM Armadillo/beta- 81.0 5 0.00011 22.5 4.4 36 88-123 4-39 (41)
37 COG5064 SRP1 Karyopherin (impo 80.8 3.7 7.9E-05 37.2 5.3 93 15-107 246-338 (526)
38 PF02985 HEAT: HEAT repeat; I 79.8 1.5 3.2E-05 24.7 1.8 28 56-83 2-29 (31)
39 PF09324 DUF1981: Domain of un 79.6 5.2 0.00011 27.8 4.8 65 54-122 17-85 (86)
40 KOG4199|consensus 79.4 6.8 0.00015 35.4 6.5 98 24-122 342-441 (461)
41 PF12348 CLASP_N: CLASP N term 78.3 15 0.00033 28.4 7.6 103 17-125 99-206 (228)
42 PF04826 Arm_2: Armadillo-like 78.3 11 0.00023 31.3 7.1 86 20-109 103-188 (254)
43 KOG2160|consensus 77.9 16 0.00034 32.3 8.3 100 24-124 95-195 (342)
44 cd00256 VATPase_H VATPase_H, r 76.4 14 0.00029 33.3 7.6 101 23-126 155-259 (429)
45 PF11701 UNC45-central: Myosin 76.2 2.9 6.2E-05 31.9 2.9 63 14-78 88-154 (157)
46 PF08167 RIX1: rRNA processing 75.8 16 0.00035 27.9 7.0 71 53-126 24-98 (165)
47 PF12755 Vac14_Fab1_bd: Vacuol 75.6 3.8 8.2E-05 29.3 3.3 56 14-72 29-86 (97)
48 PF05804 KAP: Kinesin-associat 74.4 27 0.00059 33.4 9.4 97 18-116 497-599 (708)
49 KOG1061|consensus 72.4 13 0.00029 35.7 6.9 96 21-126 95-190 (734)
50 PRK13800 putative oxidoreducta 70.9 17 0.00037 34.9 7.4 58 13-81 622-679 (897)
51 PF01602 Adaptin_N: Adaptin N 64.4 32 0.0007 29.6 7.2 89 24-123 91-179 (526)
52 PF03224 V-ATPase_H_N: V-ATPas 63.5 33 0.00071 28.5 6.8 100 20-120 113-219 (312)
53 KOG1789|consensus 63.1 41 0.00089 35.0 8.3 113 17-132 1777-1890(2235)
54 PF09759 Atx10homo_assoc: Spin 61.8 29 0.00062 25.4 5.5 52 75-126 7-60 (102)
55 PF10521 DUF2454: Protein of u 61.4 23 0.00051 29.3 5.6 70 56-125 121-203 (282)
56 PF10363 DUF2435: Protein of u 61.2 39 0.00084 23.9 6.0 64 60-125 9-72 (92)
57 cd04447 DEP_BRCC3 DEP (Disheve 60.8 9.8 0.00021 27.7 2.8 41 91-131 29-73 (92)
58 KOG4413|consensus 59.9 54 0.0012 29.8 7.8 98 11-109 127-226 (524)
59 KOG4646|consensus 56.9 24 0.00052 28.2 4.6 92 7-100 8-103 (173)
60 COG3355 Predicted transcriptio 55.2 20 0.00044 27.3 3.9 60 72-135 16-78 (126)
61 cd03569 VHS_Hrs_Vps27p VHS dom 54.5 88 0.0019 23.6 7.3 73 46-118 28-107 (142)
62 KOG4500|consensus 53.5 39 0.00084 31.6 6.0 93 13-107 316-413 (604)
63 cd00256 VATPase_H VATPase_H, r 53.2 40 0.00088 30.4 6.0 83 43-125 85-173 (429)
64 cd03568 VHS_STAM VHS domain fa 52.5 98 0.0021 23.5 7.3 72 45-116 23-101 (144)
65 PF05918 API5: Apoptosis inhib 51.7 47 0.001 31.0 6.4 66 53-123 58-123 (556)
66 PF12719 Cnd3: Nuclear condens 49.8 1.1E+02 0.0025 25.2 7.8 103 24-131 39-149 (298)
67 KOG1293|consensus 45.9 86 0.0019 30.2 7.1 99 24-123 389-488 (678)
68 COG1846 MarR Transcriptional r 45.6 59 0.0013 21.7 4.7 51 70-137 23-74 (126)
69 cd04446 DEP_DEPDC4 DEP (Dishev 44.9 23 0.00051 25.6 2.7 42 91-132 27-73 (95)
70 KOG2259|consensus 44.3 45 0.00098 32.4 5.1 57 23-84 382-440 (823)
71 PF08045 CDC14: Cell division 44.3 1E+02 0.0022 26.1 6.7 63 46-108 124-188 (257)
72 PF10274 ParcG: Parkin co-regu 44.2 1.4E+02 0.0029 24.1 7.2 100 24-127 51-166 (183)
73 KOG3678|consensus 44.1 32 0.0007 32.6 4.0 96 14-109 223-320 (832)
74 PF12830 Nipped-B_C: Sister ch 43.3 53 0.0011 25.5 4.7 74 46-125 1-74 (187)
75 cd04444 DEP_PLEK2 DEP (Disheve 42.7 32 0.0007 25.7 3.2 40 90-133 30-70 (109)
76 smart00288 VHS Domain present 42.5 1.4E+02 0.003 22.0 7.3 73 45-117 23-103 (133)
77 cd04450 DEP_RGS7-like DEP (Dis 40.7 35 0.00075 23.6 3.0 40 92-135 29-69 (88)
78 PF05918 API5: Apoptosis inhib 39.6 32 0.00069 32.2 3.3 49 15-68 62-110 (556)
79 PF14947 HTH_45: Winged helix- 38.6 55 0.0012 21.9 3.6 35 97-131 8-51 (77)
80 cd04448 DEP_PIKfyve DEP (Dishe 38.2 53 0.0012 22.6 3.5 41 91-134 28-68 (81)
81 PF00610 DEP: Domain found in 37.5 58 0.0013 21.1 3.5 40 93-135 19-60 (74)
82 KOG1993|consensus 37.2 1.2E+02 0.0027 30.2 6.9 82 53-134 115-229 (978)
83 KOG3678|consensus 35.6 72 0.0016 30.4 4.9 41 70-110 238-278 (832)
84 KOG4646|consensus 33.8 85 0.0018 25.2 4.4 110 11-122 57-167 (173)
85 COG1413 FOG: HEAT repeat [Ener 33.0 1.8E+02 0.0039 23.7 6.5 92 14-127 45-137 (335)
86 KOG1293|consensus 32.8 1.8E+02 0.0039 28.1 7.1 91 14-104 463-554 (678)
87 cd04371 DEP DEP domain, named 32.4 62 0.0013 21.0 3.0 40 92-134 29-68 (81)
88 PF12348 CLASP_N: CLASP N term 32.2 70 0.0015 24.6 3.7 78 15-92 134-215 (228)
89 COG5096 Vesicle coat complex, 31.5 2.2E+02 0.0048 27.8 7.6 94 23-126 103-196 (757)
90 PRK14707 hypothetical protein; 31.2 2.3E+02 0.005 31.5 8.0 106 16-121 378-483 (2710)
91 PF08216 CTNNBL: Catenin-beta- 30.9 59 0.0013 24.2 3.0 36 35-71 69-104 (108)
92 KOG1061|consensus 30.6 1.1E+02 0.0023 29.8 5.3 61 22-85 131-191 (734)
93 COG5064 SRP1 Karyopherin (impo 30.4 1.3E+02 0.0028 27.6 5.5 101 27-127 342-445 (526)
94 PRK09334 30S ribosomal protein 30.1 41 0.00089 24.1 1.9 50 75-132 15-74 (86)
95 PF01022 HTH_5: Bacterial regu 30.1 89 0.0019 18.8 3.3 34 98-131 5-47 (47)
96 TIGR02270 conserved hypothetic 30.0 2.3E+02 0.0051 25.2 7.1 19 53-71 116-134 (410)
97 PF00790 VHS: VHS domain; Int 29.0 2.4E+02 0.0051 20.8 6.3 64 57-120 45-113 (140)
98 PF12530 DUF3730: Protein of u 28.7 3.1E+02 0.0067 22.0 8.1 109 16-130 24-156 (234)
99 KOG3665|consensus 28.6 1.2E+02 0.0025 29.0 5.1 89 20-108 566-679 (699)
100 PF14394 DUF4423: Domain of un 28.6 52 0.0011 25.7 2.5 22 113-134 55-76 (171)
101 PF03810 IBN_N: Importin-beta 28.6 68 0.0015 20.4 2.7 32 54-85 14-47 (77)
102 TIGR02270 conserved hypothetic 28.4 2.4E+02 0.0051 25.1 6.8 37 91-129 264-300 (410)
103 PF01602 Adaptin_N: Adaptin N 25.8 3E+02 0.0065 23.6 6.9 69 19-89 159-227 (526)
104 PF11841 DUF3361: Domain of un 25.6 1.5E+02 0.0032 23.5 4.5 42 89-130 4-51 (160)
105 KOG1767|consensus 25.3 49 0.0011 24.9 1.7 54 78-132 37-93 (110)
106 KOG2122|consensus 24.0 1.5E+02 0.0033 31.8 5.2 83 26-109 365-450 (2195)
107 cd03561 VHS VHS domain family; 23.8 2.9E+02 0.0064 20.1 7.1 65 56-120 39-107 (133)
108 COG1413 FOG: HEAT repeat [Ener 23.7 4E+02 0.0087 21.7 7.3 22 20-41 114-135 (335)
109 PF13730 HTH_36: Helix-turn-he 23.7 88 0.0019 18.9 2.4 17 113-129 39-55 (55)
110 PF12802 MarR_2: MarR family; 23.5 57 0.0012 20.0 1.5 23 114-136 36-58 (62)
111 PF07409 GP46: Phage protein G 23.4 75 0.0016 23.7 2.4 21 111-131 61-81 (116)
112 KOG1059|consensus 23.0 1.9E+02 0.0042 28.5 5.5 41 53-93 335-377 (877)
113 PF01047 MarR: MarR family; I 22.7 45 0.00098 20.4 1.0 23 114-136 32-54 (59)
114 PF13463 HTH_27: Winged helix 22.6 50 0.0011 20.6 1.1 22 114-135 33-54 (68)
115 PF05536 Neurochondrin: Neuroc 22.6 4.6E+02 0.01 24.1 7.8 95 27-125 72-168 (543)
116 PF07814 WAPL: Wings apart-lik 22.4 90 0.0019 26.9 3.0 50 13-64 65-116 (361)
117 PF04063 DUF383: Domain of unk 22.3 2.7E+02 0.0058 22.2 5.5 72 32-106 78-154 (192)
118 PTZ00429 beta-adaptin; Provisi 22.0 1.2E+02 0.0027 29.2 4.1 60 22-84 150-209 (746)
119 TIGR02147 Fsuc_second hypothet 21.5 85 0.0018 26.5 2.6 23 113-135 153-175 (271)
120 PF00392 GntR: Bacterial regul 20.9 82 0.0018 19.9 1.9 21 113-133 38-58 (64)
121 smart00049 DEP Domain found in 20.9 1.2E+02 0.0026 19.7 2.8 40 92-134 21-60 (77)
122 PF10007 DUF2250: Uncharacteri 20.9 91 0.002 22.4 2.3 21 113-133 35-55 (92)
123 PF03297 Ribosomal_S25: S25 ri 20.6 73 0.0016 23.5 1.8 20 113-132 73-92 (105)
124 KOG4413|consensus 20.6 1.2E+02 0.0027 27.6 3.5 77 54-130 410-486 (524)
125 PF06371 Drf_GBD: Diaphanous G 20.4 3.6E+02 0.0078 19.9 6.7 56 66-122 128-184 (187)
126 cd04443 DEP_GPR155 DEP (Dishev 20.2 1.8E+02 0.0039 20.2 3.6 41 91-134 30-70 (83)
127 KOG0168|consensus 20.0 2.3E+02 0.005 28.6 5.4 70 53-124 210-280 (1051)
128 cd04442 DEP_1_DEP6 DEP (Dishev 20.0 1.6E+02 0.0034 20.6 3.3 41 91-134 28-68 (82)
No 1
>KOG4151|consensus
Probab=99.82 E-value=8.6e-21 Score=174.74 Aligned_cols=125 Identities=36% Similarity=0.519 Sum_probs=118.5
Q ss_pred ccCCChhhhhhhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhh-ccHHHHHHHHhcCCCcccchhhhhhhhhhhhcCh
Q psy4702 8 LFYCSQVRFLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLE-KNWVDIFKQILAHPDVGVQHRALVIVGNILSAGK 86 (138)
Q Consensus 8 ~~~n~RlklLvll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~-~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~ 86 (138)
..+|+|+|+|.++++++|+++++|++||+|+.|++++.+|.++.. .+|.+++..+..|+++++||||+||+.||+.++.
T Consensus 623 ~e~~~~l~~w~~~~e~~~E~~~lA~a~a~a~I~sv~~n~c~~~~~~~~~~e~~~~~i~~~~~~~qhrgl~~~ln~~~~~~ 702 (748)
T KOG4151|consen 623 VEYKDRLKLWNLNLEVADEKFELAGAGALAAITSVVENHCSRILELLEWLEILVRAIQDEDDEIQHRGLVIILNLFEALF 702 (748)
T ss_pred hccccCchHHHHHHHhhhhHHhhhccccccchhhcchhhhhhHHHhhcchHHHHHhhcCchhhhhhhhhhhhhhHHHHHH
Confidence 345999999999999999999999999999888889999999777 9999999999999999999999999999999999
Q ss_pred HHHHHHhhhhHHHHHHHhhccccHHHHHHHHHHHHHHHhCCCccCC
Q psy4702 87 ELAERVLSTELMEILMALTLLEGEEVRKLALQALAKGEEYGIIRRP 132 (138)
Q Consensus 87 e~a~kl~es~~leiL~~~~k~~~~~v~~~a~eaLk~~~~~glIk~~ 132 (138)
++++++++++.+++|....|.++.+..+.+..||.++.+||||+|+
T Consensus 703 ei~~~~~~~~~~~~l~~~~~~~~a~~~~~~~~~l~~a~~~~~~~~~ 748 (748)
T KOG4151|consen 703 EIAEKIFETEVMELLSGLQKLNRAPKREDAAPCLSAAEEYGLIKPK 748 (748)
T ss_pred HHHHHhccchHHHHHHHHHHhhhhhhhhhhhhHHHHHHHhhccCCC
Confidence 9999999999999999999995558899999999999999999985
No 2
>cd00020 ARM Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
Probab=98.58 E-value=6.2e-07 Score=61.31 Aligned_cols=111 Identities=23% Similarity=0.159 Sum_probs=95.8
Q ss_pred hhhhhhhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHH
Q psy4702 13 QVRFLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERV 92 (138)
Q Consensus 13 RlklLvll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl 92 (138)
=++.++-+....|...+..|..+|+.++..+|..+..+++...++.+..+..++++.++...+-++.|+.....+..+++
T Consensus 8 ~i~~l~~~l~~~~~~~~~~a~~~l~~l~~~~~~~~~~~~~~~~i~~l~~~l~~~~~~v~~~a~~~L~~l~~~~~~~~~~~ 87 (120)
T cd00020 8 GLPALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSEDEEVVKAALWALRNLAAGPEDNKLIV 87 (120)
T ss_pred ChHHHHHHHHcCCHHHHHHHHHHHHHHhcCCHHHHHHHHHCCChHHHHHHHhCCCHHHHHHHHHHHHHHccCcHHHHHHH
Confidence 35556666677788999999999999999668889888887899999999999999999999999999998888888999
Q ss_pred hhhhHHHHHHHhhccccHHHHHHHHHHHHHH
Q psy4702 93 LSTELMEILMALTLLEGEEVRKLALQALAKG 123 (138)
Q Consensus 93 ~es~~leiL~~~~k~~~~~v~~~a~eaLk~~ 123 (138)
.+.|++..|..+....+.++.+.+..+|.-+
T Consensus 88 ~~~g~l~~l~~~l~~~~~~~~~~a~~~l~~l 118 (120)
T cd00020 88 LEAGGVPKLVNLLDSSNEDIQKNATGALSNL 118 (120)
T ss_pred HHCCChHHHHHHHhcCCHHHHHHHHHHHHHh
Confidence 9999999999998776557788887777654
No 3
>PF10508 Proteasom_PSMB: Proteasome non-ATPase 26S subunit; InterPro: IPR019538 The 26S proteasome is an enzymatic complex that degrades ubiquitinated proteins in eukaryotic cells. 26S proteasome non-ATPase regulatory subunit 5 is one of a number of chaperones that are involved in the assembly of the proteasome. The chaperones dissociate before 26S proteasome formation is complete [].; GO: 0044183 protein binding involved in protein folding
Probab=97.16 E-value=0.0036 Score=55.88 Aligned_cols=121 Identities=22% Similarity=0.204 Sum_probs=97.5
Q ss_pred cCCChhhhhhhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHH
Q psy4702 9 FYCSQVRFLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKEL 88 (138)
Q Consensus 9 ~~n~RlklLvll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~ 88 (138)
.+++=+..++-+-..+|...+.+|+.+|..|.+ ++..+..++++..+.-|..++.++++.+|+|..=++-++...+.+.
T Consensus 116 ~~~~l~~~i~~~L~~~d~~Va~~A~~~L~~l~~-~~~~~~~l~~~~~~~~L~~l~~~~~~~vR~Rv~el~v~i~~~S~~~ 194 (503)
T PF10508_consen 116 VDNELLPLIIQCLRDPDLSVAKAAIKALKKLAS-HPEGLEQLFDSNLLSKLKSLMSQSSDIVRCRVYELLVEIASHSPEA 194 (503)
T ss_pred cCccHHHHHHHHHcCCcHHHHHHHHHHHHHHhC-CchhHHHHhCcchHHHHHHHHhccCHHHHHHHHHHHHHHHhcCHHH
Confidence 344555666666677899999999999999998 7888888989667999999999999999999999999999999999
Q ss_pred HHHHhhhhHHHHHHHhhccccHHHHHHH----------HHHHHHHHhCCCcc
Q psy4702 89 AERVLSTELMEILMALTLLEGEEVRKLA----------LQALAKGEEYGIIR 130 (138)
Q Consensus 89 a~kl~es~~leiL~~~~k~~~~~v~~~a----------~eaLk~~~~~glIk 130 (138)
++.+.++|+++.+....+.++--+.-.+ ...++-+++.|+++
T Consensus 195 ~~~~~~sgll~~ll~eL~~dDiLvqlnalell~~La~~~~g~~yL~~~gi~~ 246 (503)
T PF10508_consen 195 AEAVVNSGLLDLLLKELDSDDILVQLNALELLSELAETPHGLQYLEQQGIFD 246 (503)
T ss_pred HHHHHhccHHHHHHHHhcCccHHHHHHHHHHHHHHHcChhHHHHHHhCCHHH
Confidence 9999999999999998888652222233 33455556666664
No 4
>cd00020 ARM Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
Probab=96.93 E-value=0.004 Score=42.28 Aligned_cols=74 Identities=24% Similarity=0.204 Sum_probs=64.1
Q ss_pred ccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHhhhhHHHHHHHhhccccHHHHHHHHHHHHHHHhC
Q psy4702 53 KNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTELMEILMALTLLEGEEVRKLALQALAKGEEY 126 (138)
Q Consensus 53 ~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL~~~~k~~~~~v~~~a~eaLk~~~~~ 126 (138)
...++.|..++.++++.+++.++.++.|+...+.+....+++.++++.|..+...+..+++..+..+|.-+...
T Consensus 6 ~~~i~~l~~~l~~~~~~~~~~a~~~l~~l~~~~~~~~~~~~~~~~i~~l~~~l~~~~~~v~~~a~~~L~~l~~~ 79 (120)
T cd00020 6 AGGLPALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSEDEEVVKAALWALRNLAAG 79 (120)
T ss_pred cCChHHHHHHHHcCCHHHHHHHHHHHHHHhcCCHHHHHHHHHCCChHHHHHHHhCCCHHHHHHHHHHHHHHccC
Confidence 45788899999999999999999999999988888999999999999999998765448888888888777544
No 5
>PLN03200 cellulose synthase-interactive protein; Provisional
Probab=96.62 E-value=0.011 Score=61.24 Aligned_cols=112 Identities=13% Similarity=0.061 Sum_probs=95.2
Q ss_pred CChhhhhhhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHH
Q psy4702 11 CSQVRFLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAE 90 (138)
Q Consensus 11 n~RlklLvll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~ 90 (138)
.+=.|+|+-|-...|.+.+..|..+|..|+...+..++.|.+..+++.|..++.++++.+|.-++..++|+...+.+...
T Consensus 403 ~daik~LV~LL~~~~~evQ~~Av~aL~~L~~~~~e~~~aIi~~ggIp~LV~LL~s~s~~iQ~~A~~~L~nLa~~ndenr~ 482 (2102)
T PLN03200 403 AEAKKVLVGLITMATADVQEELIRALSSLCCGKGGLWEALGGREGVQLLISLLGLSSEQQQEYAVALLAILTDEVDESKW 482 (2102)
T ss_pred ccchhhhhhhhccCCHHHHHHHHHHHHHHhCCCHHHHHHHHHcCcHHHHHHHHcCCCHHHHHHHHHHHHHHHcCCHHHHH
Confidence 34457777777788899999999999999975688888888877999999999999999999999999999887778788
Q ss_pred HHhhhhHHHHHHHhhccccHHHHHHHHHHHHH
Q psy4702 91 RVLSTELMEILMALTLLEGEEVRKLALQALAK 122 (138)
Q Consensus 91 kl~es~~leiL~~~~k~~~~~v~~~a~eaLk~ 122 (138)
.+++.|++..|..+...++.++++.|.-+|--
T Consensus 483 aIieaGaIP~LV~LL~s~~~~iqeeAawAL~N 514 (2102)
T PLN03200 483 AITAAGGIPPLVQLLETGSQKAKEDSATVLWN 514 (2102)
T ss_pred HHHHCCCHHHHHHHHcCCCHHHHHHHHHHHHH
Confidence 89999999999999976544777777766653
No 6
>PLN03200 cellulose synthase-interactive protein; Provisional
Probab=96.41 E-value=0.023 Score=58.95 Aligned_cols=112 Identities=17% Similarity=0.127 Sum_probs=92.5
Q ss_pred hhhhhhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHh
Q psy4702 14 VRFLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVL 93 (138)
Q Consensus 14 lklLvll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~ 93 (138)
+..|+-|-..++...++.|..++..||..++.....|.+...++.|..|+.++++.+|...+.++.|+...+.++...+.
T Consensus 448 Ip~LV~LL~s~s~~iQ~~A~~~L~nLa~~ndenr~aIieaGaIP~LV~LL~s~~~~iqeeAawAL~NLa~~~~qir~iV~ 527 (2102)
T PLN03200 448 VQLLISLLGLSSEQQQEYAVALLAILTDEVDESKWAITAAGGIPPLVQLLETGSQKAKEDSATVLWNLCCHSEDIRACVE 527 (2102)
T ss_pred HHHHHHHHcCCCHHHHHHHHHHHHHHHcCCHHHHHHHHHCCCHHHHHHHHcCCCHHHHHHHHHHHHHHhCCcHHHHHHHH
Confidence 33455445556788999999999999875778888999977999999999999999999999999999887777766666
Q ss_pred hhhHHHHHHHhhccccHHHHHHHHHHHHHHHh
Q psy4702 94 STELMEILMALTLLEGEEVRKLALQALAKGEE 125 (138)
Q Consensus 94 es~~leiL~~~~k~~~~~v~~~a~eaLk~~~~ 125 (138)
+.|++.-|..+.+..+.++++.|..+|..+..
T Consensus 528 ~aGAIppLV~LL~sgd~~~q~~Aa~AL~nLi~ 559 (2102)
T PLN03200 528 SAGAVPALLWLLKNGGPKGQEIAAKTLTKLVR 559 (2102)
T ss_pred HCCCHHHHHHHHhCCCHHHHHHHHHHHHHHHh
Confidence 88999999999877655777888888877753
No 7
>PF13646 HEAT_2: HEAT repeats; PDB: 1OYZ_A 3FGA_A 2PF4_C 2IAE_A 3B2A_A.
Probab=94.91 E-value=0.097 Score=34.43 Aligned_cols=80 Identities=28% Similarity=0.304 Sum_probs=59.5
Q ss_pred hhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHhhhhHH
Q psy4702 19 LLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTELM 98 (138)
Q Consensus 19 ll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~l 98 (138)
.|..++|+..|..|.=+|+-+= + ++-++.|..++.|+++.+|+..+..+.++= ..+.+
T Consensus 7 ~l~~~~~~~vr~~a~~~L~~~~--~---------~~~~~~L~~~l~d~~~~vr~~a~~aL~~i~-----------~~~~~ 64 (88)
T PF13646_consen 7 LLQNDPDPQVRAEAARALGELG--D---------PEAIPALIELLKDEDPMVRRAAARALGRIG-----------DPEAI 64 (88)
T ss_dssp HHHTSSSHHHHHHHHHHHHCCT--H---------HHHHHHHHHHHTSSSHHHHHHHHHHHHCCH-----------HHHTH
T ss_pred HHhcCCCHHHHHHHHHHHHHcC--C---------HhHHHHHHHHHcCCCHHHHHHHHHHHHHhC-----------CHHHH
Confidence 3447888999988777777321 1 356888888999999999999988888771 34578
Q ss_pred HHHHHhhccccH-HHHHHHHHHH
Q psy4702 99 EILMALTLLEGE-EVRKLALQAL 120 (138)
Q Consensus 99 eiL~~~~k~~~~-~v~~~a~eaL 120 (138)
+.|..+.+.+++ .|...|.++|
T Consensus 65 ~~L~~~l~~~~~~~vr~~a~~aL 87 (88)
T PF13646_consen 65 PALIKLLQDDDDEVVREAAAEAL 87 (88)
T ss_dssp HHHHHHHTC-SSHHHHHHHHHHH
T ss_pred HHHHHHHcCCCcHHHHHHHHhhc
Confidence 888888877655 6678888876
No 8
>KOG0166|consensus
Probab=94.71 E-value=0.2 Score=46.02 Aligned_cols=104 Identities=25% Similarity=0.290 Sum_probs=89.0
Q ss_pred hhhhhhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHh
Q psy4702 14 VRFLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVL 93 (138)
Q Consensus 14 lklLvll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~ 93 (138)
|..|.-|-...|+....-|.=|++.||.-++.....+++...+.-|..|+.|+++.++-=++-.+.|++.++.+-..-++
T Consensus 239 Lp~L~~ll~~~D~~Vl~Da~WAlsyLsdg~ne~iq~vi~~gvv~~LV~lL~~~~~~v~~PaLRaiGNIvtG~d~QTq~vi 318 (514)
T KOG0166|consen 239 LPALLRLLHSTDEEVLTDACWALSYLTDGSNEKIQMVIDAGVVPRLVDLLGHSSPKVVTPALRAIGNIVTGSDEQTQVVI 318 (514)
T ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHHhcCChHHHHHHHHccchHHHHHHHcCCCcccccHHHhhccceeeccHHHHHHHH
Confidence 34556666778999999999999999965788888888877899999999999999999999999999999999999999
Q ss_pred hhhHHHHHHHhhccccH-HHHHHHH
Q psy4702 94 STELMEILMALTLLEGE-EVRKLAL 117 (138)
Q Consensus 94 es~~leiL~~~~k~~~~-~v~~~a~ 117 (138)
..|++..|+.+...+.+ .+.+.|.
T Consensus 319 ~~~~L~~l~~ll~~s~~~~ikkEAc 343 (514)
T KOG0166|consen 319 NSGALPVLSNLLSSSPKESIKKEAC 343 (514)
T ss_pred hcChHHHHHHHhccCcchhHHHHHH
Confidence 99999999999987654 4555443
No 9
>KOG4224|consensus
Probab=94.63 E-value=0.14 Score=46.34 Aligned_cols=117 Identities=23% Similarity=0.219 Sum_probs=87.3
Q ss_pred eeeccCCChhhh--------hhhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhh
Q psy4702 5 YLILFYCSQVRF--------LFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALV 76 (138)
Q Consensus 5 ~~~~~~n~Rlkl--------Lvll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~v 76 (138)
||.-++..|.|+ +.=|+-.+|-..++-|.|||--+|. +..-=+-++...++.+|..++.+.|+++|||...
T Consensus 152 nLaT~d~nk~kiA~sGaL~pltrLakskdirvqrnatgaLlnmTh-s~EnRr~LV~aG~lpvLVsll~s~d~dvqyyctt 230 (550)
T KOG4224|consen 152 NLATFDSNKVKIARSGALEPLTRLAKSKDIRVQRNATGALLNMTH-SRENRRVLVHAGGLPVLVSLLKSGDLDVQYYCTT 230 (550)
T ss_pred hhhccccchhhhhhccchhhhHhhcccchhhHHHHHHHHHHHhhh-hhhhhhhhhccCCchhhhhhhccCChhHHHHHHH
Confidence 344455566554 4557888999999999999999998 4555444555889999999999999999999999
Q ss_pred hhhhhhhcChHHHHHHhhhh--HHHHHHHhhccccHHHHHHHHHHHHHH
Q psy4702 77 IVGNILSAGKELAERVLSTE--LMEILMALTLLEGEEVRKLALQALAKG 123 (138)
Q Consensus 77 iv~Nmi~a~~e~a~kl~es~--~leiL~~~~k~~~~~v~~~a~eaLk~~ 123 (138)
-+.|+.-. .---+++.++| ++..|..+-..++.+|.=.|--||.-+
T Consensus 231 aisnIaVd-~~~Rk~Laqaep~lv~~Lv~Lmd~~s~kvkcqA~lALrnl 278 (550)
T KOG4224|consen 231 AISNIAVD-RRARKILAQAEPKLVPALVDLMDDGSDKVKCQAGLALRNL 278 (550)
T ss_pred Hhhhhhhh-HHHHHHHHhcccchHHHHHHHHhCCChHHHHHHHHHHhhh
Confidence 99998532 23346788888 888888887665446555555555544
No 10
>KOG1048|consensus
Probab=94.35 E-value=0.12 Score=49.08 Aligned_cols=109 Identities=22% Similarity=0.164 Sum_probs=80.2
Q ss_pred hhhhcCCCCHHHHHhhhhhhhhhcc----CCHHHHhhhhh-ccHHHHHHHHhcCCCcccchhhhhhh-------------
Q psy4702 17 LFLLTCEEDTETACAAAGALAMLTS----VSTPVCSMLLE-KNWVDIFKQILAHPDVGVQHRALVIV------------- 78 (138)
Q Consensus 17 Lvll~~eED~~tr~AA~GALAmLT~----~~~~~c~~I~~-~~wleil~~L~~~~~~~lqHRG~viv------------- 78 (138)
|.++...-...|.-|++|||=.||. ++..+|..++. ++++.+|..|+..+++.+..=+.-.+
T Consensus 524 l~Ll~~s~n~~TlEasaGaLQNltA~~~~~~~~~~~~v~~kekgl~~l~~ll~~~~~~vv~s~a~~LrNls~d~rnk~li 603 (717)
T KOG1048|consen 524 LLLLALSKNDNTLEASAGALQNLTAGLWTWSEYMRGAVFRKEKGLPPLVELLRNDDSDVVRSAAGALRNLSRDIRNKELI 603 (717)
T ss_pred HHHHHHhcchHHHHHhhhhHhhhhccCCcchhHHHhhhhhhccCccHHHHHHhcCCchHHHHHHHHHhhhccCchhhhhh
Confidence 6677766666799999999999997 34578888867 99999999999888777655444444
Q ss_pred ---------------------------------hhhhhcChHHHHHHhhhhHHHHHHHhhccccH-HHHHHHHHHHHHHH
Q psy4702 79 ---------------------------------GNILSAGKELAERVLSTELMEILMALTLLEGE-EVRKLALQALAKGE 124 (138)
Q Consensus 79 ---------------------------------~Nmi~a~~e~a~kl~es~~leiL~~~~k~~~~-~v~~~a~eaLk~~~ 124 (138)
.||+..+-+.|+.+.+.++++-|+.+.|.... ++.+.|-.-|.-+-
T Consensus 604 gk~a~~~lv~~Lp~~~~~~~~sedtv~~vc~tl~niv~~~~~nAkdl~~~~g~~kL~~I~~s~~S~k~~kaAs~vL~~lW 683 (717)
T KOG1048|consen 604 GKYAIPDLVRCLPGSGPSTSLSEDTVRAVCHTLNNIVRKNVLNAKDLLEIKGIPKLRLISKSQHSPKEFKAASSVLDVLW 683 (717)
T ss_pred hcchHHHHHHhCcCCCCCcCchHHHHHHHHHhHHHHHHHhHHHHHHHHhccChHHHHHHhcccCCHHHHHHHHHHHHHHH
Confidence 44554555566778888888888888887554 77777766555444
Q ss_pred h
Q psy4702 125 E 125 (138)
Q Consensus 125 ~ 125 (138)
.
T Consensus 684 ~ 684 (717)
T KOG1048|consen 684 Q 684 (717)
T ss_pred H
Confidence 3
No 11
>PF12717 Cnd1: non-SMC mitotic condensation complex subunit 1
Probab=94.33 E-value=0.16 Score=39.15 Aligned_cols=90 Identities=20% Similarity=0.206 Sum_probs=62.5
Q ss_pred HHHHHhhhhhhhhhccCCHHHHhhhhh--ccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHhhhhHHHHHHH
Q psy4702 26 TETACAAAGALAMLTSVSTPVCSMLLE--KNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTELMEILMA 103 (138)
Q Consensus 26 ~~tr~AA~GALAmLT~~~~~~c~~I~~--~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL~~ 103 (138)
+..|.-+--+++- .|.+-+. .+|.+.+...+.|+++.+|...+.++.+++..+ --|+...-+.+++..
T Consensus 2 ~~vR~n~i~~l~D-------L~~r~~~~ve~~~~~l~~~L~D~~~~VR~~al~~Ls~Li~~d---~ik~k~~l~~~~l~~ 71 (178)
T PF12717_consen 2 PSVRNNAIIALGD-------LCIRYPNLVEPYLPNLYKCLRDEDPLVRKTALLVLSHLILED---MIKVKGQLFSRILKL 71 (178)
T ss_pred HHHHHHHHHHHHH-------HHHhCcHHHHhHHHHHHHHHCCCCHHHHHHHHHHHHHHHHcC---ceeehhhhhHHHHHH
Confidence 4445544444443 4445555 679999999999999999999999999998764 234444444677765
Q ss_pred hhccccHHHHHHHHHHHHHHHhC
Q psy4702 104 LTLLEGEEVRKLALQALAKGEEY 126 (138)
Q Consensus 104 ~~k~~~~~v~~~a~eaLk~~~~~ 126 (138)
+ ..+.++|.+.|..++......
T Consensus 72 l-~D~~~~Ir~~A~~~~~e~~~~ 93 (178)
T PF12717_consen 72 L-VDENPEIRSLARSFFSELLKK 93 (178)
T ss_pred H-cCCCHHHHHHHHHHHHHHHHh
Confidence 5 333338999999988766543
No 12
>PF04826 Arm_2: Armadillo-like; InterPro: IPR006911 This entry consists of mammalian proteins of unknown function.
Probab=92.96 E-value=0.65 Score=38.55 Aligned_cols=69 Identities=20% Similarity=0.238 Sum_probs=57.8
Q ss_pred hhhhhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhh-ccHHHHHHHHhcCCCcccchhhhhhhhhhhhcC
Q psy4702 15 RFLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLE-KNWVDIFKQILAHPDVGVQHRALVIVGNILSAG 85 (138)
Q Consensus 15 klLvll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~-~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~ 85 (138)
+++-+|-..+|+..+..|-.+++..++ ++ ..+.++. -.++.++..++.+|++.++=+++-.+.||-...
T Consensus 16 ~Ll~lL~~t~dp~i~e~al~al~n~aa-f~-~nq~~Ir~~Ggi~lI~~lL~~p~~~vr~~AL~aL~Nls~~~ 85 (254)
T PF04826_consen 16 KLLCLLESTEDPFIQEKALIALGNSAA-FP-FNQDIIRDLGGISLIGSLLNDPNPSVREKALNALNNLSVND 85 (254)
T ss_pred HHHHHHhcCCChHHHHHHHHHHHhhcc-Ch-hHHHHHHHcCCHHHHHHHcCCCChHHHHHHHHHHHhcCCCh
Confidence 344555667999999999999999987 56 6666666 889999999999999999999999999995543
No 13
>PF09759 Atx10homo_assoc: Spinocerebellar ataxia type 10 protein domain; InterPro: IPR019156 This is the conserved C-terminal 100 residues of Ataxin-10. Ataxin-10 belongs to the family of armadillo repeat proteins and in solution it tends to form homotrimeric complexes, which associate via a tip-to-tip association in a horseshoe-shaped contact with the concave sides of the molecules facing each other. This domain may represent the homo-association site since that is located near the C terminus of Ataxin-10. The protein does not contain a signal sequence for secretion or any subcellular compartment confirming its cytoplasmic localisation, specifically to the olivocerebellar region [].
Probab=92.17 E-value=0.27 Score=36.07 Aligned_cols=56 Identities=13% Similarity=0.076 Sum_probs=47.1
Q ss_pred hhhhhhccCCHHHHhhhhhccHHHHHHHHhcC--CCcccchhhhhhhhhhhhcChHHH
Q psy4702 34 GALAMLTSVSTPVCSMLLEKNWVDIFKQILAH--PDVGVQHRALVIVGNILSAGKELA 89 (138)
Q Consensus 34 GALAmLT~~~~~~c~~I~~~~wleil~~L~~~--~~~~lqHRG~viv~Nmi~a~~e~a 89 (138)
-.+|.||.-++.++..+.+-.+++.++.-|.- .||-++-++++++||+.....+.-
T Consensus 8 rlianl~~~~~~~Qd~vr~~~Gi~liL~~c~iD~~nP~irEwai~aiRnL~e~n~eNQ 65 (102)
T PF09759_consen 8 RLIANLCYKNKEVQDLVRELGGIPLILSCCNIDDHNPFIREWAIFAIRNLCEGNPENQ 65 (102)
T ss_pred HHHHHHHhCCHHHHHHHHHcCChHHHHHhcCCCcccHHHHHHHHHHHHHHHhCCHHHH
Confidence 35788888889999999996689999998864 688899999999999998876543
No 14
>PRK09687 putative lyase; Provisional
Probab=91.81 E-value=0.49 Score=39.45 Aligned_cols=99 Identities=18% Similarity=0.164 Sum_probs=58.9
Q ss_pred hhhhhhh-cCCCCHHHHHhhhhhhhhhccCC----H---------------HH----Hhhhhh---ccHHHHHHHHhcCC
Q psy4702 14 VRFLFLL-TCEEDTETACAAAGALAMLTSVS----T---------------PV----CSMLLE---KNWVDIFKQILAHP 66 (138)
Q Consensus 14 lklLvll-~~eED~~tr~AA~GALAmLT~~~----~---------------~~----c~~I~~---~~wleil~~L~~~~ 66 (138)
+.+|..+ ..+.|...|.+|.++|.-+.+.. + .+ +..+-. +.-++.|..++.|+
T Consensus 92 ~~~L~~l~~~D~d~~VR~~A~~aLG~~~~~~~~~~~~a~~~l~~~~~D~~~~VR~~a~~aLg~~~~~~ai~~L~~~L~d~ 171 (280)
T PRK09687 92 FNILNNLALEDKSACVRASAINATGHRCKKNPLYSPKIVEQSQITAFDKSTNVRFAVAFALSVINDEAAIPLLINLLKDP 171 (280)
T ss_pred HHHHHHHHhcCCCHHHHHHHHHHHhcccccccccchHHHHHHHHHhhCCCHHHHHHHHHHHhccCCHHHHHHHHHHhcCC
Confidence 5566544 67788888989999988773200 1 11 111111 56678888888899
Q ss_pred CcccchhhhhhhhhhhhcChHHHHHHhhhhHHHHHHHhhccccHHHHHHHHHHHH
Q psy4702 67 DVGVQHRALVIVGNILSAGKELAERVLSTELMEILMALTLLEGEEVRKLALQALA 121 (138)
Q Consensus 67 ~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL~~~~k~~~~~v~~~a~eaLk 121 (138)
+++++|..+..+.++=..+.+ ..+.|....+.....|...|..+|.
T Consensus 172 ~~~VR~~A~~aLg~~~~~~~~---------~~~~L~~~L~D~~~~VR~~A~~aLg 217 (280)
T PRK09687 172 NGDVRNWAAFALNSNKYDNPD---------IREAFVAMLQDKNEEIRIEAIIGLA 217 (280)
T ss_pred CHHHHHHHHHHHhcCCCCCHH---------HHHHHHHHhcCCChHHHHHHHHHHH
Confidence 999999988888887222222 2333444433322255555555554
No 15
>PF00514 Arm: Armadillo/beta-catenin-like repeat; InterPro: IPR000225 The armadillo (Arm) repeat is an approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila melanogaster segment polarity gene armadillo involved in signal transduction through wingless. Animal Arm-repeat proteins function in various processes, including intracellular signalling and cytoskeletal regulation, and include such proteins as beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumour suppressor protein, and the nuclear transport factor importin-alpha, amongst others []. A subset of these proteins is conserved across eukaryotic kingdoms. In higher plants, some Arm-repeat proteins function in intracellular signalling like their mammalian counterparts, while others have novel functions []. The 3-dimensional fold of an armadillo repeat is known from the crystal structure of beta-catenin, where the 12 repeats form a superhelix of alpha helices with three helices per unit []. The cylindrical structure features a positively charged grove, which presumably interacts with the acidic surfaces of the known interaction partners of beta-catenin.; GO: 0005515 protein binding; PDB: 2Z6G_A 1IQ1_C 3RZX_A 2C1M_A 3BTR_C 3OQS_A 3TPO_A 1IAL_A 1Q1S_C 1PJM_B ....
Probab=91.18 E-value=0.3 Score=28.86 Aligned_cols=40 Identities=28% Similarity=0.316 Sum_probs=35.5
Q ss_pred CHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhh
Q psy4702 43 STPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNIL 82 (138)
Q Consensus 43 ~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi 82 (138)
++.-+..|++..+++.|..|+.++++++|.-.+-.++|+.
T Consensus 1 ~~~~~~~i~~~g~i~~Lv~ll~~~~~~v~~~a~~al~nl~ 40 (41)
T PF00514_consen 1 SPENKQAIVEAGGIPPLVQLLKSPDPEVQEEAAWALGNLA 40 (41)
T ss_dssp SHHHHHHHHHTTHHHHHHHHTTSSSHHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHcccHHHHHHHHcCCCHHHHHHHHHHHHHHh
Confidence 3566777888889999999999999999999999999985
No 16
>PF10508 Proteasom_PSMB: Proteasome non-ATPase 26S subunit; InterPro: IPR019538 The 26S proteasome is an enzymatic complex that degrades ubiquitinated proteins in eukaryotic cells. 26S proteasome non-ATPase regulatory subunit 5 is one of a number of chaperones that are involved in the assembly of the proteasome. The chaperones dissociate before 26S proteasome formation is complete [].; GO: 0044183 protein binding involved in protein folding
Probab=90.84 E-value=0.79 Score=41.05 Aligned_cols=75 Identities=23% Similarity=0.270 Sum_probs=66.1
Q ss_pred ccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHhhhhHHHHHHHhhccccHHHHHHHHHHHHHHHhCC
Q psy4702 53 KNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTELMEILMALTLLEGEEVRKLALQALAKGEEYG 127 (138)
Q Consensus 53 ~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL~~~~k~~~~~v~~~a~eaLk~~~~~g 127 (138)
++..+.|+..+.||++.+|.-++-.+.|++.++....+.+.+.+++..+..|...++..|.+.|..+|+.+....
T Consensus 76 ~~~~~~L~~gL~h~~~~Vr~l~l~~l~~~~~~~~~~~~~~~~~~l~~~i~~~L~~~d~~Va~~A~~~L~~l~~~~ 150 (503)
T PF10508_consen 76 PQYQPFLQRGLTHPSPKVRRLALKQLGRIARHSEGAAQLLVDNELLPLIIQCLRDPDLSVAKAAIKALKKLASHP 150 (503)
T ss_pred HHHHHHHHHHhcCCCHHHHHHHHHHHHHHhcCCHHHHHHhcCccHHHHHHHHHcCCcHHHHHHHHHHHHHHhCCc
Confidence 567788999999999999999999999999988888899999999999999987776688999999999887543
No 17
>PF03224 V-ATPase_H_N: V-ATPase subunit H; InterPro: IPR004908 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. V-ATPases (also known as V1V0-ATPase or vacuolar ATPase) (3.6.3.14 from EC) are found in the eukaryotic endomembrane system, and in the plasma membrane of prokaryotes and certain specialised eukaryotic cells. V-ATPases hydrolyse ATP to drive a proton pump, and are involved in a variety of vital intra- and inter-cellular processes such as receptor mediated endocytosis, protein trafficking, active transport of metabolites, homeostasis and neurotransmitter release []. V-ATPases are composed of two linked complexes: the V1 complex (subunits A-H) contains the catalytic core that hydrolyses ATP, while the V0 complex (subunits a, c, c', c'', d) forms the membrane-spanning pore. V-ATPases may have an additional role in membrane fusion through binding to t-SNARE proteins []. This entry represents subunit H (also known as Vma13p) found in the V1 complex of V-ATPases. This subunit has a regulatory function, being responsible for activating ATPase activity and coupling ATPase activity to proton flow []. The yeast enzyme contains five motifs similar to the HEAT or Armadillo repeats seen in the importins, and can be divided into two distinct domains: a large N-terminal domain consisting of stacked alpha helices, and a smaller C-terminal alpha-helical domain with a similar superhelical topology to an armadillo repeat []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0046961 proton-transporting ATPase activity, rotational mechanism, 0015991 ATP hydrolysis coupled proton transport, 0000221 vacuolar proton-transporting V-type ATPase, V1 domain; PDB: 1HO8_A.
Probab=90.72 E-value=0.53 Score=39.13 Aligned_cols=101 Identities=21% Similarity=0.164 Sum_probs=75.5
Q ss_pred CCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHh-------cCCCcccchhhhhhhhhhhhcChHHHHHHhhhh
Q psy4702 24 EDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQIL-------AHPDVGVQHRALVIVGNILSAGKELAERVLSTE 96 (138)
Q Consensus 24 ED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~-------~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~ 96 (138)
+|...+..|..+|+.|-. .+..=..+.+.+++..|..++ ...+..+||-.+.|+--+- =++++++.+...+
T Consensus 162 ~~~~~~~~av~~L~~LL~-~~~~R~~f~~~~~v~~l~~iL~~~~~~~~~~~~Ql~Y~~ll~lWlLS-F~~~~~~~~~~~~ 239 (312)
T PF03224_consen 162 SDSELQYIAVQCLQNLLR-SKEYRQVFWKSNGVSPLFDILRKQATNSNSSGIQLQYQALLCLWLLS-FEPEIAEELNKKY 239 (312)
T ss_dssp HHH---HHHHHHHHHHHT-SHHHHHHHHTHHHHHHHHHHHH---------HHHHHHHHHHHHHHHT-TSHHHHHHHHTTS
T ss_pred CCcchHHHHHHHHHHHhC-cchhHHHHHhcCcHHHHHHHHHhhcccCCCCchhHHHHHHHHHHHHh-cCHHHHHHHhccc
Confidence 455556788888998876 465555555588999999988 2356678999999998764 4568899999999
Q ss_pred HHHHHHHhhccccH-HHHHHHHHHHHHHHhC
Q psy4702 97 LMEILMALTLLEGE-EVRKLALQALAKGEEY 126 (138)
Q Consensus 97 ~leiL~~~~k~~~~-~v~~~a~eaLk~~~~~ 126 (138)
++-.|..+.|.+.. +|+.++..+|+-+++.
T Consensus 240 ~i~~L~~i~~~~~KEKvvRv~la~l~Nl~~~ 270 (312)
T PF03224_consen 240 LIPLLADILKDSIKEKVVRVSLAILRNLLSK 270 (312)
T ss_dssp HHHHHHHHHHH--SHHHHHHHHHHHHHTTSS
T ss_pred hHHHHHHHHHhcccchHHHHHHHHHHHHHhc
Confidence 99999999998754 9999999999877754
No 18
>PF13513 HEAT_EZ: HEAT-like repeat; PDB: 2Z5J_A 2OT8_B 2Z5O_A 2H4M_A 2QMR_A 1QBK_B 2Z5M_A 2Z5K_A 2Z5N_A 1GCJ_B ....
Probab=90.63 E-value=0.43 Score=29.50 Aligned_cols=54 Identities=22% Similarity=0.168 Sum_probs=41.2
Q ss_pred HHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhh
Q psy4702 27 ETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNI 81 (138)
Q Consensus 27 ~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nm 81 (138)
..|.+|.-+|+.+....+...+. ..+.-++.|..++.|+++.+|...+..+.||
T Consensus 2 ~vR~~A~~aLg~l~~~~~~~~~~-~~~~~~~~L~~~L~d~~~~VR~~A~~aLg~l 55 (55)
T PF13513_consen 2 RVRRAAAWALGRLAEGCPELLQP-YLPELLPALIPLLQDDDDSVRAAAAWALGNL 55 (55)
T ss_dssp HHHHHHHHHHHCTTTTTHHHHHH-HHHHHHHHHHHHTTSSSHHHHHHHHHHHHCH
T ss_pred HHHHHHHHHHhhHhcccHHHHHH-HHHHHHHHHHHHHcCCCHHHHHHHHHHHhcC
Confidence 57889999998876544444444 2277889999999999999988888877765
No 19
>KOG0166|consensus
Probab=90.40 E-value=1.5 Score=40.31 Aligned_cols=112 Identities=22% Similarity=0.246 Sum_probs=88.7
Q ss_pred hhhhhhhhc-CCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHH
Q psy4702 13 QVRFLFLLT-CEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAER 91 (138)
Q Consensus 13 RlklLvll~-~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~k 91 (138)
.++-+|-+- ..+++.++-.|+=||.-+.+-+...-+.+++..-+.+|..|..+++++++-.++-.+.|++..+......
T Consensus 110 ~v~~lV~~l~~~~~~~lq~eAAWaLTnIAsgtse~T~~vv~agavp~fi~Ll~s~~~~v~eQavWALgNIagds~~~Rd~ 189 (514)
T KOG0166|consen 110 VVPRLVEFLSRDDNPTLQFEAAWALTNIASGTSEQTKVVVDAGAVPIFIQLLSSPSADVREQAVWALGNIAGDSPDCRDY 189 (514)
T ss_pred cHHHHHHHHccCCChhHHHHHHHHHHHHhcCchhhccccccCCchHHHHHHhcCCcHHHHHHHHHHHhccccCChHHHHH
Confidence 455555544 4677899999999998887755566666667778899999999999999999999999999999899999
Q ss_pred HhhhhHHHHHHHhhccccH-HHHHHHHHHHHHHH
Q psy4702 92 VLSTELMEILMALTLLEGE-EVRKLALQALAKGE 124 (138)
Q Consensus 92 l~es~~leiL~~~~k~~~~-~v~~~a~eaLk~~~ 124 (138)
+++.|+|+-|......+.. ..+..+.-+|..+-
T Consensus 190 vl~~g~l~pLl~~l~~~~~~~~lRn~tW~LsNlc 223 (514)
T KOG0166|consen 190 VLSCGALDPLLRLLNKSDKLSMLRNATWTLSNLC 223 (514)
T ss_pred HHhhcchHHHHHHhccccchHHHHHHHHHHHHHH
Confidence 9999999999988766554 44555555555443
No 20
>PF08324 PUL: PUL domain; InterPro: IPR013535 The PUL (after PLAP, UFD3 and lub1) domain is a predicted predominantly alpha helical globular domain found in eukaryotes. It is found in association with either WD repeats (see PDOC00574 from PROSITEDOC) and the PFU domain (see PDOC51394 from PROSITEDOC) or PPPDE and thioredoxin (see PDOC00172 from PROSITEDOC) domains. The PUL domain is a protein-protein interaction domain [, ]. Some proteins known to contain a PUL domain are listed below: Saccharomyces cerevisiae DOA1 (UFD3, ZZZ4), involved in ubiquitin conjugation pathway. DOA1 participates in the regulation of the ubiquitin conjugation pathway involving CDC48 by hindering multiubiquitination of substrates at the CDC48 chaperone. Schizosaccharomyces pombe ubiquitin homeostasis protein lub1, acts as a negative regulator of vacuole-dependent ubiquitin degradation. Mammalian phospholipase A-2-activating protein (PLA2P, PLAA), the homologue of DOA1. PLA2P plays an important role in the regulation of specific inflammatory disease processes. ; PDB: 3EBB_A 3L3F_X 3GAE_B 3PST_A 3PSP_A.
Probab=90.01 E-value=0.36 Score=38.85 Aligned_cols=91 Identities=16% Similarity=0.188 Sum_probs=60.0
Q ss_pred CHHHHHhhhhhhhhhccCCHHHHhhhhh----ccHHHHHHH-HhcC-CCcccchhhhhhhhhhhhcChHHHHHHhhhhHH
Q psy4702 25 DTETACAAAGALAMLTSVSTPVCSMLLE----KNWVDIFKQ-ILAH-PDVGVQHRALVIVGNILSAGKELAERVLSTELM 98 (138)
Q Consensus 25 D~~tr~AA~GALAmLT~~~~~~c~~I~~----~~wleil~~-L~~~-~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~l 98 (138)
+..+|.|++..+--++.. .++.-.+ ..-+..+.. +... .++|..||.+|.+.|++..+.+..+....-|+-
T Consensus 170 ~kn~~~A~ATl~~Nlsv~---~~~~~~~~~~~~~ll~~i~~~~~~~~~d~Ea~~R~LvAlGtL~~~~~~~~~~~~~l~~~ 246 (268)
T PF08324_consen 170 NKNVRIALATLLLNLSVL---LHKNRSDEEWQSELLSSIIEVLSREESDEEALYRLLVALGTLLSSSDSAKQLAKSLDVK 246 (268)
T ss_dssp HHHHHHHHHHHHHHHHHH---HHHCTS-CCHHHHHHHHHHHHCHCCHTSHHHHHHHHHHHHHHHCCSHHHHHHCCCCTHH
T ss_pred cHHHHHHHHHHHHHHHHH---HHhcCCChHHHHHHHHHHHHHhccccCCHHHHHHHHHHHHHHhccChhHHHHHHHcChH
Confidence 667777777766555431 1222112 123444444 2222 799999999999999998787777766666788
Q ss_pred HHHHHhhccccH-HHHHHHHH
Q psy4702 99 EILMALTLLEGE-EVRKLALQ 118 (138)
Q Consensus 99 eiL~~~~k~~~~-~v~~~a~e 118 (138)
.++....+.+.+ .+.+++.|
T Consensus 247 ~~~~~~~~~~~e~ri~~v~~e 267 (268)
T PF08324_consen 247 SVLSKKANKSKEPRIKEVAAE 267 (268)
T ss_dssp HHHHHHHHHTTSHHHHHHHHH
T ss_pred HHHHHHHhcccchHHHHHhcc
Confidence 888777766555 77888876
No 21
>PF12755 Vac14_Fab1_bd: Vacuolar 14 Fab1-binding region
Probab=88.44 E-value=1.7 Score=31.14 Aligned_cols=70 Identities=20% Similarity=0.182 Sum_probs=44.4
Q ss_pred ccHHHHHHHHhcCCCcccchhhhhhhhhhhhcCh-HHHHHHhhhhHHHHHHHhhccccHHHHHHHHHHHHHHHh
Q psy4702 53 KNWVDIFKQILAHPDVGVQHRALVIVGNILSAGK-ELAERVLSTELMEILMALTLLEGEEVRKLALQALAKGEE 125 (138)
Q Consensus 53 ~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~-e~a~kl~es~~leiL~~~~k~~~~~v~~~a~eaLk~~~~ 125 (138)
+.-++.++....|+|..+||=..-.++|+....+ ++-..+ .++++.|..+...+++.|...| +-|..++.
T Consensus 26 ~~Il~pVL~~~~D~d~rVRy~AcEaL~ni~k~~~~~~l~~f--~~IF~~L~kl~~D~d~~Vr~~a-~~Ld~llk 96 (97)
T PF12755_consen 26 DEILPPVLKCFDDQDSRVRYYACEALYNISKVARGEILPYF--NEIFDALCKLSADPDENVRSAA-ELLDRLLK 96 (97)
T ss_pred HHHHHHHHHHcCCCcHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHcCCchhHHHHH-HHHHHHhc
Confidence 6677778888899999999999999999986433 222222 2344444444333333554444 55776653
No 22
>PF11701 UNC45-central: Myosin-binding striated muscle assembly central; InterPro: IPR024660 The UNC-45 or small muscle protein 1 of Caenorhabditis elegans is expressed in two forms from different genomic positions in mammals: as a general tissue protein (UNC-45a) and as a specific form (UNC-45b) expressed only in striated and skeletal muscle. Myofibril formation requires both UNC-45 forms, consistent with the fact that the cytoskeleton is necessary for the development and maintenance of organised myofibrils []. Rng3 (Ring assembly protein 3), the homologue in Schizosaccharomyces pombe, is crucial for cell shape, normal actin cytoskeleton, and contractile ring assembly, and is essential for assembly of the myosin II-containing progenitors of the contractile ring. Widespread defects in the cytoskeleton are found in null mutants of all three fungal proteins []. Mammalian Unc45 is found to act as a specific chaperone during the folding of myosin and the assembly of striated muscle by forming a stable complex with the general chaperone Hsp90 []. All members carry up to three amino-terminal tetratricopeptide repeat (TPR) and a UCS domain at the C terminus that contains a number of Arm repeats. ; PDB: 3OPB_A 3NOW_A.
Probab=87.93 E-value=1.6 Score=33.27 Aligned_cols=92 Identities=16% Similarity=0.142 Sum_probs=60.5
Q ss_pred HHHHhhhhhhhhhccCCHHHHhhhhh-ccHHHHHHHHhc--CCCcccchhhhhhhhhhhhcChHHHHHHhhhhHHHHHHH
Q psy4702 27 ETACAAAGALAMLTSVSTPVCSMLLE-KNWVDIFKQILA--HPDVGVQHRALVIVGNILSAGKELAERVLSTELMEILMA 103 (138)
Q Consensus 27 ~tr~AA~GALAmLT~~~~~~c~~I~~-~~wleil~~L~~--~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL~~ 103 (138)
+...|.+-.-+.+- .-|.+|..|+. +..++.+..++. .+++.+ ...+.=+.|-. +.++-....+-+..+++|..
T Consensus 59 ~~i~~~~~l~~lfp-~~~dv~~~l~~~eg~~~~l~~~~~~~~~~~~~-~~~~lell~aA-c~d~~~r~~I~~~~~~~L~~ 135 (157)
T PF11701_consen 59 SLIIAFSALTALFP-GPPDVGSELFLSEGFLESLLPLASRKSKDRKV-QKAALELLSAA-CIDKSCRTFISKNYVSWLKE 135 (157)
T ss_dssp HHHHHHHHHHHHCT-TTHHHHHHHCCTTTHHHHHHHHHH-CTS-HHH-HHHHHHHHHHH-TTSHHHHHCCHHHCHHHHHH
T ss_pred hHHHHHHHHHHHhC-CCHHHHHHHHhhhhHHHHHHHHHhcccCCHHH-HHHHHHHHHHH-HccHHHHHHHHHHHHHHHHH
Confidence 34444444444444 47999999999 999999999999 455544 34433333333 34556677888899999999
Q ss_pred hhccccH-H-HHHHHHHHHH
Q psy4702 104 LTLLEGE-E-VRKLALQALA 121 (138)
Q Consensus 104 ~~k~~~~-~-v~~~a~eaLk 121 (138)
..|.+.+ . +.-.|.=+|-
T Consensus 136 ~~~~~~~~~~ir~~A~v~L~ 155 (157)
T PF11701_consen 136 LYKNSKDDSEIRVLAAVGLC 155 (157)
T ss_dssp HTTTCC-HH-CHHHHHHHHH
T ss_pred HHccccchHHHHHHHHHHHh
Confidence 9988765 3 5444444443
No 23
>PF13513 HEAT_EZ: HEAT-like repeat; PDB: 2Z5J_A 2OT8_B 2Z5O_A 2H4M_A 2QMR_A 1QBK_B 2Z5M_A 2Z5K_A 2Z5N_A 1GCJ_B ....
Probab=87.74 E-value=1.4 Score=27.11 Aligned_cols=53 Identities=26% Similarity=0.295 Sum_probs=41.4
Q ss_pred ccchhhhhhhhhhhhcChHHHHHHhhhhHHHHHHHhhccccHHHHHHHHHHHHH
Q psy4702 69 GVQHRALVIVGNILSAGKELAERVLSTELMEILMALTLLEGEEVRKLALQALAK 122 (138)
Q Consensus 69 ~lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL~~~~k~~~~~v~~~a~eaLk~ 122 (138)
.+|+.++-.+.|+.....+..+. .-.+++..|..+.+.+...|+..|..||..
T Consensus 2 ~vR~~A~~aLg~l~~~~~~~~~~-~~~~~~~~L~~~L~d~~~~VR~~A~~aLg~ 54 (55)
T PF13513_consen 2 RVRRAAAWALGRLAEGCPELLQP-YLPELLPALIPLLQDDDDSVRAAAAWALGN 54 (55)
T ss_dssp HHHHHHHHHHHCTTTTTHHHHHH-HHHHHHHHHHHHTTSSSHHHHHHHHHHHHC
T ss_pred HHHHHHHHHHhhHhcccHHHHHH-HHHHHHHHHHHHHcCCCHHHHHHHHHHHhc
Confidence 46778888888877777666665 677889999998876655999999988864
No 24
>smart00185 ARM Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Probab=86.79 E-value=0.82 Score=25.89 Aligned_cols=38 Identities=13% Similarity=0.128 Sum_probs=32.1
Q ss_pred HHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhh
Q psy4702 45 PVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNIL 82 (138)
Q Consensus 45 ~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi 82 (138)
..+..+.+...++.|..|+.++++++|.=.+-.++|+.
T Consensus 3 ~~~~~i~~~g~i~~L~~ll~~~~~~i~~~a~~aL~nl~ 40 (41)
T smart00185 3 EQKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLS 40 (41)
T ss_pred HHHHHHHHCCCHHHHHHHHcCCCHHHHHHHHHHHHHHc
Confidence 34555556779999999999999999999999999985
No 25
>KOG4500|consensus
Probab=86.01 E-value=2.7 Score=38.93 Aligned_cols=91 Identities=15% Similarity=0.256 Sum_probs=70.5
Q ss_pred HHhhhhhhhhhccCCHHHHhhhhh-ccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHhhhhHHHHHHHhhcc
Q psy4702 29 ACAAAGALAMLTSVSTPVCSMLLE-KNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTELMEILMALTLL 107 (138)
Q Consensus 29 r~AA~GALAmLT~~~~~~c~~I~~-~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL~~~~k~ 107 (138)
-++|---.-.||+ ++ --.++.. |++++.+..-..++|.+++--|..-+.|....| +++..+++.+.++-|+.|.-+
T Consensus 291 k~~~el~vllltG-De-SMq~L~~~p~~l~~~~sw~~S~d~~l~t~g~LaigNfaR~D-~~ci~~v~~~~~nkL~~~l~~ 367 (604)
T KOG4500|consen 291 KRIAELDVLLLTG-DE-SMQKLHADPQFLDFLESWFRSDDSNLITMGSLAIGNFARRD-DICIQLVQKDFLNKLISCLMQ 367 (604)
T ss_pred HhhhhHhhhhhcC-ch-HHHHHhcCcHHHHHHHHHhcCCchhHHHHHHHHHHhhhccc-hHHHHHHHHHHHHHHHHHHHH
Confidence 4555555667787 34 4567777 889999999999999999999999999998766 789999999999999999866
Q ss_pred cc----H-HHHHHHHHHHHH
Q psy4702 108 EG----E-EVRKLALQALAK 122 (138)
Q Consensus 108 ~~----~-~v~~~a~eaLk~ 122 (138)
+. + +++-.+-.||.-
T Consensus 368 ~~~vdgnV~~qhA~lsALRn 387 (604)
T KOG4500|consen 368 EKDVDGNVERQHACLSALRN 387 (604)
T ss_pred hcCCCccchhHHHHHHHHHh
Confidence 32 3 545555555543
No 26
>KOG4224|consensus
Probab=85.84 E-value=0.9 Score=41.27 Aligned_cols=103 Identities=20% Similarity=0.160 Sum_probs=73.1
Q ss_pred hhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHhhhhHH
Q psy4702 19 LLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTELM 98 (138)
Q Consensus 19 ll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~l 98 (138)
++...-|....+||+-++.-|.- .-.=|--|..-.|++.|..=.+.|..++|.-.+=|+-|++.- .+.--|+-.+|++
T Consensus 92 ~llqs~d~~Iq~aa~~alGnlAV-n~enk~liv~l~Gl~~Li~qmmtd~vevqcnaVgCitnLaT~-d~nk~kiA~sGaL 169 (550)
T KOG4224|consen 92 ALLQSCDKCIQCAAGEALGNLAV-NMENKGLIVSLLGLDLLILQMMTDGVEVQCNAVGCITNLATF-DSNKVKIARSGAL 169 (550)
T ss_pred HHHhCcchhhhhhhhhhhcccee-ccCCceEEEeccChHHHHHHhcCCCcEEEeeehhhhhhhhcc-ccchhhhhhccch
Confidence 34444556666776666654432 233333344456888888888899999999999999999875 3556789999999
Q ss_pred HHHHHhhccccHHHHHHHHHHHHHH
Q psy4702 99 EILMALTLLEGEEVRKLALQALAKG 123 (138)
Q Consensus 99 eiL~~~~k~~~~~v~~~a~eaLk~~ 123 (138)
++|+.++|..+-.+...+..+|-.+
T Consensus 170 ~pltrLakskdirvqrnatgaLlnm 194 (550)
T KOG4224|consen 170 EPLTRLAKSKDIRVQRNATGALLNM 194 (550)
T ss_pred hhhHhhcccchhhHHHHHHHHHHHh
Confidence 9999988876546666676666443
No 27
>KOG1048|consensus
Probab=85.70 E-value=1.5 Score=41.78 Aligned_cols=93 Identities=14% Similarity=0.052 Sum_probs=71.8
Q ss_pred hhhhhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhh-ccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChH--HHHH
Q psy4702 15 RFLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLE-KNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKE--LAER 91 (138)
Q Consensus 15 klLvll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~-~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e--~a~k 91 (138)
+=.+.|++--|...+..|++=+.+++- .+.-+++-+. -+++.-|..|+.|++.++|-...=.+||++..+-. ---.
T Consensus 236 pe~i~mL~~q~~~~qsnaaaylQHlcf-gd~~ik~~vrqlggI~kLv~Ll~~~~~evq~~acgaLRNLvf~~~~~~NKla 314 (717)
T KOG1048|consen 236 PEVISMLMSQDPSVQSNAAAYLQHLCF-GDNKIKSRVRQLGGIPKLVALLDHRNDEVQRQACGALRNLVFGKSTDSNKLA 314 (717)
T ss_pred HHHHHHHhccChhhhHHHHHHHHHHHh-hhHHHHHHHHHhccHHHHHHHhcCCcHHHHHHHHHHHHhhhcccCCcccchh
Confidence 445778888888899999999999997 5555555444 88999999999999999999999999999975322 2233
Q ss_pred HhhhhHHHHHHHhhccc
Q psy4702 92 VLSTELMEILMALTLLE 108 (138)
Q Consensus 92 l~es~~leiL~~~~k~~ 108 (138)
+.+-+++--|+.+.+.+
T Consensus 315 i~~~~Gv~~l~~~Lr~t 331 (717)
T KOG1048|consen 315 IKELNGVPTLVRLLRHT 331 (717)
T ss_pred hhhcCChHHHHHHHHhh
Confidence 66677777777776654
No 28
>PF05804 KAP: Kinesin-associated protein (KAP)
Probab=85.61 E-value=4.8 Score=38.33 Aligned_cols=93 Identities=20% Similarity=0.146 Sum_probs=79.2
Q ss_pred hhhhhhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHh
Q psy4702 14 VRFLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVL 93 (138)
Q Consensus 14 lklLvll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~ 93 (138)
+++|+-+-+.++.++...+.+.|--||- ++.--..|.+..-++-|..|..++++++++=++=+++|+- .+.++..+++
T Consensus 292 V~~Lv~~Ldr~n~ellil~v~fLkkLSi-~~ENK~~m~~~giV~kL~kLl~s~~~~l~~~aLrlL~NLS-fd~~~R~~mV 369 (708)
T PF05804_consen 292 VSLLVKCLDRENEELLILAVTFLKKLSI-FKENKDEMAESGIVEKLLKLLPSENEDLVNVALRLLFNLS-FDPELRSQMV 369 (708)
T ss_pred HHHHHHHHcCCCHHHHHHHHHHHHHHcC-CHHHHHHHHHcCCHHHHHHHhcCCCHHHHHHHHHHHHHhC-cCHHHHHHHH
Confidence 4566666678899999999999999997 6776666766779999999999999999999999999985 5668889999
Q ss_pred hhhHHHHHHHhhccc
Q psy4702 94 STELMEILMALTLLE 108 (138)
Q Consensus 94 es~~leiL~~~~k~~ 108 (138)
+.|++.-|..+.+.+
T Consensus 370 ~~GlIPkLv~LL~d~ 384 (708)
T PF05804_consen 370 SLGLIPKLVELLKDP 384 (708)
T ss_pred HCCCcHHHHHHhCCC
Confidence 999999888887754
No 29
>KOG4199|consensus
Probab=84.27 E-value=3.8 Score=36.89 Aligned_cols=96 Identities=11% Similarity=0.239 Sum_probs=64.1
Q ss_pred CCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCc-ccc--hhhhhhhhhhhhcChHHHHHHhhhhHHHH
Q psy4702 24 EDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDV-GVQ--HRALVIVGNILSAGKELAERVLSTELMEI 100 (138)
Q Consensus 24 ED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~-~lq--HRG~viv~Nmi~a~~e~a~kl~es~~lei 100 (138)
-|++.-.-++|+|-.|.- .+..|+.|.+..|+++|.++..|-|+ +.+ -|-+.-..--++.+.++-..|++.|++++
T Consensus 254 ~dp~~L~~l~~tl~~lAV-r~E~C~~I~e~GGl~tl~~~i~d~n~~~~r~l~k~~lslLralAG~DsvKs~IV~~gg~~~ 332 (461)
T KOG4199|consen 254 IDPDSLVSLSTTLKALAV-RDEICKSIAESGGLDTLLRCIDDSNEQGNRTLAKTCLSLLRALAGSDSVKSTIVEKGGLDK 332 (461)
T ss_pred CCccHHHHHHHHHHHHHH-HHHHHHHHHHccCHHHHHHHHhhhchhhHHHHHHHHHHHHHHHhCCCchHHHHHHhcChHH
Confidence 456666667788877765 79999999998899999998887443 332 12222222233445566678999999999
Q ss_pred HHHhhcc-ccH-HHHHHHHHHH
Q psy4702 101 LMALTLL-EGE-EVRKLALQAL 120 (138)
Q Consensus 101 L~~~~k~-~~~-~v~~~a~eaL 120 (138)
+..+.-. +++ .|++.+--|+
T Consensus 333 ii~l~~~h~~~p~Vi~~~~a~i 354 (461)
T KOG4199|consen 333 IITLALRHSDDPLVIQEVMAII 354 (461)
T ss_pred HHHHHHHcCCChHHHHHHHHHH
Confidence 9988764 223 5555444443
No 30
>KOG2160|consensus
Probab=83.03 E-value=16 Score=32.13 Aligned_cols=95 Identities=16% Similarity=0.139 Sum_probs=79.4
Q ss_pred hhhhhhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCc-ccchhhhhhhhhhhhcChHHHHHH
Q psy4702 14 VRFLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDV-GVQHRALVIVGNILSAGKELAERV 92 (138)
Q Consensus 14 lklLvll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~-~lqHRG~viv~Nmi~a~~e~a~kl 92 (138)
+..++.|-...|..+|..|+=.++...+..|++-..+++..+++.|..++.++++ ..+--.++-+.++|...+-.-.++
T Consensus 126 l~~ll~~l~~~~~~lR~~Aa~Vigt~~qNNP~~Qe~v~E~~~L~~Ll~~ls~~~~~~~r~kaL~AissLIRn~~~g~~~f 205 (342)
T KOG2160|consen 126 LVPLLGYLENSDAELRELAARVIGTAVQNNPKSQEQVIELGALSKLLKILSSDDPNTVRTKALFAISSLIRNNKPGQDEF 205 (342)
T ss_pred HHHHHHHhcCCcHHHHHHHHHHHHHHHhcCHHHHHHHHHcccHHHHHHHHccCCCchHHHHHHHHHHHHHhcCcHHHHHH
Confidence 3455557777899999999999999888899999999996699988888776544 455778899999999888777899
Q ss_pred hhhhHHHHHHHhhccc
Q psy4702 93 LSTELMEILMALTLLE 108 (138)
Q Consensus 93 ~es~~leiL~~~~k~~ 108 (138)
...++..+|+.+.+..
T Consensus 206 l~~~G~~~L~~vl~~~ 221 (342)
T KOG2160|consen 206 LKLNGYQVLRDVLQSN 221 (342)
T ss_pred HhcCCHHHHHHHHHcC
Confidence 9999999999998874
No 31
>PF11698 V-ATPase_H_C: V-ATPase subunit H; InterPro: IPR011987 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. V-ATPases (also known as V1V0-ATPase or vacuolar ATPase) (3.6.3.14 from EC) are found in the eukaryotic endomembrane system, and in the plasma membrane of prokaryotes and certain specialised eukaryotic cells. V-ATPases hydrolyse ATP to drive a proton pump, and are involved in a variety of vital intra- and inter-cellular processes such as receptor mediated endocytosis, protein trafficking, active transport of metabolites, homeostasis and neurotransmitter release []. V-ATPases are composed of two linked complexes: the V1 complex (subunits A-H) contains the catalytic core that hydrolyses ATP, while the V0 complex (subunits a, c, c', c'', d) forms the membrane-spanning pore. V-ATPases may have an additional role in membrane fusion through binding to t-SNARE proteins []. This entry represents the C-terminal domain of subunit H (also known as Vma13p) found in the V1 complex of V-ATPases. This subunit has a regulatory function, being responsible for activating ATPase activity and coupling ATPase activity to proton flow []. The yeast enzyme contains five motifs similar to the HEAT or Armadillo repeats seen in the importins, and can be divided into two distinct domains: a large N-terminal domain consisting of stacked alpha helices, and a smaller C-terminal alpha-helical domain with a similar superhelical topology to an armadillo repeat []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0016820 hydrolase activity, acting on acid anhydrides, catalyzing transmembrane movement of substances, 0015991 ATP hydrolysis coupled proton transport, 0000221 vacuolar proton-transporting V-type ATPase, V1 domain; PDB: 1HO8_A.
Probab=82.69 E-value=0.87 Score=34.33 Aligned_cols=69 Identities=17% Similarity=0.215 Sum_probs=50.0
Q ss_pred hhhhhh-hcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhh-ccHHHHHHHHhcCCCcccchhhhhhhhhhhh
Q psy4702 14 VRFLFL-LTCEEDTETACAAAGALAMLTSVSTPVCSMLLE-KNWVDIFKQILAHPDVGVQHRALVIVGNILS 83 (138)
Q Consensus 14 lklLvl-l~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~-~~wleil~~L~~~~~~~lqHRG~viv~Nmi~ 83 (138)
+|.|+- |...+|..+-.-|..=+..+--.+|. .++|++ -+..+.+-.|+.|+|+++++.++.+|.=+|.
T Consensus 45 lk~L~~lL~~s~d~~~laVac~Dig~~vr~~p~-gr~ii~~lg~K~~vM~Lm~h~d~eVr~eAL~avQklm~ 115 (119)
T PF11698_consen 45 LKKLIKLLDKSDDPTTLAVACHDIGEFVRHYPN-GRNIIEKLGAKERVMELMNHEDPEVRYEALLAVQKLMV 115 (119)
T ss_dssp HHHHHHHH-SHHHHHHHHHHHHHHHHHHHH-GG-GHHHHHHHSHHHHHHHHTS-SSHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHccCCCcceeehhhcchHHHHHHChh-HHHHHHhcChHHHHHHHhcCCCHHHHHHHHHHHHHHHH
Confidence 444444 44557777776677667766654554 677777 7899999999999999999999999987664
No 32
>PRK09687 putative lyase; Provisional
Probab=82.44 E-value=5.2 Score=33.31 Aligned_cols=25 Identities=20% Similarity=0.126 Sum_probs=12.5
Q ss_pred HHHHHHHhhccccHHHHHHHHHHHH
Q psy4702 97 LMEILMALTLLEGEEVRKLALQALA 121 (138)
Q Consensus 97 ~leiL~~~~k~~~~~v~~~a~eaLk 121 (138)
.++.|....+.+...|+..|..+|-
T Consensus 129 a~~~l~~~~~D~~~~VR~~a~~aLg 153 (280)
T PRK09687 129 IVEQSQITAFDKSTNVRFAVAFALS 153 (280)
T ss_pred HHHHHHHHhhCCCHHHHHHHHHHHh
Confidence 3444444433332266666666663
No 33
>PF00514 Arm: Armadillo/beta-catenin-like repeat; InterPro: IPR000225 The armadillo (Arm) repeat is an approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila melanogaster segment polarity gene armadillo involved in signal transduction through wingless. Animal Arm-repeat proteins function in various processes, including intracellular signalling and cytoskeletal regulation, and include such proteins as beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumour suppressor protein, and the nuclear transport factor importin-alpha, amongst others []. A subset of these proteins is conserved across eukaryotic kingdoms. In higher plants, some Arm-repeat proteins function in intracellular signalling like their mammalian counterparts, while others have novel functions []. The 3-dimensional fold of an armadillo repeat is known from the crystal structure of beta-catenin, where the 12 repeats form a superhelix of alpha helices with three helices per unit []. The cylindrical structure features a positively charged grove, which presumably interacts with the acidic surfaces of the known interaction partners of beta-catenin.; GO: 0005515 protein binding; PDB: 2Z6G_A 1IQ1_C 3RZX_A 2C1M_A 3BTR_C 3OQS_A 3TPO_A 1IAL_A 1Q1S_C 1PJM_B ....
Probab=82.37 E-value=6.1 Score=23.07 Aligned_cols=38 Identities=21% Similarity=0.110 Sum_probs=30.7
Q ss_pred hHHHHHHhhhhHHHHHHHhhccccHHHHHHHHHHHHHH
Q psy4702 86 KELAERVLSTELMEILMALTLLEGEEVRKLALQALAKG 123 (138)
Q Consensus 86 ~e~a~kl~es~~leiL~~~~k~~~~~v~~~a~eaLk~~ 123 (138)
.+..+.+++.|++..|..+.+.++.++++.|.-||.-+
T Consensus 2 ~~~~~~i~~~g~i~~Lv~ll~~~~~~v~~~a~~al~nl 39 (41)
T PF00514_consen 2 PENKQAIVEAGGIPPLVQLLKSPDPEVQEEAAWALGNL 39 (41)
T ss_dssp HHHHHHHHHTTHHHHHHHHTTSSSHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHcccHHHHHHHHcCCCHHHHHHHHHHHHHH
Confidence 34567799999999999999955558889998888644
No 34
>PRK13800 putative oxidoreductase/HEAT repeat-containing protein; Provisional
Probab=81.91 E-value=4.6 Score=38.67 Aligned_cols=97 Identities=24% Similarity=0.256 Sum_probs=53.8
Q ss_pred hhhhhhhcCCCCHHHHHhhhhhhhhhccCCHHHH---hhhhh-ccH------------------HHHHHHHhcCCCcccc
Q psy4702 14 VRFLFLLTCEEDTETACAAAGALAMLTSVSTPVC---SMLLE-KNW------------------VDIFKQILAHPDVGVQ 71 (138)
Q Consensus 14 lklLvll~~eED~~tr~AA~GALAmLT~~~~~~c---~~I~~-~~w------------------leil~~L~~~~~~~lq 71 (138)
+..|..+..++|...|.+|..+|+.+-. .+.+. ...++ ++| ++.|..++.|+++.+|
T Consensus 777 ~~~L~~ll~D~d~~VR~aA~~aLg~~g~-~~~~~~~l~~aL~d~d~~VR~~Aa~aL~~l~~~~a~~~L~~~L~D~~~~VR 855 (897)
T PRK13800 777 GDAVRALTGDPDPLVRAAALAALAELGC-PPDDVAAATAALRASAWQVRQGAARALAGAAADVAVPALVEALTDPHLDVR 855 (897)
T ss_pred HHHHHHHhcCCCHHHHHHHHHHHHhcCC-cchhHHHHHHHhcCCChHHHHHHHHHHHhccccchHHHHHHHhcCCCHHHH
Confidence 3556666677788888888888887764 22221 11111 222 3445555566666666
Q ss_pred hhhhhhhhhhhhcChHHHHHHhhhhHHHHHHHhhccccHHHHHHHHHHHH
Q psy4702 72 HRALVIVGNILSAGKELAERVLSTELMEILMALTLLEGEEVRKLALQALA 121 (138)
Q Consensus 72 HRG~viv~Nmi~a~~e~a~kl~es~~leiL~~~~k~~~~~v~~~a~eaLk 121 (138)
...+-.+..+ .. .....+.|....+.++..|++.|.++|.
T Consensus 856 ~~A~~aL~~~-~~---------~~~a~~~L~~al~D~d~~Vr~~A~~aL~ 895 (897)
T PRK13800 856 KAAVLALTRW-PG---------DPAARDALTTALTDSDADVRAYARRALA 895 (897)
T ss_pred HHHHHHHhcc-CC---------CHHHHHHHHHHHhCCCHHHHHHHHHHHh
Confidence 6555555443 11 1123445555545443477777777775
No 35
>PF13646 HEAT_2: HEAT repeats; PDB: 1OYZ_A 3FGA_A 2PF4_C 2IAE_A 3B2A_A.
Probab=81.86 E-value=3 Score=27.17 Aligned_cols=57 Identities=28% Similarity=0.313 Sum_probs=39.7
Q ss_pred HHHHHHh-cCCCcccchhhhhhhhhhhhcChHHHHHHhhhhHHHHHHHhhccccHHHHHHHHHHHHHHH
Q psy4702 57 DIFKQIL-AHPDVGVQHRALVIVGNILSAGKELAERVLSTELMEILMALTLLEGEEVRKLALQALAKGE 124 (138)
Q Consensus 57 eil~~L~-~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL~~~~k~~~~~v~~~a~eaLk~~~ 124 (138)
+.|...+ .++++.+|+..+.++.++ ...+.+..|..+.+.+...|+..|..+|...-
T Consensus 2 ~~L~~~l~~~~~~~vr~~a~~~L~~~-----------~~~~~~~~L~~~l~d~~~~vr~~a~~aL~~i~ 59 (88)
T PF13646_consen 2 PALLQLLQNDPDPQVRAEAARALGEL-----------GDPEAIPALIELLKDEDPMVRRAAARALGRIG 59 (88)
T ss_dssp HHHHHHHHTSSSHHHHHHHHHHHHCC-----------THHHHHHHHHHHHTSSSHHHHHHHHHHHHCCH
T ss_pred HHHHHHHhcCCCHHHHHHHHHHHHHc-----------CCHhHHHHHHHHHcCCCHHHHHHHHHHHHHhC
Confidence 4556655 899999999887777743 22255778888875443388888888887553
No 36
>smart00185 ARM Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Probab=81.03 E-value=5 Score=22.49 Aligned_cols=36 Identities=31% Similarity=0.195 Sum_probs=28.9
Q ss_pred HHHHHhhhhHHHHHHHhhccccHHHHHHHHHHHHHH
Q psy4702 88 LAERVLSTELMEILMALTLLEGEEVRKLALQALAKG 123 (138)
Q Consensus 88 ~a~kl~es~~leiL~~~~k~~~~~v~~~a~eaLk~~ 123 (138)
....+++.|+++.|..+.+.++.++++.|..+|.-+
T Consensus 4 ~~~~i~~~g~i~~L~~ll~~~~~~i~~~a~~aL~nl 39 (41)
T smart00185 4 QKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNL 39 (41)
T ss_pred HHHHHHHCCCHHHHHHHHcCCCHHHHHHHHHHHHHH
Confidence 456788999999999998855448888888888654
No 37
>COG5064 SRP1 Karyopherin (importin) alpha [Intracellular trafficking and secretion]
Probab=80.77 E-value=3.7 Score=37.20 Aligned_cols=93 Identities=19% Similarity=0.251 Sum_probs=75.6
Q ss_pred hhhhhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHhh
Q psy4702 15 RFLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLS 94 (138)
Q Consensus 15 klLvll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~e 94 (138)
-+|.=|--..|+++-.-|-=|+..|+...-.+...|++..-+.-|..|+.||+..+|--.+-.|.|++..+..--.-++.
T Consensus 246 piL~KLiys~D~evlvDA~WAiSYlsDg~~E~i~avld~g~~~RLvElLs~~sa~iqtPalR~vGNIVTG~D~QTqviI~ 325 (526)
T COG5064 246 PILAKLIYSRDPEVLVDACWAISYLSDGPNEKIQAVLDVGIPGRLVELLSHESAKIQTPALRSVGNIVTGSDDQTQVIIN 325 (526)
T ss_pred HHHHHHHhhcCHHHHHHHHHHHHHhccCcHHHHHHHHhcCCcHHHHHHhcCccccccCHHHHhhcCeeecCccceehhee
Confidence 34444455689999999999999999876677788888445555999999999999999999999999887666677889
Q ss_pred hhHHHHHHHhhcc
Q psy4702 95 TELMEILMALTLL 107 (138)
Q Consensus 95 s~~leiL~~~~k~ 107 (138)
.|.+-.+..+...
T Consensus 326 ~G~L~a~~~lLs~ 338 (526)
T COG5064 326 CGALKAFRSLLSS 338 (526)
T ss_pred cccHHHHHHHhcC
Confidence 9988888877644
No 38
>PF02985 HEAT: HEAT repeat; InterPro: IPR000357 The HEAT repeat is a tandemly repeated, 37-47 amino acid long module occurring in a number of cytoplasmic proteins, including the four name-giving proteins huntingtin, elongation factor 3 (EF3), the 65 Kd alpha regulatory subunit of protein phosphatase 2A (PP2A) and the yeast PI3-kinase TOR1 []. Arrays of HEAT repeats consists of 3 to 36 units forming a rod-like helical structure and appear to function as protein-protein interaction surfaces. It has been noted that many HEAT repeat-containing proteins are involved in intracellular transport processes. In the crystal structure of PP2A PR65/A [], the HEAT repeats consist of pairs of antiparallel alpha helices [].; GO: 0005515 protein binding; PDB: 3FGA_A 2PF4_C 2IAE_A 2BKU_D 3EA5_B 3ND2_A 2BPT_A 2NYL_A 2NPP_D 2PKG_B ....
Probab=79.78 E-value=1.5 Score=24.68 Aligned_cols=28 Identities=29% Similarity=0.417 Sum_probs=23.7
Q ss_pred HHHHHHHhcCCCcccchhhhhhhhhhhh
Q psy4702 56 VDIFKQILAHPDVGVQHRALVIVGNILS 83 (138)
Q Consensus 56 leil~~L~~~~~~~lqHRG~viv~Nmi~ 83 (138)
++.|+.++.||++++|.-.+.++.++..
T Consensus 2 lp~l~~~l~D~~~~VR~~a~~~l~~i~~ 29 (31)
T PF02985_consen 2 LPILLQLLNDPSPEVRQAAAECLGAIAE 29 (31)
T ss_dssp HHHHHHHHT-SSHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHcCCCCHHHHHHHHHHHHHHHh
Confidence 5788999999999999999999888764
No 39
>PF09324 DUF1981: Domain of unknown function (DUF1981); InterPro: IPR015403 This domain is functionally uncharacterised and found in various plant and yeast protein transport proteins. It is noramlly associated with and C-termianl to the SEC7 domain. The SEC7 domain was named after the first protein found to contain such a region []. It has been shown to be linked with guanine nucleotide exchange function [, ].
Probab=79.62 E-value=5.2 Score=27.83 Aligned_cols=65 Identities=23% Similarity=0.348 Sum_probs=48.5
Q ss_pred cHHHHHHHH-hcCCCcccchhhhhhhhhhhhcChHHHHHHhhhh---HHHHHHHhhccccHHHHHHHHHHHHH
Q psy4702 54 NWVDIFKQI-LAHPDVGVQHRALVIVGNILSAGKELAERVLSTE---LMEILMALTLLEGEEVRKLALQALAK 122 (138)
Q Consensus 54 ~wleil~~L-~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~---~leiL~~~~k~~~~~v~~~a~eaLk~ 122 (138)
..+.=|..+ ..+++.++|-+-+-|+.+|+.+.+ .++ .+| +++||...++.+.+.++..|-++++.
T Consensus 17 ~fL~Pf~~i~~~~~~~~vre~il~ci~qil~~~~---~~i-~SGW~~if~il~~aa~~~~e~lv~~af~~~~~ 85 (86)
T PF09324_consen 17 DFLKPFEYIMSNNPSIDVRELILECILQILQSRG---ENI-KSGWKVIFSILRAAAKDNDESLVRLAFQIVQL 85 (86)
T ss_pred HHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHhH---HHH-HhccHHHHHHHHHHHhCCCccHHHHHHHHHhh
Confidence 344445555 455788999999999999998764 344 666 78888888887655889999888763
No 40
>KOG4199|consensus
Probab=79.40 E-value=6.8 Score=35.36 Aligned_cols=98 Identities=20% Similarity=0.168 Sum_probs=66.3
Q ss_pred CCHHHHHhhhhhhhhhccCCHHHHhhhhh-ccHHHHHHHHhcCCCc-ccchhhhhhhhhhhhcChHHHHHHhhhhHHHHH
Q psy4702 24 EDTETACAAAGALAMLTSVSTPVCSMLLE-KNWVDIFKQILAHPDV-GVQHRALVIVGNILSAGKELAERVLSTELMEIL 101 (138)
Q Consensus 24 ED~~tr~AA~GALAmLT~~~~~~c~~I~~-~~wleil~~L~~~~~~-~lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL 101 (138)
+|+-...++-.++.+|+=.+|.=..++++ .-..=+++.|=+||.. .+|..|...+|||+..+.+-.+-+..-| .|-|
T Consensus 342 ~~p~Vi~~~~a~i~~l~LR~pdhsa~~ie~G~a~~avqAmkahP~~a~vQrnac~~IRNiv~rs~~~~~~~l~~G-iE~L 420 (461)
T KOG4199|consen 342 DDPLVIQEVMAIISILCLRSPDHSAKAIEAGAADLAVQAMKAHPVAAQVQRNACNMIRNIVVRSAENRTILLANG-IEKL 420 (461)
T ss_pred CChHHHHHHHHHHHHHHhcCcchHHHHHhcchHHHHHHHHHhCcHHHHHHHHHHHHHHHHHHhhhhccchHHhcc-HHHH
Confidence 45545555555556666556666667777 5667788999999854 6888899999999998877776666554 5555
Q ss_pred HHhhccccHHHHHHHHHHHHH
Q psy4702 102 MALTLLEGEEVRKLALQALAK 122 (138)
Q Consensus 102 ~~~~k~~~~~v~~~a~eaLk~ 122 (138)
.--+|.+.+..-+.|..||.-
T Consensus 421 i~~A~~~h~tce~~akaALRD 441 (461)
T KOG4199|consen 421 IRTAKANHETCEAAAKAALRD 441 (461)
T ss_pred HHHHHhcCccHHHHHHHHHHh
Confidence 555566544444666666543
No 41
>PF12348 CLASP_N: CLASP N terminal; InterPro: IPR024395 This domain is found in the N-terminal region of CLIP-associated proteins (CLASPs), which are widely conserved microtubule plus-end-tracking proteins that regulate the stability of dynamic microtubules [, ]. The domain is also found in other proteins involved in microtubule binding, including STU1, MOR1 and spindle pole body component Alp14.; PDB: 2QK2_A.
Probab=78.35 E-value=15 Score=28.36 Aligned_cols=103 Identities=18% Similarity=0.103 Sum_probs=58.2
Q ss_pred hhhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccH-HHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHhhh
Q psy4702 17 LFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNW-VDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLST 95 (138)
Q Consensus 17 Lvll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~w-leil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es 95 (138)
|+-.++..-.-.+.+|.-+|-.+... -.. .++- ++++...+.|.++.+|.+-+-.+.+++..-+.-...+-.+
T Consensus 99 Ll~~~~~~~~~i~~~a~~~L~~i~~~-~~~-----~~~~~~~~l~~~~~~Kn~~vR~~~~~~l~~~l~~~~~~~~~l~~~ 172 (228)
T PF12348_consen 99 LLKKLGDSKKFIREAANNALDAIIES-CSY-----SPKILLEILSQGLKSKNPQVREECAEWLAIILEKWGSDSSVLQKS 172 (228)
T ss_dssp HHHGGG---HHHHHHHHHHHHHHHTT-S-H-------HHHHHHHHHHTT-S-HHHHHHHHHHHHHHHTT-----GGG--H
T ss_pred HHHHHccccHHHHHHHHHHHHHHHHH-CCc-----HHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHHHccchHhhhccc
Confidence 33344444455667776666655541 111 1233 7788888999999999999999999987654222222222
Q ss_pred ----hHHHHHHHhhccccHHHHHHHHHHHHHHHh
Q psy4702 96 ----ELMEILMALTLLEGEEVRKLALQALAKGEE 125 (138)
Q Consensus 96 ----~~leiL~~~~k~~~~~v~~~a~eaLk~~~~ 125 (138)
.+...+..+......+|++.|++++..+..
T Consensus 173 ~~~~~l~~~l~~~l~D~~~~VR~~Ar~~~~~l~~ 206 (228)
T PF12348_consen 173 AFLKQLVKALVKLLSDADPEVREAARECLWALYS 206 (228)
T ss_dssp HHHHHHHHHHHHHHTSS-HHHHHHHHHHHHHHHH
T ss_pred chHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH
Confidence 344455444444434999999999987754
No 42
>PF04826 Arm_2: Armadillo-like; InterPro: IPR006911 This entry consists of mammalian proteins of unknown function.
Probab=78.25 E-value=11 Score=31.34 Aligned_cols=86 Identities=14% Similarity=0.224 Sum_probs=67.3
Q ss_pred hcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHhhhhHHH
Q psy4702 20 LTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTELME 99 (138)
Q Consensus 20 l~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~le 99 (138)
.++.-|.+.+.|+-++|.-||- .+.. ..++.+ .+.-|..|+..++..+|+..+-++.|+- .+.+.++.++..+++.
T Consensus 103 ~s~~lns~~Q~agLrlL~nLtv-~~~~-~~~l~~-~i~~ll~LL~~G~~~~k~~vLk~L~nLS-~np~~~~~Ll~~q~~~ 178 (254)
T PF04826_consen 103 VSSPLNSEVQLAGLRLLTNLTV-TNDY-HHMLAN-YIPDLLSLLSSGSEKTKVQVLKVLVNLS-ENPDMTRELLSAQVLS 178 (254)
T ss_pred hcCCCCCHHHHHHHHHHHccCC-Ccch-hhhHHh-hHHHHHHHHHcCChHHHHHHHHHHHHhc-cCHHHHHHHHhccchh
Confidence 3445677899999999999986 4444 334432 4555678999999999999999999986 5568889999999999
Q ss_pred HHHHhhcccc
Q psy4702 100 ILMALTLLEG 109 (138)
Q Consensus 100 iL~~~~k~~~ 109 (138)
-++.+-..+.
T Consensus 179 ~~~~Lf~~~~ 188 (254)
T PF04826_consen 179 SFLSLFNSSE 188 (254)
T ss_pred HHHHHHccCC
Confidence 9998877753
No 43
>KOG2160|consensus
Probab=77.94 E-value=16 Score=32.26 Aligned_cols=100 Identities=18% Similarity=0.159 Sum_probs=74.0
Q ss_pred CCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHhhhhHHHHHHH
Q psy4702 24 EDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTELMEILMA 103 (138)
Q Consensus 24 ED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL~~ 103 (138)
=|.+.+-.|-==|.++-+ +=+-+..+..-.|+..+.....|.+++||-+++-+|.-.+......=+.++|.|+++.|+.
T Consensus 95 ~~le~ke~ald~Le~lve-~iDnAndl~~~ggl~~ll~~l~~~~~~lR~~Aa~Vigt~~qNNP~~Qe~v~E~~~L~~Ll~ 173 (342)
T KOG2160|consen 95 VDLEDKEDALDNLEELVE-DIDNANDLISLGGLVPLLGYLENSDAELRELAARVIGTAVQNNPKSQEQVIELGALSKLLK 173 (342)
T ss_pred CCHHHHHHHHHHHHHHHH-hhhhHHhHhhccCHHHHHHHhcCCcHHHHHHHHHHHHHHHhcCHHHHHHHHHcccHHHHHH
Confidence 455566666666666665 4455566666334445555999999999999999999999999999999999999999999
Q ss_pred hhccccH-HHHHHHHHHHHHHH
Q psy4702 104 LTLLEGE-EVRKLALQALAKGE 124 (138)
Q Consensus 104 ~~k~~~~-~v~~~a~eaLk~~~ 124 (138)
....+.. .++..|--|+-.+.
T Consensus 174 ~ls~~~~~~~r~kaL~AissLI 195 (342)
T KOG2160|consen 174 ILSSDDPNTVRTKALFAISSLI 195 (342)
T ss_pred HHccCCCchHHHHHHHHHHHHH
Confidence 9887654 66555555554443
No 44
>cd00256 VATPase_H VATPase_H, regulatory vacuolar ATP synthase subunit H (Vma13p); activation component of the peripheral V1 complex of V-ATPase, a heteromultimeric enzyme which uses ATP to actively transport protons into organelles and extracellular compartments. The topology is that of a superhelical spiral, in part the geometry is similar to superhelices composed of armadillo repeat motifs, as found in importins for example.
Probab=76.39 E-value=14 Score=33.34 Aligned_cols=101 Identities=14% Similarity=0.102 Sum_probs=77.5
Q ss_pred CCCHHHHHhhhhhhhhhccCCHHHHhhhhh-ccHHHHHHHHhcCC--CcccchhhhhhhhhhhhcChHHHHHHhhhhHHH
Q psy4702 23 EEDTETACAAAGALAMLTSVSTPVCSMLLE-KNWVDIFKQILAHP--DVGVQHRALVIVGNILSAGKELAERVLSTELME 99 (138)
Q Consensus 23 eED~~tr~AA~GALAmLT~~~~~~c~~I~~-~~wleil~~L~~~~--~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~le 99 (138)
.++...+.-|...|+.|-. .+..+.++- .+++..|..++... +..+||..+.|+-=+--.. +.++.+.+.+++-
T Consensus 155 ~~~~~~~~~~v~~L~~LL~--~~~~R~~f~~~~~v~~L~~~L~~~~~~~Ql~Y~~ll~lWlLSF~~-~~~~~~~~~~~i~ 231 (429)
T cd00256 155 ITNNDYVQTAARCLQMLLR--VDEYRFAFVLADGVPTLVKLLSNATLGFQLQYQSIFCIWLLTFNP-HAAEVLKRLSLIQ 231 (429)
T ss_pred cCCcchHHHHHHHHHHHhC--CchHHHHHHHccCHHHHHHHHhhccccHHHHHHHHHHHHHHhccH-HHHHhhccccHHH
Confidence 3456667677788988766 345566665 66999999988653 4588999999988665443 4777888889999
Q ss_pred HHHHhhccccH-HHHHHHHHHHHHHHhC
Q psy4702 100 ILMALTLLEGE-EVRKLALQALAKGEEY 126 (138)
Q Consensus 100 iL~~~~k~~~~-~v~~~a~eaLk~~~~~ 126 (138)
.|..+.|.+.. +|+.++..+|+-+.+.
T Consensus 232 ~l~~i~k~s~KEKvvRv~l~~l~Nll~~ 259 (429)
T cd00256 232 DLSDILKESTKEKVIRIVLAIFRNLISK 259 (429)
T ss_pred HHHHHHHhhhhHHHHHHHHHHHHHHhhc
Confidence 99999999765 9999999999888763
No 45
>PF11701 UNC45-central: Myosin-binding striated muscle assembly central; InterPro: IPR024660 The UNC-45 or small muscle protein 1 of Caenorhabditis elegans is expressed in two forms from different genomic positions in mammals: as a general tissue protein (UNC-45a) and as a specific form (UNC-45b) expressed only in striated and skeletal muscle. Myofibril formation requires both UNC-45 forms, consistent with the fact that the cytoskeleton is necessary for the development and maintenance of organised myofibrils []. Rng3 (Ring assembly protein 3), the homologue in Schizosaccharomyces pombe, is crucial for cell shape, normal actin cytoskeleton, and contractile ring assembly, and is essential for assembly of the myosin II-containing progenitors of the contractile ring. Widespread defects in the cytoskeleton are found in null mutants of all three fungal proteins []. Mammalian Unc45 is found to act as a specific chaperone during the folding of myosin and the assembly of striated muscle by forming a stable complex with the general chaperone Hsp90 []. All members carry up to three amino-terminal tetratricopeptide repeat (TPR) and a UCS domain at the C terminus that contains a number of Arm repeats. ; PDB: 3OPB_A 3NOW_A.
Probab=76.16 E-value=2.9 Score=31.91 Aligned_cols=63 Identities=22% Similarity=0.291 Sum_probs=47.0
Q ss_pred hhhhhhhcC--CCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhc-CCCcc-cchhhhhhh
Q psy4702 14 VRFLFLLTC--EEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILA-HPDVG-VQHRALVIV 78 (138)
Q Consensus 14 lklLvll~~--eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~-~~~~~-lqHRG~viv 78 (138)
+..++-++. .+|..+..|+.=+|..=.. ++-|+..+.++++++|..+.. ++|+. ++-|++|++
T Consensus 88 ~~~l~~~~~~~~~~~~~~~~~lell~aAc~--d~~~r~~I~~~~~~~L~~~~~~~~~~~~ir~~A~v~L 154 (157)
T PF11701_consen 88 LESLLPLASRKSKDRKVQKAALELLSAACI--DKSCRTFISKNYVSWLKELYKNSKDDSEIRVLAAVGL 154 (157)
T ss_dssp HHHHHHHHH-CTS-HHHHHHHHHHHHHHTT--SHHHHHCCHHHCHHHHHHHTTTCC-HH-CHHHHHHHH
T ss_pred HHHHHHHHhcccCCHHHHHHHHHHHHHHHc--cHHHHHHHHHHHHHHHHHHHccccchHHHHHHHHHHH
Confidence 456677777 7999999999999986554 556666666678999999995 55565 899999875
No 46
>PF08167 RIX1: rRNA processing/ribosome biogenesis
Probab=75.78 E-value=16 Score=27.94 Aligned_cols=71 Identities=14% Similarity=0.165 Sum_probs=56.0
Q ss_pred ccHHHHHHHHhcCCCcccchhhhhhhhhhhhcC-hHHHHHHhhh--hHHHHHHHhhccccH-HHHHHHHHHHHHHHhC
Q psy4702 53 KNWVDIFKQILAHPDVGVQHRALVIVGNILSAG-KELAERVLST--ELMEILMALTLLEGE-EVRKLALQALAKGEEY 126 (138)
Q Consensus 53 ~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~-~e~a~kl~es--~~leiL~~~~k~~~~-~v~~~a~eaLk~~~~~ 126 (138)
.+|..-+..|+.+.+++-++-|++.++=.+..+ .|+ +.+. .-++.|+.+.++++. .+.+.+..+|..++++
T Consensus 24 ~~l~~ri~~LL~s~~~~~rw~G~~Ll~~~~~~~~~e~---l~~~~~~W~~~Ll~~L~~~~~~~~~~~ai~~L~~l~~~ 98 (165)
T PF08167_consen 24 HKLVTRINSLLQSKSAYSRWAGLCLLKVTVEQCSWEI---LLSHGSQWLRALLSILEKPDPPSVLEAAIITLTRLFDL 98 (165)
T ss_pred HHHHHHHHHHhCCCChhhHHHHHHHHHHHHHHhhHHH---HHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHH
Confidence 789999999999999999999999998888654 332 2222 256677777777665 8899999999998864
No 47
>PF12755 Vac14_Fab1_bd: Vacuolar 14 Fab1-binding region
Probab=75.60 E-value=3.8 Score=29.28 Aligned_cols=56 Identities=20% Similarity=0.226 Sum_probs=42.2
Q ss_pred hhhhhhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhh--ccHHHHHHHHhcCCCcccch
Q psy4702 14 VRFLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLE--KNWVDIFKQILAHPDVGVQH 72 (138)
Q Consensus 14 lklLvll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~--~~wleil~~L~~~~~~~lqH 72 (138)
++.++...+++|...|-+|.=+|-.++-+ .=..++. +.-++.|..++.|+++.+|.
T Consensus 29 l~pVL~~~~D~d~rVRy~AcEaL~ni~k~---~~~~~l~~f~~IF~~L~kl~~D~d~~Vr~ 86 (97)
T PF12755_consen 29 LPPVLKCFDDQDSRVRYYACEALYNISKV---ARGEILPYFNEIFDALCKLSADPDENVRS 86 (97)
T ss_pred HHHHHHHcCCCcHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHcCCchhHHH
Confidence 45667778889999999999999998643 3334444 56666777888999999854
No 48
>PF05804 KAP: Kinesin-associated protein (KAP)
Probab=74.39 E-value=27 Score=33.42 Aligned_cols=97 Identities=20% Similarity=0.263 Sum_probs=70.5
Q ss_pred hhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhh-ccHHHHHHHHhcCC--CcccchhhhhhhhhhhhcChHHHHHHhh
Q psy4702 18 FLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLE-KNWVDIFKQILAHP--DVGVQHRALVIVGNILSAGKELAERVLS 94 (138)
Q Consensus 18 vll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~-~~wleil~~L~~~~--~~~lqHRG~viv~Nmi~a~~e~a~kl~e 94 (138)
-++...++++...-+-|.||-||. ++.=-.++++ .+.+++|..++... +++++--.++.+.++ +.+.+.|..+.+
T Consensus 497 ~~v~~~~~ee~~vE~LGiLaNL~~-~~ld~~~ll~~~~llp~L~~~L~~g~~~dDl~LE~Vi~~gtl-a~d~~~A~lL~~ 574 (708)
T PF05804_consen 497 KIVSSGDSEEFVVECLGILANLTI-PDLDWAQLLQEYNLLPWLKDLLKPGASEDDLLLEVVILLGTL-ASDPECAPLLAK 574 (708)
T ss_pred HHhhcCCcHHHHHHHHHHHHhccc-CCcCHHHHHHhCCHHHHHHHHhCCCCCChHHHHHHHHHHHHH-HCCHHHHHHHHh
Confidence 344555788999999999999996 4444455565 67888888888543 345766677666655 478899999999
Q ss_pred hhHHHHHHHhhcc--ccH-HHHHHH
Q psy4702 95 TELMEILMALTLL--EGE-EVRKLA 116 (138)
Q Consensus 95 s~~leiL~~~~k~--~~~-~v~~~a 116 (138)
+|+++.|..+.+. ++. -|+|..
T Consensus 575 sgli~~Li~LL~~kqeDdE~VlQil 599 (708)
T PF05804_consen 575 SGLIPTLIELLNAKQEDDEIVLQIL 599 (708)
T ss_pred CChHHHHHHHHHhhCchHHHHHHHH
Confidence 9999999999865 333 445544
No 49
>KOG1061|consensus
Probab=72.40 E-value=13 Score=35.72 Aligned_cols=96 Identities=21% Similarity=0.184 Sum_probs=71.2
Q ss_pred cCCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHhhhhHHHH
Q psy4702 21 TCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTELMEI 100 (138)
Q Consensus 21 ~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~lei 100 (138)
|..+++..|.=|.+.+..++- +++...++++ +.....|+++-+|.-.++|+.++-.-+++. ..++|.++.
T Consensus 95 ~~d~np~iR~lAlrtm~~l~v--~~i~ey~~~P-----l~~~l~d~~~yvRktaa~~vakl~~~~~~~---~~~~gl~~~ 164 (734)
T KOG1061|consen 95 CEDPNPLIRALALRTMGCLRV--DKITEYLCDP-----LLKCLKDDDPYVRKTAAVCVAKLFDIDPDL---VEDSGLVDA 164 (734)
T ss_pred CCCCCHHHHHHHhhceeeEee--hHHHHHHHHH-----HHHhccCCChhHHHHHHHHHHHhhcCChhh---ccccchhHH
Confidence 455777888888888888874 7777777653 566778899999999999999997766544 456788999
Q ss_pred HHHhhccccHHHHHHHHHHHHHHHhC
Q psy4702 101 LMALTLLEGEEVRKLALQALAKGEEY 126 (138)
Q Consensus 101 L~~~~k~~~~~v~~~a~eaLk~~~~~ 126 (138)
|+.+...++..|+..|+.+|....+.
T Consensus 165 L~~ll~D~~p~VVAnAlaaL~eI~e~ 190 (734)
T KOG1061|consen 165 LKDLLSDSNPMVVANALAALSEIHES 190 (734)
T ss_pred HHHHhcCCCchHHHHHHHHHHHHHHh
Confidence 99887743337777777777666553
No 50
>PRK13800 putative oxidoreductase/HEAT repeat-containing protein; Provisional
Probab=70.88 E-value=17 Score=34.88 Aligned_cols=58 Identities=16% Similarity=0.091 Sum_probs=38.3
Q ss_pred hhhhhhhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhh
Q psy4702 13 QVRFLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNI 81 (138)
Q Consensus 13 RlklLvll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nm 81 (138)
-+-.|.-+..++|+..|++|.-+|+-+.. + .-++.|..++.|++++++...+--+..+
T Consensus 622 ~~~~L~~~L~D~d~~VR~~Av~~L~~~~~--~---------~~~~~L~~aL~D~d~~VR~~Aa~aL~~l 679 (897)
T PRK13800 622 SVAELAPYLADPDPGVRRTAVAVLTETTP--P---------GFGPALVAALGDGAAAVRRAAAEGLREL 679 (897)
T ss_pred hHHHHHHHhcCCCHHHHHHHHHHHhhhcc--h---------hHHHHHHHHHcCCCHHHHHHHHHHHHHH
Confidence 34455666678999999999998886543 2 2344555566677777776665555544
No 51
>PF01602 Adaptin_N: Adaptin N terminal region; InterPro: IPR002553 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. Clathrin coats contain both clathrin and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors []. All AP complexes are heterotetramers composed of two large subunits (adaptins), a medium subunit (mu) and a small subunit (sigma). Each subunit has a specific function. Adaptin subunits recognise and bind to clathrin through their hinge region (clathrin box), and recruit accessory proteins that modulate AP function through their C-terminal appendage domains. By contrast, GGAs are monomers composed of four domains, which have functions similar to AP subunits: an N-terminal VHS (Vps27p/Hrs/Stam) domain, a GAT (GGA and Tom1) domain, a hinge region, and a C-terminal GAE (gamma-adaptin ear) domain. The GAE domain is similar to the AP gamma-adaptin ear domain, being responsible for the recruitment of accessory proteins that regulate clathrin-mediated endocytosis []. While clathrin mediates endocytic protein transport from ER to Golgi, coatomers (COPI, COPII) primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the N-terminal domain of various adaptins from different AP clathrin adaptor complexes (including AP1, AP2, AP3 and AP4), and from the beta and gamma subunits of various coatomer (COP) adaptors. This domain has a 2-layer alpha/alpha fold that forms a right-handed superhelix, and is a member of the ARM repeat superfamily []. The N-terminal region of the various AP adaptor proteins share strong sequence identity; by contrast, the C-terminal domains of different adaptins share similar structural folds, but have little sequence identity []. It has been proposed that the N-terminal domain interacts with another uniform component of the coated vesicles. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 1W63_C 2JKR_A 2JKT_A 2XA7_A 2VGL_B 3TJZ_E.
Probab=64.38 E-value=32 Score=29.59 Aligned_cols=89 Identities=19% Similarity=0.176 Sum_probs=62.3
Q ss_pred CCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHhhhhHHHHHHH
Q psy4702 24 EDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTELMEILMA 103 (138)
Q Consensus 24 ED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL~~ 103 (138)
.|+..|..|--+++.+.. +..+.. -...+..++.|+++-+|..++.++..+...+++..+. +.++.|..
T Consensus 91 ~n~~~~~lAL~~l~~i~~--~~~~~~-----l~~~v~~ll~~~~~~VRk~A~~~l~~i~~~~p~~~~~----~~~~~l~~ 159 (526)
T PF01602_consen 91 PNPYIRGLALRTLSNIRT--PEMAEP-----LIPDVIKLLSDPSPYVRKKAALALLKIYRKDPDLVED----ELIPKLKQ 159 (526)
T ss_dssp SSHHHHHHHHHHHHHH-S--HHHHHH-----HHHHHHHHHHSSSHHHHHHHHHHHHHHHHHCHCCHHG----GHHHHHHH
T ss_pred CCHHHHHHHHhhhhhhcc--cchhhH-----HHHHHHHHhcCCchHHHHHHHHHHHHHhccCHHHHHH----HHHHHHhh
Confidence 456667777777776653 555544 4566788889999999999999999999877665433 14677777
Q ss_pred hhccccHHHHHHHHHHHHHH
Q psy4702 104 LTLLEGEEVRKLALQALAKG 123 (138)
Q Consensus 104 ~~k~~~~~v~~~a~eaLk~~ 123 (138)
+.+.++..|+..|..++...
T Consensus 160 lL~d~~~~V~~~a~~~l~~i 179 (526)
T PF01602_consen 160 LLSDKDPSVVSAALSLLSEI 179 (526)
T ss_dssp HTTHSSHHHHHHHHHHHHHH
T ss_pred hccCCcchhHHHHHHHHHHH
Confidence 76444337777777777655
No 52
>PF03224 V-ATPase_H_N: V-ATPase subunit H; InterPro: IPR004908 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. V-ATPases (also known as V1V0-ATPase or vacuolar ATPase) (3.6.3.14 from EC) are found in the eukaryotic endomembrane system, and in the plasma membrane of prokaryotes and certain specialised eukaryotic cells. V-ATPases hydrolyse ATP to drive a proton pump, and are involved in a variety of vital intra- and inter-cellular processes such as receptor mediated endocytosis, protein trafficking, active transport of metabolites, homeostasis and neurotransmitter release []. V-ATPases are composed of two linked complexes: the V1 complex (subunits A-H) contains the catalytic core that hydrolyses ATP, while the V0 complex (subunits a, c, c', c'', d) forms the membrane-spanning pore. V-ATPases may have an additional role in membrane fusion through binding to t-SNARE proteins []. This entry represents subunit H (also known as Vma13p) found in the V1 complex of V-ATPases. This subunit has a regulatory function, being responsible for activating ATPase activity and coupling ATPase activity to proton flow []. The yeast enzyme contains five motifs similar to the HEAT or Armadillo repeats seen in the importins, and can be divided into two distinct domains: a large N-terminal domain consisting of stacked alpha helices, and a smaller C-terminal alpha-helical domain with a similar superhelical topology to an armadillo repeat []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0046961 proton-transporting ATPase activity, rotational mechanism, 0015991 ATP hydrolysis coupled proton transport, 0000221 vacuolar proton-transporting V-type ATPase, V1 domain; PDB: 1HO8_A.
Probab=63.48 E-value=33 Score=28.48 Aligned_cols=100 Identities=21% Similarity=0.160 Sum_probs=64.1
Q ss_pred hcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhh---ccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHhhhh
Q psy4702 20 LTCEEDTETACAAAGALAMLTSVSTPVCSMLLE---KNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTE 96 (138)
Q Consensus 20 l~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~---~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~ 96 (138)
+.+..|..+...|+-.|+.|-+..+.--....+ +..+.++.....+++.++|+=++-++.|++ ..++...-+.+.+
T Consensus 113 ll~~~D~~i~~~a~~iLt~Ll~~~~~~~~~~~~~~l~~ll~~L~~~l~~~~~~~~~~av~~L~~LL-~~~~~R~~f~~~~ 191 (312)
T PF03224_consen 113 LLDRNDSFIQLKAAFILTSLLSQGPKRSEKLVKEALPKLLQWLSSQLSSSDSELQYIAVQCLQNLL-RSKEYRQVFWKSN 191 (312)
T ss_dssp H-S-SSHHHHHHHHHHHHHHHTSTTT--HHHHHHHHHHHHHHHH-TT-HHHH---HHHHHHHHHHH-TSHHHHHHHHTHH
T ss_pred HhcCCCHHHHHHHHHHHHHHHHcCCccccchHHHHHHHHHHHHHHhhcCCCcchHHHHHHHHHHHh-CcchhHHHHHhcC
Confidence 566779999999999999887643332222111 556677777777788999999999999997 6678888899999
Q ss_pred HHHHHHHhhcc---ccH-HHHHHHHHHH
Q psy4702 97 LMEILMALTLL---EGE-EVRKLALQAL 120 (138)
Q Consensus 97 ~leiL~~~~k~---~~~-~v~~~a~eaL 120 (138)
++..|+.+... ..+ ...|.-++++
T Consensus 192 ~v~~l~~iL~~~~~~~~~~~~Ql~Y~~l 219 (312)
T PF03224_consen 192 GVSPLFDILRKQATNSNSSGIQLQYQAL 219 (312)
T ss_dssp HHHHHHHHHH---------HHHHHHHHH
T ss_pred cHHHHHHHHHhhcccCCCCchhHHHHHH
Confidence 99999998832 112 3355555544
No 53
>KOG1789|consensus
Probab=63.11 E-value=41 Score=34.98 Aligned_cols=113 Identities=18% Similarity=0.222 Sum_probs=81.5
Q ss_pred hhhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHhhhh
Q psy4702 17 LFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTE 96 (138)
Q Consensus 17 Lvll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~ 96 (138)
.+++..--|.....=|---+-.+|+ +..|+..|.....+-.|+.|+ |.-|..+.|.+.++|-+. +++++++.-.+.|
T Consensus 1777 ~~~lr~~~~~~iq~LaL~Vi~~~Ta-n~~Cv~~~a~~~vL~~LL~lL-HS~PS~R~~vL~vLYAL~-S~~~i~keA~~hg 1853 (2235)
T KOG1789|consen 1777 ITYLRCRKHPKLQILALQVILLATA-NKECVTDLATCNVLTTLLTLL-HSQPSMRARVLDVLYALS-SNGQIGKEALEHG 1853 (2235)
T ss_pred HHHHHHcCCchHHHHHHHHHHHHhc-ccHHHHHHHhhhHHHHHHHHH-hcChHHHHHHHHHHHHHh-cCcHHHHHHHhcC
Confidence 4455556777777778888889999 799999998854555565554 677888888888888765 6678887777776
Q ss_pred HH-HHHHHhhccccHHHHHHHHHHHHHHHhCCCccCC
Q psy4702 97 LM-EILMALTLLEGEEVRKLALQALAKGEEYGIIRRP 132 (138)
Q Consensus 97 ~l-eiL~~~~k~~~~~v~~~a~eaLk~~~~~glIk~~ 132 (138)
++ +|+.-+.-.+..+++..|.+-|.++.-..++-|.
T Consensus 1854 ~l~yil~~~c~~~~~QqRAqaAeLlaKl~Adkl~GPr 1890 (2235)
T KOG1789|consen 1854 GLMYILSILCLTNSDQQRAQAAELLAKLQADKLTGPR 1890 (2235)
T ss_pred chhhhhHHHhccCcHHHHHHHHHHHHHhhhccccCCc
Confidence 54 5554443333237777788889999988888764
No 54
>PF09759 Atx10homo_assoc: Spinocerebellar ataxia type 10 protein domain; InterPro: IPR019156 This is the conserved C-terminal 100 residues of Ataxin-10. Ataxin-10 belongs to the family of armadillo repeat proteins and in solution it tends to form homotrimeric complexes, which associate via a tip-to-tip association in a horseshoe-shaped contact with the concave sides of the molecules facing each other. This domain may represent the homo-association site since that is located near the C terminus of Ataxin-10. The protein does not contain a signal sequence for secretion or any subcellular compartment confirming its cytoplasmic localisation, specifically to the olivocerebellar region [].
Probab=61.77 E-value=29 Score=25.39 Aligned_cols=52 Identities=15% Similarity=0.196 Sum_probs=44.5
Q ss_pred hhhhhhhhhcChHHHHHHhhhhHHHHHHHhhccccH--HHHHHHHHHHHHHHhC
Q psy4702 75 LVIVGNILSAGKELAERVLSTELMEILMALTLLEGE--EVRKLALQALAKGEEY 126 (138)
Q Consensus 75 ~viv~Nmi~a~~e~a~kl~es~~leiL~~~~k~~~~--~v~~~a~eaLk~~~~~ 126 (138)
+-++.||....+++-.++.+-|++..+..++.-+++ -+.|-|.-|++-+.+.
T Consensus 7 vrlianl~~~~~~~Qd~vr~~~Gi~liL~~c~iD~~nP~irEwai~aiRnL~e~ 60 (102)
T PF09759_consen 7 VRLIANLCYKNKEVQDLVRELGGIPLILSCCNIDDHNPFIREWAIFAIRNLCEG 60 (102)
T ss_pred HHHHHHHHhCCHHHHHHHHHcCChHHHHHhcCCCcccHHHHHHHHHHHHHHHhC
Confidence 347889999999999999999999999999887654 7789999999888763
No 55
>PF10521 DUF2454: Protein of unknown function (DUF2454); InterPro: IPR018870 Putative protein of unknown function; subunit of the ASTRA complex which is part of the chromatin remodeling machinery; similar to Schizosaccharomyces pombe (Fission yeast) Tti2p; may interact with Rsm23p [].
Probab=61.45 E-value=23 Score=29.27 Aligned_cols=70 Identities=11% Similarity=0.064 Sum_probs=53.2
Q ss_pred HHHHHHHhcCCCcccchhhhhhhhhhhhcCh-HHHHHHhhhhHHHHHHHhhcc--c--------c-H-HHHHHHHHHHHH
Q psy4702 56 VDIFKQILAHPDVGVQHRALVIVGNILSAGK-ELAERVLSTELMEILMALTLL--E--------G-E-EVRKLALQALAK 122 (138)
Q Consensus 56 leil~~L~~~~~~~lqHRG~viv~Nmi~a~~-e~a~kl~es~~leiL~~~~k~--~--------~-~-~v~~~a~eaLk~ 122 (138)
+..++.|+-|.+++++-+|+.+++.++..-. ....-+..+|..+++....+. . + . .+++.|..||-.
T Consensus 121 iP~iL~llDD~~~~~K~~G~~lL~~ll~~~~~~~~~~L~~tGl~~v~~~al~~~L~~LP~~tp~~~s~~Ll~~ay~~L~~ 200 (282)
T PF10521_consen 121 IPPILNLLDDYSPEIKIQGCQLLHHLLEKVPAAEWDILRRTGLFSVFEDALFPCLYYLPPITPEDESLELLQAAYPALLS 200 (282)
T ss_pred HhhHHHHhcCCCHHHHHHHHHHHHHHHHhCChhhhHHHHHcChHHHHHHHHHHHhhcCCCCCCchhhHHHHHHHHHHHHH
Confidence 4567888899999999999999999997432 112338899988888776653 1 2 2 789999999988
Q ss_pred HHh
Q psy4702 123 GEE 125 (138)
Q Consensus 123 ~~~ 125 (138)
+..
T Consensus 201 L~~ 203 (282)
T PF10521_consen 201 LLK 203 (282)
T ss_pred HHH
Confidence 854
No 56
>PF10363 DUF2435: Protein of unknown function (DUF2435)
Probab=61.21 E-value=39 Score=23.89 Aligned_cols=64 Identities=16% Similarity=0.183 Sum_probs=43.6
Q ss_pred HHHhcCCCcccchhhhhhhhhhhhcChHHHHHHhhhhHHHHHHHhhccccHHHHHHHHHHHHHHHh
Q psy4702 60 KQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTELMEILMALTLLEGEEVRKLALQALAKGEE 125 (138)
Q Consensus 60 ~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL~~~~k~~~~~v~~~a~eaLk~~~~ 125 (138)
...+.||.+-+|=.|++.+++++.+.. ...+--..++.++....+.++.=|-=.|.++|..+.+
T Consensus 9 l~~L~dp~~PvRa~gL~~L~~Li~~~~--~~~~~~~~il~l~l~~L~d~DsyVYL~aI~~L~~La~ 72 (92)
T PF10363_consen 9 LSDLNDPLPPVRAHGLVLLRKLIESKS--EPVIDIPKILDLFLSQLKDEDSYVYLNAIKGLAALAD 72 (92)
T ss_pred HHHccCCCcchHHHHHHHHHHHHHcCC--cchhhHHHHHHHHHHHcCCCCchHHHHHHHHHHHHHH
Confidence 344578999999999999999998665 2223335678888888887765343445555555443
No 57
>cd04447 DEP_BRCC3 DEP (Dishevelled, Egl-10, and Pleckstrin) domain found in BBRC3-like proteins. BBRC3, also known as DEPDC1B, is a DEP containing protein of unknown function.
Probab=60.82 E-value=9.8 Score=27.65 Aligned_cols=41 Identities=24% Similarity=0.224 Sum_probs=33.4
Q ss_pred HHhhhhHHHHHHHhhccccH---HH-HHHHHHHHHHHHhCCCccC
Q psy4702 91 RVLSTELMEILMALTLLEGE---EV-RKLALQALAKGEEYGIIRR 131 (138)
Q Consensus 91 kl~es~~leiL~~~~k~~~~---~v-~~~a~eaLk~~~~~glIk~ 131 (138)
-+..|++++.|+.....+.+ ++ .+.|++.++++++.++|.+
T Consensus 29 CF~gsEAVDwL~~~l~~n~~fg~~vtR~~av~l~qkll~~hVie~ 73 (92)
T cd04447 29 CFTASEAVDWLHELLRSNSNFGPEVTRQQTVQLLKKFLKNHVIED 73 (92)
T ss_pred ccchHHHHHHHHHHHHhccccCCCCCHHHHHHHHHHHHHcCCchh
Confidence 47788999999999877543 33 6778888999999999974
No 58
>KOG4413|consensus
Probab=59.91 E-value=54 Score=29.84 Aligned_cols=98 Identities=18% Similarity=0.182 Sum_probs=79.0
Q ss_pred CChhhhhhhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHH--HHHHhcCCCcccchhhhhhhhhhhhcChHH
Q psy4702 11 CSQVRFLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDI--FKQILAHPDVGVQHRALVIVGNILSAGKEL 88 (138)
Q Consensus 11 n~RlklLvll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wlei--l~~L~~~~~~~lqHRG~viv~Nmi~a~~e~ 88 (138)
+.=+|+++=+-+-+|...+.||+-.+--++- +|..-+.|+...-++- +..|.+.-|.=.|+|=.-.+--+.+-+.+.
T Consensus 127 aeilklildcIggeddeVAkAAiesikrial-fpaaleaiFeSellDdlhlrnlaakcndiaRvRVleLIieifSiSpes 205 (524)
T KOG4413|consen 127 AEILKLILDCIGGEDDEVAKAAIESIKRIAL-FPAALEAIFESELLDDLHLRNLAAKCNDIARVRVLELIIEIFSISPES 205 (524)
T ss_pred hhHHHHHHHHHcCCcHHHHHHHHHHHHHHHh-cHHHHHHhcccccCChHHHhHHHhhhhhHHHHHHHHHHHHHHhcCHHH
Confidence 4567888888888999999999999999986 7999999999333333 344667777778888777776777778999
Q ss_pred HHHHhhhhHHHHHHHhhcccc
Q psy4702 89 AERVLSTELMEILMALTLLEG 109 (138)
Q Consensus 89 a~kl~es~~leiL~~~~k~~~ 109 (138)
+.....||+++.|-+=.|.+.
T Consensus 206 aneckkSGLldlLeaElkGte 226 (524)
T KOG4413|consen 206 ANECKKSGLLDLLEAELKGTE 226 (524)
T ss_pred HhHhhhhhHHHHHHHHhcCCc
Confidence 999999999999999888854
No 59
>KOG4646|consensus
Probab=56.87 E-value=24 Score=28.24 Aligned_cols=92 Identities=15% Similarity=0.081 Sum_probs=68.8
Q ss_pred eccCCChhhhhhhhcCC----CCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhh
Q psy4702 7 ILFYCSQVRFLFLLTCE----EDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNIL 82 (138)
Q Consensus 7 ~~~~n~RlklLvll~~e----ED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi 82 (138)
-.+|.+|+.++-.|-++ -|.++..---.-||-..- +|.=-.-+.+-+-+|.|..=+..+|+-+.--|+--+.|+
T Consensus 8 ~~hgi~Rl~Ylq~LV~efq~tt~~eakeqv~ANLANFAY-DP~Nys~Lrql~vLdlFvdsl~e~ne~LvefgIgglCNl- 85 (173)
T KOG4646|consen 8 PAHGIDRLEYLQHLVDEFQTTTNIEAKEQVTANLANFAY-DPINYSHLRQLDVLDLFVDSLEEQNELLVEFGIGGLCNL- 85 (173)
T ss_pred CCccCcHHHHHHHHHHHHHHhccHHHHHHHHHHHHhhcc-CcchHHHHHHhhHHHHHHHHhhcccHHHHHHhHHHHHhh-
Confidence 35788999999999887 455555544555666543 454444444477899999999999999999999999997
Q ss_pred hcChHHHHHHhhhhHHHH
Q psy4702 83 SAGKELAERVLSTELMEI 100 (138)
Q Consensus 83 ~a~~e~a~kl~es~~lei 100 (138)
+.++..++-|++.+++-.
T Consensus 86 C~d~~n~~~I~ea~g~pl 103 (173)
T KOG4646|consen 86 CLDKTNAKFIREALGLPL 103 (173)
T ss_pred ccChHHHHHHHHhcCCce
Confidence 577788888888876543
No 60
>COG3355 Predicted transcriptional regulator [Transcription]
Probab=55.18 E-value=20 Score=27.28 Aligned_cols=60 Identities=17% Similarity=0.160 Sum_probs=40.2
Q ss_pred hhhhhhhhhhhhcChHHHHHHhh-hh--HHHHHHHhhccccHHHHHHHHHHHHHHHhCCCccCCCCC
Q psy4702 72 HRALVIVGNILSAGKELAERVLS-TE--LMEILMALTLLEGEEVRKLALQALAKGEEYGIIRRPGEG 135 (138)
Q Consensus 72 HRG~viv~Nmi~a~~e~a~kl~e-s~--~leiL~~~~k~~~~~v~~~a~eaLk~~~~~glIk~~~~~ 135 (138)
++=+-|++++-.+|.++-.+|.+ .| -++=|......+ ...+..+|+.|++-|||++....
T Consensus 16 ~dvl~c~~GLs~~Dv~v~~~LL~~~~~~tvdelae~lnr~----rStv~rsl~~L~~~GlV~Rek~~ 78 (126)
T COG3355 16 EDVLKCVYGLSELDVEVYKALLEENGPLTVDELAEILNRS----RSTVYRSLQNLLEAGLVEREKVN 78 (126)
T ss_pred HHHHHHHhCCcHHHHHHHHHHHhhcCCcCHHHHHHHHCcc----HHHHHHHHHHHHHcCCeeeeeec
Confidence 34456788887788777777763 33 233333333333 47788999999999999986543
No 61
>cd03569 VHS_Hrs_Vps27p VHS domain family, Hrs and Vps27p subfamily; composed of Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and its yeast homolog Vps27p (vacuolar protein sorting). The VHS domain, an essential part of Hrs/Vps27p, has a superhelical structure similar to the structure of ARM (Armadillo) repeats and is present at the N-termini of proteins. Hrs also contains a FYVE (Fab1p, YOTB, Vac1p, and EEA1) zinc finger domain C-terminal to VHS, as well as two coiled-coil domains. Hrs has been proposed to play a role in at least three vesicle trafficking events: exocytosis, endocytosis, and endosome to lysosome trafficking. Hrs is involved in promoting rapid recycling of endocytosed signaling receptors to the plasma membrane.
Probab=54.47 E-value=88 Score=23.59 Aligned_cols=73 Identities=27% Similarity=0.391 Sum_probs=49.4
Q ss_pred HHhhhhh-----ccHHHHHHHHhcCCCcccchhhhhhhhhhhhc-ChHHHHHHhhhhHHHHHHHhhccccH-HHHHHHHH
Q psy4702 46 VCSMLLE-----KNWVDIFKQILAHPDVGVQHRALVIVGNILSA-GKELAERVLSTELMEILMALTLLEGE-EVRKLALQ 118 (138)
Q Consensus 46 ~c~~I~~-----~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a-~~e~a~kl~es~~leiL~~~~k~~~~-~v~~~a~e 118 (138)
+|..|-. +.-+..|..=+.++|+.+|+|.+..+-=++.. +......+...+.++-|..+.+.... +|.+...+
T Consensus 28 icD~In~~~~~~k~a~ral~krl~~~n~~vql~AL~LLe~~vkNCG~~fh~evas~~fl~~l~~l~~~~~~~~Vk~kil~ 107 (142)
T cd03569 28 ICDMIRSKDVQPKYAMRALKKRLLSKNPNVQLYALLLLESCVKNCGTHFHDEVASREFMDELKDLIKTTKNEEVRQKILE 107 (142)
T ss_pred HHHHHhCCCCCHHHHHHHHHHHHcCCChHHHHHHHHHHHHHHHHCCHHHHHHHhhHHHHHHHHHHHcccCCHHHHHHHHH
Confidence 6666655 22334444446889999999999988888875 44466777788888888888765322 55544443
No 62
>KOG4500|consensus
Probab=53.46 E-value=39 Score=31.59 Aligned_cols=93 Identities=16% Similarity=0.224 Sum_probs=70.7
Q ss_pred hhhhhhhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhc-----CCCcccchhhhhhhhhhhhcChH
Q psy4702 13 QVRFLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILA-----HPDVGVQHRALVIVGNILSAGKE 87 (138)
Q Consensus 13 RlklLvll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~-----~~~~~lqHRG~viv~Nmi~a~~e 87 (138)
=++.++-.-+.+|-.+..+++-|++-+-- .+..|..+++++.++-|..++. ++|-++||--+--+||++-.- .
T Consensus 316 ~l~~~~sw~~S~d~~l~t~g~LaigNfaR-~D~~ci~~v~~~~~nkL~~~l~~~~~vdgnV~~qhA~lsALRnl~IPv-~ 393 (604)
T KOG4500|consen 316 FLDFLESWFRSDDSNLITMGSLAIGNFAR-RDDICIQLVQKDFLNKLISCLMQEKDVDGNVERQHACLSALRNLMIPV-S 393 (604)
T ss_pred HHHHHHHHhcCCchhHHHHHHHHHHhhhc-cchHHHHHHHHHHHHHHHHHHHHhcCCCccchhHHHHHHHHHhccccC-C
Confidence 34555556677888999999989888876 7999999999888888888775 468889999999999976421 1
Q ss_pred HHHHHhhhhHHHHHHHhhcc
Q psy4702 88 LAERVLSTELMEILMALTLL 107 (138)
Q Consensus 88 ~a~kl~es~~leiL~~~~k~ 107 (138)
--.++..+|+.|.+.--.|.
T Consensus 394 nka~~~~aGvteaIL~~lk~ 413 (604)
T KOG4500|consen 394 NKAHFAPAGVTEAILLQLKL 413 (604)
T ss_pred chhhccccchHHHHHHHHHh
Confidence 12467788888766555554
No 63
>cd00256 VATPase_H VATPase_H, regulatory vacuolar ATP synthase subunit H (Vma13p); activation component of the peripheral V1 complex of V-ATPase, a heteromultimeric enzyme which uses ATP to actively transport protons into organelles and extracellular compartments. The topology is that of a superhelical spiral, in part the geometry is similar to superhelices composed of armadillo repeat motifs, as found in importins for example.
Probab=53.18 E-value=40 Score=30.37 Aligned_cols=83 Identities=13% Similarity=0.045 Sum_probs=50.0
Q ss_pred CHHHHhhhhh-----ccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHhhhhHHHHHHHhhccccH-HHHHHH
Q psy4702 43 STPVCSMLLE-----KNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTELMEILMALTLLEGE-EVRKLA 116 (138)
Q Consensus 43 ~~~~c~~I~~-----~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL~~~~k~~~~-~v~~~a 116 (138)
.|..++.+.+ +.-...+..++.++|.-++|-...++-.+++.+..-.......-.+..|....+.+.+ ..+..|
T Consensus 85 ~~~~~~~f~~~~~~~~~~~~~fl~lL~~~d~~i~~~a~~iLt~l~~~~~~~~~~~~l~~~~~~l~~~l~~~~~~~~~~~~ 164 (429)
T cd00256 85 DDTRVKLFHDDALLKKKTWEPFFNLLNRQDQFIVHMSFSILAKLACFGLAKMEGSDLDYYFNWLKEQLNNITNNDYVQTA 164 (429)
T ss_pred chHHHHHHHHHhhccccchHHHHHHHcCCchhHHHHHHHHHHHHHhcCccccchhHHHHHHHHHHHHhhccCCcchHHHH
Confidence 4555554444 1223445558888999999999999999998654321111111133445555554433 667888
Q ss_pred HHHHHHHHh
Q psy4702 117 LQALAKGEE 125 (138)
Q Consensus 117 ~eaLk~~~~ 125 (138)
+.||..+..
T Consensus 165 v~~L~~LL~ 173 (429)
T cd00256 165 ARCLQMLLR 173 (429)
T ss_pred HHHHHHHhC
Confidence 888877654
No 64
>cd03568 VHS_STAM VHS domain family, STAM subfamily; members include STAM (Signal Transducing Adaptor Molecule), EAST (EGFR-associated protein with SH3 and TAM domains) and Hbp (Hrs-binding protein). Collectively, they are referred to as STAM. All STAMs have at their N-termini a VHS domain, which is involved in cytokine-mediated intracellular signal transduction and has a superhelical structure similar to the structure of ARM (Armadillo) repeats, followed by a SH3 (Src homology 3) domain, a well-established protein-protein interaction domain. At the C-termini of most vertebrate STAMS, an ITAM (Immunoreceptor Tyrosine-based Activation) motif is present, which mediates the binding of HRS (hepatocyte growth factor-regulated tyrosine kinase substrate) in endocytic and exocytic machineries.
Probab=52.51 E-value=98 Score=23.48 Aligned_cols=72 Identities=25% Similarity=0.318 Sum_probs=47.5
Q ss_pred HHHhhhhh-----ccHHHHHHHHhcCCCcccchhhhhhhhhhhhc-ChHHHHHHhhhhHHHHHHHhhccccH-HHHHHH
Q psy4702 45 PVCSMLLE-----KNWVDIFKQILAHPDVGVQHRALVIVGNILSA-GKELAERVLSTELMEILMALTLLEGE-EVRKLA 116 (138)
Q Consensus 45 ~~c~~I~~-----~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a-~~e~a~kl~es~~leiL~~~~k~~~~-~v~~~a 116 (138)
.+|..|-. +.-+..|..=+.+.|+-+|.|.+..+--++.. +...-..+-..+.++.|..+...... +|.+..
T Consensus 23 ~icD~I~~~~~~~k~a~ral~KRl~~~n~~v~l~AL~LLe~~vkNCG~~fh~evask~Fl~eL~kl~~~~~~~~Vk~ki 101 (144)
T cd03568 23 DVCDKVKSDENGAKDCLKAIMKRLNHKDPNVQLRALTLLDACAENCGKRFHQEVASRDFTQELKKLINDRVHPTVKEKL 101 (144)
T ss_pred HHHHHHhcCCccHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHCCHHHHHHHhhHHHHHHHHHHhcccCCHHHHHHH
Confidence 35666654 22333444445789999999999999998874 44455677777888888877766322 444433
No 65
>PF05918 API5: Apoptosis inhibitory protein 5 (API5); InterPro: IPR008383 This family consists of apoptosis inhibitory protein 5 (API5) sequences from several organisms. Apoptosis or programmed cell death is a physiological form of cell death that occurs in embryonic development and organ formation. It is characterised by biochemical and morphological changes such as DNA fragmentation and cell volume shrinkage. API5 is an anti apoptosis gene located in Homo sapiens chromosome 11, whose expression prevents the programmed cell death that occurs upon the deprivation of growth factors [,].; PDB: 3U0R_A 3V6A_A.
Probab=51.71 E-value=47 Score=31.04 Aligned_cols=66 Identities=12% Similarity=0.143 Sum_probs=41.2
Q ss_pred ccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHhhhhHHHHHHHhhccccHHHHHHHHHHHHHH
Q psy4702 53 KNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTELMEILMALTLLEGEEVRKLALQALAKG 123 (138)
Q Consensus 53 ~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL~~~~k~~~~~v~~~a~eaLk~~ 123 (138)
..-++-+..||.|+|..||--|+--+-.+-...++...|+ .+||..+...++..-+..+..+|--+
T Consensus 58 ~~Ai~a~~DLcEDed~~iR~~aik~lp~~ck~~~~~v~kv-----aDvL~QlL~tdd~~E~~~v~~sL~~l 123 (556)
T PF05918_consen 58 EEAINAQLDLCEDEDVQIRKQAIKGLPQLCKDNPEHVSKV-----ADVLVQLLQTDDPVELDAVKNSLMSL 123 (556)
T ss_dssp HHHHHHHHHHHT-SSHHHHHHHHHHGGGG--T--T-HHHH-----HHHHHHHTT---HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHhcccHHHHHHHHHhHHHHHHhHHHHHhHH-----HHHHHHHHhcccHHHHHHHHHHHHHH
Confidence 4567889999999999998877766666666677777775 77888887755433345555555433
No 66
>PF12719 Cnd3: Nuclear condensing complex subunits, C-term domain
Probab=49.84 E-value=1.1e+02 Score=25.16 Aligned_cols=103 Identities=22% Similarity=0.244 Sum_probs=71.5
Q ss_pred CCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhh-cChHHHHHHh-------hh
Q psy4702 24 EDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILS-AGKELAERVL-------ST 95 (138)
Q Consensus 24 ED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~-a~~e~a~kl~-------es 95 (138)
.|.+.|..|-=.|+...-.++..+. +.+.+|......++++++-.++=++-.++. .+.+.-..-. ..
T Consensus 39 ~~~~vR~~al~cLGl~~Lld~~~a~-----~~l~l~~~~~~~~~~~v~~~al~~l~Dll~~~g~~~~~~~~~~~~~~~~~ 113 (298)
T PF12719_consen 39 SDPAVRELALKCLGLCCLLDKELAK-----EHLPLFLQALQKDDEEVKITALKALFDLLLTHGIDIFDSESDNDESVDSK 113 (298)
T ss_pred CCHHHHHHHHHHHHHHHHhChHHHH-----HHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHcCchhccchhccCccchHh
Confidence 6778888887777665544443333 344455555566799999999988888754 4434333322 35
Q ss_pred hHHHHHHHhhccccHHHHHHHHHHHHHHHhCCCccC
Q psy4702 96 ELMEILMALTLLEGEEVRKLALQALAKGEEYGIIRR 131 (138)
Q Consensus 96 ~~leiL~~~~k~~~~~v~~~a~eaLk~~~~~glIk~ 131 (138)
.+++++.........++...|.|++-|++=.|.|.+
T Consensus 114 ~l~~~l~~~l~~~~~~~~~~a~EGl~KLlL~~~i~~ 149 (298)
T PF12719_consen 114 SLLKILTKFLDSENPELQAIAVEGLCKLLLSGRISD 149 (298)
T ss_pred HHHHHHHHHHhcCCHHHHHHHHHHHHHHHhcCCCCc
Confidence 677777777666634899999999999999888876
No 67
>KOG1293|consensus
Probab=45.89 E-value=86 Score=30.18 Aligned_cols=99 Identities=18% Similarity=0.167 Sum_probs=67.0
Q ss_pred CCHHHHHhhhhhhhhhccCCHHHHhhhhh-ccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHhhhhHHHHHH
Q psy4702 24 EDTETACAAAGALAMLTSVSTPVCSMLLE-KNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTELMEILM 102 (138)
Q Consensus 24 ED~~tr~AA~GALAmLT~~~~~~c~~I~~-~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL~ 102 (138)
-|..-++||--.+-.++- +=.+-+.=++ ..+.+-+..+..+|+--++---+=.+-|++.--...-.|++++|++++|.
T Consensus 389 kd~~~~aaa~l~~~s~sr-sV~aL~tg~~~~dv~~plvqll~dp~~~i~~~~lgai~NlVmefs~~kskfl~~ngId~l~ 467 (678)
T KOG1293|consen 389 KDHDFVAAALLCLKSFSR-SVSALRTGLKRNDVAQPLVQLLMDPEIMIMGITLGAICNLVMEFSNLKSKFLRNNGIDILE 467 (678)
T ss_pred ccHHHHHHHHHHHHHHHH-HHHHHHcCCccchhHHHHHHHhhCcchhHHHHHHHHHHHHHhhcccHHHHHHHcCcHHHHH
Confidence 566666665444443322 2222222244 67888888888888888877777788899887777889999999999999
Q ss_pred HhhccccHHHHHHHHHHHHHH
Q psy4702 103 ALTLLEGEEVRKLALQALAKG 123 (138)
Q Consensus 103 ~~~k~~~~~v~~~a~eaLk~~ 123 (138)
..++..+.-+++.+.-.|.-+
T Consensus 468 s~~~~~~~n~r~~~~~~Lr~l 488 (678)
T KOG1293|consen 468 SMLTDPDFNSRANSLWVLRHL 488 (678)
T ss_pred HHhcCCCchHHHHHHHHHHHH
Confidence 998875534445555555443
No 68
>COG1846 MarR Transcriptional regulators [Transcription]
Probab=45.63 E-value=59 Score=21.71 Aligned_cols=51 Identities=31% Similarity=0.356 Sum_probs=30.8
Q ss_pred cchhhhhhhhhhhhcC-hHHHHHHhhhhHHHHHHHhhccccHHHHHHHHHHHHHHHhCCCccCCCCCCC
Q psy4702 70 VQHRALVIVGNILSAG-KELAERVLSTELMEILMALTLLEGEEVRKLALQALAKGEEYGIIRRPGEGEE 137 (138)
Q Consensus 70 lqHRG~viv~Nmi~a~-~e~a~kl~es~~leiL~~~~k~~~~~v~~~a~eaLk~~~~~glIk~~~~~~~ 137 (138)
-|++-+.++++.-..+ +++|+++ ..+ .......++.+++.|+|++..+.++
T Consensus 23 ~q~~~L~~l~~~~~~~~~~la~~l-------------~i~----~~~vt~~l~~Le~~glv~r~~~~~D 74 (126)
T COG1846 23 PQYQVLLALYEAGGITVKELAERL-------------GLD----RSTVTRLLKRLEDKGLIERLRDPED 74 (126)
T ss_pred HHHHHHHHHHHhCCCcHHHHHHHH-------------CCC----HHHHHHHHHHHHHCCCeeecCCccc
Confidence 3455555555554444 6666553 111 2345567999999999998666543
No 69
>cd04446 DEP_DEPDC4 DEP (Dishevelled, Egl-10, and Pleckstrin) domain found in DEPDC4-like proteins. DEPDC4 is a DEP domain containing protein of unknown function.
Probab=44.93 E-value=23 Score=25.63 Aligned_cols=42 Identities=12% Similarity=0.152 Sum_probs=32.4
Q ss_pred HHhhhhHHHHHHHhhcccc----HH-HHHHHHHHHHHHHhCCCccCC
Q psy4702 91 RVLSTELMEILMALTLLEG----EE-VRKLALQALAKGEEYGIIRRP 132 (138)
Q Consensus 91 kl~es~~leiL~~~~k~~~----~~-v~~~a~eaLk~~~~~glIk~~ 132 (138)
-++-+++++.|.....++. .. -.+.|++-.+++++.|+|.+-
T Consensus 27 CF~GsEaVDwL~~~l~~n~~f~~~~~tR~~Av~l~q~Ll~~gvi~~V 73 (95)
T cd04446 27 CFLGSEAVDVVLAHLMQNKYFGDVDVPRAKAVRLCQALMDCRVFEAV 73 (95)
T ss_pred ccchHHHHHHHHHHHhhccccCcccCCHHHHHHHHHHHHHcCCeeec
Confidence 4778899999988775542 12 367788889999999999975
No 70
>KOG2259|consensus
Probab=44.31 E-value=45 Score=32.43 Aligned_cols=57 Identities=21% Similarity=0.176 Sum_probs=45.1
Q ss_pred CCCH--HHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhc
Q psy4702 23 EEDT--ETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSA 84 (138)
Q Consensus 23 eED~--~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a 84 (138)
=||| +.|+||-..+.+|.-.+|+-. .+-++.|..+.-|+.+++|--.+.-++-+..+
T Consensus 382 lEDEf~EVR~AAV~Sl~~La~ssP~FA-----~~aldfLvDMfNDE~~~VRL~ai~aL~~Is~~ 440 (823)
T KOG2259|consen 382 LEDEFYEVRRAAVASLCSLATSSPGFA-----VRALDFLVDMFNDEIEVVRLKAIFALTMISVH 440 (823)
T ss_pred chHHHHHHHHHHHHHHHHHHcCCCCcH-----HHHHHHHHHHhccHHHHHHHHHHHHHHHHHHH
Confidence 3777 899999999999966566544 35688999999999999998888777665544
No 71
>PF08045 CDC14: Cell division control protein 14, SIN component; InterPro: IPR012535 Cdc14 is a component of the septation initiation network (SIN) and is required for the localisation and activity of Sid1. Sid1 is a protein kinase that localises asymmetrically to one spindle pole body (SPB) in anaphase disappears prior to cell separation [], [].
Probab=44.29 E-value=1e+02 Score=26.09 Aligned_cols=63 Identities=13% Similarity=0.101 Sum_probs=51.3
Q ss_pred HHhhhhh-ccHHHHHHHHh-cCCCcccchhhhhhhhhhhhcChHHHHHHhhhhHHHHHHHhhccc
Q psy4702 46 VCSMLLE-KNWVDIFKQIL-AHPDVGVQHRALVIVGNILSAGKELAERVLSTELMEILMALTLLE 108 (138)
Q Consensus 46 ~c~~I~~-~~wleil~~L~-~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL~~~~k~~ 108 (138)
-|+.++. .+|+++|..|+ ....+.+|-.-+-.+-.++-.+......+-+-++++++..+.|..
T Consensus 124 ~Sr~lF~r~~~m~lll~LL~~~~~~~i~~a~L~tLv~iLld~p~N~r~FE~~~Gl~~v~~llk~~ 188 (257)
T PF08045_consen 124 PSRKLFHREQNMELLLDLLSPSNPPAIQSACLDTLVCILLDSPENQRDFEELNGLSTVCSLLKSK 188 (257)
T ss_pred hHHHHHhhhhhHHHHHHHhccCCCchHHHHHHHHHHHHHHcChHHHHHHHHhCCHHHHHHHHccc
Confidence 3677777 89999999999 446788887777777777666777778888999999999999875
No 72
>PF10274 ParcG: Parkin co-regulated protein; InterPro: IPR019399 This family of proteins is transcribed anti-sense along the DNA to the Parkin gene product and the two appear to be transcribed under the same promoter. The protein has predicted alpha-helical and beta-sheet domains which suggest its function is in the ubiquitin/proteasome system []. Mutations in parkin are the genetic cause of early-onset and autosomal recessive juvenile parkinsonism.
Probab=44.17 E-value=1.4e+02 Score=24.10 Aligned_cols=100 Identities=17% Similarity=0.156 Sum_probs=66.8
Q ss_pred CCHHHHHhhhhhhhhhccCCHHHHhhhhh--ccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHhhhhHHHHH
Q psy4702 24 EDTETACAAAGALAMLTSVSTPVCSMLLE--KNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTELMEIL 101 (138)
Q Consensus 24 ED~~tr~AA~GALAmLT~~~~~~c~~I~~--~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL 101 (138)
+.+=.-.|--|.-=||.. ..-.+|+- ++-+-=+..=+.+.|+++..+++-++..++.++.-+++.++-- .=.+|
T Consensus 51 ~~Py~flA~~g~~dll~~---~~~~kilPvlPqLI~plk~AL~tr~~~V~~~~L~~Lq~Lv~~~~~vG~aLvPy-yrqLL 126 (183)
T PF10274_consen 51 EHPYRFLARQGIKDLLER---GGGEKILPVLPQLIIPLKRALNTRDPEVFCATLKALQQLVTSSDMVGEALVPY-YRQLL 126 (183)
T ss_pred CccHHHHHHHHHHHHHHh---cchhHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHHhhhhhhHHHHHH-HHHHH
Confidence 344344555566666654 34455665 6667777777889999999999999999998888888776532 22233
Q ss_pred HHhh----cc---------ccH-HHHHHHHHHHHHHHhCC
Q psy4702 102 MALT----LL---------EGE-EVRKLALQALAKGEEYG 127 (138)
Q Consensus 102 ~~~~----k~---------~~~-~v~~~a~eaLk~~~~~g 127 (138)
-.+. |. ..+ .+.+...++|+.++.+|
T Consensus 127 p~ln~f~~k~~n~gd~i~y~~~~~~~dlI~etL~~lE~~G 166 (183)
T PF10274_consen 127 PVLNLFKNKNVNLGDGIDYRKRKNLGDLIQETLELLERNG 166 (183)
T ss_pred HHHHHHHhcccCCCcccccccccchhHHHHHHHHHHHHhc
Confidence 3333 22 112 66788888898888876
No 73
>KOG3678|consensus
Probab=44.08 E-value=32 Score=32.59 Aligned_cols=96 Identities=21% Similarity=0.281 Sum_probs=73.2
Q ss_pred hhhhhhhcCC-CCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhh-cChHHHHH
Q psy4702 14 VRFLFLLTCE-EDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILS-AGKELAER 91 (138)
Q Consensus 14 lklLvll~~e-ED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~-a~~e~a~k 91 (138)
++.++.++-+ |-.+++|-.+|-|--+---|+..|.++++..+++-++.-|.-.+|++-.-....+.|..- ...++-++
T Consensus 223 ~~~Il~lAK~~e~~e~aR~~~~il~~mFKHSeet~~~Lvaa~~lD~vl~~~rRt~P~lLRH~ALAL~N~~L~~~~a~qrr 302 (832)
T KOG3678|consen 223 LGVILNLAKEREPVELARSVAGILEHMFKHSEETCQRLVAAGGLDAVLYWCRRTDPALLRHCALALGNCALHGGQAVQRR 302 (832)
T ss_pred chhhhhhhhhcCcHHHHHHHHHHHHHHhhhhHHHHHHHHhhcccchheeecccCCHHHHHHHHHHhhhhhhhchhHHHHH
Confidence 4556666666 677888888888875544489999999998899999999999888865555666778654 45556688
Q ss_pred HhhhhHHHHHHHhhcccc
Q psy4702 92 VLSTELMEILMALTLLEG 109 (138)
Q Consensus 92 l~es~~leiL~~~~k~~~ 109 (138)
++|..+-|-|.-++-..+
T Consensus 303 mveKr~~EWLF~LA~skD 320 (832)
T KOG3678|consen 303 MVEKRAAEWLFPLAFSKD 320 (832)
T ss_pred HHHhhhhhhhhhhhcchH
Confidence 999999999887765543
No 74
>PF12830 Nipped-B_C: Sister chromatid cohesion C-terminus
Probab=43.34 E-value=53 Score=25.55 Aligned_cols=74 Identities=28% Similarity=0.360 Sum_probs=0.0
Q ss_pred HHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHhhhhHHHHHHHhhccccHHHHHHHHHHHHHHHh
Q psy4702 46 VCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTELMEILMALTLLEGEEVRKLALQALAKGEE 125 (138)
Q Consensus 46 ~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL~~~~k~~~~~v~~~a~eaLk~~~~ 125 (138)
+|..|.+.-|.+|+. +|.+++..+++.++-++.=++..+ =+-=...+-.|.++.-.+...+...|.+.|+.+-+
T Consensus 1 v~s~l~Qryl~~Il~-~~~~~~~~vr~~Al~~l~~il~qG-----LvnP~~cvp~lIAL~ts~~~~ir~~A~~~l~~l~e 74 (187)
T PF12830_consen 1 VCSALVQRYLKNILE-LCLSSDDSVRLAALQVLELILRQG-----LVNPKQCVPTLIALETSPNPSIRSRAYQLLKELHE 74 (187)
T ss_pred CcHHHHHHHHHHHHH-HHhCCCHHHHHHHHHHHHHHHhcC-----CCChHHHHhHhhhhhCCCChHHHHHHHHHHHHHHH
No 75
>cd04444 DEP_PLEK2 DEP (Dishevelled, Egl-10, and Pleckstrin) domain found in pleckstrin 2-like proteins. Pleckstrin 2 is found in a wide variety of cell types, which suggest a more general role in signaling than pleckstrin 1. Pleckstrin-like proteins contain a central DEP domain, flanked by 2 PH (pleckstrin homology) domains.
Probab=42.73 E-value=32 Score=25.65 Aligned_cols=40 Identities=23% Similarity=0.282 Sum_probs=30.6
Q ss_pred HHHhhhhHHHHHHHhhccccH-HHHHHHHHHHHHHHhCCCccCCC
Q psy4702 90 ERVLSTELMEILMALTLLEGE-EVRKLALQALAKGEEYGIIRRPG 133 (138)
Q Consensus 90 ~kl~es~~leiL~~~~k~~~~-~v~~~a~eaLk~~~~~glIk~~~ 133 (138)
.-++-+++++-|+.......+ +.+.+|++ ++++|+|++-.
T Consensus 30 ~CF~Gse~VDWLv~~~~~i~R~EAv~l~q~----Lmd~gli~hV~ 70 (109)
T cd04444 30 KTFLGSALVDWLISNSFAASRLEAVTLASM----LMEENFLRPVG 70 (109)
T ss_pred ccccchHHHHHHHHCCCCCCHHHHHHHHHH----HHhCCchhhHH
Confidence 457888899999999665444 77777765 89999999743
No 76
>smart00288 VHS Domain present in VPS-27, Hrs and STAM. Unpublished observations. Domain of unknown function.
Probab=42.50 E-value=1.4e+02 Score=22.05 Aligned_cols=73 Identities=22% Similarity=0.303 Sum_probs=48.9
Q ss_pred HHHhhhhh-c-c---HHHHHHHHhcCCCcccchhhhhhhhhhhhc-ChHHHHHHhhhhHHHHHHHhhccccH-H-HHHHH
Q psy4702 45 PVCSMLLE-K-N---WVDIFKQILAHPDVGVQHRALVIVGNILSA-GKELAERVLSTELMEILMALTLLEGE-E-VRKLA 116 (138)
Q Consensus 45 ~~c~~I~~-~-~---wleil~~L~~~~~~~lqHRG~viv~Nmi~a-~~e~a~kl~es~~leiL~~~~k~~~~-~-v~~~a 116 (138)
.+|..|-. + . -+..|..=+.++|+.+|+|.+.++-=++.. +......+-..+.++-|..+.+.... + |.+..
T Consensus 23 ~icD~i~~~~~~~k~a~r~l~krl~~~n~~v~l~AL~lLe~~vkNcg~~f~~ev~s~~fl~~L~~l~~~~~~~~~Vk~ki 102 (133)
T smart00288 23 EICDLINSTPDGPKDAVRLLKKRLNNKNPHVALLALTLLDACVKNCGSKFHLEVASKEFLNELVKLIKPKYPLPLVKKRI 102 (133)
T ss_pred HHHHHHhCCCccHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHhHHHHHHHHHHHcCCCCcHHHHHHH
Confidence 35666644 2 2 233333335689999999999988888764 55677778888889999888776544 3 44433
Q ss_pred H
Q psy4702 117 L 117 (138)
Q Consensus 117 ~ 117 (138)
.
T Consensus 103 l 103 (133)
T smart00288 103 L 103 (133)
T ss_pred H
Confidence 3
No 77
>cd04450 DEP_RGS7-like DEP (Dishevelled, Egl-10, and Pleckstrin) domain found in RGS (regulator of G-protein signaling) proteins of the subfamily R7. This subgroup contains RGS7, RGS6, RGS9 and RGS11. They share a common domain architecture, containing, beside the RGS domain, a DEP domain and a GGL (G-protein gamma subunit-like ) domain. RGS proteins are GTPase-activating (GAP) proteins of heterotrimeric G proteins by increasing the rate of GTP hydrolysis of the alpha subunit. The fungal homologs, like yeast Sst2, share a related common domain architecture, containing RGS and DEP domains. Sst2 has been identified as the principal regulator of mating pheromone signaling and recently the DEP domain of Sst2 has been shown to be necessary and sufficient to mediate receptor interaction.
Probab=40.72 E-value=35 Score=23.62 Aligned_cols=40 Identities=20% Similarity=0.225 Sum_probs=27.5
Q ss_pred HhhhhHHHHHHHhhccccH-HHHHHHHHHHHHHHhCCCccCCCCC
Q psy4702 92 VLSTELMEILMALTLLEGE-EVRKLALQALAKGEEYGIIRRPGEG 135 (138)
Q Consensus 92 l~es~~leiL~~~~k~~~~-~v~~~a~eaLk~~~~~glIk~~~~~ 135 (138)
+.-+++++=|+.++.-.++ +.++. -..++++|||.+-.+.
T Consensus 29 F~G~~~v~WL~~~~~~~~~~EA~~~----~~~ll~~gli~~V~~~ 69 (88)
T cd04450 29 FTGKAIVQWLMDCTDVVDPSEALEI----AALFVKYGLITPVSDH 69 (88)
T ss_pred eEhHHHHHHHHHCCCCCCHHHHHHH----HHHHHHCCCEEEecCC
Confidence 5566778889988764333 54444 4568999999985543
No 78
>PF05918 API5: Apoptosis inhibitory protein 5 (API5); InterPro: IPR008383 This family consists of apoptosis inhibitory protein 5 (API5) sequences from several organisms. Apoptosis or programmed cell death is a physiological form of cell death that occurs in embryonic development and organ formation. It is characterised by biochemical and morphological changes such as DNA fragmentation and cell volume shrinkage. API5 is an anti apoptosis gene located in Homo sapiens chromosome 11, whose expression prevents the programmed cell death that occurs upon the deprivation of growth factors [,].; PDB: 3U0R_A 3V6A_A.
Probab=39.64 E-value=32 Score=32.15 Aligned_cols=49 Identities=22% Similarity=0.184 Sum_probs=29.2
Q ss_pred hhhhhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCc
Q psy4702 15 RFLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDV 68 (138)
Q Consensus 15 klLvll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~ 68 (138)
.-++=||++||...|..|--+|--+.-..+.-+. +-.+||..|+..+++
T Consensus 62 ~a~~DLcEDed~~iR~~aik~lp~~ck~~~~~v~-----kvaDvL~QlL~tdd~ 110 (556)
T PF05918_consen 62 NAQLDLCEDEDVQIRKQAIKGLPQLCKDNPEHVS-----KVADVLVQLLQTDDP 110 (556)
T ss_dssp HHHHHHHT-SSHHHHHHHHHHGGGG--T--T-HH-----HHHHHHHHHTT---H
T ss_pred HHHHHHHhcccHHHHHHHHHhHHHHHHhHHHHHh-----HHHHHHHHHHhcccH
Confidence 3456699999999999998887655542344444 456777777776664
No 79
>PF14947 HTH_45: Winged helix-turn-helix; PDB: 1XSX_B 1R7J_A.
Probab=38.61 E-value=55 Score=21.94 Aligned_cols=35 Identities=26% Similarity=0.352 Sum_probs=22.6
Q ss_pred HHHHHHHhhccccH--HH-------HHHHHHHHHHHHhCCCccC
Q psy4702 97 LMEILMALTLLEGE--EV-------RKLALQALAKGEEYGIIRR 131 (138)
Q Consensus 97 ~leiL~~~~k~~~~--~v-------~~~a~eaLk~~~~~glIk~ 131 (138)
+++||..+.+...+ ++ -..+.+-|+.|++.|||+.
T Consensus 8 i~~IL~~l~~~~~~~t~i~~~~~L~~~~~~~yL~~L~~~gLI~~ 51 (77)
T PF14947_consen 8 IFDILKILSKGGAKKTEIMYKANLNYSTLKKYLKELEEKGLIKK 51 (77)
T ss_dssp HHHHHHHH-TT-B-HHHHHTTST--HHHHHHHHHHHHHTTSEEE
T ss_pred HHHHHHHHHcCCCCHHHHHHHhCcCHHHHHHHHHHHHHCcCeeC
Confidence 45677777533221 21 4667888999999999965
No 80
>cd04448 DEP_PIKfyve DEP (Dishevelled, Egl-10, and Pleckstrin) domain found in fungal RhoGEF (GDP/GTP exchange factor) PIKfyve-like proteins. PIKfyve contains N-terminal Fyve finger and DEP domains, a central chaperonin-like domain and a C-terminal PIPK (phosphatidylinositol phosphate kinase) domain. PIKfyve-like proteins are important phosphatidylinositol (3)-monophosphate (PtdIns(3)P)-5-kinases, producing PtdIns(3,5)P2, which plays a major role in multivesicular body (MVB) sorting and control of retrograde traffic from the vacuole back to the endosome and/or Golgi. PIKfyve itself has been shown to be play a role in regulating early-endosome-to-trans-Golgi network (TGN) retrograde trafficking.
Probab=38.15 E-value=53 Score=22.65 Aligned_cols=41 Identities=17% Similarity=0.087 Sum_probs=27.8
Q ss_pred HHhhhhHHHHHHHhhccccHHHHHHHHHHHHHHHhCCCccCCCC
Q psy4702 91 RVLSTELMEILMALTLLEGEEVRKLALQALAKGEEYGIIRRPGE 134 (138)
Q Consensus 91 kl~es~~leiL~~~~k~~~~~v~~~a~eaLk~~~~~glIk~~~~ 134 (138)
-+.-+++++-|..-.+-..+ +.|+.--..|++.|+|++-.+
T Consensus 28 cF~GselVdWL~~~~~~~~R---~eAv~~gq~Ll~~g~i~hV~~ 68 (81)
T cd04448 28 CILGKELVNWLIRQGKAATR---VQAIAIGQALLDAGWIECVSD 68 (81)
T ss_pred ccChHHHHHHHHHcCCCCCH---HHHHHHHHHHHHCCCEEecCC
Confidence 46777888888876543322 345555566999999998554
No 81
>PF00610 DEP: Domain found in Dishevelled, Egl-10, and Pleckstrin (DEP); InterPro: IPR000591 This entry represents the DEP (Dishevelled, Egl-10 and Pleckstrin) domain, a globular domain of about 80 residues that is found in over 50 proteins involved in G-protein signalling pathways. It was named after the three proteins it was initially found in: Dishevelled (Dsh and Dvl), which play a key role in the transduction of the Wg/Wnt signal from the cell surface to the nucleus; it is a segment polarity protein required to establish coherent arrays of polarized cells and segments in embryos, and plays a role in wingless signalling. Egl-10, which regulates G-protein signalling in the central nervous system. Pleckstrin, the major substrate of protein kinase C in platelets; Pleckstrin contains two PH domains flanking the DEP domain. Mammalian regulators of G-protein signalling also contain these domains, and regulate signal transduction by increasing the GTPase activity of G-protein alpha subunits, thereby driving them into their inactive GDP-bound form. It has been proposed that the DEP domain could play a selective role in targeting DEP domain-containing proteins to specific subcellular membranous sites, perhaps even to specific G protein-coupled signaling pathways [, ]. Nuclear magnetic resonance spectroscopy has revealed that the DEP domain comprises a three-helix bundle, a beta-hairpin 'arm' composed of two beta-strands and two short beta-strands in the C-terminal region [].; GO: 0035556 intracellular signal transduction; PDB: 1UHW_A 1V3F_A 2YSR_A 2CSO_A 1W4M_A 2PBI_C 1O7F_A 2BYV_E 1FSH_A 3ML6_D ....
Probab=37.52 E-value=58 Score=21.05 Aligned_cols=40 Identities=18% Similarity=0.076 Sum_probs=25.8
Q ss_pred hhhhHHHHHHHhhcc--ccHHHHHHHHHHHHHHHhCCCccCCCCC
Q psy4702 93 LSTELMEILMALTLL--EGEEVRKLALQALAKGEEYGIIRRPGEG 135 (138)
Q Consensus 93 ~es~~leiL~~~~k~--~~~~v~~~a~eaLk~~~~~glIk~~~~~ 135 (138)
.-+++++-|+..... ..+ +.|++-...+++.|+|++..+.
T Consensus 19 ~G~e~v~WL~~~~~~~~~~r---~eA~~l~q~Ll~~g~i~~v~~~ 60 (74)
T PF00610_consen 19 TGSEAVDWLMDNFEGFVRDR---EEAVQLGQELLDHGFIEHVSDK 60 (74)
T ss_dssp EHHHHHHHHHHTSCTSTSSH---HHHHHHHHHHHHCTSEEESSSS
T ss_pred EhHHHHHHHHHhccccccCH---HHHHHHHHHHHHCCCEEECCCC
Confidence 346678889876542 222 4445555568899999986544
No 82
>KOG1993|consensus
Probab=37.21 E-value=1.2e+02 Score=30.15 Aligned_cols=82 Identities=29% Similarity=0.395 Sum_probs=56.4
Q ss_pred ccHHHHHHHHh---cC----CCcccchhhhhhhhhhhh--cChHHH--HHHhhhhHHHHHHHhhc---------------
Q psy4702 53 KNWVDIFKQIL---AH----PDVGVQHRALVIVGNILS--AGKELA--ERVLSTELMEILMALTL--------------- 106 (138)
Q Consensus 53 ~~wleil~~L~---~~----~~~~lqHRG~viv~Nmi~--a~~e~a--~kl~es~~leiL~~~~k--------------- 106 (138)
..|++.+-.|. .+ .|...|||-+.+++-++. +++.++ .|.+.+.+-+|+.-++-
T Consensus 115 reWP~Lf~~L~~~Lq~~~~~gD~~~~~RiLi~l~~ilK~Lat~RL~a~rk~F~el~~~I~~~l~~~l~s~lt~~~lq~~s 194 (978)
T KOG1993|consen 115 REWPDLFPDLLGQLQSSLGTGDSLVQHRILITLHHILKALATKRLLADRKAFYELAPEILTILAPILWSSLTMMFLQSVS 194 (978)
T ss_pred ccchhHHHHHHHHHHHhcccchHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHhHHHHHHHHHHHhcchHHHHHHHhh
Confidence 57888877764 33 488999999999999876 466665 67777777777762221
Q ss_pred cccH-------HHHHHHHHHHHHHHhCCCccCCCC
Q psy4702 107 LEGE-------EVRKLALQALAKGEEYGIIRRPGE 134 (138)
Q Consensus 107 ~~~~-------~v~~~a~eaLk~~~~~glIk~~~~ 134 (138)
..++ ++--.+.++|.+++=+|+.+|.++
T Consensus 195 s~~ea~~LsalQ~s~~~lk~lRrlvv~G~~~P~ks 229 (978)
T KOG1993|consen 195 SIKEATLLSALQRSYLTLKVLRRLVVFGFQNPSKS 229 (978)
T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhccCCcchh
Confidence 1011 334455677778888999998754
No 83
>KOG3678|consensus
Probab=35.56 E-value=72 Score=30.36 Aligned_cols=41 Identities=7% Similarity=0.161 Sum_probs=30.0
Q ss_pred cchhhhhhhhhhhhcChHHHHHHhhhhHHHHHHHhhccccH
Q psy4702 70 VQHRALVIVGNILSAGKELAERVLSTELMEILMALTLLEGE 110 (138)
Q Consensus 70 lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL~~~~k~~~~ 110 (138)
++.--+=|+-||..++.+.+..++++|+++...--.+.++.
T Consensus 238 ~aR~~~~il~~mFKHSeet~~~Lvaa~~lD~vl~~~rRt~P 278 (832)
T KOG3678|consen 238 LARSVAGILEHMFKHSEETCQRLVAAGGLDAVLYWCRRTDP 278 (832)
T ss_pred HHHHHHHHHHHHhhhhHHHHHHHHhhcccchheeecccCCH
Confidence 33333446778888899999999999999876665566544
No 84
>KOG4646|consensus
Probab=33.78 E-value=85 Score=25.16 Aligned_cols=110 Identities=15% Similarity=0.108 Sum_probs=78.7
Q ss_pred CChhhhhhhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHH
Q psy4702 11 CSQVRFLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAE 90 (138)
Q Consensus 11 n~RlklLvll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~ 90 (138)
-|=|.+.+.--.++|+.+.-=+-|+|+-|.. ++...+-|.+.-++..+..-..+|.+...|-.+--.+-|--.++.--.
T Consensus 57 l~vLdlFvdsl~e~ne~LvefgIgglCNlC~-d~~n~~~I~ea~g~plii~~lssp~e~tv~sa~~~l~~l~~~~Rt~r~ 135 (173)
T KOG4646|consen 57 LDVLDLFVDSLEEQNELLVEFGIGGLCNLCL-DKTNAKFIREALGLPLIIFVLSSPPEITVHSAALFLQLLEFGERTERD 135 (173)
T ss_pred hhHHHHHHHHhhcccHHHHHHhHHHHHhhcc-ChHHHHHHHHhcCCceEEeecCCChHHHHHHHHHHHHHhcCcccchhH
Confidence 3445566666677999999999999999998 799999998866777777777888888888888777766555555556
Q ss_pred HHhhhhHHHHHHHhhccccH-HHHHHHHHHHHH
Q psy4702 91 RVLSTELMEILMALTLLEGE-EVRKLALQALAK 122 (138)
Q Consensus 91 kl~es~~leiL~~~~k~~~~-~v~~~a~eaLk~ 122 (138)
.+....+++..-... .+.. .-.-+|...|++
T Consensus 136 ell~p~Vv~~v~r~~-~s~s~~~rnLa~~fl~~ 167 (173)
T KOG4646|consen 136 ELLSPAVVRTVQRWR-ESKSHDERNLASAFLDK 167 (173)
T ss_pred HhccHHHHHHHHHHH-HHhhHHHHHHHHHHHHh
Confidence 777777777666554 2222 233455555554
No 85
>COG1413 FOG: HEAT repeat [Energy production and conversion]
Probab=33.01 E-value=1.8e+02 Score=23.72 Aligned_cols=92 Identities=27% Similarity=0.241 Sum_probs=56.1
Q ss_pred hhhhhhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHh
Q psy4702 14 VRFLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVL 93 (138)
Q Consensus 14 lklLvll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~ 93 (138)
+..++-+-..+|..+|..|+=.+..+-+ +.-++-+..++.++++.+|+-++..+.++ .
T Consensus 45 ~~~~~~~l~~~~~~vr~~aa~~l~~~~~-----------~~av~~l~~~l~d~~~~vr~~a~~aLg~~-----------~ 102 (335)
T COG1413 45 ADELLKLLEDEDLLVRLSAAVALGELGS-----------EEAVPLLRELLSDEDPRVRDAAADALGEL-----------G 102 (335)
T ss_pred HHHHHHHHcCCCHHHHHHHHHHHhhhch-----------HHHHHHHHHHhcCCCHHHHHHHHHHHHcc-----------C
Confidence 3444444444577777777766444322 35566677788888887777777744433 2
Q ss_pred hhhHHHHHHHhhccccH-HHHHHHHHHHHHHHhCC
Q psy4702 94 STELMEILMALTLLEGE-EVRKLALQALAKGEEYG 127 (138)
Q Consensus 94 es~~leiL~~~~k~~~~-~v~~~a~eaLk~~~~~g 127 (138)
....++.|..+.+.+.+ .|+..+..+|..+-+..
T Consensus 103 ~~~a~~~li~~l~~d~~~~vR~~aa~aL~~~~~~~ 137 (335)
T COG1413 103 DPEAVPPLVELLENDENEGVRAAAARALGKLGDER 137 (335)
T ss_pred ChhHHHHHHHHHHcCCcHhHHHHHHHHHHhcCchh
Confidence 33345566666555444 77777777777665544
No 86
>KOG1293|consensus
Probab=32.81 E-value=1.8e+02 Score=28.10 Aligned_cols=91 Identities=13% Similarity=0.096 Sum_probs=65.4
Q ss_pred hhhhhhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhh-ccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHH
Q psy4702 14 VRFLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLE-KNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERV 92 (138)
Q Consensus 14 lklLvll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~-~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl 92 (138)
++.+.-+...-|.-+|.-+-=+|==+...++..-+.-+. +=+...+..+|.||+.++|--..-..||+++...+.-.-+
T Consensus 463 Id~l~s~~~~~~~n~r~~~~~~Lr~l~f~~de~~k~~~~~ki~a~~i~~l~nd~d~~Vqeq~fqllRNl~c~~~~svdfl 542 (678)
T KOG1293|consen 463 IDILESMLTDPDFNSRANSLWVLRHLMFNCDEEEKFQLLAKIPANLILDLINDPDWAVQEQCFQLLRNLTCNSRKSVDFL 542 (678)
T ss_pred HHHHHHHhcCCCchHHHHHHHHHHHHHhcchHHHHHHHHHHhhHHHHHHHHhCCCHHHHHHHHHHHHHhhcCcHHHHHHH
Confidence 455566677777777766656665555434444444444 5567788899999999999999999999999988877777
Q ss_pred hhhhHHHHHHHh
Q psy4702 93 LSTELMEILMAL 104 (138)
Q Consensus 93 ~es~~leiL~~~ 104 (138)
+++-.+++.+.-
T Consensus 543 l~~~~~~ld~i~ 554 (678)
T KOG1293|consen 543 LEKFKDVLDKID 554 (678)
T ss_pred HHhhhHHHHHHH
Confidence 777666655543
No 87
>cd04371 DEP DEP domain, named after Dishevelled, Egl-10, and Pleckstrin, where this domain was first discovered. The function of this domain is still not clear, but it is believed to be important for the membrane association of the signaling proteins in which it is present. New studies show that the DEP domain of Sst2, a yeast RGS protein is necessary and sufficient for receptor interaction.
Probab=32.42 E-value=62 Score=21.05 Aligned_cols=40 Identities=15% Similarity=0.097 Sum_probs=27.9
Q ss_pred HhhhhHHHHHHHhhccccHHHHHHHHHHHHHHHhCCCccCCCC
Q psy4702 92 VLSTELMEILMALTLLEGEEVRKLALQALAKGEEYGIIRRPGE 134 (138)
Q Consensus 92 l~es~~leiL~~~~k~~~~~v~~~a~eaLk~~~~~glIk~~~~ 134 (138)
+..+++++-|+......++ +.|++-...++++|+|.+...
T Consensus 29 F~G~e~v~WL~~~~~~~~r---~ea~~~~~~ll~~g~i~~v~~ 68 (81)
T cd04371 29 FTGSELVDWLLDNLEAITR---EEAVELGQALLKHGLIHHVSD 68 (81)
T ss_pred eEcHHHHHHHHHhCCCCCH---HHHHHHHHHHHHCCCEEEeCC
Confidence 4566778888888764332 445555667889999998654
No 88
>PF12348 CLASP_N: CLASP N terminal; InterPro: IPR024395 This domain is found in the N-terminal region of CLIP-associated proteins (CLASPs), which are widely conserved microtubule plus-end-tracking proteins that regulate the stability of dynamic microtubules [, ]. The domain is also found in other proteins involved in microtubule binding, including STU1, MOR1 and spindle pole body component Alp14.; PDB: 2QK2_A.
Probab=32.19 E-value=70 Score=24.60 Aligned_cols=78 Identities=19% Similarity=0.080 Sum_probs=49.9
Q ss_pred hhhhhhcCCCCHHHHHhhhhhhhhhccCCH---HHHhhhhh-ccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHH
Q psy4702 15 RFLFLLTCEEDTETACAAAGALAMLTSVST---PVCSMLLE-KNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAE 90 (138)
Q Consensus 15 klLvll~~eED~~tr~AA~GALAmLT~~~~---~~c~~I~~-~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~ 90 (138)
.++...+..-.+..|..++-.+...-..++ ....+-.. ++..+.+..+..|+++++|.-+-.++..+...-++-|+
T Consensus 134 ~~l~~~~~~Kn~~vR~~~~~~l~~~l~~~~~~~~~l~~~~~~~~l~~~l~~~l~D~~~~VR~~Ar~~~~~l~~~~~~~a~ 213 (228)
T PF12348_consen 134 EILSQGLKSKNPQVREECAEWLAIILEKWGSDSSVLQKSAFLKQLVKALVKLLSDADPEVREAARECLWALYSHFPERAE 213 (228)
T ss_dssp HHHHHHTT-S-HHHHHHHHHHHHHHHTT-----GGG--HHHHHHHHHHHHHHHTSS-HHHHHHHHHHHHHHHHHH-HHH-
T ss_pred HHHHHHHhCCCHHHHHHHHHHHHHHHHHccchHhhhcccchHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCHhhc
Confidence 566666777888888888877776644334 22222222 56788888899999999999999888888776666666
Q ss_pred HH
Q psy4702 91 RV 92 (138)
Q Consensus 91 kl 92 (138)
.+
T Consensus 214 ~~ 215 (228)
T PF12348_consen 214 SI 215 (228)
T ss_dssp --
T ss_pred cc
Confidence 54
No 89
>COG5096 Vesicle coat complex, various subunits [Intracellular trafficking and secretion]
Probab=31.46 E-value=2.2e+02 Score=27.80 Aligned_cols=94 Identities=24% Similarity=0.269 Sum_probs=70.0
Q ss_pred CCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHhhhhHHHHHH
Q psy4702 23 EEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTELMEILM 102 (138)
Q Consensus 23 eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL~ 102 (138)
..++.+|..|-=++.+|=- +.+. ++.+.-++.+..|+++.+|--++.+|.++-.-+++. +.+.|..+++.
T Consensus 103 d~N~~iR~~AlR~ls~l~~--~el~-----~~~~~~ik~~l~d~~ayVRk~Aalav~kly~ld~~l---~~~~g~~~~l~ 172 (757)
T COG5096 103 DPNEEIRGFALRTLSLLRV--KELL-----GNIIDPIKKLLTDPHAYVRKTAALAVAKLYRLDKDL---YHELGLIDILK 172 (757)
T ss_pred CCCHHHHHHHHHHHHhcCh--HHHH-----HHHHHHHHHHccCCcHHHHHHHHHHHHHHHhcCHhh---hhcccHHHHHH
Confidence 4667788777666655532 2222 446777899999999999999999999999877654 33456889999
Q ss_pred HhhccccHHHHHHHHHHHHHHHhC
Q psy4702 103 ALTLLEGEEVRKLALQALAKGEEY 126 (138)
Q Consensus 103 ~~~k~~~~~v~~~a~eaLk~~~~~ 126 (138)
.+...++..|+..|.-+|...-..
T Consensus 173 ~l~~D~dP~Vi~nAl~sl~~i~~e 196 (757)
T COG5096 173 ELVADSDPIVIANALASLAEIDPE 196 (757)
T ss_pred HHhhCCCchHHHHHHHHHHHhchh
Confidence 998777668888888888766544
No 90
>PRK14707 hypothetical protein; Provisional
Probab=31.23 E-value=2.3e+02 Score=31.46 Aligned_cols=106 Identities=22% Similarity=0.167 Sum_probs=83.5
Q ss_pred hhhhhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHhhh
Q psy4702 16 FLFLLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLST 95 (138)
Q Consensus 16 lLvll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es 95 (138)
+|-+||-=-|.+..++|+-+||+=-..++...+.+-....-..|-.||.=|+..+-.-++.-+.--+.++.++.+.+---
T Consensus 378 ~lNalsKWp~~~~c~~aa~~LA~~l~~d~~l~~~~~~Q~van~lnalsKWPd~~~C~~aa~~lA~~la~d~~l~~~~~p~ 457 (2710)
T PRK14707 378 VLNALSKWPDTPVCAAAASALAEHVVDDLELRKGLDPQGVSNALNALAKWPDLPICGQAVSALAGRLAHDTELCKALDPI 457 (2710)
T ss_pred HHhhhhcCCCchHHHHHHHHHHHHhccChhhhhhcchhhHHHHHHHhhcCCcchhHHHHHHHHHHHHhccHHHHhhcChH
Confidence 56677888889999999999998666566776665446677788889999999987777777777788999999999999
Q ss_pred hHHHHHHHhhccccHHHHHHHHHHHH
Q psy4702 96 ELMEILMALTLLEGEEVRKLALQALA 121 (138)
Q Consensus 96 ~~leiL~~~~k~~~~~v~~~a~eaLk 121 (138)
++..+|-+++|=.+.++...|.+.|-
T Consensus 458 ~va~~LnalSKWPd~p~c~~aa~~La 483 (2710)
T PRK14707 458 NVTQALDALSKWPDTPICGQTASALA 483 (2710)
T ss_pred HHHHHHHHhhcCCCChhHHHHHHHHH
Confidence 99999999999876555544444443
No 91
>PF08216 CTNNBL: Catenin-beta-like, Arm-motif containing nuclear; InterPro: IPR013180 This domain is found in eukaryotic proteins. A human nuclear protein with this domain (Q8WYA6 from SWISSPROT) is thought to have a role in apoptosis [].
Probab=30.88 E-value=59 Score=24.20 Aligned_cols=36 Identities=19% Similarity=0.328 Sum_probs=27.9
Q ss_pred hhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccc
Q psy4702 35 ALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQ 71 (138)
Q Consensus 35 ALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lq 71 (138)
.|-.|++ .|..=.-+++-+.+.+|.+|+.|+|.+|.
T Consensus 69 ~l~~La~-~P~LYp~lv~l~~v~sL~~LL~HeN~DIa 104 (108)
T PF08216_consen 69 KLSVLAT-APELYPELVELGAVPSLLGLLSHENTDIA 104 (108)
T ss_pred HHHHccC-ChhHHHHHHHcCCHHHHHHHHCCCCccee
Confidence 4556666 68776666666799999999999999873
No 92
>KOG1061|consensus
Probab=30.58 E-value=1.1e+02 Score=29.82 Aligned_cols=61 Identities=21% Similarity=0.231 Sum_probs=47.9
Q ss_pred CCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhcC
Q psy4702 22 CEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAG 85 (138)
Q Consensus 22 ~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~ 85 (138)
..+|.-+|++|+-..|-|=..++..|.. ...++.|+.|..|++|.+.--.+.-++++.+.+
T Consensus 131 ~d~~~yvRktaa~~vakl~~~~~~~~~~---~gl~~~L~~ll~D~~p~VVAnAlaaL~eI~e~~ 191 (734)
T KOG1061|consen 131 KDDDPYVRKTAAVCVAKLFDIDPDLVED---SGLVDALKDLLSDSNPMVVANALAALSEIHESH 191 (734)
T ss_pred cCCChhHHHHHHHHHHHhhcCChhhccc---cchhHHHHHHhcCCCchHHHHHHHHHHHHHHhC
Confidence 3478889999999999887755555554 478999999999999998777777777776654
No 93
>COG5064 SRP1 Karyopherin (importin) alpha [Intracellular trafficking and secretion]
Probab=30.43 E-value=1.3e+02 Score=27.57 Aligned_cols=101 Identities=16% Similarity=0.229 Sum_probs=79.7
Q ss_pred HHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhcC---hHHHHHHhhhhHHHHHHH
Q psy4702 27 ETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAG---KELAERVLSTELMEILMA 103 (138)
Q Consensus 27 ~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~---~e~a~kl~es~~leiL~~ 103 (138)
..|.-|.=++.-.|+-.-.-.+++++.+-..-|..|+..-+--++.-+.--|.|...+. .++-+.+++.|...-|-.
T Consensus 342 ~irKEaCWTiSNITAGnteqiqavid~nliPpLi~lls~ae~k~kKEACWAisNatsgg~~~PD~iryLv~qG~IkpLc~ 421 (526)
T COG5064 342 NIRKEACWTISNITAGNTEQIQAVIDANLIPPLIHLLSSAEYKIKKEACWAISNATSGGLNRPDIIRYLVSQGFIKPLCD 421 (526)
T ss_pred hhhhhhheeecccccCCHHHHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHhhhccccCCchHHHHHHHccchhHHHH
Confidence 67777777777777756677778888667777788888888888888888999988753 577788999999999988
Q ss_pred hhccccHHHHHHHHHHHHHHHhCC
Q psy4702 104 LTLLEGEEVRKLALQALAKGEEYG 127 (138)
Q Consensus 104 ~~k~~~~~v~~~a~eaLk~~~~~g 127 (138)
+.--.+++|.+.+..|+.-...-|
T Consensus 422 ~L~~~dNkiiev~LD~~eniLk~G 445 (526)
T COG5064 422 LLDVVDNKIIEVALDAIENILKVG 445 (526)
T ss_pred HHhccCccchhhhHHHHHHHHhhh
Confidence 887777788888888776655443
No 94
>PRK09334 30S ribosomal protein S25e; Provisional
Probab=30.12 E-value=41 Score=24.08 Aligned_cols=50 Identities=18% Similarity=0.209 Sum_probs=34.3
Q ss_pred hhhhhhhhhcChHHHHHHhhhhHHHHHHHhhcc--------ccH-HH-HHHHHHHHHHHHhCCCccCC
Q psy4702 75 LVIVGNILSAGKELAERVLSTELMEILMALTLL--------EGE-EV-RKLALQALAKGEEYGIIRRP 132 (138)
Q Consensus 75 ~viv~Nmi~a~~e~a~kl~es~~leiL~~~~k~--------~~~-~v-~~~a~eaLk~~~~~glIk~~ 132 (138)
-..+.|++.=|++.-+| ++..+-+. +++ .| ..+|+.+|+-|++.|+|++-
T Consensus 15 ~~~~~n~v~~dk~t~dk--------l~kEV~~~K~ITps~lserlkI~~SlAr~~Lr~L~~kG~Ik~V 74 (86)
T PRK09334 15 KEIKSTIITLDEELLKR--------VAKEVKKEKIVTPYTLASKYGIKISVAKKVLRELEKRGVLVLY 74 (86)
T ss_pred hhhhhceeecCHHHHHH--------HHHHhccCcEEcHHHHHHHhcchHHHHHHHHHHHHHCCCEEEE
Confidence 34566777777777776 34444332 112 44 78999999999999999874
No 95
>PF01022 HTH_5: Bacterial regulatory protein, arsR family; InterPro: IPR001845 Bacterial transcription regulatory proteins that bind DNA via a helix-turn-helix (HTH) motif can be grouped into families on the basis of sequence similarities. One such group, termed arsR, includes several proteins that appear to dissociate from DNA in the presence of metal ions: arsR, which functions as a transcriptional repressor of an arsenic resistance operon; smtB from Synechococcus sp. (strain PCC 7942), which acts as a transcriptional repressor of the smtA gene that codes for a metallothionein; cadC, a protein required for cadmium-resistance; and hypothetical protein yqcJ from Bacillus subtilis. The HTH motif is thought to be located in the central part of these proteins []. The motif is characterised by a number of well-conserved residues: at its N-terminal extremity is a cysteine residue; a second Cys is found in arsR and cadC, but not in smtA; and at the C terminus lie one or two histidines. These residues may be involved in metal-binding (Zn in smtB; metal-oxyanions such as arsenite, antimonite and arsenate for arsR; and cadmium for cadC) []. It is believed that binding of a metal ion could induce a conformational change that would prevent the protein from binding DNA []. The crystal structure of the cyanobacterial smtB shows a fold of five alpha-helices (H) and a pair of antiparallel beta-strands (B) in the topology H1-H2-H3-H4-B1-B2-H5. Helices 3 and 4 comprise the helix-turn-helix motif and the beta-sheet is called the wing as in other wHTH, such as the dtxR-type or the merR-type. Helix 4 is termed the recognition helix, like in other HTHs where it binds the DNA major groove. Most arsR/smtB-like metalloregulators form homodimers []. The dimer interface is formed by helix 5 and an N-terminal part []. Two distinct metal-binding sites have been identified. The first site comprises cysteine thiolates located in the HTH in helix 3 and for some cases in the N terminus, called the alpha3(N) site []. The second metal-binding site is located in helix 5 (and C terminus) and is called the alpha5(C) site. The alpha3N site binds large thiophilic, toxic metals including Cd, Pb, and Bi, as in S. aureus cadC. ArsR lacks the N-terminal arm and its alpha3 site coordinates smaller thiophilic ions like As and Sb. The alpha5 site contains carboxylate and imidazole ligands and interacts preferentially with biologically required metal ions including Zn, Co, and Ni. ArsR-type metalloregulators contain one of these sites, both, or other potential metal-binding sites [, ]. Binding of metal ions to these sites leads to allosteric changes that can derepress the operator/promotor DNA. The metal-inducible operons contain one or two imperfect 12-2-12 inverted repeats, which can be recognised by multimeric arsR-type metalloregulators. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 3CUO_A 1U2W_C 3F72_C 3F6V_A 3JTH_B 2P4W_B 1KU9_B 2LKP_B 1SMT_A 1R22_B ....
Probab=30.10 E-value=89 Score=18.78 Aligned_cols=34 Identities=24% Similarity=0.255 Sum_probs=22.1
Q ss_pred HHHHHHhhccccH--HH-------HHHHHHHHHHHHhCCCccC
Q psy4702 98 MEILMALTLLEGE--EV-------RKLALQALAKGEEYGIIRR 131 (138)
Q Consensus 98 leiL~~~~k~~~~--~v-------~~~a~eaLk~~~~~glIk~ 131 (138)
++||..+.+.+.. ++ ......=|+.+.+.|+|++
T Consensus 5 ~~Il~~L~~~~~~~~el~~~l~~s~~~vs~hL~~L~~~glV~~ 47 (47)
T PF01022_consen 5 LRILKLLSEGPLTVSELAEELGLSQSTVSHHLKKLREAGLVEK 47 (47)
T ss_dssp HHHHHHHTTSSEEHHHHHHHHTS-HHHHHHHHHHHHHTTSEEE
T ss_pred HHHHHHHHhCCCchhhHHHhccccchHHHHHHHHHHHCcCeeC
Confidence 5677777775431 11 3344555999999999974
No 96
>TIGR02270 conserved hypothetical protein. Members are found in Myxococcus xanthus (six members), Geobacter sulfurreducens, and Pseudomonas aeruginosa; a short protein homologous to the N-terminal region is found in Mesorhizobium loti. All sequence are from Proteobacteria. The function is unknown.
Probab=29.98 E-value=2.3e+02 Score=25.15 Aligned_cols=19 Identities=11% Similarity=0.085 Sum_probs=13.4
Q ss_pred ccHHHHHHHHhcCCCcccc
Q psy4702 53 KNWVDIFKQILAHPDVGVQ 71 (138)
Q Consensus 53 ~~wleil~~L~~~~~~~lq 71 (138)
+.+...|..++.++++.+|
T Consensus 116 ~~a~~~L~~~L~~~~p~vR 134 (410)
T TIGR02270 116 RQAEPWLEPLLAASEPPGR 134 (410)
T ss_pred hHHHHHHHHHhcCCChHHH
Confidence 4567777788877777654
No 97
>PF00790 VHS: VHS domain; InterPro: IPR002014 The VHS domain is a ~140 residues long domain, whose name is derived from its occurrence in VPS-27, Hrs and STAM. Based on regions surrounding the domain, VHS-proteins can be divided into 4 groups []: STAM/EAST/Hbp which all share the domain composition VHS-SH3-ITAM and carry one or two ubiquitin-interacting motifs Proteins with a FYVE domain (IPR000306 from INTERPRO) C-terminal to VHS which also carry one or two ubiquitin-interacting motifs GGA proteins with a domain composition VHS-GAT (GGA and Tom1) homology domain VHS domain alone or in combination with domains other than those listed above The VHS domain is always found at the N- terminus of proteins suggesting that such topology is important for function. The domain is considered to have a general membrane targeting/cargo recognition role in vesicular trafficking []. Resolution of the crystal structure of the VHS domain of Drosophila Hrs and human Tom1 revealed that it consists of eight helices arranged in a double-layer superhelix []. The existence of conserved patches of residues on the domain surface suggests that VHS domains may be involved in protein-protein recognition and docking. Overall, sequence similarity is low (approx 25%) amongst domain family members.; GO: 0006886 intracellular protein transport; PDB: 1X5B_A 2L0T_B 1DVP_A 3LDZ_C 3ZYQ_A 4AVX_A 3G2U_A 3G2W_A 1UJJ_A 3G2V_A ....
Probab=28.98 E-value=2.4e+02 Score=20.76 Aligned_cols=64 Identities=22% Similarity=0.305 Sum_probs=45.3
Q ss_pred HHHHHHhcCCCcccchhhhhhhhhhhhcC-hHHHHHHhhhhHHHHHHHhhccccH-H---HHHHHHHHH
Q psy4702 57 DIFKQILAHPDVGVQHRALVIVGNILSAG-KELAERVLSTELMEILMALTLLEGE-E---VRKLALQAL 120 (138)
Q Consensus 57 eil~~L~~~~~~~lqHRG~viv~Nmi~a~-~e~a~kl~es~~leiL~~~~k~~~~-~---v~~~a~eaL 120 (138)
..|..=+.++++.+|.|.+.++-=++... ..+...+-..+.++-|..+.+.+.. . |.+.+.+.+
T Consensus 45 ~~l~krl~~~~~~vq~~aL~lld~lvkNcg~~f~~ev~~~~fl~~l~~l~~~~~~~~~~~Vk~k~l~ll 113 (140)
T PF00790_consen 45 RALRKRLKHGNPNVQLLALTLLDALVKNCGPRFHREVASKEFLDELVKLIKSKKTDPETPVKEKILELL 113 (140)
T ss_dssp HHHHHHHTTSSHHHHHHHHHHHHHHHHHSHHHHHHHHTSHHHHHHHHHHHHHTTTHHHSHHHHHHHHHH
T ss_pred HHHHHHHhCCCHHHHHHHHHHHHHHHHcCCHHHHHHHhHHHHHHHHHHHHccCCCCchhHHHHHHHHHH
Confidence 33334457899999999998888887755 4566778888889999988776543 2 555554443
No 98
>PF12530 DUF3730: Protein of unknown function (DUF3730) ; InterPro: IPR022542 This domain is found in eukaryotes, and is typically between 220 and 262 amino acids in length.
Probab=28.68 E-value=3.1e+02 Score=22.00 Aligned_cols=109 Identities=19% Similarity=0.201 Sum_probs=62.5
Q ss_pred hhhhhcCCCCHHHHHhhhhhhhhhccCCHH----HHhhhh----h--ccHHHHHHHHhcC---------CCcccchhhhh
Q psy4702 16 FLFLLTCEEDTETACAAAGALAMLTSVSTP----VCSMLL----E--KNWVDIFKQILAH---------PDVGVQHRALV 76 (138)
Q Consensus 16 lLvll~~eED~~tr~AA~GALAmLT~~~~~----~c~~I~----~--~~wleil~~L~~~---------~~~~lqHRG~v 76 (138)
.|=-+|..++ ......-.+|.+|+..... ++-+++ . +++...|+.++.. .+.+-.++-.+
T Consensus 24 ~L~~l~~~~~-~~~~~v~~~L~~L~~~~~~~~~~~~~rLl~~lw~~~~r~f~~L~~~L~~~~~r~~~~~~~~~~~~~~~i 102 (234)
T PF12530_consen 24 ALPSLACHKN-VCVPPVLQTLVSLVEQGSLELRYVALRLLTLLWKANDRHFPFLQPLLLLLILRIPSSFSSKDEFWECLI 102 (234)
T ss_pred HHHHHhccCc-cchhHHHHHHHHHHcCCchhHHHHHHHHHHHHHHhCchHHHHHHHHHHHHHhhcccccCCCcchHHHHH
Confidence 3444556666 6777777777777763222 122322 2 6666788776655 33344444434
Q ss_pred ----hhhhhhhcChHHHHHHhhhhHHHHHHHhhc-cccHHHHHHHHHHHHHHHhCCCcc
Q psy4702 77 ----IVGNILSAGKELAERVLSTELMEILMALTL-LEGEEVRKLALQALAKGEEYGIIR 130 (138)
Q Consensus 77 ----iv~Nmi~a~~e~a~kl~es~~leiL~~~~k-~~~~~v~~~a~eaLk~~~~~glIk 130 (138)
.++.+.....+ .-.+++..|+.+.+ .+...+...+.++|..+-+...|.
T Consensus 103 ~~a~s~~~ic~~~p~-----~g~~ll~~ls~~L~~~~~~~~~alale~l~~Lc~~~vvd 156 (234)
T PF12530_consen 103 SIAASIRDICCSRPD-----HGVDLLPLLSGCLNQSCDEVAQALALEALAPLCEAEVVD 156 (234)
T ss_pred HHHHHHHHHHHhChh-----hHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHhhcc
Confidence 23333333433 23345667777774 333377899999999888776654
No 99
>KOG3665|consensus
Probab=28.60 E-value=1.2e+02 Score=29.04 Aligned_cols=89 Identities=28% Similarity=0.376 Sum_probs=62.3
Q ss_pred hcCCCCHHHHHhhhhhhhhhccCC---------HHHHhhhhh--ccHHH--------------HHHHHhcCCCcccchhh
Q psy4702 20 LTCEEDTETACAAAGALAMLTSVS---------TPVCSMLLE--KNWVD--------------IFKQILAHPDVGVQHRA 74 (138)
Q Consensus 20 l~~eED~~tr~AA~GALAmLT~~~---------~~~c~~I~~--~~wle--------------il~~L~~~~~~~lqHRG 74 (138)
+...++.+-.-=|.|.||.+.+.+ +.+...+.+ ++|+. +..-+-....++.||=+
T Consensus 566 ~~~w~~~ersY~~~siLa~ll~~~~~~~~~~~r~~~~~~l~e~i~~~~~~~~~~~~~~~f~~~~~~il~~s~~~g~~lWa 645 (699)
T KOG3665|consen 566 LNKWDSIERSYNAASILALLLSDSEKTTECVFRNSVNELLVEAISRWLTSEIRVINDRSFFPRILRILRLSKSDGSQLWA 645 (699)
T ss_pred HhhcchhhHHHHHHHHHHHHHhCCCcCccccchHHHHHHHHHHhhccCccceeehhhhhcchhHHHHhcccCCCchHHHH
Confidence 333344345556778888887732 344444544 44321 33334456789999999
Q ss_pred hhhhhhhhhcChHHHHHHhhhhHHHHHHHhhccc
Q psy4702 75 LVIVGNILSAGKELAERVLSTELMEILMALTLLE 108 (138)
Q Consensus 75 ~viv~Nmi~a~~e~a~kl~es~~leiL~~~~k~~ 108 (138)
+.-+.|++...++..+.+.++++++.+...-..+
T Consensus 646 l~ti~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 679 (699)
T KOG3665|consen 646 LWTIKNVLEQNKEYCKLVRESNGFELIENIRVLS 679 (699)
T ss_pred HHHHHHHHHcChhhhhhhHhccchhhhhhcchhH
Confidence 9999999999999888899999999998886655
No 100
>PF14394 DUF4423: Domain of unknown function (DUF4423)
Probab=28.59 E-value=52 Score=25.66 Aligned_cols=22 Identities=23% Similarity=0.398 Sum_probs=19.4
Q ss_pred HHHHHHHHHHHHhCCCccCCCC
Q psy4702 113 RKLALQALAKGEEYGIIRRPGE 134 (138)
Q Consensus 113 ~~~a~eaLk~~~~~glIk~~~~ 134 (138)
.+.+.++|+.|++.|||++..+
T Consensus 55 ~~ev~~sL~~L~~~gli~k~~~ 76 (171)
T PF14394_consen 55 AEEVRDSLEFLEKLGLIKKDGD 76 (171)
T ss_pred HHHHHHHHHHHHHCCCeEECCC
Confidence 4678899999999999999876
No 101
>PF03810 IBN_N: Importin-beta N-terminal domain; InterPro: IPR001494 Karyopherins are a group of proteins involved in transporting molecules through the pores of the nuclear envelope. Karyopherins, which may act as importins or exportins, are part of the Importin-beta super-family, which all share a similar three-dimensional structure. Members of the importin-beta (karyopherin-beta) family can bind and transport cargo by themselves, or can form heterodimers with importin-alpha. As part of a heterodimer, importin-beta mediates interactions with the pore complex, while importin-alpha acts as an adaptor protein to bind the nuclear localisation signal (NLS) on the cargo through the classical NLS import of proteins. Importin-beta is a helicoidal molecule constructed from 19 HEAT repeats. Many nuclear pore proteins contain FG sequence repeats that can bind to HEAT repeats within importins [, ], which is important for importin-beta mediated transport. Ran GTPase helps to control the unidirectional transfer of cargo. The cytoplasm contains primarily RanGDP and the nucleus RanGTP through the actions of RanGAP and RanGEF, respectively. In the nucleus, RanGTP binds to importin-beta within the importin/cargo complex, causing a conformational change in importin-beta that releases it from importin-alpha-bound cargo. As a result, the N-terminal auto-inhibitory region on importin-alpha is free to loop back and bind to the major NLS-binding site, causing the cargo to be released []. There are additional release factors as well. This entry represents the N-terminal domain of karyopherins that is important for the binding of the Ran protein []. More information about these proteins can be found at Protein of the Month: Importins [].; GO: 0008565 protein transporter activity, 0006886 intracellular protein transport; PDB: 3NC1_A 3NBY_D 3NBZ_D 3NC0_A 3GJX_D 1IBR_D 1QGR_A 3LWW_A 1F59_A 2Q5D_A ....
Probab=28.57 E-value=68 Score=20.38 Aligned_cols=32 Identities=22% Similarity=0.371 Sum_probs=25.3
Q ss_pred cHHHHHHHHhcCC--CcccchhhhhhhhhhhhcC
Q psy4702 54 NWVDIFKQILAHP--DVGVQHRALVIVGNILSAG 85 (138)
Q Consensus 54 ~wleil~~L~~~~--~~~lqHRG~viv~Nmi~a~ 85 (138)
.....+..++.++ ++.+|+=+++.++|.|...
T Consensus 14 ~~~~~l~~il~~~~~~~~~R~~A~i~LKn~I~~~ 47 (77)
T PF03810_consen 14 GFWQYLLQILSSNSQDPEVRQLAAILLKNLIKKN 47 (77)
T ss_dssp CHHHHHHHHHHCTTSCHHHHHHHHHHHHHHHHHS
T ss_pred hHHHHHHHHHHccCCCHHHHHHHHHHHHHHHHHc
Confidence 6667777777554 6789999999999999765
No 102
>TIGR02270 conserved hypothetical protein. Members are found in Myxococcus xanthus (six members), Geobacter sulfurreducens, and Pseudomonas aeruginosa; a short protein homologous to the N-terminal region is found in Mesorhizobium loti. All sequence are from Proteobacteria. The function is unknown.
Probab=28.41 E-value=2.4e+02 Score=25.14 Aligned_cols=37 Identities=14% Similarity=-0.055 Sum_probs=24.8
Q ss_pred HHhhhhHHHHHHHhhccccHHHHHHHHHHHHHHHhCCCc
Q psy4702 91 RVLSTELMEILMALTLLEGEEVRKLALQALAKGEEYGII 129 (138)
Q Consensus 91 kl~es~~leiL~~~~k~~~~~v~~~a~eaLk~~~~~glI 129 (138)
++-....++.|......+ .+...|-+++++..+-.|-
T Consensus 264 ~lg~p~av~~L~~~l~d~--~~aR~A~eA~~~ItG~~l~ 300 (410)
T TIGR02270 264 LVGDVEAAPWCLEAMREP--PWARLAGEAFSLITGMDVA 300 (410)
T ss_pred HcCCcchHHHHHHHhcCc--HHHHHHHHHHHHhhCCCcc
Confidence 344455566666665544 5788899999988775554
No 103
>PF01602 Adaptin_N: Adaptin N terminal region; InterPro: IPR002553 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. Clathrin coats contain both clathrin and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors []. All AP complexes are heterotetramers composed of two large subunits (adaptins), a medium subunit (mu) and a small subunit (sigma). Each subunit has a specific function. Adaptin subunits recognise and bind to clathrin through their hinge region (clathrin box), and recruit accessory proteins that modulate AP function through their C-terminal appendage domains. By contrast, GGAs are monomers composed of four domains, which have functions similar to AP subunits: an N-terminal VHS (Vps27p/Hrs/Stam) domain, a GAT (GGA and Tom1) domain, a hinge region, and a C-terminal GAE (gamma-adaptin ear) domain. The GAE domain is similar to the AP gamma-adaptin ear domain, being responsible for the recruitment of accessory proteins that regulate clathrin-mediated endocytosis []. While clathrin mediates endocytic protein transport from ER to Golgi, coatomers (COPI, COPII) primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the N-terminal domain of various adaptins from different AP clathrin adaptor complexes (including AP1, AP2, AP3 and AP4), and from the beta and gamma subunits of various coatomer (COP) adaptors. This domain has a 2-layer alpha/alpha fold that forms a right-handed superhelix, and is a member of the ARM repeat superfamily []. The N-terminal region of the various AP adaptor proteins share strong sequence identity; by contrast, the C-terminal domains of different adaptins share similar structural folds, but have little sequence identity []. It has been proposed that the N-terminal domain interacts with another uniform component of the coated vesicles. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 1W63_C 2JKR_A 2JKT_A 2XA7_A 2VGL_B 3TJZ_E.
Probab=25.84 E-value=3e+02 Score=23.65 Aligned_cols=69 Identities=22% Similarity=0.267 Sum_probs=39.0
Q ss_pred hhcCCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHH
Q psy4702 19 LLTCEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELA 89 (138)
Q Consensus 19 ll~~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a 89 (138)
-+..+.|...+.+|..++-.+.. .+..-..++ +.....|..+...++|-+|..-+=++.++...+.+..
T Consensus 159 ~lL~d~~~~V~~~a~~~l~~i~~-~~~~~~~~~-~~~~~~L~~~l~~~~~~~q~~il~~l~~~~~~~~~~~ 227 (526)
T PF01602_consen 159 QLLSDKDPSVVSAALSLLSEIKC-NDDSYKSLI-PKLIRILCQLLSDPDPWLQIKILRLLRRYAPMEPEDA 227 (526)
T ss_dssp HHTTHSSHHHHHHHHHHHHHHHC-THHHHTTHH-HHHHHHHHHHHTCCSHHHHHHHHHHHTTSTSSSHHHH
T ss_pred hhccCCcchhHHHHHHHHHHHcc-CcchhhhhH-HHHHHHhhhcccccchHHHHHHHHHHHhcccCChhhh
Confidence 33366778888888777777721 233322111 4455566666666777666666666666555444433
No 104
>PF11841 DUF3361: Domain of unknown function (DUF3361)
Probab=25.63 E-value=1.5e+02 Score=23.48 Aligned_cols=42 Identities=24% Similarity=0.318 Sum_probs=33.6
Q ss_pred HHHHhhhhHHHHHHHhhcccc-----H-HHHHHHHHHHHHHHhCCCcc
Q psy4702 89 AERVLSTELMEILMALTLLEG-----E-EVRKLALQALAKGEEYGIIR 130 (138)
Q Consensus 89 a~kl~es~~leiL~~~~k~~~-----~-~v~~~a~eaLk~~~~~glIk 130 (138)
|..++..+++..|..+...+. . .++..+-.|+-.+||.|.|-
T Consensus 4 A~EFI~~~Gl~~L~~~iE~g~~~~~~~~~~La~~L~af~eLMeHg~vs 51 (160)
T PF11841_consen 4 AQEFISRDGLTLLIKMIEEGTEIQPCKGEILAYALTAFVELMEHGIVS 51 (160)
T ss_pred HHHHHhccCHHHHHHHHHcCCccCcchHHHHHHHHHHHHHHHhcCcCc
Confidence 667888888888888876643 2 77888999999999999753
No 105
>KOG1767|consensus
Probab=25.28 E-value=49 Score=24.86 Aligned_cols=54 Identities=19% Similarity=0.291 Sum_probs=30.7
Q ss_pred hhhhhhcChHHHHHHh-hhhHHHHHHHhhccccH-HH-HHHHHHHHHHHHhCCCccCC
Q psy4702 78 VGNILSAGKELAERVL-STELMEILMALTLLEGE-EV-RKLALQALAKGEEYGIIRRP 132 (138)
Q Consensus 78 v~Nmi~a~~e~a~kl~-es~~leiL~~~~k~~~~-~v-~~~a~eaLk~~~~~glIk~~ 132 (138)
+.||+-=|.+.=.|+. |-.-...++.. -.+++ +| -.+|+.+|.-|+..|+|+.-
T Consensus 37 vnn~Vlfdqatydkl~kevp~~k~it~s-vl~dRlkIngsLAr~alr~L~~kG~Ik~V 93 (110)
T KOG1767|consen 37 VNNAVLFDQATYDKLLKEVPKYKLITPS-VLSDRLKINGSLARAALRELSNKGVIKQV 93 (110)
T ss_pred hhhheeecHHHHHHHHHhcccceeecHH-HhhhhhhhchHHHHHHHHHHHhcchHHHH
Confidence 4566655555555532 11111111111 12233 55 67899999999999999863
No 106
>KOG2122|consensus
Probab=23.99 E-value=1.5e+02 Score=31.83 Aligned_cols=83 Identities=16% Similarity=0.080 Sum_probs=67.1
Q ss_pred HHHHHhhhhhhhhhccCCHHHHhhhhh--ccHHHHHHHHhcCCCcccchhhhhhhhhhh-hcChHHHHHHhhhhHHHHHH
Q psy4702 26 TETACAAAGALAMLTSVSTPVCSMLLE--KNWVDIFKQILAHPDVGVQHRALVIVGNIL-SAGKELAERVLSTELMEILM 102 (138)
Q Consensus 26 ~~tr~AA~GALAmLT~~~~~~c~~I~~--~~wleil~~L~~~~~~~lqHRG~viv~Nmi-~a~~e~a~kl~es~~leiL~ 102 (138)
-.+|+=|+-||--||. -+..=++-+= ..++|.+..-+.+..++|+.==.=++||+- .+|.-.-+-|.|.|-+-.|+
T Consensus 365 ~aLRrYa~MALTNLTF-GDv~NKa~LCs~rgfMeavVAQL~s~peeL~QV~AsvLRNLSWRAD~nmKkvLrE~GsVtaLa 443 (2195)
T KOG2122|consen 365 NALRRYAGMALTNLTF-GDVANKATLCSQRGFMEAVVAQLISAPEELLQVYASVLRNLSWRADSNMKKVLRETGSVTALA 443 (2195)
T ss_pred HHHHHHHHHHhhcccc-ccccchhhhhhhhhHHHHHHHHHhcChHHHHHHHHHHHHhccccccccHHHHHHhhhhHHHHH
Confidence 3789999999999997 4545444444 889999999999999899877777899994 67777777788999999999
Q ss_pred Hhhcccc
Q psy4702 103 ALTLLEG 109 (138)
Q Consensus 103 ~~~k~~~ 109 (138)
.|+-.++
T Consensus 444 ~~al~~~ 450 (2195)
T KOG2122|consen 444 ACALRNK 450 (2195)
T ss_pred HHHHHhc
Confidence 8876544
No 107
>cd03561 VHS VHS domain family; The VHS domain is present in Vps27 (Vacuolar Protein Sorting), Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and STAM (Signal Transducing Adaptor Molecule). It has a superhelical structure similar to that of the ARM (Armadillo) repeats and is present at the N-termini of proteins involved in intracellular membrane trafficking. There are four general groups of VHS domain containing proteins based on their association with other domains. The first group consists of proteins of the STAM/EAST/Hbp family which has the domain composition VHS-SH3-ITAM. The second consists of proteins with a FYVE domain C-terminal to VHS. The third consists of GGA proteins with a domain composition VHS-GAT (GGA and TOM)-GAE (gamma-adaptin ear) domain. The fourth consists of proteins with a VHS domain alone or with domains other than those mentioned above. In GGA proteins, VHS domains are involved in cargo recognition in trans-Golgi, thereby having a general me
Probab=23.82 E-value=2.9e+02 Score=20.11 Aligned_cols=65 Identities=17% Similarity=0.194 Sum_probs=42.4
Q ss_pred HHHHHHHhcCCCcccchhhhhhhhhhhhcChH-HHHHHhhhhHHHHHHHhhccc--cH-HHHHHHHHHH
Q psy4702 56 VDIFKQILAHPDVGVQHRALVIVGNILSAGKE-LAERVLSTELMEILMALTLLE--GE-EVRKLALQAL 120 (138)
Q Consensus 56 leil~~L~~~~~~~lqHRG~viv~Nmi~a~~e-~a~kl~es~~leiL~~~~k~~--~~-~v~~~a~eaL 120 (138)
+..|..-+.++|+.+|.|.+.++-=++...+. ....+-..+.++-|..+...+ .. +|.+.+.+.+
T Consensus 39 ~raL~krl~~~n~~vql~AL~lLd~~vkNcg~~f~~~i~s~~fl~~l~~l~~~~~~~~~~Vk~kil~ll 107 (133)
T cd03561 39 ARAIRKKIKYGNPHVQLLALTLLELLVKNCGKPFHLQVADKEFLLELVKIAKNSPKYDPKVREKALELI 107 (133)
T ss_pred HHHHHHHHcCCCHHHHHHHHHHHHHHHHhCChHHHHHHhhHHHHHHHHHHhCCCCCCCHHHHHHHHHHH
Confidence 34444456789999999999999888876555 555555556666677776653 12 5555444433
No 108
>COG1413 FOG: HEAT repeat [Energy production and conversion]
Probab=23.74 E-value=4e+02 Score=21.66 Aligned_cols=22 Identities=36% Similarity=0.376 Sum_probs=17.4
Q ss_pred hcCCCCHHHHHhhhhhhhhhcc
Q psy4702 20 LTCEEDTETACAAAGALAMLTS 41 (138)
Q Consensus 20 l~~eED~~tr~AA~GALAmLT~ 41 (138)
|...+|..+|++|..+|+.+-.
T Consensus 114 l~~d~~~~vR~~aa~aL~~~~~ 135 (335)
T COG1413 114 LENDENEGVRAAAARALGKLGD 135 (335)
T ss_pred HHcCCcHhHHHHHHHHHHhcCc
Confidence 4447888999999999988754
No 109
>PF13730 HTH_36: Helix-turn-helix domain
Probab=23.66 E-value=88 Score=18.94 Aligned_cols=17 Identities=35% Similarity=0.515 Sum_probs=13.8
Q ss_pred HHHHHHHHHHHHhCCCc
Q psy4702 113 RKLALQALAKGEEYGII 129 (138)
Q Consensus 113 ~~~a~eaLk~~~~~glI 129 (138)
.....++++.|++.|+|
T Consensus 39 ~~Tv~~~i~~L~~~G~I 55 (55)
T PF13730_consen 39 RRTVQRAIKELEEKGLI 55 (55)
T ss_pred HHHHHHHHHHHHHCcCC
Confidence 35677889999999987
No 110
>PF12802 MarR_2: MarR family; PDB: 3ECO_B 2QWW_B 3KP6_B 3KP4_B 3KP2_A 3KP5_A 3KP3_B 3KP7_A 3NQO_B 3K0L_B ....
Probab=23.52 E-value=57 Score=19.97 Aligned_cols=23 Identities=13% Similarity=0.238 Sum_probs=17.2
Q ss_pred HHHHHHHHHHHhCCCccCCCCCC
Q psy4702 114 KLALQALAKGEEYGIIRRPGEGE 136 (138)
Q Consensus 114 ~~a~eaLk~~~~~glIk~~~~~~ 136 (138)
......++.+++.|||.+..+..
T Consensus 36 ~~vs~~v~~L~~~Glv~r~~~~~ 58 (62)
T PF12802_consen 36 STVSRIVKRLEKKGLVERERDPG 58 (62)
T ss_dssp HHHHHHHHHHHHTTSEEEEE-SS
T ss_pred HHHHHHHHHHHHCCCEEEeCCCC
Confidence 55667799999999999755443
No 111
>PF07409 GP46: Phage protein GP46; InterPro: IPR010877 This entry is represented by Bacteriophage Mu, Gp46. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=23.39 E-value=75 Score=23.71 Aligned_cols=21 Identities=19% Similarity=0.121 Sum_probs=18.6
Q ss_pred HHHHHHHHHHHHHHhCCCccC
Q psy4702 111 EVRKLALQALAKGEEYGIIRR 131 (138)
Q Consensus 111 ~v~~~a~eaLk~~~~~glIk~ 131 (138)
.+.+.|.|||+.+.+.|..++
T Consensus 61 ~A~~ya~EAL~wLv~dg~a~~ 81 (116)
T PF07409_consen 61 RAEDYAEEALQWLVDDGVASS 81 (116)
T ss_pred HHHHHHHHHHHHHHhcCceeE
Confidence 778999999999999998764
No 112
>KOG1059|consensus
Probab=23.01 E-value=1.9e+02 Score=28.53 Aligned_cols=41 Identities=20% Similarity=0.325 Sum_probs=33.0
Q ss_pred ccHHHHHHHHhcCCCcccchhhhhhhhhhhhcC--hHHHHHHh
Q psy4702 53 KNWVDIFKQILAHPDVGVQHRALVIVGNILSAG--KELAERVL 93 (138)
Q Consensus 53 ~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~--~e~a~kl~ 93 (138)
....++++..+.|.|+.+|-|++-.++-|+.-+ .++.++++
T Consensus 335 qa~kdlIlrcL~DkD~SIRlrALdLl~gmVskkNl~eIVk~LM 377 (877)
T KOG1059|consen 335 QAHKDLILRCLDDKDESIRLRALDLLYGMVSKKNLMEIVKTLM 377 (877)
T ss_pred HHhHHHHHHHhccCCchhHHHHHHHHHHHhhhhhHHHHHHHHH
Confidence 457899999999999999999999999998654 45555543
No 113
>PF01047 MarR: MarR family; InterPro: IPR000835 The MarR-type HTH domain is a DNA-binding, winged helix-turn-helix (wHTH) domain of about 135 amino acids present in transcription regulators of the MarR/SlyA family, involved in the development of antibiotic resistance. This family of transcription regulators is named after Escherichia coli MarR, a repressor of genes which activate the multiple antibiotic resistance and oxidative stress regulons, and after slyA from Salmonella typhimurium and E. coli, a transcription regulator that is required for virulence and survival in the macrophage environment. Regulators with the MarR-type HTH domain are present in bacteria and archaea and control a variety of biological functions, including resistance to multiple antibiotics, household disinfectants, organic solvents, oxidative stress agents and regulation of the virulence factor synthesis in pathogens of humans and plants. Many of the MarR-like regulators respond to aromatic compounds [, , ]. The crystal structures of MarR, MexR and SlyA have been determined and show a winged HTH DNA-binding core flanked by helices involved in dimerisation. The DNA-binding domains are ascribed to the superfamily of winged helix proteins, containing a three (four)-helix (H) bundle and a three-stranded antiparallel beta-sheet (B) in the topology: H1-(H1')-H2-B1-H3-H4-B2-B3-H5-H6. Helices 3 and 4 comprise the helix-turn-helix motif and the beta-sheet is called the wing. Helix 4 is termed the recognition helix, like in other HTHs where it binds the DNA major groove. The helices 1, 5 and 6 are involved in dimerisation, as most MarR-like transcription regulators form dimers [, ]. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1JGS_A 2NYX_D 2PEX_B 2PFB_A 3BPX_A 3BPV_A 2BV6_A 3BJA_A 3E6M_B 2ETH_A ....
Probab=22.72 E-value=45 Score=20.40 Aligned_cols=23 Identities=17% Similarity=0.328 Sum_probs=17.6
Q ss_pred HHHHHHHHHHHhCCCccCCCCCC
Q psy4702 114 KLALQALAKGEEYGIIRRPGEGE 136 (138)
Q Consensus 114 ~~a~eaLk~~~~~glIk~~~~~~ 136 (138)
......++.+++.|||.+..+++
T Consensus 32 ~~~t~~i~~L~~~g~I~r~~~~~ 54 (59)
T PF01047_consen 32 STVTRIIKRLEKKGLIERERDPD 54 (59)
T ss_dssp HHHHHHHHHHHHTTSEEEEEETT
T ss_pred hHHHHHHHHHHHCCCEEeccCCC
Confidence 45567799999999999755544
No 114
>PF13463 HTH_27: Winged helix DNA-binding domain; PDB: 3GFL_A 2YR2_B 3GFM_A 3GFJ_A 3GF2_A 3GEZ_A 2GXG_A 3GFI_A 2EB7_A.
Probab=22.63 E-value=50 Score=20.57 Aligned_cols=22 Identities=18% Similarity=0.289 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHhCCCccCCCCC
Q psy4702 114 KLALQALAKGEEYGIIRRPGEG 135 (138)
Q Consensus 114 ~~a~eaLk~~~~~glIk~~~~~ 135 (138)
..+...++.+.+.|||++..+.
T Consensus 33 ~~vs~~i~~L~~~glv~~~~~~ 54 (68)
T PF13463_consen 33 STVSRIIKKLEEKGLVEKERDP 54 (68)
T ss_dssp HHHHHHHHHHHHTTSEEEEEES
T ss_pred HHHHHHHHHHHHCCCEEecCCC
Confidence 4556789999999999765443
No 115
>PF05536 Neurochondrin: Neurochondrin
Probab=22.61 E-value=4.6e+02 Score=24.10 Aligned_cols=95 Identities=18% Similarity=0.125 Sum_probs=62.3
Q ss_pred HHHHhhhhhhhhhccCCHHHHhhhhh-ccHHHHHHHHhcCCCc-ccchhhhhhhhhhhhcChHHHHHHhhhhHHHHHHHh
Q psy4702 27 ETACAAAGALAMLTSVSTPVCSMLLE-KNWVDIFKQILAHPDV-GVQHRALVIVGNILSAGKELAERVLSTELMEILMAL 104 (138)
Q Consensus 27 ~tr~AA~GALAmLT~~~~~~c~~I~~-~~wleil~~L~~~~~~-~lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL~~~ 104 (138)
..+.-|-.-|+.++. .|....- .+ ..-+.+|.+.+.+.+. ++.-=..-|+.++. +..+-++.+++.|.+-.|..+
T Consensus 72 ~~~~LavsvL~~f~~-~~~~a~~-~~~~~~IP~Lle~l~~~s~~~~v~dalqcL~~Ia-s~~~G~~aLl~~g~v~~L~ei 148 (543)
T PF05536_consen 72 EYLSLAVSVLAAFCR-DPELASS-PQMVSRIPLLLEILSSSSDLETVDDALQCLLAIA-SSPEGAKALLESGAVPALCEI 148 (543)
T ss_pred HHHHHHHHHHHHHcC-ChhhhcC-HHHHHHHHHHHHHHHcCCchhHHHHHHHHHHHHH-cCcHhHHHHHhcCCHHHHHHH
Confidence 344444445555555 4544321 22 3455667777767666 77778888898888 777888999999999999888
Q ss_pred hccccHHHHHHHHHHHHHHHh
Q psy4702 105 TLLEGEEVRKLALQALAKGEE 125 (138)
Q Consensus 105 ~k~~~~~v~~~a~eaLk~~~~ 125 (138)
.... +...+.|...|-.++.
T Consensus 149 ~~~~-~~~~E~Al~lL~~Lls 168 (543)
T PF05536_consen 149 IPNQ-SFQMEIALNLLLNLLS 168 (543)
T ss_pred HHhC-cchHHHHHHHHHHHHH
Confidence 6662 2445666666665554
No 116
>PF07814 WAPL: Wings apart-like protein regulation of heterochromatin; InterPro: IPR022771 This entry contains sequences expressed in eukaryotic organisms (metazoa, fungi, plants) bearing high similarity to the WAPL conserved region of D. melanogaster wings apart-like protein. This protein is involved in the regulation of heterochromatin structure []. hWAPL (Q7Z5K2 from SWISSPROT), the human homologue, is found to play a role in the development of cervical carcinogenesis, and is thought to have similar functions to Drosophila wapl protein []. Malfunction of the hWAPL pathway is thought to activate an apoptotic pathway that consequently leads to cell death []. This entry includes proteins from metazoa, fungi and plants.
Probab=22.39 E-value=90 Score=26.90 Aligned_cols=50 Identities=22% Similarity=0.242 Sum_probs=39.2
Q ss_pred hhhhhhhhcCC-CCHHHHHhhhhhhhhhccCCHHHHhhhhh-ccHHHHHHHHhc
Q psy4702 13 QVRFLFLLTCE-EDTETACAAAGALAMLTSVSTPVCSMLLE-KNWVDIFKQILA 64 (138)
Q Consensus 13 RlklLvll~~e-ED~~tr~AA~GALAmLT~~~~~~c~~I~~-~~wleil~~L~~ 64 (138)
.-+++-.+.|. +|.....+++..+.+|++ +..+..+++ ..|.+++..|+.
T Consensus 65 ~~~l~~~l~~~~~d~~~~l~~a~i~~~l~~--d~~~~~l~~~~~~~~ll~~Ll~ 116 (361)
T PF07814_consen 65 VKRLFKALSDAPDDDILALATAAILYVLSR--DGLNMHLLLDRDSLRLLLKLLK 116 (361)
T ss_pred HHHHHHHhccccchHHHHHHHHHHHHHHcc--CCcchhhhhchhHHHHHHHHhc
Confidence 44788888777 555888888888888886 556666767 889999888876
No 117
>PF04063 DUF383: Domain of unknown function (DUF383); InterPro: IPR007205 This is a protein of unknown function. It is found N-terminal to another domain of unknown function (IPR007206 from INTERPRO).
Probab=22.28 E-value=2.7e+02 Score=22.23 Aligned_cols=72 Identities=21% Similarity=0.272 Sum_probs=51.6
Q ss_pred hhhhhhhhccCCHHHHhhhhh-ccHHHHHHHHh---cCCCcccchhhhh-hhhhhhhcChHHHHHHhhhhHHHHHHHhhc
Q psy4702 32 AAGALAMLTSVSTPVCSMLLE-KNWVDIFKQIL---AHPDVGVQHRALV-IVGNILSAGKELAERVLSTELMEILMALTL 106 (138)
Q Consensus 32 A~GALAmLT~~~~~~c~~I~~-~~wleil~~L~---~~~~~~lqHRG~v-iv~Nmi~a~~e~a~kl~es~~leiL~~~~k 106 (138)
-|--|+-+|. .+..-+-+++ .++...++.|. .++ ..+|.||++ +++|.-- +.+....+...+-++||..+.-
T Consensus 78 la~vl~NlS~-~~~gR~~~l~~~~~~~~l~kLl~ft~~~-s~iRR~Gva~~IrNccF-d~~~H~~LL~~~~~~iLp~LLl 154 (192)
T PF04063_consen 78 LASVLANLSQ-LPEGRQFFLDPQRYDGPLQKLLPFTEHK-SVIRRGGVAGTIRNCCF-DTDSHEWLLSDDEVDILPYLLL 154 (192)
T ss_pred HHHHHHHhcC-CHHHHHHHhCchhhhhHHHHHHHHhccC-cHHHHHHHHHHHHHhhc-cHhHHHHhcCchhhhhHHHHHh
Confidence 4567899998 6888888888 66665555554 556 778788876 6788743 3355678888776888887763
No 118
>PTZ00429 beta-adaptin; Provisional
Probab=22.00 E-value=1.2e+02 Score=29.17 Aligned_cols=60 Identities=17% Similarity=0.139 Sum_probs=45.5
Q ss_pred CCCCHHHHHhhhhhhhhhccCCHHHHhhhhhccHHHHHHHHhcCCCcccchhhhhhhhhhhhc
Q psy4702 22 CEEDTETACAAAGALAMLTSVSTPVCSMLLEKNWVDIFKQILAHPDVGVQHRALVIVGNILSA 84 (138)
Q Consensus 22 ~eED~~tr~AA~GALAmLT~~~~~~c~~I~~~~wleil~~L~~~~~~~lqHRG~viv~Nmi~a 84 (138)
...|+-.|++|+-|++=+-..+|..| .+..|++.|..|+.|.|+.++.-.+..+..+-..
T Consensus 150 ~D~~pYVRKtAalai~Kly~~~pelv---~~~~~~~~L~~LL~D~dp~Vv~nAl~aL~eI~~~ 209 (746)
T PTZ00429 150 ADPDPYVRKTAAMGLGKLFHDDMQLF---YQQDFKKDLVELLNDNNPVVASNAAAIVCEVNDY 209 (746)
T ss_pred cCCCHHHHHHHHHHHHHHHhhCcccc---cccchHHHHHHHhcCCCccHHHHHHHHHHHHHHh
Confidence 34677899999888888766455432 2268999999999999999988887777666544
No 119
>TIGR02147 Fsuc_second hypothetical protein, TIGR02147. This family consists of the 40 members of a paralogous protein family in the rumen anaerobe Fibrobacter succinogenes S85. Member proteins are about 270 residues long and appear to lack signal sequences and transmembrane helices. The only perfectly conserved residue is a glycine in an otherwise poorly conserved region, suggesting members are not enzymes. The family is not characterized.
Probab=21.54 E-value=85 Score=26.46 Aligned_cols=23 Identities=22% Similarity=0.398 Sum_probs=19.5
Q ss_pred HHHHHHHHHHHHhCCCccCCCCC
Q psy4702 113 RKLALQALAKGEEYGIIRRPGEG 135 (138)
Q Consensus 113 ~~~a~eaLk~~~~~glIk~~~~~ 135 (138)
.+.+.++|+.|++.|||+++.++
T Consensus 153 ~~ev~~sL~~L~~~glikk~~~g 175 (271)
T TIGR02147 153 AEQVKESLDLLERLGLIKKNEDG 175 (271)
T ss_pred HHHHHHHHHHHHHCCCeeECCCC
Confidence 45678999999999999997664
No 120
>PF00392 GntR: Bacterial regulatory proteins, gntR family; InterPro: IPR000524 Many bacterial transcription regulation proteins bind DNA through a helix-turn-helix (HTH) motif, which can be classified into subfamilies on the basis of sequence similarities. The HTH GntR family has many members distributed among diverse bacterial groups that regulate various biological processes. It was named GntR after the Bacillus subtilis repressor of the gluconate operon []. Family members include GntR, HutC, KorA, NtaR, FadR, ExuR, FarR, DgoR and PhnF. The crystal structure of the FadR protein has been determined []. In general, these proteins contain a DNA-binding HTH domain at the N terminus, and an effector-binding or oligomerisation domain at the C terminus (IPR011711 from INTERPRO). The DNA-binding domain is well conserved in structure for the whole of the GntR family, consisting of a 3-helical bundle core with a small beta-sheet (wing); the GntR winged helix structure is similar to that found in several other transcriptional regulator families. The regions outside the DNA-binding domain are more variable and are consequently used to define GntR subfamilies []. This entry represents the N-terminal DNA-binding domain of the GntR family.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1HW1_B 1H9T_A 1HW2_A 1H9G_A 1E2X_A 3IHU_A 3C7J_A 2RA5_A 3BY6_C 3IC7_A ....
Probab=20.92 E-value=82 Score=19.93 Aligned_cols=21 Identities=33% Similarity=0.384 Sum_probs=17.4
Q ss_pred HHHHHHHHHHHHhCCCccCCC
Q psy4702 113 RKLALQALAKGEEYGIIRRPG 133 (138)
Q Consensus 113 ~~~a~eaLk~~~~~glIk~~~ 133 (138)
...+++||+.|.+.|+|....
T Consensus 38 r~tvr~al~~L~~~g~i~~~~ 58 (64)
T PF00392_consen 38 RTTVREALRRLEAEGLIERRP 58 (64)
T ss_dssp HHHHHHHHHHHHHTTSEEEET
T ss_pred CcHHHHHHHHHHHCCcEEEEC
Confidence 467889999999999997543
No 121
>smart00049 DEP Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signalling proteins that contain PH, rasGEF, rhoGEF, rhoGAP, RGS, PDZ domains. DEP domain in Drosophila dishevelled is essential to rescue planar polarity defects and induce JNK signalling (Cell 94, 109-118).
Probab=20.92 E-value=1.2e+02 Score=19.68 Aligned_cols=40 Identities=20% Similarity=0.150 Sum_probs=26.5
Q ss_pred HhhhhHHHHHHHhhccccHHHHHHHHHHHHHHHhCCCccCCCC
Q psy4702 92 VLSTELMEILMALTLLEGEEVRKLALQALAKGEEYGIIRRPGE 134 (138)
Q Consensus 92 l~es~~leiL~~~~k~~~~~v~~~a~eaLk~~~~~glIk~~~~ 134 (138)
+.-+++++-|+......+ .+.|++-...++++|+|.+-.+
T Consensus 21 F~G~e~v~wL~~~~~~~~---r~eA~~l~~~ll~~g~i~~v~~ 60 (77)
T smart00049 21 FTGSELVDWLMDNLEIID---REEAVHLGQLLLDEGLIHHVNG 60 (77)
T ss_pred eEcHHHHHHHHHcCCcCC---HHHHHHHHHHHHHCCCEEEeCC
Confidence 344556777777655322 2556666778999999998653
No 122
>PF10007 DUF2250: Uncharacterized protein conserved in archaea (DUF2250); InterPro: IPR019254 Members of this family of hypothetical archaeal proteins have no known function.
Probab=20.91 E-value=91 Score=22.40 Aligned_cols=21 Identities=33% Similarity=0.362 Sum_probs=18.1
Q ss_pred HHHHHHHHHHHHhCCCccCCC
Q psy4702 113 RKLALQALAKGEEYGIIRRPG 133 (138)
Q Consensus 113 ~~~a~eaLk~~~~~glIk~~~ 133 (138)
.+.+.++|+++++.|||++.+
T Consensus 35 ~~~v~~~l~~Le~~GLler~~ 55 (92)
T PF10007_consen 35 LEEVREALEKLEEMGLLERVE 55 (92)
T ss_pred HHHHHHHHHHHHHCCCeEEec
Confidence 577889999999999998754
No 123
>PF03297 Ribosomal_S25: S25 ribosomal protein; InterPro: IPR004977 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. The S25 ribosomal protein is a component of the 40S ribosomal subunit.; PDB: 2XZM_8 2XZN_8 3O30_Q 3U5G_Z 3IZB_V 3U5C_Z 3O2Z_Q 3IZ6_V.
Probab=20.58 E-value=73 Score=23.46 Aligned_cols=20 Identities=35% Similarity=0.376 Sum_probs=18.0
Q ss_pred HHHHHHHHHHHHhCCCccCC
Q psy4702 113 RKLALQALAKGEEYGIIRRP 132 (138)
Q Consensus 113 ~~~a~eaLk~~~~~glIk~~ 132 (138)
..+|+.+|+.|++.|+|++-
T Consensus 73 ~SlAr~~Lr~L~~kG~Ik~V 92 (105)
T PF03297_consen 73 GSLARKALRELESKGLIKPV 92 (105)
T ss_dssp CHHHHHHHHHHHHCCSSEEE
T ss_pred HHHHHHHHHHHHHCCCEEEE
Confidence 57999999999999999863
No 124
>KOG4413|consensus
Probab=20.55 E-value=1.2e+02 Score=27.64 Aligned_cols=77 Identities=17% Similarity=0.152 Sum_probs=49.1
Q ss_pred cHHHHHHHHhcCCCcccchhhhhhhhhhhhcChHHHHHHhhhhHHHHHHHhhccccHHHHHHHHHHHHHHHhCCCcc
Q psy4702 54 NWVDIFKQILAHPDVGVQHRALVIVGNILSAGKELAERVLSTELMEILMALTLLEGEEVRKLALQALAKGEEYGIIR 130 (138)
Q Consensus 54 ~wleil~~L~~~~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL~~~~k~~~~~v~~~a~eaLk~~~~~glIk 130 (138)
.-++.|++....|.++++--|.-..--+++..=-.-+-+-..|++||.+.-+........+.-.+|-|+..+..++-
T Consensus 410 dPleLFlgilqQpfpEihcAalktfTAiaaqPWalkeifakeefieiVtDastEhaKaakdAkYeccKAiaeaflca 486 (524)
T KOG4413|consen 410 DPLELFLGILQQPFPEIHCAALKTFTAIAAQPWALKEIFAKEEFIEIVTDASTEHAKAAKDAKYECCKAIAEAFLCA 486 (524)
T ss_pred ChHHHHHHHHcCCChhhHHHHHHHHHHHHcCcHHHHHHhcCccceeeecccchhhHHHHHHHHHHHHHHHHHHHHHH
Confidence 35788999999999999655554444444443333344556778888877654433233566677777776655543
No 125
>PF06371 Drf_GBD: Diaphanous GTPase-binding Domain; InterPro: IPR010473 Diaphanous-related formins (Drfs) are a family of formin homology (FH) proteins that act as effectors of Rho small GTPases during growth factor-induced cytoskeletal remodelling, stress fibre formation, and cell division []. Drf proteins are characterised by a variety of shared domains: an N-terminal GTPase-binding domain (GBD), formin-homology domains FH1, FH2 (IPR003104 from INTERPRO) and FH3 (IPR010472 from INTERPRO), and a C-terminal conserved Dia-autoregulatory domain (DAD) that binds the GBD. This entry represents the GBD, which is a bifunctional autoinhibitory domain that interacts with and is regulated by activated Rho family members. Mammalian Drf3 contains a CRIB-like motif within its GBD for binding to Cdc42, which is required for Cdc42 to activate and guide Drf3 towards the cell cortex where it remodels the actin skeleton [].; GO: 0003779 actin binding, 0017048 Rho GTPase binding, 0030036 actin cytoskeleton organization; PDB: 3OBV_A 2BNX_A 3EG5_D 2BAP_B 3O4X_B 1Z2C_B 2F31_A.
Probab=20.40 E-value=3.6e+02 Score=19.88 Aligned_cols=56 Identities=20% Similarity=0.302 Sum_probs=36.8
Q ss_pred CCcccchhhhhhhhhhhhcChHHHHHHhhhhHHHHHHHhhccccH-HHHHHHHHHHHH
Q psy4702 66 PDVGVQHRALVIVGNILSAGKELAERVLSTELMEILMALTLLEGE-EVRKLALQALAK 122 (138)
Q Consensus 66 ~~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL~~~~k~~~~-~v~~~a~eaLk~ 122 (138)
++.+++|=-+-|++-++++.--+..-+-..+.+..|..+.- +.+ ++...+.+.|..
T Consensus 128 ~~~~~~~~~l~Clkal~n~~~G~~~v~~~~~~v~~i~~~L~-s~~~~~r~~~leiL~~ 184 (187)
T PF06371_consen 128 EDIDIEHECLRCLKALMNTKYGLEAVLSHPDSVNLIALSLD-SPNIKTRKLALEILAA 184 (187)
T ss_dssp TCHHHHHHHHHHHHHHTSSHHHHHHHHCSSSHHHHHHHT---TTSHHHHHHHHHHHHH
T ss_pred hhHHHHHHHHHHHHHHHccHHHHHHHHcCcHHHHHHHHHHC-CCCHHHHHHHHHHHHH
Confidence 45578888899999988775333334445778888777754 333 777777776654
No 126
>cd04443 DEP_GPR155 DEP (Dishevelled, Egl-10, and Pleckstrin) domain found in GPR155-like proteins. GRP155-like proteins, also known as PGR22, contain an N-terminal permease domain, a central transmembrane region and a C-terminal DEP domain. They are orphan receptors of the class B G protein-coupled receptors. Their function is unknown.
Probab=20.16 E-value=1.8e+02 Score=20.24 Aligned_cols=41 Identities=12% Similarity=0.099 Sum_probs=26.8
Q ss_pred HHhhhhHHHHHHHhhccccHHHHHHHHHHHHHHHhCCCccCCCC
Q psy4702 91 RVLSTELMEILMALTLLEGEEVRKLALQALAKGEEYGIIRRPGE 134 (138)
Q Consensus 91 kl~es~~leiL~~~~k~~~~~v~~~a~eaLk~~~~~glIk~~~~ 134 (138)
-+.-+++++-|........+ +.|+.--.++++.|+|.+-.+
T Consensus 30 cF~GselVdWL~~~~~~~sR---~eAv~lg~~Ll~~G~i~HV~~ 70 (83)
T cd04443 30 VFCGCDLVSWLIEVGLAQDR---GEAVLYGRRLLQGGVLQHITN 70 (83)
T ss_pred cccHHHHHHHHHHcCCCCCH---HHHHHHHHHHHHCCCEEecCC
Confidence 36777888888876432222 344444556999999998544
No 127
>KOG0168|consensus
Probab=20.01 E-value=2.3e+02 Score=28.57 Aligned_cols=70 Identities=19% Similarity=0.164 Sum_probs=48.5
Q ss_pred ccHHHHHHHHhcCC-CcccchhhhhhhhhhhhcChHHHHHHhhhhHHHHHHHhhccccHHHHHHHHHHHHHHH
Q psy4702 53 KNWVDIFKQILAHP-DVGVQHRALVIVGNILSAGKELAERVLSTELMEILMALTLLEGEEVRKLALQALAKGE 124 (138)
Q Consensus 53 ~~wleil~~L~~~~-~~~lqHRG~viv~Nmi~a~~e~a~kl~es~~leiL~~~~k~~~~~v~~~a~eaLk~~~ 124 (138)
+..+..|..|+.|+ |.+|+--+.=++.||...-..-..-+++.+.+-+|.+=... - +-+++|.++|.+++
T Consensus 210 ~slvp~Lv~LL~~E~n~DIMl~AcRaltyl~evlP~S~a~vV~~~aIPvl~~kL~~-I-eyiDvAEQ~LqALE 280 (1051)
T KOG0168|consen 210 KSLVPVLVALLSHEHNFDIMLLACRALTYLCEVLPRSSAIVVDEHAIPVLLEKLLT-I-EYIDVAEQSLQALE 280 (1051)
T ss_pred HHHHHHHHHHHhccccHHHHHHHHHHHHHHHhhccchhheeecccchHHHHHhhhh-h-hhhHHHHHHHHHHH
Confidence 77889999999885 78999999999999988765556667777777776543211 1 33445555554443
No 128
>cd04442 DEP_1_DEP6 DEP (Dishevelled, Egl-10, and Pleckstrin) domain 1 found in DEP6-like proteins. DEP6 proteins contain two DEP and a PDZ domain. Their function is unknown.
Probab=20.01 E-value=1.6e+02 Score=20.59 Aligned_cols=41 Identities=17% Similarity=0.213 Sum_probs=28.3
Q ss_pred HHhhhhHHHHHHHhhccccHHHHHHHHHHHHHHHhCCCccCCCC
Q psy4702 91 RVLSTELMEILMALTLLEGEEVRKLALQALAKGEEYGIIRRPGE 134 (138)
Q Consensus 91 kl~es~~leiL~~~~k~~~~~v~~~a~eaLk~~~~~glIk~~~~ 134 (138)
-++-+++++-|........+ +.|+.--.++++.|+|+...+
T Consensus 28 cF~GselVdWL~~~~~~~sR---~eAv~lgq~Ll~~gvi~HV~~ 68 (82)
T cd04442 28 CFVGKELIDWLIEHKEASDR---ETAIKIMQKLLDHSIIHHVCD 68 (82)
T ss_pred eeEcHHHHHHHHHcCCCCCH---HHHHHHHHHHHHCCCEEeccC
Confidence 46778889999877653222 445555566899999998543
Done!